Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kastenmüller G, Schenk ME, Gasteiger J, Mewes HW. Uncovering metabolic pathways relevant to phenotypic traits of microbial genomes. Genome Biol 2009;10:R28. [PMID: 19284550 PMCID: PMC2690999 DOI: 10.1186/gb-2009-10-3-r28] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2008] [Revised: 02/12/2009] [Accepted: 03/10/2009] [Indexed: 01/20/2023] Open

For:	Kastenmüller G, Schenk ME, Gasteiger J, Mewes HW. Uncovering metabolic pathways relevant to phenotypic traits of microbial genomes. Genome Biol 2009;10:R28. [PMID: 19284550 PMCID: PMC2690999 DOI: 10.1186/gb-2009-10-3-r28] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2008] [Revised: 02/12/2009] [Accepted: 03/10/2009] [Indexed: 01/20/2023] Open

Number

Cited by Other Article(s)

Siddharth T, Lewis NE. Predicting pathways for old and new metabolites through clustering. J Theor Biol 2024;578:111684. [PMID: 38048983 PMCID: PMC11139542 DOI: 10.1016/j.jtbi.2023.111684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 11/17/2023] [Accepted: 11/29/2023] [Indexed: 12/06/2023]

Karp PD, Paley S, Caspi R, Kothari A, Krummenacker M, Midford PE, Moore LR, Subhraveti P, Gama-Castro S, Tierrafria VH, Lara P, Muñiz-Rascado L, Bonavides-Martinez C, Santos-Zavaleta A, Mackie A, Sun G, Ahn-Horst TA, Choi H, Covert MW, Collado-Vides J, Paulsen I. The EcoCyc Database (2023). EcoSal Plus 2023;11:eesp00022023. [PMID: 37220074 PMCID: PMC10729931 DOI: 10.1128/ecosalplus.esp-0002-2023] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 04/04/2023] [Indexed: 01/28/2024]

Affiliation(s)

Peter D. Karp Bioinformatics Research Group, SRI International, Menlo Park, California, USA
Suzanne Paley Bioinformatics Research Group, SRI International, Menlo Park, California, USA
Ron Caspi Bioinformatics Research Group, SRI International, Menlo Park, California, USA
Anamika Kothari Bioinformatics Research Group, SRI International, Menlo Park, California, USA
Markus Krummenacker Bioinformatics Research Group, SRI International, Menlo Park, California, USA
Peter E. Midford Bioinformatics Research Group, SRI International, Menlo Park, California, USA
Lisa R. Moore Bioinformatics Research Group, SRI International, Menlo Park, California, USA
Pallavi Subhraveti Bioinformatics Research Group, SRI International, Menlo Park, California, USA
Socorro Gama-Castro Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
Victor H. Tierrafria Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
Paloma Lara Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
Luis Muñiz-Rascado Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
César Bonavides-Martinez Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
Alberto Santos-Zavaleta Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
Amanda Mackie Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, New South Wales, Australia
Gwanggyu Sun Department of Bioengineering, Stanford University, Stanford, California, USA
Travis A. Ahn-Horst Department of Bioengineering, Stanford University, Stanford, California, USA
Heejo Choi Department of Bioengineering, Stanford University, Stanford, California, USA
Markus W. Covert Department of Bioengineering, Stanford University, Stanford, California, USA
Julio Collado-Vides Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
Ian Paulsen School of Natural Sciences, Macquarie University, Sydney, New South Wales, Australia

Collapse

Singh DP, Bisen MS, Shukla R, Prabha R, Maurya S, Reddy YS, Singh PM, Rai N, Chaubey T, Chaturvedi KK, Srivastava S, Farooqi MS, Gupta VK, Sarma BK, Rai A, Behera TK. Metabolomics-Driven Mining of Metabolite Resources: Applications and Prospects for Improving Vegetable Crops. Int J Mol Sci 2022;23:ijms232012062. [PMID: 36292920 PMCID: PMC9603451 DOI: 10.3390/ijms232012062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 09/13/2022] [Accepted: 09/23/2022] [Indexed: 11/16/2022] Open

Affiliation(s)

Dhananjaya Pratap Singh ICAR-Indian Institute of Vegetable Research, Jakhini, Shahanshahpur, Varanasi 221305, India Correspondence:
Mansi Singh Bisen ICAR-Indian Institute of Vegetable Research, Jakhini, Shahanshahpur, Varanasi 221305, India
Renu Shukla Indian Council of Agricultural Research (ICAR), Krishi Bhawan, Dr. Rajendra Prasad Road, New Delhi 110001, India
Ratna Prabha ICAR-Indian Agricultural Statistics Research Institute, Centre for Agricultural Bioinformatics, Library Avenue, Pusa, New Delhi 110012, India
Sudarshan Maurya ICAR-Indian Institute of Vegetable Research, Jakhini, Shahanshahpur, Varanasi 221305, India
Yesaru S. Reddy ICAR-Indian Institute of Vegetable Research, Jakhini, Shahanshahpur, Varanasi 221305, India
Prabhakar Mohan Singh ICAR-Indian Institute of Vegetable Research, Jakhini, Shahanshahpur, Varanasi 221305, India
Nagendra Rai ICAR-Indian Institute of Vegetable Research, Jakhini, Shahanshahpur, Varanasi 221305, India
Tribhuwan Chaubey ICAR-Indian Institute of Vegetable Research, Jakhini, Shahanshahpur, Varanasi 221305, India
Krishna Kumar Chaturvedi ICAR-Indian Agricultural Statistics Research Institute, Centre for Agricultural Bioinformatics, Library Avenue, Pusa, New Delhi 110012, India
Sudhir Srivastava ICAR-Indian Agricultural Statistics Research Institute, Centre for Agricultural Bioinformatics, Library Avenue, Pusa, New Delhi 110012, India
Mohammad Samir Farooqi ICAR-Indian Agricultural Statistics Research Institute, Centre for Agricultural Bioinformatics, Library Avenue, Pusa, New Delhi 110012, India
Vijai Kumar Gupta Biorefining and Advanced Materials Research Centre, Scotland’s Rural College, Kings Buildings, West Mains Road, Edinburgh EH9 3JG, UK
Birinchi K. Sarma Department of Mycology and Plant Pathology, Institute of Agricultural Sciences, Banaras Hindu University, Varanasi 221005, India
Anil Rai ICAR-Indian Agricultural Statistics Research Institute, Centre for Agricultural Bioinformatics, Library Avenue, Pusa, New Delhi 110012, India
Tusar Kanti Behera ICAR-Indian Institute of Vegetable Research, Jakhini, Shahanshahpur, Varanasi 221305, India

Collapse

Chen B, Rupani PF, Azman S, Dewil R, Appels L. A redox-based strategy to enhance propionic and butyric acid production during anaerobic fermentation. BIORESOURCE TECHNOLOGY 2022;361:127672. [PMID: 35878771 DOI: 10.1016/j.biortech.2022.127672] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 07/17/2022] [Accepted: 07/18/2022] [Indexed: 06/15/2023]

Gasteiger J. Chemistry in Times of Artificial Intelligence. Chemphyschem 2020;21:2233-2242. [PMID: 32808729 PMCID: PMC7702165 DOI: 10.1002/cphc.202000518] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Revised: 08/14/2020] [Indexed: 11/09/2022]

Karp PD, Ong WK, Paley S, Billington R, Caspi R, Fulcher C, Kothari A, Krummenacker M, Latendresse M, Midford PE, Subhraveti P, Gama-Castro S, Muñiz-Rascado L, Bonavides-Martinez C, Santos-Zavaleta A, Mackie A, Collado-Vides J, Keseler IM, Paulsen I. The EcoCyc Database. EcoSal Plus 2018;8:10.1128/ecosalplus.ESP-0006-2018. [PMID: 30406744 PMCID: PMC6504970 DOI: 10.1128/ecosalplus.esp-0006-2018] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Indexed: 01/28/2023]

Affiliation(s)

Peter D Karp Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Wai Kit Ong Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Suzanne Paley Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Richard Billington Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Ron Caspi Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Carol Fulcher Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Anamika Kothari Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Markus Krummenacker Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Mario Latendresse Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Peter E Midford Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Pallavi Subhraveti Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Socorro Gama-Castro Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
Luis Muñiz-Rascado Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
César Bonavides-Martinez Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
Alberto Santos-Zavaleta Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
Amanda Mackie Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW 2109, Australia
Julio Collado-Vides Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
Ingrid M Keseler Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Ian Paulsen Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW 2109, Australia

Collapse

Burns JA, Pittis AA, Kim E. Gene-based predictive models of trophic modes suggest Asgard archaea are not phagocytotic. Nat Ecol Evol 2018;2:697-704. [DOI: 10.1038/s41559-018-0477-7] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Accepted: 01/11/2018] [Indexed: 12/24/2022]

Kaur H, Das C, Mande SS. In Silico Analysis of Putrefaction Pathways in Bacteria and Its Implication in Colorectal Cancer. Front Microbiol 2017;8:2166. [PMID: 29163445 PMCID: PMC5682003 DOI: 10.3389/fmicb.2017.02166] [Citation(s) in RCA: 53] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Accepted: 10/23/2017] [Indexed: 12/15/2022] Open

From Genomes to Phenotypes: Traitar, the Microbial Trait Analyzer. mSystems 2016;1:mSystems00101-16. [PMID: 28066816 PMCID: PMC5192078 DOI: 10.1128/msystems.00101-16] [Citation(s) in RCA: 78] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Accepted: 11/12/2016] [Indexed: 01/17/2023] Open

Abstract

Bacteria are ubiquitous in our ecosystem and have a major impact on human health, e.g., by supporting digestion in the human gut. Bacterial communities can also aid in biotechnological processes such as wastewater treatment or decontamination of polluted soils. Diverse bacteria contribute with their unique capabilities to the functioning of such ecosystems, but lab experiments to investigate those capabilities are labor-intensive. Major advances in sequencing techniques open up the opportunity to study bacteria by their genome sequences. For this purpose, we have developed Traitar, software that predicts traits of bacteria on the basis of their genomes. It is applicable to studies with tens or hundreds of bacterial genomes. Traitar may help researchers in microbiology to pinpoint the traits of interest, reducing the amount of wet lab work required.

The number of sequenced genomes is growing exponentially, profoundly shifting the bottleneck from data generation to genome interpretation. Traits are often used to characterize and distinguish bacteria and are likely a driving factor in microbial community composition, yet little is known about the traits of most microbes. We describe Traitar, the microbial trait analyzer, which is a fully automated software package for deriving phenotypes from a genome sequence. Traitar provides phenotype classifiers to predict 67 traits related to the use of various substrates as carbon and energy sources, oxygen requirement, morphology, antibiotic susceptibility, proteolysis, and enzymatic activities. Furthermore, it suggests protein families associated with the presence of particular phenotypes. Our method uses L1-regularized L2-loss support vector machines for phenotype assignments based on phyletic patterns of protein families and their evolutionary histories across a diverse set of microbial species. We demonstrate reliable phenotype assignment for Traitar to bacterial genomes from 572 species of eight phyla, also based on incomplete single-cell genomes and simulated draft genomes. We also showcase its application in metagenomics by verifying and complementing a manual metabolic reconstruction of two novel Clostridiales species based on draft genomes recovered from commercial biogas reactors. Traitar is available at https://github.com/hzi-bifo/traitar.

IMPORTANCE Bacteria are ubiquitous in our ecosystem and have a major impact on human health, e.g., by supporting digestion in the human gut. Bacterial communities can also aid in biotechnological processes such as wastewater treatment or decontamination of polluted soils. Diverse bacteria contribute with their unique capabilities to the functioning of such ecosystems, but lab experiments to investigate those capabilities are labor-intensive. Major advances in sequencing techniques open up the opportunity to study bacteria by their genome sequences. For this purpose, we have developed Traitar, software that predicts traits of bacteria on the basis of their genomes. It is applicable to studies with tens or hundreds of bacterial genomes. Traitar may help researchers in microbiology to pinpoint the traits of interest, reducing the amount of wet lab work required.

Collapse

Brbić M, Piškorec M, Vidulin V, Kriško A, Šmuc T, Supek F. The landscape of microbial phenotypic traits and associated genes. Nucleic Acids Res 2016;44:10074-10090. [PMID: 27915291 PMCID: PMC5137458 DOI: 10.1093/nar/gkw964] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2016] [Revised: 09/21/2016] [Accepted: 10/11/2016] [Indexed: 12/31/2022] Open

Gasteiger J. Explorations into Chemical Reactions and Biochemical Pathways. Mol Inform 2016;35:588-592. [DOI: 10.1002/minf.201600038] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Accepted: 04/25/2016] [Indexed: 11/07/2022]

Tamames J, Sánchez PD, Nikel PI, Pedrós-Alió C. Quantifying the Relative Importance of Phylogeny and Environmental Preferences As Drivers of Gene Content in Prokaryotic Microorganisms. Front Microbiol 2016;7:433. [PMID: 27065987 PMCID: PMC4814473 DOI: 10.3389/fmicb.2016.00433] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Accepted: 03/17/2016] [Indexed: 01/15/2023] Open

Chemoinformatics: Achievements and Challenges, a Personal View. Molecules 2016;21:151. [PMID: 26828468 PMCID: PMC6273366 DOI: 10.3390/molecules21020151] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2015] [Revised: 01/14/2016] [Accepted: 01/20/2016] [Indexed: 11/16/2022] Open

Burns JA, Paasch A, Narechania A, Kim E. Comparative Genomics of a Bacterivorous Green Alga Reveals Evolutionary Causalities and Consequences of Phago-Mixotrophic Mode of Nutrition. Genome Biol Evol 2015. [PMID: 26224703 PMCID: PMC5741210 DOI: 10.1093/gbe/evv144] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Karp PD, Weaver D, Paley S, Fulcher C, Kubo A, Kothari A, Krummenacker M, Subhraveti P, Weerasinghe D, Gama-Castro S, Huerta AM, Muñiz-Rascado L, Bonavides-Martinez C, Weiss V, Peralta-Gil M, Santos-Zavaleta A, Schröder I, Mackie A, Gunsalus R, Collado-Vides J, Keseler IM, Paulsen I. The EcoCyc Database. EcoSal Plus 2014;6:10.1128/ecosalplus.ESP-0009-2013. [PMID: 26442933 PMCID: PMC4243172 DOI: 10.1128/ecosalplus.esp-0009-2013] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2014] [Indexed: 11/20/2022]

Affiliation(s)

Peter D Karp Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Daniel Weaver Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Suzanne Paley Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Carol Fulcher Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Aya Kubo Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Anamika Kothari Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Markus Krummenacker Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Pallavi Subhraveti Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Deepika Weerasinghe Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Socorro Gama-Castro Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
Araceli M Huerta Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
Luis Muñiz-Rascado Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
César Bonavides-Martinez Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
Verena Weiss Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
Martin Peralta-Gil Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
Alberto Santos-Zavaleta Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
Imke Schröder Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, CA 90095 UCLA Institute of Genomics and Proteomics, University of California, Los Angeles, CA 90095
Amanda Mackie Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW 2109, Australia
Robert Gunsalus Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, CA 90095
Julio Collado-Vides Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
Ingrid M Keseler Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
Ian Paulsen Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW 2109, Australia

Collapse

Gasteiger J. Some solved and unsolved problems of chemoinformatics. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2014;25:443-455. [PMID: 24716817 DOI: 10.1080/1062936x.2014.898688] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Metatranscriptomics of the human oral microbiome during health and disease. mBio 2014;5:e01012-14. [PMID: 24692635 PMCID: PMC3977359 DOI: 10.1128/mbio.01012-14] [Citation(s) in RCA: 239] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open

Abstract

The human microbiome plays important roles in health, but when disrupted, these same indigenous microbes can cause disease. The composition of the microbiome changes during the transition from health to disease; however, these changes are often not conserved among patients. Since microbiome-associated diseases like periodontitis cause similar patient symptoms despite interpatient variability in microbial community composition, we hypothesized that human-associated microbial communities undergo conserved changes in metabolism during disease. Here, we used patient-matched healthy and diseased samples to compare gene expression of 160,000 genes in healthy and diseased periodontal communities. We show that health- and disease-associated communities exhibit defined differences in metabolism that are conserved between patients. In contrast, the metabolic gene expression of individual species was highly variable between patients. These results demonstrate that despite high interpatient variability in microbial composition, disease-associated communities display conserved metabolic profiles that are generally accomplished by a patient-specific cohort of microbes. IMPORTANCE The human microbiome project has shown that shifts in our microbiota are associated with many diseases, including obesity, Crohn's disease, diabetes, and periodontitis. While changes in microbial populations are apparent during these diseases, the species associated with each disease can vary from patient to patient. Taking into account this interpatient variability, we hypothesized that specific microbiota-associated diseases would be marked by conserved microbial community behaviors. Here, we use gene expression analyses of patient-matched healthy and diseased human periodontal plaque to show that microbial communities have highly conserved metabolic gene expression profiles, whereas individual species within the community do not. Furthermore, disease-associated communities exhibit conserved changes in metabolic and virulence gene expression.

Collapse

Combining chemoinformatics with bioinformatics: in silico prediction of bacterial flavor-forming pathways by a chemical systems biology approach "reverse pathway engineering". PLoS One 2014;9:e84769. [PMID: 24416282 PMCID: PMC3885609 DOI: 10.1371/journal.pone.0084769] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2013] [Accepted: 11/18/2013] [Indexed: 12/05/2022] Open

Boon E, Meehan CJ, Whidden C, Wong DHJ, Langille MGI, Beiko RG. Interactions in the microbiome: communities of organisms and communities of genes. FEMS Microbiol Rev 2014;38:90-118. [PMID: 23909933 PMCID: PMC4298764 DOI: 10.1111/1574-6976.12035] [Citation(s) in RCA: 119] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2013] [Revised: 07/02/2013] [Accepted: 07/10/2013] [Indexed: 12/17/2022] Open

Konietzny SGA, Pope PB, Weimann A, McHardy AC. Inference of phenotype-defining functional modules of protein families for microbial plant biomass degraders. BIOTECHNOLOGY FOR BIOFUELS 2014;7:124. [PMID: 25342967 PMCID: PMC4189754 DOI: 10.1186/s13068-014-0124-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2014] [Accepted: 08/05/2014] [Indexed: 05/14/2023]

Abstract

BACKGROUND

Efficient industrial processes for converting plant lignocellulosic materials into biofuels are a key to global efforts to come up with alternative energy sources to fossil fuels. Novel cellulolytic enzymes have been discovered in microbial genomes and metagenomes of microbial communities. However, the identification of relevant genes without known homologs, and the elucidation of the lignocellulolytic pathways and protein complexes for different microorganisms remain challenging.

RESULTS

We describe a new computational method for the targeted discovery of functional modules of plant biomass-degrading protein families, based on their co-occurrence patterns across genomes and metagenome datasets, and the strength of association of these modules with the genomes of known degraders. From approximately 6.4 million family annotations for 2,884 microbial genomes, and 332 taxonomic bins from 18 metagenomes, we identified 5 functional modules that are distinctive for plant biomass degraders, which we term "plant biomass degradation modules" (PDMs). These modules incorporate protein families involved in the degradation of cellulose, hemicelluloses, and pectins, structural components of the cellulosome, and additional families with potential functions in plant biomass degradation. The PDMs were linked to 81 gene clusters in genomes of known lignocellulose degraders, including previously described clusters of lignocellulolytic genes. On average, 70% of the families of each PDM were found to map to gene clusters in known degraders, which served as an additional confirmation of their functional relationships. The presence of a PDM in a genome or taxonomic metagenome bin furthermore allowed us to accurately predict the ability of any particular organism to degrade plant biomass. For 15 draft genomes of a cow rumen metagenome, we used cross-referencing to confirmed cellulolytic enzymes to validate that the PDMs identified plant biomass degraders within a complex microbial community.

CONCLUSIONS

Functional modules of protein families that are involved in different aspects of plant cell wall degradation can be inferred from co-occurrence patterns across (meta-)genomes with a probabilistic topic model. PDMs represent a new resource of protein families and candidate genes implicated in microbial plant biomass degradation. They can also be used to predict the plant biomass degradation ability for a genome or taxonomic bin. The method is also suitable for characterizing other microbial phenotypes.

Collapse

Psomopoulos FE, Mitkas PA, Ouzounis CA. Detection of genomic idiosyncrasies using fuzzy phylogenetic profiles. PLoS One 2013;8:e52854. [PMID: 23341912 PMCID: PMC3544837 DOI: 10.1371/journal.pone.0052854] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2012] [Accepted: 11/22/2012] [Indexed: 11/18/2022] Open

In-silico identification of phenotype-biased functional modules. Proteome Sci 2012;10 Suppl 1:S2. [PMID: 22759578 PMCID: PMC3380726 DOI: 10.1186/1477-5956-10-s1-s2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Schmidt MC, Rocha AM, Padmanabhan K, Shpanskaya Y, Banfield J, Scott K, Mihelcic JR, Samatova NF. NIBBS-search for fast and accurate prediction of phenotype-biased metabolic systems. PLoS Comput Biol 2012;8:e1002490. [PMID: 22589706 PMCID: PMC3349732 DOI: 10.1371/journal.pcbi.1002490] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2011] [Accepted: 03/08/2012] [Indexed: 02/07/2023] Open

Abstract

Understanding of genotype-phenotype associations is important not only for furthering our knowledge on internal cellular processes, but also essential for providing the foundation necessary for genetic engineering of microorganisms for industrial use (e.g., production of bioenergy or biofuels). However, genotype-phenotype associations alone do not provide enough information to alter an organism's genome to either suppress or exhibit a phenotype. It is important to look at the phenotype-related genes in the context of the genome-scale network to understand how the genes interact with other genes in the organism. Identification of metabolic subsystems involved in the expression of the phenotype is one way of placing the phenotype-related genes in the context of the entire network. A metabolic system refers to a metabolic network subgraph; nodes are compounds and edges labels are the enzymes that catalyze the reaction. The metabolic subsystem could be part of a single metabolic pathway or span parts of multiple pathways. Arguably, comparative genome-scale metabolic network analysis is a promising strategy to identify these phenotype-related metabolic subsystems. Network Instance-Based Biased Subgraph Search (NIBBS) is a graph-theoretic method for genome-scale metabolic network comparative analysis that can identify metabolic systems that are statistically biased toward phenotype-expressing organismal networks. We set up experiments with target phenotypes like hydrogen production, TCA expression, and acid-tolerance. We show via extensive literature search that some of the resulting metabolic subsystems are indeed phenotype-related and formulate hypotheses for other systems in terms of their role in phenotype expression. NIBBS is also orders of magnitude faster than MULE, one of the most efficient maximal frequent subgraph mining algorithms that could be adjusted for this problem. Also, the set of phenotype-biased metabolic systems output by NIBBS comes very close to the set of phenotype-biased subgraphs output by an exact maximally-biased subgraph enumeration algorithm ( MBS-Enum ). The code (NIBBS and the module to visualize the identified subsystems) is available at http://freescience.org/cs/NIBBS.

Collapse

Use of comparative genomics approaches to characterize interspecies differences in response to environmental chemicals: challenges, opportunities, and research needs. Toxicol Appl Pharmacol 2011;271:372-85. [PMID: 22142766 DOI: 10.1016/j.taap.2011.11.011] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2011] [Revised: 11/11/2011] [Accepted: 11/16/2011] [Indexed: 01/12/2023]

Abstract

A critical challenge for environmental chemical risk assessment is the characterization and reduction of uncertainties introduced when extrapolating inferences from one species to another. The purpose of this article is to explore the challenges, opportunities, and research needs surrounding the issue of how genomics data and computational and systems level approaches can be applied to inform differences in response to environmental chemical exposure across species. We propose that the data, tools, and evolutionary framework of comparative genomics be adapted to inform interspecies differences in chemical mechanisms of action. We compare and contrast existing approaches, from disciplines as varied as evolutionary biology, systems biology, mathematics, and computer science, that can be used, modified, and combined in new ways to discover and characterize interspecies differences in chemical mechanism of action which, in turn, can be explored for application to risk assessment. We consider how genetic, protein, pathway, and network information can be interrogated from an evolutionary biology perspective to effectively characterize variations in biological processes of toxicological relevance among organisms. We conclude that comparative genomics approaches show promise for characterizing interspecies differences in mechanisms of action, and further, for improving our understanding of the uncertainties inherent in extrapolating inferences across species in both ecological and human health risk assessment. To achieve long-term relevance and consistent use in environmental chemical risk assessment, improved bioinformatics tools, computational methods robust to data gaps, and quantitative approaches for conducting extrapolations across species are critically needed. Specific areas ripe for research to address these needs are recommended.

Collapse

Whitmore SE, Lamont RJ. The pathogenic persona of community-associated oral streptococci. Mol Microbiol 2011;81:305-14. [PMID: 21635580 DOI: 10.1111/j.1365-2958.2011.07707.x] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Lingner T, Mühlhausen S, Gabaldón T, Notredame C, Meinicke P. Predicting phenotypic traits of prokaryotes from protein domain frequencies. BMC Bioinformatics 2010;11:481. [PMID: 20868492 PMCID: PMC2955703 DOI: 10.1186/1471-2105-11-481] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2010] [Accepted: 09/24/2010] [Indexed: 12/03/2022] Open

MacDonald NJ, Beiko RG. Efficient learning of microbial genotype-phenotype association rules. ACTA ACUST UNITED AC 2010;26:1834-40. [PMID: 20529891 DOI: 10.1093/bioinformatics/btq305] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Kuboniwa M, Lamont RJ. Subgingival biofilm formation. Periodontol 2000 2010;52:38-52. [PMID: 20017794 DOI: 10.1111/j.1600-0757.2009.00311.x] [Citation(s) in RCA: 114] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Heinemann M, Sauer U. Systems biology of microbial metabolism. Curr Opin Microbiol 2010;13:337-43. [PMID: 20219420 DOI: 10.1016/j.mib.2010.02.005] [Citation(s) in RCA: 89] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2010] [Accepted: 02/13/2010] [Indexed: 12/20/2022]

Dale JM, Popescu L, Karp PD. Machine learning methods for metabolic pathway prediction. BMC Bioinformatics 2010;11:15. [PMID: 20064214 PMCID: PMC3146072 DOI: 10.1186/1471-2105-11-15] [Citation(s) in RCA: 106] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2009] [Accepted: 01/08/2010] [Indexed: 12/29/2022] Open

Abstract

Background

A key challenge in systems biology is the reconstruction of an organism's metabolic network from its genome sequence. One strategy for addressing this problem is to predict which metabolic pathways, from a reference database of known pathways, are present in the organism, based on the annotated genome of the organism.

Results

To quantitatively validate methods for pathway prediction, we developed a large "gold standard" dataset of 5,610 pathway instances known to be present or absent in curated metabolic pathway databases for six organisms. We defined a collection of 123 pathway features, whose information content we evaluated with respect to the gold standard. Feature data were used as input to an extensive collection of machine learning (ML) methods, including naïve Bayes, decision trees, and logistic regression, together with feature selection and ensemble methods. We compared the ML methods to the previous PathoLogic algorithm for pathway prediction using the gold standard dataset. We found that ML-based prediction methods can match the performance of the PathoLogic algorithm. PathoLogic achieved an accuracy of 91% and an F-measure of 0.786. The ML-based prediction methods achieved accuracy as high as 91.2% and F-measure as high as 0.787. The ML-based methods output a probability for each predicted pathway, whereas PathoLogic does not, which provides more information to the user and facilitates filtering of predicted pathways.

Conclusions

ML methods for pathway prediction perform as well as existing methods, and have qualitative advantages in terms of extensibility, tunability, and explainability. More advanced prediction methods and/or more sophisticated input features may improve the performance of ML methods. However, pathway prediction performance appears to be limited largely by the ability to correctly match enzymes to the reactions they catalyze based on genome annotations.

Collapse