Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Brbić M, Piškorec M, Vidulin V, Kriško A, Šmuc T, Supek F. The landscape of microbial phenotypic traits and associated genes. Nucleic Acids Res 2016;44:10074-10090. [PMID: 27915291 PMCID: PMC5137458 DOI: 10.1093/nar/gkw964] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2016] [Revised: 09/21/2016] [Accepted: 10/11/2016] [Indexed: 12/31/2022] Open

For:	Brbić M, Piškorec M, Vidulin V, Kriško A, Šmuc T, Supek F. The landscape of microbial phenotypic traits and associated genes. Nucleic Acids Res 2016;44:10074-10090. [PMID: 27915291 PMCID: PMC5137458 DOI: 10.1093/nar/gkw964] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2016] [Revised: 09/21/2016] [Accepted: 10/11/2016] [Indexed: 12/31/2022] Open

Number

Cited by Other Article(s)

Sonnert ND, Rosen CE, Ghazi AR, Franzosa EA, Duncan-Lowey B, González-Hernández JA, Huck JD, Yang Y, Dai Y, Rice TA, Nguyen MT, Song D, Cao Y, Martin AL, Bielecka AA, Fischer S, Guan C, Oh J, Huttenhower C, Ring AM, Palm NW. A host-microbiota interactome reveals extensive transkingdom connectivity. Nature 2024;628:171-179. [PMID: 38509360 DOI: 10.1038/s41586-024-07162-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2022] [Accepted: 02/05/2024] [Indexed: 03/22/2024]

Abstract

The myriad microorganisms that live in close association with humans have diverse effects on physiology, yet the molecular bases for these impacts remain mostly unknown1-3. Classical pathogens often invade host tissues and modulate immune responses through interactions with human extracellular and secreted proteins (the 'exoproteome'). Commensal microorganisms may also facilitate niche colonization and shape host biology by engaging host exoproteins; however, direct exoproteome-microbiota interactions remain largely unexplored. Here we developed and validated a novel technology, BASEHIT, that enables proteome-scale assessment of human exoproteome-microbiome interactions. Using BASEHIT, we interrogated more than 1.7 million potential interactions between 519 human-associated bacterial strains from diverse phylogenies and tissues of origin and 3,324 human exoproteins. The resulting interactome revealed an extensive network of transkingdom connectivity consisting of thousands of previously undescribed host-microorganism interactions involving 383 strains and 651 host proteins. Specific binding patterns within this network implied underlying biological logic; for example, conspecific strains exhibited shared exoprotein-binding patterns, and individual tissue isolates uniquely bound tissue-specific exoproteins. Furthermore, we observed dozens of unique and often strain-specific interactions with potential roles in niche colonization, tissue remodelling and immunomodulation, and found that strains with differing host interaction profiles had divergent interactions with host cells in vitro and effects on the host immune system in vivo. Overall, these studies expose a previously unexplored landscape of molecular-level host-microbiota interactions that may underlie causal effects of indigenous microorganisms on human health and disease.

Collapse

Affiliation(s)

Nicole D Sonnert Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA Department of Microbial Pathogenesis, Yale School of Medicine, New Haven, CT, USA
Connor E Rosen Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Andrew R Ghazi Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Eric A Franzosa Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA
Brianna Duncan-Lowey Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Jaime A González-Hernández Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
John D Huck Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Yi Yang Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Yile Dai Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Tyler A Rice Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Mytien T Nguyen Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Deguang Song Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Yiyun Cao Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Anjelica L Martin Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Agata A Bielecka Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Suzanne Fischer Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Changhui Guan The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
Julia Oh The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
Curtis Huttenhower Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA
Aaron M Ring Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA. Department of Pharmacology, Yale School of Medicine, New Haven, CT, USA.
Noah W Palm Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA.

Collapse

Zhao H, Li Z, Chen W, Zheng Z, Xie S. Accelerated Partially Shared Dictionary Learning With Differentiable Scale-Invariant Sparsity for Multi-View Clustering. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023;34:8825-8839. [PMID: 35254997 DOI: 10.1109/tnnls.2022.3153310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Abstract

Multiview dictionary learning (DL) is attracting attention in multiview clustering due to the efficient feature learning ability. However, most existing multiview DL algorithms are facing problems in fully utilizing consistent and complementary information simultaneously in the multiview data and learning the most precise representation for multiview clustering because of gaps between views. This article proposes an efficient multiview DL algorithm for multiview clustering, which uses the partially shared DL model with a flexible ratio of shared sparse coefficients to excavate both consistency and complementarity in the multiview data. In particular, a differentiable scale-invariant function is used as the sparsity regularizer, which considers the absolute sparsity of coefficients as the l0 norm regularizer but is continuous and differentiable almost everywhere. The corresponding optimization problem is solved by the proximal splitting method with extrapolation technology; moreover, the proximal operator of the differentiable scale-invariant regularizer can be derived. The synthetic experiment results demonstrate that the proposed algorithm can recover the synthetic dictionary well with reasonable convergence time costs. Multiview clustering experiments include six real-world multiview datasets, and the performances show that the proposed algorithm is not sensitive to the regularizer parameter as the other algorithms. Furthermore, an appropriate coefficient sharing ratio can help to exploit consistent information while keeping complementary information from multiview data and thus enhance performances in multiview clustering. In addition, the convergence performances show that the proposed algorithm can obtain the best performances in multiview clustering among compared algorithms and can converge faster than compared multiview algorithms mostly.

Collapse

Karlsen ST, Rau MH, Sánchez BJ, Jensen K, Zeidan AA. From genotype to phenotype: computational approaches for inferring microbial traits relevant to the food industry. FEMS Microbiol Rev 2023;47:fuad030. [PMID: 37286882 PMCID: PMC10337747 DOI: 10.1093/femsre/fuad030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/31/2023] [Accepted: 06/06/2023] [Indexed: 06/09/2023] Open

Zhao J, Wang X, Zou Q, Kang F, Peng J, Wang F. On improvability of hash clustering data from different sources by bipartite graph. Pattern Anal Appl 2022. [DOI: 10.1007/s10044-022-01125-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Gemler BT, Mukherjee C, Howland CA, Huk D, Shank Z, Harbo LJ, Tabbaa OP, Bartling CM. Function-based classification of hazardous biological sequences: Demonstration of a new paradigm for biohazard assessments. Front Bioeng Biotechnol 2022;10:979497. [PMID: 36277394 PMCID: PMC9585941 DOI: 10.3389/fbioe.2022.979497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 08/31/2022] [Indexed: 12/04/2022] Open

Abstract

Bioengineering applies analytical and engineering principles to identify functional biological building blocks for biotechnology applications. While these building blocks are leveraged to improve the human condition, the lack of simplistic, machine-readable definition of biohazards at the function level is creating a gap for biosafety practices. More specifically, traditional safety practices focus on the biohazards of known pathogens at the organism-level and may not accurately consider novel biodesigns with engineered functionalities at the genetic component-level. This gap is motivating the need for a paradigm shift from organism-centric procedures to function-centric biohazard identification and classification practices. To address this challenge, we present a novel methodology for classifying biohazards at the individual sequence level, which we then compiled to distinguish the biohazardous property of pathogenicity at the whole genome level. Our methodology is rooted in compilation of hazardous functions, defined as a set of sequences and associated metadata that describe coarse-level functions associated with pathogens (e.g., adherence, immune subversion). We demonstrate that the resulting database can be used to develop hazardous “fingerprints” based on the functional metadata categories. We verified that these hazardous functions are found at higher levels in pathogens compared to non-pathogens, and hierarchical clustering of the fingerprints can distinguish between these two groups. The methodology presented here defines the hazardous functions associated with bioengineering functional building blocks at the sequence level, which provide a foundational framework for classifying biological hazards at the organism level, thus leading to the improvement and standardization of current biosecurity and biosafety practices.

Collapse

Karaoz U, Brodie EL. microTrait: A Toolset for a Trait-Based Representation of Microbial Genomes. FRONTIERS IN BIOINFORMATICS 2022;2:918853. [PMID: 36304272 PMCID: PMC9580909 DOI: 10.3389/fbinf.2022.918853] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 06/20/2022] [Indexed: 11/29/2023] Open

Abstract

Remote sensing approaches have revolutionized the study of macroorganisms, allowing theories of population and community ecology to be tested across increasingly larger scales without much compromise in resolution of biological complexity. In microbial ecology, our remote window into the ecology of microorganisms is through the lens of genome sequencing. For microbial organisms, recent evidence from genomes recovered from metagenomic samples corroborate a highly complex view of their metabolic diversity and other associated traits which map into high physiological complexity. Regardless, during the first decades of this omics era, microbial ecological research has primarily focused on taxa and functional genes as ecological units, favoring breadth of coverage over resolution of biological complexity manifested as physiological diversity. Recently, the rate at which provisional draft genomes are generated has increased substantially, giving new insights into ecological processes and interactions. From a genotype perspective, the wide availability of genome-centric data requires new data synthesis approaches that place organismal genomes center stage in the study of environmental roles and functional performance. Extraction of ecologically relevant traits from microbial genomes will be essential to the future of microbial ecological research. Here, we present microTrait, a computational pipeline that infers and distills ecologically relevant traits from microbial genome sequences. microTrait maps a genome sequence into a trait space, including discrete and continuous traits, as well as simple and composite. Traits are inferred from genes and pathways representing energetic, resource acquisition, and stress tolerance mechanisms, while genome-wide signatures are used to infer composite, or life history, traits of microorganisms. This approach is extensible to any microbial habitat, although we provide initial examples of this approach with reference to soil microbiomes.

Collapse

Lu Y, Li Q, Li T. PPA-GCN: A Efficient GCN Framework for Prokaryotic Pathways Assignment. Front Genet 2022;13:839453. [PMID: 35444686 PMCID: PMC9013948 DOI: 10.3389/fgene.2022.839453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Accepted: 03/17/2022] [Indexed: 11/17/2022] Open

Hu EZ, Lan XR, Liu ZL, Gao J, Niu DK. A positive correlation between GC content and growth temperature in prokaryotes. BMC Genomics 2022;23:110. [PMID: 35139824 PMCID: PMC8827189 DOI: 10.1186/s12864-022-08353-7] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 01/31/2022] [Indexed: 01/27/2023] Open

Abstract

Background

GC pairs are generally more stable than AT pairs; GC-rich genomes were proposed to be more adapted to high temperatures than AT-rich genomes. Previous studies consistently showed positive correlations between growth temperature and the GC contents of structural RNA genes. However, for the whole genome sequences and the silent sites of the codons in protein-coding genes, the relationship between GC content and growth temperature is in a long-lasting debate.

Results

With a dataset much larger than previous studies (681 bacteria and 155 archaea with completely assembled genomes), our phylogenetic comparative analyses showed positive correlations between optimal growth temperature (Topt) and GC content both in bacterial and archaeal structural RNA genes and in bacterial whole genome sequences, chromosomal sequences, plasmid sequences, core genes, and accessory genes. However, in the 155 archaea, we did not observe a significant positive correlation of Topt with whole-genome GC content (GC_w) or GC content at four-fold degenerate sites. We randomly drew 155 samples from the 681 bacteria for 1000 rounds. In most cases (> 95%), the positive correlations between Topt and genomic GC contents became statistically nonsignificant (P > 0.05). This result suggested that the small sample sizes might account for the lack of positive correlations between growth temperature and genomic GC content in the 155 archaea and the bacterial samples of previous studies. Comparing the GC content among four categories (psychrophiles/psychrotrophiles, mesophiles, thermophiles, and hyperthermophiles) also revealed a positive correlation between GC_w and growth temperature in bacteria. By including the GC_w of incompletely assembled genomes, we expanded the sample size of archaea to 303. Positive correlations between GC_w and Topt appear especially after excluding the halophilic archaea whose GC contents might be strongly shaped by intense UV radiation.

Conclusions

This study explains the previous contradictory observations and ends a long debate. Prokaryotes growing in high temperatures have higher GC contents. Thermal adaptation is one possible explanation for the positive association. Meanwhile, we propose that the elevated efficiency of DNA repair in response to heat mutagenesis might have the by-product of increasing GC content like that happens in intracellular symbionts and marine bacterioplankton.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12864-022-08353-7.

Collapse

Ben Khedher M, Ghedira K, Rolain JM, Ruimy R, Croce O. Application and Challenge of 3rd Generation Sequencing for Clinical Bacterial Studies. Int J Mol Sci 2022;23:1395. [PMID: 35163319 PMCID: PMC8835973 DOI: 10.3390/ijms23031395] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 01/20/2022] [Accepted: 01/24/2022] [Indexed: 02/04/2023] Open

Barnett SE, Youngblut ND, Koechli CN, Buckley DH. Multisubstrate DNA stable isotope probing reveals guild structure of bacteria that mediate soil carbon cycling. Proc Natl Acad Sci U S A 2021;118:e2115292118. [PMID: 34799453 PMCID: PMC8617410 DOI: 10.1073/pnas.2115292118] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Accepted: 10/10/2021] [Indexed: 11/18/2022] Open

Weissman JL, Dogra S, Javadi K, Bolten S, Flint R, Davati C, Beattie J, Dixit K, Peesay T, Awan S, Thielen P, Breitwieser F, Johnson PLF, Karig D, Fagan WF, Bewick S. Exploring the functional composition of the human microbiome using a hand-curated microbial trait database. BMC Bioinformatics 2021;22:306. [PMID: 34098872 PMCID: PMC8186035 DOI: 10.1186/s12859-021-04216-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Accepted: 05/25/2021] [Indexed: 12/15/2022] Open

Zhang GY, Chen XW, Zhou YR, Wang CD, Huang D, He XY. Kernelized multi-view subspace clustering via auto-weighted graph learning. APPL INTELL 2021. [DOI: 10.1007/s10489-021-02365-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Zimmermann J, Kaleta C, Waschina S. gapseq: informed prediction of bacterial metabolic pathways and reconstruction of accurate metabolic models. Genome Biol 2021;22:81. [PMID: 33691770 PMCID: PMC7949252 DOI: 10.1186/s13059-021-02295-1] [Citation(s) in RCA: 86] [Impact Index Per Article: 28.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 02/10/2021] [Indexed: 12/21/2022] Open

Discovering microbe-disease associations from the literature using a hierarchical long short-term memory network and an ensemble parser model. Sci Rep 2021;11:4490. [PMID: 33627732 PMCID: PMC7904816 DOI: 10.1038/s41598-021-83966-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Accepted: 02/08/2021] [Indexed: 02/07/2023] Open

Cauchy loss induced block diagonal representation for robust multi-view subspace clustering. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.11.017] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

A synthesis of bacterial and archaeal phenotypic trait data. Sci Data 2020;7:170. [PMID: 32503990 PMCID: PMC7275036 DOI: 10.1038/s41597-020-0497-4] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Accepted: 04/20/2020] [Indexed: 11/08/2022] Open

Hornischer K, Khaledi A, Pohl S, Schniederjans M, Pezoldt L, Casilag F, Muthukumarasamy U, Bruchmann S, Thöming J, Kordes A, Häussler S. BACTOME-a reference database to explore the sequence- and gene expression-variation landscape of Pseudomonas aeruginosa clinical isolates. Nucleic Acids Res 2020;47:D716-D720. [PMID: 30272193 PMCID: PMC6324029 DOI: 10.1093/nar/gky895] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Accepted: 09/21/2018] [Indexed: 12/26/2022] Open

Affiliation(s)

Klaus Hornischer Institute of Molecular Bacteriology, Helmholtz Centre for Infection Research, D-38124 Braunschweig, Germany.,Institute of Molecular Bacteriology, TWINCORE GmbH, Center for Clinical and Experimental Infection Research, D-30625 Hannover, Germany.,Molecular Health GmbH, D-69115 Heidelberg, Germany
Ariane Khaledi Institute of Molecular Bacteriology, Helmholtz Centre for Infection Research, D-38124 Braunschweig, Germany.,Institute of Molecular Bacteriology, TWINCORE GmbH, Center for Clinical and Experimental Infection Research, D-30625 Hannover, Germany
Sarah Pohl Institute of Molecular Bacteriology, Helmholtz Centre for Infection Research, D-38124 Braunschweig, Germany.,Institute of Molecular Bacteriology, TWINCORE GmbH, Center for Clinical and Experimental Infection Research, D-30625 Hannover, Germany
Monika Schniederjans Institute of Molecular Bacteriology, Helmholtz Centre for Infection Research, D-38124 Braunschweig, Germany.,Institute of Molecular Bacteriology, TWINCORE GmbH, Center for Clinical and Experimental Infection Research, D-30625 Hannover, Germany
Lorena Pezoldt Institute of Molecular Bacteriology, Helmholtz Centre for Infection Research, D-38124 Braunschweig, Germany.,Institute of Molecular Bacteriology, TWINCORE GmbH, Center for Clinical and Experimental Infection Research, D-30625 Hannover, Germany
Fiordiligie Casilag Institute of Molecular Bacteriology, Helmholtz Centre for Infection Research, D-38124 Braunschweig, Germany.,Institute of Molecular Bacteriology, TWINCORE GmbH, Center for Clinical and Experimental Infection Research, D-30625 Hannover, Germany
Uthayakumar Muthukumarasamy Institute of Molecular Bacteriology, Helmholtz Centre for Infection Research, D-38124 Braunschweig, Germany.,Institute of Molecular Bacteriology, TWINCORE GmbH, Center for Clinical and Experimental Infection Research, D-30625 Hannover, Germany
Sebastian Bruchmann Institute of Molecular Bacteriology, Helmholtz Centre for Infection Research, D-38124 Braunschweig, Germany.,Institute of Molecular Bacteriology, TWINCORE GmbH, Center for Clinical and Experimental Infection Research, D-30625 Hannover, Germany.,Pathogen Genomics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, UK
Janne Thöming Institute of Molecular Bacteriology, Helmholtz Centre for Infection Research, D-38124 Braunschweig, Germany.,Institute of Molecular Bacteriology, TWINCORE GmbH, Center for Clinical and Experimental Infection Research, D-30625 Hannover, Germany
Adrian Kordes Institute of Molecular Bacteriology, Helmholtz Centre for Infection Research, D-38124 Braunschweig, Germany.,Institute of Molecular Bacteriology, TWINCORE GmbH, Center for Clinical and Experimental Infection Research, D-30625 Hannover, Germany
Susanne Häussler Institute of Molecular Bacteriology, Helmholtz Centre for Infection Research, D-38124 Braunschweig, Germany.,Institute of Molecular Bacteriology, TWINCORE GmbH, Center for Clinical and Experimental Infection Research, D-30625 Hannover, Germany

Collapse

San JE, Baichoo S, Kanzi A, Moosa Y, Lessells R, Fonseca V, Mogaka J, Power R, de Oliveira T. Current Affairs of Microbial Genome-Wide Association Studies: Approaches, Bottlenecks and Analytical Pitfalls. Front Microbiol 2020;10:3119. [PMID: 32082269 PMCID: PMC7002396 DOI: 10.3389/fmicb.2019.03119] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 12/24/2019] [Indexed: 12/12/2022] Open

Patterns of diverse gene functions in genomic neighborhoods predict gene function and phenotype. Sci Rep 2019;9:19537. [PMID: 31863070 PMCID: PMC6925100 DOI: 10.1038/s41598-019-55984-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Accepted: 12/02/2019] [Indexed: 01/01/2023] Open

Weissman JL, Fagan WF, Johnson PLF. Linking high GC content to the repair of double strand breaks in prokaryotic genomes. PLoS Genet 2019;15:e1008493. [PMID: 31703064 PMCID: PMC6867656 DOI: 10.1371/journal.pgen.1008493] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 11/20/2019] [Accepted: 10/25/2019] [Indexed: 01/21/2023] Open

Abstract

Genomic GC content varies widely among microbes for reasons unknown. While mutation bias partially explains this variation, prokaryotes near-universally have a higher GC content than predicted solely by this bias. Debate surrounds the relative importance of the remaining explanations of selection versus biased gene conversion favoring GC alleles. Some environments (e.g. soils) are associated with a high genomic GC content of their inhabitants, which implies that either high GC content is a selective adaptation to particular habitats, or that certain habitats favor increased rates of gene conversion. Here, we report a novel association between the presence of the non-homologous end joining DNA double-strand break repair pathway and GC content; this observation suggests that DNA damage may be a fundamental driver of GC content, leading in part to the many environmental patterns observed to-date. We discuss potential mechanisms accounting for the observed association, and provide preliminary evidence that sites experiencing higher rates of double-strand breaks are under selection for increased GC content relative to the genomic background.

The overall nucleotide composition of an organism’s genome varies greatly between species. Previous work has identified certain environmental factors (e.g., oxygen availability) associated with the relative number of GC bases as opposed to AT bases in the genomes of species. Many of these environments that are associated with high GC content are also associated with relatively high rates of DNA damage. We show that organisms possessing the non-homologous end-joining DNA repair pathway, which is one mechanism to repair DNA double-strand breaks, have an elevated GC content relative to expectation. We also show that certain sites on the genome that are particularly susceptible to double strand breaks have an elevated GC content. This leads us to suggest that an important underlying driver of variability in nucleotide composition across environments is the rate of DNA damage (specifically double-strand breaks) to which an organism living in each environment is exposed.

Collapse

Bewick S, Gurarie E, Weissman JL, Beattie J, Davati C, Flint R, Thielen P, Breitwieser F, Karig D, Fagan WF. Trait-based analysis of the human skin microbiome. MICROBIOME 2019;7:101. [PMID: 31277701 PMCID: PMC6612184 DOI: 10.1186/s40168-019-0698-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Accepted: 05/19/2019] [Indexed: 05/04/2023]

Abstract

BACKGROUND

The past decade of microbiome research has concentrated on cataloging the diversity of taxa in different environments. The next decade is poised to focus on microbial traits and function. Most existing methods for doing this perform pathway analysis using reference databases. This has both benefits and drawbacks. Function can go undetected if reference databases are coarse-grained or incomplete. Likewise, detection of a pathway does not guarantee expression of the associated function. Finally, function cannot be connected to specific microbial constituents, making it difficult to ascertain the types of organisms exhibiting particular traits-something that is important for understanding microbial success in specific environments. A complementary approach to pathway analysis is to use the wealth of microbial trait information collected over years of lab-based, culture experiments.

METHODS

Here, we use journal articles and Bergey's Manual of Systematic Bacteriology to develop a trait-based database for 971 human skin bacterial taxa. We then use this database to examine functional traits that are over/underrepresented among skin taxa. Specifically, we focus on three trait classes-binary, categorical, and quantitative-and compare trait values among skin taxa and microbial taxa more broadly. We compare binary traits using a Chi-square test, categorical traits using randomization trials, and quantitative traits using a nonparametric relative effects test based on global rankings using Tukey contrasts.

RESULTS

We find a number of traits that are over/underrepresented within the human skin microbiome. For example, spore formation, acid phosphatase, alkaline phosphatase, pigment production, catalase, and oxidase are all less common among skin taxa. As well, skin bacteria are less likely to be aerobic, favoring, instead, a facultative strategy. They are also less likely to exhibit gliding motility, less likely to be spirillum or rod-shaped, and less likely to grow in chains. Finally, skin bacteria have more difficulty at high pH, prefer warmer temperatures, and are much less resilient to hypotonic conditions.

CONCLUSIONS

Our analysis shows how an approach that relies on information from culture experiments can both support findings from pathway analysis, and also generate new insights into the structuring principles of microbial communities.

Collapse

Weissman JL, Laljani RMR, Fagan WF, Johnson PLF. Visualization and prediction of CRISPR incidence in microbial trait-space to identify drivers of antiviral immune strategy. ISME JOURNAL 2019;13:2589-2602. [PMID: 31239539 PMCID: PMC6776019 DOI: 10.1038/s41396-019-0411-2] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Revised: 03/15/2019] [Accepted: 03/24/2019] [Indexed: 01/21/2023]

Schmutzer M, Barraclough TG. The role of recombination, niche-specific gene pools and flexible genomes in the ecological speciation of bacteria. Ecol Evol 2019;9:4544-4556. [PMID: 31031926 PMCID: PMC6476844 DOI: 10.1002/ece3.5052] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Revised: 02/16/2019] [Accepted: 02/18/2019] [Indexed: 12/21/2022] Open

Perz AI, Giles CB, Brown CA, Porter H, Roopnarinesingh X, Wren JD. MNEMONIC: MetageNomic Experiment Mining to create an OTU Network of Inhabitant Correlations. BMC Bioinformatics 2019;20:96. [PMID: 30871469 PMCID: PMC6419333 DOI: 10.1186/s12859-019-2623-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

Uncovering carbohydrate metabolism through a genotype-phenotype association study of 56 lactic acid bacteria genomes. Appl Microbiol Biotechnol 2019;103:3135-3152. [PMID: 30830251 PMCID: PMC6447522 DOI: 10.1007/s00253-019-09701-6] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Revised: 02/14/2019] [Accepted: 02/14/2019] [Indexed: 11/09/2022]

Barnett SE, Youngblut ND, Buckley DH. Data Analysis for DNA Stable Isotope Probing Experiments Using Multiple Window High-Resolution SIP. Methods Mol Biol 2019;2046:109-128. [PMID: 31407300 DOI: 10.1007/978-1-4939-9721-3_9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]

Tedersoo L, Drenkhan R, Anslan S, Morales‐Rodriguez C, Cleary M. High-throughput identification and diagnostics of pathogens and pests: Overview and practical recommendations. Mol Ecol Resour 2019;19:47-76. [PMID: 30358140 PMCID: PMC7379260 DOI: 10.1111/1755-0998.12959] [Citation(s) in RCA: 61] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Revised: 08/01/2018] [Accepted: 08/28/2018] [Indexed: 12/26/2022]

Engqvist MKM. Correlating enzyme annotations with a large set of microbial growth temperatures reveals metabolic adaptations to growth at diverse temperatures. BMC Microbiol 2018;18:177. [PMID: 30400856 PMCID: PMC6219164 DOI: 10.1186/s12866-018-1320-7] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2018] [Accepted: 10/16/2018] [Indexed: 12/15/2022] Open

Abstract

Background

The ambient temperature of all habitats is a key physical property that shapes the biology of microbes inhabiting them. The optimal growth temperature (OGT) of a microbe, is therefore a key piece of data needed to understand evolutionary adaptations manifested in their genome sequence. Unfortunately there is no growth temperature database or easily downloadable dataset encompassing the majority of cultured microorganisms. We are thus limited in interpreting genomic data to identify temperature adaptations in microbes.

Results

In this work I significantly contribute to closing this gap by mining data from major culture collection centres to obtain growth temperature data for a nonredundant set of 21,498 microbes. The dataset (10.5281/zenodo.1175608) contains mainly bacteria and archaea and spans psychrophiles, mesophiles, thermophiles and hyperthermophiles. Using this data a full 43% of all protein entries in the UniProt database can be annotated with the growth temperature of the species from which they originate. I validate the dataset by showing a Pearson correlation of up to 0.89 between growth temperature and mean enzyme optima, a physiological property directly influenced by the growth temperature. Using the temperature dataset I correlate the genomic occurance of enzyme functional annotations with growth temperature. I identify 319 enzyme functions that either increase or decrease in occurrence with temperature. Eight metabolic pathways were statistically enriched for these enzyme functions. Furthermore, I establish a correlation between 33 domains of unknown function (DUFs) with growth temperature in microbes, four of which (DUF438, DUF1524, DUF1957 and DUF3458_C) were significant in both archaea and bacteria.

Conclusions

The growth temperature dataset enables large-scale correlation analysis with enzyme function- and domain-level annotations. Growth-temperature dependent changes in their occurrence highlight potential evolutionary adaptations. A few of the identified changes are previously known, such as the preference for menaquinone biosynthesis through the futalosine pathway in bacteria growing at high temperatures. Others represent important starting points for future studies, such as DUFs where their occurrence change with temperature. The growth temperature dataset should become a valuable community resource and will find additional, important, uses in correlating genomic, transcriptomic, proteomic, metabolomic, phenotypic or taxonomic properties with temperature in future studies.

Electronic supplementary material

The online version of this article (10.1186/s12866-018-1320-7) contains supplementary material, which is available to authorized users.

Collapse

Vidulin V, Šmuc T, Džeroski S, Supek F. The evolutionary signal in metagenome phyletic profiles predicts many gene functions. MICROBIOME 2018;6:129. [PMID: 29991352 PMCID: PMC6040064 DOI: 10.1186/s40168-018-0506-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Accepted: 06/19/2018] [Indexed: 06/08/2023]

Abstract

BACKGROUND

The function of many genes is still not known even in model organisms. An increasing availability of microbiome DNA sequencing data provides an opportunity to infer gene function in a systematic manner.

RESULTS

We evaluated if the evolutionary signal contained in metagenome phyletic profiles (MPP) is predictive of a broad array of gene functions. The MPPs are an encoding of environmental DNA sequencing data that consists of relative abundances of gene families across metagenomes. We find that such MPPs can accurately predict 826 Gene Ontology functional categories, while drawing on human gut microbiomes, ocean metagenomes, and DNA sequences from various other engineered and natural environments. Overall, in this task, the MPPs are highly accurate, and moreover they provide coverage for a set of Gene Ontology terms largely complementary to standard phylogenetic profiles, derived from fully sequenced genomes. We also find that metagenomes approximated from taxon relative abundance obtained via 16S rRNA gene sequencing may provide surprisingly useful predictive models. Crucially, the MPPs derived from different types of environments can infer distinct, non-overlapping sets of gene functions and therefore complement each other. Consistently, simulations on > 5000 metagenomes indicate that the amount of data is not in itself critical for maximizing predictive accuracy, while the diversity of sampled environments appears to be the critical factor for obtaining robust models.

CONCLUSIONS

In past work, metagenomics has provided invaluable insight into ecology of various habitats, into diversity of microbial life and also into human health and disease mechanisms. We propose that environmental DNA sequencing additionally constitutes a useful tool to predict biological roles of genes, yielding inferences out of reach for existing comparative genomics approaches.

Collapse

Hockenberry AJ, Stern AJ, Amaral LAN, Jewett MC. Diversity of Translation Initiation Mechanisms across Bacterial Species Is Driven by Environmental Conditions and Growth Demands. Mol Biol Evol 2017;35:582-592. [PMID: 29220489 PMCID: PMC5850609 DOI: 10.1093/molbev/msx310] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open