Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Pasolli E, Truong DT, Malik F, Waldron L, Segata N. Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights. PLoS Comput Biol 2016;12:e1004977. [PMID: 27400279 PMCID: PMC4939962 DOI: 10.1371/journal.pcbi.1004977] [Citation(s) in RCA: 294] [Impact Index Per Article: 36.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 05/11/2016] [Indexed: 12/12/2022] Open

For:	Pasolli E, Truong DT, Malik F, Waldron L, Segata N. Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights. PLoS Comput Biol 2016;12:e1004977. [PMID: 27400279 PMCID: PMC4939962 DOI: 10.1371/journal.pcbi.1004977] [Citation(s) in RCA: 294] [Impact Index Per Article: 36.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 05/11/2016] [Indexed: 12/12/2022] Open

Number

Cited by Other Article(s)

151

Separation of Donor and Recipient Microbial Diversity Allows Determination of Taxonomic and Functional Features of Gut Microbiota Restructuring following Fecal Transplantation. mSystems 2021;6:e0081121. [PMID: 34402648 PMCID: PMC8407411 DOI: 10.1128/msystems.00811-21] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Abstract

Fecal microbiota transplantation (FMT) is currently used in medicine to treat recurrent clostridial colitis and other intestinal diseases. However, neither the therapeutic mechanism of FMT nor the mechanism that allows the donor bacteria to colonize the intestine of the recipient has yet been clearly described. From a biological point of view, FMT can be considered a useful model for studying the ecology of host-associated microbial communities. FMT experiments can shed light on the relationship features between the host and its gut microbiota. This creates the need for experimentation with approaches to metagenomic data analysis which may be useful for the interpretation of observed biological phenomena. Here, the recipient intestine colonization analysis tool (RECAST) novel computational approach is presented, which is based on the metagenomic read sorting process per their origin in the recipient’s post-FMT stool metagenome. Using the RECAST algorithm, taxonomic/functional annotation, and machine learning approaches, the metagenomes from three FMT studies, including healthy volunteers, patients with clostridial colitis, and patients with metabolic syndrome, were analyzed. Using our computational pipeline, the donor-derived and recipient-derived microbes which formed the recipient post-FMT stool metagenomes (successful microbes) were identified. Their presence is well explained by a higher relative abundance in donor/pre-FMT recipient metagenomes or other metagenomes from the human population. In addition, successful microbes are enriched with gene groups potentially related to antibiotic resistance, including antimicrobial peptides. Interestingly, the observed reorganization features are universal and independent of the disease.

IMPORTANCE We assumed that the enrichment of successful gut microbes by lantibiotic/antibiotic resistance genes can be related to gut microbiota colonization resistance by third-party microbe phenomena and resistance to bacterium-derived or host-derived antimicrobial substances. According to this assumption, competition between the donor-derived and recipient-derived microbes as well as host immunity may play a key role in the FMT-related colonization and redistribution of recipient gut microbiota structure.

Author Video: An author video summary of this article is available.

Collapse

152

Sahu A, Blätke MA, Szymański JJ, Töpfer N. Advances in flux balance analysis by integrating machine learning and mechanism-based models. Comput Struct Biotechnol J 2021;19:4626-4640. [PMID: 34471504 PMCID: PMC8382995 DOI: 10.1016/j.csbj.2021.08.004] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2021] [Revised: 08/03/2021] [Accepted: 08/03/2021] [Indexed: 02/08/2023] Open

153

Zhou Z, Hu S, Zhang R, Ma Y, Du K, Sun M, Zhang H, Jiang X, Tu H, Wang X, Chen P. A simple and novel biomarker panel for serofluid dish rapid quality and safety assessment based on gray relational analysis. FOOD BIOSCI 2021. [DOI: 10.1016/j.fbio.2021.101188] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]

154

Hassouneh SAD, Loftus M, Yooseph S. Linking Inflammatory Bowel Disease Symptoms to Changes in the Gut Microbiome Structure and Function. Front Microbiol 2021;12:673632. [PMID: 34349736 PMCID: PMC8326577 DOI: 10.3389/fmicb.2021.673632] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Accepted: 06/25/2021] [Indexed: 12/12/2022] Open

Abstract

Inflammatory bowel disease (IBD) is a chronic disease of the gastrointestinal tract that is often characterized by abdominal pain, rectal bleeding, inflammation, and weight loss. Many studies have posited that the gut microbiome may play an integral role in the onset and exacerbation of IBD. Here, we present a novel computational analysis of a previously published IBD dataset. This dataset consists of shotgun sequence data generated from fecal samples collected from individuals with IBD and an internal control group. Utilizing multiple external controls, together with appropriate techniques to handle the compositionality aspect of sequence data, our computational framework can identify and corroborate differences in the taxonomic profiles, bacterial association networks, and functional capacity within the IBD gut microbiome. Our analysis identified 42 bacterial species that are differentially abundant between IBD and every control group (one internal control and two external controls) with at least a twofold difference. Of the 42 species, 34 were significantly elevated in IBD, relative to every other control. These 34 species were still present in the control groups and appear to play important roles, according to network centrality and degree, in all bacterial association networks. Many of the species elevated in IBD have been implicated in modulating the immune response, mucin degradation, antibiotic resistance, and inflammation. We also identified elevated relative abundances of protein families related to signal transduction, sporulation and germination, and polysaccharide degradation as well as decreased relative abundance of protein families related to menaquinone and ubiquinone biosynthesis. Finally, we identified differences in functional capacities between IBD and healthy controls, and subsequently linked the changes in the functional capacity to previously published clinical research and to symptoms that commonly occur in IBD.

Collapse

155

Estaki M, Jiang L, Bokulich NA, McDonald D, González A, Kosciolek T, Martino C, Zhu Q, Birmingham A, Vázquez-Baeza Y, Dillon MR, Bolyen E, Caporaso JG, Knight R. QIIME 2 Enables Comprehensive End-to-End Analysis of Diverse Microbiome Data and Comparative Studies with Publicly Available Data. ACTA ACUST UNITED AC 2021;70:e100. [PMID: 32343490 PMCID: PMC9285460 DOI: 10.1002/cpbi.100] [Citation(s) in RCA: 185] [Impact Index Per Article: 61.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Affiliation(s)

Mehrbod Estaki Department of Pediatrics, University of California San Diego, La Jolla, California
Lingjing Jiang Division of Biostatistics, University of California San Diego, La Jolla, California
Nicholas A Bokulich Center for Applied Microbiome Science, Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona.,Department of Biological Sciences, Northern Arizona University, Flagstaff, Arizona
Daniel McDonald Department of Pediatrics, University of California San Diego, La Jolla, California
Antonio González Department of Pediatrics, University of California San Diego, La Jolla, California
Tomasz Kosciolek Department of Pediatrics, University of California San Diego, La Jolla, California.,Małopolska Centre of Biotechnology, Jagiellonian University, Kraków, Poland
Cameron Martino Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, California.,Center for Microbiome Innovation, University of California San Diego, La Jolla, California
Qiyun Zhu Department of Pediatrics, University of California San Diego, La Jolla, California
Amanda Birmingham Center for Computational Biology and Bioinformatics, University of California San Diego, La Jolla, California
Yoshiki Vázquez-Baeza Center for Microbiome Innovation, University of California San Diego, La Jolla, California.,Jacobs School of Engineering, University of California San Diego, La Jolla, California
Matthew R Dillon Center for Applied Microbiome Science, Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona
Evan Bolyen Center for Applied Microbiome Science, Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona
J Gregory Caporaso Center for Applied Microbiome Science, Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona.,Department of Biological Sciences, Northern Arizona University, Flagstaff, Arizona
Rob Knight Department of Pediatrics, University of California San Diego, La Jolla, California.,Center for Microbiome Innovation, University of California San Diego, La Jolla, California.,Department of Computer Science and Engineering, University of California San Diego, La Jolla, California.,Department of Bioengineering, University of California San Diego, La Jolla, California

Collapse

156

Yang F, Zou Q. mAML: an automated machine learning pipeline with a microbiome repository for human disease classification. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021;2020:5862399. [PMID: 32588040 PMCID: PMC7316531 DOI: 10.1093/database/baaa050] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 05/27/2020] [Accepted: 06/03/2020] [Indexed: 12/20/2022]

157

García-Jiménez B, Muñoz J, Cabello S, Medina J, Wilkinson MD. Predicting microbiomes through a deep latent space. Bioinformatics 2021;37:1444-1451. [PMID: 33289510 PMCID: PMC8208755 DOI: 10.1093/bioinformatics/btaa971] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 10/21/2020] [Accepted: 11/06/2020] [Indexed: 12/28/2022] Open

Abstract

Motivation

Microbial communities influence their environment by modifying the availability of compounds, such as nutrients or chemical elicitors. Knowing the microbial composition of a site is therefore relevant to improve productivity or health. However, sequencing facilities are not always available, or may be prohibitively expensive in some cases. Thus, it would be desirable to computationally predict the microbial composition from more accessible, easily-measured features.

Results

Integrating deep learning techniques with microbiome data, we propose an artificial neural network architecture based on heterogeneous autoencoders to condense the long vector of microbial abundance values into a deep latent space representation. Then, we design a model to predict the deep latent space and, consequently, to predict the complete microbial composition using environmental features as input. The performance of our system is examined using the rhizosphere microbiome of Maize. We reconstruct the microbial composition (717 taxa) from the deep latent space (10 values) with high fidelity (>0.9 Pearson correlation). We then successfully predict microbial composition from environmental variables, such as plant age, temperature or precipitation (0.73 Pearson correlation, 0.42 Bray–Curtis). We extend this to predict microbiome composition under hypothetical scenarios, such as future climate change conditions. Finally, via transfer learning, we predict microbial composition in a distinct scenario with only 100 sequences, and distinct environmental features. We propose that our deep latent space may assist microbiome-engineering strategies when technical or financial resources are limited, through predicting current or future microbiome compositions.

Availability and implementation

Software, results and data are available at https://github.com/jorgemf/DeepLatentMicrobiome

Supplementary information

Supplementary data are available at Bioinformatics online.

Collapse

158

Nishimura N, Kaji K, Kitagawa K, Sawada Y, Furukawa M, Ozutsumi T, Fujinaga Y, Tsuji Y, Takaya H, Kawaratani H, Moriya K, Namisaki T, Akahane T, Fukui H, Yoshiji H. Intestinal Permeability Is a Mechanical Rheostat in the Pathogenesis of Liver Cirrhosis. Int J Mol Sci 2021;22:ijms22136921. [PMID: 34203178 PMCID: PMC8267717 DOI: 10.3390/ijms22136921] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Revised: 06/22/2021] [Accepted: 06/24/2021] [Indexed: 12/12/2022] Open

Abstract

Recent studies have suggested that an alteration in the gut microbiota and their products, particularly endotoxins derived from Gram-negative bacteria, may play a major role in the pathogenesis of liver diseases. Gut dysbiosis caused by a high-fat diet and alcohol consumption induces increased intestinal permeability, which means higher translocation of bacteria and their products and components, including endotoxins, the so-called "leaky gut". Clinical studies have found that plasma endotoxin levels are elevated in patients with chronic liver diseases, including alcoholic liver disease and nonalcoholic liver disease. A decrease in commensal nonpathogenic bacteria including Ruminococaceae and Lactobacillus and an overgrowth of pathogenic bacteria such as Bacteroidaceae and Enterobacteriaceae are observed in cirrhotic patients. The decreased diversity of the gut microbiota in cirrhotic patients before liver transplantation is also related to a higher incidence of post-transplant infections and cognitive impairment. The exposure to endotoxins activates macrophages via Toll-like receptor 4 (TLR4), leading to a greater production of proinflammatory cytokines and chemokines including tumor necrosis factor-alpha, interleukin (IL)-6, and IL-8, which play key roles in the progression of liver diseases. TLR4 is a major receptor activated by the binding of endotoxins in macrophages, and its downstream signal induces proinflammatory cytokines. The expression of TLR4 is also observed in nonimmune cells in the liver, such as hepatic stellate cells, which play a crucial role in the progression of liver fibrosis that develops into hepatocarcinogenesis, suggesting the importance of the interaction between endotoxemia and TLR4 signaling as a target for preventing liver disease progression. In this review, we summarize the findings for the role of gut-derived endotoxemia underlying the progression of liver pathogenesis.

Collapse

159

Chen X, Liu L, Zhang W, Yang J, Wong KC. Human host status inference from temporal microbiome changes via recurrent neural networks. Brief Bioinform 2021;22:6307015. [PMID: 34151933 DOI: 10.1093/bib/bbab223] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 04/21/2021] [Accepted: 04/21/2021] [Indexed: 01/04/2023] Open

160

Jiao N, Loomba R, Yang ZH, Wu D, Fang S, Bettencourt R, Lan P, Zhu R, Zhu L. Alterations in bile acid metabolizing gut microbiota and specific bile acid genes as a precision medicine to subclassify NAFLD. Physiol Genomics 2021;53:336-348. [PMID: 34151600 DOI: 10.1152/physiolgenomics.00011.2021] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Abstract

Multiple mechanisms for the gut microbiome contributing to the pathogenesis of nonalcoholic fatty liver disease (NAFLD) have been implicated. Here, we aim to investigate the contribution and potential application for altered bile acids (BA) metabolizing microbes in NAFLD by post hoc analysis of whole metagenome sequencing (WMS) data. The discovery cohort consisted of 86 well-characterized patients with biopsy-proven NAFLD and 38 healthy controls. Assembly-based analysis was performed to identify BA-metabolizing microbes. Statistical tests, feature selection, and microbial coabundance analysis were integrated to identify microbial alterations and markers in NAFLD. An independent validation cohort was subjected to similar analyses. NAFLD microbiota exhibited decreased diversity and microbial associations. We established a classifier model with 53 differential species exhibiting a robust diagnostic accuracy [area under the receiver-operator curve (AUC) = 0.97] for detecting NAFLD. Next, eight important differential pathway markers including secondary BA biosynthesis were identified. Specifically, increased abundance of 7α-hydroxysteroid dehydrogenase (7α-HSDH), 3α-hydroxysteroid dehydrogenase (baiA), and bile acid-coenzyme A ligase (baiB) was detected in NAFLD. Furthermore, 10 of 50 BA-metabolizing metagenome-assembled genomes (MAGs) from Bacteroides ovatus and Eubacterium biforme were dominant in NAFLD and interplayed as a synergetic ecological guild. Importantly, two subtypes of patients with NAFLD were observed according to secondary BA metabolism potentials. Elevated capability for secondary BA biosynthesis was also observed in the validation cohort. These bacterial BA-metabolizing genes and microbes identified in this study may serve as disease markers. Microbial differences in BA-metabolism and strain-specific differences among patients highlight the potential for precision medicine in NAFLD treatment.

Collapse

Affiliation(s)

Na Jiao Department of Colorectal Surgery, Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Diseases, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China.,Department of Bioinformatics, Putuo People's Hospital, Tongji University, Shanghai, People's Republic of China
Rohit Loomba Division of Gastroenterology and Epidemiology, Department of Medicine, NAFLD Research Center, University of California San Diego, La Jolla, California
Zi-Huan Yang Department of Colorectal Surgery, Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Diseases, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China
Dingfeng Wu Department of Bioinformatics, Putuo People's Hospital, Tongji University, Shanghai, People's Republic of China
Sa Fang Department of Bioinformatics, Putuo People's Hospital, Tongji University, Shanghai, People's Republic of China
Richele Bettencourt Division of Gastroenterology and Epidemiology, Department of Medicine, NAFLD Research Center, University of California San Diego, La Jolla, California
Ping Lan Department of Colorectal Surgery, Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Diseases, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China
Ruixin Zhu Department of Bioinformatics, Putuo People's Hospital, Tongji University, Shanghai, People's Republic of China
Lixin Zhu Department of Colorectal Surgery, Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Diseases, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China.,Department of Biochemistry, Genome, Environment and Microbiome Community of Excellence, The State University of New York at Buffalo, Buffalo, New York

Collapse

161

Jasner Y, Belogolovski A, Ben-Itzhak M, Koren O, Louzoun Y. Microbiome Preprocessing Machine Learning Pipeline. Front Immunol 2021;12:677870. [PMID: 34220823 PMCID: PMC8250139 DOI: 10.3389/fimmu.2021.677870] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 05/07/2021] [Indexed: 11/13/2022] Open

162

Lima KM, Davis RR, Liu SY, Greenhalgh DG, Tran NK. Longitudinal profiling of the burn patient cutaneous and gastrointestinal microbiota: a pilot study. Sci Rep 2021;11:10667. [PMID: 34021204 PMCID: PMC8139985 DOI: 10.1038/s41598-021-89822-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2021] [Accepted: 04/15/2021] [Indexed: 11/09/2022] Open

163

Gene-level metagenomic architectures across diseases yield high-resolution microbiome diagnostic indicators. Nat Commun 2021;12:2907. [PMID: 34006865 PMCID: PMC8131609 DOI: 10.1038/s41467-021-23029-8] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 04/13/2021] [Indexed: 02/06/2023] Open

Abstract

We propose microbiome disease “architectures”: linking >1 million microbial features (species, pathways, and genes) to 7 host phenotypes from 13 cohorts using a pipeline designed to identify associations that are robust to analytical model choice. Here, we quantify conservation and heterogeneity in microbiome-disease associations, using gene-level analysis to identify strain-specific, cross-disease, positive and negative associations. We find coronary artery disease, inflammatory bowel diseases, and liver cirrhosis to share gene-level signatures ascribed to the Streptococcus genus. Type 2 diabetes, by comparison, has a distinct metagenomic signature not linked to any one specific species or genus. We additionally find that at the species-level, the prior-reported connection between Solobacterium moorei and colorectal cancer is not consistently identified across models—however, our gene-level analysis unveils a group of robust, strain-specific gene associations. Finally, we validate our findings regarding colorectal cancer and inflammatory bowel diseases in independent cohorts and identify that features inversely associated with disease tend to be less reproducible than features enriched in disease. Overall, our work is not only a step towards gene-based, cross-disease microbiome diagnostic indicators, but it also illuminates the nuances of the genetic architecture of the human microbiome, including tension between gene- and species-level associations.

Here, combing the massive gene-universe of the gut microbiome to identify strain-specific, cross-disease, associations across seven human diseases, the authors introduce the concept of microbiome architecture, defined as the complete set of positive and negative associations between microbial genes and human host disease, highlighting microbiome architectures as potential diagnostic indicators.

Collapse

164

Beghini F, McIver LJ, Blanco-Míguez A, Dubois L, Asnicar F, Maharjan S, Mailyan A, Manghi P, Scholz M, Thomas AM, Valles-Colomer M, Weingart G, Zhang Y, Zolfo M, Huttenhower C, Franzosa EA, Segata N. Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3. eLife 2021;10:65088. [PMID: 33944776 PMCID: PMC8096432 DOI: 10.7554/elife.65088] [Citation(s) in RCA: 723] [Impact Index Per Article: 241.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2020] [Accepted: 04/21/2021] [Indexed: 02/06/2023] Open

165

Simon TG, Chan AT, Huttenhower C. Microbiome Biomarkers: One Step Closer in NAFLD Cirrhosis. Hepatology 2021;73:2063-2066. [PMID: 33283299 DOI: 10.1002/hep.31660] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Revised: 10/25/2020] [Accepted: 11/23/2020] [Indexed: 12/17/2022]

166

Wilkinson JE, Franzosa EA, Everett C, Li C, Hu FB, Wirth DF, Song M, Chan AT, Rimm E, Garrett WS, Huttenhower C. A framework for microbiome science in public health. Nat Med 2021;27:766-774. [PMID: 33820996 DOI: 10.1038/s41591-021-01258-0] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Accepted: 01/19/2021] [Indexed: 12/12/2022]

Affiliation(s)

Jeremy E Wilkinson Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Eric A Franzosa Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA Infectious Disease and Microbiome Program, Broad Institute, Cambridge, MA, USA
Christine Everett Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Chengchen Li Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Frank B Hu Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Department of Nutrition, Harvard T.H. Chan School of Public Health, Boston, MA, USA Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Dyann F Wirth Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA Infectious Disease and Microbiome Program, Broad Institute, Cambridge, MA, USA Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Mingyang Song Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA Department of Nutrition, Harvard T.H. Chan School of Public Health, Boston, MA, USA Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Andrew T Chan Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA Infectious Disease and Microbiome Program, Broad Institute, Cambridge, MA, USA Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, MA, USA Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Eric Rimm Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Department of Nutrition, Harvard T.H. Chan School of Public Health, Boston, MA, USA Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Wendy S Garrett Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA. Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, MA, USA. Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA, USA. Department of Molecular Metabolism, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
Curtis Huttenhower Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA. Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA. Infectious Disease and Microbiome Program, Broad Institute, Cambridge, MA, USA. Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, MA, USA.

Collapse

167

Wu S, Chen Y, Li Z, Li J, Zhao F, Su X. Towards multi-label classification: Next step of machine learning for microbiome research. Comput Struct Biotechnol J 2021;19:2742-2749. [PMID: 34093989 PMCID: PMC8131981 DOI: 10.1016/j.csbj.2021.04.054] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 04/21/2021] [Accepted: 04/22/2021] [Indexed: 11/22/2022] Open

168

Anyaso-Samuel S, Sachdeva A, Guha S, Datta S. Metagenomic Geolocation Prediction Using an Adaptive Ensemble Classifier. Front Genet 2021;12:642282. [PMID: 33959149 PMCID: PMC8093763 DOI: 10.3389/fgene.2021.642282] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 03/18/2021] [Indexed: 11/13/2022] Open

169

Young C, Wood HM, Fuentes Balaguer A, Bottomley D, Gallop N, Wilkinson L, Benton SC, Brealey M, John C, Burtonwood C, Thompson KN, Yan Y, Barrett JH, Morris EJA, Huttenhower C, Quirke P. Microbiome Analysis of More Than 2,000 NHS Bowel Cancer Screening Programme Samples Shows the Potential to Improve Screening Accuracy. Clin Cancer Res 2021;27:2246-2254. [PMID: 33658300 PMCID: PMC7610626 DOI: 10.1158/1078-0432.ccr-20-3807] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2020] [Revised: 12/05/2020] [Accepted: 02/12/2021] [Indexed: 02/03/2023]

Abstract

PURPOSE

There is potential for fecal microbiome profiling to improve colorectal cancer screening. This has been demonstrated by research studies, but it has not been quantified at scale using samples collected and processed routinely by a national screening program.

EXPERIMENTAL DESIGN

Between 2016 and 2019, the largest of the NHS Bowel Cancer Screening Programme hubs prospectively collected processed guaiac fecal occult blood test (gFOBT) samples with subsequent colonoscopy outcomes: blood-negative [n = 491 (22%)]; colorectal cancer [n = 430 (19%)]; adenoma [n = 665 (30%)]; colonoscopy-normal [n = 300 (13%)]; nonneoplastic [n = 366 (16%)]. Samples were transported and stored at room temperature. DNA underwent 16S rRNA gene V4 amplicon sequencing. Taxonomic profiling was performed to provide features for classification via random forests (RF).

RESULTS

Samples provided 16S amplicon-based microbial profiles, which confirmed previously described colorectal cancer-microbiome associations. Microbiome-based RF models showed potential as a first-tier screen, distinguishing colorectal cancer or neoplasm (colorectal cancer or adenoma) from blood-negative with AUC 0.86 (0.82-0.89) and AUC 0.78 (0.74-0.82), respectively. Microbiome-based models also showed potential as a second-tier screen, distinguishing from among gFOBT blood-positive samples, colorectal cancer or neoplasm from colonoscopy-normal with AUC 0.79 (0.74-0.83) and AUC 0.73 (0.68-0.77), respectively. Models remained robust when restricted to 15 taxa, and performed similarly during external validation with metagenomic datasets.

CONCLUSIONS

Microbiome features can be assessed using gFOBT samples collected and processed routinely by a national colorectal cancer screening program to improve accuracy as a first- or second-tier screen. The models required as few as 15 taxa, raising the potential of an inexpensive qPCR test. This could reduce the number of colonoscopies in countries that use fecal occult blood test screening.

Collapse

Affiliation(s)

Caroline Young Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University Hospital, University of Leeds, Leeds, United Kingdom.
Henry M Wood Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University Hospital, University of Leeds, Leeds, United Kingdom
Alba Fuentes Balaguer Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University Hospital, University of Leeds, Leeds, United Kingdom
Daniel Bottomley Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University Hospital, University of Leeds, Leeds, United Kingdom
Niall Gallop Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University Hospital, University of Leeds, Leeds, United Kingdom
Lyndsay Wilkinson Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University Hospital, University of Leeds, Leeds, United Kingdom
Sally C Benton NHS Bowel Cancer Screening Programme - Southern Hub, Surrey Research Park, Guildford, United Kingdom
Martin Brealey NHS Bowel Cancer Screening Programme - Southern Hub, Surrey Research Park, Guildford, United Kingdom
Cerin John NHS Bowel Cancer Screening Programme - Southern Hub, Surrey Research Park, Guildford, United Kingdom
Carole Burtonwood NHS Bowel Cancer Screening Programme - Southern Hub, Surrey Research Park, Guildford, United Kingdom
Kelsey N Thompson Department of Biostatistics, Harvard T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts
Yan Yan Department of Biostatistics, Harvard T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts
Jennifer H Barrett Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University Hospital, University of Leeds, Leeds, United Kingdom
Eva J A Morris Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University Hospital, University of Leeds, Leeds, United Kingdom Big Data Institute, Nuffield Department of Population Health, Old Road Campus, University of Oxford, Oxford, United Kingdom
Curtis Huttenhower Department of Biostatistics, Harvard T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts
Philip Quirke Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University Hospital, University of Leeds, Leeds, United Kingdom

Collapse

170

Johns MS, Petrelli NJ. Microbiome and colorectal cancer: A review of the past, present, and future. Surg Oncol 2021;37:101560. [PMID: 33848761 DOI: 10.1016/j.suronc.2021.101560] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Revised: 11/22/2020] [Accepted: 03/28/2021] [Indexed: 12/27/2022]

171

Zhang W, Chen X, Wong KC. Noninvasive early diagnosis of intestinal diseases based on artificial intelligence in genomics and microbiome. J Gastroenterol Hepatol 2021;36:823-831. [PMID: 33880763 DOI: 10.1111/jgh.15500] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Revised: 03/15/2021] [Accepted: 03/17/2021] [Indexed: 12/15/2022]

172

Wirbel J, Zych K, Essex M, Karcher N, Kartal E, Salazar G, Bork P, Sunagawa S, Zeller G. Microbiome meta-analysis and cross-disease comparison enabled by the SIAMCAT machine learning toolbox. Genome Biol 2021;22:93. [PMID: 33785070 PMCID: PMC8008609 DOI: 10.1186/s13059-021-02306-1] [Citation(s) in RCA: 96] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 02/24/2021] [Indexed: 02/08/2023] Open

173

Rahman MA, Rangwala H. IDMIL: an alignment-free Interpretable Deep Multiple Instance Learning (MIL) for predicting disease from whole-metagenomic data. Bioinformatics 2021;36:i39-i47. [PMID: 32657370 PMCID: PMC7355246 DOI: 10.1093/bioinformatics/btaa477] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Abstract

Motivation

The human body hosts more microbial organisms than human cells. Analysis of this microbial diversity provides key insight into the role played by these microorganisms on human health. Metagenomics is the collective DNA sequencing of coexisting microbial organisms in an environmental sample or a host. This has several applications in precision medicine, agriculture, environmental science and forensics. State-of-the-art predictive models for phenotype predictions from metagenomic data rely on alignments, assembly, extensive pruning, taxonomic profiling and reference sequence databases. These processes are time consuming and they do not consider novel microbial sequences when aligned with the reference genome, limiting the potential of whole metagenomics. We formulate the problem of predicting human disease from whole-metagenomic data using Multiple Instance Learning (MIL), a popular supervised learning paradigm. Our proposed alignment-free approach provides higher accuracy in prediction by harnessing the capability of deep convolutional neural network (CNN) within a MIL framework and provides interpretability via neural attention mechanism.

Results

The MIL formulation combined with the hierarchical feature extraction capability of deep-CNN provides significantly better predictive performance compared to popular existing approaches. The attention mechanism allows for the identification of groups of sequences that are likely to be correlated to diseases providing the much-needed interpretation. Our proposed approach does not rely on alignment, assembly and reference sequence databases; making it fast and scalable for large-scale metagenomic data. We evaluate our method on well-known large-scale metagenomic studies and show that our proposed approach outperforms comparative state-of-the-art methods for disease prediction.

Availability and implementation

https://github.com/mrahma23/IDMIL.

Collapse

174

Manandhar I, Alimadadi A, Aryal S, Munroe PB, Joe B, Cheng X. Gut microbiome-based supervised machine learning for clinical diagnosis of inflammatory bowel diseases. Am J Physiol Gastrointest Liver Physiol 2021;320:G328-G337. [PMID: 33439104 PMCID: PMC8828266 DOI: 10.1152/ajpgi.00360.2020] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Abstract

Despite the availability of various diagnostic tests for inflammatory bowel diseases (IBD), misdiagnosis of IBD occurs frequently, and thus, there is a clinical need to further improve the diagnosis of IBD. As gut dysbiosis is reported in patients with IBD, we hypothesized that supervised machine learning (ML) could be used to analyze gut microbiome data for predictive diagnostics of IBD. To test our hypothesis, fecal 16S metagenomic data of 729 subjects with IBD and 700 subjects without IBD from the American Gut Project were analyzed using five different ML algorithms. Fifty differential bacterial taxa were identified [linear discriminant analysis effect size (LEfSe): linear discriminant analysis (LDA) score > 3] between the IBD and non-IBD groups, and ML classifications trained with these taxonomic features using random forest (RF) achieved a testing area under the receiver operating characteristic curves (AUC) of ∼0.80. Next, we tested if operational taxonomic units (OTUs), instead of bacterial taxa, could be used as ML features for diagnostic classification of IBD. Top 500 high-variance OTUs were used for ML training, and an improved testing AUC of ∼0.82 (RF) was achieved. Lastly, we tested if supervised ML could be used for differentiating Crohn's disease (CD) and ulcerative colitis (UC). Using 331 CD and 141 UC samples, 117 differential bacterial taxa (LEfSe: LDA score > 3) were identified, and the RF model trained with differential taxonomic features or high-variance OTU features achieved a testing AUC > 0.90. In summary, our study demonstrates the promising potential of artificial intelligence via supervised ML modeling for predictive diagnostics of IBD using gut microbiome data.NEW & NOTEWORTHY Our study demonstrates the promising potential of artificial intelligence via supervised machine learning modeling for predictive diagnostics of different types of inflammatory bowel diseases using fecal gut microbiome data.

Collapse

175

Shanahan ER, McMaster JJ, Staudacher HM. Conducting research on diet-microbiome interactions: A review of current challenges, essential methodological principles, and recommendations for best practice in study design. J Hum Nutr Diet 2021;34:631-644. [PMID: 33639033 DOI: 10.1111/jhn.12868] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Revised: 01/07/2021] [Accepted: 01/19/2021] [Indexed: 12/21/2022]

Abstract

Diet is one of the strongest modulators of the gut microbiome. However, the complexity of the interactions between diet and the microbial community emphasises the need for a robust study design and continued methodological development. This review aims to summarise considerations for conducting high-quality diet-microbiome research, outline key challenges unique to the field, and provide advice for addressing these in a practical manner useful to dietitians, microbiologists, gastroenterologists and other diet-microbiome researchers. Searches of databases and references from relevant articles were conducted using the primary search terms 'diet', 'diet intervention', 'dietary analysis', 'microbiome' and 'microbiota', alone or in combination. Publications were considered relevant if they addressed methods for diet and/or microbiome research, or were a human study relevant to diet-microbiome interactions. Best-practice design in diet-microbiome research requires appropriate consideration of the study population and careful choice of trial design and data collection methodology. Ongoing challenges include the collection of dietary data that accurately reflects intake at a timescale relevant to microbial community structure and metabolism, measurement of nutrients in foods pertinent to microbes, improving ability to measure and understand microbial metabolic and functional properties, adequately powering studies, and the considered analysis of multivariate compositional datasets. Collaboration across the disciplines of nutrition science and microbiology is crucial for high-quality diet-microbiome research. Improvements in our understanding of the interaction between nutrient intake and microbial metabolism, as well as continued methodological innovation, will facilitate development of effective evidence-based personalised dietary treatments.

Collapse

176

Carrieri AP, Haiminen N, Maudsley-Barton S, Gardiner LJ, Murphy B, Mayes AE, Paterson S, Grimshaw S, Winn M, Shand C, Hadjidoukas P, Rowe WPM, Hawkins S, MacGuire-Flanagan A, Tazzioli J, Kenny JG, Parida L, Hoptroff M, Pyzer-Knapp EO. Explainable AI reveals changes in skin microbiome composition linked to phenotypic differences. Sci Rep 2021;11:4565. [PMID: 33633172 PMCID: PMC7907326 DOI: 10.1038/s41598-021-83922-6] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Accepted: 02/08/2021] [Indexed: 02/06/2023] Open

Abstract

Alterations in the human microbiome have been observed in a variety of conditions such as asthma, gingivitis, dermatitis and cancer, and much remains to be learned about the links between the microbiome and human health. The fusion of artificial intelligence with rich microbiome datasets can offer an improved understanding of the microbiome’s role in human health. To gain actionable insights it is essential to consider both the predictive power and the transparency of the models by providing explanations for the predictions. We combine the collection of leg skin microbiome samples from two healthy cohorts of women with the application of an explainable artificial intelligence (EAI) approach that provides accurate predictions of phenotypes with explanations. The explanations are expressed in terms of variations in the relative abundance of key microbes that drive the predictions. We predict skin hydration, subject's age, pre/post-menopausal status and smoking status from the leg skin microbiome. The changes in microbial composition linked to skin hydration can accelerate the development of personalized treatments for healthy skin, while those associated with age may offer insights into the skin aging process. The leg microbiome signatures associated with smoking and menopausal status are consistent with previous findings from oral/respiratory tract microbiomes and vaginal/gut microbiomes respectively. This suggests that easily accessible microbiome samples could be used to investigate health-related phenotypes, offering potential for non-invasive diagnosis and condition monitoring. Our EAI approach sets the stage for new work focused on understanding the complex relationships between microbial communities and phenotypes. Our approach can be applied to predict any condition from microbiome samples and has the potential to accelerate the development of microbiome-based personalized therapeutics and non-invasive diagnostics.

Collapse

177

Moreno-Indias I, Lahti L, Nedyalkova M, Elbere I, Roshchupkin G, Adilovic M, Aydemir O, Bakir-Gungor B, Santa Pau ECD, D’Elia D, Desai MS, Falquet L, Gundogdu A, Hron K, Klammsteiner T, Lopes MB, Marcos-Zambrano LJ, Marques C, Mason M, May P, Pašić L, Pio G, Pongor S, Promponas VJ, Przymus P, Saez-Rodriguez J, Sampri A, Shigdel R, Stres B, Suharoschi R, Truu J, Truică CO, Vilne B, Vlachakis D, Yilmaz E, Zeller G, Zomer AL, Gómez-Cabrero D, Claesson MJ. Statistical and Machine Learning Techniques in Human Microbiome Studies: Contemporary Challenges and Solutions. Front Microbiol 2021;12:635781. [PMID: 33692771 PMCID: PMC7937616 DOI: 10.3389/fmicb.2021.635781] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 01/28/2021] [Indexed: 12/23/2022] Open

Affiliation(s)

Isabel Moreno-Indias Instituto de Investigación Biomédica de Málaga (IBIMA), Unidad de Gestión Clìnica de Endocrinologìa y Nutrición, Hospital Clìnico Universitario Virgen de la Victoria, Universidad de Málaga, Málaga, Spain Centro de Investigación Biomeìdica en Red de Fisiopatologtìa de la Obesidad y la Nutrición (CIBEROBN), Instituto de Salud Carlos III, Madrid, Spain
Leo Lahti Department of Computing, University of Turku, Turku, Finland
Miroslava Nedyalkova Human Genetics and Disease Mechanisms, Latvian Biomedical Research and Study Centre, Riga, Latvia
Ilze Elbere Latvian Biomedical Research and Study Centre, Riga, Latvia
Gennady Roshchupkin Department of Epidemiology, Erasmus Medical Center, Rotterdam, Netherlands
Muhamed Adilovic Department of Genetics and Bioengineering, International University of Sarajevo, Sarajevo, Bosnia and Herzegovina
Onder Aydemir Department of Electrical and Electronics Engineering, Karadeniz Technical University, Trabzon, Turkey
Burcu Bakir-Gungor Department of Computer Engineering, Abdullah Gul University, Kayseri, Turkey
Enrique Carrillo-de Santa Pau Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain
Domenica D’Elia Department for Biomedical Sciences, Institute for Biomedical Technologies, National Research Council, Bari, Italy
Mahesh S. Desai Department of Infection and Immunity, Luxembourg Institute of Health, Esch-sur-Alzette, Luxembourg Odense Research Center for Anaphylaxis, Department of Dermatology and Allergy Center, Odense University Hospital, University of Southern Denmark, Odense, Denmark
Laurent Falquet Department of Biology, University of Fribourg, Fribourg, Switzerland Swiss Institute of Bioinformatics, Lausanne, Switzerland
Aycan Gundogdu Department of Microbiology and Clinical Microbiology, Faculty of Medicine, Erciyes University, Kayseri, Turkey Metagenomics Laboratory, Genome and Stem Cell Center (GenKök), Erciyes University, Kayseri, Turkey
Karel Hron Department of Mathematical Analysis and Applications of Mathematics, Palacký University, Olomouc, Czechia
Thomas Klammsteiner Department of Microbiology, University of Innsbruck, Innsbruck, Austria
Marta B. Lopes NOVA Laboratory for Computer Science and Informatics (NOVA LINCS), FCT, UNL, Caparica, Portugal Centro de Matemática e Aplicações (CMA), FCT, UNL, Caparica, Portugal
Laura Judith Marcos-Zambrano Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain
Cláudia Marques CINTESIS, NOVA Medical School, NMS, Universidade Nova de Lisboa, Lisbon, Portugal
Michael Mason Computational Oncology, Sage Bionetworks, Seattle, WA, United States
Patrick May Bioinformatics Core, Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
Lejla Pašić Sarajevo Medical School, University Sarajevo School of Science and Technology, Sarajevo, Bosnia and Herzegovina
Gianvito Pio Department of Computer Science, University of Bari Aldo Moro, Bari, Italy
Sándor Pongor Faculty of Information Tehnology and Bionics, Pázmány University, Budapest, Hungary
Vasilis J. Promponas Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia, Cyprus
Piotr Przymus Faculty of Mathematics and Computer Science, Nicolaus Copernicus University, Toruñ, Poland
Julio Saez-Rodriguez Institute of Computational Biomedicine, Heidelberg University, Faculty of Medicine and Heidelberg University Hospital, Heidelberg, Germany
Alexia Sampri Division of Informatics, Imaging and Data Sciences, School of Health Sciences, University of Manchester, Manchester, United Kingdom
Rajesh Shigdel Department of Clinical Science, University of Bergen, Bergen, Norway
Blaz Stres Jozef Stefan Institute, Ljubljana, Slovenia Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia Faculty of Civil and Geodetic Engineering, University of Ljubljana, Ljubljana, Slovenia
Ramona Suharoschi Molecular Nutrition and Proteomics Lab, Faculty of the Food Science and Technology, Institute of Life Sciences, University of Agricultural Sciences and Veterinary Medicine of Cluj-Napoca, Cluj-Napoca, Romania
Jaak Truu Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
Ciprian-Octavian Truică Department of Computer Science and Engineering, Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Bucharest, Romania
Baiba Vilne Bioinformatics Research Unit, Riga Stradins University, Riga, Latvia
Dimitrios Vlachakis Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
Ercument Yilmaz Department of Computer Technologies, Karadeniz Technical University, Trabzon, Turkey
Georg Zeller European Molecular Biology Laboratory, Structural and Computational Biology Unit, Heidelberg, Germany
Aldert L. Zomer Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, Utrecht, Netherlands
David Gómez-Cabrero Navarrabiomed, Complejo Hospitalario de Navarra (CHN), IdiSNA, Universidad Pública de Navarra (UPNA), Pamplona, Spain
Marcus J. Claesson School of Microbiology and APC Microbiome Ireland, University College Cork, Cork, Ireland

Collapse

178

Marcos-Zambrano LJ, Karaduzovic-Hadziabdic K, Loncar Turukalo T, Przymus P, Trajkovik V, Aasmets O, Berland M, Gruca A, Hasic J, Hron K, Klammsteiner T, Kolev M, Lahti L, Lopes MB, Moreno V, Naskinova I, Org E, Paciência I, Papoutsoglou G, Shigdel R, Stres B, Vilne B, Yousef M, Zdravevski E, Tsamardinos I, Carrillo de Santa Pau E, Claesson MJ, Moreno-Indias I, Truu J. Applications of Machine Learning in Human Microbiome Studies: A Review on Feature Selection, Biomarker Identification, Disease Prediction and Treatment. Front Microbiol 2021;12:634511. [PMID: 33737920 PMCID: PMC7962872 DOI: 10.3389/fmicb.2021.634511] [Citation(s) in RCA: 113] [Impact Index Per Article: 37.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 02/01/2021] [Indexed: 12/19/2022] Open

Abstract

The number of microbiome-related studies has notably increased the availability of data on human microbiome composition and function. These studies provide the essential material to deeply explore host-microbiome associations and their relation to the development and progression of various complex diseases. Improved data-analytical tools are needed to exploit all information from these biological datasets, taking into account the peculiarities of microbiome data, i.e., compositional, heterogeneous and sparse nature of these datasets. The possibility of predicting host-phenotypes based on taxonomy-informed feature selection to establish an association between microbiome and predict disease states is beneficial for personalized medicine. In this regard, machine learning (ML) provides new insights into the development of models that can be used to predict outputs, such as classification and prediction in microbiology, infer host phenotypes to predict diseases and use microbial communities to stratify patients by their characterization of state-specific microbial signatures. Here we review the state-of-the-art ML methods and respective software applied in human microbiome studies, performed as part of the COST Action ML4Microbiome activities. This scoping review focuses on the application of ML in microbiome studies related to association and clinical use for diagnostics, prognostics, and therapeutics. Although the data presented here is more related to the bacterial community, many algorithms could be applied in general, regardless of the feature type. This literature and software review covering this broad topic is aligned with the scoping review methodology. The manual identification of data sources has been complemented with: (1) automated publication search through digital libraries of the three major publishers using natural language processing (NLP) Toolkit, and (2) an automated identification of relevant software repositories on GitHub and ranking of the related research papers relying on learning to rank approach.

Collapse

Affiliation(s)

Laura Judith Marcos-Zambrano Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain
Kanita Karaduzovic-Hadziabdic Faculty of Engineering and Natural Sciences, International University of Sarajevo, Sarajevo, Bosnia and Herzegovina
Tatjana Loncar Turukalo Faculty of Technical Sciences, University of Novi Sad, Novi Sad, Serbia
Piotr Przymus Faculty of Mathematics and Computer Science, Nicolaus Copernicus University, Toruń, Poland
Vladimir Trajkovik Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, North Macedonia
Oliver Aasmets Institute of Genomics, Estonian Genome Centre, University of Tartu, Tartu, Estonia Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
Magali Berland Université Paris-Saclay, INRAE, MGP, Jouy-en-Josas, France
Aleksandra Gruca Department of Computer Networks and Systems, Silesian University of Technology, Gliwice, Poland
Jasminka Hasic University Sarajevo School of Science and Technology, Sarajevo, Bosnia and Herzegovina
Karel Hron Department of Mathematical Analysis and Applications of Mathematics, Palacký University, Olomouc, Czechia
Thomas Klammsteiner Department of Microbiology, University of Innsbruck, Innsbruck, Austria
Mikhail Kolev South West University “Neofit Rilski”, Blagoevgrad, Bulgaria
Leo Lahti Department of Computing, University of Turku, Turku, Finland
Marta B. Lopes NOVA Laboratory for Computer Science and Informatics (NOVA LINCS), FCT, UNL, Caparica, Portugal Centro de Matemática e Aplicações (CMA), FCT, UNL, Caparica, Portugal
Victor Moreno Oncology Data Analytics Program, Catalan Institute of Oncology (ICO)Barcelona, Spain Colorectal Cancer Group, Institut de Recerca Biomedica de Bellvitge (IDIBELL), Barcelona, Spain Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Barcelona, Spain Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, Barcelona, Spain
Irina Naskinova South West University “Neofit Rilski”, Blagoevgrad, Bulgaria
Elin Org Institute of Genomics, Estonian Genome Centre, University of Tartu, Tartu, Estonia
Inês Paciência EPIUnit – Instituto de Saúde Pública da Universidade do Porto, Porto, Portugal
Georgios Papoutsoglou Department of Computer Science, University of Crete, Heraklion, Greece
Rajesh Shigdel Department of Clinical Science, University of Bergen, Bergen, Norway
Blaz Stres Group for Microbiology and Microbial Biotechnology, Department of Animal Science, University of Ljubljana, Ljubljana, Slovenia
Baiba Vilne Bioinformatics Research Unit, Riga Stradins University, Riga, Latvia
Malik Yousef Department of Information Systems, Zefat Academic College, Zefat, Israel Galilee Digital Health Research Center (GDH), Zefat Academic College, Zefat, Israel
Eftim Zdravevski Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, North Macedonia
Ioannis Tsamardinos Department of Computer Science, University of Crete, Heraklion, Greece
Enrique Carrillo de Santa Pau Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain
Marcus J. Claesson School of Microbiology & APC Microbiome Ireland, University College Cork, Cork, Ireland
Isabel Moreno-Indias Unidad de Gestión Clínica de Endocrinología y Nutrición, Instituto de Investigación Biomédica de Málaga (IBIMA), Hospital Clínico Universitario Virgen de la Victoria, Universidad de Málaga, Málaga, Spain Centro de Investigación Biomédica en Red de Fisiopatología de la Obesidad y la Nutrición (CIBEROBN), Instituto de Salud Carlos III, Madrid, Spain
Jaak Truu Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia

Collapse

179

Shi K, Zhang L, Yu J, Chen Z, Lai S, Zhao X, Li WG, Luo Q, Lin W, Feng J, Bork P, Zhao XM, Li F. A 12-genus bacterial signature identifies a group of severe autistic children with differential sensory behavior and brain structures. Clin Transl Med 2021;11:e314. [PMID: 33634969 PMCID: PMC7893807 DOI: 10.1002/ctm2.314] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 01/20/2021] [Accepted: 01/21/2021] [Indexed: 01/01/2023] Open

Affiliation(s)

Kai Shi Institute of Science and Technology for Brain-inspired Intelligence, Fudan University, Shanghai, China.,School of Mathematical Sciences, SCMS, and SCAM, Fudan University, Shanghai, China.,College of Information Science and Engineering, Guilin University of Technology, Guilin, China
Lingli Zhang Department of Developmental and Behavioural Pediatric & Child Primary Care, Xinhua Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.,Brain and Behavioural Research Unit of Shanghai Institute for Pediatric Research and MOE Shanghai Key Laboratory for Children's Environmental Health, Xinhua Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
Juehua Yu Department of Developmental and Behavioural Pediatric & Child Primary Care, Xinhua Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.,NHC Key Laboratory of Drug Addiction Medicine (Kunming Medical University), First Affiliated Hospital of Kunming Medical University, Kunming, Yunnan, China
Zilin Chen Department of Developmental and Behavioural Pediatric & Child Primary Care, Xinhua Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.,Brain and Behavioural Research Unit of Shanghai Institute for Pediatric Research and MOE Shanghai Key Laboratory for Children's Environmental Health, Xinhua Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
Senying Lai Institute of Science and Technology for Brain-inspired Intelligence, Fudan University, Shanghai, China
Xingzhong Zhao Institute of Science and Technology for Brain-inspired Intelligence, Fudan University, Shanghai, China
Wei-Guang Li Collaborative Innovation Center for Brain Science, Department of Anatomy and Physiology, Shanghai Jiao Tong University School of Medicine, Shanghai, China
Qiang Luo Institute of Science and Technology for Brain-inspired Intelligence, Fudan University, Shanghai, China
Wei Lin Institute of Science and Technology for Brain-inspired Intelligence, Fudan University, Shanghai, China.,School of Mathematical Sciences, SCMS, and SCAM, Fudan University, Shanghai, China.,Research Institute of Intelligent Complex Systems, Fudan University, Shanghai, China
Jianfeng Feng Institute of Science and Technology for Brain-inspired Intelligence, Fudan University, Shanghai, China
Peer Bork European Molecular Biology Laboratory, Meyerhofstraße 1, Heidelberg, Germany
Xing-Ming Zhao Institute of Science and Technology for Brain-inspired Intelligence, Fudan University, Shanghai, China.,MOE Key Laboratory of Computational Neuroscience and Brain-inspired Intelligence, and Frontiers Center for Brain Science, Shanghai, China
Fei Li Department of Developmental and Behavioural Pediatric & Child Primary Care, Xinhua Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.,Brain and Behavioural Research Unit of Shanghai Institute for Pediatric Research and MOE Shanghai Key Laboratory for Children's Environmental Health, Xinhua Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China

Collapse

180

Sharma D, Paterson AD, Xu W. TaxoNN: ensemble of neural networks on stratified microbiome data for disease prediction. Bioinformatics 2021;36:4544-4550. [PMID: 32449747 PMCID: PMC7750934 DOI: 10.1093/bioinformatics/btaa542] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Revised: 05/08/2020] [Accepted: 05/19/2020] [Indexed: 11/13/2022] Open

181

Aasmets O, Lüll K, Lang JM, Pan C, Kuusisto J, Fischer K, Laakso M, Lusis AJ, Org E. Machine Learning Reveals Time-Varying Microbial Predictors with Complex Effects on Glucose Regulation. mSystems 2021;6:e01191-20. [PMID: 33594006 PMCID: PMC8573957 DOI: 10.1128/msystems.01191-20] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 01/22/2021] [Indexed: 12/11/2022] Open

Abstract

The incidence of type 2 diabetes (T2D) has been increasing globally, and a growing body of evidence links type 2 diabetes with altered microbiota composition. Type 2 diabetes is preceded by a long prediabetic state characterized by changes in various metabolic parameters. We tested whether the gut microbiome could have predictive potential for T2D development during the healthy and prediabetic disease stages. We used prospective data of 608 well-phenotyped Finnish men collected from the population-based Metabolic Syndrome in Men (METSIM) study to build machine learning models for predicting continuous glucose and insulin measures in a shorter (1.5 year) and longer (4 year) period. Our results show that the inclusion of the gut microbiome improves prediction accuracy for modeling T2D-associated parameters such as glycosylated hemoglobin and insulin measures. We identified novel microbial biomarkers and described their effects on the predictions using interpretable machine learning techniques, which revealed complex linear and nonlinear associations. Additionally, the modeling strategy carried out allowed us to compare the stability of model performance and biomarker selection, also revealing differences in short-term and long-term predictions. The identified microbiome biomarkers provide a predictive measure for various metabolic traits related to T2D, thus providing an additional parameter for personal risk assessment. Our work also highlights the need for robust modeling strategies and the value of interpretable machine learning.IMPORTANCE Recent studies have shown a clear link between gut microbiota and type 2 diabetes. However, current results are based on cross-sectional studies that aim to determine the microbial dysbiosis when the disease is already prevalent. In order to consider the microbiome as a factor in disease risk assessment, prospective studies are needed. Our study is the first study that assesses the gut microbiome as a predictive measure for several type 2 diabetes-associated parameters in a longitudinal study setting. Our results revealed a number of novel microbial biomarkers that can improve the prediction accuracy for continuous insulin measures and glycosylated hemoglobin levels. These results make the prospect of using the microbiome in personalized medicine promising.

Collapse

182

PM2RA: A Framework for Detecting and Quantifying Relationship Alterations in Microbial Community. GENOMICS PROTEOMICS & BIOINFORMATICS 2021;19:154-167. [PMID: 33581337 PMCID: PMC8498968 DOI: 10.1016/j.gpb.2020.07.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/21/2020] [Revised: 06/28/2020] [Accepted: 08/09/2020] [Indexed: 11/21/2022]

183

Microbial source tracking using metagenomics and other new technologies. J Microbiol 2021;59:259-269. [DOI: 10.1007/s12275-021-0668-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 01/08/2021] [Accepted: 01/08/2021] [Indexed: 12/12/2022]

184

Microbiome connections with host metabolism and habitual diet from 1,098 deeply phenotyped individuals. Nat Med 2021;27:321-332. [PMID: 33432175 PMCID: PMC8353542 DOI: 10.1038/s41591-020-01183-8] [Citation(s) in RCA: 416] [Impact Index Per Article: 138.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Accepted: 11/16/2020] [Indexed: 02/07/2023]

185

Ghannam RB, Techtmann SM. Machine learning applications in microbial ecology, human microbiome studies, and environmental monitoring. Comput Struct Biotechnol J 2021;19:1092-1107. [PMID: 33680353 PMCID: PMC7892807 DOI: 10.1016/j.csbj.2021.01.028] [Citation(s) in RCA: 76] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 01/16/2021] [Accepted: 01/18/2021] [Indexed: 01/04/2023] Open

186

Dhungel E, Mreyoud Y, Gwak HJ, Rajeh A, Rho M, Ahn TH. MegaR: an interactive R package for rapid sample classification and phenotype prediction using metagenome profiles and machine learning. BMC Bioinformatics 2021;22:25. [PMID: 33461494 PMCID: PMC7814621 DOI: 10.1186/s12859-020-03933-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 12/11/2020] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Diverse microbiome communities drive biogeochemical processes and evolution of animals in their ecosystems. Many microbiome projects have demonstrated the power of using metagenomics to understand the structures and factors influencing the function of the microbiomes in their environments. In order to characterize the effects from microbiome composition for human health, diseases, and even ecosystems, one must first understand the relationship of microbes and their environment in different samples. Running machine learning model with metagenomic sequencing data is encouraged for this purpose, but it is not an easy task to make an appropriate machine learning model for all diverse metagenomic datasets.

RESULTS

We introduce MegaR, an R Shiny package and web application, to build an unbiased machine learning model effortlessly with interactive visual analysis. The MegaR employs taxonomic profiles from either whole metagenome sequencing or 16S rRNA sequencing data to develop machine learning models and classify the samples into two or more categories. It provides various options for model fine tuning throughout the analysis pipeline such as data processing, multiple machine learning techniques, model validation, and unknown sample prediction that can be used to achieve the highest prediction accuracy possible for any given dataset while still maintaining a user-friendly experience.

CONCLUSIONS

Metagenomic sample classification and phenotype prediction is important particularly when it applies to a diagnostic method for identifying and predicting microbe-related human diseases. MegaR provides various interactive visualizations for user to build an accurate machine-learning model without difficulty. Unknown sample prediction with a properly trained model using MegaR will enhance researchers to identify the sample property in a fast turnaround time.

Collapse

187

Reiman D, Farhat AM, Dai Y. Predicting Host Phenotype Based on Gut Microbiome Using a Convolutional Neural Network Approach. Methods Mol Biol 2021;2190:249-266. [PMID: 32804370 DOI: 10.1007/978-1-0716-0826-5_12] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

188

Mancin L, Rollo I, Mota JF, Piccini F, Carletti M, Susto GA, Valle G, Paoli A. Optimizing Microbiota Profiles for Athletes. Exerc Sport Sci Rev 2021;49:42-49. [PMID: 33044333 DOI: 10.1249/jes.0000000000000236] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

189

Fouladi F, Carroll IM, Sharpton TJ, Bulik-Sullivan E, Heinberg L, Steffen KJ, Fodor AA. A microbial signature following bariatric surgery is robustly consistent across multiple cohorts. Gut Microbes 2021;13:1930872. [PMID: 34159880 PMCID: PMC8224199 DOI: 10.1080/19490976.2021.1930872] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 04/28/2021] [Accepted: 05/05/2021] [Indexed: 02/07/2023] Open

Abstract

Bariatric surgery induces significant shifts in the gut microbiota which could potentially contribute to weight loss and metabolic benefits. The aim of this study was to characterize a microbial signature following Roux-en-Y Gastric bypass (RYGB) surgery using novel and existing gut microbiota sequence data. We generated 16S rRNA gene and metagenomic sequences from fecal samples from patients undergoing RYGB surgery (n = 61 for 16S rRNA gene and n = 135 for metagenomics) at pre-surgical baseline and one, six, and twelve-month post-surgery. We compared these data with three smaller publicly available 16S rRNA gene and one metagenomic datasets from patients who also underwent RYGB surgery. Linear mixed models and machine learning approaches were used to examine the presence of a common microbial signature across studies. Comparison of our new sequences with previous longitudinal studies revealed strikingly similar profiles in both fecal microbiota composition (r = 0.41 ± 0.10; p < .05) and metabolic pathways (r = 0.70 ± 0.05; p < .001) early after surgery across multiple datasets. Notably, Veillonella, Streptococcus, Gemella, Fusobacterium, Escherichia/Shigella, and Akkermansia increased after surgery, while Blautia decreased. Machine learning approaches revealed that the replicable gut microbiota signature associated with RYGB surgery could be used to discriminate pre- and post-surgical samples. Opportunistic pathogen abundance also increased post-surgery in a consistent manner across cohorts. Our study reveals a robust microbial signature involving many commensal and pathogenic taxa and metabolic pathways early after RYGB surgery across different studies and sites. Characterization of the effects of this robust microbial signature on outcomes of bariatric surgery could provide insights into the development of microbiome-based interventions for predicting or improving outcomes following surgery.

Collapse

190

Elgart M, Redline S, Sofer T. Machine and Deep Learning in Molecular and Genetic Aspects of Sleep Research. Neurotherapeutics 2021;18:228-243. [PMID: 33829409 PMCID: PMC8116376 DOI: 10.1007/s13311-021-01014-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/18/2021] [Indexed: 12/11/2022] Open

191

McCoubrey LE, Elbadawi M, Orlu M, Gaisford S, Basit AW. Harnessing machine learning for development of microbiome therapeutics. Gut Microbes 2021;13:1-20. [PMID: 33522391 PMCID: PMC7872042 DOI: 10.1080/19490976.2021.1872323] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 12/20/2020] [Indexed: 02/06/2023] Open

192

Hsu CK, Su SC, Chang LC, Shao SC, Yang KJ, Chen CY, Chen YT, Wu IW. Effects of Low Protein Diet on Modulating Gut Microbiota in Patients with Chronic Kidney Disease: A Systematic Review and Meta-analysis of International Studies. Int J Med Sci 2021;18:3839-3850. [PMID: 34790060 PMCID: PMC8579282 DOI: 10.7150/ijms.66451] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Accepted: 10/09/2021] [Indexed: 12/11/2022] Open

Abstract

Background: Although associations between low protein diet (LPD) and changes of gut microbiota have been reported; however, systematic discernment of the effects of LPD on diet-microbiome-host interaction in patients with chronic kidney disease (CKD) is lacking. Methods: We searched PUBMED and EMBASE for articles published on changes of gut microbiota associated with implementation of LPD in CKD patients until July 2021. Independent researchers extracted data and assessed risks of bias. We conducted meta-analyses of combine p-value, mean differences and random effects for gut microbiota and related metabolites. Study heterogeneity was measured by Tau² and I² statistic. This study followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. Results: Five articles met inclusion criteria. The meta-analyses of gut microbiota exhibited enrichments of Lactobacillaceae (meta-p= 0.010), Bacteroidaceae (meta-p= 0.048) and Streptococcus anginosus (meta-p< 0.001), but revealed depletion of Bacteroides eggerthii (p=0.017) and Roseburia faecis (meta-p=0.019) in LPD patients compared to patients undergoing normal protein diet. The serum IS levels (mean difference: 0.68 ug/mL, 95% CI: -8.38-9.68, p= 0.89) and pCS levels (mean difference: -3.85 ug/mL, 95% CI: -15.49-7.78, p < 0.52) did not change between groups. We did not find significant differences on renal function associated with change of microbiota between groups (eGFR, mean difference: -7.21 mL/min/1.73 m², 95% CI: -33.2-18.79, p= 0.59; blood urea nitrogen, mean difference: -6.8 mg/dL, 95% CI: -46.42-32.82, p= 0.74). Other clinical (sodium, potassium, phosphate, albumin, fasting sugar, uric acid, total cholesterol, triglycerides, C-reactive protein and hemoglobin) and anthropometric estimates (body mass index, systolic blood pressure and diastolic blood pressure) did not differ between the two groups. Conclusions: This systematic review and meta-analysis suggested that the effects of LPD on the microbiota were observed predominantly at the families and species levels but minimal on microbial diversity or richness. In the absence of global compositional microbiota shifts, the species-level changes appear insufficient to alter metabolic or clinical outputs.

Collapse

193

Trivieri N, Pracella R, Cariglia MG, Panebianco C, Parrella P, Visioli A, Giani F, Soriano AA, Barile C, Canistro G, Latiano TP, Dimitri L, Bazzocchi F, Cassano D, Vescovi AL, Pazienza V, Binda E. BRAF^V600E mutation impinges on gut microbial markers defining novel biomarkers for serrated colorectal cancer effective therapies. JOURNAL OF EXPERIMENTAL & CLINICAL CANCER RESEARCH : CR 2020;39:285. [PMID: 33317591 PMCID: PMC7737386 DOI: 10.1186/s13046-020-01801-w] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Accepted: 12/04/2020] [Indexed: 12/12/2022]

Abstract

BACKGROUND

Colorectal cancer (CRC) harboring BRAF^V600E mutation exhibits low response to conventional therapy and poorest prognosis. Due to the emerging correlation between gut microbiota and CRC carcinogenesis, we investigated in serrated BRAF^V600E cases the existence of a peculiar fecal microbial fingerprint and specific bacterial markers, which might represent a tool for the development of more effective clinical strategies.

METHODS

By injecting human CRC stem-like cells isolated from BRAF^V600E patients in immunocompromised mice, we described a new xenogeneic model of this subtype of CRC. By performing bacterial 16S rRNA sequencing, the fecal microbiota profile was then investigated either in CRC-carrying mice or in a cohort of human CRC subjects. The microbial communities' functional profile was also predicted. Data were compared with Mann-Whitney U, Welch's t-test for unequal variances and Kruskal-Wallis test with Benjamini-Hochberg false discovery rate (FDR) correction, extracted as potential BRAF class biomarkers and selected as model features. The obtained mean test prediction scores were subjected to Receiver Operating characteristic (ROC) analysis. To discriminate the BRAF status, a Random Forest classifier (RF) was employed.

RESULTS

A specific microbial signature distinctive for BRAF status emerged, being the BRAF-mutated cases closer to healthy controls than BRAF wild-type counterpart. In agreement, a considerable score of correlation was also pointed out between bacteria abundance from BRAF-mutated cases and the level of markers distinctive of BRAF^V600E pathway, including those involved in inflammation, innate immune response and epithelial-mesenchymal transition. We provide evidence that two candidate bacterial markers, Prevotella enoeca and Ruthenibacterium lactatiformans, more abundant in BRAF^V600E and BRAF wild-type subjects respectively, emerged as single factors with the best performance in distinguishing BRAF status (AUROC = 0.72 and 0.74, respectively, 95% confidence interval). Furthermore, the combination of the 10 differentially represented microorganisms between the two groups improved performance in discriminating serrated CRC driven by BRAF mutation from BRAF wild-type CRC cases (AUROC = 0.85, 95% confidence interval, 0.69-1.01).

CONCLUSION

Overall, our results suggest that BRAF^V600E mutation itself drives a distinctive gut microbiota signature and provide predictive CRC-associated bacterial biomarkers able to discriminate BRAF status in CRC patients and, thus, useful to devise non-invasive patient-selective diagnostic strategies and patient-tailored optimized therapies.

Collapse

Affiliation(s)

Nadia Trivieri Cancer Stem Cells Unit, ISBReMIT, IRCSS Casa Sollievo della Sofferenza, Opera di San Pio da Pietrelcina, San Giovanni Rotondo, FG, Italy
Riccardo Pracella Cancer Stem Cells Unit, ISBReMIT, IRCSS Casa Sollievo della Sofferenza, Opera di San Pio da Pietrelcina, San Giovanni Rotondo, FG, Italy
Maria Grazia Cariglia Cancer Stem Cells Unit, ISBReMIT, IRCSS Casa Sollievo della Sofferenza, Opera di San Pio da Pietrelcina, San Giovanni Rotondo, FG, Italy
Concetta Panebianco Gastroenterology Unit, IRCSS Casa Sollievo della Sofferenza, Opera di San Pio da Pietrelcina, San Giovanni Rotondo, FG, Italy
Paola Parrella Oncology Laboratory, IRCSS Casa Sollievo della Sofferenza, Opera di San Pio da Pietrelcina, San Giovanni Rotondo, FG, Italy
Alberto Visioli StemGen SpA, Milan, Italy
Fabrizio Giani StemGen SpA, Milan, Italy
Amata Amy Soriano Cancer Stem Cells Unit, ISBReMIT, IRCSS Casa Sollievo della Sofferenza, Opera di San Pio da Pietrelcina, San Giovanni Rotondo, FG, Italy
Chiara Barile Cancer Stem Cells Unit, ISBReMIT, IRCSS Casa Sollievo della Sofferenza, Opera di San Pio da Pietrelcina, San Giovanni Rotondo, FG, Italy
Giuseppe Canistro Abdominal Surgery Unit, IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, FG, Italy
Tiziana Pia Latiano Division of Medical Oncology, IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, FG, Italy
Lucia Dimitri Anatomical Pathology Unit, IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, FG, Italy
Francesca Bazzocchi Abdominal Surgery Unit, IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, FG, Italy
Dario Cassano Abdominal Surgery Unit, IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, FG, Italy
Angelo L Vescovi StemGen SpA, Milan, Italy.,Science Directorate, IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, FG, Italy
Valerio Pazienza Gastroenterology Unit, IRCSS Casa Sollievo della Sofferenza, Opera di San Pio da Pietrelcina, San Giovanni Rotondo, FG, Italy
Elena Binda Cancer Stem Cells Unit, ISBReMIT, IRCSS Casa Sollievo della Sofferenza, Opera di San Pio da Pietrelcina, San Giovanni Rotondo, FG, Italy. .,Cancer Stem Cells Unit, Fondazione IRCCS Casa Sollievo della Sofferenza, Institute for Stem Cell Biology, Regenerative Medicine and Innovative Therapeutics (ISBReMIT), 71013, San Giovanni Rotondo, FG, Italy.

Collapse

194

Chen JCY, Tyler AD. Systematic evaluation of supervised machine learning for sample origin prediction using metagenomic sequencing data. Biol Direct 2020;15:29. [PMID: 33302990 PMCID: PMC7731568 DOI: 10.1186/s13062-020-00287-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Accepted: 12/01/2020] [Indexed: 02/07/2023] Open

Abstract

Background

The advent of metagenomic sequencing provides microbial abundance patterns that can be leveraged for sample origin prediction. Supervised machine learning classification approaches have been reported to predict sample origin accurately when the origin has been previously sampled. Using metagenomic datasets provided by the 2019 CAMDA challenge, we evaluated the influence of variable technical, analytical and machine learning approaches for result interpretation and novel source prediction.

Results

Comparison between 16S rRNA amplicon and shotgun sequencing approaches as well as metagenomic analytical tools showed differences in normalized microbial abundance, especially for organisms present at low abundance. Shotgun sequence data analyzed using Kraken2 and Bracken, for taxonomic annotation, had higher detection sensitivity. As classification models are limited to labeling pre-trained origins, we took an alternative approach using Lasso-regularized multivariate regression to predict geographic coordinates for comparison. In both models, the prediction errors were much higher in Leave-1-city-out than in 10-fold cross validation, of which the former realistically forecasted the increased difficulty in accurately predicting samples from new origins. This challenge was further confirmed when applying the model to a set of samples obtained from new origins. Overall, the prediction performance of the regression and classification models, as measured by mean squared error, were comparable on mystery samples. Due to higher prediction error rates for samples from new origins, we provided an additional strategy based on prediction ambiguity to infer whether a sample is from a new origin. Lastly, we report increased prediction error when data from different sequencing protocols were included as training data.

Conclusions

Herein, we highlight the capacity of predicting sample origin accurately with pre-trained origins and the challenge of predicting new origins through both regression and classification models. Overall, this work provides a summary of the impact of sequencing technique, protocol, taxonomic analytical approaches, and machine learning approaches on the use of metagenomics for prediction of sample origin.

Supplementary Information

The online version contains supplementary material available at 10.1186/s13062-020-00287-y.

Collapse

195

Bokulich NA, Ziemski M, Robeson MS, Kaehler BD. Measuring the microbiome: Best practices for developing and benchmarking microbiomics methods. Comput Struct Biotechnol J 2020;18:4048-4062. [PMID: 33363701 PMCID: PMC7744638 DOI: 10.1016/j.csbj.2020.11.049] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Revised: 11/27/2020] [Accepted: 11/28/2020] [Indexed: 12/12/2022] Open

196

Kohli A, Holzwanger EA, Levy AN. Emerging use of artificial intelligence in inflammatory bowel disease. World J Gastroenterol 2020;26:6923-6928. [PMID: 33311940 PMCID: PMC7701951 DOI: 10.3748/wjg.v26.i44.6923] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Revised: 10/24/2020] [Accepted: 11/12/2020] [Indexed: 02/06/2023] Open

197

Fermented food products in the era of globalization: tradition meets biotechnology innovations. Curr Opin Biotechnol 2020;70:36-41. [PMID: 33232845 DOI: 10.1016/j.copbio.2020.10.006] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 09/18/2020] [Accepted: 10/19/2020] [Indexed: 02/06/2023]

198

Alvarez-Pitti J, de Blas A, Lurbe E. Innovations in Infant Feeding: Future Challenges and Opportunities in Obesity and Cardiometabolic Disease. Nutrients 2020;12:nu12113508. [PMID: 33202614 PMCID: PMC7697724 DOI: 10.3390/nu12113508] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 11/11/2020] [Accepted: 11/11/2020] [Indexed: 12/15/2022] Open

199

Popa O, Oldenburg E, Ebenhöh O. From sequence to information. Philos Trans R Soc Lond B Biol Sci 2020;375:20190448. [PMID: 33131436 PMCID: PMC7662195 DOI: 10.1098/rstb.2019.0448] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

200

Microbiome of the first stool after birth and infantile colic. Pediatr Res 2020;88:776-783. [PMID: 32053826 DOI: 10.1038/s41390-020-0804-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 12/16/2019] [Accepted: 01/28/2020] [Indexed: 11/08/2022]