1
|
Macadangdang BR, Wang Y, Woodward C, Revilla JI, Shaw BM, Sasaninia K, Makanani SK, Berruto C, Ahuja U, Miller JF. Targeted protein evolution in the gut microbiome by diversity-generating retroelements. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.15.621889. [PMID: 39605476 PMCID: PMC11601372 DOI: 10.1101/2024.11.15.621889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
Diversity-generating retroelements (DGRs) accelerate evolution by rapidly diversifying variable proteins. The human gastrointestinal microbiota harbors the greatest density of DGRs known in nature, suggesting they play adaptive roles in this environment. We identified >1,100 unique DGRs among human-associated Bacteroides species and discovered a subset that diversify adhesive components of Type V pili and related proteins. We show that Bacteroides DGRs are horizontally transferred across species, that some are highly active while others are tightly controlled, and that they preferentially alter the functional characteristics of ligand-binding residues on adhesive organelles. Specific variable protein sequences are enriched when Bacteroides strains compete with other commensal bacteria in gnotobiotic mice. Analysis of >2,700 DGRs from diverse phyla in mother-infant pairs shows that Bacteroides DGRs are preferentially transferred to vaginally delivered infants where they actively diversify. Our observations provide a foundation for understanding the roles of stochastic, targeted genome plasticity in shaping host-associated microbial communities.
Collapse
Affiliation(s)
- Benjamin R. Macadangdang
- Division of Neonatology and Developmental Biology, Department of Pediatrics, David Geffen School of Medicine at the University of California, Los Angeles, Los Angeles, CA, United States
- California NanoSystems Institute, Los Angeles, CA, United States
| | - Yanling Wang
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, California, United States
| | - Cora Woodward
- California NanoSystems Institute, Los Angeles, CA, United States
| | - Jessica I. Revilla
- Department of Molecular, Cell, and Developmental Biology, University of California, Los Angeles, Los Angeles, CA, United States
| | - Bennett M. Shaw
- David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, United States
| | - Kayvan Sasaninia
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, California, United States
| | - Sara K. Makanani
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, California, United States
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA, United States
| | - Chiara Berruto
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, California, United States
| | - Umesh Ahuja
- California NanoSystems Institute, Los Angeles, CA, United States
| | - Jeff F. Miller
- California NanoSystems Institute, Los Angeles, CA, United States
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, California, United States
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA, United States
- Lead contact
| |
Collapse
|
2
|
Dindhoria K, Manyapu V, Ali A, Kumar R. Unveiling the role of emerging metagenomics for the examination of hypersaline environments. Biotechnol Genet Eng Rev 2024; 40:2090-2128. [PMID: 37017219 DOI: 10.1080/02648725.2023.2197717] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 03/28/2023] [Indexed: 04/06/2023]
Abstract
Hypersaline ecosystems are distributed all over the globe. They are subjected to poly-extreme stresses and are inhabited by halophilic microorganisms possessing multiple adaptations. The halophiles have many biotechnological applications such as nutrient supplements, antioxidant synthesis, salt tolerant enzyme production, osmolyte synthesis, biofuel production, electricity generation etc. However, halophiles are still underexplored in terms of complex ecological interactions and functions as compared to other niches. The advent of metagenomics and the recent advancement of next-generation sequencing tools have made it feasible to investigate the microflora of an ecosystem, its interactions and functions. Both target gene and shotgun metagenomic approaches are commonly employed for the taxonomic, phylogenetic, and functional analyses of the hypersaline microbial communities. This review discusses different types of hypersaline niches, their residential microflora, and an overview of the metagenomic approaches used to investigate them. Various applications, hurdles and the recent advancements in metagenomic approaches have also been focused on here for their better understanding and utilization in the study of hypersaline microbiome.
Collapse
Affiliation(s)
- Kiran Dindhoria
- Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology Palampur, Palampur, Himachal Pradesh, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Vivek Manyapu
- Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology Palampur, Palampur, Himachal Pradesh, India
| | - Ashif Ali
- Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology Palampur, Palampur, Himachal Pradesh, India
| | - Rakshak Kumar
- Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology Palampur, Palampur, Himachal Pradesh, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| |
Collapse
|
3
|
Chaudhari JK, Pant S, Jha R, Pathak RK, Singh DB. Biological big-data sources, problems of storage, computational issues, and applications: a comprehensive review. Knowl Inf Syst 2024; 66:3159-3209. [DOI: 10.1007/s10115-023-02049-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 09/12/2023] [Accepted: 12/11/2023] [Indexed: 01/03/2025]
|
4
|
Samantray D, Tanwar AS, Murali TS, Brand A, Satyamoorthy K, Paul B. A Comprehensive Bioinformatics Resource Guide for Genome-Based Antimicrobial Resistance Studies. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2023; 27:445-460. [PMID: 37861712 DOI: 10.1089/omi.2023.0140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/21/2023]
Abstract
The use of high-throughput sequencing technologies and bioinformatic tools has greatly transformed microbial genome research. With the help of sophisticated computational tools, it has become easier to perform whole genome assembly, identify and compare different species based on their genomes, and predict the presence of genes responsible for proteins, antimicrobial resistance, and toxins. These bioinformatics resources are likely to continuously improve in quality, become more user-friendly to analyze the multiple genomic data, efficient in generating information and translating it into meaningful knowledge, and enhance our understanding of the genetic mechanism of AMR. In this manuscript, we provide an essential guide for selecting the popular resources for microbial research, such as genome assembly and annotation, antibiotic resistance gene profiling, identification of virulence factors, and drug interaction studies. In addition, we discuss the best practices in computer-oriented microbial genome research, emerging trends in microbial genomic data analysis, integration of multi-omics data, the appropriate use of machine-learning algorithms, and open-source bioinformatics resources for genome data analytics.
Collapse
Affiliation(s)
- Debyani Samantray
- Department of Bioinformatics, Manipal School of Life Sciences, Manipal Academy of Higher Education, Manipal, India
| | - Ankit Singh Tanwar
- United Nations University-Maastricht Economic and Social Research Institute on Innovation and Technology (UNU-MERIT), Maastricht, The Netherlands
- Faculty of Health, Medicine and Life Sciences (FHML), Maastricht University, Maastricht, The Netherlands
| | - Thokur Sreepathy Murali
- Department of Biotechnology, Manipal School of Life Sciences, Manipal Academy of Higher Education, Manipal, India
| | - Angela Brand
- United Nations University-Maastricht Economic and Social Research Institute on Innovation and Technology (UNU-MERIT), Maastricht, The Netherlands
- Faculty of Health, Medicine and Life Sciences (FHML), Maastricht University, Maastricht, The Netherlands
- Department of Health Information, Prasanna School of Public Health (PSPH), Manipal Academy of Higher Education, Manipal, India
| | - Kapaettu Satyamoorthy
- SDM College of Medical Sciences and Hospital, Shri Dharmasthala Manjunatheshwara (SDM) University, Dharwad, India
| | - Bobby Paul
- Department of Bioinformatics, Manipal School of Life Sciences, Manipal Academy of Higher Education, Manipal, India
| |
Collapse
|
5
|
Lema NK, Gemeda MT, Woldesemayat AA. Recent Advances in Metagenomic Approaches, Applications, and Challenge. Curr Microbiol 2023; 80:347. [PMID: 37733134 DOI: 10.1007/s00284-023-03451-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Accepted: 08/20/2023] [Indexed: 09/22/2023]
Abstract
Advances in metagenomics analysis with the advent of next-generation sequencing have extended our knowledge of microbial communities as compared to conventional techniques providing advanced approach to identify novel and uncultivable microorganisms based on their genetic information derived from a particular environment. Shotgun metagenomics involves investigating the DNA of the entire community without the requirement of PCR amplification. It provides access to study all genes present in the sample. On the other hand, amplicon sequencing targets taxonomically important marker genes, the analysis of which is restricted to previously known DNA sequences. While sequence-based metagenomics is used to analyze DNA sequences directly from the environment without the requirement of library construction and with limited identification of novel genes and products that can be complemented by functional genomics, function-based metagenomics requires fragmentation and cloning of extracted metagenome DNA in a suitable host with subsequent functional screening and sequencing clone for detection of a novel gene. Although advances were made in metagenomics, different challenges arise. This review provides insight into advances in the metagenomic approaches combined with next-generation sequencing, their recent applications highlighting the emerging ones, such as in astrobiology, forensic sciences, and SARS-CoV-2 infection diagnosis, and the challenges associated. This review further discusses the different types of metagenomics and outlines advancements in bioinformatics tools and their significance in the analysis of metagenomic datasets.
Collapse
Affiliation(s)
- Niguse K Lema
- Department of Biotechnology, College of Biological and Chemical Engineering, Addis Ababa Science and Technology University, Addis Ababa, Ethiopia
- Biotechnology and Bioprocess Center of Excellence, Addis Ababa Science and Technology University, Addis Ababa, Ethiopia
- Department of Biotechnology, Arba Minch University, Arba Minch, Ethiopia
| | - Mesfin T Gemeda
- Department of Biotechnology, College of Biological and Chemical Engineering, Addis Ababa Science and Technology University, Addis Ababa, Ethiopia
- Biotechnology and Bioprocess Center of Excellence, Addis Ababa Science and Technology University, Addis Ababa, Ethiopia
| | - Adugna A Woldesemayat
- Department of Biotechnology, College of Biological and Chemical Engineering, Addis Ababa Science and Technology University, Addis Ababa, Ethiopia.
- Biotechnology and Bioprocess Center of Excellence, Addis Ababa Science and Technology University, Addis Ababa, Ethiopia.
| |
Collapse
|
6
|
Yang J, He Q, Lu F, Chen K, Ni Z, Wang H, Zhou C, Zhang Y, Chen B, Bo Z, Li J, Yu H, Wang Y, Chen G. A distinct microbiota signature precedes the clinical diagnosis of hepatocellular carcinoma. Gut Microbes 2023; 15:2201159. [PMID: 37089022 PMCID: PMC10128432 DOI: 10.1080/19490976.2023.2201159] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Accepted: 04/05/2023] [Indexed: 04/25/2023] Open
Abstract
Oral, gut, and tumor microbiota have been implicated as important regulators in the carcinogenesis and progression of gastrointestinal malignancies. However, few studies focused on the existence and association of resident microbes within different body regions. Herein, we aim to reveal the durability of the oral-gut-tumor microbiome and its diagnostic performance in hepatocellular carcinoma (HCC). Our study included two cohorts: a retrospective discovery cohort of 364 HBV-HCC patients and 160 controls with oral or fecal samples, a prospective validation cohort of 91 cases, and 124 controls for matching samples, as well as 48 HBV, and 39 HBV-cirrhosis patients for gut microbial patterns examined by 16S rRNA gene sequencing. With the random forest analysis, 10 oral and 9 gut genera that could distinguish HCC from controls in the retrospective cohort were validated among the prospective matching participants, with area under the curve (AUC) values of 0.7971 and 0.8084, respectively. When influential taxa were merged, the AUC of the consistent classifier increased to 0.9405. The performance continued to improve to 0.9811 when combined with serum levels of alpha-fetoprotein (AFP). Specifically, microbial biomarkers represented by Streptococcus displayed a constantly increasing trend during the disease transition. Furthermore, the presence of several dominant microbiota species was confirmed in hepatic tumor and non-tumor tissues with fluorescence in situ hybridization (FISH) and 5 R 16S rRNA gene sequencing. Overall, our findings based on the oral-gut-tumor microbiota provide a reliable approach for the early detection of HCC.
Collapse
Affiliation(s)
- Jinhuan Yang
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Qikuan He
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Fei Lu
- The First School of Medicine, School of Information and Engineering, Wenzhou Medical University, Wenzhou, China
| | - Kaiwen Chen
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - ZhiHao Ni
- School of Nursing, Wenzhou Medical University, Wenzhou, China
| | - Haoyue Wang
- School of Mental Health, Wenzhou Medical University, Wenzhou, China
| | - Chen Zhou
- The First School of Medicine, School of Information and Engineering, Wenzhou Medical University, Wenzhou, China
| | - Yaosheng Zhang
- The First School of Medicine, School of Information and Engineering, Wenzhou Medical University, Wenzhou, China
| | - Bo Chen
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Zhiyuan Bo
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Jialiang Li
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Haitao Yu
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Yi Wang
- Department of Epidemiology and Biostatistics, School of Public Health and Management, Wenzhou Medical University; Chashan High Education Zone, Wenzhou, China
| | - Gang Chen
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
- Key Laboratory of Diagnosis and Treatment of Severe Hepato-Pancreatic Diseases of Zhejiang Province, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| |
Collapse
|
7
|
Liu J, Sun J, Liu Y. Effective Identification of Bacterial Genomes From Short and Long Read Sequencing Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2806-2816. [PMID: 34232887 DOI: 10.1109/tcbb.2021.3095164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
With the development of sequencing technology, microbiological genome sequencing analysis has attracted extensive attention. For inexperienced users without sufficient bioinformatics skills, making sense of sequencing data for microbial identification, especially for bacterial identification, through reads analysis is still challenging. In order to address the challenge of effectively analyzing genomic information, in this paper, we develop an effective approach and automatic bioinformatics pipeline called PBGI for bacterial genome identification, performing automatedly and customized bioinformatics analysis using short-reads or long-reads sequencing data produced by multiple platforms such as Illumina, PacBio and Oxford Nanopore. An evaluation of the proposed approach on the practical data set is presented, showing that PBGI provides a user-friendly way to perform bacterial identification through short or long reads analysis, and could provide accurate analyzing results. The source code of the PBGI is freely available at https://github.com/lyotvincent/PBGI.
Collapse
|
8
|
Lewin GR, Davis NM, McDonald BR, Book AJ, Chevrette MG, Suh S, Boll A, Currie CR. Long-Term Cellulose Enrichment Selects for Highly Cellulolytic Consortia and Competition for Public Goods. mSystems 2022; 7:e0151921. [PMID: 35258341 PMCID: PMC9040578 DOI: 10.1128/msystems.01519-21] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 02/10/2022] [Indexed: 11/23/2022] Open
Abstract
The complexity of microbial communities hinders our understanding of how microbial diversity and microbe-microbe interactions impact community functions. Here, using six independent communities originating from the refuse dumps of leaf-cutter ants and enriched using the plant polymer cellulose as the sole source of carbon, we examine how changes in bacterial diversity and interactions impact plant biomass decomposition. Over up to 60 serial transfers (∼8 months) using Whatman cellulose filter paper, cellulolytic ability increased and then stabilized in four enrichment lines and was variable in two lines. Bacterial community characterization using 16S rRNA gene amplicon sequencing showed community succession differed between the highly cellulolytic enrichment lines and those that had slower and more variable cellulose degradation rates. Metagenomic and metatranscriptomic analyses revealed that Cellvibrio and/or Cellulomonas dominated each enrichment line and produced the majority of cellulase enzymes, while diverse taxa were retained within these communities over the duration of transfers. Interestingly, the less cellulolytic communities had a higher diversity of organisms competing for the cellulose breakdown product cellobiose, suggesting that cheating slowed cellulose degradation. In addition, we found competitive exclusion as an important factor shaping all of the communities, with a negative correlation of Cellvibrio and Cellulomonas abundance within individual enrichment lines and the expression of genes associated with the production of secondary metabolites, toxins, and other antagonistic compounds. Our results provide insights into how microbial diversity and competition affect the stability and function of cellulose-degrading communities. IMPORTANCE Microbial communities are a key driver of the carbon cycle through the breakdown of complex polysaccharides in diverse environments including soil, marine systems, and the mammalian gut. However, due to the complexity of these communities, the species-species interactions that impact community structure and ultimately shape the rate of decomposition are difficult to define. Here, we performed serial enrichment on cellulose using communities inoculated from leaf-cutter ant refuse dumps, a cellulose-rich environment. By concurrently tracking cellulolytic ability and community composition and through metagenomic and metatranscriptomic sequencing, we analyzed the ecological dynamics of the enrichment lines. Our data suggest that antagonism is prevalent in these communities and that competition for soluble sugars may slow degradation and lead to community instability. Together, these results help reveal the relationships between competition and polysaccharide decomposition, with implications in diverse areas ranging from microbial community ecology to cellulosic biofuels production.
Collapse
Affiliation(s)
- Gina R. Lewin
- Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Bacteriology, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Nicole M. Davis
- Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Bacteriology, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Bradon R. McDonald
- Department of Bacteriology, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Adam J. Book
- Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Bacteriology, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Marc G. Chevrette
- Department of Bacteriology, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Wisconsin Institute for Discovery and Department of Plant Pathology, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Steven Suh
- Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Bacteriology, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Ardina Boll
- Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Bacteriology, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Cameron R. Currie
- Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Bacteriology, University of Wisconsin—Madison, Madison, Wisconsin, USA
| |
Collapse
|
9
|
Kesh K, Mendez R, Mateo-Victoriano B, Garrido VT, Durden B, Gupta VK, Oliveras Reyes A, Merchant N, Datta J, Banerjee S, Banerjee S. Obesity enriches for tumor protective microbial metabolites and treatment refractory cells to confer therapy resistance in PDAC. Gut Microbes 2022; 14:2096328. [PMID: 35816618 PMCID: PMC9275504 DOI: 10.1080/19490976.2022.2096328] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 06/20/2022] [Indexed: 02/04/2023] Open
Abstract
Obesity causes chronic inflammation and changes in gut microbiome. However, how this contributes to poor survival and therapy resistance in patients with pancreatic cancer remain undetermined. Our current study shows that high fat diet-fed obese pancreatic tumor bearing mice do not respond to standard of care therapy with gemcitabine and paclitaxel when compared to corresponding control diet-fed mice. C57BL6 mice were put on control and high fat diet for 1 month following with pancreatic tumors were implanted in both groups. Microbiome of lean (control) and obese (high fat diet fed) mice was analyzed. Fecal matter transplant from control mice to obese mice sensitized tumors to chemotherapy and demonstrated extensive cell death. Analysis of gut microbiome showed an enrichment of queuosine (Q) producing bacteria in obese mice and an enrichment of S-adenosyl methionine (SAM) producing bacteria in control diet-fed mice. Further, supplementation of obese animals with SAM sensitized pancreatic tumors to chemotherapy. Treatment of pancreatic cancer cells with Q increased PRDX1 involved in oxidative stress protection. In parallel, tumors in obese mice showed increase in CD133+ treatment refractory tumor populations compared to control animals. These observations indicated that microbial metabolite Q accumulation in high fat diet-fed mice protected tumors from chemotherapy induced oxidative stress by upregulating PRDX1. This protection could be reversed by treatment with SAM. We conclude that relative concentration of SAM and queuosine in fecal samples of pancreatic cancer patients can be developed as a potential biomarker and therapeutic target in chemotherapy refractory pancreatic cancer.
Collapse
Affiliation(s)
- Kousik Kesh
- Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | - Roberto Mendez
- Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | | | - Vanessa T Garrido
- Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | - Brittany Durden
- Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, USA
| | - Vineet K Gupta
- Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | | | - Nipun Merchant
- Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | - Jashodeep Datta
- Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | - Santanu Banerjee
- Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
- Miami Integrative Metabolomics Research Center, University of Miami, Miami, FL, USA
| | - Sulagna Banerjee
- Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| |
Collapse
|
10
|
Choudhari J, Choubey J, Verma M, Chatterjee T, Sahariah B. Metagenomics: the boon for microbial world knowledge and current challenges. Bioinformatics 2022. [DOI: 10.1016/b978-0-323-89775-4.00022-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
|
11
|
Raj A, Kumar A, Dames JF. Tapping the Role of Microbial Biosurfactants in Pesticide Remediation: An Eco-Friendly Approach for Environmental Sustainability. Front Microbiol 2021; 12:791723. [PMID: 35003022 PMCID: PMC8733403 DOI: 10.3389/fmicb.2021.791723] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 11/22/2021] [Indexed: 11/15/2022] Open
Abstract
Pesticides are used indiscriminately all over the world to protect crops from pests and pathogens. If they are used in excess, they contaminate the soil and water bodies and negatively affect human health and the environment. However, bioremediation is the most viable option to deal with these pollutants, but it has certain limitations. Therefore, harnessing the role of microbial biosurfactants in pesticide remediation is a promising approach. Biosurfactants are the amphiphilic compounds that can help to increase the bioavailability of pesticides, and speeds up the bioremediation process. Biosurfactants lower the surface area and interfacial tension of immiscible fluids and boost the solubility and sorption of hydrophobic pesticide contaminants. They have the property of biodegradability, low toxicity, high selectivity, and broad action spectrum under extreme pH, temperature, and salinity conditions, as well as a low critical micelle concentration (CMC). All these factors can augment the process of pesticide remediation. Application of metagenomic and in-silico tools would help by rapidly characterizing pesticide degrading microorganisms at a taxonomic and functional level. A comprehensive review of the literature shows that the role of biosurfactants in the biological remediation of pesticides has received limited attention. Therefore, this article is intended to provide a detailed overview of the role of various biosurfactants in improving pesticide remediation as well as different methods used for the detection of microbial biosurfactants. Additionally, this article covers the role of advanced metagenomics tools in characterizing the biosurfactant producing pesticide degrading microbes from different environments.
Collapse
Affiliation(s)
- Aman Raj
- Metagenomics and Secretomics Research Laboratory, Department of Botany, Dr. Harisingh Gour University (Central University), Sagar, India
| | - Ashwani Kumar
- Metagenomics and Secretomics Research Laboratory, Department of Botany, Dr. Harisingh Gour University (Central University), Sagar, India
- Mycorrhizal Research Laboratory, Department of Biochemistry and Microbiology, Rhodes University, Grahamstown, South Africa
| | - Joanna Felicity Dames
- Mycorrhizal Research Laboratory, Department of Biochemistry and Microbiology, Rhodes University, Grahamstown, South Africa
| |
Collapse
|
12
|
Curry KD, Nute MG, Treangen TJ. It takes guts to learn: machine learning techniques for disease detection from the gut microbiome. Emerg Top Life Sci 2021; 5:815-827. [PMID: 34779841 PMCID: PMC8786294 DOI: 10.1042/etls20210213] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 09/29/2021] [Accepted: 10/06/2021] [Indexed: 02/01/2023]
Abstract
Associations between the human gut microbiome and expression of host illness have been noted in a variety of conditions ranging from gastrointestinal dysfunctions to neurological deficits. Machine learning (ML) methods have generated promising results for disease prediction from gut metagenomic information for diseases including liver cirrhosis and irritable bowel disease, but have lacked efficacy when predicting other illnesses. Here, we review current ML methods designed for disease classification from microbiome data. We highlight the computational challenges these methods have effectively overcome and discuss the biological components that have been overlooked to offer perspectives on future work in this area.
Collapse
Affiliation(s)
- Kristen D. Curry
- Department of Computer Science, Rice University, Houston, TX 77005, USA
| | - Michael G. Nute
- Department of Computer Science, Rice University, Houston, TX 77005, USA
| | - Todd J. Treangen
- Department of Computer Science, Rice University, Houston, TX 77005, USA
| |
Collapse
|
13
|
MiDSystem: A comprehensive online system for de novo assembly and analysis of microbial genomes. N Biotechnol 2021; 65:42-52. [PMID: 34411700 DOI: 10.1016/j.nbt.2021.08.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Revised: 08/13/2021] [Accepted: 08/14/2021] [Indexed: 12/12/2022]
Abstract
The substantial reduction in experimental cost of next-generation sequencing techniques makes it feasible to assemble a bacterial genome of unknown species de novo and acquire substantial genetic information from environmental samples. Many bioinformatics tools and algorithms have also been developed for prokaryotes, but complex parameter settings and command line-based user interfaces cause a significant entry barrier for novices. Efficient construction of pipelines that integrate all the available genomic data poses a major challenge to the understanding of unknown pathogens. MiDSystem is a comprehensive online system for analyzing genomic data from microbiomes. With a user-friendly interface, MiDSystem supports both de novo assembly and metagenomic analysis pipelines. It is designed to automatically analyze whole genome shotgun sequencing data of bacteria submitted by users. Multiple analytical steps can be performed directly on the system, and the results generated from the embedded tools are visualized in an online summary report to make it more interpretable. Constructing a genome de novo has gradually become the foundation of bacterial studies. Taking both single species and metagenomic samples into consideration, MiDSystem can greatly reduce the time and effort for analysis of bacterial genomic data. Use of MiDSystem will enable more focus to be placed on understanding the etiology of bacterial infections and microorganism ecologies.
Collapse
|
14
|
Music of metagenomics-a review of its applications, analysis pipeline, and associated tools. Funct Integr Genomics 2021; 22:3-26. [PMID: 34657989 DOI: 10.1007/s10142-021-00810-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 09/25/2021] [Accepted: 10/03/2021] [Indexed: 10/20/2022]
Abstract
This humble effort highlights the intricate details of metagenomics in a simple, poetic, and rhythmic way. The paper enforces the significance of the research area, provides details about major analytical methods, examines the taxonomy and assembly of genomes, emphasizes some tools, and concludes by celebrating the richness of the ecosystem populated by the "metagenome."
Collapse
|
15
|
Kayani MUR, Huang W, Feng R, Chen L. Genome-resolved metagenomics using environmental and clinical samples. Brief Bioinform 2021; 22:bbab030. [PMID: 33758906 PMCID: PMC8425419 DOI: 10.1093/bib/bbab030] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 11/29/2020] [Accepted: 01/20/2021] [Indexed: 12/25/2022] Open
Abstract
Recent advances in high-throughput sequencing technologies and computational methods have added a new dimension to metagenomic data analysis i.e. genome-resolved metagenomics. In general terms, it refers to the recovery of draft or high-quality microbial genomes and their taxonomic classification and functional annotation. In recent years, several studies have utilized the genome-resolved metagenome analysis approach and identified previously unknown microbial species from human and environmental metagenomes. In this review, we describe genome-resolved metagenome analysis as a series of four necessary steps: (i) preprocessing of the sequencing reads, (ii) de novo metagenome assembly, (iii) genome binning and (iv) taxonomic and functional analysis of the recovered genomes. For each of these four steps, we discuss the most commonly used tools and the currently available pipelines to guide the scientific community in the recovery and subsequent analyses of genomes from any metagenome sample. Furthermore, we also discuss the tools required for validation of assembly quality as well as for improving quality of the recovered genomes. We also highlight the currently available pipelines that can be used to automate the whole analysis without having advanced bioinformatics knowledge. Finally, we will highlight the most widely adapted and actively maintained tools and pipelines that can be helpful to the scientific community in decision making before they commence the analysis.
Collapse
Affiliation(s)
- Masood ur Rehman Kayani
- Center for Microbiota and Immunological Diseases, Shanghai General Hospital, Shanghai Institute of Immunology, Shanghai Jiao Tong University, School of Medicine, Shanghai 2,000,025, China
| | - Wanqiu Huang
- Shanghai Institute of Immunology, Shanghai Jiao Tong University, School of Medicine, Shanghai 200,000, China
| | - Ru Feng
- Center for Microbiota and Immunological Diseases, Shanghai General Hospital, Shanghai Institute of Immunology, Shanghai Jiao Tong University, School of Medicine, Shanghai 2,000,025, China
| | - Lei Chen
- Center for Microbiota and Immunological Diseases, Shanghai General Hospital, Shanghai Institute of Immunology, Shanghai Jiao Tong University, School of Medicine, Shanghai 2,000,025, China
| |
Collapse
|
16
|
Ayling M, Clark MD, Leggett RM. New approaches for metagenome assembly with short reads. Brief Bioinform 2021; 21:584-594. [PMID: 30815668 PMCID: PMC7299287 DOI: 10.1093/bib/bbz020] [Citation(s) in RCA: 110] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 01/31/2019] [Accepted: 02/01/2019] [Indexed: 02/07/2023] Open
Abstract
In recent years, the use of longer range read data combined with advances in assembly algorithms has stimulated big improvements in the contiguity and quality of genome assemblies. However, these advances have not directly transferred to metagenomic data sets, as assumptions made by the single genome assembly algorithms do not apply when assembling multiple genomes at varying levels of abundance. The development of dedicated assemblers for metagenomic data was a relatively late innovation and for many years, researchers had to make do using tools designed for single genomes. This has changed in the last few years and we have seen the emergence of a new type of tool built using different principles. In this review, we describe the challenges inherent in metagenomic assemblies and compare the different approaches taken by these novel assembly tools.
Collapse
Affiliation(s)
- Martin Ayling
- Earlham Institute, Norwich Research Park, Norwich, UK
| | | | | |
Collapse
|
17
|
Bisht V, Acharjee A, Gkoutos GV. NFnetFu: A novel workflow for microbiome data fusion. Comput Biol Med 2021; 135:104556. [PMID: 34216888 PMCID: PMC8404037 DOI: 10.1016/j.compbiomed.2021.104556] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Revised: 06/04/2021] [Accepted: 06/04/2021] [Indexed: 12/18/2022]
Abstract
Microbiome data analysis and its interpretation into meaningful biological insights remain very challenging for numerous reasons, perhaps most prominently, due to the need to account for multiple factors, including collinearity, sparsity (excessive zeros) and effect size, that the complex experimental workflow and subsequent downstream data analysis require. Moreover, a meaningful microbiome data analysis necessitates the development of interpretable models that incorporate inferences across available data as well as background biomedical knowledge. We developed a multimodal framework that considers sparsity (excessive zeros), lower effect size, intrinsically microbial correlations, i.e., collinearity, as well as background biomedical knowledge in the form of a cluster-infused enriched network architecture. Finally, our framework also provides a candidate taxa/Operational Taxonomic Unit (OTU) that can be targeted for future validation experiments. We have developed a tool, the term NFnetFU (Neuro Fuzzy network Fusion), that encompasses our framework and have made it freely available at https://github.com/VartikaBisht6197/NFnetFu.
Collapse
Affiliation(s)
- Vartika Bisht
- College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, B15 2TT, UK
| | - Animesh Acharjee
- College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, B15 2TT, UK; Institute of Translational Medicine, University Hospitals Birmingham NHS, Foundation Trust, B15 2TT, UK; NIHR Surgical Reconstruction and Microbiology Research Centre, University Hospital Birmingham, Birmingham, B15 2WB, UK; MRC Health Data Research UK HDR, UK.
| | - Georgios V Gkoutos
- College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, B15 2TT, UK; Institute of Translational Medicine, University Hospitals Birmingham NHS, Foundation Trust, B15 2TT, UK; NIHR Surgical Reconstruction and Microbiology Research Centre, University Hospital Birmingham, Birmingham, B15 2WB, UK; MRC Health Data Research UK HDR, UK; NIHR Experimental Cancer Medicine Centre, B15 2TT, Birmingham, UK; NIHR Biomedical Research Centre, University Hospital Birmingham, Birmingham, B15 2TT, UK
| |
Collapse
|
18
|
Martínez Arbas S, Busi SB, Queirós P, de Nies L, Herold M, May P, Wilmes P, Muller EEL, Narayanasamy S. Challenges, Strategies, and Perspectives for Reference-Independent Longitudinal Multi-Omic Microbiome Studies. Front Genet 2021; 12:666244. [PMID: 34194470 PMCID: PMC8236828 DOI: 10.3389/fgene.2021.666244] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/30/2021] [Indexed: 12/21/2022] Open
Abstract
In recent years, multi-omic studies have enabled resolving community structure and interrogating community function of microbial communities. Simultaneous generation of metagenomic, metatranscriptomic, metaproteomic, and (meta) metabolomic data is more feasible than ever before, thus enabling in-depth assessment of community structure, function, and phenotype, thus resulting in a multitude of multi-omic microbiome datasets and the development of innovative methods to integrate and interrogate those multi-omic datasets. Specifically, the application of reference-independent approaches provides opportunities in identifying novel organisms and functions. At present, most of these large-scale multi-omic datasets stem from spatial sampling (e.g., water/soil microbiomes at several depths, microbiomes in/on different parts of the human anatomy) or case-control studies (e.g., cohorts of human microbiomes). We believe that longitudinal multi-omic microbiome datasets are the logical next step in microbiome studies due to their characteristic advantages in providing a better understanding of community dynamics, including: observation of trends, inference of causality, and ultimately, prediction of community behavior. Furthermore, the acquisition of complementary host-derived omics, environmental measurements, and suitable metadata will further enhance the aforementioned advantages of longitudinal data, which will serve as the basis to resolve drivers of community structure and function to understand the biotic and abiotic factors governing communities and specific populations. Carefully setup future experiments hold great potential to further unveil ecological mechanisms to evolution, microbe-microbe interactions, or microbe-host interactions. In this article, we discuss the challenges, emerging strategies, and best-practices applicable to longitudinal microbiome studies ranging from sampling, biomolecular extraction, systematic multi-omic measurements, reference-independent data integration, modeling, and validation.
Collapse
Affiliation(s)
- Susana Martínez Arbas
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Susheel Bhanu Busi
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Pedro Queirós
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Laura de Nies
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Malte Herold
- Department of Environmental Research and Innovation, Luxembourg Institute of Science and Technology, Belvaux, Luxembourg
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
- Department of Life Sciences and Medicine, Faculty of Science, Technology and Medicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Emilie E. L. Muller
- Université de Strasbourg, UMR 7156 CNRS, Génétique Moléculaire, Génomique, Microbiologie, Strasbourg, France
| | - Shaman Narayanasamy
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| |
Collapse
|
19
|
Graw S, Chappell K, Washam CL, Gies A, Bird J, Robeson MS, Byrum SD. Multi-omics data integration considerations and study design for biological systems and disease. Mol Omics 2021; 17:170-185. [PMID: 33347526 PMCID: PMC8058243 DOI: 10.1039/d0mo00041h] [Citation(s) in RCA: 75] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
With the advancement of next-generation sequencing and mass spectrometry, there is a growing need for the ability to merge biological features in order to study a system as a whole. Features such as the transcriptome, methylome, proteome, histone post-translational modifications and the microbiome all influence the host response to various diseases and cancers. Each of these platforms have technological limitations due to sample preparation steps, amount of material needed for sequencing, and sequencing depth requirements. These features provide a snapshot of one level of regulation in a system. The obvious next step is to integrate this information and learn how genes, proteins, and/or epigenetic factors influence the phenotype of a disease in context of the system. In recent years, there has been a push for the development of data integration methods. Each method specifically integrates a subset of omics data using approaches such as conceptual integration, statistical integration, model-based integration, networks, and pathway data integration. In this review, we discuss considerations of the study design for each data feature, the limitations in gene and protein abundance and their rate of expression, the current data integration methods, and microbiome influences on gene and protein expression. The considerations discussed in this review should be regarded when developing new algorithms for integrating multi-omics data.
Collapse
Affiliation(s)
- Stefan Graw
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA.
| | - Kevin Chappell
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA.
| | - Charity L Washam
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA. and Arkansas Children's Research Institute, 13 Children's Way, Little Rock, AR 72202, USA
| | - Allen Gies
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA.
| | - Jordan Bird
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA.
| | - Michael S Robeson
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR 72205, USA.
| | - Stephanie D Byrum
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA. and Arkansas Children's Research Institute, 13 Children's Way, Little Rock, AR 72202, USA
| |
Collapse
|
20
|
Lapidus AL, Korobeynikov AI. Metagenomic Data Assembly - The Way of Decoding Unknown Microorganisms. Front Microbiol 2021; 12:613791. [PMID: 33833738 PMCID: PMC8021871 DOI: 10.3389/fmicb.2021.613791] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Accepted: 03/03/2021] [Indexed: 01/08/2023] Open
Abstract
Metagenomics is a segment of conventional microbial genomics dedicated to the sequencing and analysis of combined genomic DNA of entire environmental samples. The most critical step of the metagenomic data analysis is the reconstruction of individual genes and genomes of the microorganisms in the communities using metagenomic assemblers - computational programs that put together small fragments of sequenced DNA generated by sequencing instruments. Here, we describe the challenges of metagenomic assembly, a wide spectrum of applications in which metagenomic assemblies were used to better understand the ecology and evolution of microbial ecosystems, and present one of the most efficient microbial assemblers, SPAdes that was upgraded to become applicable for metagenomics.
Collapse
Affiliation(s)
- Alla L. Lapidus
- Center for Algorithmic Biotechnology, St. Petersburg State University, Saint Petersburg, Russia
| | | |
Collapse
|
21
|
Moreno-Indias I, Lahti L, Nedyalkova M, Elbere I, Roshchupkin G, Adilovic M, Aydemir O, Bakir-Gungor B, Santa Pau ECD, D’Elia D, Desai MS, Falquet L, Gundogdu A, Hron K, Klammsteiner T, Lopes MB, Marcos-Zambrano LJ, Marques C, Mason M, May P, Pašić L, Pio G, Pongor S, Promponas VJ, Przymus P, Saez-Rodriguez J, Sampri A, Shigdel R, Stres B, Suharoschi R, Truu J, Truică CO, Vilne B, Vlachakis D, Yilmaz E, Zeller G, Zomer AL, Gómez-Cabrero D, Claesson MJ. Statistical and Machine Learning Techniques in Human Microbiome Studies: Contemporary Challenges and Solutions. Front Microbiol 2021; 12:635781. [PMID: 33692771 PMCID: PMC7937616 DOI: 10.3389/fmicb.2021.635781] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 01/28/2021] [Indexed: 12/23/2022] Open
Abstract
The human microbiome has emerged as a central research topic in human biology and biomedicine. Current microbiome studies generate high-throughput omics data across different body sites, populations, and life stages. Many of the challenges in microbiome research are similar to other high-throughput studies, the quantitative analyses need to address the heterogeneity of data, specific statistical properties, and the remarkable variation in microbiome composition across individuals and body sites. This has led to a broad spectrum of statistical and machine learning challenges that range from study design, data processing, and standardization to analysis, modeling, cross-study comparison, prediction, data science ecosystems, and reproducible reporting. Nevertheless, although many statistics and machine learning approaches and tools have been developed, new techniques are needed to deal with emerging applications and the vast heterogeneity of microbiome data. We review and discuss emerging applications of statistical and machine learning techniques in human microbiome studies and introduce the COST Action CA18131 "ML4Microbiome" that brings together microbiome researchers and machine learning experts to address current challenges such as standardization of analysis pipelines for reproducibility of data analysis results, benchmarking, improvement, or development of existing and new tools and ontologies.
Collapse
Affiliation(s)
- Isabel Moreno-Indias
- Instituto de Investigación Biomédica de Málaga (IBIMA), Unidad de Gestión Clìnica de Endocrinologìa y Nutrición, Hospital Clìnico Universitario Virgen de la Victoria, Universidad de Málaga, Málaga, Spain
- Centro de Investigación Biomeìdica en Red de Fisiopatologtìa de la Obesidad y la Nutrición (CIBEROBN), Instituto de Salud Carlos III, Madrid, Spain
| | - Leo Lahti
- Department of Computing, University of Turku, Turku, Finland
| | - Miroslava Nedyalkova
- Human Genetics and Disease Mechanisms, Latvian Biomedical Research and Study Centre, Riga, Latvia
| | - Ilze Elbere
- Latvian Biomedical Research and Study Centre, Riga, Latvia
| | | | - Muhamed Adilovic
- Department of Genetics and Bioengineering, International University of Sarajevo, Sarajevo, Bosnia and Herzegovina
| | - Onder Aydemir
- Department of Electrical and Electronics Engineering, Karadeniz Technical University, Trabzon, Turkey
| | - Burcu Bakir-Gungor
- Department of Computer Engineering, Abdullah Gul University, Kayseri, Turkey
| | | | - Domenica D’Elia
- Department for Biomedical Sciences, Institute for Biomedical Technologies, National Research Council, Bari, Italy
| | - Mahesh S. Desai
- Department of Infection and Immunity, Luxembourg Institute of Health, Esch-sur-Alzette, Luxembourg
- Odense Research Center for Anaphylaxis, Department of Dermatology and Allergy Center, Odense University Hospital, University of Southern Denmark, Odense, Denmark
| | - Laurent Falquet
- Department of Biology, University of Fribourg, Fribourg, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Aycan Gundogdu
- Department of Microbiology and Clinical Microbiology, Faculty of Medicine, Erciyes University, Kayseri, Turkey
- Metagenomics Laboratory, Genome and Stem Cell Center (GenKök), Erciyes University, Kayseri, Turkey
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, Olomouc, Czechia
| | | | - Marta B. Lopes
- NOVA Laboratory for Computer Science and Informatics (NOVA LINCS), FCT, UNL, Caparica, Portugal
- Centro de Matemática e Aplicações (CMA), FCT, UNL, Caparica, Portugal
| | - Laura Judith Marcos-Zambrano
- Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain
| | - Cláudia Marques
- CINTESIS, NOVA Medical School, NMS, Universidade Nova de Lisboa, Lisbon, Portugal
| | - Michael Mason
- Computational Oncology, Sage Bionetworks, Seattle, WA, United States
| | - Patrick May
- Bioinformatics Core, Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Lejla Pašić
- Sarajevo Medical School, University Sarajevo School of Science and Technology, Sarajevo, Bosnia and Herzegovina
| | - Gianvito Pio
- Department of Computer Science, University of Bari Aldo Moro, Bari, Italy
| | - Sándor Pongor
- Faculty of Information Tehnology and Bionics, Pázmány University, Budapest, Hungary
| | - Vasilis J. Promponas
- Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia, Cyprus
| | - Piotr Przymus
- Faculty of Mathematics and Computer Science, Nicolaus Copernicus University, Toruñ, Poland
| | - Julio Saez-Rodriguez
- Institute of Computational Biomedicine, Heidelberg University, Faculty of Medicine and Heidelberg University Hospital, Heidelberg, Germany
| | - Alexia Sampri
- Division of Informatics, Imaging and Data Sciences, School of Health Sciences, University of Manchester, Manchester, United Kingdom
| | - Rajesh Shigdel
- Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Blaz Stres
- Jozef Stefan Institute, Ljubljana, Slovenia
- Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
- Faculty of Civil and Geodetic Engineering, University of Ljubljana, Ljubljana, Slovenia
| | - Ramona Suharoschi
- Molecular Nutrition and Proteomics Lab, Faculty of the Food Science and Technology, Institute of Life Sciences, University of Agricultural Sciences and Veterinary Medicine of Cluj-Napoca, Cluj-Napoca, Romania
| | - Jaak Truu
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | - Ciprian-Octavian Truică
- Department of Computer Science and Engineering, Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Bucharest, Romania
| | - Baiba Vilne
- Bioinformatics Research Unit, Riga Stradins University, Riga, Latvia
| | - Dimitrios Vlachakis
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Ercument Yilmaz
- Department of Computer Technologies, Karadeniz Technical University, Trabzon, Turkey
| | - Georg Zeller
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Heidelberg, Germany
| | - Aldert L. Zomer
- Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, Utrecht, Netherlands
| | - David Gómez-Cabrero
- Navarrabiomed, Complejo Hospitalario de Navarra (CHN), IdiSNA, Universidad Pública de Navarra (UPNA), Pamplona, Spain
| | - Marcus J. Claesson
- School of Microbiology and APC Microbiome Ireland, University College Cork, Cork, Ireland
| |
Collapse
|
22
|
Ghosh A, Firdous S, Saha S. Bioinformatics for Human Microbiome. Adv Bioinformatics 2021. [DOI: 10.1007/978-981-33-6191-1_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
|
23
|
Sala C, Mordhorst H, Grützke J, Brinkmann A, Petersen TN, Poulsen C, Cotter PD, Crispie F, Ellis RJ, Castellani G, Amid C, Hakhverdyan M, Guyader SL, Manfreda G, Mossong J, Nitsche A, Ragimbeau C, Schaeffer J, Schlundt J, Tay MYF, Aarestrup FM, Hendriksen RS, Pamp SJ, De Cesare A. Metagenomics-Based Proficiency Test of Smoked Salmon Spiked with a Mock Community. Microorganisms 2020; 8:microorganisms8121861. [PMID: 33255715 PMCID: PMC7760972 DOI: 10.3390/microorganisms8121861] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Revised: 11/17/2020] [Accepted: 11/23/2020] [Indexed: 12/13/2022] Open
Abstract
An inter-laboratory proficiency test was organized to assess the ability of participants to perform shotgun metagenomic sequencing of cold smoked salmon, experimentally spiked with a mock community composed of six bacteria, one parasite, one yeast, one DNA, and two RNA viruses. Each participant applied its in-house wet-lab workflow(s) to obtain the metagenomic dataset(s), which were then collected and analyzed using MG-RAST. A total of 27 datasets were analyzed. Sample pre-processing, DNA extraction protocol, library preparation kit, and sequencing platform, influenced the abundance of specific microorganisms of the mock community. Our results highlight that despite differences in wet-lab protocols, the reads corresponding to the mock community members spiked in the cold smoked salmon, were both detected and quantified in terms of relative abundance, in the metagenomic datasets, proving the suitability of shotgun metagenomic sequencing as a genomic tool to detect microorganisms belonging to different domains in the same food matrix. The implementation of standardized wet-lab protocols would highly facilitate the comparability of shotgun metagenomic sequencing dataset across laboratories and sectors. Moreover, there is a need for clearly defining a sequencing reads threshold, to consider pathogens as detected or undetected in a food sample.
Collapse
Affiliation(s)
- Claudia Sala
- Department of Physics and Astronomy, University of Bologna, 40127 Bologna, Italy;
| | - Hanne Mordhorst
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Josephine Grützke
- German Federal Institute for Risk Assessment, Department of Biological Safety, 12277 Berlin, Germany;
| | - Annika Brinkmann
- Highly Pathogenic Viruses, ZBS 1, Centre for Biological Threats and Special Pathogens, Robert Koch Institute, 13353 Berlin, Germany; (A.B.); (A.N.)
| | - Thomas N. Petersen
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Casper Poulsen
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Paul D. Cotter
- Teagasc Food Research Centre, Moorepark, APC Microbiome Ireland and Vistamilk, T12 YN60 Co. Cork, Ireland; (P.D.C.); (F.C.)
| | - Fiona Crispie
- Teagasc Food Research Centre, Moorepark, APC Microbiome Ireland and Vistamilk, T12 YN60 Co. Cork, Ireland; (P.D.C.); (F.C.)
| | - Richard J. Ellis
- Surveillance and Laboratory Services Department, Animal and Plant Health Agency, APHA Weybridge, Addlestone, Surrey, KT15 3NB, UK;
| | - Gastone Castellani
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, 40127 Bologna, Italy;
| | - Clara Amid
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK;
| | | | - Soizick Le Guyader
- Laboratoire de Microbiologie, CEDEX 03, 44311 Nantes, France; (S.L.G.); (J.S.)
| | - Gerardo Manfreda
- Department of Agricultural and Food Sciences, University of Bologna, 40064 Ozzano dell’Emilia, Italy;
| | - Joël Mossong
- Epidemiology and Microbial Genomics, Laboratoire National de Santé, L-3555 Dudelange, Luxembourg; (J.M.); (C.R.)
| | - Andreas Nitsche
- Highly Pathogenic Viruses, ZBS 1, Centre for Biological Threats and Special Pathogens, Robert Koch Institute, 13353 Berlin, Germany; (A.B.); (A.N.)
| | - Catherine Ragimbeau
- Epidemiology and Microbial Genomics, Laboratoire National de Santé, L-3555 Dudelange, Luxembourg; (J.M.); (C.R.)
| | - Julien Schaeffer
- Laboratoire de Microbiologie, CEDEX 03, 44311 Nantes, France; (S.L.G.); (J.S.)
| | - Joergen Schlundt
- Nanyang Technological University Food Technology Centre (NAFTEC), Nanyang Technological University (NTU), 62 Nanyang Dr, Singapore 637459, Singapore; (J.S.); (M.Y.F.T.)
| | - Moon Y. F. Tay
- Nanyang Technological University Food Technology Centre (NAFTEC), Nanyang Technological University (NTU), 62 Nanyang Dr, Singapore 637459, Singapore; (J.S.); (M.Y.F.T.)
| | - Frank M. Aarestrup
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Rene S. Hendriksen
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Sünje Johanna Pamp
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Alessandra De Cesare
- Department of Veterinary Medical Sciences, University of Bologna, Via Tolara di Sopra 50, 40064 Ozzano dell’Emilia, Italy
- Correspondence:
| |
Collapse
|
24
|
Mendez R, Kesh K, Arora N, Di Martino L, McAllister F, Merchant N, Banerjee S, Banerjee S. Microbial dysbiosis and polyamine metabolism as predictive markers for early detection of pancreatic cancer. Carcinogenesis 2020; 41:561-570. [PMID: 31369062 DOI: 10.1093/carcin/bgz116] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2018] [Revised: 05/29/2019] [Accepted: 06/17/2019] [Indexed: 12/15/2022] Open
Abstract
The lack of tools for early detection of pancreatic ductal adenocarcinoma (PDAC) is directly correlated with the abysmal survival rates in patients. In addition to several potential detection tools under active investigation, we tested the gut microbiome and its metabolic complement as one of the earliest detection tools that could be useful in patients at high risk for PDAC. We used a combination of 16s rRNA pyrosequencing and whole-genome sequencing of gut fecal microbiota in a genetically engineered PDAC murine model (KRASG12DTP53R172HPdxCre or KPC). Metabolic reconstruction of microbiome was done using the HUMAnN2 pipeline. Serum polyamine levels were measured from murine and patient samples using chromogenic assay. Our results showed a Proteobacterial and Firmicutes dominance in gut microbiota in early stages of PDAC development. Upon in silico reconstruction of active metabolic pathways within the altered microbial flora, polyamine and nucleotide biosynthetic pathways were significantly elevated. These metabolic products are known to be actively assimilated by the host and eventually utilized by rapidly dividing cells for proliferation validating their importance in the context of tumorigenesis. In KPC mice, as well as PDAC patients, we show significantly elevated serum polyamine concentrations. Therefore, at the early stages of tumorigenesis, there is a strong correlation between microbial changes and release of metabolites that foster host tumorigenesis, thereby fulfilling the 'vicious cycle hypothesis' of the role of microbiome in health and disease states. Our results provide a potential, precise, noninvasive tool for early detection of PDAC, which may result in improved outcomes.
Collapse
Affiliation(s)
- Roberto Mendez
- Department of Surgery, University of Miami, Miami, FL, USA
| | - Kousik Kesh
- Department of Surgery, University of Miami, Miami, FL, USA
| | - Nivedita Arora
- Department of Surgery, University of Minnesota, Minneapolis, MN, USA
| | - Leá Di Martino
- Department of Surgery, University of Miami, Miami, FL, USA.,Université Grenoble Alpes, Isère, France
| | - Florencia McAllister
- Department of Clinical Cancer Prevention, The University of Texas MD Anderson Cancer Center, Miami, FL, USA
| | - Nipun Merchant
- Department of Surgery, University of Miami, Miami, FL, USA.,Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | - Sulagna Banerjee
- Department of Surgery, University of Miami, Miami, FL, USA.,Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | - Santanu Banerjee
- Department of Surgery, University of Miami, Miami, FL, USA.,Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| |
Collapse
|
25
|
Benavides A, Sanchez F, Alzate JF, Cabarcas F. DATMA: Distributed AuTomatic Metagenomic Assembly and annotation framework. PeerJ 2020; 8:e9762. [PMID: 32953263 PMCID: PMC7474881 DOI: 10.7717/peerj.9762] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2019] [Accepted: 07/28/2020] [Indexed: 11/20/2022] Open
Abstract
Background A prime objective in metagenomics is to classify DNA sequence fragments into taxonomic units. It usually requires several stages: read’s quality control, de novo assembly, contig annotation, gene prediction, etc. These stages need very efficient programs because of the number of reads from the projects. Furthermore, the complexity of metagenomes requires efficient and automatic tools that orchestrate the different stages. Method DATMA is a pipeline for fast metagenomic analysis that orchestrates the following: sequencing quality control, 16S rRNA-identification, reads binning, de novo assembly and evaluation, gene prediction, and taxonomic annotation. Its distributed computing model can use multiple computing resources to reduce the analysis time. Results We used a controlled experiment to show DATMA functionality. Two pre-annotated metagenomes to compare its accuracy and speed against other metagenomic frameworks. Then, with DATMA we recovered a draft genome of a novel Anaerolineaceae from a biosolid metagenome. Conclusions DATMA is a bioinformatics tool that automatically analyzes complex metagenomes. It is faster than similar tools and, in some cases, it can extract genomes that the other tools do not. DATMA is freely available at https://github.com/andvides/DATMA.
Collapse
Affiliation(s)
- Andres Benavides
- Grupo GICEI, Facultad de Ingeniería Electrónica, Institución Universitaria Pascual Bravo, Medellín, Antioquia, Colombia
- Grupo SISTEMIC, Ingeniería Electrónica, Facultad de Ingeniería, Universidad de Antioquia UdeA, Medellín, Colombia
| | - Friman Sanchez
- Barcelona Supercomputing Center, currently at Smart Variable S.L., Barcelona, Spain
| | - Juan F. Alzate
- Centro Nacional de Secuenciación Genómica-CNSG, Sede de Investigación Universitaria-SIU, Universidad de Antioquia UdeA, Medellín, Colombia
- Departamento de Microbiología y Parasitología, Facultad de Medicina, Universidad de Antioquia UdeA, Medellín, Colombia
| | - Felipe Cabarcas
- Centro Nacional de Secuenciación Genómica-CNSG, Sede de Investigación Universitaria-SIU, Universidad de Antioquia UdeA, Medellín, Colombia
- Grupo SISTEMIC, Ingeniería Electrónica, Facultad de Ingeniería, Universidad de Antioquia UdeA, Medellín, Colombia
| |
Collapse
|
26
|
Pérez-Cobas AE, Gomez-Valero L, Buchrieser C. Metagenomic approaches in microbial ecology: an update on whole-genome and marker gene sequencing analyses. Microb Genom 2020; 6:mgen000409. [PMID: 32706331 PMCID: PMC7641418 DOI: 10.1099/mgen.0.000409] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 06/30/2020] [Indexed: 12/23/2022] Open
Abstract
Metagenomics and marker gene approaches, coupled with high-throughput sequencing technologies, have revolutionized the field of microbial ecology. Metagenomics is a culture-independent method that allows the identification and characterization of organisms from all kinds of samples. Whole-genome shotgun sequencing analyses the total DNA of a chosen sample to determine the presence of micro-organisms from all domains of life and their genomic content. Importantly, the whole-genome shotgun sequencing approach reveals the genomic diversity present, but can also give insights into the functional potential of the micro-organisms identified. The marker gene approach is based on the sequencing of a specific gene region. It allows one to describe the microbial composition based on the taxonomic groups present in the sample. It is frequently used to analyse the biodiversity of microbial ecosystems. Despite its importance, the analysis of metagenomic sequencing and marker gene data is quite a challenge. Here we review the primary workflows and software used for both approaches and discuss the current challenges in the field.
Collapse
Affiliation(s)
- Ana Elena Pérez-Cobas
- Institut Pasteur, Biologie des Bactéries Intracellulaires, Paris, France and CNRS UMR 3525, 675724, Paris, France
| | - Laura Gomez-Valero
- Institut Pasteur, Biologie des Bactéries Intracellulaires, Paris, France and CNRS UMR 3525, 675724, Paris, France
| | - Carmen Buchrieser
- Institut Pasteur, Biologie des Bactéries Intracellulaires, Paris, France and CNRS UMR 3525, 675724, Paris, France
| |
Collapse
|
27
|
Dovrolis N, Kolios G, Spyrou GM, Maroulakou I. Computational profiling of the gut-brain axis: microflora dysbiosis insights to neurological disorders. Brief Bioinform 2020; 20:825-841. [PMID: 29186317 DOI: 10.1093/bib/bbx154] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Revised: 10/17/2017] [Indexed: 12/14/2022] Open
Abstract
Almost 2500 years after Hippocrates' observations on health and its direct association to the gastrointestinal tract, a paradigm shift has recently occurred, making the gut and its symbionts (bacteria, fungi, archaea and viruses) a point of convergence for studies. It is nowadays well established that the gut microflora's compositional diversity regulates via its genes (the microbiome) the host's health and provides preliminary insights into disease progression and regulation. The microbiome's involvement is evident in immunological and physiological studies that link changes in its biodiversity to its contributions to the host's phenotype but also in neurological investigations, substantiating the aptly named gut-brain axis. The definitive mechanisms of this last bidirectional interaction will be our main focus because it presents researchers with a new conundrum. In this review, we prospect current literature for computational analysis methodologies that accommodate the need for better understanding of the microbiome-gut-brain interactions and neurological disorder onset and progression, through cross-disciplinary systems biology applications. We will present bioinformatics tools used in exploring these synergies that help build and interpret microbial 16S ribosomal RNA data sets, produced by shotgun and high-throughput sequencing of healthy and neurological disorder samples stored in biological databases. These approaches provide alternative means for researchers to form hypotheses to their inquests faster, cheaper and swith precision. The goal of these studies relies on the integration of combined metagenomics and metabolomics assessments. An accurate characterization of the microbiome and its functionality can support new diagnostic, prognostic and therapeutic strategies for neurological disorders, customized for each individual host.
Collapse
|
28
|
Blanco-Míguez A, Fdez-Riverola F, Sánchez B, Lourenço A. Resources and tools for the high-throughput, multi-omic study of intestinal microbiota. Brief Bioinform 2020; 20:1032-1056. [PMID: 29186315 DOI: 10.1093/bib/bbx156] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Revised: 10/23/2017] [Indexed: 12/18/2022] Open
Abstract
The human gut microbiome impacts several aspects of human health and disease, including digestion, drug metabolism and the propensity to develop various inflammatory, autoimmune and metabolic diseases. Many of the molecular processes that play a role in the activity and dynamics of the microbiota go beyond species and genic composition and thus, their understanding requires advanced bioinformatics support. This article aims to provide an up-to-date view of the resources and software tools that are being developed and used in human gut microbiome research, in particular data integration and systems-level analysis efforts. These efforts demonstrate the power of standardized and reproducible computational workflows for integrating and analysing varied omics data and gaining deeper insights into microbe community structure and function as well as host-microbe interactions.
Collapse
Affiliation(s)
| | | | | | - Anália Lourenço
- Dpto. de Informática - Universidade de Vigo, ESEI - Escuela Superior de Ingeniería Informática, Edificio politécnico, Campus Universitario As Lagoas s/n, 32004 Ourense, Spain
| |
Collapse
|
29
|
Kesh K, Mendez R, Abdelrahman L, Banerjee S, Banerjee S. Type 2 diabetes induced microbiome dysbiosis is associated with therapy resistance in pancreatic adenocarcinoma. Microb Cell Fact 2020; 19:75. [PMID: 32204699 PMCID: PMC7092523 DOI: 10.1186/s12934-020-01330-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Accepted: 03/12/2020] [Indexed: 12/14/2022] Open
Abstract
Resistance to therapy is one of the major factors that contribute to dismal survival statistics in pancreatic cancer. While there are many tumor intrinsic and tumor microenvironment driven factors that contribute to therapy resistance, whether pre-existing metabolic diseases like type 2 diabetes (T2D) contribute to this has remained understudied. It is well accepted that hyperglycemia associated with type 2 diabetes changes the gut microbiome. Further, hyperglycemia also enriches for a "stem-like" population within the tumor. In the current study, we observed that in a T2D mouse model, the microbiome changed significantly as the hyperglycemia developed in these animals. Our results further showed that, tumors implanted in the T2D mice responded poorly to gemcitabine/paclitaxel (Gem/Pac) standard of care compared to those in the control group. A metabolomic reconstruction of the WGS of the gut microbiota further revealed that an enrichment of bacterial population involved in drug metabolism in the T2D group. Additionally, we also observed an increase in the CD133+ tumor cells population in the T2D model. These observations indicated that in an animal model for T2D, microbial dysbiosis is associated with increased resistance to chemotherapeutic compounds.
Collapse
Affiliation(s)
- Kousik Kesh
- Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, USA
- Sylvester Comprehensive Cancer Center, Miami, FL, USA
| | - Roberto Mendez
- Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, USA
- Sylvester Comprehensive Cancer Center, Miami, FL, USA
- Miami Integrative Metabolomics Research Center, University of Miami, Miami, FL, USA
| | - Leila Abdelrahman
- Miami Integrative Metabolomics Research Center, University of Miami, Miami, FL, USA
| | - Santanu Banerjee
- Sylvester Comprehensive Cancer Center, Miami, FL, USA.
- Miami Integrative Metabolomics Research Center, University of Miami, Miami, FL, USA.
- Department of Surgery, Miller School of Medicine, University of Miami, Biomedical Research Building Suite 516, 1501, NW 10th Ave, Miami, FL, 33156, USA.
| | - Sulagna Banerjee
- Sylvester Comprehensive Cancer Center, Miami, FL, USA.
- Department of Surgery, Miller School of Medicine, University of Miami, Biomedical Research Building, Suite 508, 1501, NW 10th Ave, Miami, FL, 33156, USA.
| |
Collapse
|
30
|
Abstract
Colorectal cancer (CRC) is a leading cause of cancer-related deaths in both the USA and the world. Recent research has demonstrated the involvement of the gut microbiota in CRC development and progression. Microbial biomarkers of disease have focused primarily on the bacterial component of the microbiome; however, the viral portion of the microbiome, consisting of both bacteriophages and eukaryotic viruses, together known as the virome, has been lesser studied. Here we review the recent advancements in high-throughput sequencing (HTS) technologies and bioinformatics, which have enabled scientists to better understand how viruses might influence the development of colorectal cancer. We discuss the contemporary findings revealing modulations in the virome and their correlation with CRC development and progression. While a variety of challenges still face viral HTS detection in clinical specimens, we consider herein numerous next steps for future basic and clinical research. Clinicians need to move away from a single infectious agent model for disease etiology by grasping new, more encompassing etiological paradigms, in which communities of various microbial components interact with each other and the host. The reporting and indexing of patient health information, socioeconomic data, and other relevant metadata will enable identification of predictive variables and covariates of viral presence and CRC development. Altogether, the virome has a more profound role in carcinogenesis and cancer progression than once thought, and viruses, specific for either human cells or bacteria, are clinically relevant in understanding CRC pathology, patient prognosis, and treatment development.
Collapse
|
31
|
Jo J, Oh J, Park C. Microbial community analysis using high-throughput sequencing technology: a beginner's guide for microbiologists. J Microbiol 2020; 58:176-192. [PMID: 32108314 DOI: 10.1007/s12275-020-9525-5] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 12/11/2019] [Accepted: 12/16/2019] [Indexed: 12/19/2022]
Abstract
Microbial communities present in diverse environments from deep seas to human body niches play significant roles in the complex ecosystem and human health. Characterizing their structural and functional diversities is indispensable, and many approaches, such as microscopic observation, DNA fingerprinting, and PCR-based marker gene analysis, have been successfully applied to identify microorganisms. Since the revolutionary improvement of DNA sequencing technologies, direct and high-throughput analysis of genomic DNA from a whole environmental community without prior cultivation has become the mainstream approach, overcoming the constraints of the classical approaches. Here, we first briefly review the history of environmental DNA analysis applications with a focus on profiling the taxonomic composition and functional potentials of microbial communities. To this end, we aim to introduce the shotgun metagenomic sequencing (SMS) approach, which is used for the untargeted ("shotgun") sequencing of all ("meta") microbial genomes ("genomic") present in a sample. SMS data analyses are performed in silico using various software programs; however, in silico analysis is typically regarded as a burden on wet-lab experimental microbiologists. Therefore, in this review, we present microbiologists who are unfamiliar with in silico analyses with a basic and practical SMS data analysis protocol. This protocol covers all the bioinformatics processes of the SMS analysis in terms of data preprocessing, taxonomic profiling, functional annotation, and visualization.
Collapse
Affiliation(s)
- Jihoon Jo
- School of Biological Sciences and Technology, Chonnam National University, Gwangju, 61186, Republic of Korea
| | - Jooseong Oh
- School of Biological Sciences and Technology, Chonnam National University, Gwangju, 61186, Republic of Korea
| | - Chungoo Park
- School of Biological Sciences and Technology, Chonnam National University, Gwangju, 61186, Republic of Korea.
| |
Collapse
|
32
|
Chen J, Zhao Y, Sun Y. De novo haplotype reconstruction in viral quasispecies using paired-end read guided path finding. Bioinformatics 2019; 34:2927-2935. [PMID: 29617936 DOI: 10.1093/bioinformatics/bty202] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2017] [Accepted: 04/02/2018] [Indexed: 12/29/2022] Open
Abstract
Motivation RNA virus populations contain different but genetically related strains, all infecting an individual host. Reconstruction of the viral haplotypes is a fundamental step to characterize the virus population, predict their viral phenotypes and finally provide important information for clinical treatment and prevention. Advances of the next-generation sequencing technologies open up new opportunities to assemble full-length haplotypes. However, error-prone short reads, high similarities between related strains, an unknown number of haplotypes pose computational challenges for reference-free haplotype reconstruction. There is still much room to improve the performance of existing haplotype assembly tools. Results In this work, we developed a de novo haplotype reconstruction tool named PEHaplo, which employs paired-end reads to distinguish highly similar strains for viral quasispecies data. It was applied on both simulated and real quasispecies data, and the results were benchmarked against several recently published de novo haplotype reconstruction tools. The comparison shows that PEHaplo outperforms the benchmarked tools in a comprehensive set of metrics. Availability and implementation The source code and the documentation of PEHaplo are available at https://github.com/chjiao/PEHaplo. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jiao Chen
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Yingchao Zhao
- School of Computing and Information Sciences, Caritas Institute of Higher Education, Hong Kong, China
| | - Yanni Sun
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| |
Collapse
|
33
|
Rajan SK, Lindqvist M, Brummer RJ, Schoultz I, Repsilber D. Phylogenetic microbiota profiling in fecal samples depends on combination of sequencing depth and choice of NGS analysis method. PLoS One 2019; 14:e0222171. [PMID: 31527871 PMCID: PMC6748435 DOI: 10.1371/journal.pone.0222171] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Accepted: 08/22/2019] [Indexed: 12/12/2022] Open
Abstract
The human gut microbiota is well established as an important factor in health and disease. Fecal sample microbiota are often analyzed as a proxy for gut microbiota, and characterized with respect to their composition profiles. Modern approaches employ whole genome shotgun next-generation sequencing as the basis for these analyses. Sequencing depth as well as choice of next-generation sequencing data analysis method constitute two main interacting methodological factors for such an approach. In this study, we used 200 million sequence read pairs from one fecal sample for comparing different taxonomy classification methods, using default and custom-made reference databases, at different sequencing depths. A mock community data set with known composition was used for validating the classification methods. Results suggest that sequencing beyond 60 million read pairs does not seem to improve classification. The phylogeny prediction pattern, when using the default databases and the consensus database, appeared to be similar for all three methods. Moreover, these methods predicted rather different species. We conclude that the choice of sequencing depth and classification method has important implications for taxonomy composition prediction. A multi-method-consensus approach for robust gut microbiota NGS analysis is recommended.
Collapse
Affiliation(s)
| | | | | | - Ida Schoultz
- School of Medical Sciences, Örebro University, Örebro, Sweden
| | - Dirk Repsilber
- School of Medical Sciences, Örebro University, Örebro, Sweden
- * E-mail:
| |
Collapse
|
34
|
Breitwieser FP, Lu J, Salzberg SL. A review of methods and databases for metagenomic classification and assembly. Brief Bioinform 2019; 20:1125-1136. [PMID: 29028872 PMCID: PMC6781581 DOI: 10.1093/bib/bbx120] [Citation(s) in RCA: 281] [Impact Index Per Article: 46.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Revised: 08/22/2017] [Indexed: 12/13/2022] Open
Abstract
Microbiome research has grown rapidly over the past decade, with a proliferation of new methods that seek to make sense of large, complex data sets. Here, we survey two of the primary types of methods for analyzing microbiome data: read classification and metagenomic assembly, and we review some of the challenges facing these methods. All of the methods rely on public genome databases, and we also discuss the content of these databases and how their quality has a direct impact on our ability to interpret a microbiome sample.
Collapse
Affiliation(s)
| | | | - Steven L Salzberg
- Corresponding author: Steven L. Salzberg, Center for Computational Biology, Johns Hopkins University, 1900 E. Monument St., Baltimore, MD, 21205, USA. E-mail:
| |
Collapse
|
35
|
Abstract
Microbiome research has grown rapidly over the past decade, with a proliferation of new methods that seek to make sense of large, complex data sets. Here, we survey two of the primary types of methods for analyzing microbiome data: read classification and metagenomic assembly, and we review some of the challenges facing these methods. All of the methods rely on public genome databases, and we also discuss the content of these databases and how their quality has a direct impact on our ability to interpret a microbiome sample.
Collapse
|
36
|
Coelho LP, Alves R, Monteiro P, Huerta-Cepas J, Freitas AT, Bork P. NG-meta-profiler: fast processing of metagenomes using NGLess, a domain-specific language. MICROBIOME 2019; 7:84. [PMID: 31159881 PMCID: PMC6547473 DOI: 10.1186/s40168-019-0684-8] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Accepted: 04/10/2019] [Indexed: 05/10/2023]
Abstract
BACKGROUND Shotgun metagenomes contain a sample of all the genomic material in an environment, allowing for the characterization of a microbial community. In order to understand these communities, bioinformatics methods are crucial. A common first step in processing metagenomes is to compute abundance estimates of different taxonomic or functional groups from the raw sequencing data. Given the breadth of the field, computational solutions need to be flexible and extensible, enabling the combination of different tools into a larger pipeline. RESULTS We present NGLess and NG-meta-profiler. NGLess is a domain specific language for describing next-generation sequence processing pipelines. It was developed with the goal of enabling user-friendly computational reproducibility. It provides built-in support for many common operations on sequencing data and is extensible with external tools with configuration files. Using this framework, we developed NG-meta-profiler, a fast profiler for metagenomes which performs sequence preprocessing, mapping to bundled databases, filtering of the mapping results, and profiling (taxonomic and functional). It is significantly faster than either MOCAT2 or htseq-count and (as it builds on NGLess) its results are perfectly reproducible. CONCLUSIONS NG-meta-profiler is a high-performance solution for metagenomics processing built on NGLess. It can be used as-is to execute standard analyses or serve as the starting point for customization in a perfectly reproducible fashion. NGLess and NG-meta-profiler are open source software (under the liberal MIT license) and can be downloaded from https://ngless.embl.de or installed through bioconda.
Collapse
Affiliation(s)
- Luis Pedro Coelho
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence (Fudan University), Ministry of Education, Shanghai, China
| | - Renato Alves
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
- Collaboration for joint PhD degree between EMBL and Heidelberg University, Faculty of Biosciences, Heidelberg, Germany
| | - Paulo Monteiro
- INESC-ID, Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
| | - Jaime Huerta-Cepas
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM), Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Madrid, Spain
| | - Ana Teresa Freitas
- INESC-ID, Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
| | - Peer Bork
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
- Max Delbrück Centre for Molecular Medicine, Berlin, Germany
- Molecular Medicine Partnership Unit, University of Heidelberg and European Molecular Biology Laboratory, Heidelberg, Germany
- Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany
| |
Collapse
|
37
|
Jagadeesan B, Gerner-Smidt P, Allard MW, Leuillet S, Winkler A, Xiao Y, Chaffron S, Van Der Vossen J, Tang S, Katase M, McClure P, Kimura B, Ching Chai L, Chapman J, Grant K. The use of next generation sequencing for improving food safety: Translation into practice. Food Microbiol 2019; 79:96-115. [PMID: 30621881 PMCID: PMC6492263 DOI: 10.1016/j.fm.2018.11.005] [Citation(s) in RCA: 160] [Impact Index Per Article: 26.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Revised: 10/27/2018] [Accepted: 11/13/2018] [Indexed: 01/06/2023]
Abstract
Next Generation Sequencing (NGS) combined with powerful bioinformatic approaches are revolutionising food microbiology. Whole genome sequencing (WGS) of single isolates allows the most detailed comparison possible hitherto of individual strains. The two principle approaches for strain discrimination, single nucleotide polymorphism (SNP) analysis and genomic multi-locus sequence typing (MLST) are showing concordant results for phylogenetic clustering and are complementary to each other. Metabarcoding and metagenomics, applied to total DNA isolated from either food materials or the production environment, allows the identification of complete microbial populations. Metagenomics identifies the entire gene content and when coupled to transcriptomics or proteomics, allows the identification of functional capacity and biochemical activity of microbial populations. The focus of this review is on the recent use and future potential of NGS in food microbiology and on current challenges. Guidance is provided for new users, such as public health departments and the food industry, on the implementation of NGS and how to critically interpret results and place them in a broader context. The review aims to promote the broader application of NGS technologies within the food industry as well as highlight knowledge gaps and novel applications of NGS with the aim of driving future research and increasing food safety outputs from its wider use.
Collapse
Affiliation(s)
- Balamurugan Jagadeesan
- Nestlé Research, Nestec Ltd, Route du Jorat 57, Vers-chez-les-Blanc, CH-1000, Lausanne 26, Switzerland.
| | - Peter Gerner-Smidt
- Centers for Disease Control and Prevention, MS-CO-3, 1600 Clifton Road, 30329-4027, Atlanta, USA
| | - Marc W Allard
- US Food and Drug Administration, 5001 Campus Drive, College Park, MD, 02740, USA
| | - Sébastien Leuillet
- Institut Mérieux, Mérieux NutriSciences, 3 route de la Chatterie, 44800, Saint Herblain, France
| | - Anett Winkler
- Cargill Deutschland GmbH, Cerestarstr. 2, 47809, Krefeld, Germany
| | - Yinghua Xiao
- Arla Innovation Center, Agro Food Park 19, 8200, Aarhus, Denmark
| | - Samuel Chaffron
- Laboratoire des Sciences du Numérique de Nantes (LS2N), CNRS UMR 6004 - Université de Nantes, 2 rue de la Houssinière, 44322, Nantes, France
| | - Jos Van Der Vossen
- The Netherlands Organisation for Applied Scientific Research, TNO, Utrechtseweg 48, 3704 HE, Zeist, NL, the Netherlands
| | - Silin Tang
- Mars Global Food Safety Center, Yanqi Economic Development Zone, 101407, Beijing, China
| | - Mitsuru Katase
- Fuji Oil Co., Ltd., Sumiyoshi-cho 1, Izumisano Osaka, 598-8540, Japan
| | - Peter McClure
- Mondelēz International, Linden 3, Bournville Lane, B30 2LU, Birmingham, United Kingdom
| | - Bon Kimura
- Tokyo University of Marine Science & Technology, Konan 4-5-7, Minato-ku, Tokyo, 108-8477, Japan
| | - Lay Ching Chai
- Institute of Biological Sciences, Faculty of Science, University of Malaya, 50603, Kuala Lumpur, Malaysia
| | - John Chapman
- Unilever Research & Development, Postbus, 114, 3130 AC, Vlaardingen, the Netherlands
| | - Kathie Grant
- Gastrointestinal Bacteria Reference Unit, National Infection Service, Public Health England, 61 Colindale Avenue, London, NW9 5EQ, United Kingdom.
| |
Collapse
|
38
|
Koutsandreas T, Ladoukakis E, Pilalis E, Zarafeta D, Kolisis FN, Skretas G, Chatziioannou AA. ANASTASIA: An Automated Metagenomic Analysis Pipeline for Novel Enzyme Discovery Exploiting Next Generation Sequencing Data. Front Genet 2019; 10:469. [PMID: 31178894 PMCID: PMC6543708 DOI: 10.3389/fgene.2019.00469] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Accepted: 05/01/2019] [Indexed: 01/27/2023] Open
Abstract
Metagenomic analysis of environmental samples provides deep insight into the enzymatic mixture of the corresponding niches, capable of revealing peptide sequences with novel functional properties exploiting the high performance of next-generation sequencing (NGS) technologies. At the same time due to their ever increasing complexity, there is a compelling need for ever larger computational configurations to ensure proper bioinformatic analysis, and fine annotation. With the aiming to address the challenges of such an endeavor, we have developed a novel web-based application named ANASTASIA (automated nucleotide aminoacid sequences translational plAtform for systemic interpretation and analysis). ANASTASIA provides a rich environment of bioinformatic tools, either publicly available or novel, proprietary algorithms, integrated within numerous automated algorithmic workflows, and which enables versatile data processing tasks for (meta)genomic sequence datasets. ANASTASIA was initially developed in the framework of the European FP7 project HotZyme, whose aim was to perform exhaustive analysis of metagenomes derived from thermal springs around the globe and to discover new enzymes of industrial interest. ANASTASIA has evolved to become a stable and extensible environment for diversified, metagenomic, functional analyses for a range of applications overarching industrial biotechnology to biomedicine, within the frames of the ELIXIR-GR project. As a showcase, we report the successful in silico mining of a novel thermostable esterase termed “EstDZ4” from a metagenomic sample collected from a hot spring located in Krisuvik, Iceland.
Collapse
Affiliation(s)
- Theodoros Koutsandreas
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece.,e-NIOS Applications PC, Athens, Greece
| | - Efthymios Ladoukakis
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece.,Laboratory of Biotechnology, School of Chemical Engineering, National Technical University of Athens, Athens, Greece
| | - Eleftherios Pilalis
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece.,e-NIOS Applications PC, Athens, Greece
| | - Dimitra Zarafeta
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece
| | - Fragiskos N Kolisis
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece.,Laboratory of Biotechnology, School of Chemical Engineering, National Technical University of Athens, Athens, Greece
| | - Georgios Skretas
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece
| | - Aristotelis A Chatziioannou
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece.,e-NIOS Applications PC, Athens, Greece
| |
Collapse
|
39
|
Abstract
Stable isotope probing (SIP) provides researchers a culture-independent method to retrieve nucleic acids from active microbial populations performing a specific metabolic activity in complex ecosystems. In recent years, the use of the SIP method in microbial ecology studies has been accelerated. This is partly due to the advances in sequencing and bioinformatics tools, which enable fast and reliable analysis of DNA and RNA from the SIP experiments. One of these sequencing tools, metagenomics, has contributed significantly to the body of knowledge by providing data not only on taxonomy but also on the key functional genes in specific metabolic pathways and their relative abundances. In this chapter, we provide a general background on the application of the SIP-metagenomics approach in microbial ecology and a workflow for the analysis of metagenomic datasets using the most up-to-date bioinformatics tools.
Collapse
Affiliation(s)
- Eileen Kröber
- Microbial Biogeochemistry, RA Landscape Functioning, ZALF Leibniz Centre for Landscape Research, Müncheberg, Germany
| | - Özge Eyice
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK.
| |
Collapse
|
40
|
Benavides A, Isaza JP, Niño-García JP, Alzate JF, Cabarcas F. CLAME: a new alignment-based binning algorithm allows the genomic description of a novel Xanthomonadaceae from the Colombian Andes. BMC Genomics 2018; 19:858. [PMID: 30537931 PMCID: PMC6288851 DOI: 10.1186/s12864-018-5191-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Hot spring bacteria have unique biological adaptations to survive the extreme conditions of these environments; these bacteria produce thermostable enzymes that can be used in biotechnological and industrial applications. However, sequencing these bacteria is complex, since it is not possible to culture them. As an alternative, genome shotgun sequencing of whole microbial communities can be used. The problem is that the classification of sequences within a metagenomic dataset is very challenging particularly when they include unknown microorganisms since they lack genomic reference. We failed to recover a bacterium genome from a hot spring metagenome using the available software tools, so we develop a new tool that allowed us to recover most of this genome. Results We present a proteobacteria draft genome reconstructed from a Colombian’s Andes hot spring metagenome. The genome seems to be from a new lineage within the family Rhodanobacteraceae of the class Gammaproteobacteria, closely related to the genus Dokdonella. We were able to generate this genome thanks to CLAME. CLAME, from Spanish “CLAsificador MEtagenomico”, is a tool to group reads in bins. We show that most reads from each bin belong to a single chromosome. CLAME is very effective recovering most of the reads belonging to the predominant species within a metagenome. Conclusions We developed a tool that can be used to extract genomes (or parts of them) from a complex metagenome. Electronic supplementary material The online version of this article (10.1186/s12864-018-5191-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Andres Benavides
- Grupo SISTEMIC, Ingeniería Electrónica, Facultad de Ingeniería, Universidad de Antioquia UdeA, Calle 70 No, 52-21, Medellín, Colombia.
| | - Juan Pablo Isaza
- Centro Nacional de Secuenciación Genómica-CNSG, Sede de Investigación Universitaria-SIU, Universidad de Antioquia UdeA, Calle 70 No, 52-21, Medellín, Colombia.,Grupo de Parasitología, Departamento de Microbiología y Parasitología, Facultad de Medicina, Universidad de Antioquia UdeA, Calle 70 No, 52-21, Medellín, Colombia
| | - Juan Pablo Niño-García
- Escuela de Microbiología, Universidad de Antioquia UdeA, Calle 70 No, 52-21, Medellín, Colombia
| | - Juan Fernando Alzate
- Centro Nacional de Secuenciación Genómica-CNSG, Sede de Investigación Universitaria-SIU, Universidad de Antioquia UdeA, Calle 70 No, 52-21, Medellín, Colombia.,Grupo de Parasitología, Departamento de Microbiología y Parasitología, Facultad de Medicina, Universidad de Antioquia UdeA, Calle 70 No, 52-21, Medellín, Colombia
| | - Felipe Cabarcas
- Grupo SISTEMIC, Ingeniería Electrónica, Facultad de Ingeniería, Universidad de Antioquia UdeA, Calle 70 No, 52-21, Medellín, Colombia.,Centro Nacional de Secuenciación Genómica-CNSG, Sede de Investigación Universitaria-SIU, Universidad de Antioquia UdeA, Calle 70 No, 52-21, Medellín, Colombia
| |
Collapse
|
41
|
Almeida OGG, De Martinis ECP. Bioinformatics tools to assess metagenomic data for applied microbiology. Appl Microbiol Biotechnol 2018; 103:69-82. [DOI: 10.1007/s00253-018-9464-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Revised: 10/15/2018] [Accepted: 10/16/2018] [Indexed: 12/14/2022]
|
42
|
Souvorov A, Agarwala R, Lipman DJ. SKESA: strategic k-mer extension for scrupulous assemblies. Genome Biol 2018; 19:153. [PMID: 30286803 PMCID: PMC6172800 DOI: 10.1186/s13059-018-1540-z] [Citation(s) in RCA: 375] [Impact Index Per Article: 53.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Accepted: 09/12/2018] [Indexed: 01/20/2023] Open
Abstract
SKESA is a DeBruijn graph-based de-novo assembler designed for assembling reads of microbial genomes sequenced using Illumina. Comparison with SPAdes and MegaHit shows that SKESA produces assemblies that have high sequence quality and contiguity, handles low-level contamination in reads, is fast, and produces an identical assembly for the same input when assembled multiple times with the same or different compute resources. SKESA has been used for assembling over 272,000 read sets in the Sequence Read Archive at NCBI and for real-time pathogen detection. Source code for SKESA is freely available at https://github.com/ncbi/SKESA/releases .
Collapse
Affiliation(s)
| | - Richa Agarwala
- NCBI/NLM/NIH/DHHS, 8600 Rockville Pike, Bethesda, 20894 MD USA
| | - David J. Lipman
- NCBI/NLM/NIH/DHHS, 8600 Rockville Pike, Bethesda, 20894 MD USA
- Impossible Foods, impossiblefoods.com, Redwood City, 94063 CA USA
| |
Collapse
|
43
|
Visconti A, Martin TC, Falchi M. YAMP: a containerized workflow enabling reproducibility in metagenomics research. Gigascience 2018; 7:5039705. [PMID: 29917068 PMCID: PMC6047416 DOI: 10.1093/gigascience/giy072] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2018] [Revised: 05/01/2018] [Accepted: 06/11/2018] [Indexed: 01/12/2023] Open
Abstract
YAMP ("Yet Another Metagenomics Pipeline") is a user-friendly workflow that enables the analysis of whole shotgun metagenomic data while using containerization to ensure computational reproducibility and facilitate collaborative research. YAMP can be executed on any UNIX-like system and offers seamless support for multiple job schedulers as well as for the Amazon AWS cloud. Although YAMP was developed to be ready to use by nonexperts, bioinformaticians will appreciate its flexibility, modularization, and simple customization.
Collapse
Affiliation(s)
- Alessia Visconti
- Department of Twin Research and Genetic Epidemiology, King’s College London, Westminster Bridge Road, SE1 7EH, London, UK
| | - Tiphaine C Martin
- Department of Twin Research and Genetic Epidemiology, King’s College London, Westminster Bridge Road, SE1 7EH, London, UK
| | - Mario Falchi
- Department of Twin Research and Genetic Epidemiology, King’s College London, Westminster Bridge Road, SE1 7EH, London, UK
| |
Collapse
|
44
|
|
45
|
Nooij S, Schmitz D, Vennema H, Kroneman A, Koopmans MPG. Overview of Virus Metagenomic Classification Methods and Their Biological Applications. Front Microbiol 2018; 9:749. [PMID: 29740407 PMCID: PMC5924777 DOI: 10.3389/fmicb.2018.00749] [Citation(s) in RCA: 83] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Accepted: 04/03/2018] [Indexed: 12/20/2022] Open
Abstract
Metagenomics poses opportunities for clinical and public health virology applications by offering a way to assess complete taxonomic composition of a clinical sample in an unbiased way. However, the techniques required are complicated and analysis standards have yet to develop. This, together with the wealth of different tools and workflows that have been proposed, poses a barrier for new users. We evaluated 49 published computational classification workflows for virus metagenomics in a literature review. To this end, we described the methods of existing workflows by breaking them up into five general steps and assessed their ease-of-use and validation experiments. Performance scores of previous benchmarks were summarized and correlations between methods and performance were investigated. We indicate the potential suitability of the different workflows for (1) time-constrained diagnostics, (2) surveillance and outbreak source tracing, (3) detection of remote homologies (discovery), and (4) biodiversity studies. We provide two decision trees for virologists to help select a workflow for medical or biodiversity studies, as well as directions for future developments in clinical viral metagenomics.
Collapse
Affiliation(s)
- Sam Nooij
- Emerging and Endemic Viruses, Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands.,Viroscience Laboratory, Erasmus University Medical Centre, Rotterdam, Netherlands
| | - Dennis Schmitz
- Emerging and Endemic Viruses, Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands.,Viroscience Laboratory, Erasmus University Medical Centre, Rotterdam, Netherlands
| | - Harry Vennema
- Emerging and Endemic Viruses, Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| | - Annelies Kroneman
- Emerging and Endemic Viruses, Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| | - Marion P G Koopmans
- Emerging and Endemic Viruses, Centre for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands.,Viroscience Laboratory, Erasmus University Medical Centre, Rotterdam, Netherlands
| |
Collapse
|
46
|
Peñalver Bernabé B, Cralle L, Gilbert JA. Systems biology of the human microbiome. Curr Opin Biotechnol 2018; 51:146-153. [PMID: 29453029 DOI: 10.1016/j.copbio.2018.01.018] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2017] [Accepted: 01/22/2018] [Indexed: 12/15/2022]
Abstract
Recent research has shown that the microbiome-a collection of microorganisms, including bacteria, fungi, and viruses, living on and in a host-are of extraordinary importance in human health, even from conception and development in the uterus. Therefore, to further our ability to diagnose disease, to predict treatment outcomes, and to identify novel therapeutics, it is essential to include microbiome and microbial metabolic biomarkers in Systems Biology investigations. In clinical studies or, more precisely, Systems Medicine approaches, we can use the diversity and individual characteristics of the personal microbiome to enhance our resolution for patient stratification. In this review, we explore several Systems Medicine approaches, including Microbiome Wide Association Studies to understand the role of the human microbiome in health and disease, with a focus on 'preventive medicine' or P4 (i.e., personalized, predictive, preventive, participatory) medicine.
Collapse
Affiliation(s)
| | - Lauren Cralle
- The Microbiome Center, Department of Surgery, University of Chicago, Chicago, USA; Biosciences Division, Argonne National Laboratory, Lemont, IL, USA
| | - Jack A Gilbert
- The Microbiome Center, Department of Surgery, University of Chicago, Chicago, USA; Biosciences Division, Argonne National Laboratory, Lemont, IL, USA; Marine Biology Laboratory, Woods Hole, MA, USA.
| |
Collapse
|
47
|
Evaluation of nine popular de novo assemblers in microbial genome assembly. J Microbiol Methods 2017; 143:32-37. [DOI: 10.1016/j.mimet.2017.09.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2017] [Revised: 09/09/2017] [Accepted: 09/10/2017] [Indexed: 11/17/2022]
|
48
|
Salaheen S, Kim SW, Haley BJ, Van Kessel JAS, Biswas D. Alternative Growth Promoters Modulate Broiler Gut Microbiome and Enhance Body Weight Gain. Front Microbiol 2017; 8:2088. [PMID: 29123512 PMCID: PMC5662582 DOI: 10.3389/fmicb.2017.02088] [Citation(s) in RCA: 65] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2017] [Accepted: 10/11/2017] [Indexed: 02/01/2023] Open
Abstract
Antibiotic growth promoters (AGPs) are frequently used to enhance weight-gain in poultry production. However, there has been increasing concern over the impact of AGP on the emergence of antibiotic resistance in zoonotic bacterial pathogens in the microbial community of the poultry gut. In this study, we adopted mass-spectrophotometric, phylogenetic, and shotgun-metagenomic approaches to evaluate bioactive phenolic extracts (BPE) from blueberry (Vaccinium corymbosum) and blackberry (Rubus fruticosus) pomaces as AGP alternatives in broilers. We conducted two trials with 100 Cobb-500 broiler chicks (in each trial) in four equal groups that were provided water with no supplementation, supplemented with AGP (tylosin, neomycin sulfate, bacitracin, erythromycin, and oxytetracycline), or supplemented with 0.1 g Gallic acid equivalent (GAE)/L or 1.0 g GAE/L (during the last 72 h before euthanasia) of BPE for 6 weeks. When compared with the control group (water only), the chickens supplemented with AGP and 0.1 g GAE/L of BPE gained 9.5 and 5.8% more body weight, respectively. The microbiomes of both the AGP- and BPE-treated chickens had higher Firmicutes to Bacteroidetes ratios. AGP supplementation appeared to be associated with higher relative abundance of bacteriophages and unique cecal resistomes compared with BPE supplementation or control. Functional characterization of cecal microbiomes revealed significant animal-to-animal variation in the relative abundance of genes involved in energy and carbohydrate metabolism. These findings established a baseline upon which mechanisms of plant-based performance enhancers in regulation of animal growth can be investigated. In addition, the data will aid in designing alternate strategies to improve animal growth performance and consequently production.
Collapse
Affiliation(s)
- Serajus Salaheen
- Environmental Microbial and Food Safety Laboratory, Beltsville Agricultural Research Center, Agricultural Research Services, United States Department of Agriculture, Beltsville, MD, United States
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, United States
| | - Seon-Woo Kim
- Environmental Microbial and Food Safety Laboratory, Beltsville Agricultural Research Center, Agricultural Research Services, United States Department of Agriculture, Beltsville, MD, United States
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, United States
| | - Bradd J. Haley
- Environmental Microbial and Food Safety Laboratory, Beltsville Agricultural Research Center, Agricultural Research Services, United States Department of Agriculture, Beltsville, MD, United States
| | - Jo Ann S. Van Kessel
- Environmental Microbial and Food Safety Laboratory, Beltsville Agricultural Research Center, Agricultural Research Services, United States Department of Agriculture, Beltsville, MD, United States
| | - Debabrata Biswas
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, United States
- Center for Food Safety and Security Systems, University of Maryland, College Park, MD, United States
| |
Collapse
|
49
|
Abstract
Background Homology search is still a significant step in functional analysis for genomic data. Profile Hidden Markov Model-based homology search has been widely used in protein domain analysis in many different species. In particular, with the fast accumulation of transcriptomic data of non-model species and metagenomic data, profile homology search is widely adopted in integrated pipelines for functional analysis. While the state-of-the-art tool HMMER has achieved high sensitivity and accuracy in domain annotation, the sensitivity of HMMER on short reads declines rapidly. The low sensitivity on short read homology search can lead to inaccurate domain composition and abundance computation. Our experimental results showed that half of the reads were missed by HMMER for a RNA-Seq dataset. Thus, there is a need for better methods to improve the homology search performance for short reads. Results We introduce a profile homology search tool named Short-Pair that is designed for short paired-end reads. By using an approximate Bayesian approach employing distribution of fragment lengths and alignment scores, Short-Pair can retrieve the missing end and determine true domains. In particular, Short-Pair increases the accuracy in aligning short reads that are part of remote homologs. We applied Short-Pair to a RNA-Seq dataset and a metagenomic dataset and quantified its sensitivity and accuracy on homology search. The experimental results show that Short-Pair can achieve better overall performance than the state-of-the-art methodology of profile homology search. Conclusions Short-Pair is best used for next-generation sequencing (NGS) data that lack reference genomes. It provides a complementary paired-end read homology search tool to HMMER. The source code is freely available at https://sourceforge.net/projects/short-pair/.
Collapse
|
50
|
Abstract
Microbiome analysis involves determining the composition and function of a community of microorganisms in a particular location. For the gastroenterologist, this technology opens up a rapidly evolving set of challenges and opportunities for generating novel insights into the health of patients on the basis of microbiota characterizations from intestinal, hepatic or extraintestinal samples. Alterations in gut microbiota composition correlate with intestinal and extraintestinal disease and, although only a few mechanisms are known, the microbiota are still an attractive target for developing biomarkers for disease detection and management as well as potential therapeutic applications. In this Review, we summarize the major decision points confronting new entrants to the field or for those designing new projects in microbiome research. We provide recommendations based on current technology options and our experience of sequencing platform choices. We also offer perspectives on future applications of microbiome research, which we hope convey the promise of this technology for clinical applications.
Collapse
|