1
|
Chanderraj R, Brown CA, Hinkle K, Falkowski N, Woods RJ, Dickson RP. The bacterial density of clinical rectal swabs is highly variable, correlates with sequencing contamination, and predicts patient risk of extraintestinal infection. MICROBIOME 2022; 10:2. [PMID: 34991717 PMCID: PMC8734160 DOI: 10.1186/s40168-021-01190-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 10/18/2021] [Indexed: 05/02/2023]
Abstract
BACKGROUND In ecology, population density is a key feature of community analysis. Yet in studies of the gut microbiome, bacterial density is rarely reported. Studies of hospitalized patients commonly use rectal swabs for microbiome analysis, yet variation in their bacterial density-and the clinical and methodologic significance of this variation-remains undetermined. We used an ultra-sensitive quantification approach-droplet digital PCR (ddPCR)-to quantify bacterial density in rectal swabs from 118 hospitalized patients. We compared bacterial density with bacterial community composition (via 16S rRNA amplicon sequencing) and clinical data to determine if variation in bacterial density has methodological, clinical, and prognostic significance. RESULTS Bacterial density in rectal swab specimens was highly variable, spanning five orders of magnitude (1.2 × 104-3.2 × 109 16S rRNA gene copies/sample). Low bacterial density was strongly correlated with the detection of sequencing contamination (Spearman ρ = - 0.95, p < 10-16). Low-density rectal swab communities were dominated by peri-rectal skin bacteria and sequencing contaminants (p < 0.01), suggesting that some variation in bacterial density is explained by sampling variation. Yet bacterial density was also associated with important clinical exposures, conditions, and outcomes. Bacterial density was lower among patients who had received piperacillin-tazobactam (p = 0.017) and increased among patients with multiple medical comorbidities (Charlson score, p = 0.0040) and advanced age (p = 0.043). Bacterial density at the time of hospital admission was independently associated with subsequent extraintestinal infection (p = 0.0028), even when controlled for severity of illness and comorbidities. CONCLUSIONS The bacterial density of rectal swabs is highly variable, and this variability is of methodological, clinical, and prognostic significance. Microbiome studies using rectal swabs are vulnerable to sequencing contamination and should include appropriate negative sequencing controls. Among hospitalized patients, gut bacterial density is associated with clinical exposures (antibiotics, comorbidities) and independently predicts infection risk. Bacterial density is an important and under-studied feature of gut microbiome community analysis. Video abstract.
Collapse
Affiliation(s)
- Rishi Chanderraj
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, USA
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Christopher A Brown
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, USA
- Institute for Research on Innovation and Science, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Kevin Hinkle
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Nicole Falkowski
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Robert J Woods
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, USA
- Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Robert P Dickson
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, USA.
- Department of Microbiology and Immunology, University of Michigan Medical School, Ann Arbor, MI, USA.
- Weil Institute for Critical Care Research & Innovation, MI, Ann Arbor, USA.
- Pulmonary and Critical Care Medicine, University of Michigan Health System, 6220 MSRB III / SPC 5642, 1150 W. Medical Center Dr, Ann Arbor, MI, 48109-5642, USA.
| |
Collapse
|
2
|
The intratumoral microbiome: Characterization methods and functional impact. Cancer Lett 2021; 522:63-79. [PMID: 34517085 DOI: 10.1016/j.canlet.2021.09.009] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 09/01/2021] [Accepted: 09/06/2021] [Indexed: 12/24/2022]
Abstract
Live-pathogenic bacteria, which were identified inside tumors hundreds year ago, are key elements in modern cancer research. As they have a relatively accessible genome, they offer a multitude of metabolic engineering opportunities, useful in several clinical fields. Better understanding of the tumor microenvironment and its associated microbiome would help conceptualize new metabolically engineered species, triggering efficient therapeutic responses against cancer. Unfortunately, given the low microbial biomass nature of tumors, characterizing the tumor microbiome remains a challenge. Tumors have a high host versus bacterial DNA ratio, making it extremely complex to identify tumor-associated bacteria. Nevertheless, with the improvements in next-generation analytic tools, recent studies demonstrated the existence of intratumor bacteria inside defined tumors. It is now proven that each cancer subtype has a unique microbiome, characterized by bacterial communities with specific metabolic functions. This review provides a brief overview of the main approaches used to characterize the tumor microbiome, and of the recently proposed functions of intracellular bacteria identified in oncological entities. The therapeutic aspects of live-pathogenic microbes are also discussed, regarding the tumor microenvironment of each cancer type.
Collapse
|
3
|
Kong HG, Ham H, Lee MH, Park DS, Lee YH. Microbial Community Dysbiosis and Functional Gene Content Changes in Apple Flowers due to Fire Blight. THE PLANT PATHOLOGY JOURNAL 2021; 37:404-412. [PMID: 34365752 PMCID: PMC8357563 DOI: 10.5423/ppj.nt.05.2021.0072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 06/02/2021] [Accepted: 06/06/2021] [Indexed: 06/13/2023]
Abstract
Despite the plant microbiota plays an important role in plant health, little is known about the potential interactions of the flower microbiota with pathogens. In this study, we investigated the microbial community of apple blossoms when infected with Erwinia amylovora. The long-read sequencing technology, which significantly increased the genome sequence resolution, thus enabling the characterization of fire blight-induced changes in the flower microbial community. Each sample showed a unique microbial community at the species level. Pantoea agglomerans and P. allii were the most predominant bacteria in healthy flowers, whereas E. amylovora comprised more than 90% of the microbial population in diseased flowers. Furthermore, gene function analysis revealed that glucose and xylose metabolism were enriched in diseased flowers. Overall, our results showed that the microbiome of apple blossoms is rich in specific bacteria, and the nutritional composition of flowers is important for the incidence and spread of bacterial disease.
Collapse
Affiliation(s)
- Hyun Gi Kong
- Corresponding author. Phone) +82-63-238-3279, FAX) +82-63-238-3838, E-mail)
| | | | | | | | | |
Collapse
|
4
|
Wei ZG, Zhang XD, Cao M, Liu F, Qian Y, Zhang SW. Comparison of Methods for Picking the Operational Taxonomic Units From Amplicon Sequences. Front Microbiol 2021; 12:644012. [PMID: 33841367 PMCID: PMC8024490 DOI: 10.3389/fmicb.2021.644012] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Accepted: 02/17/2021] [Indexed: 12/31/2022] Open
Abstract
With the advent of next-generation sequencing technology, it has become convenient and cost efficient to thoroughly characterize the microbial diversity and taxonomic composition in various environmental samples. Millions of sequencing data can be generated, and how to utilize this enormous sequence resource has become a critical concern for microbial ecologists. One particular challenge is the OTUs (operational taxonomic units) picking in 16S rRNA sequence analysis. Lucky, this challenge can be directly addressed by sequence clustering that attempts to group similar sequences. Therefore, numerous clustering methods have been proposed to help to cluster 16S rRNA sequences into OTUs. However, each method has its clustering mechanism, and different methods produce diverse outputs. Even a slight parameter change for the same method can also generate distinct results, and how to choose an appropriate method has become a challenge for inexperienced users. A lot of time and resources can be wasted in selecting clustering tools and analyzing the clustering results. In this study, we introduced the recent advance of clustering methods for OTUs picking, which mainly focus on three aspects: (i) the principles of existing clustering algorithms, (ii) benchmark dataset construction for OTU picking and evaluation metrics, and (iii) the performance of different methods with various distance thresholds on benchmark datasets. This paper aims to assist biological researchers to select the reasonable clustering methods for analyzing their collected sequences and help algorithm developers to design more efficient sequences clustering methods.
Collapse
Affiliation(s)
- Ze-Gang Wei
- Institute of Physics and Optoelectronics Technology, Baoji University of Arts and Sciences, Baoji, China
- Key Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical University, Xi’an, China
| | - Xiao-Dan Zhang
- Institute of Physics and Optoelectronics Technology, Baoji University of Arts and Sciences, Baoji, China
| | - Ming Cao
- Faculty of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, China
- School of Mathematics and Statistics, Shaanxi Xueqian Normal University, Xi’an, China
| | - Fei Liu
- Institute of Physics and Optoelectronics Technology, Baoji University of Arts and Sciences, Baoji, China
| | - Yu Qian
- Institute of Physics and Optoelectronics Technology, Baoji University of Arts and Sciences, Baoji, China
| | - Shao-Wu Zhang
- Key Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical University, Xi’an, China
| |
Collapse
|
5
|
Gignoux-Wolfsohn SA, Precht WF, Peters EC, Gintert BE, Kaufman LS. Ecology, histopathology, and microbial ecology of a white-band disease outbreak in the threatened staghorn coral Acropora cervicornis. DISEASES OF AQUATIC ORGANISMS 2020; 137:217-237. [PMID: 32132275 DOI: 10.3354/dao03441] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
This study is a multi-pronged description of a temperature-induced outbreak of white-band disease (WBD) that occurred in Acropora cervicornis off northern Miami Beach, Florida (USA), from July to October 2014. We describe the ecology of the disease and examine diseased corals using both histopathology and next-generation bacterial 16S gene sequencing, making it possible to better understand the effect this disease has on the coral holobiont, and to address some of the seeming contradictions among previous studies of WBD that employed either a purely histological or molecular approach. The outbreak began in July 2014, as sea surface temperatures reached 29°C, and peaked in mid-September, a month after the sea surface temperature maximum. The microscopic anatomy of apparently healthy portions of colonies displaying active disease signs appeared normal except for some tissue atrophy and dissociation of mesenterial filaments deep within the branch. Structural changes were more pronounced in visibly diseased fragments, with atrophy, necrosis, and lysing of surface and basal body wall and polyp structures at the tissue-loss margin. The only bacteria evident microscopically in both diseased and apparently healthy tissues with Giemsa staining was a Rickettsiales-like organism (RLO) occupying mucocytes. Sequencing also identified bacteria belonging to the order Rickettsiales in all fragments. When compared to apparently healthy fragments, diseased fragments had more diverse bacterial communities made up of many previously suggested potential primary pathogens and secondary (opportunistic) colonizers. Interactions between elevated seawater temperatures, the coral host, and pathogenic members of the diseased microbiome all contribute to the coral displaying signs of WBD.
Collapse
|
6
|
Yadav D, Dutta A, Mande SS. OTUX: V-region specific OTU database for improved 16S rRNA OTU picking and efficient cross-study taxonomic comparison of microbiomes. DNA Res 2019; 26:147-156. [PMID: 30624596 PMCID: PMC6476724 DOI: 10.1093/dnares/dsy045] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Accepted: 12/13/2018] [Indexed: 02/01/2023] Open
Abstract
Many microbiome studies employ reference-based operational taxonomic unit (OTU)-picking methods, which in general, rely on databases cataloguing reference OTUs identified through clustering full-length 16S rRNA genes. Given that the rate of accumulation of mutations are not uniform throughout the length of a 16S rRNA gene across different taxonomic clades, results of OTU identification or taxonomic classification obtained using ‘short-read’ sequence queries (as generated by next-generation sequencing platforms) can be inconsistent and of suboptimal accuracy. De novo OTU clustering results too can significantly vary depending upon the hypervariable region (V-region) targeted for sequencing. As a consequence, comparison of microbiomes profiled in different scientific studies becomes difficult and often poses a challenge in analysing new findings in context of prior knowledge. The OTUX approach of reference-based OTU-picking proposes to overcome these limitations by using ‘customized’ OTU reference databases, which can cater to different sets of short-read sequences corresponding to different 16S V-regions. The results obtained with OTUX-approach (which are in terms of OTUX-OTU identifiers) can also be ‘mapped back’ or represented in terms of other OTU database identifiers/taxonomy, e.g. Greengenes, thus allowing for easy cross-study comparisons. Validation with simulated datasets indicates more efficient, accurate, and consistent taxonomic classifications obtained using OTUX-approach, as compared with conventional methods.
Collapse
Affiliation(s)
- Deepak Yadav
- Bio-Sciences R&D Division, TCS Research, Tata Consultancy Services Limited, Pune, Maharashtra, India
| | - Anirban Dutta
- Bio-Sciences R&D Division, TCS Research, Tata Consultancy Services Limited, Pune, Maharashtra, India
| | - Sharmila S Mande
- Bio-Sciences R&D Division, TCS Research, Tata Consultancy Services Limited, Pune, Maharashtra, India
| |
Collapse
|
7
|
Wei ZG, Zhang SW. DMSC: A Dynamic Multi-Seeds Method for Clustering 16S rRNA Sequences Into OTUs. Front Microbiol 2019; 10:428. [PMID: 30915052 PMCID: PMC6422886 DOI: 10.3389/fmicb.2019.00428] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2019] [Accepted: 02/19/2019] [Indexed: 12/30/2022] Open
Abstract
Next-generation sequencing (NGS)-based 16S rRNA sequencing by jointly using the PCR amplification and NGS technology is a cost-effective technique, which has been successfully used to study the phylogeny and taxonomy of samples from complex microbiomes or environments. Clustering 16S rRNA sequences into operational taxonomic units (OTUs) is often the first step for many downstream analyses. Heuristic clustering is one of the most widely employed approaches for generating OTUs. However, most heuristic OTUs clustering methods just select one single seed sequence to represent each cluster, resulting in their outcomes suffer from either overestimation of OTUs number or sensitivity to sequencing errors. In this paper, we present a novel dynamic multi-seeds clustering method (namely DMSC) to pick OTUs. DMSC first heuristically generates clusters according to the distance threshold. When the size of a cluster reaches the pre-defined minimum size, then DMSC selects the multi-core sequences (MCS) as the seeds that are defined as the n-core sequences (n ≥ 3), in which the distance between any two sequences is less than the distance threshold. A new sequence is assigned to the corresponding cluster depending on the average distance to MCS and the distance standard deviation within the MCS. If a new sequence is added to the cluster, dynamically update the MCS until no sequence is merged into the cluster. The new method DMSC was tested on several simulated and real-life sequence datasets and also compared with the traditional heuristic methods such as CD-HIT, UCLUST, and DBH. Experimental results in terms of the inferred OTUs number, normalized mutual information (NMI) and Matthew correlation coefficient (MCC) metrics demonstrate that DMSC can produce higher quality clusters with low memory usage and reduce OTU overestimation. Additionally, DMSC is also robust to the sequencing errors. The DMSC software can be freely downloaded from https://github.com/NWPU-903PR/DMSC.
Collapse
Affiliation(s)
- Ze-Gang Wei
- Key Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical University, Xi'an, China.,Institute of Physics and Optoelectronics Technology, Baoji University of Arts and Science, Baoji, China
| | - Shao-Wu Zhang
- Key Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical University, Xi'an, China
| |
Collapse
|
8
|
Ozkan J, Willcox MD. The Ocular Microbiome: Molecular Characterisation of a Unique and Low Microbial Environment. Curr Eye Res 2019; 44:685-694. [PMID: 30640553 DOI: 10.1080/02713683.2019.1570526] [Citation(s) in RCA: 79] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Aim: The ocular surface is continually exposed to bacteria from the environment and traditional culture-based microbiological studies have isolated a low diversity of microorganisms from this region. The use of culture-independent methods to define the ocular microbiome, primarily involving 16S ribosomal RNA gene sequencing studies, have shown that the microbial communities present on the ocular surface have a greater diversity than previously reported. Method: A review of the literature on ocular microbiome research in health and disease. Results: Molecular techniques have been used to investigate the effect of contact lens wear and disease on the microbiota of the ocular surface and eyelids and the immunoregulatory role of the ocular surface microbiota. Studies have shown that compositional changes in the microbiota occur in ocular surface disorders such as blepharitis, trachoma and dry eye and also suggest a role of the ocular and non-ocular microbiome in retinal disease including age-related macular degeneration, glaucoma, uveitis and diabetic retinopathy. However, ocular microbiome studies need to recognise the potential for contamination to impact findings and carefully control each stage of the experimental procedure and to utilise statistical methods to identify contamination signals. Conclusion: The healthy ocular surface is characterised by a relatively stable, comparatively low diversity microbiome with recent findings that the bacteria of the ocular surface appear to have a role in maintaining homeostasis by modulating immune function.
Collapse
Affiliation(s)
- Jerome Ozkan
- a School of Optometry and Vision Science , University of New South Wales , Sydney , Australia.,b School of Biological, Earth and Environmental Sciences , University of New South Wales , Sydney , Australia
| | - Mark D Willcox
- a School of Optometry and Vision Science , University of New South Wales , Sydney , Australia
| |
Collapse
|
9
|
Earl JP, Adappa ND, Krol J, Bhat AS, Balashov S, Ehrlich RL, Palmer JN, Workman AD, Blasetti M, Sen B, Hammond J, Cohen NA, Ehrlich GD, Mell JC. Species-level bacterial community profiling of the healthy sinonasal microbiome using Pacific Biosciences sequencing of full-length 16S rRNA genes. MICROBIOME 2018; 6:190. [PMID: 30352611 PMCID: PMC6199724 DOI: 10.1186/s40168-018-0569-2] [Citation(s) in RCA: 92] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2018] [Accepted: 10/02/2018] [Indexed: 05/03/2023]
Abstract
BACKGROUND Pan-bacterial 16S rRNA microbiome surveys performed with massively parallel DNA sequencing technologies have transformed community microbiological studies. Current 16S profiling methods, however, fail to provide sufficient taxonomic resolution and accuracy to adequately perform species-level associative studies for specific conditions. This is due to the amplification and sequencing of only short 16S rRNA gene regions, typically providing for only family- or genus-level taxonomy. Moreover, sequencing errors often inflate the number of taxa present. Pacific Biosciences' (PacBio's) long-read technology in particular suffers from high error rates per base. Herein, we present a microbiome analysis pipeline that takes advantage of PacBio circular consensus sequencing (CCS) technology to sequence and error correct full-length bacterial 16S rRNA genes, which provides high-fidelity species-level microbiome data. RESULTS Analysis of a mock community with 20 bacterial species demonstrated 100% specificity and sensitivity with regard to taxonomic classification. Examination of a 250-plus species mock community demonstrated correct species-level classification of > 90% of taxa, and relative abundances were accurately captured. The majority of the remaining taxa were demonstrated to be multiply, incorrectly, or incompletely classified. Using this methodology, we examined the microgeographic variation present among the microbiomes of six sinonasal sites, by both swab and biopsy, from the anterior nasal cavity to the sphenoid sinus from 12 subjects undergoing trans-sphenoidal hypophysectomy. We found greater variation among subjects than among sites within a subject, although significant within-individual differences were also observed. Propiniobacterium acnes (recently renamed Cutibacterium acnes) was the predominant species throughout, but was found at distinct relative abundances by site. CONCLUSIONS Our microbial composition analysis pipeline for single-molecule real-time 16S rRNA gene sequencing (MCSMRT, https://github.com/jpearl01/mcsmrt ) overcomes deficits of standard marker gene-based microbiome analyses by using CCS of entire 16S rRNA genes to provide increased taxonomic and phylogenetic resolution. Extensions of this approach to other marker genes could help refine taxonomic assignments of microbial species and improve reference databases, as well as strengthen the specificity of associations between microbial communities and dysbiotic states.
Collapse
Affiliation(s)
- Joshua P. Earl
- Department of Microbiology & Immunology, Centers for Genomic Sciences and Advanced Microbial Processing, Drexel University College of Medicine, 245 N 15th Street, Philadelphia, PA 19102 USA
| | - Nithin D. Adappa
- Veteran’s Administration Medical Center, Department of Otorhinolaryngology: Head and Neck Surgery, University of Pennsylvania Perelman School of Medicine, 3400 Spruce Street, 5 Ravdin, Philadelphia, PA 19104-4283 USA
| | - Jaroslaw Krol
- Department of Microbiology & Immunology, Centers for Genomic Sciences and Advanced Microbial Processing, Drexel University College of Medicine, 245 N 15th Street, Philadelphia, PA 19102 USA
| | - Archana S. Bhat
- Department of Microbiology & Immunology, Centers for Genomic Sciences and Advanced Microbial Processing, Drexel University College of Medicine, 245 N 15th Street, Philadelphia, PA 19102 USA
| | - Sergey Balashov
- Department of Microbiology & Immunology, Centers for Genomic Sciences and Advanced Microbial Processing, Drexel University College of Medicine, 245 N 15th Street, Philadelphia, PA 19102 USA
| | - Rachel L. Ehrlich
- Department of Microbiology & Immunology, Centers for Genomic Sciences and Advanced Microbial Processing, Drexel University College of Medicine, 245 N 15th Street, Philadelphia, PA 19102 USA
| | - James N. Palmer
- Veteran’s Administration Medical Center, Department of Otorhinolaryngology: Head and Neck Surgery, University of Pennsylvania Perelman School of Medicine, 3400 Spruce Street, 5 Ravdin, Philadelphia, PA 19104-4283 USA
| | - Alan D. Workman
- Veteran’s Administration Medical Center, Department of Otorhinolaryngology: Head and Neck Surgery, University of Pennsylvania Perelman School of Medicine, 3400 Spruce Street, 5 Ravdin, Philadelphia, PA 19104-4283 USA
| | - Mariel Blasetti
- Veteran’s Administration Medical Center, Department of Otorhinolaryngology: Head and Neck Surgery, University of Pennsylvania Perelman School of Medicine, 3400 Spruce Street, 5 Ravdin, Philadelphia, PA 19104-4283 USA
| | - Bhaswati Sen
- Department of Microbiology & Immunology, Centers for Genomic Sciences and Advanced Microbial Processing, Drexel University College of Medicine, 245 N 15th Street, Philadelphia, PA 19102 USA
| | - Jocelyn Hammond
- Department of Microbiology & Immunology, Centers for Genomic Sciences and Advanced Microbial Processing, Drexel University College of Medicine, 245 N 15th Street, Philadelphia, PA 19102 USA
| | - Noam A. Cohen
- Veteran’s Administration Medical Center, Department of Otorhinolaryngology: Head and Neck Surgery, University of Pennsylvania Perelman School of Medicine, 3400 Spruce Street, 5 Ravdin, Philadelphia, PA 19104-4283 USA
| | - Garth D. Ehrlich
- Department of Microbiology & Immunology, Centers for Genomic Sciences and Advanced Microbial Processing, Drexel University College of Medicine, 245 N 15th Street, Philadelphia, PA 19102 USA
| | - Joshua Chang Mell
- Department of Microbiology & Immunology, Centers for Genomic Sciences and Advanced Microbial Processing, Drexel University College of Medicine, 245 N 15th Street, Philadelphia, PA 19102 USA
| |
Collapse
|
10
|
Barriuso J, Martínez MJ. In Silico Analysis of the Quorum Sensing Metagenome in Environmental Biofilm Samples. Front Microbiol 2018; 9:1243. [PMID: 29930547 PMCID: PMC6000730 DOI: 10.3389/fmicb.2018.01243] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2018] [Accepted: 05/23/2018] [Indexed: 01/09/2023] Open
Abstract
Quorum sensing (QS) is a sophisticated cell to cell signaling mechanism mediated by small diffusible molecules called “autoinducers.” This phenomenon is well studied in bacteria, where different QS systems are described that differ between Gram-negative and Gram-positive bacteria. However, a common system to these groups was discovered, the autoinducer 2. QS has implications in biofilm formation, where the application of metagenomic techniques to study these phenomena may be useful to understand the communication networks established by the different components of the community, and to discover new targets for microbial control. Here we present an in silico screening of QS proteins in all publicly available biofilm metagenomes from the JGI database. We performed sequence, conserved motifs, phylogenetic, and three-dimensional structure analyses of the candidates, resulting in an effective strategy to search QS proteins in metagenomes sequences. The number of QS proteins present in each sample, and its phylogenetic affiliation, was clearly related to the bacterial diversity and the origin of the biofilm. The samples isolated from natural habitats presented clear differences with those from artificial habitats. Interesting findings have been made in the abundance of LuxR-like proteins finding an unbalanced ratio between the synthases and the receptor proteins in Bacteroidetes bacteria, pointing out the existence of “cheaters” in this group. Moreover, we have shown the presence of the LuxI/R QS system in bacteria from the Nitrospira taxonomic group. Finally, some undescribed proteins from the HdtS family have been found in Gamma-proteobacteria.
Collapse
Affiliation(s)
- Jorge Barriuso
- Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas, Madrid, Spain
| | - María J Martínez
- Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas, Madrid, Spain
| |
Collapse
|
11
|
Cabral BCA, Hoffmann L, Budowle B, Ürményi TP, Moura-Neto RS, Azevedo SMFO, Silva R. Planktonic microbial profiling in water samples from a Brazilian Amazonian reservoir. Microbiologyopen 2018; 7:e00523. [PMID: 29380948 PMCID: PMC5911997 DOI: 10.1002/mbo3.523] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2017] [Revised: 06/18/2017] [Accepted: 06/27/2017] [Indexed: 11/18/2022] Open
Abstract
Our comprehension of the dynamics and diversity of freshwater planktonic bacterial communities is far from complete concerning the Brazilian Amazonian region. Therefore, reference studies are urgently needed. We mapped bacterial communities present in the planktonic communities of a freshwater artificial reservoir located in the western Amazonian basin. Two samples were obtained from rainy and dry seasons, the periods during which water quality and plankton diversity undergo the most significant changes. Hypervariable 16S rRNA and shotgun sequencing were performed to describe the first reference of a microbial community in an Amazonian lentic system. Microbial composition consisted mainly of Betaproteobacteria, Cyanobacteria, Alphaproteobacteria, and Actinobacteria in the dry period. The bacteria distribution in the rainy period was notably absent of Cyanobacteria. Microcystis was observed in the dry period in which the gene cluster for cyanotoxins was found. Iron acquisition gene group was higher in the sample from the rainy season. This work mapped the first inventory of the planktonic microbial community of a large water reservoir in the Amazon, providing a reference for future functional studies and determining other communities and how they interact.
Collapse
Affiliation(s)
- Bianca C A Cabral
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Luísa Hoffmann
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, TX, USA.,Center of Excellence in Genomic Medicine Research (CEGMR) King Abdulaziz University, Jeddah, Saudi Arabia
| | - Turán P Ürményi
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Rodrigo S Moura-Neto
- Instituto de Biologia, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Sandra M F O Azevedo
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Rosane Silva
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| |
Collapse
|
12
|
Gignoux-Wolfsohn SA, Aronson FM, Vollmer SV. Complex interactions between potentially pathogenic, opportunistic, and resident bacteria emerge during infection on a reef-building coral. FEMS Microbiol Ecol 2017. [PMID: 28637338 DOI: 10.1093/femsec/fix080] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Increased bacterial diversity on diseased corals can obscure disease etiology and complicate our understanding of pathogenesis. To untangle microbes that may cause white band disease signs from microbes responding to disease, we inoculated healthy Acropora cervicornis corals with an infectious dose from visibly diseased corals. We sampled these dosed corals and healthy controls over time for sequencing of the bacterial 16S region. Endozoicomonas were associated with healthy fragments from 4/10 colonies, dominating microbiomes before dosing and decreasing over time only in corals that displayed disease signs, suggesting a role in disease resistance. We grouped disease-associated bacteria by when they increased in abundance (primary vs secondary) and whether they originated in the dose (colonizers) or the previously healthy corals (responders). We found that all primary responders increased in all dosed corals regardless of final disease state and are therefore unlikely to cause disease signs. In contrast, primary colonizers in the families Pasteurellaceae and Francisellaceae increased solely in dosed corals that ultimately displayed disease signs, and may be infectious foreign bacteria involved in the development of disease signs. Moving away from a static comparison of diseased and healthy bacterial communities, we provide a framework to identify key players in other coral diseases.
Collapse
Affiliation(s)
- Sarah A Gignoux-Wolfsohn
- Department of Ecology, Evolution, & Natural Resources School of Environmental and Biological Sciences, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901-8525, USA
| | - Felicia M Aronson
- Marine Science Center, Northeastern University, Nahant, MA 01908, USA
| | - Steven V Vollmer
- Marine Science Center, Northeastern University, Nahant, MA 01908, USA
| |
Collapse
|
13
|
Guo H, Rischer M, Sperfeld M, Weigel C, Menzel KD, Clardy J, Beemelmanns C. Natural products and morphogenic activity of γ-Proteobacteria associated with the marine hydroid polyp Hydractinia echinata. Bioorg Med Chem 2017; 25:6088-6097. [PMID: 28893599 PMCID: PMC5675742 DOI: 10.1016/j.bmc.2017.06.053] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2017] [Revised: 06/21/2017] [Accepted: 06/30/2017] [Indexed: 02/07/2023]
Abstract
Illumina 16S rRNA gene sequencing was used to profile the associated bacterial community of the marine hydroid Hydractinia echinata, a long-standing model system in developmental biology. 56 associated bacteria were isolated and evaluated for their antimicrobial activity. Three strains were selected for further in-depth chemical analysis leading to the identification of 17 natural products. Several γ-Proteobacteria were found to induce settlement of the motile larvae, but only six isolates induced the metamorphosis to the primary polyp stage within 24h. Our study paves the way to better understand how bacterial partners contribute to protection, homeostasis and propagation of the hydroid polyp.
Collapse
Affiliation(s)
- Huijuan Guo
- Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Beutenbergstraβe 11a, D-07745 Jena, Germany
| | - Maja Rischer
- Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Beutenbergstraβe 11a, D-07745 Jena, Germany
| | - Martin Sperfeld
- Department of Applied and Ecological Microbiology, Institute for Microbiology, Friedrich Schiller University Jena, Philosophenweg 12, D-07743 Jena, Germany
| | - Christiane Weigel
- Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Beutenbergstraβe 11a, D-07745 Jena, Germany
| | - Klaus Dieter Menzel
- Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Beutenbergstraβe 11a, D-07745 Jena, Germany
| | - Jon Clardy
- Department of Biological Chemistry & Molecular Pharmacology, Harvard Medical School, Harvard University, 240 Longwood Ave., Boston, MA 02115, USA
| | - Christine Beemelmanns
- Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Beutenbergstraβe 11a, D-07745 Jena, Germany.
| |
Collapse
|
14
|
Whelan FJ, Surette MG. A comprehensive evaluation of the sl1p pipeline for 16S rRNA gene sequencing analysis. MICROBIOME 2017; 5:100. [PMID: 28807046 PMCID: PMC5557527 DOI: 10.1186/s40168-017-0314-2] [Citation(s) in RCA: 57] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2017] [Accepted: 07/21/2017] [Indexed: 05/11/2023]
Abstract
BACKGROUND Advances in next-generation sequencing technologies have allowed for detailed, molecular-based studies of microbial communities such as the human gut, soil, and ocean waters. Sequencing of the 16S rRNA gene, specific to prokaryotes, using universal PCR primers has become a common approach to studying the composition of these microbiota. However, the bioinformatic processing of the resulting millions of DNA sequences can be challenging, and a standardized protocol would aid in reproducible analyses. METHODS The short-read library 16S rRNA gene sequencing pipeline (sl1p, pronounced "slip") was designed with the purpose of mitigating this lack of reproducibility by combining pre-existing tools into a computational pipeline. This pipeline automates the processing of raw 16S rRNA gene sequencing data to create human-readable tables, graphs, and figures to make the collected data more readily accessible. RESULTS Data generated from mock communities were compared using eight OTU clustering algorithms, two taxon assignment approaches, and three 16S rRNA gene reference databases. While all of these algorithms and options are available to sl1p users, through testing with human-associated mock communities, AbundantOTU+, the RDP Classifier, and the Greengenes 2011 reference database were chosen as sl1p's defaults based on their ability to best represent the known input communities. CONCLUSIONS sl1p promotes reproducible research by providing a comprehensive log file, and reduces the computational knowledge needed by the user to process next-generation sequencing data. sl1p is freely available at https://bitbucket.org/fwhelan/sl1p .
Collapse
Affiliation(s)
- Fiona J. Whelan
- Department of Biochemistry and Biomedical Sciences, McMaster University, 1280 Main St. W, Hamilton, Canada
| | - Michael G. Surette
- Department of Biochemistry and Biomedical Sciences, McMaster University, 1280 Main St. W, Hamilton, Canada
- Department of Medicine, McMaster University, 1280 Main St. W, Hamilton, Canada
| |
Collapse
|
15
|
Wei ZG, Zhang SW, Zhang YZ. DMclust, a Density-based Modularity Method for Accurate OTU Picking of 16S rRNA Sequences. Mol Inform 2017; 36. [PMID: 28586119 DOI: 10.1002/minf.201600059] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2017] [Accepted: 04/25/2017] [Indexed: 11/08/2022]
Abstract
Clustering 16S rRNA sequences into operational taxonomic units (OTUs) is a crucial step in analyzing metagenomic data. Although many methods have been developed, how to obtain an appropriate balance between clustering accuracy and computational efficiency is still a major challenge. A novel density-based modularity clustering method, called DMclust, is proposed in this paper to bin 16S rRNA sequences into OTUs with high clustering accuracy. The DMclust algorithm consists of four main phases. It first searches for the sequence dense group defined as n-sequence community, in which the distance between any two sequences is less than a threshold. Then these dense groups are used to construct a weighted network, where dense groups are viewed as nodes, each pair of dense groups is connected by an edge, and the distance of pairwise groups represents the weight of the edge. Then, a modularity-based community detection method is employed to generate the preclusters. Finally, the remaining sequences are assigned to their nearest preclusters to form OTUs. Compared with existing widely used methods, the experimental results on several metagenomic datasets show that DMclust has higher accurate clustering performance with acceptable memory usage.
Collapse
Affiliation(s)
- Ze-Gang Wei
- Key Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical University, Xi'an, 710072, China
| | - Shao-Wu Zhang
- Key Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical University, Xi'an, 710072, China
| | - Yi-Zhai Zhang
- Key Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical University, Xi'an, 710072, China
| |
Collapse
|
16
|
Cai Y, Zheng W, Yao J, Yang Y, Mai V, Mao Q, Sun Y. ESPRIT-Forest: Parallel clustering of massive amplicon sequence data in subquadratic time. PLoS Comput Biol 2017; 13:e1005518. [PMID: 28437450 PMCID: PMC5421816 DOI: 10.1371/journal.pcbi.1005518] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2016] [Revised: 05/08/2017] [Accepted: 04/13/2017] [Indexed: 12/30/2022] Open
Abstract
The rapid development of sequencing technology has led to an explosive accumulation of genomic sequence data. Clustering is often the first step to perform in sequence analysis, and hierarchical clustering is one of the most commonly used approaches for this purpose. However, it is currently computationally expensive to perform hierarchical clustering of extremely large sequence datasets due to its quadratic time and space complexities. In this paper we developed a new algorithm called ESPRIT-Forest for parallel hierarchical clustering of sequences. The algorithm achieves subquadratic time and space complexity and maintains a high clustering accuracy comparable to the standard method. The basic idea is to organize sequences into a pseudo-metric based partitioning tree for sub-linear time searching of nearest neighbors, and then use a new multiple-pair merging criterion to construct clusters in parallel using multiple threads. The new algorithm was tested on the human microbiome project (HMP) dataset, currently one of the largest published microbial 16S rRNA sequence dataset. Our experiment demonstrated that with the power of parallel computing it is now compu- tationally feasible to perform hierarchical clustering analysis of tens of millions of sequences. The software is available at http://www.acsu.buffalo.edu/∼yijunsun/lab/ESPRIT-Forest.html.
Collapse
Affiliation(s)
- Yunpeng Cai
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- * E-mail: (YC); (YS)
| | - Wei Zheng
- Department of Computer Science and Engineering, The State University of New York at Buffalo, Buffalo, New York, United States of America
| | - Jin Yao
- Department of Microbiology and Immunology, The State University of New York at Buffalo, Buffalo, New York, United States of America
| | - Yujie Yang
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Volker Mai
- Department of Epidemiology, University of Florida, Gainesville, Florida, United States of America
| | - Qi Mao
- Department of Microbiology and Immunology, The State University of New York at Buffalo, Buffalo, New York, United States of America
| | - Yijun Sun
- Department of Computer Science and Engineering, The State University of New York at Buffalo, Buffalo, New York, United States of America
- Department of Microbiology and Immunology, The State University of New York at Buffalo, Buffalo, New York, United States of America
- Department of Biostatistics, The State University of New York at Buffalo, Buffalo, New York, United States of America
- * E-mail: (YC); (YS)
| |
Collapse
|
17
|
Nascimento MM, Zaura E, Mira A, Takahashi N, Ten Cate JM. Second Era of OMICS in Caries Research: Moving Past the Phase of Disillusionment. J Dent Res 2017; 96:733-740. [PMID: 28384412 DOI: 10.1177/0022034517701902] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Novel approaches using OMICS techniques enable a collective assessment of multiple related biological units, including genes, gene expression, proteins, and metabolites. In the past decade, next-generation sequencing ( NGS) technologies were improved by longer sequence reads and the development of genome databases and user-friendly pipelines for data analysis, all accessible at lower cost. This has generated an outburst of high-throughput data. The application of OMICS has provided more depth to existing hypotheses as well as new insights in the etiology of dental caries. For example, the determination of complete bacterial microbiomes of oral samples rather than selected species, together with oral metatranscriptome and metabolome analyses, supports the viewpoint of dysbiosis of the supragingival biofilms. In addition, metabolome studies have been instrumental in disclosing the contributions of major pathways for central carbon and amino acid metabolisms to biofilm pH homeostasis. New, often noncultured, oral streptococci have been identified, and their phenotypic characterization has revealed candidates for probiotic therapy. Although findings from OMICS research have been greatly informative, problems related to study design, data quality, integration, and reproducibility still need to be addressed. Also, the emergence and continuous updates of these computationally demanding technologies require expertise in advanced bioinformatics for reliable interpretation of data. Despite the obstacles cited above, OMICS research is expected to encourage the discovery of novel caries biomarkers and the development of next-generation diagnostics and therapies for caries control. These observations apply equally to the study of other oral diseases.
Collapse
Affiliation(s)
- M M Nascimento
- 1 Department of Restorative Dental Sciences, Division of Operative Dentistry, College of Dentistry, University of Florida, Gainesville, FL, USA
| | - E Zaura
- 2 Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam, University of Amsterdam and Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - A Mira
- 3 Department of Health & Genomics, Center for Advanced Research in Public Health, FISABIO Foundation, Valencia, Spain
| | - N Takahashi
- 4 Department of Oral Biology, Division of Oral Ecology and Biochemistry, Tohoku University Graduate School of Dentistry, Sendai, Japan
| | - J M Ten Cate
- 5 Royal Netherlands Academy of Arts and Sciences (KNAW), Amsterdam, the Netherlands
| |
Collapse
|
18
|
OptiClust, an Improved Method for Assigning Amplicon-Based Sequence Data to Operational Taxonomic Units. mSphere 2017; 2:mSphere00073-17. [PMID: 28289728 PMCID: PMC5343174 DOI: 10.1128/mspheredirect.00073-17] [Citation(s) in RCA: 232] [Impact Index Per Article: 33.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Accepted: 02/15/2017] [Indexed: 11/20/2022] Open
Abstract
Assignment of 16S rRNA gene sequences to operational taxonomic units (OTUs) is a computational bottleneck in the process of analyzing microbial communities. Although this has been an active area of research, it has been difficult to overcome the time and memory demands while improving the quality of the OTU assignments. Here, we developed a new OTU assignment algorithm that iteratively reassigns sequences to new OTUs to optimize the Matthews correlation coefficient (MCC), a measure of the quality of OTU assignments. To assess the new algorithm, OptiClust, we compared it to 10 other algorithms using 16S rRNA gene sequences from two simulated and four natural communities. Using the OptiClust algorithm, the MCC values averaged 15.2 and 16.5% higher than the OTUs generated when we used the average neighbor and distance-based greedy clustering with VSEARCH, respectively. Furthermore, on average, OptiClust was 94.6 times faster than the average neighbor algorithm and just as fast as distance-based greedy clustering with VSEARCH. An empirical analysis of the efficiency of the algorithms showed that the time and memory required to perform the algorithm scaled quadratically with the number of unique sequences in the data set. The significant improvement in the quality of the OTU assignments over previously existing methods will significantly enhance downstream analysis by limiting the splitting of similar sequences into separate OTUs and merging of dissimilar sequences into the same OTU. The development of the OptiClust algorithm represents a significant advance that is likely to have numerous other applications. IMPORTANCE The analysis of microbial communities from diverse environments using 16S rRNA gene sequencing has expanded our knowledge of the biogeography of microorganisms. An important step in this analysis is the assignment of sequences into taxonomic groups based on their similarity to sequences in a database or based on their similarity to each other, irrespective of a database. In this study, we present a new algorithm for the latter approach. The algorithm, OptiClust, seeks to optimize a metric of assignment quality by shuffling sequences between taxonomic groups. We found that OptiClust produces more robust assignments and does so in a rapid and memory-efficient manner. This advance will allow for a more robust analysis of microbial communities and the factors that shape them.
Collapse
|
19
|
Martirosyan V, Unc A, Miller G, Doniger T, Wachtel C, Steinberger Y. Desert Perennial Shrubs Shape the Microbial-Community Miscellany in Laimosphere and Phyllosphere Space. MICROBIAL ECOLOGY 2016; 72:659-668. [PMID: 27450478 DOI: 10.1007/s00248-016-0822-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2016] [Accepted: 07/14/2016] [Indexed: 06/06/2023]
Abstract
Microbial function, composition, and distribution play a fundamental role in ecosystem ecology. The interaction between desert plants and their associated microbes is expected to greatly affect their response to changes in this harsh environment. Using comparative analyses, we studied the impact of three desert shrubs, Atriplex halimus (A), Artemisia herba-alba (AHA), and Hammada scoparia (HS), on soil- and leaf-associated microbial communities. DNA extracted from the leaf surface and soil samples collected beneath the shrubs were used to study associated microbial diversity using a sequencing survey of variable regions of bacterial 16S rRNA and fungal ribosomal internal transcribed spacer (ITS1). We found that the composition of bacterial and fungal orders is plant-type-specific, indicating that each plant type provides a suitable and unique microenvironment. The different adaptive ecophysiological properties of the three plant species and the differential effect on their associated microbial composition point to the role of adaptation in the shaping of microbial diversity. Overall, our findings suggest a link between plant ecophysiological adaptation as a "temporary host" and the biotic-community parameters in extreme xeric environments.
Collapse
Affiliation(s)
- Varsik Martirosyan
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 5290002, Israel
- Life Sciences International Postgraduate Educational Center, Acharyan 31 Str., Yerevan, 0040, Armenia
| | - Adrian Unc
- Boreal Ecosystems Research Initiative, Memorial University of Newfoundland, Corner Brook, Newfoundland and Labrador, A2H 6P9, Canada
| | - Gad Miller
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 5290002, Israel
| | - Tirza Doniger
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 5290002, Israel
| | - Chaim Wachtel
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 5290002, Israel
| | - Yosef Steinberger
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 5290002, Israel.
| |
Collapse
|
20
|
de la Cuesta-Zuluaga J, Escobar JS. Considerations For Optimizing Microbiome Analysis Using a Marker Gene. Front Nutr 2016; 3:26. [PMID: 27551678 PMCID: PMC4976105 DOI: 10.3389/fnut.2016.00026] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2016] [Accepted: 07/26/2016] [Indexed: 12/22/2022] Open
Abstract
Next-generation sequencing technologies have found a widespread use in the study of host–microbe interactions due to the increase in their throughput and their ever-decreasing costs. The analysis of human-associated microbial communities using a marker gene, particularly the 16S rRNA, has been greatly benefited from these technologies – the human gut microbiome research being a remarkable example of such analysis that has greatly expanded our understanding of microbe-mediated human health and disease, metabolism, and food absorption. 16S studies go through a series of in vitro and in silico steps that can greatly influence their outcomes. However, the lack of a standardized workflow has led to uncertainties regarding the transparency and reproducibility of gut microbiome studies. We, here, discuss the most common challenges in the archetypical 16S rRNA workflow, including the extraction of total DNA, its use as template in PCR with primers that amplify specific hypervariable regions of the gene, amplicon sequencing, the denoising and removal of low-quality reads, the detection and removal of chimeric sequences, the clustering of high-quality sequences into operational taxonomic units, and their taxonomic classification. We recommend the essential technical information that should be conveyed in publications for reproducibility of results and encourage non-experts to include procedures and available tools that mitigate most of the problems encountered in microbiome analysis.
Collapse
Affiliation(s)
| | - Juan S Escobar
- Vidarium - Nutrition, Health and Wellness Research Center, Grupo Empresarial Nutresa , Medellín , Colombia
| |
Collapse
|
21
|
Application of a Database-Independent Approach To Assess the Quality of Operational Taxonomic Unit Picking Methods. mSystems 2016; 1:mSystems00027-16. [PMID: 27832214 PMCID: PMC5069744 DOI: 10.1128/msystems.00027-16] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Assignment of 16S rRNA gene sequences to operational taxonomic units (OTUs) allows microbial ecologists to overcome the inconsistencies and biases within bacterial taxonomy and provides a strategy for clustering similar sequences that do not have representatives in a reference database. I have applied the Matthews correlation coefficient to assess the ability of 15 reference-independent and -dependent clustering algorithms to assign sequences to OTUs. This metric quantifies the ability of an algorithm to reflect the relationships between sequences without the use of a reference and can be applied to any data set or method. The most consistently robust method was the average neighbor algorithm; however, for some data sets, other algorithms matched its performance.
Collapse
|
22
|
Wei ZG, Zhang SW. MtHc: a motif-based hierarchical method for clustering massive 16S rRNA sequences into OTUs. MOLECULAR BIOSYSTEMS 2016; 11:1907-13. [PMID: 25912934 DOI: 10.1039/c5mb00089k] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The recent sequencing revolution driven by high-throughput technologies has led to rapid accumulation of 16S rRNA sequences for microbial communities. Clustering short sequences into operational taxonomic units (OTUs) is an initial crucial process in analyzing metagenomic data. Although many methods have been proposed for OTU inferences, a major challenge is the balance between inference accuracy and computational efficiency. To address these challenges, we present a novel motif-based hierarchical method (namely MtHc) for clustering massive 16S rRNA sequences into OTUs with high clustering accuracy and low memory usage. Suppose all the 16S rRNA sequences can be used to construct a complete weighted network, where sequences are viewed as nodes, each pair of sequences is connected by an imaginary edge, and the distance of a pair of sequences represents the weight of the edge. MtHc consists of three main phrases. First, heuristically search the motif that is defined as n-node sub-graph (in the present study, n = 3, 4, 5), in which the distance between any two nodes is less than a threshold. Second, use the motif as a seed to form candidate clusters by computing the distances of other sequences with the motif. Finally, hierarchically merge the candidate clusters to generate the OTUs by only calculating the distances of motifs between two clusters. Compared with the existing methods on several simulated and real-life metagenomic datasets, we demonstrate that MtHc has higher clustering performance, less memory usage and robustness for setting parameters, and that it is more effective to handle the large-scale metagenomic datasets. The MtHC software can be freely download from for academic users.
Collapse
Affiliation(s)
- Ze-Gang Wei
- Key Laboratory of Information Fusion Technology of Ministry of Education, College of Automation, Northwestern Polytechnical University, Xi'an, 710072, China.
| | | |
Collapse
|
23
|
Aloisio I, Quagliariello A, De Fanti S, Luiselli D, De Filippo C, Albanese D, Corvaglia LT, Faldella G, Di Gioia D. Evaluation of the effects of intrapartum antibiotic prophylaxis on newborn intestinal microbiota using a sequencing approach targeted to multi hypervariable 16S rDNA regions. Appl Microbiol Biotechnol 2016; 100:5537-46. [PMID: 26971496 DOI: 10.1007/s00253-016-7410-2] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2015] [Revised: 02/18/2016] [Accepted: 02/22/2016] [Indexed: 02/04/2023]
Abstract
Different factors are known to influence the early gut colonization in newborns, among them the perinatal use of antibiotics. On the other hand, the effect on the baby of the administration of antibiotics to the mother during labor, referred to as intrapartum antibiotic prophylaxis (IAP), has received less attention, although routinely used in group B Streptococcus positive women to prevent the infection in newborns. In this work, the fecal microbiota of neonates born to mothers receiving IAP and of control subjects were compared taking advantage for the first time of high-throughput DNA sequencing technology. Seven different 16S rDNA hypervariable regions (V2, V3, V4, V6 + V7, V8, and V9) were amplified and sequenced using the Ion Torrent Personal Genome Machine. The results obtained showed significant differences in the microbial composition of newborns born to mothers who had received IAP, with a lower abundance of Actinobacteria and Bacteroidetes as well as an overrepresentation of Proteobacteria. Considering that the seven hypervariable regions showed different discriminant ability in the taxonomic identification, further analyses were performed on the V4 region evidencing in IAP infants a reduced microbial richness and biodiversity, as well as a lower number of bacterial families with a predominance of Enterobacteriaceae members. In addition, this analysis pointed out a significant reduction in Bifidobacterium spp. strains. The reduced abundance of these beneficial microorganisms, together with the increased amount of potentially pathogenic bacteria, may suggest that IAP infants are more exposed to gastrointestinal or generally health disorders later in age.
Collapse
Affiliation(s)
- Irene Aloisio
- Department of Agricultural Sciences, University of Bologna, viale Fanin 42, 40127, Bologna, Italy
| | - Andrea Quagliariello
- Laboratory of Molecular Anthropology, Centre for Genome Biology Department of Biological, Geological and Environmental Sciences (BiGeA), University of Bologna, via Selmi 3, 40126, Bologna, Italy
| | - Sara De Fanti
- Laboratory of Molecular Anthropology, Centre for Genome Biology Department of Biological, Geological and Environmental Sciences (BiGeA), University of Bologna, via Selmi 3, 40126, Bologna, Italy
| | - Donata Luiselli
- Laboratory of Molecular Anthropology, Centre for Genome Biology Department of Biological, Geological and Environmental Sciences (BiGeA), University of Bologna, via Selmi 3, 40126, Bologna, Italy
| | - Carlotta De Filippo
- Institute of Biometeorology (IBIMET), National Research Council (CNR), Via G. Caproni 8, 50145, Florence, Italy
| | - Davide Albanese
- Department of Computational Biology, Research and Innovation Centre, Fondazione Edmund Mach, Via E. Mach 1, 38010 S. Michele all'Adige, Trento, Italy
| | - Luigi Tommaso Corvaglia
- Neonatal Intensive Care Unit, S. Orsola Malpighi Hospital, Via Massarenti 11, 40138, Bologna, Italy
| | - Giacomo Faldella
- Neonatal Intensive Care Unit, S. Orsola Malpighi Hospital, Via Massarenti 11, 40138, Bologna, Italy
| | - Diana Di Gioia
- Department of Agricultural Sciences, University of Bologna, viale Fanin 42, 40127, Bologna, Italy.
| |
Collapse
|
24
|
Garnica S, Schön ME, Abarenkov K, Riess K, Liimatainen K, Niskanen T, Dima B, Soop K, Frøslev TG, Jeppesen TS, Peintner U, Kuhnert-Finkernagel R, Brandrud TE, Saar G, Oertel B, Ammirati JF. Determining threshold values for barcoding fungi: lessons from Cortinarius (Basidiomycota), a highly diverse and widespread ectomycorrhizal genus. FEMS Microbiol Ecol 2016; 92:fiw045. [PMID: 26929438 DOI: 10.1093/femsec/fiw045] [Citation(s) in RCA: 71] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2016] [Indexed: 11/14/2022] Open
Abstract
Different distance-based threshold selection approaches were used to assess and compare use of the internal transcribed spacer (ITS) region to distinguish among 901 Cortinarius species represented by >3000 collections. Sources of error associated with genetic markers and selection approaches were explored and evaluated using MOTUs from genus and lineage based-alignments. Our study indicates that 1%-2% more species can be distinguished by using the full-length ITS barcode as compared to either the ITS1 or ITS2 regions alone. Optimal threshold values for different picking approaches and genetic marker lengths inferred from a subset of species containing major lineages ranged from 97.0% to 99.5% sequence similarity using clustering optimization and UNITE SH, and from 1% to 2% sequence dissimilarity with CROP. Errors for the optimal cutoff ranged from 0% to 70%, and these can be reduced to a maximum of 22% when excluding species lacking a barcode gap. A threshold value of 99% is suitable for distinguishing species in the majority of lineages in the genus using the entire ITS region but only 90% of the species could be identified using just the ITS1 or ITS2 region. Prior identification of species, lacking barcode gaps and their subsequent separate analyses, maximized the accuracy of threshold approaches.
Collapse
Affiliation(s)
- Sigisfredo Garnica
- Institute of Evolution and Ecology, Plant Evolutionary Ecology, University of Tübingen, Auf der Morgenstelle 5, D-72076 Tübingen, Germany
| | - Max Emil Schön
- Institute of Evolution and Ecology, Plant Evolutionary Ecology, University of Tübingen, Auf der Morgenstelle 5, D-72076 Tübingen, Germany
| | - Kessy Abarenkov
- Institute of Ecology and Earth Sciences, University of Tartu, 51005 Tartu, Estonia
| | - Kai Riess
- Institute of Evolution and Ecology, Plant Evolutionary Ecology, University of Tübingen, Auf der Morgenstelle 5, D-72076 Tübingen, Germany
| | - Kare Liimatainen
- Department of Biosciences, Plant Biology, University of Helsinki, PO Box 65, 00014 Lahti, Finland
| | - Tuula Niskanen
- Jodrell Laboratory, Royal Botanic Gardens, Kew, TW9 3DS, UK
| | - Bálint Dima
- Department of Biosciences, Plant Biology, University of Helsinki, PO Box 65, 00014 Lahti, Finland
| | - Karl Soop
- Honorary Research Associate, Swedish Museum of Natural History, Department of Cryptogamic Botany, Naturhistoriska riksmuseet, 104 05 Stockholm, Sweden
| | - Tobias Guldberg Frøslev
- Natural History Museum of Denmark, Center for Geogenetics, University of Copenhagen, Øster Voldgade 5-7, 1350 Copenhagen K, Denmark
| | - Thomas Stjernegaard Jeppesen
- Natural History Museum of Denmark, Collections, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen Ø, Denmark
| | - Ursula Peintner
- Institute of Microbiology, University Innsbruck, Technikerstraße 25, 6020 Innsbruck, Austria
| | | | - Tor Erik Brandrud
- Department of Landscape Ecology (Oslo), Norwegian Institute for Nature Research, N-Oslo 5, Norway
| | | | - Bernhard Oertel
- INRES, University of Bonn, Auf dem Hügel 6, D-53121 Bonn, Germany
| | - Joseph F Ammirati
- Department of Biology, University of Washington, Seattle, Washington 98195, USA
| |
Collapse
|
25
|
Westcott SL, Schloss PD. De novo clustering methods outperform reference-based methods for assigning 16S rRNA gene sequences to operational taxonomic units. PeerJ 2015; 3:e1487. [PMID: 26664811 PMCID: PMC4675110 DOI: 10.7717/peerj.1487] [Citation(s) in RCA: 169] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Accepted: 11/19/2015] [Indexed: 12/13/2022] Open
Abstract
Background. 16S rRNA gene sequences are routinely assigned to operational taxonomic units (OTUs) that are then used to analyze complex microbial communities. A number of methods have been employed to carry out the assignment of 16S rRNA gene sequences to OTUs leading to confusion over which method is optimal. A recent study suggested that a clustering method should be selected based on its ability to generate stable OTU assignments that do not change as additional sequences are added to the dataset. In contrast, we contend that the quality of the OTU assignments, the ability of the method to properly represent the distances between the sequences, is more important. Methods. Our analysis implemented six de novo clustering algorithms including the single linkage, complete linkage, average linkage, abundance-based greedy clustering, distance-based greedy clustering, and Swarm and the open and closed-reference methods. Using two previously published datasets we used the Matthew's Correlation Coefficient (MCC) to assess the stability and quality of OTU assignments. Results. The stability of OTU assignments did not reflect the quality of the assignments. Depending on the dataset being analyzed, the average linkage and the distance and abundance-based greedy clustering methods generated OTUs that were more likely to represent the actual distances between sequences than the open and closed-reference methods. We also demonstrated that for the greedy algorithms VSEARCH produced assignments that were comparable to those produced by USEARCH making VSEARCH a viable free and open source alternative to USEARCH. Further interrogation of the reference-based methods indicated that when USEARCH or VSEARCH were used to identify the closest reference, the OTU assignments were sensitive to the order of the reference sequences because the reference sequences can be identical over the region being considered. More troubling was the observation that while both USEARCH and VSEARCH have a high level of sensitivity to detect reference sequences, the specificity of those matches was poor relative to the true best match. Discussion. Our analysis calls into question the quality and stability of OTU assignments generated by the open and closed-reference methods as implemented in current version of QIIME. This study demonstrates that de novo methods are the optimal method of assigning sequences into OTUs and that the quality of these assignments needs to be assessed for multiple methods to identify the optimal clustering method for a particular dataset.
Collapse
Affiliation(s)
- Sarah L. Westcott
- Department of Microbiology and Immunology, University of Michigan—Ann Arbor, Ann Arbor, MI, United States
| | - Patrick D. Schloss
- Department of Microbiology and Immunology, University of Michigan—Ann Arbor, Ann Arbor, MI, United States
| |
Collapse
|
26
|
Gignoux-Wolfsohn SA, Vollmer SV. Identification of Candidate Coral Pathogens on White Band Disease-Infected Staghorn Coral. PLoS One 2015; 10:e0134416. [PMID: 26241853 PMCID: PMC4524643 DOI: 10.1371/journal.pone.0134416] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2015] [Accepted: 07/08/2015] [Indexed: 02/01/2023] Open
Abstract
Bacterial diseases affecting scleractinian corals pose an enormous threat to the health of coral reefs, yet we still have a limited understanding of the bacteria associated with coral diseases. White band disease is a bacterial disease that affects the two Caribbean acroporid corals, the staghorn coral Acropora cervicornis and the elkhorn coral A. palmate. Species of Vibrio and Rickettsia have both been identified as putative WBD pathogens. Here we used Illumina 16S rRNA gene sequencing to profile the bacterial communities associated with healthy and diseased A. cervicornis collected from four field sites during two different years. We also exposed corals in tanks to diseased and healthy (control) homogenates to reduce some of the natural variation of field-collected coral bacterial communities. Using a combination of multivariate analyses, we identified community-level changes between diseased and healthy corals in both the field-collected and tank-exposed datasets. We then identified changes in the abundances of individual operational taxonomic units (OTUs) between diseased and healthy corals. By comparing the diseased and healthy-associated bacteria in field-collected and tank-exposed corals, we were able to identify 16 healthy-associated OTUs and 106 consistently disease-associated OTUs, which are good candidates for putative WBD pathogens. A large percentage of these disease-associated OTUs belonged to the order Flavobacteriales. In addition, two of the putative pathogens identified here belong to orders previously suggested as WBD pathogens: Vibronales and Rickettsiales.
Collapse
Affiliation(s)
- Sarah A. Gignoux-Wolfsohn
- Marine Science Center, Northeastern University, Nahant, Massachusetts, United States of America
- * E-mail:
| | - Steven V. Vollmer
- Marine Science Center, Northeastern University, Nahant, Massachusetts, United States of America
| |
Collapse
|
27
|
Flynn JM, Brown EA, Chain FJJ, MacIsaac HJ, Cristescu ME. Toward accurate molecular identification of species in complex environmental samples: testing the performance of sequence filtering and clustering methods. Ecol Evol 2015; 5:2252-66. [PMID: 26078860 PMCID: PMC4461425 DOI: 10.1002/ece3.1497] [Citation(s) in RCA: 89] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2015] [Revised: 03/05/2015] [Accepted: 03/10/2015] [Indexed: 11/05/2022] Open
Abstract
Metabarcoding has the potential to become a rapid, sensitive, and effective approach for identifying species in complex environmental samples. Accurate molecular identification of species depends on the ability to generate operational taxonomic units (OTUs) that correspond to biological species. Due to the sometimes enormous estimates of biodiversity using this method, there is a great need to test the efficacy of data analysis methods used to derive OTUs. Here, we evaluate the performance of various methods for clustering length variable 18S amplicons from complex samples into OTUs using a mock community and a natural community of zooplankton species. We compare analytic procedures consisting of a combination of (1) stringent and relaxed data filtering, (2) singleton sequences included and removed, (3) three commonly used clustering algorithms (mothur, UCLUST, and UPARSE), and (4) three methods of treating alignment gaps when calculating sequence divergence. Depending on the combination of methods used, the number of OTUs varied by nearly two orders of magnitude for the mock community (60–5068 OTUs) and three orders of magnitude for the natural community (22–22191 OTUs). The use of relaxed filtering and the inclusion of singletons greatly inflated OTU numbers without increasing the ability to recover species. Our results also suggest that the method used to treat gaps when calculating sequence divergence can have a great impact on the number of OTUs. Our findings are particularly relevant to studies that cover taxonomically diverse species and employ markers such as rRNA genes in which length variation is extensive.
Collapse
Affiliation(s)
- Jullien M Flynn
- Department of Biology, McGill University 1205 Docteur Penfield, Stewart Biology Building, Montreal, Quebec, Canada, H3A 1B1
| | - Emily A Brown
- Department of Biology, McGill University 1205 Docteur Penfield, Stewart Biology Building, Montreal, Quebec, Canada, H3A 1B1 ; Great Lakes Institute for Environmental Research, University of Windsor Windsor, Ontario, Canada
| | - Frédéric J J Chain
- Department of Biology, McGill University 1205 Docteur Penfield, Stewart Biology Building, Montreal, Quebec, Canada, H3A 1B1
| | - Hugh J MacIsaac
- Great Lakes Institute for Environmental Research, University of Windsor Windsor, Ontario, Canada
| | - Melania E Cristescu
- Department of Biology, McGill University 1205 Docteur Penfield, Stewart Biology Building, Montreal, Quebec, Canada, H3A 1B1
| |
Collapse
|
28
|
Morrison SS, Pyzh R, Jeon MS, Amaro C, Roig FJ, Baker-Austin C, Oliver JD, Gibas CJ. Impact of analytic provenance in genome analysis. BMC Genomics 2014; 15 Suppl 8:S1. [PMID: 25435180 PMCID: PMC4248810 DOI: 10.1186/1471-2164-15-s8-s1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Background Many computational methods are available for assembly and annotation of newly sequenced microbial genomes. However, when new genomes are reported in the literature, there is frequently very little critical analysis of choices made during the sequence assembly and gene annotation stages. These choices have a direct impact on the biologically relevant products of a genomic analysis - for instance identification of common and differentiating regions among genomes in a comparison, or identification of enriched gene functional categories in a specific strain. Here, we examine the outcomes of different assembly and analysis steps in typical workflows in a comparison among strains of Vibrio vulnificus. Results Using six recently sequenced strains of V. vulnificus, we demonstrate the "alternate realities" of comparative genomics, and how they depend on the choice of a robust assembly method and accurate ab initio annotation. We apply several popular assemblers for paired-end Illumina data, and three well-regarded ab initio genefinders. We demonstrate significant differences in detected gene overlap among comparative genomics workflows that depend on these two steps. The divergence between workflows, even those using widely adopted methods, is obvious both at the single genome level and when a comparison is performed. In a typical example where multiple workflows are applied to the strain V. vulnificus CECT 4606, a workflow that uses the Velvet assembler and Glimmer gene finder identifies 3275 gene features, while a workflow that uses the Velvet assembler and the RAST annotation system identifies 5011 gene features. Only 3171 genes are identical between both workflows. When we examine 9 assembly/ annotation workflow scenarios as input to a three-way genome comparison, differentiating genes and even differentially represented functional categories change significantly from scenario to scenario. Conclusions Inconsistencies in genomic analysis can arise depending on the choices that are made during the assembly and annotation stages. These inconsistencies can have a significant impact on the interpretation of an individual genome's content. The impact is multiplied when comparison of content and function among multiple genomes is the goal. Tracking the analysis history of the data - its analytic provenance - is critical for reproducible analysis of genome data.
Collapse
|
29
|
Schmidt TSB, Matias Rodrigues JF, von Mering C. Limits to robustness and reproducibility in the demarcation of operational taxonomic units. Environ Microbiol 2014; 17:1689-706. [PMID: 25156547 DOI: 10.1111/1462-2920.12610] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2014] [Accepted: 08/21/2014] [Indexed: 11/27/2022]
Abstract
The demarcation of operational taxonomic units (OTUs) from complex sequence data sets is a key step in contemporary studies of microbial ecology. However, as biologically motivated 'optimal' OTU-binning algorithms remain elusive, many conceptually distinct approaches continue to be used. Using a global data set of 887 870 bacterial 16S rRNA gene sequences, we objectively quantified biases introduced by several widely employed sequence clustering algorithms. We found that OTU-binning methods often provided surprisingly non-equivalent partitions of identical data sets, notably when clustering to the same nominal similarity thresholds; and we quantified the resulting impact on ecological data description for a well-defined human skin microbiome data set. We observed that some methods were very robust to varying clustering thresholds, while others were found to be highly susceptible even to slight threshold variations. Moreover, we comprehensively quantified the impact of the choice of 16S rRNA gene subregion, as well as of data set scope and context on algorithm performance. Our findings may contribute to an enhanced comparability of results across sequence-processing pipelines, and we arrive at recommendations towards higher levels of standardization in established workflows.
Collapse
Affiliation(s)
- Thomas S B Schmidt
- Institute for Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Winterthurerstrasse 190, Zürich, 8057, Switzerland
| | | | | |
Collapse
|
30
|
Mondy S, Lenglet A, Beury-Cirou A, Libanga C, Ratet P, Faure D, Dessaux Y. An increasing opine carbon bias in artificial exudation systems and genetically modified plant rhizospheres leads to an increasing reshaping of bacterial populations. Mol Ecol 2014; 23:4846-61. [DOI: 10.1111/mec.12890] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Revised: 08/12/2014] [Accepted: 08/15/2014] [Indexed: 11/29/2022]
Affiliation(s)
- Samuel Mondy
- Institut des Sciences du Végétal (ISV); UPR2355, CNRS, Saclay Plant Sciences; Avenue de la Terrasse, Gif-sur-Yvette 91198 France
| | - Aurore Lenglet
- Institut des Sciences du Végétal (ISV); UPR2355, CNRS, Saclay Plant Sciences; Avenue de la Terrasse, Gif-sur-Yvette 91198 France
| | - Amelie Beury-Cirou
- Institut des Sciences du Végétal (ISV); UPR2355, CNRS, Saclay Plant Sciences; Avenue de la Terrasse, Gif-sur-Yvette 91198 France
| | - Celestin Libanga
- Institut des Sciences du Végétal (ISV); UPR2355, CNRS, Saclay Plant Sciences; Avenue de la Terrasse, Gif-sur-Yvette 91198 France
| | - Pascal Ratet
- Institut des Sciences du Végétal (ISV); UPR2355, CNRS, Saclay Plant Sciences; Avenue de la Terrasse, Gif-sur-Yvette 91198 France
| | - Denis Faure
- Institut des Sciences du Végétal (ISV); UPR2355, CNRS, Saclay Plant Sciences; Avenue de la Terrasse, Gif-sur-Yvette 91198 France
| | - Yves Dessaux
- Institut des Sciences du Végétal (ISV); UPR2355, CNRS, Saclay Plant Sciences; Avenue de la Terrasse, Gif-sur-Yvette 91198 France
| |
Collapse
|
31
|
Kim M, Wang L, Morrison M, Yu Z. Development of a phylogenetic microarray for comprehensive analysis of ruminal bacterial communities. J Appl Microbiol 2014; 117:949-60. [DOI: 10.1111/jam.12598] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Revised: 06/22/2014] [Accepted: 07/12/2014] [Indexed: 12/14/2022]
Affiliation(s)
- M. Kim
- Department of Animal Sciences; The Ohio State University; Columbus OH USA
| | - L. Wang
- Department of Animal Sciences; The Ohio State University; Columbus OH USA
| | - M. Morrison
- Department of Animal Sciences; The Ohio State University; Columbus OH USA
- University of Queensland Diamantina Institute; Woolloongabba Qld Australia
| | - Z. Yu
- Department of Animal Sciences; The Ohio State University; Columbus OH USA
| |
Collapse
|
32
|
Kim M, Yu Z. Variations in 16S rRNA-based microbiome profiling between pyrosequencing runs and between pyrosequencing facilities. J Microbiol 2014; 52:355-65. [PMID: 24723104 DOI: 10.1007/s12275-014-3443-3] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2013] [Revised: 11/15/2013] [Accepted: 11/25/2013] [Indexed: 12/13/2022]
Abstract
Pyrosequencing of 16S rRNA gene amplicons on the 454 FLX Titanium platform has been widely used to analyze microbiomes in various environments. However, different results may stem from variations among sequencing runs or among sequencing facilities. This study aimed to evaluate these variations between different pyrosequencing runs by sequencing 16S rRNA gene amplicon libraries generated from three sets of rumen samples twice each on the 454 FLX Titanium system at two independent sequencing facilities. Similar relative abundances were found for predominant taxa represented by large numbers of sequence reads but not for minor taxa represented by small numbers of sequence reads. The two sequencing facilities revealed different bacterial profiles with respect to both predominant taxa and minor taxa, including the most predominant genus Prevotella, the family Lachnospiraceae, and the phylum Proteobacteria. Differences in primers used to generate amplicon libraries may be a major source of variations in microbiome profiling. Because different primers and regions of 16S rRNA genes are often used by different researchers, significant variations likely exist among studies. Quantitative interpretation for relative abundance of taxa, especially minor taxa, from prevalence of sequence reads and comparisons of results from different studies should be done with caution.
Collapse
Affiliation(s)
- Minseok Kim
- Department of Animal Sciences, The Ohio State University, Columbus, OH, USA
| | | |
Collapse
|
33
|
Comparative analysis of microbiome between accurately identified 16S rDNA and quantified bacteria in simulated samples. J Med Microbiol 2014; 63:433-440. [DOI: 10.1099/jmm.0.060616-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Although 16S rRNA gene (rDNA) sequencing is the gold standard for categorizing bacteria or characterizing microbial communities its clinical utility is limited by bias in metagenomic studies, in either the experiments or the data analyses. To evaluate the efficiency of current metagenomic methods, we sequenced seven simulated samples of ten bacterial species mixed at different concentrations. The V3 region of 16S rDNA was targeted and used to determine the distribution of bacterial species. The number of target sequences in individual simulated samples was in the range 1–1000 to provide a better reflection of natural microbial communities. However, for a given bacterial species present in the same proportion but at different concentrations, the observed percentage of 16S rDNAs was similar, except at very low concentrations that cannot be detected by real-time PCR. These results confirmed that the comparative microbiome in a sample characterized by 16S rDNA sequencing is sufficient to detect not only potential infectious pathogens, but also the relative proportion of 16S rDNA in the sample.
Collapse
|
34
|
Yang I, Nell S, Suerbaum S. Survival in hostile territory: the microbiota of the stomach. FEMS Microbiol Rev 2013; 37:736-61. [DOI: 10.1111/1574-6976.12027] [Citation(s) in RCA: 108] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2013] [Revised: 05/28/2013] [Accepted: 06/07/2013] [Indexed: 02/06/2023] Open
|
35
|
MSClust: A Multi-Seeds based Clustering algorithm for microbiome profiling using 16S rRNA sequence. J Microbiol Methods 2013; 94:347-55. [PMID: 23899776 DOI: 10.1016/j.mimet.2013.07.004] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2013] [Revised: 07/06/2013] [Accepted: 07/07/2013] [Indexed: 11/21/2022]
Abstract
Recent developments of next generation sequencing technologies have led to rapid accumulation of 16S rRNA sequences for microbiome profiling. One key step in data processing is to cluster short sequences into operational taxonomic units (OTUs). Although many methods have been proposed for OTU inferences, a major challenge is the balance between inference accuracy and computational efficiency, where inference accuracy is often sacrificed to accommodate the need to analyze large numbers of sequences. Inspired by the hierarchical clustering method and a modified greedy network clustering algorithm, we propose a novel multi-seeds based heuristic clustering method, named MSClust, for OTU inference. MSClust first adaptively selects multi-seeds instead of one seed for each candidate cluster, and the reads are then processed using a greedy clustering strategy. Through many numerical examples, we demonstrate that MSClust enjoys less memory usage, and better biological accuracy compared to existing heuristic clustering methods while preserving efficiency and scalability.
Collapse
|
36
|
Valverde JR, Mellado RP. Analysis of metagenomic data containing high biodiversity levels. PLoS One 2013; 8:e58118. [PMID: 23505458 PMCID: PMC3591453 DOI: 10.1371/journal.pone.0058118] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2012] [Accepted: 02/03/2013] [Indexed: 11/18/2022] Open
Abstract
In this paper we have addressed the problem of analysing Next Generation Sequencing samples with an expected large biodiversity content. We analysed several well-known 16S rRNA datasets from experimental samples, including both large and short sequences, in numbers of tens of thousands, in addition to carefully crafted synthetic datasets containing more than 7000 OTUs. From this data analysis several patterns were identified and used to develop new guidelines for experimentation in conditions of high biodiversity. We analysed the suitability of different clustering packages for these type of situations, the problem of even sampling, the relative effectiveness of Chao1 and ACE estimators as well as their effect on sampling size for a variety of population distributions. As regards practical analysis procedures, we advocated an approach that retains as much high-quality experimental data as possible. By carefully applying selection rules combining the taxonomic assignment with clustering strategies, we derived a set of recommendations for ultra-sequencing data analysis at high biodiversity levels.
Collapse
Affiliation(s)
- José R. Valverde
- Scientific Computing Service, Centro Nacional de Biotecnología (CSIC), c/Darwin, 3, Madrid, Spain
| | - Rafael P. Mellado
- Department of Microbial Biotechnology, Centro Nacional de Biotecnología (CSIC), c/Darwin, 3, Madrid, Spain
- * E-mail:
| |
Collapse
|
37
|
Can abundance of protists be inferred from sequence data: a case study of foraminifera. PLoS One 2013; 8:e56739. [PMID: 23431390 PMCID: PMC3576339 DOI: 10.1371/journal.pone.0056739] [Citation(s) in RCA: 92] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2012] [Accepted: 01/14/2013] [Indexed: 12/03/2022] Open
Abstract
Protists are key players in microbial communities, yet our understanding of their role in ecosystem functioning is seriously impeded by difficulties in identification of protistan species and their quantification. Current microscopy-based methods used for determining the abundance of protists are tedious and often show a low taxonomic resolution. Recent development of next-generation sequencing technologies offered a very powerful tool for studying the richness of protistan communities. Still, the relationship between abundance of species and number of sequences remains subjected to various technical and biological biases. Here, we test the impact of some of these biological biases on sequence abundance of SSU rRNA gene in foraminifera. First, we quantified the rDNA copy number and rRNA expression level of three species of foraminifera by qPCR. Then, we prepared five mock communities with these species, two in equal proportions and three with one species ten times more abundant. The libraries of rDNA and cDNA of the mock communities were constructed, Sanger sequenced and the sequence abundance was calculated. The initial species proportions were compared to the raw sequence proportions as well as to the sequence abundance normalized by rDNA copy number and rRNA expression level per species. Our results showed that without normalization, all sequence data differed significantly from the initial proportions. After normalization, the congruence between the number of sequences and number of specimens was much better. We conclude that without normalization, species abundance determination based on sequence data was not possible because of the effect of biological biases. Nevertheless, by taking into account the variation of rDNA copy number and rRNA expression level we were able to infer species abundance, suggesting that our approach can be successful in controlled conditions.
Collapse
|
38
|
Dave M, Higgins PD, Middha S, Rioux KP. The human gut microbiome: current knowledge, challenges, and future directions. Transl Res 2012; 160:246-57. [PMID: 22683238 DOI: 10.1016/j.trsl.2012.05.003] [Citation(s) in RCA: 176] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/06/2011] [Revised: 05/08/2012] [Accepted: 05/08/2012] [Indexed: 12/14/2022]
Abstract
The Human Genome Project was completed a decade ago, leaving a legacy of process, tools, and infrastructure now being turned to the study of the microbes that reside in and on the human body as determinants of health and disease, and has been branded "The Human Microbiome Project." Of the various niches under investigation, the human gut houses the most complex and abundant microbial community and is an arena for important host-microbial interactions that have both local and systemic impact. Initial studies of the human microbiome have been largely descriptive, a testing ground for innovative molecular techniques and new hypotheses. Methods for studying the microbiome have quickly evolved from low-resolution surveys of microbial community structure to high-definition description of composition, function, and ecology. Next-generation sequencing technologies combined with advanced bioinformatics place us at the doorstep of revolutionary insight into the composition, capability, and activity of the human intestinal microbiome. Renewed efforts to cultivate previously "uncultivable" microbes will be important to the overall understanding of gut ecology. There remain numerous methodological challenges to the effective study and understanding of the gut microbiome, largely relating to study design, sample collection, and the number of predictor variables. Strategic collaboration of clinicians, microbiologists, molecular biologists, computational scientists, and bioinformaticians is the ideal paradigm for success in this field. Meaningful interpretation of the gut microbiome requires that host genetic and environmental influences be controlled or accounted for. Understanding the gut microbiome in healthy humans is a foundation for discovering its influence in various important gastrointestinal and nutritional diseases (eg, inflammatory bowel disease, diabetes, and obesity), and for rational translation to human health gains.
Collapse
Affiliation(s)
- Maneesh Dave
- Division of Gastroenterology and Hepatology, Mayo Clinic, Rochester, MN, USA
| | | | | | | |
Collapse
|
39
|
Barriuso J, Valverde JR, Mellado RP. Effect of Cry1Ab protein on rhizobacterial communities of Bt-maize over a four-year cultivation period. PLoS One 2012; 7:e35481. [PMID: 22558158 PMCID: PMC3340378 DOI: 10.1371/journal.pone.0035481] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2012] [Accepted: 03/16/2012] [Indexed: 11/29/2022] Open
Abstract
Background Bt-maize is a transgenic variety of maize expressing the Cry toxin from Bacillus turingiensis. The potential accumulation of the relative effect of the transgenic modification and the cry toxin on the rhizobacterial communities of Bt-maize has been monitored over a period of four years. Methodology/Principal Findings The accumulative effects of the cultivation of this transgenic plant have been monitored by means of high throughput DNA pyrosequencing of the bacterial DNA coding for the 16S rRNA hypervariable V6 region from rhizobacterial communities. The obtained sequences were subjected to taxonomic, phylogenetic and taxonomic-independent diversity studies. The results obtained were consistent, indicating that variations detected in the rhizobacterial community structure were possibly due to climatic factors rather than to the presence of the Bt-gene. No variations were observed in the diversity estimates between non-Bt and Bt-maize. Conclusions/Significance The cultivation of Bt-maize during the four-year period did not change the maize rhizobacterial communities when compared to those of the non-Bt maize. This is the first study to be conducted with Bt-maize during such a long cultivation period and the first evaluation of rhizobacterial communities to be performed in this transgenic plant using Next Generation Sequencing.
Collapse
Affiliation(s)
- Jorge Barriuso
- Centro Nacional de Biotecnología (CSIC), Campus de la Universidad Autónoma, Cantoblanco, Madrid, Spain
| | - José R. Valverde
- Centro Nacional de Biotecnología (CSIC), Campus de la Universidad Autónoma, Cantoblanco, Madrid, Spain
| | - Rafael P. Mellado
- Centro Nacional de Biotecnología (CSIC), Campus de la Universidad Autónoma, Cantoblanco, Madrid, Spain
- * E-mail:
| |
Collapse
|