1
|
Nikodemova M, Holzhausen EA, Deblois CL, Barnet JH, Peppard PE, Suen G, Malecki KM. The effect of low-abundance OTU filtering methods on the reliability and variability of microbial composition assessed by 16S rRNA amplicon sequencing. Front Cell Infect Microbiol 2023; 13:1165295. [PMID: 37377642 PMCID: PMC10291178 DOI: 10.3389/fcimb.2023.1165295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 05/18/2023] [Indexed: 06/29/2023] Open
Abstract
PCR amplicon sequencing may lead to detection of spurious operational taxonomic units (OTUs), inflating estimates of gut microbial diversity. There is no consensus in the analytical approach as to what filtering methods should be applied to remove low-abundance OTUs; moreover, few studies have investigated the reliability of OTU detection within replicates. Here, we investigated the reliability of OTU detection (% agreement in detecting OTU in triplicates) and accuracy of their quantification (assessed by coefficient of variation (CV)) in human stool specimens. Stool samples were collected from 12 participants 22-55 years old. We applied several methods for filtering low-abundance OTUs and determined their impact on alpha-diversity and beta-diversity metrics. The reliability of OTU detection without any filtering was only 44.1% (SE=0.9) but increased after filtering low-abundance OTUs. After filtering OTUs with <0.1% abundance in the dataset, the reliability increased to 87.7% (SE=0.6) but at the expense of removing 6.97% reads from the dataset. When filtering was based on individual sample, the reliability increased to 73.1% after filtering OTUs with <10 copies while removing only 1.12% of reads. High abundance OTUs (>10 copies in sample) had lower CV, indicating better accuracy of quantification than low-abundance OTUs. Excluding very low-abundance OTUs had a significant impact on alpha-diversity metrics sensitive to the presence of rare species (observed OTUs, Chao1) but had little impact on relative abundance of major phyla and families and alpha-diversity metrics accounting for both richness and evenness (Shannon, Inverse Simpson). To increase the reliability of microbial composition, we advise removing OTUs with <10 copies in individual samples, particularly in studies where only one subsample per specimen is available for analysis.
Collapse
Affiliation(s)
- Maria Nikodemova
- Population Health Sciences, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, United States
- Department of Physical Therapy, University of Florida, Gainesville, FL, United States
| | - Elizabeth A. Holzhausen
- Population Health Sciences, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, United States
- Department of Integrative Physiology, University of Colorado Boulder, Boulder, CO, United States
| | - Courtney L. Deblois
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, United States
- Microbiology Doctoral Training Program, University of Wisconsin-Madison, Madison, WI, United States
| | - Jodi H. Barnet
- Population Health Sciences, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, United States
| | - Paul E. Peppard
- Population Health Sciences, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, United States
| | - Garret Suen
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, United States
| | - Kristen M. Malecki
- Population Health Sciences, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, United States
- Division of Environmental and Occupational Health Sciences, School of Public Health, University of Illinois at Chicago, Chicago, IL, United States
| |
Collapse
|
2
|
Yorki S, Shea T, Cuomo CA, Walker BJ, LaRocque RC, Manson AL, Earl AM, Worby CJ. Comparison of long- and short-read metagenomic assembly for low-abundance species and resistance genes. Brief Bioinform 2023; 24:bbad050. [PMID: 36804804 PMCID: PMC10025444 DOI: 10.1093/bib/bbad050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 01/13/2023] [Accepted: 01/26/2023] [Indexed: 02/23/2023] Open
Abstract
Recent technological and computational advances have made metagenomic assembly a viable approach to achieving high-resolution views of complex microbial communities. In previous benchmarking, short-read (SR) metagenomic assemblers had the highest accuracy, long-read (LR) assemblers generated the most contiguous sequences and hybrid (HY) assemblers balanced length and accuracy. However, no assessments have specifically compared the performance of these assemblers on low-abundance species, which include clinically relevant organisms in the gut. We generated semi-synthetic LR and SR datasets by spiking small and increasing amounts of Escherichia coli isolate reads into fecal metagenomes and, using different assemblers, examined E. coli contigs and the presence of antibiotic resistance genes (ARGs). For ARG assembly, although SR assemblers recovered more ARGs with high accuracy, even at low coverages, LR assemblies allowed for the placement of ARGs within longer, E. coli-specific contigs, thus pinpointing their taxonomic origin. HY assemblies identified resistance genes with high accuracy and had lower contiguity than LR assemblies. Each assembler type's strengths were maintained even when our isolate was spiked in with a competing strain, which fragmented and reduced the accuracy of all assemblies. For strain characterization and determining gene context, LR assembly is optimal, while for base-accurate gene identification, SR assemblers outperform other options. HY assembly offers contiguity and base accuracy, but requires generating data on multiple platforms, and may suffer high misassembly rates when strain diversity exists. Our results highlight the trade-offs associated with each approach for recovering low-abundance taxa, and that the optimal approach is goal-dependent.
Collapse
Affiliation(s)
- Sosie Yorki
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Terrance Shea
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Christina A Cuomo
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Bruce J Walker
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Applied Invention, LLC, Cambridge, MA, USA
| | - Regina C LaRocque
- Division of Infectious Diseases, Massachusetts General Hospital, Boston, MA, USA
| | - Abigail L Manson
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ashlee M Earl
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Colin J Worby
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| |
Collapse
|
3
|
Nyffeler M, Bonte D. Where Have All the Spiders Gone? Observations of a Dramatic Population Density Decline in the Once Very Abundant Garden Spider, Araneus diadematus (Araneae: Araneidae), in the Swiss Midland. Insects 2020; 11:insects11040248. [PMID: 32326490 PMCID: PMC7240396 DOI: 10.3390/insects11040248] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Revised: 04/09/2020] [Accepted: 04/10/2020] [Indexed: 12/02/2022]
Abstract
Aerial web-spinning spiders (including large orb-weavers), as a group, depend almost entirely on flying insects as a food source. The recent widespread loss of flying insects across large parts of western Europe, in terms of both diversity and biomass, can therefore be anticipated to have a drastic negative impact on the survival and abundance of this type of spider. To test the putative importance of such a hitherto neglected trophic cascade, a survey of population densities of the European garden spider Araneus diadematus—a large orb-weaving species—was conducted in the late summer of 2019 at twenty sites in the Swiss midland. The data from this survey were compared with published population densities for this species from the previous century. The study verified the above-mentioned hypothesis that this spider’s present-day overall mean population density has declined alarmingly to densities much lower than can be expected from normal population fluctuations (0.7% of the historical values). Review of other available records suggested that this pattern is widespread and not restricted to this region. In conclusion, the decline of this once so abundant spider in the Swiss midland is evidently revealing a bottom-up trophic cascade in response to the widespread loss of flying insect prey in recent decades.
Collapse
Affiliation(s)
- Martin Nyffeler
- Department of Environmental Sciences, Section of Conservation Biology, University of Basel, CH–4056 Basel, Switzerland
- Correspondence:
| | - Dries Bonte
- Department of Biology, Terrestrial Ecology Unit, Ghent University, 9000 Ghent, Belgium;
| |
Collapse
|
4
|
Abstract
INTRODUCTION Mass spectrometry (MS) is the premier tool for discovering novel disease-associated protein biomarkers. Unfortunately, when applied to complex body fluid samples, MS has poor sensitivity for the detection of low abundance biomarkers (≪10 ng/mL), derived directly from the diseased tissue cells or pathogens. Areas covered: Herein we discuss the strengths and drawbacks of technologies used to concentrate low abundance analytes in body fluids, with the aim to improve the effective sensitivity for MS discovery. Solvent removal by dry-down or dialysis, and immune-depletion of high abundance serum or plasma proteins, is shown to have disadvantages compared to positive selection of the candidate biomarkers by affinity enrichment. A theoretical analysis of affinity enrichment reveals that the yield for low abundance biomarkers is a direct function of the binding affinity (Association/Dissociation rates) used for biomarker capture. In addition, a high affinity capture pre processing step can effectively dissociate the candidate biomarker from partitioning with high abundance proteins such as albumin. Expert commentary: Properly designed high affinity capture materials can enrich the yield of low abundance (0.1-10 picograms/mL) candidate biomarkers for MS detection. Affinity capture and concentration, as an upfront step in sample preparation for MS, combined with MS advances in software and hardware that improve the resolution of the chromatographic separation can yield a transformative new class of low abundance biomarkers predicting disease risk or disease latency.
Collapse
Affiliation(s)
| | - Robyn Araujo
- b School of Mathematical Sciences , Queensland University of Technology , Brisbane , Australia
| | - Marissa Howard
- c Center for Applied Proteomics and Molecular Medicine , George Mason University , Manassas , VA , USA
| | - Ruben Magni
- c Center for Applied Proteomics and Molecular Medicine , George Mason University , Manassas , VA , USA
| | - Lance A Liotta
- c Center for Applied Proteomics and Molecular Medicine , George Mason University , Manassas , VA , USA
| | - Alessandra Luchini
- c Center for Applied Proteomics and Molecular Medicine , George Mason University , Manassas , VA , USA
| |
Collapse
|
5
|
Li H, Li T, Yao M, Li J, Zhang S, Wirth S, Cao W, Lin Q, Li X. Pika Gut May Select for Rare but Diverse Environmental Bacteria. Front Microbiol 2016; 7:1269. [PMID: 27582734 PMCID: PMC4987353 DOI: 10.3389/fmicb.2016.01269] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2016] [Accepted: 08/02/2016] [Indexed: 02/01/2023] Open
Abstract
The composition of the mammalian gut bacterial communities can be influenced by the introduction of environmental bacteria in their respective habitats. However, there are no extensive studies examining the interactions between environmental bacteriome and gut bacteriome in wild mammals. Here, we explored the relationship between the gut bacterial communities of pika (Ochotona spp.) and the related environmental bacteria across host species and altitudinal sites using 16S rRNA gene sequencing. Plateau pikas (O. curzoniae) and Daurian pikas (O. daurica) were sampled at five different sites, and plant and soil samples were collected at each site as well. Our data indicated that Plateau pikas and Daurian pikas had distinct bacterial communities. The pika, plant and soil bacterial communities were also distinct. Very little overlap occurred in the pika core bacteria and the most abundant environmental bacteria. The shared OTUs between pikas and environments were present in the environment at relatively low abundance, whereas they were affiliated with diverse bacterial taxa. These results suggested that the pika gut may mainly select for low-abundance but diverse environmental bacteria in a host species-specific manner.
Collapse
Affiliation(s)
- Huan Li
- Key Laboratory of Environmental and Applied Microbiology, Environmental Microbiology Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of SciencesSichuan, China; University of Chinese Academy of SciencesBeijing, China
| | - Tongtong Li
- Key Laboratory of Environmental and Applied Microbiology, Environmental Microbiology Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences Sichuan, China
| | - Minjie Yao
- Key Laboratory of Environmental and Applied Microbiology, Environmental Microbiology Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences Sichuan, China
| | - Jiabao Li
- Key Laboratory of Environmental and Applied Microbiology, Environmental Microbiology Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences Sichuan, China
| | - Shiheng Zhang
- Key Laboratory of Environmental and Applied Microbiology, Environmental Microbiology Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences Sichuan, China
| | - Stephan Wirth
- Leibniz-Center for Agricultural Landscape Research (ZALF), Institute of Landscape Biogeochemistry Müncheberg, Germany
| | - Weidong Cao
- Key Laboratory of Plant Nutrition and Fertilizer, Ministry of Agriculture/Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural SciencesBeijing, China; Soil and Fertilizer Institute, Qinghai Academy of Agriculture and Forestry Sciences, Qinghai UniversityXining, China
| | - Qiang Lin
- Key Laboratory of Environmental and Applied Microbiology, Environmental Microbiology Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences Sichuan, China
| | - Xiangzhen Li
- Key Laboratory of Environmental and Applied Microbiology, Environmental Microbiology Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences Sichuan, China
| |
Collapse
|
6
|
Liu T, Qian WJ, Chen WNU, Jacobs JM, Moore RJ, Anderson DJ, Gritsenko MA, Monroe ME, Thrall BD, Camp DG, Smith RD. Improved proteome coverage by using high efficiency cysteinyl peptide enrichment: the human mammary epithelial cell proteome. Proteomics 2005; 5:1263-73. [PMID: 15742320 PMCID: PMC1769322 DOI: 10.1002/pmic.200401055] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Automated multidimensional capillary liquid chromatography-tandem mass spectrometry (LC-MS/MS) has been increasingly applied in various large scale proteome profiling efforts. However, comprehensive global proteome analysis remains technically challenging due to issues associated with sample complexity and dynamic range of protein abundances, which is particularly apparent in mammalian biological systems. We report here the application of a high efficiency cysteinyl peptide enrichment (CPE) approach to the global proteome analysis of human mammary epithelial cells (HMECs) which significantly improved both sequence coverage of protein identifications and the overall proteome coverage. The cysteinyl peptides were specifically enriched by using a thiol-specific covalent resin, fractionated by strong cation exchange chromatography, and subsequently analyzed by reversed-phase capillary LC-MS/MS. An HMEC tryptic digest without CPE was also fractionated and analyzed under the same conditions for comparison. The combined analyses of HMEC tryptic digests with and without CPE resulted in a total of 14 416 confidently identified peptides covering 4294 different proteins with an estimated 10% gene coverage of the human genome. By using the high efficiency CPE, an additional 1096 relatively low abundance proteins were identified, resulting in 34.3% increase in proteome coverage; 1390 proteins were observed with increased sequence coverage. Comparative protein distribution analyses revealed that the CPE method is not biased with regard to protein M(r) , pI, cellular location, or biological functions. These results demonstrate that the use of the CPE approach provides improved efficiency in comprehensive proteome-wide analyses of highly complex mammalian biological systems.
Collapse
Affiliation(s)
- Tao Liu
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Wei-Jun Qian
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Wan-Nan U. Chen
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Jon M. Jacobs
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Ronald J. Moore
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - David J. Anderson
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Marina A. Gritsenko
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Matthew E. Monroe
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Brian D. Thrall
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - David G. Camp
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Richard D. Smith
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| |
Collapse
|