51
|
Mullis MM, Rambo IM, Baker BJ, Reese BK. Diversity, Ecology, and Prevalence of Antimicrobials in Nature. Front Microbiol 2019; 10:2518. [PMID: 31803148 PMCID: PMC6869823 DOI: 10.3389/fmicb.2019.02518] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Accepted: 10/18/2019] [Indexed: 12/15/2022] Open
Abstract
Microorganisms possess a variety of survival mechanisms, including the production of antimicrobials that function to kill and/or inhibit the growth of competing microorganisms. Studies of antimicrobial production have largely been driven by the medical community in response to the rise in antibiotic-resistant microorganisms and have involved isolated pure cultures under artificial laboratory conditions neglecting the important ecological roles of these compounds. The search for new natural products has extended to biofilms, soil, oceans, coral reefs, and shallow coastal sediments; however, the marine deep subsurface biosphere may be an untapped repository for novel antimicrobial discovery. Uniquely, prokaryotic survival in energy-limited extreme environments force microbial populations to either adapt their metabolism to outcompete or produce novel antimicrobials that inhibit competition. For example, subsurface sediments could yield novel antimicrobial genes, while at the same time answering important ecological questions about the microbial community.
Collapse
Affiliation(s)
- Megan M. Mullis
- Department of Life Sciences, Texas A&M University Corpus Christi, Corpus Christi, TX, United States
| | - Ian M. Rambo
- Department of Marine Science, University of Texas Marine Science Institute, Port Aransas, TX, United States
| | - Brett J. Baker
- Department of Marine Science, University of Texas Marine Science Institute, Port Aransas, TX, United States
| | - Brandi Kiel Reese
- Department of Life Sciences, Texas A&M University Corpus Christi, Corpus Christi, TX, United States
| |
Collapse
|
52
|
|
53
|
GAAP: A Genome Assembly + Annotation Pipeline. BIOMED RESEARCH INTERNATIONAL 2019; 2019:4767354. [PMID: 31346518 PMCID: PMC6617929 DOI: 10.1155/2019/4767354] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 05/20/2019] [Accepted: 05/26/2019] [Indexed: 12/24/2022]
Abstract
Genomic analysis begins with de novo assembly of short-read fragments in order to reconstruct full-length base sequences without exploiting a reference genome sequence. Then, in the annotation step, gene locations are identified within the base sequences, and the structures and functions of these genes are determined. Recently, a wide range of powerful tools have been developed and published for whole-genome analysis, enabling even individual researchers in small laboratories to perform whole-genome analyses on their objects of interest. However, these analytical tools are generally complex and use diverse algorithms, parameter setting methods, and input formats; thus, it remains difficult for individual researchers to select, utilize, and combine these tools to obtain their final results. To resolve these issues, we have developed a genome analysis pipeline (GAAP) for semiautomated, iterative, and high-throughput analysis of whole-genome data. This pipeline is designed to perform read correction, de novo genome (transcriptome) assembly, gene prediction, and functional annotation using a range of proven tools and databases. We aim to assist non-IT researchers by describing each stage of analysis in detail and discussing current approaches. We also provide practical advice on how to access and use the bioinformatics tools and databases and how to implement the provided suggestions. Whole-genome analysis of Toxocara canis is used as case study to show intermediate results at each stage, demonstrating the practicality of the proposed method.
Collapse
|
54
|
Carleton HA, Besser J, Williams-Newkirk AJ, Huang A, Trees E, Gerner-Smidt P. Metagenomic Approaches for Public Health Surveillance of Foodborne Infections: Opportunities and Challenges. Foodborne Pathog Dis 2019; 16:474-479. [PMID: 31170005 PMCID: PMC6653786 DOI: 10.1089/fpd.2019.2636] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Foodborne disease surveillance in the United States is at a critical point. Clinical and diagnostic laboratories are using culture-independent diagnostic tests (CIDTs) to identify the pathogen causing foodborne illness from patient specimens. CIDTs are molecular tests that allow doctors to rapidly identify the bacteria causing illness within hours. CIDTs, unlike previous gold standard methods such as bacterial culture, do not produce an isolate that can be subtyped as part of the national molecular subtyping network for foodborne disease surveillance, PulseNet. Without subtype information, cases can no longer be linked using molecular data to identify potentially related cases that are part of an outbreak. In this review, we discuss the public health needs for a molecular subtyping approach directly from patient specimen and highlight different approaches, including amplicon and shotgun metagenomic sequencing.
Collapse
Affiliation(s)
- Heather A Carleton
- Enteric Diseases Laboratory Branch, Division of Foodborne, Waterborne and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia
| | - John Besser
- Enteric Diseases Laboratory Branch, Division of Foodborne, Waterborne and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia
| | - Amanda J Williams-Newkirk
- Enteric Diseases Laboratory Branch, Division of Foodborne, Waterborne and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia
| | - Andrew Huang
- Enteric Diseases Laboratory Branch, Division of Foodborne, Waterborne and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia
| | - Eija Trees
- Enteric Diseases Laboratory Branch, Division of Foodborne, Waterborne and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia
| | - Peter Gerner-Smidt
- Enteric Diseases Laboratory Branch, Division of Foodborne, Waterborne and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia
| |
Collapse
|
55
|
Nelkner J, Henke C, Lin TW, Pätzold W, Hassa J, Jaenicke S, Grosch R, Pühler A, Sczyrba A, Schlüter A. Effect of Long-Term Farming Practices on Agricultural Soil Microbiome Members Represented by Metagenomically Assembled Genomes (MAGs) and Their Predicted Plant-Beneficial Genes. Genes (Basel) 2019; 10:E424. [PMID: 31163637 PMCID: PMC6627896 DOI: 10.3390/genes10060424] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 05/23/2019] [Accepted: 05/24/2019] [Indexed: 11/24/2022] Open
Abstract
To follow the hypothesis that agricultural management practices affect structure and function of the soil microbiome regarding soil health and plant-beneficial traits, high-throughput (HT) metagenome analyses were performed on Chernozem soil samples from a long-term field experiment designated LTE-1 carried out at Bernburg-Strenzfeld (Saxony-Anhalt, Germany). Metagenomic DNA was extracted from soil samples representing the following treatments: (i) plough tillage with standard nitrogen fertilization and use of fungicides and growth regulators, (ii) plough tillage with reduced nitrogen fertilization (50%), (iii) cultivator tillage with standard nitrogen fertilization and use of fungicides and growth regulators, and (iv) cultivator tillage with reduced nitrogen fertilization (50%). Bulk soil (BS), as well as root-affected soil (RS), were considered for all treatments in replicates. HT-sequencing of metagenomic DNA yielded approx. 100 Giga bases (Gb) of sequence information. Taxonomic profiling of soil communities revealed the presence of 70 phyla, whereby Proteobacteria, Actinobacteria, Bacteroidetes, Planctomycetes, Acidobacteria, Thaumarchaeota, Firmicutes, Verrucomicrobia and Chloroflexi feature abundances of more than 1%. Functional microbiome profiling uncovered, i.a., numerous potential plant-beneficial, plant-growth-promoting and biocontrol traits predicted to be involved in nutrient provision, phytohormone synthesis, antagonism against pathogens and signal molecule synthesis relevant in microbe-plant interaction. Neither taxonomic nor functional microbiome profiling based on single-read analyses revealed pronounced differences regarding the farming practices applied. Soil metagenome sequences were assembled and taxonomically binned. The ten most reliable and abundant Metagenomically Assembled Genomes (MAGs) were taxonomically classified and metabolically reconstructed. Importance of the phylum Thaumarchaeota for the analyzed microbiome is corroborated by the fact that the four corresponding MAGs were predicted to oxidize ammonia (nitrification), thus contributing to the cycling of nitrogen, and in addition are most probably able to fix carbon dioxide. Moreover, Thaumarchaeota and several bacterial MAGs also possess genes with predicted functions in plant-growth-promotion. Abundances of certain MAGs (species resolution level) responded to the tillage practice, whereas the factors compartment (BS vs. RS) and nitrogen fertilization only marginally shaped MAG abundance profiles. Hence, soil management regimes promoting plant-beneficial microbiome members are very likely advantageous for the respective agrosystem, its health and carbon sequestration and accordingly may enhance plant productivity. Since Chernozem soils are highly fertile, corresponding microbiome data represent a valuable reference resource for agronomy in general.
Collapse
Affiliation(s)
- Johanna Nelkner
- Center for Biotechnology (CeBiTec), Bielefeld University, Genome Research of Industrial Microorganisms, Universitätsstraße 27, 33615 Bielefeld, Germany.
| | - Christian Henke
- Center for Biotechnology (CeBiTec), Bielefeld University, Computational Metagenomics Group, Universitätsstraße 27, 33615 Bielefeld, Germany.
| | - Timo Wentong Lin
- Center for Biotechnology (CeBiTec), Bielefeld University, Genome Research of Industrial Microorganisms, Universitätsstraße 27, 33615 Bielefeld, Germany.
| | - Wiebke Pätzold
- Center for Biotechnology (CeBiTec), Bielefeld University, Computational Metagenomics Group, Universitätsstraße 27, 33615 Bielefeld, Germany.
| | - Julia Hassa
- Center for Biotechnology (CeBiTec), Bielefeld University, Genome Research of Industrial Microorganisms, Universitätsstraße 27, 33615 Bielefeld, Germany.
| | - Sebastian Jaenicke
- Justus-Liebig-University Gießen, Bioinformatics & Systems Biology, Heinrich-Buff-Ring 58, 35392 Gießen, Germany.
| | - Rita Grosch
- Leibniz-Institute of Vegetable and Ornamental Crops (IGZ) Großbeeren/Erfurt eV, Theodor-Echtermeyer-Weg 1, 14979 Großbeeren, Germany.
| | - Alfred Pühler
- Center for Biotechnology (CeBiTec), Bielefeld University, Genome Research of Industrial Microorganisms, Universitätsstraße 27, 33615 Bielefeld, Germany.
| | - Alexander Sczyrba
- Center for Biotechnology (CeBiTec), Bielefeld University, Computational Metagenomics Group, Universitätsstraße 27, 33615 Bielefeld, Germany.
| | - Andreas Schlüter
- Center for Biotechnology (CeBiTec), Bielefeld University, Genome Research of Industrial Microorganisms, Universitätsstraße 27, 33615 Bielefeld, Germany.
| |
Collapse
|
56
|
Koutsandreas T, Ladoukakis E, Pilalis E, Zarafeta D, Kolisis FN, Skretas G, Chatziioannou AA. ANASTASIA: An Automated Metagenomic Analysis Pipeline for Novel Enzyme Discovery Exploiting Next Generation Sequencing Data. Front Genet 2019; 10:469. [PMID: 31178894 PMCID: PMC6543708 DOI: 10.3389/fgene.2019.00469] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Accepted: 05/01/2019] [Indexed: 01/27/2023] Open
Abstract
Metagenomic analysis of environmental samples provides deep insight into the enzymatic mixture of the corresponding niches, capable of revealing peptide sequences with novel functional properties exploiting the high performance of next-generation sequencing (NGS) technologies. At the same time due to their ever increasing complexity, there is a compelling need for ever larger computational configurations to ensure proper bioinformatic analysis, and fine annotation. With the aiming to address the challenges of such an endeavor, we have developed a novel web-based application named ANASTASIA (automated nucleotide aminoacid sequences translational plAtform for systemic interpretation and analysis). ANASTASIA provides a rich environment of bioinformatic tools, either publicly available or novel, proprietary algorithms, integrated within numerous automated algorithmic workflows, and which enables versatile data processing tasks for (meta)genomic sequence datasets. ANASTASIA was initially developed in the framework of the European FP7 project HotZyme, whose aim was to perform exhaustive analysis of metagenomes derived from thermal springs around the globe and to discover new enzymes of industrial interest. ANASTASIA has evolved to become a stable and extensible environment for diversified, metagenomic, functional analyses for a range of applications overarching industrial biotechnology to biomedicine, within the frames of the ELIXIR-GR project. As a showcase, we report the successful in silico mining of a novel thermostable esterase termed “EstDZ4” from a metagenomic sample collected from a hot spring located in Krisuvik, Iceland.
Collapse
Affiliation(s)
- Theodoros Koutsandreas
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece.,e-NIOS Applications PC, Athens, Greece
| | - Efthymios Ladoukakis
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece.,Laboratory of Biotechnology, School of Chemical Engineering, National Technical University of Athens, Athens, Greece
| | - Eleftherios Pilalis
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece.,e-NIOS Applications PC, Athens, Greece
| | - Dimitra Zarafeta
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece
| | - Fragiskos N Kolisis
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece.,Laboratory of Biotechnology, School of Chemical Engineering, National Technical University of Athens, Athens, Greece
| | - Georgios Skretas
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece
| | - Aristotelis A Chatziioannou
- Institute of Chemical Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece.,e-NIOS Applications PC, Athens, Greece
| |
Collapse
|
57
|
Pholwat S, Liu J, Taniuchi M, Chinli R, Pongpan T, Thaipisutikul I, Ratanakorn P, Platts-Mills JA, Fleece M, Stroup S, Gratz J, Mduma E, Mujaga B, Walongo T, Nshama R, Kimathi C, Foongladda S, Houpt ER. Genotypic antimicrobial resistance assays for use on E. coli isolates and stool specimens. PLoS One 2019; 14:e0216747. [PMID: 31075137 PMCID: PMC6510447 DOI: 10.1371/journal.pone.0216747] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Accepted: 04/27/2019] [Indexed: 01/31/2023] Open
Abstract
Antimicrobial resistance (AMR) is an emerging public health problem and methods for surveillance are needed. We designed 85 sequence-specific PCR reactions to detect 79 genes or mutations associated with resistance across 10 major antimicrobial classes, with a focus on E. coli. The 85 qPCR assays demonstrated >99.9% concordance with sequencing. We evaluated the correlation between genotypic resistance markers and phenotypic susceptibility results on 239 E. coli isolates. Both sensitivity and specificity exceeded 90% for ampicillin, ceftriaxone, cefepime, imipenem, ciprofloxacin, azithromycin, gentamicin, amikacin, trimethoprim/sulfamethoxazole, tetracycline, and chloramphenicol phenotypic susceptibility results. We then evaluated the assays on direct stool specimens and observed a sensitivity of 97% ± 5 but, as expected, a lower specificity of 75% ± 31 versus the genotype of the E. coli cultured from stool. Finally, the assays were incorporated into a convenient TaqMan Array Card (TAC) format. These assays may be useful for tracking AMR in E. coli isolates or directly in stool for targeted testing of the fecal antibiotic resistome.
Collapse
Affiliation(s)
- Suporn Pholwat
- Division of Infectious Diseases and International Health, Department of Medicine, University of Virginia, Charlottesville, Virginia, United States of America
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Jie Liu
- Division of Infectious Diseases and International Health, Department of Medicine, University of Virginia, Charlottesville, Virginia, United States of America
| | - Mami Taniuchi
- Division of Infectious Diseases and International Health, Department of Medicine, University of Virginia, Charlottesville, Virginia, United States of America
| | - Rattapha Chinli
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Tawat Pongpan
- Swine Veterinarian Service, Charoen Pokphand Foods PCL, Bangkok, Thailand
| | - Iyarit Thaipisutikul
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Parntep Ratanakorn
- Faculty of Veterinary Science, Mahidol University, Nakhonpathom, Thailand
| | - James A. Platts-Mills
- Division of Infectious Diseases and International Health, Department of Medicine, University of Virginia, Charlottesville, Virginia, United States of America
| | - Molly Fleece
- Division of Infectious Diseases and International Health, Department of Medicine, University of Virginia, Charlottesville, Virginia, United States of America
| | - Suzanne Stroup
- Division of Infectious Diseases and International Health, Department of Medicine, University of Virginia, Charlottesville, Virginia, United States of America
| | - Jean Gratz
- Division of Infectious Diseases and International Health, Department of Medicine, University of Virginia, Charlottesville, Virginia, United States of America
- Kilimanjaro Clinical Research Institute, Moshi, Tanzania
| | - Esto Mduma
- Haydom Lutheran Hospital, Haydom, Tanzania
| | - Buliga Mujaga
- Kilimanjaro Clinical Research Institute, Moshi, Tanzania
| | | | | | | | - Suporn Foongladda
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Eric R. Houpt
- Division of Infectious Diseases and International Health, Department of Medicine, University of Virginia, Charlottesville, Virginia, United States of America
- * E-mail:
| |
Collapse
|
58
|
Functioning of the Intestinal Ecosystem: From New Technologies in Microbial Research to Practical Poultry Feeding – A Review. ANNALS OF ANIMAL SCIENCE 2019. [DOI: 10.2478/aoas-2019-0007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Abstract
Unlike classical microbiology which focuses on bacteria capable of growing in vitro, metagenomics is a study of genetic information originating from microflora which aims to characterise the microbiome, namely the common genome of bacteria, archaea, fungi, protozoa and viruses living in the host. Metagenomics relies on next-generation sequencing (NGS), a large-scale sequencing technique which allows millions of sequential reactions to be carried out in parallel to decode entire communities of microorganisms. Metagenomic analyses support taxonomic analyses (involving gene fragments encoding ribosomal RNAs 5S and 16S in bacteria) or functional analyses for identifying genes encoding proteins that participate in the regulation of metabolic pathways in the body. New metagenomics technologies expand our knowledge of the phylogenetic structure of microflora in the gastrointestinal tract of poultry, and they support the identification of previously unknown groups of microbiota, mainly those occurring in small numbers. Next-generation sequencing also provides indirect information about the quantitative structure of the genes of gut microorganisms, but microbial activity and changes in the proportions of microbial metabolites that affect the host’s intestinal integrity and metabolism remain insufficiently investigated. Therefore, research studies are undertaken to investigate the proportions of the key microbial metabolites in the intestinal contents of poultry relative to changes in the population size of the most important bacterial groups, including those determined by cheaper techniques.
Collapse
|
59
|
Karim MR, Michel A, Zappa A, Baranov P, Sahay R, Rebholz-Schuhmann D. Improving data workflow systems with cloud services and use of open data for bioinformatics research. Brief Bioinform 2019; 19:1035-1050. [PMID: 28419324 PMCID: PMC6169675 DOI: 10.1093/bib/bbx039] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Indexed: 11/22/2022] Open
Abstract
Data workflow systems (DWFSs) enable bioinformatics researchers to combine components for data access and data analytics, and to share the final data analytics approach with their collaborators. Increasingly, such systems have to cope with large-scale data, such as full genomes (about 200 GB each), public fact repositories (about 100 TB of data) and 3D imaging data at even larger scales. As moving the data becomes cumbersome, the DWFS needs to embed its processes into a cloud infrastructure, where the data are already hosted. As the standardized public data play an increasingly important role, the DWFS needs to comply with Semantic Web technologies. This advancement to DWFS would reduce overhead costs and accelerate the progress in bioinformatics research based on large-scale data and public resources, as researchers would require less specialized IT knowledge for the implementation. Furthermore, the high data growth rates in bioinformatics research drive the demand for parallel and distributed computing, which then imposes a need for scalability and high-throughput capabilities onto the DWFS. As a result, requirements for data sharing and access to public knowledge bases suggest that compliance of the DWFS with Semantic Web standards is necessary. In this article, we will analyze the existing DWFS with regard to their capabilities toward public open data use as well as large-scale computational and human interface requirements. We untangle the parameters for selecting a preferable solution for bioinformatics research with particular consideration to using cloud services and Semantic Web technologies. Our analysis leads to research guidelines and recommendations toward the development of future DWFS for the bioinformatics research community.
Collapse
Affiliation(s)
- Md Rezaul Karim
- Semantics in eHealth and Life Sciences (SeLS), Insight Centre for Data Analytics, National University of Ireland, Galway, Ireland
| | - Audrey Michel
- School of Biochemistry and Cell Biology, University College Cork, Ireland
| | - Achille Zappa
- Insight Centre for Data Analytics, National University of Ireland Galway, Dangan, Galway, Ireland
| | - Pavel Baranov
- School of Biochemistry and Cell Biology, University College Cork, Ireland
| | - Ratnesh Sahay
- Semantics in eHealth and Life Sciences (SeLS), Insight Centre for Data Analytics, National University of Ireland, Galway, Ireland
| | | |
Collapse
|
60
|
Calderon D, Peña L, Suarez A, Villamil C, Ramirez-Rojas A, Anzola JM, García-Betancur JC, Cepeda ML, Uribe D, Del Portillo P, Mongui A. Recovery and functional validation of hidden soil enzymes in metagenomic libraries. Microbiologyopen 2019; 8:e00572. [PMID: 30851083 PMCID: PMC6460280 DOI: 10.1002/mbo3.572] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2017] [Revised: 11/01/2017] [Accepted: 11/09/2017] [Indexed: 11/10/2022] Open
Abstract
The vast microbial diversity on the planet represents an invaluable source for identifying novel activities with potential industrial and therapeutic application. In this regard, metagenomics has emerged as a group of strategies that have significantly facilitated the analysis of DNA from multiple environments and has expanded the limits of known microbial diversity. However, the functional characterization of enzymes, metabolites, and products encoded by diverse microbial genomes is limited by the inefficient heterologous expression of foreign genes. We have implemented a pipeline that combines NGS and Sanger sequencing as a way to identify fosmids within metagenomic libraries. This strategy facilitated the identification of putative proteins, subcloning of targeted genes and preliminary characterization of selected proteins. Overall, the in silico approach followed by the experimental validation allowed us to efficiently recover the activity of previously hidden enzymes derived from agricultural soil samples. Therefore, the methodology workflow described herein can be applied to recover activities encoded by environmental DNA from multiple sources.
Collapse
Affiliation(s)
- Dayana Calderon
- Molecular Biotechnology Research Group, Corporación CorpoGen, Bogotá, Colombia
| | - Luis Peña
- Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Friedrich-Schiller Universität, Jena, Germany
| | - Angélica Suarez
- Molecular Biotechnology Research Group, Corporación CorpoGen, Bogotá, Colombia
| | - Carolina Villamil
- Molecular Biotechnology Research Group, Corporación CorpoGen, Bogotá, Colombia
| | - Adan Ramirez-Rojas
- Molecular Biotechnology Research Group, Corporación CorpoGen, Bogotá, Colombia
| | - Juan M Anzola
- Computational Biology, Corporación CorpoGen, Bogotá, Colombia
| | | | - Martha L Cepeda
- Molecular Biotechnology Research Group, Corporación CorpoGen, Bogotá, Colombia
| | - Daniel Uribe
- Biotechnology Institute, Universidad Nacional de Colombia, Bogotá, Colombia
| | | | - Alvaro Mongui
- Molecular Biotechnology Research Group, Corporación CorpoGen, Bogotá, Colombia.,Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| |
Collapse
|
61
|
Genome-wide profiling of human papillomavirus DNA integration in liquid-based cytology specimens from a Gabonese female population using HPV capture technology. Sci Rep 2019; 9:1504. [PMID: 30728408 PMCID: PMC6365579 DOI: 10.1038/s41598-018-37871-2] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Accepted: 12/10/2018] [Indexed: 01/02/2023] Open
Abstract
Human papillomavirus (HPV) is recognised as the cause of precancerous and cancerous cervical lesions. Furthermore, in high-grade lesions, HPV is frequently integrated in the host cell genome and associated with the partial or complete loss of the E1 and E2 genes, which regulate the activity of viral oncoproteins E6 and E7. In this study, using a double-capture system followed by high-throughput sequencing, we determined the HPV integration status present in liquid-based cervical smears in an urban Gabonese population. The main inclusion criteria were based on cytological grade and the detection of the HPV16 genotype using molecular assays. The rate of HPV integration in the host genome varied with cytological grade: 85.7% (6/7), 71.4% (5/7), 66.7% (2/3) 60% (3/5) and 30.8% (4/13) for carcinomas, HSIL, ASCH, LSIL and ASCUS, respectively. For high cytological grades (carcinomas and HSIL), genotypes HPV16 and 18 represented 92.9% of the samples (13/14). The integrated form of HPV16 genotype was mainly found in high-grade lesions in 71.4% of samples regardless of cytological grade. Minority genotypes (HPV33, 51, 58 and 59) were found in LSIL samples, except HPV59, which was identified in one HSIL sample. Among all the HPV genotypes identified after double capture, 10 genotypes (HPV30, 35, 39, 44, 45, 53, 56, 59, 74 and 82) were detected only in episomal form. Our study revealed that the degree of HPV integration varies with cervical cytological grade. The integration event might be a potential clinical prognostic biomarker for the prediction of the progression of neoplastic lesions.
Collapse
|
62
|
|
63
|
Bilal T, Malik B, Hakeem KR. Metagenomic analysis of uncultured microorganisms and their enzymatic attributes. J Microbiol Methods 2018; 155:65-69. [PMID: 30452938 DOI: 10.1016/j.mimet.2018.11.014] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2018] [Revised: 11/13/2018] [Accepted: 11/16/2018] [Indexed: 12/13/2022]
Abstract
Although second generation biofuel technology is a sustainable route for bioethanol production it is not currently a robust technology because of certain hindrances viz., unavailability of potential enzyme resources, low efficiency of enzymes and restricted availability of potent enzymes that work under harsh conditions in industrial processes. Therefore, bioprospecting of extremophilic microorganisms using metagenomics is a promising alternative to discover novel microbes and enzymes with efficient tolerance to unfavourable conditions and thus could revolutionize the energy sector. Metagenomics a recent field in "omics" technology enables the genomic study of uncultured microorganisms with the goal of better understanding microbial dynamics. Metagenomics in conjunction with NextGen Sequencing technology facilitates the sequencing of microbial DNA directly from environmental samples and has expanded, and transformed our knowledge of the microbial world. However, filtering the meaningful information from the millions of genomic sequences offers a serious challenge to bioinformaticians. The current review holds the opinion tool 'know- how' to unravel the secrets of nature while expediting the bio-industrial world. We also discuss the novel biocatalytic agents discovered through metagenomics and how bioengineering plays a pivotal role to enhance their efficiency.
Collapse
Affiliation(s)
- Tanveer Bilal
- Department of Bioresources, University of Kashmir, Srinagar 190006, India; Department of Bioresources, Amar Singh College, Cluster University of Kashmir, Srinagar 190001, India
| | - Bisma Malik
- Department of Bioresources, University of Kashmir, Srinagar 190006, India
| | - Khalid Rehman Hakeem
- Department of Biological Sciences, Faculty of Sciences, King Abdulaziz University, Jeddah, Saudi Arabia.
| |
Collapse
|
64
|
Almeida OGG, De Martinis ECP. Bioinformatics tools to assess metagenomic data for applied microbiology. Appl Microbiol Biotechnol 2018; 103:69-82. [DOI: 10.1007/s00253-018-9464-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Revised: 10/15/2018] [Accepted: 10/16/2018] [Indexed: 12/14/2022]
|
65
|
Characterizing Small RNAs in Filamentous Fungi Using the Rice Blast Fungus, Magnaporthe oryzae, as an Example. Methods Mol Biol 2018. [PMID: 30182228 DOI: 10.1007/978-1-4939-8724-5_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
The goal of this chapter is to provide a framework of sequential steps for small RNA (sRNA) analysis in filamentous fungi. Here, we present protocols for (1) comparative analysis of sRNAs in different conditions, (2) comparisons of sRNA libraries to RNAseq data and (3) identification and analysis of methylguanosine-capped and polyadenylated sRNAs (CPA-sRNAs). This species of small RNA is particularly interesting in Magnaporthe oryzae, as they map to transcription start and end sites of protein-coding genes. While we do not provide specific command lines for scripts, we provide a general framework for steps needed to carry out all three types of analyses, including relevant references, websites and free online tools. Screenshots are provided from our own customized interface using M. oryzae as an example, to assist the reader in visualizing many of the steps.
Collapse
|
66
|
Ma ZS, Li L. Measuring metagenome diversity and similarity with Hill numbers. Mol Ecol Resour 2018; 18:1339-1355. [PMID: 29985552 DOI: 10.1111/1755-0998.12923] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Revised: 01/31/2018] [Accepted: 02/17/2018] [Indexed: 11/27/2022]
Abstract
The first step of any metagenome sequencing project is to get the inventory of OTU abundances (operational taxonomic units) and/or metagenomic gene abundances. The former is generated with 16S-rRNA-tagged amplicon sequencing technology, and the latter can be generated from either gene-targeted or whole-sample shotgun metagenomics technologies. With 16S-rRNA data sets, measuring community diversity with diversity indexes such as species richness and Shannon's index has been a de facto standard analysis; nevertheless, similarly comprehensive approaches to metagenomic gene abundances are still largely missing, despite that both OTU and gene abundances are DNA reads. Here, we adapt the Hill numbers, which were reintroduced to macrocommunity ecology recently and are now widely regarded as a most appropriate measure system for ecological diversity, for measuring metagenome alpha-, beta- and gamma-diversities, and similarity. Our proposal includes the following: (a) Metagenomic gene (MG) diversity measures the single-gene-level metagenome diversity; (b) Type-I metagenome functional gene cluster (MFGC) diversity measures the diversity of functional gene clusters but ignoring within-cluster gene abundance information; (c) Type-II MFGC diversity considers within-cluster gene abundances information and integrates gene-cluster-level metagenome diversity and functional gene redundancy information; and (d) Four classes of Hill-numbers-based similarity metrics, including local gene overlap, regional gene overlap, gene homogeneity measure and gene turnover complement, were introduced in terms of MG and MFGC, respectively. We demonstrate the proposal with the gut metagenomes from healthy and IBD (inflammatory bowel disease) cohorts. The Hill numbers offer a unified approach to cohesively and comprehensively measuring the ecological and metagenome diversities of microbiomes.
Collapse
Affiliation(s)
- Zhanshan Sam Ma
- Computational Biology and Medical Ecology Lab, State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Lianwei Li
- Computational Biology and Medical Ecology Lab, State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| |
Collapse
|
67
|
Abstract
Metagenomic datasets contain billions of protein sequences that could greatly enhance large-scale functional annotation and structure prediction. Utilizing this enormous resource would require reducing its redundancy by similarity clustering. However, clustering hundreds of millions of sequences is impractical using current algorithms because their runtimes scale as the input set size N times the number of clusters K, which is typically of similar order as N, resulting in runtimes that increase almost quadratically with N. We developed Linclust, the first clustering algorithm whose runtime scales as N, independent of K. It can also cluster datasets several times larger than the available main memory. We cluster 1.6 billion metagenomic sequence fragments in 10 h on a single server to 50% sequence identity, >1000 times faster than has been possible before. Linclust will help to unlock the great wealth contained in metagenomic and genomic sequence databases. Billions of metagenomic and genomic sequences fill up public datasets, which makes similarity clustering an important and time-critical analysis step. Here, the authors develop Linclust, an algorithm with linear time complexity that can cluster over a billion sequences within hours on a single server.
Collapse
|
68
|
Bengtsson-Palme J, Larsson DGJ, Kristiansson E. Using metagenomics to investigate human and environmental resistomes. J Antimicrob Chemother 2018; 72:2690-2703. [PMID: 28673041 DOI: 10.1093/jac/dkx199] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Antibiotic resistance is a global health concern declared by the WHO as one of the largest threats to modern healthcare. In recent years, metagenomic DNA sequencing has started to be applied as a tool to study antibiotic resistance in different environments, including the human microbiota. However, a multitude of methods exist for metagenomic data analysis, and not all methods are suitable for the investigation of resistance genes, particularly if the desired outcome is an assessment of risks to human health. In this review, we outline the current state of methods for sequence handling, mapping to databases of resistance genes, statistical analysis and metagenomic assembly. In addition, we provide an overview of important considerations related to the analysis of resistance genes, and recommend some of the currently used tools and methods that are best equipped to inform research and clinical practice related to antibiotic resistance.
Collapse
Affiliation(s)
- Johan Bengtsson-Palme
- Department of Infectious Diseases, Institute of Biomedicine, The Sahlgrenska Academy, University of Gothenburg, Guldhedsgatan 10, SE-41346, Gothenburg, Sweden.,Centre for Antibiotic Resistance Research (CARe) at University of Gothenburg, Box 440, SE-40530, Gothenburg, Sweden
| | - D G Joakim Larsson
- Department of Infectious Diseases, Institute of Biomedicine, The Sahlgrenska Academy, University of Gothenburg, Guldhedsgatan 10, SE-41346, Gothenburg, Sweden.,Centre for Antibiotic Resistance Research (CARe) at University of Gothenburg, Box 440, SE-40530, Gothenburg, Sweden
| | - Erik Kristiansson
- Centre for Antibiotic Resistance Research (CARe) at University of Gothenburg, Box 440, SE-40530, Gothenburg, Sweden.,Department of Mathematical Sciences, Chalmers University of Technology, SE-41296, Gothenburg, Sweden
| |
Collapse
|
69
|
Wu IC, Liu WC, Chang TT. Applications of next-generation sequencing analysis for the detection of hepatocellular carcinoma-associated hepatitis B virus mutations. J Biomed Sci 2018; 25:51. [PMID: 29859540 PMCID: PMC5984823 DOI: 10.1186/s12929-018-0442-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Accepted: 04/30/2018] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Next-generation sequencing (NGS) is a powerful and high-throughput method for the detection of viral mutations. This article provides a brief overview about optimization of NGS analysis for hepatocellular carcinoma (HCC)-associated hepatitis B virus (HBV) mutations, and hepatocarcinogenesis of relevant mutations. MAIN BODY For the application of NGS analysis in the genome of HBV, four noteworthy steps were discovered in testing. First, a sample-specific reference sequence was the most effective mapping reference for NGS. Second, elongating the end of reference sequence improved mapping performance at the end of the genome. Third, resetting the origin of mapping reference sequence could probed deletion mutations and variants at a certain location with common mutations. Fourth, using a platform-specific cut-off value to distinguish authentic minority variants from technical artifacts was found to be highly effective. One hundred and sixty-seven HBV single nucleotide variants (SNVs) were found to be studied previously through a systematic literature review, and 12 SNVs were determined to be associated with HCC by meta-analysis. From comprehensive research using a HBV genome-wide NGS analysis, 60 NGS-defined HCC-associated SNVs with their pathogenic frequencies were identified, with 19 reported previously. All the 12 HCC-associated SNVs proved by meta-analysis were confirmed by NGS analysis, except for C1766T and T1768A which were mainly expressed in genotypes A and D, but including the subgroup analysis of A1762T. In the 41 novel NGS-defined HCC-associated SNVs, 31.7% (13/41) had cut-off values of SNV frequency lower than 20%. This showed that NGS could be used to detect HCC-associated SNVs with low SNV frequency. Most SNV II (the minor strains in the majority of non-HCC patients) had either low (< 20%) or high (> 80%) SNV frequencies in HCC patients, a characteristic U-shaped distribution pattern. The cut-off values of SNV frequency for HCC-associated SNVs represent their pathogenic frequencies. The pathogenic frequencies of HCC-associated SNV II also showed a U-shaped distribution. Hepatocarcinogenesis induced by HBV mutated proteins through cellular pathways was reviewed. CONCLUSION NGS analysis is useful to discover novel HCC-associated HBV SNVs, especially those with low SNV frequency. The hepatocarcinogenetic mechanisms of novel HCC-associated HBV SNVs defined by NGS analysis deserve further investigation.
Collapse
Affiliation(s)
- I-Chin Wu
- Department of Internal Medicine, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, 138 Sheng-Li Road, Tainan, 70403, Taiwan, Republic of China.,Infectious Disease and Signaling Research Center, National Cheng Kung University, Tainan, Taiwan, Republic of China
| | - Wen-Chun Liu
- Department of Internal Medicine, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, 138 Sheng-Li Road, Tainan, 70403, Taiwan, Republic of China.,Infectious Disease and Signaling Research Center, National Cheng Kung University, Tainan, Taiwan, Republic of China
| | - Ting-Tsung Chang
- Department of Internal Medicine, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, 138 Sheng-Li Road, Tainan, 70403, Taiwan, Republic of China.
| |
Collapse
|
70
|
Probiotic Lactobacillus Paracasei Expressing a Nucleic Acid-Hydrolyzing Minibody (3D8 Scfv) Enhances Probiotic Activities in Mice Intestine as Revealed by Metagenomic Analyses. Genes (Basel) 2018; 9:genes9060276. [PMID: 29844265 PMCID: PMC6027128 DOI: 10.3390/genes9060276] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2018] [Revised: 05/17/2018] [Accepted: 05/17/2018] [Indexed: 01/13/2023] Open
Abstract
Probiotics are well known for their beneficial effects for animals, including humans and livestock. Here, we tested the probiotic activity of Lactobacillus paracasei expressing 3D8 scFv, a nucleic acid-hydrolyzing mini-antibody, in mice intestine. A total of 18 fecal samples derived from three different conditions at two different time points were subjected to high-throughput 16S ribosomal RNA (rRNA) metagenomic analyses. Bioinformatic analyses identified an average of 290 operational taxonomic units. After administration of L. paracasei, populations of the probiotics L. paracasei, Lactobacillus reuteri, and Pediococcus acidilactici increased, whereas the population of harmful bacteria such as Helicobacter species decreased. Furthermore, continuous administration of L. paracasei resulted in L. paracasei emerging as the dominant probiotic after competition with other existing probiotics. Expression of 3D8 scFv protein specifically increased the population of P. acidilactici, which is another probiotic. In summary, our results showed that L. paracasei expressing 3D8 scFv protein enhanced probiotic activity in mice intestine with no observable side effects. Thus, the system developed in this study may be a good tool for the expression of recombinant protein using probiotics.
Collapse
|
71
|
Mataragas M, Alessandria V, Ferrocino I, Rantsiou K, Cocolin L. A bioinformatics pipeline integrating predictive metagenomics profiling for the analysis of 16S rDNA/rRNA sequencing data originated from foods. Food Microbiol 2018; 76:279-286. [PMID: 30166151 DOI: 10.1016/j.fm.2018.05.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2017] [Revised: 05/18/2018] [Accepted: 05/23/2018] [Indexed: 11/29/2022]
Abstract
The recent advances in molecular biology, such as the advent of next-generation sequencing (NGS) platforms, have paved the way to new exciting tools which rapidly transform food microbiology. Nowadays, NGS methods such as 16S rDNA/rRNA metagenomics or amplicon sequencing are used for the taxonomic profiling of the food microbial communities. Although 16S rDNA/rRNA NGS-based microbial data are not suited for the investigation of the functional potential of the identified operational taxonomic units as compared to shotgun metagenomics, advances in the bioinformatics discipline allow now the performance of such studies. In this paper, a bioinformatics workflow is described integrating predictive metagenomics profiling with specific application to food microbiology data. Bioinformatics tools pertinent to each sub-module of the pipeline are suggested as well. The published 16S rDNA/rRNA amplicon data originated from an Italian Grana-type cheese, using an NGS platform, was employed to demonstrate the predictive metagenomics profiling approach. The pipeline identified the microbial community and the changes that occurred in the microbial profile during manufacture of the food product studied (taxonomic profiling). The workflow also indicated significant changes in the functional profiling of the community. The tool may help to investigate the functional potential, alterations, and interactions of a microbial community. The proposed workflow may also find an application in the investigation of the ecology of foodborne pathogens encountered in various food products.
Collapse
Affiliation(s)
- Marios Mataragas
- Hellenic Agricultural Organization "DEMETER", Institute of Technology of Agricultural Products, Department of Dairy Research, Ethnikis Antistaseos 3, 45221, Ioannina, Greece.
| | - Valentina Alessandria
- University of Turin, Department of Agricultural, Forest and Food Sciences, Laboratory of Food Microbiology, Largo P. Braccini 2, 10095, Grugliasco, Turin, Italy
| | - Ilario Ferrocino
- University of Turin, Department of Agricultural, Forest and Food Sciences, Laboratory of Food Microbiology, Largo P. Braccini 2, 10095, Grugliasco, Turin, Italy
| | - Kalliopi Rantsiou
- University of Turin, Department of Agricultural, Forest and Food Sciences, Laboratory of Food Microbiology, Largo P. Braccini 2, 10095, Grugliasco, Turin, Italy
| | - Luca Cocolin
- University of Turin, Department of Agricultural, Forest and Food Sciences, Laboratory of Food Microbiology, Largo P. Braccini 2, 10095, Grugliasco, Turin, Italy
| |
Collapse
|
72
|
Schaeffer L, Pimentel H, Bray N, Melsted P, Pachter L. Pseudoalignment for metagenomic read assignment. Bioinformatics 2018; 33:2082-2088. [PMID: 28334086 DOI: 10.1093/bioinformatics/btx106] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Accepted: 02/17/2017] [Indexed: 12/13/2022] Open
Abstract
Motivation Read assignment is an important first step in many metagenomic analysis workflows, providing the basis for identification and quantification of species. However ambiguity among the sequences of many strains makes it difficult to assign reads at the lowest level of taxonomy, and reads are typically assigned to taxonomic levels where they are unambiguous. We explore connections between metagenomic read assignment and the quantification of transcripts from RNA-Seq data in order to develop novel methods for rapid and accurate quantification of metagenomic strains. Results We find that the recent idea of pseudoalignment introduced in the RNA-Seq context is highly applicable in the metagenomics setting. When coupled with the Expectation-Maximization (EM) algorithm, reads can be assigned far more accurately and quickly than is currently possible with state of the art software, making it possible and practical for the first time to analyze abundances of individual genomes in metagenomics projects. Availability and Implementation Pipeline and analysis code can be downloaded from http://github.com/pachterlab/metakallisto. Contact lpachter@math.berkeley.edu.
Collapse
Affiliation(s)
- L Schaeffer
- Department of Molecular and Cell Biology, UC Berkeley, Berkeley, CA, USA
| | - H Pimentel
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - N Bray
- Department of Molecular and Cell Biology and Innovative Genomics Institute, UC Berkeley, Berkeley, CA, USA
| | - P Melsted
- Faculty of Industrial Engineering, Mechanical Engineering and Computer Science, University of Iceland, Reykjavik, Iceland
| | - L Pachter
- Department of Molecular and Cell Biology, UC Berkeley, Berkeley, CA, USA.,Departments of Mathematics and Computer Science, UC Berkeley, Berkeley, CA, USA
| |
Collapse
|
73
|
Savio D, Stadler P, Reischer GH, Kirschner AK, Demeter K, Linke R, Blaschke AP, Sommer R, Szewzyk U, Wilhartitz IC, Mach RL, Stadler H, Farnleitner AH. Opening the black box of spring water microbiology from alpine karst aquifers to support proactive drinking water resource management. WIRES. WATER 2018; 5:e1282. [PMID: 29780584 PMCID: PMC5947618 DOI: 10.1002/wat2.1282] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2017] [Revised: 01/25/2018] [Accepted: 01/26/2018] [Indexed: 06/08/2023]
Abstract
Over the past 15 years, pioneering interdisciplinary research has been performed on the microbiology of hydrogeologically well-defined alpine karst springs located in the Northern Calcareous Alps (NCA) of Austria. This article gives an overview on these activities and links them to other relevant research. Results from the NCA springs and comparable sites revealed that spring water harbors abundant natural microbial communities even in aquifers with high water residence times and the absence of immediate surface influence. Apparently, hydrogeology has a strong impact on the concentration and size of the observed microbes, and total cell counts (TCC) were suggested as a useful means for spring type classification. Measurement of microbial activities at the NCA springs revealed extremely low microbial growth rates in the base flow component of the studied spring waters and indicated the importance of biofilm-associated microbial activities in sediments and on rock surfaces. Based on genetic analysis, the autochthonous microbial endokarst community (AMEC) versus transient microbial endokarst community (TMEC) concept was proposed for the NCA springs, and further details within this overview article are given to prompt its future evaluation. In this regard, it is well known that during high-discharge situations, surface-associated microbes and nutrients such as from soil habitats or human settlements-potentially containing fecal-associated pathogens as the most critical water-quality hazard-may be rapidly flushed into vulnerable karst aquifers. In this context, a framework for the comprehensive analysis of microbial pollution has been proposed for the NCA springs to support the sustainable management of drinking water safety in accordance with recent World Health Organization guidelines. Near-real-time online water quality monitoring, microbial source tracking (MST) and MST-guided quantitative microbial-risk assessment (QMRA) are examples of the proposed analytical tools. In this context, this overview article also provides a short introduction to recently emerging methodologies in microbiological diagnostics to support reading for the practitioner. Finally, the article highlights future research and development needs. This article is categorized under: 1Engineering Water > Water, Health, and Sanitation2Science of Water > Water Extremes3Water and Life > Nature of Freshwater Ecosystems.
Collapse
Affiliation(s)
- Domenico Savio
- Division Water Quality and HealthDepartment Pharmacology, Physiology and Microbiology, Karl Landsteiner University of Health SciencesKrems a. d. DonauAustria
- Centre for Water Resource SystemsTechnische Universität WienViennaAustria
| | - Philipp Stadler
- Centre for Water Resource SystemsTechnische Universität WienViennaAustria
- Institute for Water Quality, Resource and Waste ManagementTechnische Universität WienViennaAustria
| | - Georg H. Reischer
- Institute of Chemical, Environmental & Bioscience Engineering, Research Group Environmental Microbiology and Molecular Diagnostics166/5/3, Technische Universität WienViennaAustria
- Interuniversity Cooperation Centre for Water and Health, www.waterandhealth.at
| | - Alexander K.T. Kirschner
- Interuniversity Cooperation Centre for Water and Health, www.waterandhealth.at
- Unit Water Hygiene, Institute for Hygiene and Applied ImmunologyMedical University of ViennaViennaAustria
| | - Katalin Demeter
- Centre for Water Resource SystemsTechnische Universität WienViennaAustria
- Institute of Chemical, Environmental & Bioscience Engineering, Research Group Environmental Microbiology and Molecular Diagnostics166/5/3, Technische Universität WienViennaAustria
| | - Rita Linke
- Institute of Chemical, Environmental & Bioscience Engineering, Research Group Environmental Microbiology and Molecular Diagnostics166/5/3, Technische Universität WienViennaAustria
- Interuniversity Cooperation Centre for Water and Health, www.waterandhealth.at
| | - Alfred P. Blaschke
- Centre for Water Resource SystemsTechnische Universität WienViennaAustria
- Interuniversity Cooperation Centre for Water and Health, www.waterandhealth.at
- Institute of Hydraulic Engineering and Water Resources ManagementTechnische Universität WienViennaAustria
| | - Regina Sommer
- Interuniversity Cooperation Centre for Water and Health, www.waterandhealth.at
- Unit Water Hygiene, Institute for Hygiene and Applied ImmunologyMedical University of ViennaViennaAustria
| | - Ulrich Szewzyk
- Department of Environmental TechnologyTechnical University of BerlinBerlinGermany
| | - Inés C. Wilhartitz
- Department of Environmental MicrobiologyEawag, Swiss Federal Institute of Aquatic Science and TechnologyDübendorfSwitzerland
| | - Robert L. Mach
- Institute of Chemical, Environmental & Bioscience Engineering, Research Group Environmental Microbiology and Molecular Diagnostics166/5/3, Technische Universität WienViennaAustria
| | - Hermann Stadler
- Department for Water Resources Management and Environmental AnalyticsInstitute for Water, Energy and Sustainability, Joanneum Research, GrazAustria
| | - Andreas H. Farnleitner
- Division Water Quality and HealthDepartment Pharmacology, Physiology and Microbiology, Karl Landsteiner University of Health SciencesKrems a. d. DonauAustria
- Institute of Chemical, Environmental & Bioscience Engineering, Research Group Environmental Microbiology and Molecular Diagnostics166/5/3, Technische Universität WienViennaAustria
- Interuniversity Cooperation Centre for Water and Health, www.waterandhealth.at
| |
Collapse
|
74
|
Yauy K, Baux D, Pegeot H, Van Goethem C, Mathieu C, Guignard T, Juntas Morales R, Lacourt D, Krahn M, Lehtokari VL, Bonne G, Tuffery-Giraud S, Koenig M, Cossée M. MoBiDiC Prioritization Algorithm, a Free, Accessible, and Efficient Pipeline for Single-Nucleotide Variant Annotation and Prioritization for Next-Generation Sequencing Routine Molecular Diagnosis. J Mol Diagn 2018; 20:465-473. [PMID: 29689380 DOI: 10.1016/j.jmoldx.2018.03.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Revised: 02/16/2018] [Accepted: 03/09/2018] [Indexed: 12/22/2022] Open
Abstract
Interpretation of next-generation sequencing constitutes the main limitation of molecular diagnostics. In diagnosing myopathies and muscular dystrophies, another issue is efficiency in predicting the pathogenicity of variants identified in large genes, especially TTN; current in silico prediction tools show limitations in predicting and ranking the numerous variants of such genes. We propose a variant-prioritization tool, the MoBiDiCprioritization algorithm (MPA). MPA is based on curated interpretation of data on previously reported variants, biological assumptions, and splice and missense predictors, and is used to prioritize all types of single-nucleotide variants. MPA was validated by comparing its sensitivity and specificity to those of dbNSFP database prediction tools, using a data set composed of DYSF, DMD, LMNA, NEB, and TTN variants extracted from expert-reviewed and ExAC databases. MPA obtained the best annotation rates for missense and splice variants. As MPA aggregates the results from several predictors, individual predictor errors are counterweighted, improving the sensitivity and specificity of missense and splice variant predictions. We propose a sequential use of MPA, beginning with the selection of variants with higher scores and followed by, in the absence of candidate pathologic variants, consideration of variants with lower scores. We provide scripts and documentation for free academic use and a validated annotation pipeline scaled for panel and exome sequencing to prioritize single-nucleotide variants from a VCF file.
Collapse
Affiliation(s)
- Kevin Yauy
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France.
| | - David Baux
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France
| | - Henri Pegeot
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France
| | - Charles Van Goethem
- Laboratoire de Biopathologie Cellulaire et Tissulaire des Tumeurs, Hôpital Arnaud de Villeneuve, CHU Montpellier, Montpellier, France
| | - Charly Mathieu
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France
| | - Thomas Guignard
- Plateforme Recherche de Microremaniements Chromosomiques-Centre de Référence Anomalies du Développement et Syndromes Malformatifs, Département de Génétique Médicale, Maladies Rares et Médecine Personnalisée, CHU Montpellier, Montpellier, France; Faculté de Médecine Montpellier-Nîmes, Université de Montpellier, Montpellier, France
| | | | - Delphine Lacourt
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France
| | - Martin Krahn
- Unité de Génétique Médicale et Génomique Fonctionnelle INSERM UMRS910, Université d'Aix Marseille, Marseille, France; Département de Génétique Médicale, Hôpital Timone Enfants, Assistance Publique Hôpitaux de Marseille, Marseille, France
| | - Vilma-Lotta Lehtokari
- The Folkhalsan Institute of Genetics and the Department of Medical Genetics, Haartman Institute, University of Helsinki, Helsinki, Finland
| | - Gisele Bonne
- Unité INSERM U974-Thérapie des Maladies du Muscle Striée, Center of Research in Myology, Institut de Myologie, Université Pierre et Marie Curie, Sorbonne Universités, Paris, France
| | - Sylvie Tuffery-Giraud
- Laboratoire de Génétique des Maladies Rares EA7402, Université de Montpellier, Montpellier, France
| | - Michel Koenig
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France; Laboratoire de Génétique des Maladies Rares EA7402, Université de Montpellier, Montpellier, France
| | - Mireille Cossée
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France; Laboratoire de Génétique des Maladies Rares EA7402, Université de Montpellier, Montpellier, France
| |
Collapse
|
75
|
Cesare AD, Palma F, Lucchi A, Pasquali F, Manfreda G. Microbiological profile of chicken carcasses: A comparative analysis using shotgun metagenomic sequencing. Ital J Food Saf 2018; 7:6923. [PMID: 29732327 PMCID: PMC5913701 DOI: 10.4081/ijfs.2018.6923] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2017] [Revised: 12/05/2017] [Accepted: 12/05/2017] [Indexed: 01/14/2023] Open
Abstract
In the last few years metagenomic and 16S rRNA sequencing have completly changed the microbiological investigations of food products. In this preliminary study, the microbiological profile of chicken carcasses collected from animals fed with different diets were tested by using shotgun metagenomic sequencing. A total of 15 carcasses have been collected at the slaughetrhouse at the end of the refrigeration tunnel from chickens reared for 35 days and fed with a control diet (n=5), a diet supplemented with 1500 FTU/kg of commercial phytase (n=5) and a diet supplemented with 1500 FTU/kg of commercial phytase and 3g/kg of inositol (n=5). Ten grams of neck and breast skin were obtained from each carcass and submited to total DNA extraction by using the DNeasy Blood & Tissue Kit (Qiagen). Sequencing libraries have been prepared by using the Nextera XT DNA Library Preparation Kit (Illumina) and sequenced in a HiScanSQ (Illumina) at 100 bp in paired ends. A number of sequences ranging between 5 and 9 million was obtained for each sample. Sequence analysis showed that Proteobacteria and Firmicutes represented more than 98% of whole bacterial populations associated to carcass skin in all groups but their abundances were different between groups. Moraxellaceae and other degradative bacteria showed a significantly higher abundance in the control compared to the treated groups. Furthermore, Clostridium perfringens showed a relative frequency of abundance significantly higher in the group fed with phytase and Salmonella enterica in the group fed with phytase plus inositol. The results of this preliminary study showed that metagenome sequencing is suitable to investigate and monitor carcass microbiota in order to detect specific pathogenic and/or degradative populations.
Collapse
Affiliation(s)
- Alessandra De Cesare
- Department of Agriculture and Food Sciences, Alma Mater Studiorum University of Bologna, Italy
| | - Federica Palma
- Department of Agriculture and Food Sciences, Alma Mater Studiorum University of Bologna, Italy
| | - Alex Lucchi
- Department of Agriculture and Food Sciences, Alma Mater Studiorum University of Bologna, Italy
| | - Frederique Pasquali
- Department of Agriculture and Food Sciences, Alma Mater Studiorum University of Bologna, Italy
| | - Gerardo Manfreda
- Department of Agriculture and Food Sciences, Alma Mater Studiorum University of Bologna, Italy
| |
Collapse
|
76
|
Burczynska A, Dziewit L, Decewicz P, Struzycka I, Wroblewska M. Application of Metagenomic Analyses in Dentistry as a Novel Strategy Enabling Complex Insight into Microbial Diversity of the Oral Cavity. Pol J Microbiol 2018; 66:9-15. [PMID: 29359689 DOI: 10.5604/17331331.1234988] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The composition of the oral microbiome in healthy individuals is complex and dynamic, and depends on many factors, such as anatomical location in the oral cavity, diet, oral hygiene habits or host immune responses. It is estimated at present that worldwide about 2 billion people suffer from diseases of the oral cavity, mainly periodontal disease and dental caries. Importantly, the oral microflora involved in local infections may spread and cause systemic, even life-threatening infections. In search for etiological agents of infections in dentistry, traditional approaches are not sufficient, as about 50% of oral bacteria are not cultivable. Instead, metagenomic analyses are particularly useful for studies of the complex oral microbiome - both in healthy individuals, and in patients with oral and dental diseases. In this paper we review the current and future applications of metagenomic studies in evaluation of both the composition of the oral microbiome as well as its potential pathogenic role in infections in dentistry.
Collapse
Affiliation(s)
| | - Lukasz Dziewit
- Department of Bacterial Genetics, Institute of Microbiology, Faculty of Biology, University of Warsaw, Poland
| | - Przemysław Decewicz
- Department of Bacterial Genetics, Institute of Microbiology, Faculty of Biology, University of Warsaw, Poland; Research and Development for Life Sciences Ltd., Poland
| | - Izabela Struzycka
- Department of Comprehensive Dental Care, Medical University of Warsaw, Poland
| | - Marta Wroblewska
- Department of Dental Microbiology, Medical University of Warsaw, Poland; Department of Microbiology, Central Clinical Hospital in Warsaw, Poland
| |
Collapse
|
77
|
Alonso A, Lasseigne BN, Williams K, Nielsen J, Ramaker RC, Hardigan AA, Johnston B, Roberts BS, Cooper SJ, Marsal S, Myers RM. aRNApipe: a balanced, efficient and distributed pipeline for processing RNA-seq data in high-performance computing environments. Bioinformatics 2018; 33:1727-1729. [PMID: 28108448 PMCID: PMC5447234 DOI: 10.1093/bioinformatics/btx023] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2016] [Accepted: 01/18/2017] [Indexed: 11/14/2022] Open
Abstract
Summary The wide range of RNA-seq applications and their high-computational needs require the development of pipelines orchestrating the entire workflow and optimizing usage of available computational resources. We present aRNApipe, a project-oriented pipeline for processing of RNA-seq data in high-performance cluster environments. aRNApipe is highly modular and can be easily migrated to any high-performance computing (HPC) environment. The current applications included in aRNApipe combine the essential RNA-seq primary analyses, including quality control metrics, transcript alignment, count generation, transcript fusion identification, alternative splicing and sequence variant calling. aRNApipe is project-oriented and dynamic so users can easily update analyses to include or exclude samples or enable additional processing modules. Workflow parameters are easily set using a single configuration file that provides centralized tracking of all analytical processes. Finally, aRNApipe incorporates interactive web reports for sample tracking and a tool for managing the genome assemblies available to perform an analysis. Availability and documentation https://github.com/HudsonAlpha/aRNAPipe ; DOI: 10.5281/zenodo.202950. Contact rmyers@hudsonalpha.org. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Arnald Alonso
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.,Rheumatology Research Group, Vall d'Hebron Hospital Research Institute, Barcelona, Spain
| | | | - Kelly Williams
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Josh Nielsen
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Ryne C Ramaker
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.,Department of Genetics, The University of Alabama at Birmingham, Birmingham, AL, USA
| | - Andrew A Hardigan
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.,Department of Genetics, The University of Alabama at Birmingham, Birmingham, AL, USA
| | - Bobbi Johnston
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Brian S Roberts
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Sara J Cooper
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Sara Marsal
- Rheumatology Research Group, Vall d'Hebron Hospital Research Institute, Barcelona, Spain
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| |
Collapse
|
78
|
Westbrook A, Ramsdell J, Schuelke T, Normington L, Bergeron RD, Thomas WK, MacManes MD. PALADIN: protein alignment for functional profiling whole metagenome shotgun data. Bioinformatics 2018; 33:1473-1478. [PMID: 28158639 PMCID: PMC5423455 DOI: 10.1093/bioinformatics/btx021] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2016] [Accepted: 01/24/2017] [Indexed: 11/14/2022] Open
Abstract
Motivation Whole metagenome shotgun sequencing is a powerful approach for assaying the functional potential of microbial communities. We currently lack tools that efficiently and accurately align DNA reads against protein references, the technique necessary for constructing a functional profile. Here, we present PALADIN-a novel modification of the Burrows-Wheeler Aligner that provides accurate alignment, robust reporting capabilities and orders-of-magnitude improved efficiency by directly mapping in protein space. Results We compared the accuracy and efficiency of PALADIN against existing tools that employ nucleotide or protein alignment algorithms. Using simulated reads, PALADIN consistently outperformed the popular DNA read mappers BWA and NovoAlign in detected proteins, percentage of reads mapped and ontological similarity. We also compared PALADIN against four existing protein alignment tools: BLASTX, RAPSearch2, DIAMOND and Lambda, using empirically obtained reads. PALADIN yielded results seven times faster than the best performing alternative, DIAMOND and nearly 8000 times faster than BLASTX. PALADIN's accuracy was comparable to all tested solutions. Availability and Implementation PALADIN was implemented in C, and its source code and documentation are available at https://github.com/twestbrookunh/paladin. Contact anthonyw@wildcats.unh.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Anthony Westbrook
- Department of Computer Science, University of New Hampshire, Durham, NH, USA
| | - Jordan Ramsdell
- Department of Computer Science, University of New Hampshire, Durham, NH, USA.,Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH, USA
| | - Taruna Schuelke
- Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH, USA
| | - Louisa Normington
- Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH, USA
| | - R Daniel Bergeron
- Department of Computer Science, University of New Hampshire, Durham, NH, USA.,Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH, USA
| | - W Kelley Thomas
- Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH, USA.,Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH, USA
| | - Matthew D MacManes
- Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH, USA.,Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH, USA
| |
Collapse
|
79
|
Zhang S, Hu Z, Wang H. A Retrospective Review of Microbiological Methods Applied in Studies Following the Deepwater Horizon Oil Spill. Front Microbiol 2018; 9:520. [PMID: 29628913 PMCID: PMC5876298 DOI: 10.3389/fmicb.2018.00520] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Accepted: 03/08/2018] [Indexed: 12/19/2022] Open
Abstract
The Deepwater Horizon (DWH) oil spill in the Gulf of Mexico in 2010 resulted in serious damage to local marine and coastal environments. In addition to the physical removal and chemical dispersion of spilled oil, biodegradation by indigenous microorganisms was regarded as the most effective way for cleaning up residual oil. Different microbiological methods were applied to investigate the changes and responses of bacterial communities after the DWH oil spills. By summarizing and analyzing these microbiological methods, giving recommendations and proposing some methods that have not been used, this review aims to provide constructive guidelines for microbiological studies after environmental disasters, especially those involving organic pollutants.
Collapse
Affiliation(s)
| | - Zhong Hu
- Biology Department, College of Science, Shantou University, Shantou, China
| | - Hui Wang
- Biology Department, College of Science, Shantou University, Shantou, China
| |
Collapse
|
80
|
A Reference Viral Database (RVDB) To Enhance Bioinformatics Analysis of High-Throughput Sequencing for Novel Virus Detection. mSphere 2018; 3:mSphere00069-18. [PMID: 29564396 PMCID: PMC5853486 DOI: 10.1128/mspheredirect.00069-18] [Citation(s) in RCA: 112] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Accepted: 02/16/2018] [Indexed: 12/20/2022] Open
Abstract
To facilitate bioinformatics analysis of high-throughput sequencing (HTS) data for the detection of both known and novel viruses, we have developed a new reference viral database (RVDB) that provides a broad representation of different virus species from eukaryotes by including all viral, virus-like, and virus-related sequences (excluding bacteriophages), regardless of their size. In particular, RVDB contains endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Sequences were clustered to reduce redundancy while retaining high viral sequence diversity. A particularly useful feature of RVDB is the reduction of cellular sequences, which can enhance the run efficiency of large transcriptomic and genomic data analysis and increase the specificity of virus detection. Detection of distantly related viruses by high-throughput sequencing (HTS) is bioinformatically challenging because of the lack of a public database containing all viral sequences, without abundant nonviral sequences, which can extend runtime and obscure viral hits. Our reference viral database (RVDB) includes all viral, virus-related, and virus-like nucleotide sequences (excluding bacterial viruses), regardless of length, and with overall reduced cellular sequences. Semantic selection criteria (SEM-I) were used to select viral sequences from GenBank, resulting in a first-generation viral database (VDB). This database was manually and computationally reviewed, resulting in refined, semantic selection criteria (SEM-R), which were applied to a new download of updated GenBank sequences to create a second-generation VDB. Viral entries in the latter were clustered at 98% by CD-HIT-EST to reduce redundancy while retaining high viral sequence diversity. The viral identity of the clustered representative sequences (creps) was confirmed by BLAST searches in NCBI databases and HMMER searches in PFAM and DFAM databases. The resulting RVDB contained a broad representation of viral families, sequence diversity, and a reduced cellular content; it includes full-length and partial sequences and endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Testing of RVDBv10.2, with an in-house HTS transcriptomic data set indicated a significantly faster run for virus detection than interrogating the entirety of the NCBI nonredundant nucleotide database, which contains all viral sequences but also nonviral sequences. RVDB is publically available for facilitating HTS analysis, particularly for novel virus detection. It is meant to be updated on a regular basis to include new viral sequences added to GenBank. IMPORTANCE To facilitate bioinformatics analysis of high-throughput sequencing (HTS) data for the detection of both known and novel viruses, we have developed a new reference viral database (RVDB) that provides a broad representation of different virus species from eukaryotes by including all viral, virus-like, and virus-related sequences (excluding bacteriophages), regardless of their size. In particular, RVDB contains endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Sequences were clustered to reduce redundancy while retaining high viral sequence diversity. A particularly useful feature of RVDB is the reduction of cellular sequences, which can enhance the run efficiency of large transcriptomic and genomic data analysis and increase the specificity of virus detection.
Collapse
|
81
|
Morales‐Cruz A, Allenbeck G, Figueroa‐Balderas R, Ashworth VE, Lawrence DP, Travadon R, Smith RJ, Baumgartner K, Rolshausen PE, Cantu D. Closed-reference metatranscriptomics enables in planta profiling of putative virulence activities in the grapevine trunk disease complex. MOLECULAR PLANT PATHOLOGY 2018; 19:490-503. [PMID: 28218463 PMCID: PMC6638111 DOI: 10.1111/mpp.12544] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 02/13/2017] [Indexed: 05/20/2023]
Abstract
Grapevines, like other perennial crops, are affected by so-called 'trunk diseases', which damage the trunk and other woody tissues. Mature grapevines typically contract more than one trunk disease and often multiple grapevine trunk pathogens (GTPs) are recovered from infected tissues. The co-existence of different GTP species in complex and dynamic microbial communities complicates the study of the molecular mechanisms underlying disease development, especially under vineyard conditions. The objective of this study was to develop and optimize a community-level transcriptomics (i.e. metatranscriptomics) approach that could monitor simultaneously the virulence activities of multiple GTPs in planta. The availability of annotated genomes for the most relevant co-infecting GTPs in diseased grapevine wood provided the unprecedented opportunity to generate a multi-species reference for the mapping and quantification of DNA and RNA sequencing reads. We first evaluated popular sequence read mappers using permutations of multiple simulated datasets. Alignment parameters of the selected mapper were optimized to increase the specificity and sensitivity for its application to metagenomics and metatranscriptomics analyses. Initial testing on grapevine wood experimentally inoculated with individual GTPs confirmed the validity of the method. Using naturally infected field samples expressing a variety of trunk disease symptoms, we show that our approach provides quantitative assessments of species composition, as well as genome-wide transcriptional profiling of potential virulence factors, namely cell wall degradation, secondary metabolism and nutrient uptake for all co-infecting GTPs.
Collapse
Affiliation(s)
- Abraham Morales‐Cruz
- Department of Viticulture and EnologyUniversity of California DavisDavisCA95616USA
| | - Gabrielle Allenbeck
- Department of Viticulture and EnologyUniversity of California DavisDavisCA95616USA
| | | | - Vanessa E. Ashworth
- Department of Botany and Plant SciencesUniversity of California RiversideRiversideCA92521USA
| | - Daniel P. Lawrence
- Department of Plant PathologyUniversity of California DavisDavisCA95616USA
| | - Renaud Travadon
- Department of Plant PathologyUniversity of California DavisDavisCA95616USA
| | - Rhonda J. Smith
- University of California Cooperative Extension, Sonoma CountySanta RosaCA95403USA
| | - Kendra Baumgartner
- United States Department of Agriculture ‐ Agricultural Research ServiceCrops Pathology and Genetics Research UnitDavisCA95616USA
| | - Philippe E. Rolshausen
- Department of Botany and Plant SciencesUniversity of California RiversideRiversideCA92521USA
| | - Dario Cantu
- Department of Viticulture and EnologyUniversity of California DavisDavisCA95616USA
| |
Collapse
|
82
|
Sergaki C, Lagunas B, Lidbury I, Gifford ML, Schäfer P. Challenges and Approaches in Microbiome Research: From Fundamental to Applied. FRONTIERS IN PLANT SCIENCE 2018; 9:1205. [PMID: 30174681 PMCID: PMC6107787 DOI: 10.3389/fpls.2018.01205] [Citation(s) in RCA: 68] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Accepted: 07/26/2018] [Indexed: 05/07/2023]
Abstract
We face major agricultural challenges that remain a threat for global food security. Soil microbes harbor enormous potentials to provide sustainable and economically favorable solutions that could introduce novel approaches to improve agricultural practices and, hence, crop productivity. In this review we give an overview regarding the current state-of-the-art of microbiome research by discussing new technologies and approaches. We also provide insights into fundamental microbiome research that aim to provide a deeper understanding of the dynamics within microbial communities, as well as their interactions with different plant hosts and the environment. We aim to connect all these approaches with potential applications and reflect how we can use microbial communities in modern agricultural systems to realize a more customized and sustainable use of valuable resources (e.g., soil).
Collapse
Affiliation(s)
- Chrysi Sergaki
- School of Life Sciences, University of Warwick, Coventry, United Kingdom
- *Correspondence: Chrysi Sergaki,
| | - Beatriz Lagunas
- School of Life Sciences, University of Warwick, Coventry, United Kingdom
| | - Ian Lidbury
- School of Life Sciences, University of Warwick, Coventry, United Kingdom
| | - Miriam L. Gifford
- School of Life Sciences, University of Warwick, Coventry, United Kingdom
- Warwick Integrative Synthetic Biology Centre, University of Warwick, Coventry, United Kingdom
| | - Patrick Schäfer
- School of Life Sciences, University of Warwick, Coventry, United Kingdom
- Warwick Integrative Synthetic Biology Centre, University of Warwick, Coventry, United Kingdom
| |
Collapse
|
83
|
Foong CP, Lakshmanan M, Abe H, Taylor TD, Foong SY, Sudesh K. A novel and wide substrate specific polyhydroxyalkanoate (PHA) synthase from unculturable bacteria found in mangrove soil. JOURNAL OF POLYMER RESEARCH 2017. [DOI: 10.1007/s10965-017-1403-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
84
|
Williamson KE, Fuhrmann JJ, Wommack KE, Radosevich M. Viruses in Soil Ecosystems: An Unknown Quantity Within an Unexplored Territory. Annu Rev Virol 2017; 4:201-219. [PMID: 28961409 DOI: 10.1146/annurev-virology-101416-041639] [Citation(s) in RCA: 158] [Impact Index Per Article: 22.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Viral abundance in soils can range from below detection limits in hot deserts to over 1 billion per gram in wetlands. Abundance appears to be strongly influenced by water availability and temperature, but a lack of informational standards creates difficulties for cross-study analysis. Soil viral diversity is severely underestimated and undersampled, although current measures of viral richness are higher for soils than for aquatic ecosystems. Both morphometric and metagenomic analyses have raised questions about the prevalence of nontailed, ssDNA viruses in soils. Soil is complex and critically important to terrestrial biodiversity and human civilization, but impacts of viral activities on soil ecosystem services are poorly understood. While information from aquatic systems and medical microbiology suggests the potential for viral influences on nutrient cycles, food web interactions, gene transfer, and other key processes in soils, very few empirical data are available. To understand the soil virome, much work remains.
Collapse
Affiliation(s)
- Kurt E Williamson
- Biology Department, College of William and Mary, Williamsburg, Virginia 23185;
| | - Jeffry J Fuhrmann
- Department of Plant and Soil Sciences, University of Delaware, Newark, Delaware 19716
| | - K Eric Wommack
- Department of Plant and Soil Sciences, University of Delaware, Newark, Delaware 19716.,Department Biological Sciences, University of Delaware, Newark, Delaware 19716.,College of Earth, Ocean, and Environment, University of Delaware, Newark, Delaware 19716
| | - Mark Radosevich
- Biosystems Engineering and Soil Science Department, University of Tennessee, Knoxville, Tennessee 37996
| |
Collapse
|
85
|
MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol 2017; 35:1026-1028. [PMID: 29035372 DOI: 10.1038/nbt.3988] [Citation(s) in RCA: 1486] [Impact Index Per Article: 212.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
86
|
Cornman RS. Relative abundance of deformed wing virus, Varroa destructor virus 1, and their recombinants in honey bees (Apis mellifera) assessed by kmer analysis of public RNA-Seq data. J Invertebr Pathol 2017; 149:44-50. [PMID: 28743669 DOI: 10.1016/j.jip.2017.07.005] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Revised: 07/06/2017] [Accepted: 07/20/2017] [Indexed: 11/17/2022]
Abstract
Deformed wing virus (DWV) is a major pathogen of concern to apiculture, and recent reports have indicated the local predominance and potential virulence of recombinants between DWV and a related virus, Varroa destructor virus 1 (VDV). However, little is known about the frequency and titer of VDV and recombinants relative to DWV generally. In this study, I assessed the relative occurrence and titer of DWV and VDV in public RNA-seq accessions of honey bee using a rapid, kmer-based approach. Three recombinant types were detectable graphically and corroborated by de novo assembly. Recombination breakpoints did not disrupt the capsid-encoding region, consistent with previous reports, and both VDV- and DWV-derived capsids were observed in recombinant backgrounds. High abundance of VDV kmers was largely restricted to recombinant forms. Non-metric multidimensional scaling identified genotypic clusters among DWV isolates, which was corroborated by read mapping and consensus generation. The recently described DWV-C lineage was not detected in the searched accessions. The data further highlight the utility of high-throughput sequencing to monitor viral polymorphisms and statistically test biological predictors of titer, and point to the need for consistent methodologies and sampling schemes.
Collapse
|
87
|
Ramos VMC, Castelo-Branco R, Leão PN, Martins J, Carvalhal-Gomes S, Sobrinho da Silva F, Mendonça Filho JG, Vasconcelos VM. Cyanobacterial Diversity in Microbial Mats from the Hypersaline Lagoon System of Araruama, Brazil: An In-depth Polyphasic Study. Front Microbiol 2017; 8:1233. [PMID: 28713360 PMCID: PMC5492833 DOI: 10.3389/fmicb.2017.01233] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Accepted: 06/19/2017] [Indexed: 11/24/2022] Open
Abstract
Microbial mats are complex, micro-scale ecosystems that can be found in a wide range of environments. In the top layer of photosynthetic mats from hypersaline environments, a large diversity of cyanobacteria typically predominates. With the aim of strengthening the knowledge on the cyanobacterial diversity present in the coastal lagoon system of Araruama (state of Rio de Janeiro, Brazil), we have characterized three mat samples by means of a polyphasic approach. We have used morphological and molecular data obtained by culture-dependent and -independent methods. Moreover, we have compared different classification methodologies and discussed the outcomes, challenges, and pitfalls of these methods. Overall, we show that Araruama's lagoons harbor a high cyanobacterial diversity. Thirty-six unique morphospecies could be differentiated, which increases by more than 15% the number of morphospecies and genera already reported for the entire Araruama system. Morphology-based data were compared with the 16S rRNA gene phylogeny derived from isolate sequences and environmental sequences obtained by PCR-DGGE and pyrosequencing. Most of the 48 phylotypes could be associated with the observed morphospecies at the order level. More than one third of the sequences demonstrated to be closely affiliated (best BLAST hit results of ≥99%) with cyanobacteria from ecologically similar habitats. Some sequences had no close relatives in the public databases, including one from an isolate, being placed as “loner” sequences within different orders. This hints at hidden cyanobacterial diversity in the mats of the Araruama system, while reinforcing the relevance of using complementary approaches to study cyanobacterial diversity.
Collapse
Affiliation(s)
- Vitor M C Ramos
- Faculty of Sciences, University of PortoPorto, Portugal.,Interdisciplinary Centre of Marine and Environmental Research (CIIMAR/CIMAR), University of PortoMatosinhos, Portugal
| | - Raquel Castelo-Branco
- Interdisciplinary Centre of Marine and Environmental Research (CIIMAR/CIMAR), University of PortoMatosinhos, Portugal
| | - Pedro N Leão
- Interdisciplinary Centre of Marine and Environmental Research (CIIMAR/CIMAR), University of PortoMatosinhos, Portugal
| | - Joana Martins
- Faculty of Sciences, University of PortoPorto, Portugal.,Interdisciplinary Centre of Marine and Environmental Research (CIIMAR/CIMAR), University of PortoMatosinhos, Portugal
| | - Sinda Carvalhal-Gomes
- Palynofacies and Organic Facies Laboratory, Department of Geology, Federal University of Rio de JaneiroRio de Janeiro, Brazil
| | - Frederico Sobrinho da Silva
- Palynofacies and Organic Facies Laboratory, Department of Geology, Federal University of Rio de JaneiroRio de Janeiro, Brazil
| | - João G Mendonça Filho
- Palynofacies and Organic Facies Laboratory, Department of Geology, Federal University of Rio de JaneiroRio de Janeiro, Brazil
| | - Vitor M Vasconcelos
- Faculty of Sciences, University of PortoPorto, Portugal.,Interdisciplinary Centre of Marine and Environmental Research (CIIMAR/CIMAR), University of PortoMatosinhos, Portugal
| |
Collapse
|
88
|
van der Walt AJ, van Goethem MW, Ramond JB, Makhalanyane TP, Reva O, Cowan DA. Assembling metagenomes, one community at a time. BMC Genomics 2017; 18:521. [PMID: 28693474 PMCID: PMC5502489 DOI: 10.1186/s12864-017-3918-9] [Citation(s) in RCA: 75] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Accepted: 07/02/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Metagenomics allows unprecedented access to uncultured environmental microorganisms. The analysis of metagenomic sequences facilitates gene prediction and annotation, and enables the assembly of draft genomes, including uncultured members of a community. However, while several platforms have been developed for this critical step, there is currently no clear framework for the assembly of metagenomic sequence data. RESULTS To assist with selection of an appropriate metagenome assembler we evaluated the capabilities of nine prominent assembly tools on nine publicly-available environmental metagenomes, as well as three simulated datasets. Overall, we found that SPAdes provided the largest contigs and highest N50 values across 6 of the 9 environmental datasets, followed by MEGAHIT and metaSPAdes. MEGAHIT emerged as a computationally inexpensive alternative to SPAdes, assembling the most complex dataset using less than 500 GB of RAM and within 10 hours. CONCLUSIONS We found that assembler choice ultimately depends on the scientific question, the available resources and the bioinformatic competence of the researcher. We provide a concise workflow for the selection of the best assembly tool.
Collapse
Affiliation(s)
- Andries Johannes van der Walt
- Centre for Microbial Ecology and Genomics (CMEG), Department of Genetics, University of Pretoria, Natural Sciences Building 2, Lynnwood Road, Pretoria, 0028, South Africa.,Centre for Bioinformatics and Computational Biology, Department of Biochemistry, University of Pretoria, Pretoria, South Africa
| | - Marc Warwick van Goethem
- Centre for Microbial Ecology and Genomics (CMEG), Department of Genetics, University of Pretoria, Natural Sciences Building 2, Lynnwood Road, Pretoria, 0028, South Africa
| | - Jean-Baptiste Ramond
- Centre for Microbial Ecology and Genomics (CMEG), Department of Genetics, University of Pretoria, Natural Sciences Building 2, Lynnwood Road, Pretoria, 0028, South Africa
| | - Thulani Peter Makhalanyane
- Centre for Microbial Ecology and Genomics (CMEG), Department of Genetics, University of Pretoria, Natural Sciences Building 2, Lynnwood Road, Pretoria, 0028, South Africa
| | - Oleg Reva
- Centre for Bioinformatics and Computational Biology, Department of Biochemistry, University of Pretoria, Pretoria, South Africa
| | - Don Arthur Cowan
- Centre for Microbial Ecology and Genomics (CMEG), Department of Genetics, University of Pretoria, Natural Sciences Building 2, Lynnwood Road, Pretoria, 0028, South Africa.
| |
Collapse
|
89
|
Tiwari R, Nain L, Labrou NE, Shukla P. Bioprospecting of functional cellulases from metagenome for second generation biofuel production: a review. Crit Rev Microbiol 2017; 44:244-257. [DOI: 10.1080/1040841x.2017.1337713] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Rameshwar Tiwari
- Department of Microbiology, Laboratory of Enzyme Technology and Protein Bioinformatics, Maharshi Dayanand University, Rohtak, India
- Division of Microbiology, Indian Agricultural Research Institute, New Delhi, India
| | - Lata Nain
- Division of Microbiology, Indian Agricultural Research Institute, New Delhi, India
| | - Nikolaos E. Labrou
- Department of Biotechnology, School of Food, Biotechnology and Development, Laboratory of Enzyme Technology, Agricultural University of Athens, Athens, Greece
| | - Pratyoosh Shukla
- Department of Microbiology, Laboratory of Enzyme Technology and Protein Bioinformatics, Maharshi Dayanand University, Rohtak, India
| |
Collapse
|
90
|
Morgan HH, du Toit M, Setati ME. The Grapevine and Wine Microbiome: Insights from High-Throughput Amplicon Sequencing. Front Microbiol 2017; 8:820. [PMID: 28553266 PMCID: PMC5425579 DOI: 10.3389/fmicb.2017.00820] [Citation(s) in RCA: 81] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2016] [Accepted: 04/21/2017] [Indexed: 12/21/2022] Open
Abstract
From the time when microbial activity in wine fermentation was first demonstrated, the microbial ecology of the vineyard, grape, and wine has been extensively investigated using culture-based methods. However, the last 2 decades have been characterized by an important change in the approaches used for microbial examination, due to the introduction of DNA-based community fingerprinting methods such as DGGE, SSCP, T-RFLP, and ARISA. These approaches allowed for the exploration of microbial community structures without the need to cultivate, and have been extensively applied to decipher the microbial populations associated with the grapevine as well as the microbial dynamics throughout grape berry ripening and wine fermentation. These techniques are well-established for the rapid more sensitive profiling of microbial communities; however, they often do not provide direct taxonomic information and possess limited ability to detect the presence of rare taxa and taxa with low abundance. Consequently, the past 5 years have seen an upsurge in the application of high-throughput sequencing methods for the in-depth assessment of the grapevine and wine microbiome. Although a relatively new approach in wine sciences, these methods reveal a considerably greater diversity than previously reported, and identified several species that had not yet been reported. The aim of the current review is to highlight the contribution of high-throughput next generation sequencing and metagenomics approaches to vineyard microbial ecology especially unraveling the influence of vineyard management practices on microbial diversity.
Collapse
Affiliation(s)
- Horatio H Morgan
- Department of Viticulture and Oenology, Institute for Wine Biotechnology, Stellenbosch UniversityStellenbosch, South Africa
| | - Maret du Toit
- Department of Viticulture and Oenology, Institute for Wine Biotechnology, Stellenbosch UniversityStellenbosch, South Africa
| | - Mathabatha E Setati
- Department of Viticulture and Oenology, Institute for Wine Biotechnology, Stellenbosch UniversityStellenbosch, South Africa
| |
Collapse
|
91
|
Joyce BL, Haug-Baltzell AK, Hulvey JP, McCarthy F, Devisetty UK, Lyons E. Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms. J Vis Exp 2017. [PMID: 28518075 PMCID: PMC5607918 DOI: 10.3791/55009] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022] Open
Abstract
This workflow allows novice researchers to leverage advanced computational resources such as cloud computing to carry out pairwise comparative transcriptomics. It also serves as a primer for biologists to develop data scientist computational skills, e.g. executing bash commands, visualization and management of large data sets. All command line code and further explanations of each command or step can be found on the wiki (https://wiki.cyverse.org/wiki/x/dgGtAQ). The Discovery Environment and Atmosphere platforms are connected together through the CyVerse Data Store. As such, once the initial raw sequencing data has been uploaded there is no more need to transfer large data files over an Internet connection, minimizing the amount of time needed to conduct analyses. This protocol is designed to analyze only two experimental treatments or conditions. Differential gene expression analysis is conducted through pairwise comparisons, and will not be suitable to test multiple factors. This workflow is also designed to be manual rather than automated. Each step must be executed and investigated by the user, yielding a better understanding of data and analytical outputs, and therefore better results for the user. Once complete, this protocol will yield de novo assembled transcriptome(s) for underserved (non-model) organisms without the need to map to previously assembled reference genomes (which are usually not available in underserved organism). These de novo transcriptomes are further used in pairwise differential gene expression analysis to investigate genes differing between two experimental conditions. Differentially expressed genes are then functionally annotated to understand the genetic response organisms have to experimental conditions. In total, the data derived from this protocol is used to test hypotheses about biological responses of underserved organisms.
Collapse
Affiliation(s)
- Blake L Joyce
- BIO5 Institute, University of Arizona; The School of Plant Sciences, University of Arizona;
| | | | | | - Fiona McCarthy
- School of Animal and Comparative Biomedical Sciences, University of Arizona
| | | | - Eric Lyons
- BIO5 Institute, University of Arizona; The School of Plant Sciences, University of Arizona; Genetics GIDP, University of Arizona
| |
Collapse
|
92
|
Abstract
High-throughput technologies have revolutionized medical research. The advent of genotyping arrays enabled large-scale genome-wide association studies and methods for examining global transcript levels, which gave rise to the field of “integrative genetics”. Other omics technologies, such as proteomics and metabolomics, are now often incorporated into the everyday methodology of biological researchers. In this review, we provide an overview of such omics technologies and focus on methods for their integration across multiple omics layers. As compared to studies of a single omics type, multi-omics offers the opportunity to understand the flow of information that underlies disease.
Collapse
Affiliation(s)
- Yehudit Hasin
- Department of Medicine, University of California, 10833 Le Conte Avenue, A2-237 CHS, Los Angeles, CA, 90095, USA.,Department of Human Genetics, University of California, 10833 Le Conte Avenue, A2-237 CHS, Los Angeles, CA, 90095, USA
| | - Marcus Seldin
- Department of Medicine, University of California, 10833 Le Conte Avenue, A2-237 CHS, Los Angeles, CA, 90095, USA
| | - Aldons Lusis
- Department of Medicine, University of California, 10833 Le Conte Avenue, A2-237 CHS, Los Angeles, CA, 90095, USA. .,Department of Microbiology, Immunology and Molecular Genetics, 10833 Le Conte Avenue, A2-237 CHS, Los Angeles, CA, 90095, USA. .,Department of Human Genetics, University of California, 10833 Le Conte Avenue, A2-237 CHS, Los Angeles, CA, 90095, USA.
| |
Collapse
|
93
|
Lin YY, Hsieh CH, Chen JH, Lu X, Kao JH, Chen PJ, Chen DS, Wang HY. De novo assembly of highly polymorphic metagenomic data using in situ generated reference sequences and a novel BLAST-based assembly pipeline. BMC Bioinformatics 2017; 18:223. [PMID: 28446139 PMCID: PMC5406902 DOI: 10.1186/s12859-017-1630-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 04/12/2017] [Indexed: 12/18/2022] Open
Abstract
Background The accuracy of metagenomic assembly is usually compromised by high levels of polymorphism due to divergent reads from the same genomic region recognized as different loci when sequenced and assembled together. A viral quasispecies is a group of abundant and diversified genetically related viruses found in a single carrier. Current mainstream assembly methods, such as Velvet and SOAPdenovo, were not originally intended for the assembly of such metagenomics data, and therefore demands for new methods to provide accurate and informative assembly results for metagenomic data. Results In this study, we present a hybrid method for assembling highly polymorphic data combining the partial de novo-reference assembly (PDR) strategy and the BLAST-based assembly pipeline (BBAP). The PDR strategy generates in situ reference sequences through de novo assembly of a randomly extracted partial data set which is subsequently used for the reference assembly for the full data set. BBAP employs a greedy algorithm to assemble polymorphic reads. We used 12 hepatitis B virus quasispecies NGS data sets from a previous study to assess and compare the performance of both PDR and BBAP. Analyses suggest the high polymorphism of a full metagenomic data set leads to fragmentized de novo assembly results, whereas the biased or limited representation of external reference sequences included fewer reads into the assembly with lower assembly accuracy and variation sensitivity. In comparison, the PDR generated in situ reference sequence incorporated more reads into the final PDR assembly of the full metagenomics data set along with greater accuracy and higher variation sensitivity. BBAP assembly results also suggest higher assembly efficiency and accuracy compared to other assembly methods. Additionally, BBAP assembly recovered HBV structural variants that were not observed amongst assembly results of other methods. Together, PDR/BBAP assembly results were significantly better than other compared methods. Conclusions Both PDR and BBAP independently increased the assembly efficiency and accuracy of highly polymorphic data, and assembly performances were further improved when used together. BBAP also provides nucleotide frequency information. Together, PDR and BBAP provide powerful tools for metagenomic data studies. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1630-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- You-Yu Lin
- Department of Life Science, National Taiwan University, Taipei, 106, Taiwan. .,Graduate Institute of Clinical Medicine, National Taiwan University, Taipei, 100, Taiwan.
| | - Chia-Hung Hsieh
- Department of Forestry and Nature Conservation, Chinese Culture University, Taipei, 111, Taiwan
| | - Jiun-Hong Chen
- Department of Life Science, National Taiwan University, Taipei, 106, Taiwan
| | - Xuemei Lu
- Laboratory of Disease Genomics and Individualized Medicine, Beijing Institute of Genomics, the Chinese Academy of Sciences, Beijing, 100101, China
| | - Jia-Horng Kao
- Graduate Institute of Clinical Medicine, National Taiwan University, Taipei, 100, Taiwan
| | - Pei-Jer Chen
- Graduate Institute of Clinical Medicine, National Taiwan University, Taipei, 100, Taiwan
| | - Ding-Shinn Chen
- Graduate Institute of Clinical Medicine, National Taiwan University, Taipei, 100, Taiwan.,Genomics Research Center, Academia Sinica, Taipei, 115, Taiwan
| | - Hurng-Yi Wang
- Graduate Institute of Clinical Medicine, National Taiwan University, Taipei, 100, Taiwan. .,Institute of Ecology and Evolutionary Biology, National Taiwan University, Taipei, 106, Taiwan. .,Research Center for Developmental Biology and Regenerative Medicine, National Taiwan University, Taipei, 100, Taiwan.
| |
Collapse
|
94
|
Garrido-Cardenas JA, Manzano-Agugliaro F. The metagenomics worldwide research. Curr Genet 2017; 63:819-829. [PMID: 28401295 DOI: 10.1007/s00294-017-0693-8] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2017] [Revised: 04/05/2017] [Accepted: 04/06/2017] [Indexed: 12/20/2022]
Abstract
Metagenomics is the technique, or set of techniques, whose main objective is to determine the microbial population that can be found in a determined environment, studied in the context of its community. For this, it uses the techniques of massive sequencing, or next generation sequencing, due to the difficulties presented by traditional techniques when trying to transfer all the microorganisms present in a given environment to the laboratory. Metagenomics is a newly created technique, which was born at the beginning of the twenty-first century, and since then the interest of the world scientific community in fields as diverse as medicine, biotechnology, agriculture or genetics has not left to grow. In this article, the authors make a historical review of the metagenomics, analyze and evaluate the different massive sequencing platforms used for metagenomic assays, review the current literature on this subject and advance future problems with which researchers who decide to go deeper in this field could find. In this way, the prior knowledge of the researcher will facilitate the approach of his research.
Collapse
|
95
|
Rubino F, Carberry C, M Waters S, Kenny D, McCabe MS, Creevey CJ. Divergent functional isoforms drive niche specialisation for nutrient acquisition and use in rumen microbiome. ISME JOURNAL 2017; 11:932-944. [PMID: 28085156 PMCID: PMC5364355 DOI: 10.1038/ismej.2016.172] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2016] [Revised: 07/28/2016] [Accepted: 09/21/2016] [Indexed: 01/16/2023]
Abstract
Many microbes in complex competitive environments share genes for acquiring and utilising nutrients, questioning whether niche specialisation exists and if so, how it is maintained. We investigated the genomic signatures of niche specialisation in the rumen microbiome, a highly competitive, anaerobic environment, with limited nutrient availability determined by the biomass consumed by the host. We generated individual metagenomic libraries from 14 cows fed an ad libitum diet of grass silage and calculated functional isoform diversity for each microbial gene identified. The animal replicates were used to calculate confidence intervals to test for differences in diversity of functional isoforms between microbes that may drive niche specialisation. We identified 153 genes with significant differences in functional isoform diversity between the two most abundant bacterial genera in the rumen (Prevotella and Clostridium). We found Prevotella possesses a more diverse range of isoforms capable of degrading hemicellulose, whereas Clostridium for cellulose. Furthermore, significant differences were observed in key metabolic processes indicating that isoform diversity plays an important role in maintaining their niche specialisation. The methods presented represent a novel approach for untangling complex interactions between microorganisms in natural environments and have resulted in an expanded catalogue of gene targets central to rumen cellulosic biomass degradation.
Collapse
Affiliation(s)
- Francesco Rubino
- Institute of Biological, Environmental and Rural Sciences (IBERS), Aberystwyth University, Aberystwyth, UK.,Animal and Bioscience Research Department, Teagasc, Grange, Dunsany, Co., Meath, Ireland
| | - Ciara Carberry
- Animal and Bioscience Research Department, Teagasc, Grange, Dunsany, Co., Meath, Ireland.,School of Agriculture, University College Dublin, Dublin, Ireland
| | - Sinéad M Waters
- Animal and Bioscience Research Department, Teagasc, Grange, Dunsany, Co., Meath, Ireland
| | - David Kenny
- Animal and Bioscience Research Department, Teagasc, Grange, Dunsany, Co., Meath, Ireland
| | - Matthew S McCabe
- Animal and Bioscience Research Department, Teagasc, Grange, Dunsany, Co., Meath, Ireland
| | - Christopher J Creevey
- Institute of Biological, Environmental and Rural Sciences (IBERS), Aberystwyth University, Aberystwyth, UK
| |
Collapse
|
96
|
Abstract
Gene finding is the process of identifying genome sequence regions representing stretches of DNA that encode biologically active products, such as proteins or functional noncoding RNAs. As this is usually the first step in the analysis of any novel genomic sequence or resequenced sample of well-known organisms, it is a very important issue, as all downstream analyses depend on the results. This chapter describes the biological basis for gene finding, and the programs and computational approaches that are available for the automated identification of protein-coding genes. For bacterial, archaeal, and eukaryotic genomes, as well as for multi-species sequence data originating from environmental community studies, the state of the art in automated gene finding is described.
Collapse
Affiliation(s)
- Alice Carolyn McHardy
- Department for Algorithmic Bioinformatics, Heinrich Heine University, Düsseldorf, Germany.
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany.
| | - Andreas Kloetgen
- Department for Algorithmic Bioinformatics, Heinrich Heine University, Düsseldorf, Germany
- Department of Pediatric Oncology, Hematology and Clinical Immunology, Heinrich Heine University, Düsseldorf, Germany
| |
Collapse
|
97
|
Schee genannt Halfmann S, Evangelatos N, Schröder-Bäck P, Brand A. European healthcare systems readiness to shift from ‘one-size fits all’ to personalized medicine. Per Med 2017; 14:63-74. [DOI: 10.2217/pme-2016-0061] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Personalized medicine (PM) is no longer an abstract healthcare approach. It has become a reality over the last years and is already successfully applied in the various medical fields. Although there are success stories of implementing PM, there are still many more opportunities to further implement and make full use of the potential of PM. We assessed the system readiness of healthcare systems in Europe to shift from the predominant ‘one size fits all’ healthcare approach to PM. We conclude that European healthcare systems are only partially ready for PM. Key challenges such as integration of big data, health literacy, reimbursement and regulatory issues need to be overcome in order to strengthen the implementation and uptake of PM.
Collapse
Affiliation(s)
- Sebastian Schee genannt Halfmann
- Maastricht Economic & Social Research Institute on Innovation & Technology (MERIT), Maastricht University, Boschstraat 24, 6211AX Maastricht, The Netherlands
| | - Nikolaos Evangelatos
- Maastricht Economic & Social Research Institute on Innovation & Technology (MERIT), Maastricht University, Boschstraat 24, 6211AX Maastricht, The Netherlands
- University Clinic for Emergency & Intensive Care Medicine, Paracelsus Medical University (PMU), Prof. Ernst-Nathan-Strasse 1, 90419 Nuremberg, Germany
| | - Peter Schröder-Bäck
- Department of International Health, School CAPHRI, Maastricht University, Duboisdomein 30, 6229 GT Maastricht, The Netherlands
- Faculty for Health & Human Sciences, University of Bremen, Grazer Strasse 2, 28359 Bremen, Germany
| | - Angela Brand
- Maastricht Economic & Social Research Institute on Innovation & Technology (MERIT), Maastricht University, Boschstraat 24, 6211AX Maastricht, The Netherlands
- Faculty of Health, Medicine & Life Sciences, Maastricht University, P.O. Box 616, 6200 MD Maastricht, The Netherlands
| |
Collapse
|
98
|
Sedlar K, Kupkova K, Provaznik I. Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics. Comput Struct Biotechnol J 2016; 15:48-55. [PMID: 27980708 PMCID: PMC5148923 DOI: 10.1016/j.csbj.2016.11.005] [Citation(s) in RCA: 70] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2016] [Revised: 11/24/2016] [Accepted: 11/26/2016] [Indexed: 12/11/2022] Open
Abstract
One of main steps in a study of microbial communities is resolving their composition, diversity and function. In the past, these issues were mostly addressed by the use of amplicon sequencing of a target gene because of reasonable price and easier computational postprocessing of the bioinformatic data. With the advancement of sequencing techniques, the main focus shifted to the whole metagenome shotgun sequencing, which allows much more detailed analysis of the metagenomic data, including reconstruction of novel microbial genomes and to gain knowledge about genetic potential and metabolic capacities of whole environments. On the other hand, the output of whole metagenomic shotgun sequencing is mixture of short DNA fragments belonging to various genomes, therefore this approach requires more sophisticated computational algorithms for clustering of related sequences, commonly referred to as sequence binning. There are currently two types of binning methods: taxonomy dependent and taxonomy independent. The first type classifies the DNA fragments by performing a standard homology inference against a reference database, while the latter performs the reference-free binning by applying clustering techniques on features extracted from the sequences. In this review, we describe the strategies within the second approach. Although these strategies do not require prior knowledge, they have higher demands on the length of sequences. Besides their basic principle, an overview of particular methods and tools is provided. Furthermore, the review covers the utilization of the methods in context with the length of sequences and discusses the needs for metagenomic data preprocessing in form of initial assembly prior to binning.
Collapse
Affiliation(s)
- Karel Sedlar
- Department of Biomedical Engineering, Brno University of Technology, Technicka 12, Brno, Czech Republic
| | | | | |
Collapse
|
99
|
Ambardar S, Gupta R, Trakroo D, Lal R, Vakhlu J. High Throughput Sequencing: An Overview of Sequencing Chemistry. Indian J Microbiol 2016; 56:394-404. [PMID: 27784934 PMCID: PMC5061697 DOI: 10.1007/s12088-016-0606-4] [Citation(s) in RCA: 123] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2015] [Accepted: 06/22/2016] [Indexed: 01/19/2023] Open
Abstract
In the present century sequencing is to the DNA science, what gel electrophoresis was to it in the last century. From 1977 to 2016 three generation of the sequencing technologies of various types have been developed. Second and third generation sequencing technologies referred commonly to as next generation sequencing technology, has evolved significantly with increase in sequencing speed, decrease in sequencing cost, since its inception in 2004. GS FLX by 454 Life Sciences/Roche diagnostics, Genome Analyzer, HiSeq, MiSeq and NextSeq by Illumina, Inc., SOLiD by ABI, Ion Torrent by Life Technologies are various type of the sequencing platforms available for second generation sequencing. The platforms available for the third generation sequencing are Helicos™ Genetic Analysis System by SeqLL, LLC, SMRT Sequencing by Pacific Biosciences, Nanopore sequencing by Oxford Nanopore's, Complete Genomics by Beijing Genomics Institute and GnuBIO by BioRad, to name few. The present article is an overview of the principle and the sequencing chemistry of these high throughput sequencing technologies along with brief comparison of various types of sequencing platforms available.
Collapse
Affiliation(s)
- Sheetal Ambardar
- Metagenomics Laboratory, School of Biotechnology, University of Jammu, Jammu, J&K India
- Centre for Cellular and Molecular Platform, National Centre for Biological Sciences, TIFR Bangalore, Bangalore, India
- Institute of Trans-Disciplinary Health Sciences and Technology, Trans-Disciplinary University, Bangalore, 560064 India
| | - Rikita Gupta
- Metagenomics Laboratory, School of Biotechnology, University of Jammu, Jammu, J&K India
| | - Deepika Trakroo
- Metagenomics Laboratory, School of Biotechnology, University of Jammu, Jammu, J&K India
| | - Rup Lal
- Molecular Biology Laboratory, Department of Zoology, South Campus, University of Delhi, Delhi, India
| | - Jyoti Vakhlu
- Metagenomics Laboratory, School of Biotechnology, University of Jammu, Jammu, J&K India
| |
Collapse
|
100
|
Kim H, Rha E, Seong W, Yeom SJ, Lee DH, Lee SG. A Cell-Cell Communication-Based Screening System for Novel Microbes with Target Enzyme Activities. ACS Synth Biol 2016; 5:1231-1238. [PMID: 27452868 DOI: 10.1021/acssynbio.5b00287] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The development of synthetic biological devices has increased rapidly in recent years and the practical benefits of such biological devices are becoming increasingly clear. Here, we further improved the design of a previously reported high-throughput genetic enzyme screening system by investigating device-compatible biological components and phenol-mediated cell-cell communication, both of which increased the efficiency and practicality of the screening device without requiring the use of flow cytometry analysis. A sensor cell was designed to detect novel microbes with target enzyme activities on solid media by forming clear, circular colonies with fluorescence around the unknown microbes producing target enzymes. This mechanism of detection was enabled by the combination of pre-effector phenolic substrate treatment in the presence of target enzyme-producing microbes and control of the growth and fluorescence of remote sensor cells via phenol-mediated cell-cell communication. The sensor cells were applied to screen soil bacteria with phosphatase activity using phenyl phosphate as phenolic substrates. The sensor cells facilitated successful visualization of phosphatase activity in unknown microbes, which were identified by 16S rRNA analysis. Enzyme activity assays confirmed that the proposed screening technique was able to find 23 positive clones out of 33 selected colonies. Since many natural enzymatic reactions produce phenolic compounds from phenol-derived substrates, we anticipate that the proposed technique may have broad applications in the assessment and screening of novel microbes with target enzymes of interest. This method also can provide insights into the identification of novel enzymes for which screening assays are not yet available.
Collapse
Affiliation(s)
- Haseong Kim
- Synthetic Biology & Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology, 125 Gwahak-ro, Yuseong-gu, Daejeon, South Korea
| | - Eugene Rha
- Synthetic Biology & Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology, 125 Gwahak-ro, Yuseong-gu, Daejeon, South Korea
| | - Wonjae Seong
- Synthetic Biology & Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology, 125 Gwahak-ro, Yuseong-gu, Daejeon, South Korea
- Biosystems
and Bioengineering Program, University of Science and Technology, 217 Gajung-ro, Yuseong-gu, Daejeon, South Korea
| | - Soo-Jin Yeom
- Synthetic Biology & Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology, 125 Gwahak-ro, Yuseong-gu, Daejeon, South Korea
| | - Dae-Hee Lee
- Synthetic Biology & Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology, 125 Gwahak-ro, Yuseong-gu, Daejeon, South Korea
- Biosystems
and Bioengineering Program, University of Science and Technology, 217 Gajung-ro, Yuseong-gu, Daejeon, South Korea
| | - Seung-Goo Lee
- Synthetic Biology & Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology, 125 Gwahak-ro, Yuseong-gu, Daejeon, South Korea
- Biosystems
and Bioengineering Program, University of Science and Technology, 217 Gajung-ro, Yuseong-gu, Daejeon, South Korea
| |
Collapse
|