1
|
Sequeira JC, Pereira V, Alves MM, Pereira MA, Rocha M, Salvador AF. MOSCA 2.0: A bioinformatics framework for metagenomics, metatranscriptomics and metaproteomics data analysis and visualization. Mol Ecol Resour 2024; 24:e13996. [PMID: 39099161 DOI: 10.1111/1755-0998.13996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 06/14/2024] [Accepted: 07/15/2024] [Indexed: 08/06/2024]
Abstract
The analysis of meta-omics data requires the utilization of several bioinformatics tools and proficiency in informatics. The integration of multiple meta-omics data is even more challenging, and the outputs of existing bioinformatics solutions are not always easy to interpret. Here, we present a meta-omics bioinformatics pipeline, Meta-Omics Software for Community Analysis (MOSCA), which aims to overcome these limitations. MOSCA was initially developed for analysing metagenomics (MG) and metatranscriptomics (MT) data. Now, it also performs MG and metaproteomics (MP) integrated analysis, and MG/MT analysis was upgraded with an additional iterative binning step, metabolic pathways mapping, and several improvements regarding functional annotation and data visualization. MOSCA handles raw sequencing data and mass spectra and performs pre-processing, assembly, annotation, binning and differential gene/protein expression analysis. MOSCA shows taxonomic and functional analysis in large tables, performs metabolic pathways mapping, generates Krona plots and shows gene/protein expression results in heatmaps, improving omics data visualization. MOSCA is easily run from a single command while also providing a web interface (MOSGUITO). Relevant features include an extensive set of customization options, allowing tailored analyses to suit specific research objectives, and the ability to restart the pipeline from intermediary checkpoints using alternative configurations. Two case studies showcased MOSCA results, giving a complete view of the anaerobic microbial communities from anaerobic digesters and insights on the role of specific microorganisms. MOSCA represents a pivotal advancement in meta-omics research, offering an intuitive, comprehensive, and versatile solution for researchers seeking to unravel the intricate tapestry of microbial communities.
Collapse
Affiliation(s)
- João C Sequeira
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - Vítor Pereira
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - M Madalena Alves
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - M Alcina Pereira
- Centre of Biological Engineering, University of Minho, Braga, Portugal
- LABBELS - Associate Laboratory, Braga/Guimarães, Portugal
| | - Miguel Rocha
- Centre of Biological Engineering, University of Minho, Braga, Portugal
- LABBELS - Associate Laboratory, Braga/Guimarães, Portugal
| | - Andreia F Salvador
- Centre of Biological Engineering, University of Minho, Braga, Portugal
- LABBELS - Associate Laboratory, Braga/Guimarães, Portugal
| |
Collapse
|
2
|
Ma ZS. Metagenome comparison (MC): A new framework for detecting unique/enriched OMUs (operational metagenomic units) derived from whole-genome sequencing reads. Comput Biol Med 2024; 180:108852. [PMID: 39137667 DOI: 10.1016/j.compbiomed.2024.108852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 07/04/2024] [Accepted: 07/04/2024] [Indexed: 08/15/2024]
Abstract
BACKGROUND Current methods for comparing metagenomes, derived from whole-genome sequencing reads, include top-down metrics or parametric models such as metagenome-diversity, and bottom-up, non-parametric, model-free machine learning approaches like Naïve Bayes for k-mer-profiling. However, both types are limited in their ability to effectively and comprehensively identify and catalogue unique or enriched metagenomic genes, a critical task in comparative metagenomics. This challenge is significant and complex due to its NP-hard nature, which means computational time grows exponentially, or even faster, with the problem size, rendering it impractical for even the fastest supercomputers without heuristic approximation algorithms. METHOD In this study, we introduce a new framework, MC (Metagenome-Comparison), designed to exhaustively detect and catalogue unique or enriched metagenomic genes (MGs) and their derivatives, including metagenome functional gene clusters (MFGC), or more generally, the operational metagenomic unit (OMU) that can be considered the counterpart of the OTU (operational taxonomic unit) from amplicon sequencing reads. The MC is essentially a heuristic search algorithm guided by pairs of new metrics (termed MG-specificity or OMU-specificity, MG-specificity diversity or OMU-specificity diversity). It is further constrained by statistical significance (P-value) implemented as a pair of statistical tests. RESULTS We evaluated the MC using large metagenomic datasets related to obesity, diabetes, and IBD, and found that the proportions of unique and enriched metagenomic genes ranged from 0.001% to 0.08 % and 0.08%-0.82 % respectively, and less than 10 % for the MFGC. CONCLUSION The MC provides a robust method for comparing metagenomes at various scales, from baseline MGs to various function/pathway clusters of metagenomes, collectively termed OMUs.
Collapse
Affiliation(s)
- Zhanshan Sam Ma
- Faculty of Arts and Sciences Harvard University Cambridge, MA, 02138, USA; Microbiome Medicine and Advanced AI Lab, Cambridge, MA, 02138, USA; Computational Biology and Medical Ecology Lab Kunming Institute of Zoology Chinese Academy of Sciences, China.
| |
Collapse
|
3
|
Nakamura K, Asano S, Nambu M, Ishiguro N, Tanikawa A, Naganuma N. Metagenome-assembled genome sequences of two bacterial species from polyvinyl alcohol-degrading co-colonies. Microbiol Resour Announc 2024; 13:e0036324. [PMID: 38860814 PMCID: PMC11256862 DOI: 10.1128/mra.00363-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 05/17/2024] [Indexed: 06/12/2024] Open
Abstract
We present the metagenome-assembled genome sequences of two polyvinyl alcohol-degrading co-colony-derived bacterial species relative to Rhodanobacter sp. DHB23 and Priestia megaterium ATCC 14581. We estimated the genomes of these species to be 3,476,996- and 5,169,587-bp long (for Rhodanobacter sp. DHB23 and Priestia megaterium ATCC 14581, respectively).
Collapse
Affiliation(s)
- Kohei Nakamura
- Faculty of Applied Biological Sciences, Gifu University, Gifu, Japan
| | - Sota Asano
- Faculty of Applied Biological Sciences, Gifu University, Gifu, Japan
| | - Masashi Nambu
- Basic Technology Research Division R&D Department, AICELLO CORPORATION, Aichi, Japan
| | - Nanami Ishiguro
- Basic Technology Research Division R&D Department, AICELLO CORPORATION, Aichi, Japan
| | - Atsushi Tanikawa
- Basic Technology Research Division R&D Department, AICELLO CORPORATION, Aichi, Japan
| | - Nobuaki Naganuma
- Basic Technology Research Division R&D Department, AICELLO CORPORATION, Aichi, Japan
| |
Collapse
|
4
|
Becker D, Popp D, Bonk F, Kleinsteuber S, Harms H, Centler F. Metagenomic Analysis of Anaerobic Microbial Communities Degrading Short-Chain Fatty Acids as Sole Carbon Sources. Microorganisms 2023; 11:microorganisms11020420. [PMID: 36838385 PMCID: PMC9959488 DOI: 10.3390/microorganisms11020420] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 02/04/2023] [Indexed: 02/11/2023] Open
Abstract
Analyzing microbial communities using metagenomes is a powerful approach to understand compositional structures and functional connections in anaerobic digestion (AD) microbiomes. Whereas short-read sequencing approaches based on the Illumina platform result in highly fragmented metagenomes, long-read sequencing leads to more contiguous assemblies. To evaluate the performance of a hybrid approach of these two sequencing approaches we compared the metagenome-assembled genomes (MAGs) resulting from five AD microbiome samples. The samples were taken from reactors fed with short-chain fatty acids at different feeding regimes (continuous and discontinuous) and organic loading rates (OLR). Methanothrix showed a high relative abundance at all feeding regimes but was strongly reduced in abundance at higher OLR, when Methanosarcina took over. The bacterial community composition differed strongly between reactors of different feeding regimes and OLRs. However, the functional potential was similar regardless of feeding regime and OLR. The hybrid sequencing approach using Nanopore long-reads and Illumina MiSeq reads improved assembly statistics, including an increase of the N50 value (on average from 32 to 1740 kbp) and an increased length of the longest contig (on average from 94 to 1898 kbp). The hybrid approach also led to a higher share of high-quality MAGs and generated five potentially circular genomes while none were generated using MiSeq-based contigs only. Finally, 27 hybrid MAGs were reconstructed of which 18 represent potentially new species-15 of them bacterial species. During pathway analysis, selected MAGs revealed similar gene patterns of butyrate degradation and might represent new butyrate-degrading bacteria. The demonstrated advantages of adding long reads to metagenomic analyses make the hybrid approach the preferable option when dealing with complex microbiomes.
Collapse
Affiliation(s)
- Daniela Becker
- UFZ—Helmholtz Centre for Environmental Research, Department of Environmental Microbiology, Permoserstr 15, 04318 Leipzig, Germany
- IAV GmbH, Kauffahrtei 23-25, 09120 Chemnitz, Germany
| | - Denny Popp
- UFZ—Helmholtz Centre for Environmental Research, Department of Environmental Microbiology, Permoserstr 15, 04318 Leipzig, Germany
- Institute of Human Genetics, University of Leipzig Medical Center, Philipp-Rosenthal-Str. 55, 04103 Leipzig, Germany
| | - Fabian Bonk
- UFZ—Helmholtz Centre for Environmental Research, Department of Environmental Microbiology, Permoserstr 15, 04318 Leipzig, Germany
- VERBIO Vereinigte Bioenergie AG, Thura Mark 18, 06780 Zörbig, Germany
| | - Sabine Kleinsteuber
- UFZ—Helmholtz Centre for Environmental Research, Department of Environmental Microbiology, Permoserstr 15, 04318 Leipzig, Germany
| | - Hauke Harms
- UFZ—Helmholtz Centre for Environmental Research, Department of Environmental Microbiology, Permoserstr 15, 04318 Leipzig, Germany
| | - Florian Centler
- UFZ—Helmholtz Centre for Environmental Research, Department of Environmental Microbiology, Permoserstr 15, 04318 Leipzig, Germany
- School of Life Sciences, University of Siegen, 57076 Siegen, Germany
- Correspondence:
| |
Collapse
|
5
|
Viehweger A, Marquet M, Hölzer M, Dietze N, Pletz MW, Brandt C. Nanopore-based enrichment of antimicrobial resistance genes - a case-based study. GIGABYTE 2023; 2023:gigabyte75. [PMID: 36949817 PMCID: PMC10027057 DOI: 10.46471/gigabyte.75] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 01/11/2023] [Indexed: 01/26/2023] Open
Abstract
Rapid screening of hospital admissions to detect asymptomatic carriers of resistant bacteria can prevent pathogen outbreaks. However, the resulting isolates rarely have their genome sequenced due to cost constraints and long turn-around times to get and process the data, limiting their usefulness to the practitioner. Here we used real-time, on-device target enrichment ("adaptive") sequencing as a highly multiplexed assay covering 1,147 antimicrobial resistance genes. We compared its utility against standard and metagenomic sequencing, focusing on an isolate of Raoultella ornithinolytica harbouring three carbapenemases (NDM, KPC, VIM). Based on this experimental data, we then modelled the influence of several variables on the enrichment results and predicted the large effect of nucleotide identity (higher is better) and read length (shorter is better). Lastly, we showed how all relevant resistance genes are detected using adaptive sequencing on a miniature ("Flongle") flow cell, motivating its use in a clinical setting to monitor similar cases and their surroundings.
Collapse
Affiliation(s)
- Adrian Viehweger
- Institute of Medical Microbiology and Virology, University Hospital Leipzig, Leipzig, Germany
| | - Mike Marquet
- Institute for Infectious Diseases and Infection Control, Jena University Hospital, Jena, Germany
| | - Martin Hölzer
- MF1 Bioinformatics, Robert Koch Institute, Berlin, Germany
| | - Nadine Dietze
- Institute of Medical Microbiology and Virology, University Hospital Leipzig, Leipzig, Germany
| | - Mathias W. Pletz
- Institute for Infectious Diseases and Infection Control, Jena University Hospital, Jena, Germany
| | - Christian Brandt
- Institute for Infectious Diseases and Infection Control, Jena University Hospital, Jena, Germany
| |
Collapse
|
6
|
Chen JW, Shrestha L, Green G, Leier A, Marquez-Lago TT. The hitchhikers' guide to RNA sequencing and functional analysis. Brief Bioinform 2023; 24:bbac529. [PMID: 36617463 PMCID: PMC9851315 DOI: 10.1093/bib/bbac529] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 10/18/2022] [Accepted: 11/07/2022] [Indexed: 01/10/2023] Open
Abstract
DNA and RNA sequencing technologies have revolutionized biology and biomedical sciences, sequencing full genomes and transcriptomes at very high speeds and reasonably low costs. RNA sequencing (RNA-Seq) enables transcript identification and quantification, but once sequencing has concluded researchers can be easily overwhelmed with questions such as how to go from raw data to differential expression (DE), pathway analysis and interpretation. Several pipelines and procedures have been developed to this effect. Even though there is no unique way to perform RNA-Seq analysis, it usually follows these steps: 1) raw reads quality check, 2) alignment of reads to a reference genome, 3) aligned reads' summarization according to an annotation file, 4) DE analysis and 5) gene set analysis and/or functional enrichment analysis. Each step requires researchers to make decisions, and the wide variety of options and resulting large volumes of data often lead to interpretation challenges. There also seems to be insufficient guidance on how best to obtain relevant information and derive actionable knowledge from transcription experiments. In this paper, we explain RNA-Seq steps in detail and outline differences and similarities of different popular options, as well as advantages and disadvantages. We also discuss non-coding RNA analysis, multi-omics, meta-transcriptomics and the use of artificial intelligence methods complementing the arsenal of tools available to researchers. Lastly, we perform a complete analysis from raw reads to DE and functional enrichment analysis, visually illustrating how results are not absolute truths and how algorithmic decisions can greatly impact results and interpretation.
Collapse
Affiliation(s)
- Jiung-Wen Chen
- Department of Biology, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Lisa Shrestha
- Department of Genetics, University of Alabama at Birmingham, School of Medicine, Birmingham, AL, USA
| | - George Green
- Department of Biology, University of Alabama at Birmingham, Birmingham, AL, USA
| | - André Leier
- Department of Genetics, University of Alabama at Birmingham, School of Medicine, Birmingham, AL, USA
- Department of Cell, Developmental and Integrative Biology, University of Alabama at Birmingham, School of Medicine, Birmingham, AL, USA
| | - Tatiana T Marquez-Lago
- Department of Genetics, University of Alabama at Birmingham, School of Medicine, Birmingham, AL, USA
- Department of Cell, Developmental and Integrative Biology, University of Alabama at Birmingham, School of Medicine, Birmingham, AL, USA
- Department of Microbiology, University of Alabama at Birmingham, School of Medicine, Birmingham, AL, USA
| |
Collapse
|
7
|
Goussarov G, Mysara M, Vandamme P, Van Houdt R. Introduction to the principles and methods underlying the recovery of metagenome-assembled genomes from metagenomic data. Microbiologyopen 2022; 11:e1298. [PMID: 35765182 PMCID: PMC9179125 DOI: 10.1002/mbo3.1298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 05/19/2022] [Accepted: 05/19/2022] [Indexed: 11/18/2022] Open
Abstract
The rise of metagenomics offers a leap forward for understanding the genetic diversity of microorganisms in many different complex environments by providing a platform that can identify potentially unlimited numbers of known and novel microorganisms. As such, it is impossible to imagine new major initiatives without metagenomics. Nevertheless, it represents a relatively new discipline with various levels of complexity and demands on bioinformatics. The underlying principles and methods used in metagenomics are often seen as common knowledge and often not detailed or fragmented. Therefore, we reviewed these to guide microbiologists in taking the first steps into metagenomics. We specifically focus on a workflow aimed at reconstructing individual genomes, that is, metagenome-assembled genomes, integrating DNA sequencing, assembly, binning, identification and annotation.
Collapse
Affiliation(s)
- Gleb Goussarov
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN)MolBelgium
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of SciencesGhent UniversityGhentBelgium
| | - Mohamed Mysara
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN)MolBelgium
| | - Peter Vandamme
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of SciencesGhent UniversityGhentBelgium
| | - Rob Van Houdt
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN)MolBelgium
| |
Collapse
|
8
|
Saenz C, Nigro E, Gunalan V, Arumugam M. MIntO: A Modular and Scalable Pipeline For Microbiome Metagenomic and Metatranscriptomic Data Integration. FRONTIERS IN BIOINFORMATICS 2022; 2:846922. [PMID: 36304282 PMCID: PMC9580859 DOI: 10.3389/fbinf.2022.846922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Accepted: 04/11/2022] [Indexed: 11/24/2022] Open
Abstract
Omics technologies have revolutionized microbiome research allowing the characterization of complex microbial communities in different biomes without requiring their cultivation. As a consequence, there has been a great increase in the generation of omics data from metagenomes and metatranscriptomes. However, pre-processing and analysis of these data have been limited by the availability of computational resources, bioinformatics expertise and standardized computational workflows to obtain consistent results that are comparable across different studies. Here, we introduce MIntO (Microbiome Integrated meta-Omics), a highly versatile pipeline that integrates metagenomic and metatranscriptomic data in a scalable way. The distinctive feature of this pipeline is the computation of gene expression profile through integrating metagenomic and metatranscriptomic data taking into account the community turnover and gene expression variations to disentangle the mechanisms that shape the metatranscriptome across time and between conditions. The modular design of MIntO enables users to run the pipeline using three available modes based on the input data and the experimental design, including de novo assembly leading to metagenome-assembled genomes. The integrated pipeline will be relevant to provide unique biochemical insights into microbial ecology by linking functions to retrieved genomes and to examine gene expression variation. Functional characterization of community members will be crucial to increase our knowledge of the microbiome’s contribution to human health and environment. MIntO v1.0.1 is available at https://github.com/arumugamlab/MIntO.
Collapse
|
9
|
Krakau S, Straub D, Gourlé H, Gabernet G, Nahnsen S. nf-core/mag: a best-practice pipeline for metagenome hybrid assembly and binning. NAR Genom Bioinform 2022; 4:lqac007. [PMID: 35118380 PMCID: PMC8808542 DOI: 10.1093/nargab/lqac007] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 11/19/2021] [Accepted: 01/25/2022] [Indexed: 12/18/2022] Open
Abstract
The analysis of shotgun metagenomic data provides valuable insights into microbial communities, while allowing resolution at individual genome level. In absence of complete reference genomes, this requires the reconstruction of metagenome assembled genomes (MAGs) from sequencing reads. We present the nf-core/mag pipeline for metagenome assembly, binning and taxonomic classification. It can optionally combine short and long reads to increase assembly continuity and utilize sample-wise group-information for co-assembly and genome binning. The pipeline is easy to install-all dependencies are provided within containers-portable and reproducible. It is written in Nextflow and developed as part of the nf-core initiative for best-practice pipeline development. All codes are hosted on GitHub under the nf-core organization https://github.com/nf-core/mag and released under the MIT license.
Collapse
Affiliation(s)
- Sabrina Krakau
- Quantitative Biology Center (QBiC), University of Tübingen, 72076 Tübingen, Germany
| | - Daniel Straub
- Quantitative Biology Center (QBiC), University of Tübingen, 72076 Tübingen, Germany
| | - Hadrien Gourlé
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, S-75007 Uppsala, Sweden
| | - Gisela Gabernet
- Quantitative Biology Center (QBiC), University of Tübingen, 72076 Tübingen, Germany
| | - Sven Nahnsen
- Quantitative Biology Center (QBiC), University of Tübingen, 72076 Tübingen, Germany
| |
Collapse
|
10
|
Chen YH, Chiang PW, Rogozin DY, Degermendzhy AG, Chiu HH, Tang SL. Salvaging high-quality genomes of microbial species from a meromictic lake using a hybrid sequencing approach. Commun Biol 2021; 4:996. [PMID: 34426638 PMCID: PMC8382752 DOI: 10.1038/s42003-021-02510-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Accepted: 08/01/2021] [Indexed: 11/08/2022] Open
Abstract
Most of Earth's bacteria have yet to be cultivated. The metabolic and functional potentials of these uncultivated microorganisms thus remain mysterious, and the metagenome-assembled genome (MAG) approach is the most robust method for uncovering these potentials. However, MAGs discovered by conventional metagenomic assembly and binning are usually highly fragmented genomes with heterogeneous sequence contamination. In this study, we combined Illumina and Nanopore data to develop a new workflow to reconstruct 233 MAGs-six novel bacterial orders, 20 families, 66 genera, and 154 species-from Lake Shunet, a secluded meromictic lake in Siberia. With our workflow, the average N50 of reconstructed MAGs greatly increased 10-40-fold compared to when the conventional Illumina assembly and binning method were used. More importantly, six complete MAGs were recovered from our datasets. The recovery of 154 novel species MAGs from a rarely explored lake greatly expands the current bacterial genome encyclopedia.
Collapse
Affiliation(s)
- Yu-Hsiang Chen
- Bioinformatics Program, Taiwan International Graduate Program, National Taiwan University, Taipei, Taiwan
- Bioinformatics Program, Institute of Information Science, Taiwan International Graduate Program, Academia Sinica, Taipei, Taiwan
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Pei-Wen Chiang
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Denis Yu Rogozin
- Institute of Biophysics, Siberian Branch of Russian Academy of Sciences, Krasnoyarsk, Russia
- Siberian Federal University, Krasnoyarsk, Russia
| | - Andrey G Degermendzhy
- Institute of Biophysics, Siberian Branch of Russian Academy of Sciences, Krasnoyarsk, Russia
| | - Hsiu-Hui Chiu
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Sen-Lin Tang
- Bioinformatics Program, Institute of Information Science, Taiwan International Graduate Program, Academia Sinica, Taipei, Taiwan.
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan.
| |
Collapse
|
11
|
Melendrez MC, Shaw S, Brown CT, Goodner BW, Kvaal C. Editorial: Curriculum Applications in Microbiology: Bioinformatics in the Classroom. Front Microbiol 2021; 12:705233. [PMID: 34276638 PMCID: PMC8281245 DOI: 10.3389/fmicb.2021.705233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Accepted: 06/07/2021] [Indexed: 11/18/2022] Open
Affiliation(s)
| | - Sophie Shaw
- Centre for Genome Enabled Biology and Medicine, University of Aberdeen, Aberdeen, United Kingdom
| | - C Titus Brown
- Department of Population Health and Reproduction, University of California, Davis, Davis, CA, United States
| | | | - Christopher Kvaal
- Department of Biology, St. Cloud State University, St. Cloud, MN, United States
| |
Collapse
|
12
|
Ciuffreda L, Rodríguez-Pérez H, Flores C. Nanopore sequencing and its application to the study of microbial communities. Comput Struct Biotechnol J 2021; 19:1497-1511. [PMID: 33815688 PMCID: PMC7985215 DOI: 10.1016/j.csbj.2021.02.020] [Citation(s) in RCA: 89] [Impact Index Per Article: 29.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Revised: 02/24/2021] [Accepted: 02/27/2021] [Indexed: 12/14/2022] Open
Abstract
Since its introduction, nanopore sequencing has enhanced our ability to study complex microbial samples through the possibility to sequence long reads in real time using inexpensive and portable technologies. The use of long reads has allowed to address several previously unsolved issues in the field, such as the resolution of complex genomic structures, and facilitated the access to metagenome assembled genomes (MAGs). Furthermore, the low cost and portability of platforms together with the development of rapid protocols and analysis pipelines have featured nanopore technology as an attractive and ever-growing tool for real-time in-field sequencing for environmental microbial analysis. This review provides an up-to-date summary of the experimental protocols and bioinformatic tools for the study of microbial communities using nanopore sequencing, highlighting the most important and recent research in the field with a major focus on infectious diseases. An overview of the main approaches including targeted and shotgun approaches, metatranscriptomics, epigenomics, and epitranscriptomics is provided, together with an outlook to the major challenges and perspectives over the use of this technology for microbial studies.
Collapse
Affiliation(s)
- Laura Ciuffreda
- Research Unit, Hospital Universitario N.S. de Candelaria, Universidad de La Laguna, 38010 Santa Cruz de Tenerife, Spain
| | - Héctor Rodríguez-Pérez
- Research Unit, Hospital Universitario N.S. de Candelaria, Universidad de La Laguna, 38010 Santa Cruz de Tenerife, Spain
| | - Carlos Flores
- Research Unit, Hospital Universitario N.S. de Candelaria, Universidad de La Laguna, 38010 Santa Cruz de Tenerife, Spain
- CIBER de Enfermedades Respiratorias, Instituto de Salud Carlos III, 28029 Madrid, Spain
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), 38600 Santa Cruz de Tenerife, Spain
- Instituto de Tecnologías Biomédicas (ITB), Universidad de La Laguna, 38200 Santa Cruz de Tenerife, Spain
| |
Collapse
|