1
|
Vital JS, Tanoeiro L, Lopes-Oliveira R, Vale FF. Biomarker Characterization and Prediction of Virulence and Antibiotic Resistance from Helicobacter pylori Next Generation Sequencing Data. Biomolecules 2022; 12:691. [PMID: 35625618 PMCID: PMC9138241 DOI: 10.3390/biom12050691] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Revised: 05/02/2022] [Accepted: 05/07/2022] [Indexed: 02/06/2023] Open
Abstract
The Gram-negative bacterium Helicobacter pylori colonizes c.a. 50% of human stomachs worldwide and is the major risk factor for gastric adenocarcinoma. Its high genetic variability makes it difficult to identify biomarkers of early stages of infection that can reliably predict its outcome. Moreover, the increasing antibiotic resistance found in H. pylori defies therapy, constituting a major human health problem. Here, we review H. pylori virulence factors and genes involved in antibiotic resistance, as well as the technologies currently used for their detection. Furthermore, we show that next generation sequencing may lead to faster characterization of virulence factors and prediction of the antibiotic resistance profile, thus contributing to personalized treatment and management of H. pylori-associated infections. With this new approach, more and permanent data will be generated at a lower cost, opening the future to new applications for H. pylori biomarker identification and antibiotic resistance prediction.
Collapse
Affiliation(s)
- Joana S. Vital
- Pathogen Genome Bioinformatics and Computational Biology, Research Institute for Medicines (iMed.ULisboa), Faculty of Pharmacy, Universidade de Lisboa, 1649-003 Lisboa, Portugal; (J.S.V.); (L.T.); (R.L.-O.)
| | - Luís Tanoeiro
- Pathogen Genome Bioinformatics and Computational Biology, Research Institute for Medicines (iMed.ULisboa), Faculty of Pharmacy, Universidade de Lisboa, 1649-003 Lisboa, Portugal; (J.S.V.); (L.T.); (R.L.-O.)
| | - Ricardo Lopes-Oliveira
- Pathogen Genome Bioinformatics and Computational Biology, Research Institute for Medicines (iMed.ULisboa), Faculty of Pharmacy, Universidade de Lisboa, 1649-003 Lisboa, Portugal; (J.S.V.); (L.T.); (R.L.-O.)
| | - Filipa F. Vale
- Pathogen Genome Bioinformatics and Computational Biology, Research Institute for Medicines (iMed.ULisboa), Faculty of Pharmacy, Universidade de Lisboa, 1649-003 Lisboa, Portugal; (J.S.V.); (L.T.); (R.L.-O.)
| |
Collapse
|
2
|
Characterization of East-Asian Helicobacter pylori encoding Western EPIYA-ABC CagA. J Microbiol 2021; 60:207-214. [PMID: 34757586 DOI: 10.1007/s12275-022-1483-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 10/20/2021] [Accepted: 10/21/2021] [Indexed: 12/24/2022]
Abstract
The polymorphic bacterial oncoprotein, CagA shows geography-dependent variation in the C-terminal Glu-Pro-Ile-Tyr-Ala (EPIYA) motifs; East-Asian H. pylori isolates carry the ABD type while Western isolates carry the ABC type. In Western isolates, the EPIYA-C motif is sometimes found in multi-copy and this genotype is associated with disease severity. Interestingly, a small number of East-Asian H. pylori isolates have been found to carry Western ABC-type CagA. To gain a better understanding of these unusual isolates, the genomes of four Korean H. pylori clinical isolates carrying ABC-type CagA were sequenced via third generation (Pac-Bio SMRT) sequencing technology. The obtained data were utilized for phylogenetic analysis as well as comparison of additional virulence factors that are known to show geographic-dependent polymorphisms. Three of four isolates indeed belonged to the hpEastAsia group and showed typical East-Asian polymorphism in virulence factors such as homA/B/C, babA/B/C, and oipA. One isolate grouped to HpAfrica and showed typical Western polymorphism of virulence factors such as cagA, homA/B/C, and oipA. To understand the occurrence of the multi-copy EPIYA-C motif genotype in an East-Asian H. pylori background, the Korean clinical isolate, K154 was analyzed; this strain belonged to hpEastAsia but encoded CagA EPIYA-ABCCCC. Based on DNA sequence homology within the CagA multimerization (CM) sequence that flanked the EPIYA-C motifs, we predicted that the number of C motifs might change via homologous recombination. To test this hypothesis, K154 was cultured for one generation and 287 single colonies were analyzed for the number of EPIYA-C motifs using PCR-based screening and DNA sequencing verification. Three out of 284 (1%) single colony isolates showed changes in the number of EPIYA-C motifs in vitro; one isolate increased to five EPIYA-C motifs, one decreased to three EPIYA-C motifs, and one completely deleted the EPIYA-C motifs. The capacity for dynamic changes in the number of EPIYA-C repeats of CagA may play a role in generating important intraspecies diversity in East-Asian H. pylori.
Collapse
|
3
|
Tedersoo L, Albertsen M, Anslan S, Callahan B. Perspectives and Benefits of High-Throughput Long-Read Sequencing in Microbial Ecology. Appl Environ Microbiol 2021; 87:e0062621. [PMID: 34132589 PMCID: PMC8357291 DOI: 10.1128/aem.00626-21] [Citation(s) in RCA: 74] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Short-read, high-throughput sequencing (HTS) methods have yielded numerous important insights into microbial ecology and function. Yet, in many instances short-read HTS techniques are suboptimal, for example, by providing insufficient phylogenetic resolution or low integrity of assembled genomes. Single-molecule and synthetic long-read (SLR) HTS methods have successfully ameliorated these limitations. In addition, nanopore sequencing has generated a number of unique analysis opportunities, such as rapid molecular diagnostics and direct RNA sequencing, and both Pacific Biosciences (PacBio) and nanopore sequencing support detection of epigenetic modifications. Although initially suffering from relatively low sequence quality, recent advances have greatly improved the accuracy of long-read sequencing technologies. In spite of great technological progress in recent years, the long-read HTS methods (PacBio and nanopore sequencing) are still relatively costly, require large amounts of high-quality starting material, and commonly need specific solutions in various analysis steps. Despite these challenges, long-read sequencing technologies offer high-quality, cutting-edge alternatives for testing hypotheses about microbiome structure and functioning as well as assembly of eukaryote genomes from complex environmental DNA samples.
Collapse
Affiliation(s)
- Leho Tedersoo
- Mycology and Microbiology Center, University of Tartu, Tartu, Estonia
| | - Mads Albertsen
- Department of Chemistry and Bioscience, Aalborg University, Aalborg, Denmark
| | - Sten Anslan
- Mycology and Microbiology Center, University of Tartu, Tartu, Estonia
- Braunschweig University of Technology, Zoological Institute, Braunschweig, Germany
| | - Benjamin Callahan
- Department of Population Health and Pathobiology, College of Veterinary Medicine and Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
| |
Collapse
|
4
|
Kubota-Aizawa S, Matsubara Y, Kanemoto H, Mimuro H, Uchida K, Chambers J, Tsuboi M, Ohno K, Fukushima K, Kato N, Yotsuyanagi H, Tsujimoto H. Transmission of Helicobacter pylori between a human and two dogs: A case report. Helicobacter 2021; 26:e12798. [PMID: 33818862 DOI: 10.1111/hel.12798] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/25/2020] [Accepted: 02/24/2021] [Indexed: 12/12/2022]
Abstract
BACKGROUND Whereas non-Helicobacter pylori helicobacters, which are frequently detected in the stomachs of dogs and cats as a source of zoonoses, have attracted considerable attention, the role of pets in H. pylori epidemiology is unclear. In our previous study, an H. pylori infection was detected in the stomach of a dog (Dog 1). Here, we investigated the H. pylori infection status in the female offspring of Dog 1 (Dog 2) and its owner within the same household. MATERIALS AND METHODS Biopsy specimens were obtained from the dog's owner and tested for H. pylori. DNA from gastric biopsy samples of Dog 1, gastric fluid sediment of Dog 2, and bacteria from the stomach of the owner was obtained, and Helicobacter genus- and species-specific PCRs were performed. Then, sequence analyses of the partial region of the ureAB gene were conducted. RESULTS Samples from both dogs and the owner reacted positively in the genus-specific PCR and negative in the Helicobacter felis-, Helicobacter bizzozeronii-, and Helicobacter heilmannii sensu stricto-specific PCRs. All three samples also reacted positively in the H. pylori-specific PCR. Sequences of the partial ureAB gene from all subjects were identical. CONCLUSIONS The results suggested that the two dogs and their owner were infected with an identical H. pylori strain. This report is the first to demonstrate that H. pylori can be transmitted between humans and dogs. Further studies are required to investigate the risk factors for the transmission of H. pylori between humans and dogs from the perspective of preventive epidemiology.
Collapse
Affiliation(s)
- Sanae Kubota-Aizawa
- Department of Veterinary Internal Medicine, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Yasuo Matsubara
- Department of General Medicine, IMSUT Hospital, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Hideyuki Kanemoto
- Department of Veterinary Internal Medicine, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Hitomi Mimuro
- Department of Infection Microbiology, Research Institute for Microbial Diseases, Osaka University, Osaka, Japan.,Department of Infectious Diseases Control, International Research Center for Infectious Diseases, The Institute of Medical Science, University of Tokyo, Tokyo, Japan
| | - Kazuyuki Uchida
- Department of Veterinary Pathology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - James Chambers
- Department of Veterinary Pathology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Masaya Tsuboi
- Department of Veterinary Pathology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Koichi Ohno
- Department of Veterinary Internal Medicine, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Kenjiro Fukushima
- Department of Veterinary Internal Medicine, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Naoya Kato
- Department of Gastroenterology, Graduate School of Medicine, Chiba University, Chiba, Japan
| | - Hiroshi Yotsuyanagi
- Department of General Medicine, IMSUT Hospital, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Hajime Tsujimoto
- Department of Veterinary Internal Medicine, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
5
|
Juraschek K, Borowiak M, Tausch SH, Malorny B, Käsbohrer A, Otani S, Schwarz S, Meemken D, Deneke C, Hammerl JA. Outcome of Different Sequencing and Assembly Approaches on the Detection of Plasmids and Localization of Antimicrobial Resistance Genes in Commensal Escherichia coli. Microorganisms 2021; 9:microorganisms9030598. [PMID: 33799479 PMCID: PMC8000739 DOI: 10.3390/microorganisms9030598] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2021] [Revised: 03/11/2021] [Accepted: 03/12/2021] [Indexed: 01/01/2023] Open
Abstract
Antimicrobial resistance (AMR) is a major threat to public health worldwide. Currently, AMR typing changes from phenotypic testing to whole-genome sequence (WGS)-based detection of resistance determinants for a better understanding of the isolate diversity and elements involved in gene transmission (e.g., plasmids, bacteriophages, transposons). However, the use of WGS data in monitoring purposes requires suitable techniques, standardized parameters and approved guidelines for reliable AMR gene detection and prediction of their association with mobile genetic elements (plasmids). In this study, different sequencing and assembly strategies were tested for their suitability in AMR monitoring in Escherichia coli in the routines of the German National Reference Laboratory for Antimicrobial Resistances. To assess the outcomes of the different approaches, results from in silico predictions were compared with conventional phenotypic- and genotypic-typing data. With the focus on (fluoro)quinolone-resistant E.coli, five qnrS-positive isolates with multiple extrachromosomal elements were subjected to WGS with NextSeq (Illumina), PacBio (Pacific BioSciences) and ONT (Oxford Nanopore) for in depth characterization of the qnrS1-carrying plasmids. Raw reads from short- and long-read sequencing were assembled individually by Unicycler or Flye or a combination of both (hybrid assembly). The generated contigs were subjected to bioinformatics analysis. Based on the generated data, assembly of long-read sequences are error prone and can yield in a loss of small plasmid genomes. In contrast, short-read sequencing was shown to be insufficient for the prediction of a linkage of AMR genes (e.g., qnrS1) to specific plasmid sequences. Furthermore, short-read sequencing failed to detect certain duplications and was unsuitable for genome finishing. Overall, the hybrid assembly led to the most comprehensive typing results, especially in predicting associations of AMR genes and mobile genetic elements. Thus, the use of different sequencing technologies and hybrid assemblies currently represents the best approach for reliable AMR typing and risk assessment.
Collapse
Affiliation(s)
- Katharina Juraschek
- Epidemiology, Zoonoses and Antimicrobial Resistance, German Federal Institute for Risk Assessment (BfR), Max-Dohrn Str. 8-10, 10589 Berlin, Germany;
- Correspondence: (K.J.); (J.A.H.)
| | - Maria Borowiak
- Study Centre for Genome Sequencing and Analysis, German Federal Institute for Risk Assessment (BfR), Max-Dohrn Str. 8-10, 10589 Berlin, Germany; (M.B.); (S.H.T.); (B.M.); (C.D.)
| | - Simon H. Tausch
- Study Centre for Genome Sequencing and Analysis, German Federal Institute for Risk Assessment (BfR), Max-Dohrn Str. 8-10, 10589 Berlin, Germany; (M.B.); (S.H.T.); (B.M.); (C.D.)
| | - Burkhard Malorny
- Study Centre for Genome Sequencing and Analysis, German Federal Institute for Risk Assessment (BfR), Max-Dohrn Str. 8-10, 10589 Berlin, Germany; (M.B.); (S.H.T.); (B.M.); (C.D.)
| | - Annemarie Käsbohrer
- Epidemiology, Zoonoses and Antimicrobial Resistance, German Federal Institute for Risk Assessment (BfR), Max-Dohrn Str. 8-10, 10589 Berlin, Germany;
- Unit for Veterinary Public Health and Epidemiology, University of Veterinary Medicine, Veterinaerplatz 1, 1210 Vienna, Austria
| | - Saria Otani
- DTU Food, National Food Institute, Technical University of Denmark, Kemitorvet, Building 204, 2800 Kgs Lyngby, Denmark;
| | - Stefan Schwarz
- Institute of Microbiology and Epizootics, Department of Veterinary Medicine, Freie Universität Berlin, 14163 Berlin, Germany;
| | - Diana Meemken
- Institute of Food Safety and Food Hygiene, Working Group Meat Hygiene, Department of Veterinary Medicine, Freie Universität Berlin, 14163 Berlin, Germany;
| | - Carlus Deneke
- Study Centre for Genome Sequencing and Analysis, German Federal Institute for Risk Assessment (BfR), Max-Dohrn Str. 8-10, 10589 Berlin, Germany; (M.B.); (S.H.T.); (B.M.); (C.D.)
| | - Jens Andre Hammerl
- Epidemiology, Zoonoses and Antimicrobial Resistance, German Federal Institute for Risk Assessment (BfR), Max-Dohrn Str. 8-10, 10589 Berlin, Germany;
- Correspondence: (K.J.); (J.A.H.)
| |
Collapse
|
6
|
Watanabe Y, Oikawa R, Kodaka Y, Sato Y, Ono S, Kenmochi T, Suzuki H, Futagami S, Kato M, Yamamoto H, Itoh F. Cancer-related genetic variants of Helicobacter pylori strains determined using gastric wash-based whole-genome analysis with single-molecule real-time technology. Int J Cancer 2020; 148:178-192. [PMID: 32803883 DOI: 10.1002/ijc.33257] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2020] [Revised: 06/22/2020] [Accepted: 08/04/2020] [Indexed: 12/17/2022]
Abstract
Helicobacter pylori (H. pylori) are a primary factor in the pathogenesis of gastric cancer (GC); GC ranks third among cancer-related mortality. A clear understanding of the H. pylori genome factors underlying GC is necessary to develop more effective methods to prevent GC. A single-molecule real-time DNA sequencing-based H. pylori genome-wide association study analysis was performed using the H. pylori genome present in five early-stage GC (EGC) and five non-GC clinical DNA samples recovered from gastric washes. A total of 275 genes with 702 nucleotide variants (NVs) were found to be common to three or more patients with EGC but no non-GC patients (single-NV: 654/702, 93.2%; multi-NV: 40/702, 5.7%; deletion: 3/702, 0.4%; insertion: 3/702, 0.7%). Gene ontology analysis of H. pylori revealed that genes involved in the mitochondrial electron transport system, glycolytic processes and the TCA cycle were highly enriched. Cancer-related NVs were most frequently found in a member of the Helicobacter outer membrane protein family, hopL. In particular, one of the NVs in hopL was a novel six-nucleotide insertion (1159095̂1159096, TACTTC); this mutant was detected more frequently in a validation set of 50 additional EGC samples (22/50, 44.0%) than in 18 non-GC samples (3/18, 16.7%, P = .04). These results suggest that the hopL variant is associated with the development of GC and may serve as a genetic biomarker of H. pylori virulence and GC risk. Our assay can serve as a potent tool to expand our understanding of bacteria-associated tumorigenesis.
Collapse
Affiliation(s)
- Yoshiyuki Watanabe
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, St. Marianna University School of Medicine, Kawasaki, Japan.,Department of Internal Medicine, Kawasaki Rinko General Hospital, Kawasaki, Japan
| | - Ritsuko Oikawa
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, St. Marianna University School of Medicine, Kawasaki, Japan
| | - Yasuhiro Kodaka
- Division of Gastroenterology, Musashikosugi Hospital, Nippon Medical School, Kawasaki, Japan
| | - Yoshinori Sato
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, St. Marianna University School of Medicine, Kawasaki, Japan
| | - Shoko Ono
- Department of Gastroenterology, Hokkaido University Hospital, Sapporo, Japan
| | - Takeshi Kenmochi
- Department of Gastroenterology, Saiseikai Kanagawa Hospital, Yokohama, Japan
| | - Hideo Suzuki
- Department of Gastroenterology, Institute of Clinical Medicine, University of Tsukuba, Tsukuba, Japan
| | - Seiji Futagami
- Division of Gastroenterology, Musashikosugi Hospital, Nippon Medical School, Kawasaki, Japan
| | - Mototsugu Kato
- Department of Gastroenterology, National Hospital Organization Hakodate Hospital, Hakodate, Japan
| | - Hiroyuki Yamamoto
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, St. Marianna University School of Medicine, Kawasaki, Japan
| | - Fumio Itoh
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, St. Marianna University School of Medicine, Kawasaki, Japan
| |
Collapse
|
7
|
Ardui S, Ameur A, Vermeesch JR, Hestand MS. Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics. Nucleic Acids Res 2019; 46:2159-2168. [PMID: 29401301 PMCID: PMC5861413 DOI: 10.1093/nar/gky066] [Citation(s) in RCA: 402] [Impact Index Per Article: 80.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Accepted: 01/23/2018] [Indexed: 12/30/2022] Open
Abstract
Short read massive parallel sequencing has emerged as a standard diagnostic tool in the medical setting. However, short read technologies have inherent limitations such as GC bias, difficulties mapping to repetitive elements, trouble discriminating paralogous sequences, and difficulties in phasing alleles. Long read single molecule sequencers resolve these obstacles. Moreover, they offer higher consensus accuracies and can detect epigenetic modifications from native DNA. The first commercially available long read single molecule platform was the RS system based on PacBio's single molecule real-time (SMRT) sequencing technology, which has since evolved into their RSII and Sequel systems. Here we capsulize how SMRT sequencing is revolutionizing constitutional, reproductive, cancer, microbial and viral genetic testing.
Collapse
Affiliation(s)
- Simon Ardui
- Department of Human Genetics, KU Leuven, Leuven 3000, Belgium
| | - Adam Ameur
- Department of Immunology, Genetics and Pathology, Uppsala University, Science for Life Laboratory, Uppsala 75108, Sweden.,School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | | | - Matthew S Hestand
- Department of Human Genetics, KU Leuven, Leuven 3000, Belgium.,Department of Clinical Genetics, VU University Medical Center, Amsterdam 1081 BT, The Netherlands
| |
Collapse
|
8
|
Utturkar SM, Klingeman DM, Hurt RA, Brown SD. A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies. Front Microbiol 2017; 8:1272. [PMID: 28769883 PMCID: PMC5513972 DOI: 10.3389/fmicb.2017.01272] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Accepted: 06/26/2017] [Indexed: 11/20/2022] Open
Abstract
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
Collapse
Affiliation(s)
- Sagar M Utturkar
- Graduate School of Genome Science and Technology, University of TennesseeKnoxville, TN, United States
| | - Dawn M Klingeman
- Biosciences Division, Oak Ridge National LaboratoryOak Ridge, TN, United States.,BioEnergy Science CenterOak Ridge, TN, United States
| | - Richard A Hurt
- Biosciences Division, Oak Ridge National LaboratoryOak Ridge, TN, United States
| | - Steven D Brown
- Graduate School of Genome Science and Technology, University of TennesseeKnoxville, TN, United States.,Biosciences Division, Oak Ridge National LaboratoryOak Ridge, TN, United States.,BioEnergy Science CenterOak Ridge, TN, United States
| |
Collapse
|
9
|
Nakano K, Shiroma A, Shimoji M, Tamotsu H, Ashimine N, Ohki S, Shinzato M, Minami M, Nakanishi T, Teruya K, Satou K, Hirano T. Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area. Hum Cell 2017; 30:149-161. [PMID: 28364362 PMCID: PMC5486853 DOI: 10.1007/s13577-017-0168-8] [Citation(s) in RCA: 103] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 03/22/2017] [Indexed: 12/30/2022]
Abstract
PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II's sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.
Collapse
Affiliation(s)
- Kazuma Nakano
- Okinawa Institute of Advanced Sciences, Uruma, Okinawa, Japan.
| | - Akino Shiroma
- Okinawa Institute of Advanced Sciences, Uruma, Okinawa, Japan
| | - Makiko Shimoji
- Okinawa Institute of Advanced Sciences, Uruma, Okinawa, Japan
| | - Hinako Tamotsu
- Okinawa Institute of Advanced Sciences, Uruma, Okinawa, Japan
| | - Noriko Ashimine
- Okinawa Institute of Advanced Sciences, Uruma, Okinawa, Japan
| | - Shun Ohki
- Okinawa Institute of Advanced Sciences, Uruma, Okinawa, Japan
| | - Misuzu Shinzato
- Okinawa Institute of Advanced Sciences, Uruma, Okinawa, Japan
| | - Maiko Minami
- Okinawa Institute of Advanced Sciences, Uruma, Okinawa, Japan
| | | | - Kuniko Teruya
- Okinawa Institute of Advanced Sciences, Uruma, Okinawa, Japan
| | - Kazuhito Satou
- Okinawa Institute of Advanced Sciences, Uruma, Okinawa, Japan
| | - Takashi Hirano
- Okinawa Institute of Advanced Sciences, Uruma, Okinawa, Japan
| |
Collapse
|
10
|
Rhoads A, Au KF. PacBio Sequencing and Its Applications. GENOMICS PROTEOMICS & BIOINFORMATICS 2015; 13:278-89. [PMID: 26542840 PMCID: PMC4678779 DOI: 10.1016/j.gpb.2015.08.002] [Citation(s) in RCA: 1196] [Impact Index Per Article: 132.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Revised: 08/06/2015] [Accepted: 08/11/2015] [Indexed: 12/15/2022]
Abstract
Single-molecule, real-time sequencing developed by Pacific BioSciences offers longer read lengths than the second-generation sequencing (SGS) technologies, making it well-suited for unsolved problems in genome, transcriptome, and epigenetics research. The highly-contiguous de novo assemblies using PacBio sequencing can close gaps in current reference assemblies and characterize structural variation (SV) in personal genomes. With longer reads, we can sequence through extended repetitive regions and detect mutations, many of which are associated with diseases. Moreover, PacBio transcriptome sequencing is advantageous for the identification of gene isoforms and facilitates reliable discoveries of novel genes and novel isoforms of annotated genes, due to its ability to sequence full-length transcripts or fragments with significant lengths. Additionally, PacBio’s sequencing technique provides information that is useful for the direct detection of base modifications, such as methylation. In addition to using PacBio sequencing alone, many hybrid sequencing strategies have been developed to make use of more accurate short reads in conjunction with PacBio long reads. In general, hybrid sequencing strategies are more affordable and scalable especially for small-size laboratories than using PacBio Sequencing alone. The advent of PacBio sequencing has made available much information that could not be obtained via SGS alone.
Collapse
Affiliation(s)
- Anthony Rhoads
- Department of Biostatistics, University of Iowa, Iowa City, IA 52242, USA
| | - Kin Fai Au
- Department of Biostatistics, University of Iowa, Iowa City, IA 52242, USA; Department of Internal Medicine, University of Iowa, Iowa City, IA 52242, USA.
| |
Collapse
|
11
|
First Complete Genome Sequence of Clostridium sporogenes DSM 795T, a Nontoxigenic Surrogate for Clostridium botulinum, Determined Using PacBio Single-Molecule Real-Time Technology. GENOME ANNOUNCEMENTS 2015; 3:3/4/e00832-15. [PMID: 26227598 PMCID: PMC4520896 DOI: 10.1128/genomea.00832-15] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
The first complete genome sequence of Clostridium sporogenes DSM 795T, a nontoxigenic surrogate for Clostridium botulinum, was determined in a single contig using the PacBio single-molecule real-time technology. The genome (4,142,990 bp; G+C content, 27.98%) included 86 sets of >1,000-bp identical sequence pairs and 380 tandem repeats.
Collapse
|
12
|
Utturkar SM, Klingeman DM, Bruno-Barcena JM, Chinn MS, Grunden AM, Köpke M, Brown SD. Sequence data for Clostridium autoethanogenum using three generations of sequencing technologies. Sci Data 2015; 2:150014. [PMID: 25977818 PMCID: PMC4409012 DOI: 10.1038/sdata.2015.14] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Accepted: 03/12/2015] [Indexed: 01/07/2023] Open
Abstract
During the past decade, DNA sequencing output has been mostly dominated by the second generation sequencing platforms which are characterized by low cost, high throughput and shorter read lengths for example, Illumina. The emergence and development of so called third generation sequencing platforms such as PacBio has permitted exceptionally long reads (over 20 kb) to be generated. Due to read length increases, algorithm improvements and hybrid assembly approaches, the concept of one chromosome, one contig and automated finishing of microbial genomes is now a realistic and achievable task for many microbial laboratories. In this paper, we describe high quality sequence datasets which span three generations of sequencing technologies, containing six types of data from four NGS platforms and originating from a single microorganism, Clostridium autoethanogenum. The dataset reported here will be useful for the scientific community to evaluate upcoming NGS platforms, enabling comparison of existing and novel bioinformatics approaches and will encourage interest in the development of innovative experimental and computational methods for NGS data.
Collapse
Affiliation(s)
- Sagar M Utturkar
- Graduate School of Genome Science and Technology, University of Tennessee , Knoxville, Tennessee 37919, USA
| | - Dawn M Klingeman
- Biosciences Division, Oak Ridge National Laboratory , Oak Ridge, Tennessee 37831, USA
| | - José M Bruno-Barcena
- Department of Plant and Microbial Biology, North Carolina State University , Raleigh, North Carolina 27695, USA
| | - Mari S Chinn
- Department of Biological and Agricultural Engineering, North Carolina State University , Raleigh, North Carolina 27695, USA
| | - Amy M Grunden
- Department of Plant and Microbial Biology, North Carolina State University , Raleigh, North Carolina 27695, USA
| | | | - Steven D Brown
- Graduate School of Genome Science and Technology, University of Tennessee , Knoxville, Tennessee 37919, USA ; Biosciences Division, Oak Ridge National Laboratory , Oak Ridge, Tennessee 37831, USA
| |
Collapse
|
13
|
Laver T, Harrison J, O'Neill PA, Moore K, Farbos A, Paszkiewicz K, Studholme DJ. Assessing the performance of the Oxford Nanopore Technologies MinION. BIOMOLECULAR DETECTION AND QUANTIFICATION 2015; 3:1-8. [PMID: 26753127 PMCID: PMC4691839 DOI: 10.1016/j.bdq.2015.02.001] [Citation(s) in RCA: 291] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2014] [Revised: 02/05/2015] [Accepted: 02/18/2015] [Indexed: 01/15/2023]
Abstract
The Oxford Nanopore Technologies (ONT) MinION is a new sequencing technology that potentially offers read lengths of tens of kilobases (kb) limited only by the length of DNA molecules presented to it. The device has a low capital cost, is by far the most portable DNA sequencer available, and can produce data in real-time. It has numerous prospective applications including improving genome sequence assemblies and resolution of repeat-rich regions. Before such a technology is widely adopted, it is important to assess its performance and limitations in respect of throughput and accuracy. In this study we assessed the performance of the MinION by re-sequencing three bacterial genomes, with very different nucleotide compositions ranging from 28.6% to 70.7%; the high G + C strain was underrepresented in the sequencing reads. We estimate the error rate of the MinION (after base calling) to be 38.2%. Mean and median read lengths were 2 kb and 1 kb respectively, while the longest single read was 98 kb. The whole length of a 5 kb rRNA operon was covered by a single read. As the first nanopore-based single molecule sequencer available to researchers, the MinION is an exciting prospect; however, the current error rate limits its ability to compete with existing sequencing technologies, though we do show that MinION sequence reads can enhance contiguity of de novo assembly when used in conjunction with Illumina MiSeq data.
Collapse
Affiliation(s)
- T Laver
- Biosciences, University of Exeter, Geoffrey Pope Building, Stocker Road, Exeter EX4 4QD, UK
| | - J Harrison
- Biosciences, University of Exeter, Geoffrey Pope Building, Stocker Road, Exeter EX4 4QD, UK
| | - P A O'Neill
- Biosciences, University of Exeter, Geoffrey Pope Building, Stocker Road, Exeter EX4 4QD, UK; Wellcome Trust Biomedical Informatics Hub, Geoffrey Pope Building, Stocker Road, University of Exeter, Exeter EX4 4QD, UK
| | - K Moore
- Biosciences, University of Exeter, Geoffrey Pope Building, Stocker Road, Exeter EX4 4QD, UK; Wellcome Trust Biomedical Informatics Hub, Geoffrey Pope Building, Stocker Road, University of Exeter, Exeter EX4 4QD, UK
| | - A Farbos
- Biosciences, University of Exeter, Geoffrey Pope Building, Stocker Road, Exeter EX4 4QD, UK; Wellcome Trust Biomedical Informatics Hub, Geoffrey Pope Building, Stocker Road, University of Exeter, Exeter EX4 4QD, UK
| | - K Paszkiewicz
- Biosciences, University of Exeter, Geoffrey Pope Building, Stocker Road, Exeter EX4 4QD, UK; Wellcome Trust Biomedical Informatics Hub, Geoffrey Pope Building, Stocker Road, University of Exeter, Exeter EX4 4QD, UK
| | - D J Studholme
- Biosciences, University of Exeter, Geoffrey Pope Building, Stocker Road, Exeter EX4 4QD, UK
| |
Collapse
|
14
|
One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly. Curr Opin Microbiol 2014; 23:110-20. [PMID: 25461581 DOI: 10.1016/j.mib.2014.11.014] [Citation(s) in RCA: 265] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2014] [Revised: 11/17/2014] [Accepted: 11/18/2014] [Indexed: 11/20/2022]
Abstract
Like a jigsaw puzzle with large pieces, a genome sequenced with long reads is easier to assemble. However, recent sequencing technologies have favored lowering per-base cost at the expense of read length. This has dramatically reduced sequencing cost, but resulted in fragmented assemblies, which negatively affect downstream analyses and hinder the creation of finished (gapless, high-quality) genomes. In contrast, emerging long-read sequencing technologies can now produce reads tens of kilobases in length, enabling the automated finishing of microbial genomes for under $1000. This promises to improve the quality of reference databases and facilitate new studies of chromosomal structure and variation. We present an overview of these new technologies and the methods used to assemble long reads into complete genomes.
Collapse
|
15
|
First Complete Genome Sequence of Salmonella enterica subsp. enterica Serovar Typhimurium Strain ATCC 13311 (NCTC 74), a Reference Strain of Multidrug Resistance, as Achieved by Use of PacBio Single-Molecule Real-Time Technology. GENOME ANNOUNCEMENTS 2014; 2:2/5/e00986-14. [PMID: 25278532 PMCID: PMC4183876 DOI: 10.1128/genomea.00986-14] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
We report the first complete genomic sequence of Salmonella enterica subsp. enterica serovar Typhimurium strain ATCC 13311, the leading food-borne pathogen and a reference strain used in drug resistance studies. De novo assembly with PacBio sequencing completed its chromosome and one plasmid. They will accelerate the investigation into multidrug resistance in Salmonella Typhimurium.
Collapse
|
16
|
Complete Genome Sequence of Pelosinus sp. Strain UFO1 Assembled Using Single-Molecule Real-Time DNA Sequencing Technology. GENOME ANNOUNCEMENTS 2014; 2:2/5/e00881-14. [PMID: 25189589 PMCID: PMC4155594 DOI: 10.1128/genomea.00881-14] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Pelosinus species can reduce metals such as Fe(III), U(VI), and Cr(VI) and have been isolated from diverse geographical regions. Five draft genome sequences have been published. We report the complete genome sequence for Pelosinus sp. strain UFO1 using only PacBio DNA sequence data and without manual finishing.
Collapse
|