Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Powers JG, Weigman VJ, Shu J, Pufky JM, Cox D, Hurban P. Efficient and accurate whole genome assembly and methylome profiling of E. coli. BMC Genomics 2013;14:675. [PMID: 24090403 DOI: 10.1186/1471-2164-14-675] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2013] [Accepted: 09/26/2013] [Indexed: 11/10/2022] Open

For:	Powers JG, Weigman VJ, Shu J, Pufky JM, Cox D, Hurban P. Efficient and accurate whole genome assembly and methylome profiling of E. coli. BMC Genomics 2013;14:675. [PMID: 24090403 DOI: 10.1186/1471-2164-14-675] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2013] [Accepted: 09/26/2013] [Indexed: 11/10/2022] Open

Number

Cited by Other Article(s)

Liu L, Peng S, Song W, Zhao H, Li H, Wang H. Genomic Analysis of an Excellent Wine-Making Strain Oenococcus oeni SD-2a. Pol J Microbiol 2022;71:279-292. [PMID: 35716166 PMCID: PMC9252139 DOI: 10.33073/pjm-2022-026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Accepted: 05/14/2022] [Indexed: 12/27/2022] Open

Neubert K, Zuchantke E, Leidenfrost RM, Wünschiers R, Grützke J, Malorny B, Brendebach H, Al Dahouk S, Homeier T, Hotzel H, Reinert K, Tomaso H, Busch A. Testing assembly strategies of Francisella tularensis genomes to infer an evolutionary conservation analysis of genomic structures. BMC Genomics 2021;22:822. [PMID: 34773979 PMCID: PMC8590783 DOI: 10.1186/s12864-021-08115-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 10/12/2021] [Indexed: 02/08/2023] Open

Affiliation(s)

Kerstin Neubert Department of Mathematics and Computer Science, Algorithmic Bioinformatics, Freie Universität Berlin, Institute of Computer Science, Takustr. 9, 14195, Berlin, Germany.,German Federal Institute for Risk Assessment, Diedersdorfer Weg 1, 12277, Berlin, Germany
Eric Zuchantke Friedrich-Loeffler-Institut, Institute of Bacterial Infections and Zoonoses, Naumburger Str. 96a, 07749, Jena, Germany
Robert Maximilian Leidenfrost Department of Biotechnology and Chemistry, Mittweida University of Applied Sciences, Technikumplatz 17a, 09648, Mittweida, Germany
Röbbe Wünschiers Department of Biotechnology and Chemistry, Mittweida University of Applied Sciences, Technikumplatz 17a, 09648, Mittweida, Germany
Josephine Grützke German Federal Institute for Risk Assessment, Diedersdorfer Weg 1, 12277, Berlin, Germany
Burkhard Malorny German Federal Institute for Risk Assessment, Diedersdorfer Weg 1, 12277, Berlin, Germany
Holger Brendebach German Federal Institute for Risk Assessment, Diedersdorfer Weg 1, 12277, Berlin, Germany
Sascha Al Dahouk German Federal Institute for Risk Assessment, Diedersdorfer Weg 1, 12277, Berlin, Germany
Timo Homeier Friedrich-Loeffler-Institut, Institute of Epidemiology, Südufer, 10 17493, Greifswald, Insel Riems, Germany
Helmut Hotzel Friedrich-Loeffler-Institut, Institute of Bacterial Infections and Zoonoses, Naumburger Str. 96a, 07749, Jena, Germany
Knut Reinert Department of Mathematics and Computer Science, Algorithmic Bioinformatics, Freie Universität Berlin, Institute of Computer Science, Takustr. 9, 14195, Berlin, Germany
Herbert Tomaso Friedrich-Loeffler-Institut, Institute of Bacterial Infections and Zoonoses, Naumburger Str. 96a, 07749, Jena, Germany
Anne Busch Friedrich-Loeffler-Institut, Institute of Bacterial Infections and Zoonoses, Naumburger Str. 96a, 07749, Jena, Germany. .,Department of Anaesthesiology and Intensive Care Medicine, University Hospital Jena, Jena, Germany.

Collapse

Ardui S, Ameur A, Vermeesch JR, Hestand MS. Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics. Nucleic Acids Res 2019;46:2159-2168. [PMID: 29401301 PMCID: PMC5861413 DOI: 10.1093/nar/gky066] [Citation(s) in RCA: 400] [Impact Index Per Article: 80.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Accepted: 01/23/2018] [Indexed: 12/30/2022] Open

Forde BM, McAllister LJ, Paton JC, Paton AW, Beatson SA. SMRT sequencing reveals differential patterns of methylation in two O111:H- STEC isolates from a hemolytic uremic syndrome outbreak in Australia. Sci Rep 2019;9:9436. [PMID: 31263188 PMCID: PMC6602927 DOI: 10.1038/s41598-019-45760-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 05/23/2019] [Indexed: 11/21/2022] Open

Denancé N, Briand M, Gaborieau R, Gaillard S, Jacques MA. Identification of genetic relationships and subspecies signatures in Xylella fastidiosa. BMC Genomics 2019;20:239. [PMID: 30909861 PMCID: PMC6434890 DOI: 10.1186/s12864-019-5565-9] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Accepted: 02/25/2019] [Indexed: 12/12/2022] Open

Abstract

BACKGROUND

The phytopathogenic bacterium Xylella fastidiosa was thought to be restricted to the Americas where it infects and kills numerous hosts. Its detection worldwide has been blooming since 2013 in Europe and Asia. Genetically diverse, this species is divided into six subspecies but genetic traits governing this classification are poorly understood.

RESULTS

SkIf (Specific k-mers Identification) was designed and exploited for comparative genomics on a dataset of 46 X. fastidiosa genomes, including seven newly sequenced individuals. It was helpful to quickly check the synonymy between strains from different collections. SkIf identified specific SNPs within 16S rRNA sequences that can be employed for predicting the distribution of Xylella through data mining. Applied to inter- and intra-subspecies analyses, it identified specific k-mers in genes affiliated to differential gene ontologies. Chemotaxis-related genes more prevalently possess specific k-mers in genomes from subspecies fastidiosa, morus and sandyi taken as a whole group. In the subspecies pauca increased abundance of specific k-mers was found in genes associated with the bacterial cell wall/envelope/plasma membrane. Most often, the k-mer specificity occurred in core genes with non-synonymous SNPs in their sequences in genomes of the other subspecies, suggesting putative impact in the protein functions. The presence of two integrative and conjugative elements (ICEs) was identified, one chromosomic and an entire plasmid in a single strain of X. fastidiosa subsp. pauca. Finally, a revised taxonomy of X. fastidiosa into three major clades defined by the subspecies pauca (clade I), multiplex (clade II) and the combination of fastidiosa, morus and sandyi (clade III) was strongly supported by k-mers specifically associated with these subspecies.

CONCLUSIONS

SkIf is a robust and rapid software, freely available, that can be dedicated to the comparison of sequence datasets and is applicable to any field of research. Applied to X. fastidiosa, an emerging pathogen in Europe, it provided an important resource to mine for identifying genetic markers of subspecies to optimize the strategies attempted to limit the pathogen dissemination in novel areas.

Collapse

Pinning down the role of common luminal intestinal parasitic protists in human health and disease - status and challenges. Parasitology 2019;146:695-701. [PMID: 30732665 DOI: 10.1017/s0031182019000039] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Efficiency of PacBio long read correction by 2nd generation Illumina sequencing. Genomics 2019;111:43-49. [DOI: 10.1016/j.ygeno.2017.12.011] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2017] [Revised: 12/11/2017] [Accepted: 12/17/2017] [Indexed: 12/17/2022]

Payelleville A, Legrand L, Ogier JC, Roques C, Roulet A, Bouchez O, Mouammine A, Givaudan A, Brillard J. The complete methylome of an entomopathogenic bacterium reveals the existence of loci with unmethylated Adenines. Sci Rep 2018;8:12091. [PMID: 30108278 PMCID: PMC6092372 DOI: 10.1038/s41598-018-30620-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Accepted: 08/03/2018] [Indexed: 01/01/2023] Open

A Whole Genome Assembly of the Horn Fly, Haematobia irritans, and Prediction of Genes with Roles in Metabolism and Sex Determination. G3-GENES GENOMES GENETICS 2018;8:1675-1686. [PMID: 29602812 PMCID: PMC5940159 DOI: 10.1534/g3.118.200154] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Besser J, Carleton HA, Gerner-Smidt P, Lindsey RL, Trees E. Next-generation sequencing technologies and their application to the study and control of bacterial infections. Clin Microbiol Infect 2017;24:335-341. [PMID: 29074157 DOI: 10.1016/j.cmi.2017.10.013] [Citation(s) in RCA: 238] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Revised: 10/05/2017] [Accepted: 10/17/2017] [Indexed: 12/21/2022]

Quainoo S, Coolen JPM, van Hijum SAFT, Huynen MA, Melchers WJG, van Schaik W, Wertheim HFL. Whole-Genome Sequencing of Bacterial Pathogens: the Future of Nosocomial Outbreak Analysis. Clin Microbiol Rev 2017;30:1015-1063. [PMID: 28855266 PMCID: PMC5608882 DOI: 10.1128/cmr.00016-17] [Citation(s) in RCA: 228] [Impact Index Per Article: 32.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Abstract

Outbreaks of multidrug-resistant bacteria present a frequent threat to vulnerable patient populations in hospitals around the world. Intensive care unit (ICU) patients are particularly susceptible to nosocomial infections due to indwelling devices such as intravascular catheters, drains, and intratracheal tubes for mechanical ventilation. The increased vulnerability of infected ICU patients demonstrates the importance of effective outbreak management protocols to be in place. Understanding the transmission of pathogens via genotyping methods is an important tool for outbreak management. Recently, whole-genome sequencing (WGS) of pathogens has become more accessible and affordable as a tool for genotyping. Analysis of the entire pathogen genome via WGS could provide unprecedented resolution in discriminating even highly related lineages of bacteria and revolutionize outbreak analysis in hospitals. Nevertheless, clinicians have long been hesitant to implement WGS in outbreak analyses due to the expensive and cumbersome nature of early sequencing platforms. Recent improvements in sequencing technologies and analysis tools have rapidly increased the output and analysis speed as well as reduced the overall costs of WGS. In this review, we assess the feasibility of WGS technologies and bioinformatics analysis tools for nosocomial outbreak analyses and provide a comparison to conventional outbreak analysis workflows. Moreover, we review advantages and limitations of sequencing technologies and analysis tools and present a real-world example of the implementation of WGS for antimicrobial resistance analysis. We aimed to provide health care professionals with a guide to WGS outbreak analysis that highlights its benefits for hospitals and assists in the transition from conventional to WGS-based outbreak analysis.

Collapse

Abrams AJ, Trees DL. Genomic sequencing of Neisseria gonorrhoeae to respond to the urgent threat of antimicrobial-resistant gonorrhea. Pathog Dis 2017;75:3106325. [PMID: 28387837 PMCID: PMC6956991 DOI: 10.1093/femspd/ftx041] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2016] [Accepted: 04/04/2017] [Indexed: 01/02/2023] Open

Seong HJ, Park HJ, Hong E, Lee SC, Sul WJ, Han SW. Methylome Analysis of Two Xanthomonas spp. Using Single-Molecule Real-Time Sequencing. THE PLANT PATHOLOGY JOURNAL 2016;32:500-507. [PMID: 27904456 PMCID: PMC5117858 DOI: 10.5423/ppj.ft.10.2016.0216] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Revised: 10/24/2016] [Accepted: 10/24/2016] [Indexed: 05/24/2023]

Gibriel HAY, Thomma BPHJ, Seidl MF. The Age of Effectors: Genome-Based Discovery and Applications. PHYTOPATHOLOGY 2016;106:1206-1212. [PMID: 27050568 DOI: 10.1094/phyto-02-16-0110-fi] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]

Velandia-Huerto CA, Berkemer SJ, Hoffmann A, Retzlaff N, Romero Marroquín LC, Hernández-Rosales M, Stadler PF, Bermúdez-Santana CI. Orthologs, turn-over, and remolding of tRNAs in primates and fruit flies. BMC Genomics 2016;17:617. [PMID: 27515907 PMCID: PMC4981973 DOI: 10.1186/s12864-016-2927-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2016] [Accepted: 07/11/2016] [Indexed: 12/26/2022] Open

Abstract

Background

Transfer RNAs (tRNAs) are ubiquitous in all living organism. They implement the genetic code so that most genomes contain distinct tRNAs for almost all 61 codons. They behave similar to mobile elements and proliferate in genomes spawning both local and non-local copies. Most tRNA families are therefore typically present as multicopy genes. The members of the individual tRNA families evolve under concerted or rapid birth-death evolution, so that paralogous copies maintain almost identical sequences over long evolutionary time-scales. To a good approximation these are functionally equivalent. Individual tRNA copies thus are evolutionary unstable and easily turn into pseudogenes and disappear. This leads to a rapid turnover of tRNAs and often large differences in the tRNA complements of closely related species. Since tRNA paralogs are not distinguished by sequence, common methods cannot not be used to establish orthology between tRNA genes.

Results

In this contribution we introduce a general framework to distinguish orthologs and paralogs in gene families that are subject to concerted evolution. It is based on the use of uniquely aligned adjacent sequence elements as anchors to establish syntenic conservation of sequence intervals. In practice, anchors and intervals can be extracted from genome-wide multiple sequence alignments. Syntenic clusters of concertedly evolving genes of different families can then be subdivided by list alignments, leading to usually small clusters of candidate co-orthologs. On the basis of recent advances in phylogenetic combinatorics, these candidate clusters can be further processed by cograph editing to recover their duplication histories. We developed a workflow that can be conceptualized as stepwise refinement of a graph of homologous genes. We apply this analysis strategy with different types of synteny anchors to investigate the evolution of tRNAs in primates and fruit flies. We identified a large number of tRNA remolding events concentrated at the tips of the phylogeny. With one notable exception all phylogenetically old tRNA remoldings do not change the isoacceptor class.

Conclusions

Gene families evolving under concerted evolution are not amenable to classical phylogenetic analyses since paralogs maintain identical, species-specific sequences, precluding the estimation of correct gene trees from sequence differences. This leaves conservation of syntenic arrangements with respect to “anchor elements” that are not subject to concerted evolution as the only viable source of phylogenetic information. We have demonstrated here that a purely synteny-based analysis of tRNA gene histories is indeed feasible. Although the choice of synteny anchors influences the resolution in particular when tight gene clusters are present, and the quality of sequence alignments, genome assemblies, and genome rearrangements limits the scope of the analysis, largely coherent results can be obtained for tRNAs. In particular, we conclude that a large fraction of the tRNAs are recent copies. This proliferation is compensated by rapid pseudogenization as exemplified by many very recent alloacceptor remoldings.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-016-2927-4) contains supplementary material, which is available to authorized users.

Collapse

Affiliation(s)

Cristian A Velandia-Huerto Biology Department, Universidad Nacional de Colombia, Carrera 45 # 26-85, Edif. Uriel Gutiérrez, Bogotá, D.C, Colombia
Sarah J Berkemer Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, D-04103, Germany.,Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16-18D-04107, Leipzig, Germany
Anne Hoffmann Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16-18D-04107, Leipzig, Germany
Nancy Retzlaff Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, D-04103, Germany.,Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16-18D-04107, Leipzig, Germany
Liliana C Romero Marroquín Biology Department, Universidad Nacional de Colombia, Carrera 45 # 26-85, Edif. Uriel Gutiérrez, Bogotá, D.C, Colombia
Maribel Hernández-Rosales CONACYT - Instituto de Matemáticas, UNAM Juriquilla, Av. Juriquilla #3001, Santiago de Querétaro, MX-76230, QRO, México
Peter F Stadler Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, D-04103, Germany. .,Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16-18D-04107, Leipzig, Germany. .,Fraunhofer Institut for Cell Therapy and Immunology, Perlickstraße 1, Leipzig, D-04103, Germany. .,Department of Theoretical Chemistry, University of Vienna, Währinger Straße 17, Vienna, A-1090, Austria. .,Center for non-coding RNA in Technology and Health, Grønegårdsvej 3, Frederiksberg C, DK-1870, Denmark. .,Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM87501, USA.
Clara I Bermúdez-Santana Biology Department, Universidad Nacional de Colombia, Carrera 45 # 26-85, Edif. Uriel Gutiérrez, Bogotá, D.C, Colombia

Collapse

Lavezzo E, Barzon L, Toppo S, Palù G. Third generation sequencing technologies applied to diagnostic microbiology: benefits and challenges in applications and data analysis. Expert Rev Mol Diagn 2016;16:1011-23. [PMID: 27453996 DOI: 10.1080/14737159.2016.1217158] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Xie B, Liu B, Yi Y, Yang L, Liang D, Zhu Y, Liu H. Microbiological mechanism of the improved nitrogen and phosphorus removal by embedding microbial fuel cell in Anaerobic-Anoxic-Oxic wastewater treatment process. BIORESOURCE TECHNOLOGY 2016;207:109-17. [PMID: 26874439 DOI: 10.1016/j.biortech.2016.01.090] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2015] [Revised: 01/21/2016] [Accepted: 01/23/2016] [Indexed: 05/27/2023]

Affiliation(s)

Beizhen Xie School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China; Institution of Environmental Biology and Life Support Technology, Beihang University, Beijing 100191, China; International Joint Research Center of Aerospace Biotechnology & Medical Engineering, Beihang University, Beijing 100191, China
Bojie Liu School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China; Institution of Environmental Biology and Life Support Technology, Beihang University, Beijing 100191, China; International Joint Research Center of Aerospace Biotechnology & Medical Engineering, Beihang University, Beijing 100191, China
Yue Yi School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China; Institution of Environmental Biology and Life Support Technology, Beihang University, Beijing 100191, China; International Joint Research Center of Aerospace Biotechnology & Medical Engineering, Beihang University, Beijing 100191, China
Lige Yang School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China; Institution of Environmental Biology and Life Support Technology, Beihang University, Beijing 100191, China; International Joint Research Center of Aerospace Biotechnology & Medical Engineering, Beihang University, Beijing 100191, China
Dawei Liang Beijing Key Laboratory of Bio-inspired Energy Materials and Devices, Beihang University, Beijing 100191, China
Ying Zhu Key Laboratory of Bio-inspired Smart Interfacial Science and Technology of Ministry of Education, School of Chemistry and Environment, Beihang University, Beijing 100191, China
Hong Liu School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China; Institution of Environmental Biology and Life Support Technology, Beihang University, Beijing 100191, China; International Joint Research Center of Aerospace Biotechnology & Medical Engineering, Beihang University, Beijing 100191, China.

Collapse

Rao C, Guyard C, Pelaz C, Wasserscheid J, Bondy-Denomy J, Dewar K, Ensminger AW. Active and adaptive Legionella CRISPR-Cas reveals a recurrent challenge to the pathogen. Cell Microbiol 2016;18:1319-38. [PMID: 26936325 PMCID: PMC5071653 DOI: 10.1111/cmi.12586] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2016] [Accepted: 02/25/2016] [Indexed: 01/04/2023]

Abstract

Clustered regularly interspaced short palindromic repeats with CRISPR‐associated gene (CRISPR‐Cas) systems are widely recognized as critical genome defense systems that protect microbes from external threats such as bacteriophage infection. Several isolates of the intracellular pathogen Legionella pneumophila possess multiple CRISPR‐Cas systems (type I‐C, type I‐F and type II‐B), yet the targets of these systems remain unknown. With the recent observation that at least one of these systems (II‐B) plays a non‐canonical role in supporting intracellular replication, the possibility remained that these systems are vestigial genome defense systems co‐opted for other purposes. Our data indicate that this is not the case. Using an established plasmid transformation assay, we demonstrate that type I‐C, I‐F and II‐B CRISPR‐Cas provide protection against spacer targets. We observe efficient laboratory acquisition of new spacers under ‘priming’ conditions, in which initially incomplete target elimination leads to the generation of new spacers and ultimate loss of the invasive DNA. Critically, we identify the first known target of L. pneumophila CRISPR‐Cas: a 30 kb episome of unknown function whose interbacterial transfer is guarded against by CRISPR‐Cas. We provide evidence that the element can subvert CRISPR‐Cas by mutating its targeted sequences – but that primed spacer acquisition may limit this mechanism of escape. Rather than generally impinging on bacterial fitness, this element drives a host specialization event – with improved fitness in Acanthamoeba but a reduced ability to replicate in other hosts and conditions. These observations add to a growing body of evidence that host range restriction can serve as an existential threat to L. pneumophila in the wild.

Collapse

Fadeev E, De Pascale F, Vezzi A, Hübner S, Aharonovich D, Sher D. Why Close a Bacterial Genome? The Plasmid of Alteromonas Macleodii HOT1A3 is a Vector for Inter-Specific Transfer of a Flexible Genomic Island. Front Microbiol 2016;7:248. [PMID: 27014193 PMCID: PMC4781885 DOI: 10.3389/fmicb.2016.00248] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2015] [Accepted: 02/15/2016] [Indexed: 12/20/2022] Open

Devall M, Roubroeks J, Mill J, Weedon M, Lunnon K. Epigenetic regulation of mitochondrial function in neurodegenerative disease: New insights from advances in genomic technologies. Neurosci Lett 2016;625:47-55. [PMID: 26876477 DOI: 10.1016/j.neulet.2016.02.013] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Revised: 02/04/2016] [Accepted: 02/05/2016] [Indexed: 10/22/2022]

Wang B, Shao Y, Chen T, Chen W, Chen F. Global insights into acetic acid resistance mechanisms and genetic stability of Acetobacter pasteurianus strains by comparative genomics. Sci Rep 2015;5:18330. [PMID: 26691589 PMCID: PMC4686929 DOI: 10.1038/srep18330] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Accepted: 11/16/2015] [Indexed: 12/11/2022] Open

DNA Methylation Assessed by SMRT Sequencing Is Linked to Mutations in Neisseria meningitidis Isolates. PLoS One 2015;10:e0144612. [PMID: 26656597 PMCID: PMC4676702 DOI: 10.1371/journal.pone.0144612] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2015] [Accepted: 11/20/2015] [Indexed: 11/20/2022] Open

Abstract

The Gram-negative bacterium Neisseria meningitidis features extensive genetic variability. To present, proposed virulence genotypes are also detected in isolates from asymptomatic carriers, indicating more complex mechanisms underlying variable colonization modes of N. meningitidis. We applied the Single Molecule, Real-Time (SMRT) sequencing method from Pacific Biosciences to assess the genome-wide DNA modification profiles of two genetically related N. meningitidis strains, both of serogroup A. The resulting DNA methylomes revealed clear divergences, represented by the detection of shared and of strain-specific DNA methylation target motifs. The positional distribution of these methylated target sites within the genomic sequences displayed clear biases, which suggest a functional role of DNA methylation related to the regulation of genes. DNA methylation in N. meningitidis has a likely underestimated potential for variability, as evidenced by a careful analysis of the ORF status of a panel of confirmed and predicted DNA methyltransferase genes in an extended collection of N. meningitidis strains of serogroup A. Based on high coverage short sequence reads, we find phase variability as a major contributor to the variability in DNA methylation. Taking into account the phase variable loci, the inferred functional status of DNA methyltransferase genes matched the observed methylation profiles. Towards an elucidation of presently incompletely characterized functional consequences of DNA methylation in N. meningitidis, we reveal a prominent colocalization of methylated bases with Single Nucleotide Polymorphisms (SNPs) detected within our genomic sequence collection. As a novel observation we report increased mutability also at 6mA methylated nucleotides, complementing mutational hotspots previously described at 5mC methylated nucleotides. These findings suggest a more diverse role of DNA methylation and Restriction-Modification (RM) systems in the evolution of prokaryotic genomes.

Collapse

Lin HH, Liao YC. Evaluation and Validation of Assembling Corrected PacBio Long Reads for Microbial Genome Completion via Hybrid Approaches. PLoS One 2015;10:e0144305. [PMID: 26641475 PMCID: PMC4671558 DOI: 10.1371/journal.pone.0144305] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2015] [Accepted: 11/16/2015] [Indexed: 11/23/2022] Open

Abstract

Despite the ever-increasing output of next-generation sequencing data along with developing assemblers, dozens to hundreds of gaps still exist in de novo microbial assemblies due to uneven coverage and large genomic repeats. Third-generation single-molecule, real-time (SMRT) sequencing technology avoids amplification artifacts and generates kilobase-long reads with the potential to complete microbial genome assembly. However, due to the low accuracy (~85%) of third-generation sequences, a considerable amount of long reads (>50X) are required for self-correction and for subsequent de novo assembly. Recently-developed hybrid approaches, using next-generation sequencing data and as few as 5X long reads, have been proposed to improve the completeness of microbial assembly. In this study we have evaluated the contemporary hybrid approaches and demonstrated that assembling corrected long reads (by runCA) produced the best assembly compared to long-read scaffolding (e.g., AHA, Cerulean and SSPACE-LongRead) and gap-filling (SPAdes). For generating corrected long reads, we further examined long-read correction tools, such as ECTools, LSC, LoRDEC, PBcR pipeline and proovread. We have demonstrated that three microbial genomes including Escherichia coli K12 MG1655, Meiothermus ruber DSM1279 and Pdeobacter heparinus DSM2366 were successfully hybrid assembled by runCA into near-perfect assemblies using ECTools-corrected long reads. In addition, we developed a tool, Patch, which implements corrected long reads and pre-assembled contigs as inputs, to enhance microbial genome assemblies. With the additional 20X long reads, short reads of S. cerevisiae W303 were hybrid assembled into 115 contigs using the verified strategy, ECTools + runCA. Patch was subsequently applied to upgrade the assembly to a 35-contig draft genome. Our evaluation of the hybrid approaches shows that assembling the ECTools-corrected long reads via runCA generates near complete microbial genomes, suggesting that genome assembly could benefit from re-analyzing the available hybrid datasets that were not assembled in an optimal fashion.

Collapse

Lineage-Specific Methyltransferases Define the Methylome of the Globally Disseminated Escherichia coli ST131 Clone. mBio 2015;6:e01602-15. [PMID: 26578678 PMCID: PMC4659465 DOI: 10.1128/mbio.01602-15] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

Abstract

UNLABELLED

Escherichia coli sequence type 131 (ST131) is a clone of uropathogenic E. coli that has emerged rapidly and disseminated globally in both clinical and community settings. Members of the ST131 lineage from across the globe have been comprehensively characterized in terms of antibiotic resistance, virulence potential, and pathogenicity, but to date nothing is known about the methylome of these important human pathogens. Here we used single-molecule real-time (SMRT) PacBio sequencing to determine the methylome of E. coli EC958, the most-well-characterized completely sequenced ST131 strain. Our analysis of 52,081 methylated adenines in the genome of EC958 discovered three (m6)A methylation motifs that have not been described previously. Subsequent SMRT sequencing of isogenic knockout mutants identified the two type I methyltransferases (MTases) and one type IIG MTase responsible for (m6)A methylation of novel recognition sites. Although both type I sites were rare, the type IIG sites accounted for more than 12% of all methylated adenines in EC958. Analysis of the distribution of MTase genes across 95 ST131 genomes revealed their prevalence is highly conserved within the ST131 lineage, with most variation due to the presence or absence of mobile genetic elements on which individual MTase genes are located.

IMPORTANCE

DNA modification plays a crucial role in bacterial regulation. Despite several examples demonstrating the role of methyltransferase (MTase) enzymes in bacterial virulence, investigation of this phenomenon on a whole-genome scale has remained elusive until now. Here we used single-molecule real-time (SMRT) sequencing to determine the first complete methylome of a strain from the multidrug-resistant E. coli sequence type 131 (ST131) lineage. By interrogating the methylome computationally and with further SMRT sequencing of isogenic mutants representing previously uncharacterized MTase genes, we defined the target sequences of three novel ST131-specific MTases and determined the genomic distribution of all MTase target sequences. Using a large collection of 95 previously sequenced ST131 genomes, we identified mobile genetic elements as a major factor driving diversity in DNA methylation patterns. Overall, our analysis highlights the potential for DNA methylation to dramatically influence gene regulation at the transcriptional level within a well-defined E. coli clone.

Collapse

Mind the gap; seven reasons to close fragmented genome assemblies. Fungal Genet Biol 2015;90:24-30. [PMID: 26342853 DOI: 10.1016/j.fgb.2015.08.010] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Revised: 08/27/2015] [Accepted: 08/28/2015] [Indexed: 10/23/2022]

Single-Molecule Real-Time Sequencing Combined with Optical Mapping Yields Completely Finished Fungal Genome. mBio 2015;6:mBio.00936-15. [PMID: 26286689 PMCID: PMC4542186 DOI: 10.1128/mbio.00936-15] [Citation(s) in RCA: 120] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

Next-generation sequencing (NGS) technologies have increased the scalability, speed, and resolution of genomic sequencing and, thus, have revolutionized genomic studies. However, eukaryotic genome sequencing initiatives typically yield considerably fragmented genome assemblies. Here, we assessed various state-of-the-art sequencing and assembly strategies in order to produce a contiguous and complete eukaryotic genome assembly, focusing on the filamentous fungus Verticillium dahliae. Compared with Illumina-based assemblies of the V. dahliae genome, hybrid assemblies that also include PacBio-generated long reads establish superior contiguity. Intriguingly, provided that sufficient sequence depth is reached, assemblies solely based on PacBio reads outperform hybrid assemblies and even result in fully assembled chromosomes. Furthermore, the addition of optical map data allowed us to produce a gapless and complete V. dahliae genome assembly of the expected eight chromosomes from telomere to telomere. Consequently, we can now study genomic regions that were previously not assembled or poorly assembled, including regions that are populated by repetitive sequences, such as transposons, allowing us to fully appreciate an organism’s biological complexity. Our data show that a combination of PacBio-generated long reads and optical mapping can be used to generate complete and gapless assemblies of fungal genomes.

Studying whole-genome sequences has become an important aspect of biological research. The advent of next-generation sequencing (NGS) technologies has nowadays brought genomic science within reach of most research laboratories, including those that study nonmodel organisms. However, most genome sequencing initiatives typically yield (highly) fragmented genome assemblies. Nevertheless, considerable relevant information related to genome structure and evolution is likely hidden in those nonassembled regions. Here, we investigated a diverse set of strategies to obtain gapless genome assemblies, using the genome of a typical ascomycete fungus as the template. Eventually, we were able to show that a combination of PacBio-generated long reads and optical mapping yields a gapless telomere-to-telomere genome assembly, allowing in-depth genome analyses to facilitate functional studies into an organism’s biology.

Collapse

Faino L, Seidl MF, Datema E, van den Berg GCM, Janssen A, Wittenberg AHJ, Thomma BPHJ. Single-Molecule Real-Time Sequencing Combined with Optical Mapping Yields Completely Finished Fungal Genome. mBio 2015. [PMID: 26286689 DOI: 10.1128/mbio.00936-915] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/30/2023] Open

Abstract

UNLABELLED

Next-generation sequencing (NGS) technologies have increased the scalability, speed, and resolution of genomic sequencing and, thus, have revolutionized genomic studies. However, eukaryotic genome sequencing initiatives typically yield considerably fragmented genome assemblies. Here, we assessed various state-of-the-art sequencing and assembly strategies in order to produce a contiguous and complete eukaryotic genome assembly, focusing on the filamentous fungus Verticillium dahliae. Compared with Illumina-based assemblies of the V. dahliae genome, hybrid assemblies that also include PacBio-generated long reads establish superior contiguity. Intriguingly, provided that sufficient sequence depth is reached, assemblies solely based on PacBio reads outperform hybrid assemblies and even result in fully assembled chromosomes. Furthermore, the addition of optical map data allowed us to produce a gapless and complete V. dahliae genome assembly of the expected eight chromosomes from telomere to telomere. Consequently, we can now study genomic regions that were previously not assembled or poorly assembled, including regions that are populated by repetitive sequences, such as transposons, allowing us to fully appreciate an organism's biological complexity. Our data show that a combination of PacBio-generated long reads and optical mapping can be used to generate complete and gapless assemblies of fungal genomes.

IMPORTANCE

Collapse

Orkunoglu-Suer F, Harralson AF, Frankfurter D, Gindoff P, O'Brien TJ. Targeted single molecule sequencing methodology for ovarian hyperstimulation syndrome. BMC Genomics 2015;16:264. [PMID: 25888426 PMCID: PMC4397691 DOI: 10.1186/s12864-015-1451-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2014] [Accepted: 03/09/2015] [Indexed: 01/27/2023] Open

Abstract

Background

One of the most significant issues surrounding next generation sequencing is the cost and the difficulty assembling short read lengths. Targeted capture enrichment of longer fragments using single molecule sequencing (SMS) is expected to improve both sequence assembly and base-call accuracy but, at present, there are very few examples of successful application of these technologic advances in translational research and clinical testing. We developed a targeted single molecule sequencing (T-SMS) panel for genes implicated in ovarian response to controlled ovarian hyperstimulation (COH) for infertility.

Results

Target enrichment was carried out using droplet-base multiplex polymerase chain reaction (PCR) technology (RainDance®) designed to yield amplicons averaging 1 kb fragment size from candidate 44 loci (99.8% unique base-pair coverage). The total targeted sequence was 3.18 Mb per sample. SMS was carried out using single molecule, real-time DNA sequencing (SMRT® Pacific Biosciences®), average raw read length = 1178 nucleotides, 5% of the amplicons >6000 nucleotides). After filtering with circular consensus (CCS) reads, the mean read length was 3200 nucleotides (97% CCS accuracy). Primary data analyses, alignment and filtering utilized the Pacific Biosciences® SMRT portal. Secondary analysis was conducted using the Genome Analysis Toolkit for SNP discovery l and wANNOVAR for functional analysis of variants. Filtered functional variants 18 of 19 (94.7%) were further confirmed using conventional Sanger sequencing. CCS reads were able to accurately detect zygosity. Coverage within GC rich regions (i.e.VEGFR; 72% GC rich) was achieved by capturing long genomic DNA (gDNA) fragments and reading into regions that flank the capture regions. As proof of concept, a non-synonymous LHCGR variant captured in two severe OHSS cases, and verified by conventional sequencing.

Conclusions

Combining emulsion PCR-generated 1 kb amplicons and SMRT DNA sequencing permitted greater depth of coverage for T-SMS and facilitated easier sequence assembly. To the best of our knowledge, this is the first report combining emulsion PCR and T-SMS for long reads using human DNA samples, and NGS panel designed for biomarker discovery in OHSS.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1451-2) contains supplementary material, which is available to authorized users.

Collapse

Molecular analysis of asymptomatic bacteriuria Escherichia coli strain VR50 reveals adaptation to the urinary tract by gene acquisition. Infect Immun 2015;83:1749-64. [PMID: 25667270 DOI: 10.1128/iai.02810-14] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Accepted: 01/09/2015] [Indexed: 12/21/2022] Open

Del Chierico F, Ancora M, Marcacci M, Cammà C, Putignani L, Conti S. Choice of next-generation sequencing pipelines. Methods Mol Biol 2015;1231:31-47. [PMID: 25343857 DOI: 10.1007/978-1-4939-1720-4_3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly. Curr Opin Microbiol 2014;23:110-20. [PMID: 25461581 DOI: 10.1016/j.mib.2014.11.014] [Citation(s) in RCA: 265] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2014] [Revised: 11/17/2014] [Accepted: 11/18/2014] [Indexed: 11/20/2022]

Lu S, Le S, Tan Y, Li M, Liu C, Zhang K, Huang J, Chen H, Rao X, Zhu J, Zou L, Ni Q, Li S, Wang J, Jin X, Hu Q, Yao X, Zhao X, Zhang L, Huang G, Hu F. Unlocking the mystery of the hard-to-sequence phage genome: PaP1 methylome and bacterial immunity. BMC Genomics 2014;15:803. [PMID: 25233860 PMCID: PMC4177049 DOI: 10.1186/1471-2164-15-803] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2014] [Accepted: 09/16/2014] [Indexed: 12/02/2022] Open

Abstract

Background

Whole-genome sequencing is an important method to understand the genetic information, gene function, biological characteristics and survival mechanisms of organisms. Sequencing large genomes is very simple at present. However, we encountered a hard-to-sequence genome of Pseudomonas aeruginosa phage PaP1. Shotgun sequencing method failed to complete the sequence of this genome.

Results

After persevering for 10 years and going over three generations of sequencing techniques, we successfully completed the sequence of the PaP1 genome with a length of 91,715 bp. Single-molecule real-time sequencing results revealed that this genome contains 51 N-6-methyladenines and 152 N-4-methylcytosines. Three significant modified sequence motifs were predicted, but not all of the sites found in the genome were methylated in these motifs. Further investigations revealed a novel immune mechanism of bacteria, in which host bacteria can recognise and repel modified bases containing inserts in a large scale. This mechanism could be accounted for the failure of the shotgun method in PaP1 genome sequencing. This problem was resolved using the nfi^- mutant of Escherichia coli DH5α as a host bacterium to construct a shotgun library.

Conclusions

This work provided insights into the hard-to-sequence phage PaP1 genome and discovered a new mechanism of bacterial immunity. The methylome of phage PaP1 is responsible for the failure of shotgun sequencing and for bacterial immunity mediated by enzyme Endo V activity; this methylome also provides a valuable resource for future studies on PaP1 genome replication and modification, as well as on gene regulation and host interaction.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-803) contains supplementary material, which is available to authorized users.

Collapse

Miyamoto M, Motooka D, Gotoh K, Imai T, Yoshitake K, Goto N, Iida T, Yasunaga T, Horii T, Arakawa K, Kasahara M, Nakamura S. Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes. BMC Genomics 2014;15:699. [PMID: 25142801 PMCID: PMC4159541 DOI: 10.1186/1471-2164-15-699] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2014] [Accepted: 08/15/2014] [Indexed: 11/18/2022] Open

Abstract

Background

The availability of diverse second- and third-generation sequencing technologies enables the rapid determination of the sequences of bacterial genomes. However, identifying the sequencing technology most suitable for producing a finished genome with multiple chromosomes remains a challenge. We evaluated the abilities of the following three second-generation sequencers: Roche 454 GS Junior (GS Jr), Life Technologies Ion PGM (Ion PGM), and Illumina MiSeq (MiSeq) and a third-generation sequencer, the Pacific Biosciences RS sequencer (PacBio), by sequencing and assembling the genome of Vibrio parahaemolyticus, which consists of a 5-Mb genome comprising two circular chromosomes.

Results

We sequenced the genome of V. parahaemolyticus with GS Jr, Ion PGM, MiSeq, and PacBio and performed de novo assembly with several genome assemblers. Although GS Jr generated the longest mean read length of 418 bp among the second-generation sequencers, the maximum contig length of the best assembly from GS Jr was 165 kbp, and the number of contigs was 309. Single runs of Ion PGM and MiSeq produced data of considerably greater sequencing coverage, 279× and 1,927×, respectively. The optimized result for Ion PGM contained 61 contigs assembled from reads of 77× coverage, and the longest contig was 895 kbp in size. Those for MiSeq were 34 contigs, 58× coverage, and 733 kbp, respectively. These results suggest that higher coverage depth is unnecessary for a better assembly result. We observed that multiple rRNA coding regions were fragmented in the assemblies from the second-generation sequencers, whereas PacBio generated two exceptionally long contigs of 3,288,561 and 1,875,537 bps, each of which was from a single chromosome, with 73× coverage and mean read length 3,119 bp, allowing us to determine the absolute positions of all rRNA operons.

Conclusions

PacBio outperformed the other sequencers in terms of the length of contigs and reconstructed the greatest portion of the genome, achieving a genome assembly of “finished grade” because of its long reads. It showed the potential to assemble more complex genomes with multiple chromosomes containing more repetitive sequences.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-699) contains supplementary material, which is available to authorized users.

Collapse

Utturkar SM, Klingeman DM, Land ML, Schadt CW, Doktycz MJ, Pelletier DA, Brown SD. Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences. ACTA ACUST UNITED AC 2014;30:2709-16. [PMID: 24930142 PMCID: PMC4173024 DOI: 10.1093/bioinformatics/btu391] [Citation(s) in RCA: 86] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Affiliation(s)

Sagar M Utturkar Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37919, USA and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Dawn M Klingeman Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37919, USA and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Miriam L Land Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37919, USA and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Christopher W Schadt Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37919, USA and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37919, USA and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Mitchel J Doktycz Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37919, USA and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37919, USA and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Dale A Pelletier Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37919, USA and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37919, USA and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Steven D Brown Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37919, USA and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37919, USA and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA

Collapse

Korlach J. Returning to more finished genomes. GENOMICS DATA 2014;2:46-8. [PMID: 26484068 PMCID: PMC4535613 DOI: 10.1016/j.gdata.2014.02.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Brown SD, Nagaraju S, Utturkar S, De Tissera S, Segovia S, Mitchell W, Land ML, Dassanayake A, Köpke M. Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of Clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant Clostridia. BIOTECHNOLOGY FOR BIOFUELS 2014;7:40. [PMID: 24655715 PMCID: PMC4022347 DOI: 10.1186/1754-6834-7-40] [Citation(s) in RCA: 100] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2013] [Accepted: 02/19/2014] [Indexed: 05/04/2023]

Abstract

BACKGROUND

Clostridium autoethanogenum strain JA1-1 (DSM 10061) is an acetogen capable of fermenting CO, CO2 and H2 (e.g. from syngas or waste gases) into biofuel ethanol and commodity chemicals such as 2,3-butanediol. A draft genome sequence consisting of 100 contigs has been published.

RESULTS

A closed, high-quality genome sequence for C. autoethanogenum DSM10061 was generated using only the latest single-molecule DNA sequencing technology and without the need for manual finishing. It is assigned to the most complex genome classification based upon genome features such as repeats, prophage, nine copies of the rRNA gene operons. It has a low G + C content of 31.1%. Illumina, 454, Illumina/454 hybrid assemblies were generated and then compared to the draft and PacBio assemblies using summary statistics, CGAL, QUAST and REAPR bioinformatics tools and comparative genomic approaches. Assemblies based upon shorter read DNA technologies were confounded by the large number repeats and their size, which in the case of the rRNA gene operons were ~5 kb. CRISPR (Clustered Regularly Interspaced Short Paloindromic Repeats) systems among biotechnologically relevant Clostridia were classified and related to plasmid content and prophages. Potential associations between plasmid content and CRISPR systems may have implications for historical industrial scale Acetone-Butanol-Ethanol (ABE) fermentation failures and future large scale bacterial fermentations. While C. autoethanogenum contains an active CRISPR system, no such system is present in the closely related Clostridium ljungdahlii DSM 13528. A common prophage inserted into the Arg-tRNA shared between the strains suggests a common ancestor. However, C. ljungdahlii contains several additional putative prophages and it has more than double the amount of prophage DNA compared to C. autoethanogenum. Other differences include important metabolic genes for central metabolism (as an additional hydrogenase and the absence of a phophoenolpyruvate synthase) and substrate utilization pathway (mannose and aromatics utilization) that might explain phenotypic differences between C. autoethanogenum and C. ljungdahlii.

CONCLUSIONS

Single molecule sequencing will be increasingly used to produce finished microbial genomes. The complete genome will facilitate comparative genomics and functional genomics and support future comparisons between Clostridia and studies that examine the evolution of plasmids, bacteriophage and CRISPR systems.

Collapse