Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Clark SC, Egan R, Frazier PI, Wang Z. ALE: a generic assembly likelihood evaluation framework for assessing the accuracy of genome and metagenome assemblies. ACTA ACUST UNITED AC 2013;29:435-43. [PMID: 23303509 DOI: 10.1093/bioinformatics/bts723] [Citation(s) in RCA: 109] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

For:	Clark SC, Egan R, Frazier PI, Wang Z. ALE: a generic assembly likelihood evaluation framework for assessing the accuracy of genome and metagenome assemblies. ACTA ACUST UNITED AC 2013;29:435-43. [PMID: 23303509 DOI: 10.1093/bioinformatics/bts723] [Citation(s) in RCA: 109] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Number

Cited by Other Article(s)

Siderius NL, Sapula SA, Hart BJ, Hutchings JL, Venter H. Enterobacter adelaidei sp. nov. Isolation of an extensively drug resistant strain from hospital wastewater in Australia and the global distribution of the species. Microbiol Res 2024;288:127867. [PMID: 39163716 DOI: 10.1016/j.micres.2024.127867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 08/02/2024] [Accepted: 08/03/2024] [Indexed: 08/22/2024]

Abstract

BACKGROUND

Enterobacter species are included among the normal human gut microflora and persist in a diverse range of other environmental niches. They have become important opportunistic nosocomial pathogens known to harbour plasmid-mediated multi-class antimicrobial resistance (AMR) determinants. Global AMR surveillance of Enterobacterales isolates shows the genus is second to Klebsiella in terms of frequency of carbapenem resistance. Enterobacter taxonomy is confusing and standard species identification methods are largely inaccurate or insufficient. There are currently 27 named species and a total of 46 taxa in the genus distinguishable via average nucleotide identity (ANI) calculation between pairs of genomic sequences. Here we describe an Enterobacter strain, ECC3473, isolated from the wastewater of an Australian hospital whose species could not be determined by standard methods nor by ribosomal RNA gene multi-locus typing.

AIM

To characterise ECC3473 in terms of phenotypic and genotypic antimicrobial resistance, biochemical characteristics and taxonomy as well as to determine the global distribution of the novel species to which it belongs.

METHODS

Standard broth dilution and disk diffusion were used to determine phenotypic AMR. The strain's complete genome, including plasmids, was obtained following long- and short read sequencing and a novel long/short read hybrid assembly and polishing, and the genomic basis of AMR was determined. Phylogenomic analysis and quantitative measures of relatedness (ANI, digital DNA-DNA hybridisation, and difference in G+C content) were used to study the taxonomic relationship between ECC3473 and Enterobacter type-strains. NCBI and PubMLST databases and the literature were searched for additional members of the novel species to determine its global distribution.

RESULTS

ECC3473 is one of 21 strains isolated globally belonging to a novel Enterobacter species for which the name, Enterobacter adelaidei sp. nov. is proposed. The novel species was found to be resilient in its capacity to persist in contaminated water and adaptable in its ability to accumulate multiple transmissible AMR determinants.

CONCLUSION

E. adelaidei sp. nov. may become increasingly important to the dissemination of AMR.

Collapse

Williams SK, Jerlström Hultqvist J, Eglit Y, Salas-Leiva DE, Curtis B, Orr RJS, Stairs CW, Atalay TN, MacMillan N, Simpson AGB, Roger AJ. Extreme mitochondrial reduction in a novel group of free-living metamonads. Nat Commun 2024;15:6805. [PMID: 39122691 PMCID: PMC11316075 DOI: 10.1038/s41467-024-50991-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 07/15/2024] [Indexed: 08/12/2024] Open

Bouras G, Judd LM, Edwards RA, Vreugde S, Stinear TP, Wick RR. How low can you go? Short-read polishing of Oxford Nanopore bacterial genome assemblies. Microb Genom 2024;10:001254. [PMID: 38833287 PMCID: PMC11261834 DOI: 10.1099/mgen.0.001254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Accepted: 04/30/2024] [Indexed: 06/06/2024] Open

Bouras G, Houtak G, Wick RR, Mallawaarachchi V, Roach MJ, Papudeshi B, Judd LM, Sheppard AE, Edwards RA, Vreugde S. Hybracter: enabling scalable, automated, complete and accurate bacterial genome assemblies. Microb Genom 2024;10:001244. [PMID: 38717808 PMCID: PMC11165638 DOI: 10.1099/mgen.0.001244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 04/16/2024] [Indexed: 05/21/2024] Open

Affiliation(s)

George Bouras Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia The Department of Surgery – Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, Adelaide, South Australia, Australia
Ghais Houtak Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia The Department of Surgery – Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, Adelaide, South Australia, Australia
Ryan R. Wick Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
Vijini Mallawaarachchi Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
Michael J. Roach Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia Adelaide Centre for Epigenetics and South Australian Immunogenomics Cancer Institute, The University of Adelaide, Adelaide, Australia
Bhavya Papudeshi Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
Lousie M. Judd Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
Anna E. Sheppard School of Biological Sciences, The University of Adelaide, Adelaide, Australia
Robert A. Edwards Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
Sarah Vreugde Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia The Department of Surgery – Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, Adelaide, South Australia, Australia

Collapse

Bouras G, Houtak G, Wick RR, Mallawaarachchi V, Roach MJ, Papudeshi B, Judd LM, Sheppard AE, Edwards RA, Vreugde S. Hybracter: Enabling Scalable, Automated, Complete and Accurate Bacterial Genome Assemblies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.12.12.571215. [PMID: 38168369 PMCID: PMC10760025 DOI: 10.1101/2023.12.12.571215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]

Affiliation(s)

George Bouras Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia The Department of Surgery - Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, South Australia, Australia
Ghais Houtak Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia The Department of Surgery - Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, South Australia, Australia
Ryan R. Wick Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
Vijini Mallawaarachchi Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
Michael J. Roach Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia Adelaide Centre for Epigenetics and South Australian Immunogenomics Cancer Institute, The University of Adelaide, Adelaide, Australia
Bhavya Papudeshi Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
Lousie M. Judd Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
Anna E. Sheppard School of Biological Sciences, The University of Adelaide, Adelaide, Australia
Robert A. Edwards Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, Australia
Sarah Vreugde Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia The Department of Surgery - Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, South Australia, Australia

Collapse

Li K, Xu P, Wang J, Yi X, Jiao Y. Identification of errors in draft genome assemblies at single-nucleotide resolution for quality assessment and improvement. Nat Commun 2023;14:6556. [PMID: 37848433 PMCID: PMC10582259 DOI: 10.1038/s41467-023-42336-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 10/05/2023] [Indexed: 10/19/2023] Open

Rafique Q, Rehman A, Afghan MS, Ahmad HM, Zafar I, Fayyaz K, Ain Q, Rayan RA, Al-Aidarous KM, Rashid S, Mushtaq G, Sharma R. Reviewing methods of deep learning for diagnosing COVID-19, its variants and synergistic medicine combinations. Comput Biol Med 2023;163:107191. [PMID: 37354819 PMCID: PMC10281043 DOI: 10.1016/j.compbiomed.2023.107191] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 05/28/2023] [Accepted: 06/19/2023] [Indexed: 06/26/2023]

Medvedev P. Theoretical Analysis of Sequencing Bioinformatics Algorithms and Beyond. COMMUNICATIONS OF THE ACM 2023;66:118-125. [PMID: 38736702 PMCID: PMC11087067 DOI: 10.1145/3571723] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/14/2024]

Žárský V, Karnkowska A, Boscaro V, Trznadel M, Whelan TA, Hiltunen-Thorén M, Onut-Brännström I, Abbott CL, Fast NM, Burki F, Keeling PJ. Contrasting outcomes of genome reduction in mikrocytids and microsporidians. BMC Biol 2023;21:137. [PMID: 37280585 DOI: 10.1186/s12915-023-01635-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Accepted: 05/26/2023] [Indexed: 06/08/2023] Open

Abstract

BACKGROUND

Intracellular symbionts often undergo genome reduction, losing both coding and non-coding DNA in a process that ultimately produces small, gene-dense genomes with few genes. Among eukaryotes, an extreme example is found in microsporidians, which are anaerobic, obligate intracellular parasites related to fungi that have the smallest nuclear genomes known (except for the relic nucleomorphs of some secondary plastids). Mikrocytids are superficially similar to microsporidians: they are also small, reduced, obligate parasites; however, as they belong to a very different branch of the tree of eukaryotes, the rhizarians, such similarities must have evolved in parallel. Since little genomic data are available from mikrocytids, we assembled a draft genome of the type species, Mikrocytos mackini, and compared the genomic architecture and content of microsporidians and mikrocytids to identify common characteristics of reduction and possible convergent evolution.

RESULTS

At the coarsest level, the genome of M. mackini does not exhibit signs of extreme genome reduction; at 49.7 Mbp with 14,372 genes, the assembly is much larger and gene-rich than those of microsporidians. However, much of the genomic sequence and most (8075) of the protein-coding genes code for transposons, and may not contribute much of functional relevance to the parasite. Indeed, the energy and carbon metabolism of M. mackini share several similarities with those of microsporidians. Overall, the predicted proteome involved in cellular functions is quite reduced and gene sequences are extremely divergent. Microsporidians and mikrocytids also share highly reduced spliceosomes that have retained a strikingly similar subset of proteins despite having reduced independently. In contrast, the spliceosomal introns in mikrocytids are very different from those of microsporidians in that they are numerous, conserved in sequence, and constrained to an exceptionally narrow size range (all 16 or 17 nucleotides long) at the shortest extreme of known intron lengths.

CONCLUSIONS

Nuclear genome reduction has taken place many times and has proceeded along different routes in different lineages. Mikrocytids show a mix of similarities and differences with other extreme cases, including uncoupling the actual size of a genome with its functional reduction.

Collapse

Mineeva O, Danciu D, Schölkopf B, Ley RE, Rätsch G, Youngblut ND. ResMiCo: Increasing the quality of metagenome-assembled genomes with deep learning. PLoS Comput Biol 2023;19:e1011001. [PMID: 37126495 PMCID: PMC10174551 DOI: 10.1371/journal.pcbi.1011001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 05/11/2023] [Accepted: 03/06/2023] [Indexed: 05/02/2023] Open

Jia L, Wu Y, Dong Y, Chen J, Chen WH, Zhao XM. A survey on computational strategies for genome-resolved gut metagenomics. Brief Bioinform 2023;24:7145904. [PMID: 37114640 DOI: 10.1093/bib/bbad162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 03/20/2023] [Accepted: 04/04/2023] [Indexed: 04/29/2023] Open

Affiliation(s)

Longhao Jia Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
Yingjian Wu Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
Yanqi Dong Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
Jingchao Chen Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
Wei-Hua Chen Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China Institution of Medical Artificial Intelligence, Binzhou Medical University, Yantai 264003, China
Xing-Ming Zhao Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Ministry of Education, Ministry of Education, Shanghai 200433, China MOE Frontiers Center for Brain Science, Fudan University, Shanghai 200433, China State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China

Collapse

Bizic M, Brad T, Ionescu D, Barbu-Tudoran L, Zoccarato L, Aerts JW, Contarini PE, Gros O, Volland JM, Popa R, Ody J, Vellone D, Flot JF, Tighe S, Sarbu SM. Cave Thiovulum (Candidatus Thiovulum stygium) differs metabolically and genomically from marine species. THE ISME JOURNAL 2023;17:340-353. [PMID: 36528730 PMCID: PMC9938260 DOI: 10.1038/s41396-022-01350-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 11/29/2022] [Accepted: 12/02/2022] [Indexed: 12/23/2022]

Affiliation(s)

Mina Bizic Leibniz Institute for Freshwater Ecology and Inland Fisheries, IGB, Dep 3, Plankton and Microbial Ecology, Zur Alte Fischerhütte 2, OT Neuglobsow, 16775, Stechlin, Germany. .,Berlin-Brandenburg Institute of Advanced Biodiversity Research (BBIB), Berlin, Germany.
Traian Brad "Emil Racoviţă" Institute of Speleology, Clinicilor 5-7, 400006, Cluj-Napoca Romania, Romania.
Danny Ionescu Leibniz Institute for Freshwater Ecology and Inland Fisheries, IGB, Dep 3, Plankton and Microbial Ecology, Zur Alte Fischerhütte 2, OT Neuglobsow, 16775, Stechlin, Germany. .,Berlin-Brandenburg Institute of Advanced Biodiversity Research (BBIB), Berlin, Germany.
Lucian Barbu-Tudoran grid.7399.40000 0004 1937 1397Center for Electron Microscopy, “Babeș-Bolyai” University, Clinicilor 5, 400006 Cluj-Napoca, Romania
Luca Zoccarato Leibniz Institute for Freshwater Ecology and Inland Fisheries, IGB, Dep 3, Plankton and Microbial Ecology, Zur Alte Fischerhütte 2, OT Neuglobsow, 16775 Stechlin, Germany ,5grid.5173.00000 0001 2298 5320Institute of Computational Biology, University of Natural Resources and Life Sciences, Gregor-Mendel-Straße 3, 31180 Vienna, Austria
Joost W. Aerts grid.12380.380000 0004 1754 9227Department of Molecular Cell Physiology, Faculty of Earth and Life sciences, De Boelelaan 1085, 1081 HV Amsterdam, The Netherlands
Paul-Emile Contarini Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum National d’Histoire Naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, 97110 Pointe-à-Pitre, France ,8Laboratory for Research in Complex Systems, Menlo Park, CA USA
Olivier Gros Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum National d’Histoire Naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, 97110 Pointe-à-Pitre, France
Jean-Marie Volland Laboratory for Research in Complex Systems, Menlo Park, CA USA ,9grid.184769.50000 0001 2231 4551Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 94720 Berkeley, CA USA
Radu Popa River Road Research, 62 Leslie St, Buffalo, NY 1421 USA
Jessica Ody grid.4989.c0000 0001 2348 0746Evolutionary Biology and Ecology, Université libre de Bruxelles (ULB), C.P. 160/12, Avenue F.D. Roosevelt 50, 1050 Brussels, Belgium
Daniel Vellone grid.59062.380000 0004 1936 7689Vermont Integrative Genomics Lab, University of Vermont Cancer Center, Health Science Research Facility, Burlington, Vermont, VT 05405 USA
Jean-François Flot grid.4989.c0000 0001 2348 0746Evolutionary Biology and Ecology, Université libre de Bruxelles (ULB), C.P. 160/12, Avenue F.D. Roosevelt 50, 1050 Brussels, Belgium ,13Interuniversity Institute of Bioinformatics in Brussels—(IB)², Brussels, Belgium
Scott Tighe grid.59062.380000 0004 1936 7689Vermont Integrative Genomics Lab, University of Vermont Cancer Center, Health Science Research Facility, Burlington, Vermont, VT 05405 USA
Serban M. Sarbu grid.501624.40000 0001 2260 1489“Emil Racoviţă” Institute of Speleology, Frumoasă 31-B, 010986 Bucureşti, Romania ,15grid.253555.10000 0001 2297 1981Department of Biological Sciences, California State University, Chico, CA 95929 USA

Collapse

Wick RR, Judd LM, Holt KE. Assembling the perfect bacterial genome using Oxford Nanopore and Illumina sequencing. PLoS Comput Biol 2023;19:e1010905. [PMID: 36862631 PMCID: PMC9980784 DOI: 10.1371/journal.pcbi.1010905] [Citation(s) in RCA: 36] [Impact Index Per Article: 36.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/03/2023] Open

Lai S, Pan S, Sun C, Coelho LP, Chen WH, Zhao XM. metaMIC: reference-free misassembly identification and correction of de novo metagenomic assemblies. Genome Biol 2022;23:242. [PMID: 36376928 PMCID: PMC9661791 DOI: 10.1186/s13059-022-02810-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 11/01/2022] [Indexed: 11/16/2022] Open

Affiliation(s)

Senying Lai Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
Shaojun Pan Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
Chuqing Sun Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei China
Luis Pedro Coelho Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
Wei-Hua Chen Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei China College of Life Science, Henan Normal University, Xinxiang, Henan China
Xing-Ming Zhao Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China Research Institute of Intelligent Complex Systems, Fudan University, Shanghai, China International Human Phenome Institutes (Shanghai), Shanghai, China Zhangjiang Fudan International Innovation Center, Shanghai, China

Collapse

Fan J, Chan S, Patro R. Perplexity: evaluating transcript abundance estimation in the absence of ground truth. Algorithms Mol Biol 2022;17:6. [PMID: 35331283 PMCID: PMC8951746 DOI: 10.1186/s13015-022-00214-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Accepted: 03/01/2022] [Indexed: 11/20/2022] Open

Wick RR, Holt KE. Polypolish: Short-read polishing of long-read bacterial genome assemblies. PLoS Comput Biol 2022;18:e1009802. [PMID: 35073327 PMCID: PMC8812927 DOI: 10.1371/journal.pcbi.1009802] [Citation(s) in RCA: 209] [Impact Index Per Article: 104.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 02/03/2022] [Accepted: 01/03/2022] [Indexed: 12/12/2022] Open

Abstract

Long-read-only bacterial genome assemblies usually contain residual errors, most commonly homopolymer-length errors. Short-read polishing tools can use short reads to fix these errors, but most rely on short-read alignment which is unreliable in repeat regions. Errors in such regions are therefore challenging to fix and often remain after short-read polishing. Here we introduce Polypolish, a new short-read polisher which uses all-per-read alignments to repair errors in repeat sequences that other polishers cannot. Polypolish performed well in benchmarking tests using both simulated and real reads, and it almost never introduced errors during polishing. The best results were achieved by using Polypolish in combination with other short-read polishers.

Recent improvements in Oxford Nanopore Technologies sequencing platforms and assembly algorithms have made it easier than ever to generate complete bacterial genome sequences. However, Oxford Nanopore genome sequences suffer from errors that limit their utility in downstream analyses. To fix these errors, one can ‘polish’ the genome with Illumina sequencing, exploiting the fact that Oxford Nanopore and Illumina sequencing have different error profiles. There are several polishing tools which can fix most errors in an Oxford Nanopore genome, but they struggle with errors in repetitive regions of the genome. With this in mind, we have developed a polisher, Polypolish, which uses a novel approach that allows it to fix more errors in genomic repeats. Our results show that Polypolish is both effective at repairing sequence errors and very unlikely to introduce new errors. Polypolish can often fix errors that other polishers cannot and vice versa, so the best results come from using a combination of tools. Polypolish therefore has an important role in bacterial genome assembly methods that aim for the highest possible sequence accuracy.

Collapse

Genome assembly and annotation. Bioinformatics 2022. [DOI: 10.1016/b978-0-323-89775-4.00013-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Poláková E, Albanaz ATS, Zakharova A, Novozhilova TS, Gerasimov ES, Yurchenko V. Ku80 is involved in telomere maintenance but dispensable for genomic stability in Leishmania mexicana. PLoS Negl Trop Dis 2021;15:e0010041. [PMID: 34965251 PMCID: PMC8716037 DOI: 10.1371/journal.pntd.0010041] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 11/30/2021] [Indexed: 01/09/2023] Open

MacDonald ML, Lee KH. EvalDNA: a machine learning-based tool for the comprehensive evaluation of mammalian genome assembly quality. BMC Bioinformatics 2021;22:570. [PMID: 34837948 PMCID: PMC8627028 DOI: 10.1186/s12859-021-04480-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 11/15/2021] [Indexed: 11/16/2022] Open

Abstract

Background

To select the most complete, continuous, and accurate assembly for an organism of interest, comprehensive quality assessment of assemblies is necessary. We present a novel tool, called Evaluation of De Novo Assemblies (EvalDNA), which uses supervised machine learning for the quality scoring of genome assemblies and does not require an existing reference genome for accuracy assessment.

Results

EvalDNA calculates a list of quality metrics from an assembled sequence and applies a model created from supervised machine learning methods to integrate various metrics into a comprehensive quality score. A well-tested, accurate model for scoring mammalian genome sequences is provided as part of EvalDNA. This random forest regression model evaluates an assembled sequence based on continuity, completeness, and accuracy, and was able to explain 86% of the variation in reference-based quality scores within the testing data. EvalDNA was applied to human chromosome 14 assemblies from the GAGE study to rank genome assemblers and to compare EvalDNA to two other quality evaluation tools. In addition, EvalDNA was used to evaluate several genome assemblies of the Chinese hamster genome to help establish a better reference genome for the biopharmaceutical manufacturing community. EvalDNA was also used to assess more recent human assemblies from the QUAST-LG study completed in 2018, and its ability to score bacterial genomes was examined through application on bacterial assemblies from the GAGE-B study.

Conclusions

EvalDNA enables scientists to easily identify the best available genome assembly for their organism of interest without requiring a reference assembly. EvalDNA sets itself apart from other quality assessment tools by producing a quality score that enables direct comparison among assemblies from different species.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12859-021-04480-2.

Collapse

D’aes J, Fraiture MA, Bogaerts B, De Keersmaecker SCJ, Roosens NHC, Vanneste K. Characterization of Genetically Modified Microorganisms Using Short- and Long-Read Whole-Genome Sequencing Reveals Contaminations of Related Origin in Multiple Commercial Food Enzyme Products. Foods 2021;10:2637. [PMID: 34828918 PMCID: PMC8624754 DOI: 10.3390/foods10112637] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Revised: 10/22/2021] [Accepted: 10/28/2021] [Indexed: 12/02/2022] Open

Music of metagenomics-a review of its applications, analysis pipeline, and associated tools. Funct Integr Genomics 2021;22:3-26. [PMID: 34657989 DOI: 10.1007/s10142-021-00810-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 09/25/2021] [Accepted: 10/03/2021] [Indexed: 10/20/2022]

Tihelka E, Cai C, Giacomelli M, Lozano-Fernandez J, Rota-Stabelli O, Huang D, Engel MS, Donoghue PCJ, Pisani D. The evolution of insect biodiversity. Curr Biol 2021;31:R1299-R1311. [PMID: 34637741 DOI: 10.1016/j.cub.2021.08.057] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Wick RR, Judd LM, Cerdeira LT, Hawkey J, Méric G, Vezina B, Wyres KL, Holt KE. Trycycler: consensus long-read assemblies for bacterial genomes. Genome Biol 2021;22:266. [PMID: 34521459 PMCID: PMC8442456 DOI: 10.1186/s13059-021-02483-z] [Citation(s) in RCA: 166] [Impact Index Per Article: 55.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 08/31/2021] [Indexed: 01/23/2023] Open

Urban JM, Foulk MS, Bliss JE, Coleman CM, Lu N, Mazloom R, Brown SJ, Spradling AC, Gerbi SA. High contiguity de novo genome assembly and DNA modification analyses for the fungus fly, Sciara coprophila, using single-molecule sequencing. BMC Genomics 2021;22:643. [PMID: 34488624 PMCID: PMC8419958 DOI: 10.1186/s12864-021-07926-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2021] [Accepted: 08/08/2021] [Indexed: 12/26/2022] Open

Abstract

BACKGROUND

The lower Dipteran fungus fly, Sciara coprophila, has many unique biological features that challenge the rule of genome DNA constancy. For example, Sciara undergoes paternal chromosome elimination and maternal X chromosome nondisjunction during spermatogenesis, paternal X elimination during embryogenesis, intrachromosomal DNA amplification of DNA puff loci during larval development, and germline-limited chromosome elimination from all somatic cells. Paternal chromosome elimination in Sciara was the first observation of imprinting, though the mechanism remains a mystery. Here, we present the first draft genome sequence for Sciara coprophila to take a large step forward in addressing these features.

RESULTS

We assembled the Sciara genome using PacBio, Nanopore, and Illumina sequencing. To find an optimal assembly using these datasets, we generated 44 short-read and 50 long-read assemblies. We ranked assemblies using 27 metrics assessing contiguity, gene content, and dataset concordance. The highest-ranking assemblies were scaffolded using BioNano optical maps. RNA-seq datasets from multiple life stages and both sexes facilitated genome annotation. A set of 66 metrics was used to select the first draft assembly for Sciara. Nearly half of the Sciara genome sequence was anchored into chromosomes, and all scaffolds were classified as X-linked or autosomal by coverage.

CONCLUSIONS

We determined that X-linked genes in Sciara males undergo dosage compensation. An entire bacterial genome from the Rickettsia genus, a group known to be endosymbionts in insects, was co-assembled with the Sciara genome, opening the possibility that Rickettsia may function in sex determination in Sciara. Finally, the signal level of the PacBio and Nanopore data support the presence of cytosine and adenine modifications in the Sciara genome, consistent with a possible role in imprinting.

Collapse

Kayani MUR, Huang W, Feng R, Chen L. Genome-resolved metagenomics using environmental and clinical samples. Brief Bioinform 2021;22:bbab030. [PMID: 33758906 PMCID: PMC8425419 DOI: 10.1093/bib/bbab030] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 11/29/2020] [Accepted: 01/20/2021] [Indexed: 12/25/2022] Open

Ayling M, Clark MD, Leggett RM. New approaches for metagenome assembly with short reads. Brief Bioinform 2021;21:584-594. [PMID: 30815668 PMCID: PMC7299287 DOI: 10.1093/bib/bbz020] [Citation(s) in RCA: 100] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 01/31/2019] [Accepted: 02/01/2019] [Indexed: 02/07/2023] Open

Re-examination of two diatom reference genomes using long-read sequencing. BMC Genomics 2021;22:379. [PMID: 34030633 PMCID: PMC8147415 DOI: 10.1186/s12864-021-07666-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 04/26/2021] [Indexed: 12/03/2022] Open

Abstract

Background

The marine diatoms Thalassiosira pseudonana and Phaeodactylum tricornutum are valuable model organisms for exploring the evolution, diversity and ecology of this important algal group. Their reference genomes, published in 2004 and 2008, respectively, were the product of traditional Sanger sequencing. In the case of T. pseudonana, optical restriction site mapping was employed to further clarify and contextualize chromosome-level scaffolds. While both genomes are considered highly accurate and reasonably contiguous, they still contain many unresolved regions and unordered/unlinked scaffolds.

Results

We have used Oxford Nanopore Technologies long-read sequencing to update and validate the quality and contiguity of the T. pseudonana and P. tricornutum genomes. Fine-scale assessment of our long-read derived genome assemblies allowed us to resolve previously uncertain genomic regions, further characterize complex structural variation, and re-evaluate the repetitive DNA content of both genomes. We also identified 1862 previously undescribed genes in T. pseudonana. In P. tricornutum, we used transposable element detection software to identify 33 novel copia-type LTR-RT insertions, indicating ongoing activity and rapid expansion of this superfamily as the organism continues to be maintained in culture. Finally, Bionano optical mapping of P. tricornutum chromosomes was combined with long-read sequence data to explore the potential of long-read sequencing and optical mapping for resolving haplotypes.

Conclusion

Despite its potential to yield highly contiguous scaffolds, long-read sequencing is not a panacea. Even for relatively small nuclear genomes such as those investigated herein, repetitive DNA sequences cause problems for current genome assembly algorithms. Determining whether a long-read derived genomic assembly is ‘better’ than one produced using traditional sequence data is not straightforward. Our revised reference genomes for P. tricornutum and T. pseudonana nevertheless provide additional insight into the structure and evolution of both genomes, thereby providing a more robust foundation for future diatom research.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12864-021-07666-3.

Collapse

Meyer F, Lesker TR, Koslicki D, Fritz A, Gurevich A, Darling AE, Sczyrba A, Bremges A, McHardy AC. Tutorial: assessing metagenomics software with the CAMI benchmarking toolkit. Nat Protoc 2021;16:1785-1801. [PMID: 33649565 DOI: 10.1038/s41596-020-00480-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Accepted: 11/26/2020] [Indexed: 01/31/2023]

Complete and Circularized Bacterial Genome Sequence of Gordonia sp. Strain X0973. Microbiol Resour Announc 2021;10:10/9/e01479-20. [PMID: 33664146 PMCID: PMC7936644 DOI: 10.1128/mra.01479-20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Lipworth S, Pickford H, Sanderson N, Chau KK, Kavanagh J, Barker L, Vaughan A, Swann J, Andersson M, Jeffery K, Morgan M, Peto TEA, Crook DW, Stoesser N, Walker AS. Optimized use of Oxford Nanopore flowcells for hybrid assemblies. Microb Genom 2020;6:mgen000453. [PMID: 33174830 PMCID: PMC7725331 DOI: 10.1099/mgen.0.000453] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Accepted: 09/25/2020] [Indexed: 01/16/2023] Open

Affiliation(s)

Samuel Lipworth Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK
Hayleah Pickford Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK
Nicholas Sanderson Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK NIHR Oxford Biomedical Research Centre, Oxford, UK
Kevin K. Chau Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK
James Kavanagh Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK
Leanne Barker Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK
Alison Vaughan Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK NIHR Oxford Biomedical Research Centre, Oxford, UK
Jeremy Swann Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK NIHR Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance at the University of Oxford in partnership with Public Health England, Oxford, UK
Monique Andersson Department of Clinical Microbiology, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, UK
Katie Jeffery Department of Clinical Microbiology, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, UK
Marcus Morgan Department of Clinical Microbiology, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, UK
Timothy E. A. Peto Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK NIHR Oxford Biomedical Research Centre, Oxford, UK
Derrick W. Crook Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK NIHR Oxford Biomedical Research Centre, Oxford, UK Department of Clinical Microbiology, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, UK
Nicole Stoesser Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK Department of Clinical Microbiology, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, UK
A. Sarah Walker Modernising Medical Microbiology, Nuffield Department of Medicine, University of Oxford, UK NIHR Oxford Biomedical Research Centre, Oxford, UK

Collapse

Mineeva O, Rojas-Carulla M, Ley RE, Schölkopf B, Youngblut ND. DeepMAsED: evaluating the quality of metagenomic assemblies. Bioinformatics 2020;36:3011-3017. [PMID: 32096824 DOI: 10.1093/bioinformatics/btaa124] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Revised: 01/19/2020] [Accepted: 02/18/2020] [Indexed: 11/13/2022] Open

Complete and Circularized Genome Assemblies of the Kroppenstedtia eburnea Genus Type Strain and the Kroppenstedtia pulmonis Species Type Strain with MiSeq and MinION Sequence Data. Microbiol Resour Announc 2020;9:9/44/e00650-20. [PMID: 33122418 PMCID: PMC7595940 DOI: 10.1128/mra.00650-20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Padovani de Souza K, Setubal JC, Ponce de Leon F de Carvalho AC, Oliveira G, Chateau A, Alves R. Machine learning meets genome assembly. Brief Bioinform 2020;20:2116-2129. [PMID: 30137230 DOI: 10.1093/bib/bby072] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Revised: 07/11/2018] [Accepted: 07/22/2018] [Indexed: 12/23/2022] Open

Prussing C, Snavely EA, Singh N, Lapierre P, Lasek-Nesselquist E, Mitchell K, Haas W, Owsiak R, Nazarian E, Musser KA. Nanopore MinION Sequencing Reveals Possible Transfer of bla _KPC-2 Plasmid Across Bacterial Species in Two Healthcare Facilities. Front Microbiol 2020;11:2007. [PMID: 32973725 PMCID: PMC7466660 DOI: 10.3389/fmicb.2020.02007] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Accepted: 07/29/2020] [Indexed: 11/13/2022] Open

Jung H, Jeon MS, Hodgett M, Waterhouse P, Eyun SI. Comparative Evaluation of Genome Assemblers from Long-Read Sequencing for Plants and Crops. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2020;68:7670-7677. [PMID: 32530283 DOI: 10.1021/acs.jafc.0c01647] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]

Mikheenko A, Bzikadze AV, Gurevich A, Miga KH, Pevzner PA. TandemTools: mapping long reads and assessing/improving assembly quality in extra-long tandem repeats. Bioinformatics 2020;36:i75-i83. [PMID: 32657355 PMCID: PMC7355294 DOI: 10.1093/bioinformatics/btaa440] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open

Bohr LL, Mortimer TD, Pepperell CS. Lateral Gene Transfer Shapes Diversity of Gardnerella spp. Front Cell Infect Microbiol 2020;10:293. [PMID: 32656099 PMCID: PMC7324480 DOI: 10.3389/fcimb.2020.00293] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 05/18/2020] [Indexed: 12/13/2022] Open

Abstract

Gardnerella spp. are pathognomonic for bacterial vaginosis, which increases the risk of preterm birth and the transmission of sexually transmitted infections. Gardnerella spp. are genetically diverse, comprising what have recently been defined as distinct species with differing functional capacities. Disease associations with Gardnerella spp. are not straightforward: patients with BV are usually infected with multiple species, and Gardnerella spp. are also found in the vaginal microbiome of healthy women. Genome comparisons of Gardnerella spp. show evidence of lateral gene transfer (LGT), but patterns of LGT have not been characterized in detail. Here we sought to define the role of LGT in shaping the genetic structure of Gardnerella spp. We analyzed whole genome sequencing data for 106 Gardnerella strains and used these data for pan genome analysis and to characterize LGT in the core and accessory genomes, over recent and remote timescales. In our diverse sample of Gardnerella strains, we found that both the core and accessory genomes are clearly differentiated in accordance with newly defined species designations. We identified putative competence and pilus assembly genes across most species; we also found them to be differentiated between species. Competence machinery has diverged in parallel with the core genome, with selection against deleterious mutations as a predominant influence on their evolution. By contrast, the virulence factor vaginolysin, which encodes a toxin, appears to be readily exchanged among species. We identified five distinct prophage clusters in Gardnerella genomes, two of which appear to be exchanged between Gardnerella species. Differences among species are apparent in their patterns of LGT, including their exchange with diverse gene pools. Despite frequent LGT and co-localization in the same niche, our results show that Gardnerella spp. are clearly genetically differentiated and yet capable of exchanging specific genetic material. This likely reflects complex interactions within bacterial communities associated with the vaginal microbiome. Our results provide insight into how such interactions evolve and are maintained, allowing these multi-species communities to colonize and invade human tissues and adapt to antibiotics and other stressors.

Collapse

Olson ND, Treangen TJ, Hill CM, Cepeda-Espinoza V, Ghurye J, Koren S, Pop M. Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes. Brief Bioinform 2020;20:1140-1150. [PMID: 28968737 DOI: 10.1093/bib/bbx098] [Citation(s) in RCA: 80] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2017] [Revised: 07/13/2017] [Indexed: 01/09/2023] Open

Alqahtani F, Măndoiu II. Statistical Mitogenome Assembly with RepeaTs. J Comput Biol 2020;27:1407-1421. [PMID: 32048871 DOI: 10.1089/cmb.2019.0505] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open

Abstract

By using next-generation sequencing technologies, it is possible to quickly and inexpensively generate large numbers of relatively short reads from both the nuclear and mitochondrial DNA (mtDNA) contained in a biological sample. Unfortunately, assembling such whole-genome sequencing (WGS) data with standard de novo assemblers often fails to generate high-quality mitochondrial genome sequences due to the large difference in copy number (and hence sequencing depth) between the mitochondrial and nuclear genomes. Assembly of complete mitochondrial genome sequences is further complicated by the fact that many de novo assemblers are not designed for circular genomes and by the presence of repeats in the mitochondrial genomes of some species. In this article, we describe the Statistical Mitogenome Assembly with RepeaTs (SMART) pipeline for automated assembly of mitochondrial genomes from WGS data. SMART uses an efficient coverage-based filter to first select a subset of reads enriched in mtDNA sequences. Contigs produced by an initial assembly step are filtered using the Basic Local Alignment Search Tool searches against a comprehensive mitochondrial genome database and are used as "baits" for an alignment-based filter that produces the set of reads used in a second de novo assembly and scaffolding step. In the presence of repeats, the possible paths through the assembly graph are evaluated using a maximum likelihood model. Additionally, the assembly process is repeated for a user-specified number of times on resampled subsets of reads to select for annotation of the reconstructed sequences with highest bootstrap support. Experiments on WGS data sets from a variety of species show that the SMART pipeline produces complete circular mitochondrial genome sequences with a higher success rate than current state-of-the-art tools, particularly for low-coverage WGS data sets.

Collapse

Luo Y, Liao X, Wu FX, Wang J. Computational Approaches for Transcriptome Assembly Based on Sequencing Technologies. Curr Bioinform 2020. [DOI: 10.2174/1574893614666190410155603] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Nethery MA, Henriksen ED, Daughtry KV, Johanningsmeier SD, Barrangou R. Comparative genomics of eight Lactobacillus buchneri strains isolated from food spoilage. BMC Genomics 2019;20:902. [PMID: 31775607 PMCID: PMC6881996 DOI: 10.1186/s12864-019-6274-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Accepted: 11/12/2019] [Indexed: 12/22/2022] Open

Royo-Llonch M, Sánchez P, González JM, Pedrós-Alió C, Acinas SG. Ecological and functional capabilities of an uncultured Kordia sp. Syst Appl Microbiol 2019;43:126045. [PMID: 31831198 DOI: 10.1016/j.syapm.2019.126045] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 10/28/2019] [Accepted: 11/12/2019] [Indexed: 01/07/2023]

Athena: Automated Tuning of k-mer based Genomic Error Correction Algorithms using Language Models. Sci Rep 2019;9:16157. [PMID: 31695060 PMCID: PMC6834855 DOI: 10.1038/s41598-019-52196-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Accepted: 10/07/2019] [Indexed: 01/30/2023] Open

Abstract

The performance of most error-correction (EC) algorithms that operate on genomics reads is dependent on the proper choice of its configuration parameters, such as the value of k in k-mer based techniques. In this work, we target the problem of finding the best values of these configuration parameters to optimize error correction and consequently improve genome assembly. We perform this in an adaptive manner, adapted to different datasets and to EC tools, due to the observation that different configuration parameters are optimal for different datasets, i.e., from different platforms and species, and vary with the EC algorithm being applied. We use language modeling techniques from the Natural Language Processing (NLP) domain in our algorithmic suite, Athena, to automatically tune the performance-sensitive configuration parameters. Through the use of N-Gram and Recurrent Neural Network (RNN) language modeling, we validate the intuition that the EC performance can be computed quantitatively and efficiently using the “perplexity” metric, repurposed from NLP. After training the language model, we show that the perplexity metric calculated from a sample of the test (or production) data has a strong negative correlation with the quality of error correction of erroneous NGS reads. Therefore, we use the perplexity metric to guide a hill climbing-based search, converging toward the best configuration parameter value. Our approach is suitable for both de novo and comparative sequencing (resequencing), eliminating the need for a reference genome to serve as the ground truth. We find that Athena can automatically find the optimal value of k with a very high accuracy for 7 real datasets and using 3 different k-mer based EC algorithms, Lighter, Blue, and Racer. The inverse relation between the perplexity metric and alignment rate exists under all our tested conditions—for real and synthetic datasets, for all kinds of sequencing errors (insertion, deletion, and substitution), and for high and low error rates. The absolute value of that correlation is at least 73%. In our experiments, the best value of k found by Athena achieves an alignment rate within 0.53% of the oracle best value of k found through brute force searching (i.e., scanning through the entire range of k values). Athena’s selected value of k lies within the top-3 best k values using N-Gram models and the top-5 best k values using RNN models With best parameter selection by Athena, the assembly quality (NG50) is improved by a Geometric Mean of 4.72X across the 7 real datasets.

Collapse

Sydenham TV, Overballe-Petersen S, Hasman H, Wexler H, Kemp M, Justesen US. Complete hybrid genome assembly of clinical multidrug-resistant Bacteroides fragilis isolates enables comprehensive identification of antimicrobial-resistance genes and plasmids. Microb Genom 2019;5:e000312. [PMID: 31697231 PMCID: PMC6927303 DOI: 10.1099/mgen.0.000312] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Accepted: 10/17/2019] [Indexed: 02/06/2023] Open

Abstract

Bacteroides fragilis constitutes a significant part of the normal human gut microbiota and can also act as an opportunistic pathogen. Antimicrobial resistance (AMR) and the prevalence of AMR genes are increasing, and prediction of antimicrobial susceptibility based on sequence information could support targeted antimicrobial therapy in a clinical setting. Complete identification of insertion sequence (IS) elements carrying promoter sequences upstream of resistance genes is necessary for prediction of AMR. However, de novo assemblies from short reads alone are often fractured due to repeat regions and the presence of multiple copies of identical IS elements. Identification of plasmids in clinical isolates can aid in the surveillance of the dissemination of AMR, and comprehensive sequence databases support microbiome and metagenomic studies. We tested several short-read, hybrid and long-lead assembly pipelines by assembling the type strain B. fragilis CCUG4856T (=ATCC25285=NCTC9343) with Illumina short reads and long reads generated by Oxford Nanopore Technologies (ONT) MinION sequencing. Hybrid assembly with Unicycler, using quality filtered Illumina reads and Filtlong filtered and Canu-corrected ONT reads, produced the assembly of highest quality. This approach was then applied to six clinical multidrug-resistant B. fragilis isolates and, with minimal manual finishing of chromosomal assemblies of three isolates, complete, circular assemblies of all isolates were produced. Eleven circular, putative plasmids were identified in the six assemblies, of which only three corresponded to a known cultured Bacteroides plasmid. Complete IS elements could be identified upstream of AMR genes; however, there was not complete correlation between the absence of IS elements and antimicrobial susceptibility. As our knowledge on factors that increase expression of resistance genes in the absence of IS elements is limited, further research is needed prior to implementing AMR prediction for B. fragilis from whole-genome sequencing.

Collapse

De Maio N, Shaw LP, Hubbard A, George S, Sanderson ND, Swann J, Wick R, AbuOun M, Stubberfield E, Hoosdally SJ, Crook DW, Peto TEA, Sheppard AE, Bailey MJ, Read DS, Anjum MF, Walker AS, Stoesser N. Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes. Microb Genom 2019;5:e000294. [PMID: 31483244 PMCID: PMC6807382 DOI: 10.1099/mgen.0.000294] [Citation(s) in RCA: 121] [Impact Index Per Article: 24.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 08/19/2019] [Indexed: 01/23/2023] Open

Abstract

Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (hybrid assembly). However, it is not clear how different long-read sequencing methods affect hybrid assembly accuracy. Relative automation of the assembly process is also crucial to facilitating high-throughput complete bacterial genome reconstruction, avoiding multiple bespoke filtering and data manipulation steps. In this study, we compared hybrid assemblies for 20 bacterial isolates, including two reference strains, using Illumina sequencing and long reads from either Oxford Nanopore Technologies (ONT) or SMRT Pacific Biosciences (PacBio) sequencing platforms. We chose isolates from the family Enterobacteriaceae, as these frequently have highly plastic, repetitive genetic structures, and complete genome reconstruction for these species is relevant for a precise understanding of the epidemiology of antimicrobial resistance. We de novo assembled genomes using the hybrid assembler Unicycler and compared different read processing strategies, as well as comparing to long-read-only assembly with Flye followed by short-read polishing with Pilon. Hybrid assembly with either PacBio or ONT reads facilitated high-quality genome reconstruction, and was superior to the long-read assembly and polishing approach evaluated with respect to accuracy and completeness. Combining ONT and Illumina reads fully resolved most genomes without additional manual steps, and at a lower consumables cost per isolate in our setting. Automated hybrid assembly is a powerful tool for complete and accurate bacterial genome assembly.

Collapse

Affiliation(s)

Nicola De Maio Nuffield Department of Medicine, University of Oxford, Oxford, UK
Liam P. Shaw Nuffield Department of Medicine, University of Oxford, Oxford, UK
Alasdair Hubbard Department of Tropical Disease Biology, Liverpool School of Tropical Medicine, Liverpool, L3 5QA, UK
Sophie George Nuffield Department of Medicine, University of Oxford, Oxford, UK NIHR HPRU Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with Public Health England, Oxford, UK
Nicholas D. Sanderson Nuffield Department of Medicine, University of Oxford, Oxford, UK
Jeremy Swann Nuffield Department of Medicine, University of Oxford, Oxford, UK
Ryan Wick Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, Melbourne, Australia
Manal AbuOun Department of Bacteriology, Animal and Plant Health Agency, Addlestone, Surrey, KT15 3NB, UK
Emma Stubberfield Department of Bacteriology, Animal and Plant Health Agency, Addlestone, Surrey, KT15 3NB, UK
Sarah J. Hoosdally Nuffield Department of Medicine, University of Oxford, Oxford, UK
Derrick W. Crook Nuffield Department of Medicine, University of Oxford, Oxford, UK NIHR HPRU Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with Public Health England, Oxford, UK
Timothy E. A. Peto Nuffield Department of Medicine, University of Oxford, Oxford, UK NIHR HPRU Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with Public Health England, Oxford, UK
Anna E. Sheppard Nuffield Department of Medicine, University of Oxford, Oxford, UK NIHR HPRU Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with Public Health England, Oxford, UK
Mark J. Bailey Centre for Ecology & Hydrology, Benson Lane, Crowmarsh Gifford, Wallingford, OX10 8BB, UK
Daniel S. Read Centre for Ecology & Hydrology, Benson Lane, Crowmarsh Gifford, Wallingford, OX10 8BB, UK
Muna F. Anjum Department of Bacteriology, Animal and Plant Health Agency, Addlestone, Surrey, KT15 3NB, UK
A. Sarah Walker Nuffield Department of Medicine, University of Oxford, Oxford, UK NIHR HPRU Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with Public Health England, Oxford, UK
Nicole Stoesser Nuffield Department of Medicine, University of Oxford, Oxford, UK

Collapse

Grosmaire M, Launay C, Siegwald M, Brugière T, Estrada-Virrueta L, Berger D, Burny C, Modolo L, Blaxter M, Meister P, Félix MA, Gouyon PH, Delattre M. Males as somatic investment in a parthenogenetic nematode. Science 2019;363:1210-1213. [PMID: 30872523 DOI: 10.1126/science.aau0099] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Accepted: 02/13/2019] [Indexed: 12/20/2022]

Affiliation(s)

Manon Grosmaire Laboratoire de Biologie et Modélisation de la Cellule, Université de Lyon, ENS, UCBL, CNRS, INSERM, UMR 5239, U 1210, F-69364 Lyon, France
Caroline Launay Laboratoire de Biologie et Modélisation de la Cellule, Université de Lyon, ENS, UCBL, CNRS, INSERM, UMR 5239, U 1210, F-69364 Lyon, France
Marion Siegwald Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum national d'Histoire naturelle, CNRS, SU, EPHE, UA, CP 39, 57 rue Cuvier, 75005 Paris, France
Thibault Brugière Laboratoire de Biologie et Modélisation de la Cellule, Université de Lyon, ENS, UCBL, CNRS, INSERM, UMR 5239, U 1210, F-69364 Lyon, France
Lilia Estrada-Virrueta Laboratoire de Biologie et Modélisation de la Cellule, Université de Lyon, ENS, UCBL, CNRS, INSERM, UMR 5239, U 1210, F-69364 Lyon, France
Duncan Berger The Ashworth Laboratories, Institute of Evolutionary Biology, The University of Edinburgh, Edinburgh EH9 3FL, UK
Claire Burny Laboratoire de Biologie et Modélisation de la Cellule, Université de Lyon, ENS, UCBL, CNRS, INSERM, UMR 5239, U 1210, F-69364 Lyon, France.,Present address: Vienna Graduate School of Population Genetics, Vetmeduni Vienna, Vienna A-1210, Austria
Laurent Modolo Laboratoire de Biologie et Modélisation de la Cellule, Université de Lyon, ENS, UCBL, CNRS, INSERM, UMR 5239, U 1210, F-69364 Lyon, France
Mark Blaxter The Ashworth Laboratories, Institute of Evolutionary Biology, The University of Edinburgh, Edinburgh EH9 3FL, UK
Peter Meister Cell Fate and Nuclear Organization, Institute of Cell Biology, University of Bern, 3012 Bern, Switzerland
Marie-Anne Félix Département de Biologie, Ecole Normale Supérieure, IBENS, CNRS, Inserm, PSL Research University, 75005 Paris, France
Pierre-Henri Gouyon Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum national d'Histoire naturelle, CNRS, SU, EPHE, UA, CP 39, 57 rue Cuvier, 75005 Paris, France
Marie Delattre Laboratoire de Biologie et Modélisation de la Cellule, Université de Lyon, ENS, UCBL, CNRS, INSERM, UMR 5239, U 1210, F-69364 Lyon, France.

Collapse

Plastome based phylogenetics and younger crown node age in Pelargonium. Mol Phylogenet Evol 2019;137:33-43. [DOI: 10.1016/j.ympev.2019.03.021] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2018] [Revised: 03/23/2019] [Accepted: 03/25/2019] [Indexed: 11/20/2022]

Nieuwenhuis M, van de Peppel LJJ, Bakker FT, Zwaan BJ, Aanen DK. Enrichment of G4DNA and a Large Inverted Repeat Coincide in the Mitochondrial Genomes of Termitomyces. Genome Biol Evol 2019;11:1857-1869. [PMID: 31209489 PMCID: PMC6609731 DOI: 10.1093/gbe/evz122] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/11/2019] [Indexed: 12/20/2022] Open

Marijon P, Chikhi R, Varré JS. Graph analysis of fragmented long-read bacterial genome assemblies. Bioinformatics 2019;35:4239-4246. [DOI: 10.1093/bioinformatics/btz219] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Revised: 02/19/2019] [Accepted: 03/26/2019] [Indexed: 11/14/2022] Open

Complete Genome Sequence of Nocardia farcinica W6977^T Obtained by Combining Illumina and PacBio Reads. Microbiol Resour Announc 2019;8:MRA01373-18. [PMID: 30687825 PMCID: PMC6346157 DOI: 10.1128/mra.01373-18] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Accepted: 12/03/2018] [Indexed: 12/11/2022] Open