1
|
Tsai IJ. Detecting Assembly Errors With Klumpy: Building Confidence in Your Daily Genomic Analysis. Mol Ecol Resour 2024:e14037. [PMID: 39440672 DOI: 10.1111/1755-0998.14037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Accepted: 10/02/2024] [Indexed: 10/25/2024]
Abstract
In the realm of genome assembly, even minor errors can send researchers down to rabbit holes of unintended misinterpretation. Enter Klumpy-a tool designed to help detecting these elusive mistakes before they cause significant problems. By providing detailed, region-specific assessments and an intuitive visualisation platform, Klumpy (Madrigal, et al. 2024) empowers researchers to pinpoint and resolve potential issues with precision, paving the way for more reliable downstream analyses and discoveries.
Collapse
|
2
|
Naithani S, Deng CH, Sahu SK, Jaiswal P. Exploring Pan-Genomes: An Overview of Resources and Tools for Unraveling Structure, Function, and Evolution of Crop Genes and Genomes. Biomolecules 2023; 13:1403. [PMID: 37759803 PMCID: PMC10527062 DOI: 10.3390/biom13091403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 08/29/2023] [Accepted: 09/12/2023] [Indexed: 09/29/2023] Open
Abstract
The availability of multiple sequenced genomes from a single species made it possible to explore intra- and inter-specific genomic comparisons at higher resolution and build clade-specific pan-genomes of several crops. The pan-genomes of crops constructed from various cultivars, accessions, landraces, and wild ancestral species represent a compendium of genes and structural variations and allow researchers to search for the novel genes and alleles that were inadvertently lost in domesticated crops during the historical process of crop domestication or in the process of extensive plant breeding. Fortunately, many valuable genes and alleles associated with desirable traits like disease resistance, abiotic stress tolerance, plant architecture, and nutrition qualities exist in landraces, ancestral species, and crop wild relatives. The novel genes from the wild ancestors and landraces can be introduced back to high-yielding varieties of modern crops by implementing classical plant breeding, genomic selection, and transgenic/gene editing approaches. Thus, pan-genomic represents a great leap in plant research and offers new avenues for targeted breeding to mitigate the impact of global climate change. Here, we summarize the tools used for pan-genome assembly and annotations, web-portals hosting plant pan-genomes, etc. Furthermore, we highlight a few discoveries made in crops using the pan-genomic approach and future potential of this emerging field of study.
Collapse
Affiliation(s)
- Sushma Naithani
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA;
| | - Cecilia H. Deng
- Molecular & Digital Breeing Group, New Cultivar Innovation, The New Zealand Institute for Plant and Food Research Limited, Private Bag 92169, Auckland 1142, New Zealand;
| | - Sunil Kumar Sahu
- State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen 518083, China;
| | - Pankaj Jaiswal
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA;
| |
Collapse
|
3
|
Ruiz JL, Reimering S, Escobar-Prieto JD, Brancucci NMB, Echeverry DF, Abdi AI, Marti M, Gómez-Díaz E, Otto TD. From contigs towards chromosomes: automatic improvement of long read assemblies (ILRA). Brief Bioinform 2023; 24:bbad248. [PMID: 37406192 PMCID: PMC10359078 DOI: 10.1093/bib/bbad248] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 05/24/2023] [Accepted: 06/16/2023] [Indexed: 07/07/2023] Open
Abstract
Recent advances in long read technologies not only enable large consortia to aim to sequence all eukaryotes on Earth, but they also allow individual laboratories to sequence their species of interest with relatively low investment. Long read technologies embody the promise of overcoming scaffolding problems associated with repeats and low complexity sequences, but the number of contigs often far exceeds the number of chromosomes and they may contain many insertion and deletion errors around homopolymer tracts. To overcome these issues, we have implemented the ILRA pipeline to correct long read-based assemblies. Contigs are first reordered, renamed, merged, circularized, or filtered if erroneous or contaminated. Illumina short reads are used subsequently to correct homopolymer errors. We successfully tested our approach by improving the genome sequences of Homo sapiens, Trypanosoma brucei, and Leptosphaeria spp., and by generating four novel Plasmodium falciparum assemblies from field samples. We found that correcting homopolymer tracts reduced the number of genes incorrectly annotated as pseudogenes, but an iterative approach seems to be required to correct more sequencing errors. In summary, we describe and benchmark the performance of our new tool, which improved the quality of novel long read assemblies up to 1 Gbp. The pipeline is available at GitHub: https://github.com/ThomasDOtto/ILRA.
Collapse
Affiliation(s)
- José Luis Ruiz
- Instituto de Parasitología y Biomedicina López-Neyra (IPBLN), Consejo Superior de Investigaciones Científicas, 18016, Granada, Spain
| | - Susanne Reimering
- Department for Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | | | - Nicolas M B Brancucci
- School of Infection & Immunity, MVLS, University of Glasgow, Glasgow, UK
- Department of Medical Parasitology and Infection Biology, Swiss Tropical and Public Health Institute, 4123 Allschwil, Switzerland
- University of Basel, 4001 Basel, Switzerland
| | - Diego F Echeverry
- Centro Internacional de Entrenamiento e Investigaciones Médicas (CIDEIM), Cali, Colombia
- Departamento de Microbiología, Facultad de Salud, Universidad del Valle, Cali, Colombia
| | | | - Matthias Marti
- School of Infection & Immunity, MVLS, University of Glasgow, Glasgow, UK
| | - Elena Gómez-Díaz
- Instituto de Parasitología y Biomedicina López-Neyra (IPBLN), Consejo Superior de Investigaciones Científicas, 18016, Granada, Spain
| | - Thomas D Otto
- School of Infection & Immunity, MVLS, University of Glasgow, Glasgow, UK
| |
Collapse
|
4
|
Huston WM, Lawrence A, Wee BA, Thomas M, Timms P, Vodstrcil LA, McNulty A, McIvor R, Worthington K, Donovan B, Phillips S, Chen MY, Fairley CK, Hocking JS. Repeat infections with chlamydia in women may be more transcriptionally active with lower responses from some immune genes. Front Public Health 2022; 10:1012835. [PMID: 36299763 PMCID: PMC9589431 DOI: 10.3389/fpubh.2022.1012835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 09/23/2022] [Indexed: 01/27/2023] Open
Abstract
Chlamydia trachomatis, the most common bacterial sexually transmitted infection worldwide, is responsible for considerable health burden due to its significant sequelae. There are growing concerns about chlamydial treatment and management due to widely documented increasing burden of repeat infections. In the current study, a cohort study design of 305 women with urogenital chlamydial infections demonstrated that 11.8% of women experienced repeat infections after treatment with azithromycin. The chlamydial DNA load measured by quantitative PCR was higher in women who experienced a repeat infection (p = 0.0097) and repeat infection was associated with sexual contact. There was no genomic or phenotypic evidence of azithromycin resistance within the chlamydial isolates. During repeat infection, or repeat positive tests during follow up, vaginal chlamydial gene expression (ompA, euo, omcB, htrA, trpAB) was markedly higher compared to baseline, and two of the selected immune genes analyzed had significantly lower expression at the time of repeat infection. Overall, there are two implications of these results. The results could be generalized to all recent infections, or repeat positive events, and indicate that chlamydial infections are have higher transcriptional activity of select genes early in the infection in women. Alternatively, after azithromycin treatment, repeat infections of Chlamydia may be more transcriptionally active at certain genes, and there may be post-treatment immunological alterations that interplay into repeat exposures establishing an active infection. The potential that recent infections may involve a higher level of activity from the organism may have implications for management by more regular testing of the most at risk women to reduce the risk of sequelae.
Collapse
Affiliation(s)
- Wilhelmina M. Huston
- Faculty of Science, University of Technology Sydney, Ultimo, NSW, Australia,*Correspondence: Wilhelmina M. Huston
| | - Amba Lawrence
- Institute of Health and Biomedical Innovation, Queensland University of Technology, Kelvin Grove, QLD, Australia
| | - Bryan A. Wee
- Institute of Health and Biomedical Innovation, Queensland University of Technology, Kelvin Grove, QLD, Australia
| | - Mark Thomas
- Faculty of Science, University of Technology Sydney, Ultimo, NSW, Australia
| | - Peter Timms
- Bioinnovation Centre, University of the Sunshine Coast, Sippy Downs, QLD, Australia
| | - Lenka A. Vodstrcil
- Melbourne Sexual Health Centre, Central Clinical School, Monash University, Carlton, VIC, Australia,Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, University of Melbourne, Carlton, VIC, Australia
| | - Anna McNulty
- Sydney Sexual Health Centre, Sydney, NSW, Australia
| | - Ruthy McIvor
- Sydney Sexual Health Centre, Sydney, NSW, Australia
| | - Karen Worthington
- Melbourne Sexual Health Centre, Alfred Health, Carlton, VIC, Australia
| | - Basil Donovan
- Kirby Institute, University of New South Wales, Kensington, NSW, Australia
| | - Samuel Phillips
- Bioinnovation Centre, University of the Sunshine Coast, Sippy Downs, QLD, Australia
| | - Marcus Y. Chen
- Melbourne School of Population and Global Health, University of Melbourne, Carlton, VIC, Australia,Australia and Melbourne Sexual Health Centre, Carlton, VIC, Australia
| | | | - Jane S. Hocking
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, University of Melbourne, Carlton, VIC, Australia,Jane S. Hocking
| |
Collapse
|
5
|
Zhou X, Arslan M, Liu Z, Li D, Xi H, Feng Y, Li S, Wei J, Rong X, Liang Z, Wang X, Wu Z, Gamal El-Din M. Low carbon-to-nitrogen ratio digestate from high-rate anaerobic baffled reactor facilitates heterotrophic/autotrophic nitrifiers involved in nitrogen removal. BIORESOURCE TECHNOLOGY 2022; 359:127346. [PMID: 35605776 DOI: 10.1016/j.biortech.2022.127346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Revised: 05/16/2022] [Accepted: 05/18/2022] [Indexed: 06/15/2023]
Abstract
In this study, baffled anaerobic-aerobic reactors (AOBRs) with modified basalt fiber (MBF) carriers and felt were used to treat domestic wastewater (DWW). The influent was first treated in anaerobic compartments, with the NH4+-N containing digestate refluxed into aerobic compartment for nitrification. The nitrified liquid was channeled to the anaerobic compartments for further denitrification. Under optimal conditions, AOBR with MBF carriers could remove 91% chemical oxygen demand (COD) and 81% total nitrogen (TN), with biomass production increased by 7.6%, 4.5% and 8.7% in three successive anaerobic compartments compared to the control. Biological viability analysis showed that live cells outnumbered dead cells in bio-nests. Metagenomics analysis showed that multiple metabolic pathways accounted for nitrogen conversion in anaerobic and aerobic compartments. More importantly, low COD/TN ratio digestate facilitated heterotrophic nitrification-aerobic denitrification (HN-AD) species growth in aerobic compartment. This study provides a promising strategy to source treatment of DWW from urban communities.
Collapse
Affiliation(s)
- Xiangtong Zhou
- Institute of Environmental Health and Ecological Safety, Jiangsu University, Zhenjiang 212013, China
| | - Muhammad Arslan
- Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Alberta T6G 1H9, Canada
| | - Zhigang Liu
- Institute of Environmental Health and Ecological Safety, Jiangsu University, Zhenjiang 212013, China
| | - Da Li
- State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, No. 73 Huanghe Road, Nangang District, Harbin 150090, China
| | - Haipeng Xi
- Institute of Environmental Health and Ecological Safety, Jiangsu University, Zhenjiang 212013, China
| | - Yujie Feng
- State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, No. 73 Huanghe Road, Nangang District, Harbin 150090, China.
| | - Shanwei Li
- Institute of Environmental Health and Ecological Safety, Jiangsu University, Zhenjiang 212013, China
| | - Jing Wei
- Institute of Environmental Health and Ecological Safety, Jiangsu University, Zhenjiang 212013, China
| | - Xinshan Rong
- Institute of Environmental Health and Ecological Safety, Jiangsu University, Zhenjiang 212013, China
| | - Zhishui Liang
- School of Civil Engineering, Southeast University, No. 2 Sipailou, Nanjing, Jiangsu 210096, China
| | - Xiaochun Wang
- Institute of Environmental Health and Ecological Safety, Jiangsu University, Zhenjiang 212013, China
| | - Zhiren Wu
- Institute of Environmental Health and Ecological Safety, Jiangsu University, Zhenjiang 212013, China
| | - Mohamed Gamal El-Din
- Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Alberta T6G 1H9, Canada
| |
Collapse
|
6
|
Baptista RP, Li Y, Sateriale A, Sanders MJ, Brooks KL, Tracey A, Ansell BRE, Jex AR, Cooper GW, Smith ED, Xiao R, Dumaine JE, Georgeson P, Pope BJ, Berriman M, Striepen B, Cotton JA, Kissinger JC. Long-read assembly and comparative evidence-based reanalysis of Cryptosporidium genome sequences reveal expanded transporter repertoire and duplication of entire chromosome ends including subtelomeric regions. Genome Res 2022; 32:203-213. [PMID: 34764149 PMCID: PMC8744675 DOI: 10.1101/gr.275325.121] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Accepted: 11/10/2021] [Indexed: 11/25/2022]
Abstract
Cryptosporidiosis is a leading cause of waterborne diarrheal disease globally and an important contributor to mortality in infants and the immunosuppressed. Despite its importance, the Cryptosporidium community has only had access to a good, but incomplete, Cryptosporidium parvum IOWA reference genome sequence. Incomplete reference sequences hamper annotation, experimental design, and interpretation. We have generated a new C. parvum IOWA genome assembly supported by Pacific Biosciences (PacBio) and Oxford Nanopore long-read technologies and a new comparative and consistent genome annotation for three closely related species: C. parvum, Cryptosporidium hominis, and Cryptosporidium tyzzeri We made 1926 C. parvum annotation updates based on experimental evidence. They include new transporters, ncRNAs, introns, and altered gene structures. The new assembly and annotation revealed a complete Dnmt2 methylase ortholog. Comparative annotation between C. parvum, C. hominis, and C. tyzzeri revealed that most "missing" orthologs are found, suggesting that the biological differences between the species must result from gene copy number variation, differences in gene regulation, and single-nucleotide variants (SNVs). Using the new assembly and annotation as reference, 190 genes are identified as evolving under positive selection, including many not detected previously. The new C. parvum IOWA reference genome assembly is larger, gap free, and lacks ambiguous bases. This chromosomal assembly recovers all 16 chromosome ends, 13 of which are contiguously assembled. The three remaining chromosome ends are provisionally placed. These ends represent duplication of entire chromosome ends including subtelomeric regions revealing a new level of genome plasticity that will both inform and impact future research.
Collapse
Affiliation(s)
- Rodrigo P Baptista
- Center for Tropical and Emerging Global Diseases, University of Georgia, Athens, Georgia 30602, USA
- Institute of Bioinformatics, University of Georgia, Athens, Georgia 30602, USA
| | - Yiran Li
- Institute of Bioinformatics, University of Georgia, Athens, Georgia 30602, USA
| | - Adam Sateriale
- Department of Pathology, School of Veterinary Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Mandy J Sanders
- The Wellcome Sanger Institute, Hinxton, CB10 1SA, United Kingdom
| | - Karen L Brooks
- The Wellcome Sanger Institute, Hinxton, CB10 1SA, United Kingdom
| | - Alan Tracey
- The Wellcome Sanger Institute, Hinxton, CB10 1SA, United Kingdom
| | - Brendan R E Ansell
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne and Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville 3052, Australia
| | - Aaron R Jex
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne and Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville 3052, Australia
| | - Garrett W Cooper
- Department of Clinical Pathology, The University of Melbourne, Victorian Comprehensive Cancer Centre, Melbourne VIC 3000, Australia
| | - Ethan D Smith
- Department of Clinical Pathology, The University of Melbourne, Victorian Comprehensive Cancer Centre, Melbourne VIC 3000, Australia
| | - Rui Xiao
- Institute of Bioinformatics, University of Georgia, Athens, Georgia 30602, USA
| | - Jennifer E Dumaine
- Department of Pathology, School of Veterinary Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Peter Georgeson
- Department of Clinical Pathology, The University of Melbourne, Victorian Comprehensive Cancer Centre, Melbourne VIC 3000, Australia
- Melbourne Bioinformatics, The University of Melbourne, Parkville VIC 3010, Australia
- University of Melbourne Centre for Cancer Research, Victorian Comprehensive Cancer Centre, Melbourne VIC 3000, Australia
| | - Bernard J Pope
- Department of Clinical Pathology, The University of Melbourne, Victorian Comprehensive Cancer Centre, Melbourne VIC 3000, Australia
- Melbourne Bioinformatics, The University of Melbourne, Parkville VIC 3010, Australia
- Department of Surgery (Royal Melbourne Hospital), Melbourne Medical School, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Melbourne 3010, Australia
- Department of Medicine, Central Clinical School, Faculty of Medicine Nursing and Health Sciences, Monash University, Melbourne 3004, Australia
| | - Matthew Berriman
- The Wellcome Sanger Institute, Hinxton, CB10 1SA, United Kingdom
| | - Boris Striepen
- Department of Pathology, School of Veterinary Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - James A Cotton
- The Wellcome Sanger Institute, Hinxton, CB10 1SA, United Kingdom
| | - Jessica C Kissinger
- Center for Tropical and Emerging Global Diseases, University of Georgia, Athens, Georgia 30602, USA
- Institute of Bioinformatics, University of Georgia, Athens, Georgia 30602, USA
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| |
Collapse
|
7
|
Genome assembly and annotation. Bioinformatics 2022. [DOI: 10.1016/b978-0-323-89775-4.00013-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
8
|
Kaur C, Gupta M, Garai S, Mishra SK, Chauhan PS, Sopory S, Singla-Pareek SL, Adlakha N, Pareek A. Microbial methylglyoxal metabolism contributes towards growth promotion and stress tolerance in plants. Environ Microbiol 2021; 24:2817-2836. [PMID: 34435423 DOI: 10.1111/1462-2920.15743] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Revised: 08/16/2021] [Accepted: 08/23/2021] [Indexed: 12/13/2022]
Abstract
Plant growth promotion by microbes is a cumulative phenomenon involving multiple traits, many of which are not explored yet. Hence, to unravel microbial mechanisms underlying growth promotion, we have analysed the genomes of two potential growth-promoting microbes, viz., Pseudomonas sp. CK-NBRI-02 (P2) and Bacillus marisflavi CK-NBRI-03 (P3) for the presence of plant-beneficial traits. Besides known traits, we found that microbes differ in their ability to metabolize methylglyoxal (MG), a ubiquitous cytotoxin regarded as general consequence of stress in plants. P2 exhibited greater tolerance to MG and possessed better ability to sustain plant growth under dicarbonyl stress. However, under salinity, only P3 showed a dose-dependent induction in MG detoxification activity in accordance with concomitant increase in MG levels, contributing to enhanced salt tolerance. Furthermore, salt-stressed transcriptomes of both the strains showed differences with respect to MG, ion and osmolyte homeostasis, with P3 being more responsive to stress. Importantly, application of either strain altered MG levels and subsequently MG detoxification machinery in Arabidopsis, probably to strengthen plant defence response and growth. We therefore, suggest a crucial role of microbial MG resistance in plant growth promotion and that it should be considered as a beneficial trait while screening microbes for stress mitigation in plants.
Collapse
Affiliation(s)
- Charanpreet Kaur
- School of Life Sciences, Jawaharlal Nehru University, New Delhi, 110067, India.,International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Mayank Gupta
- International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Sampurna Garai
- International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Shashank K Mishra
- Microbial Technologies Division, CSIR-National Botanical Research Institute, Lucknow, Uttar Pradesh, India
| | - Puneet Singh Chauhan
- Microbial Technologies Division, CSIR-National Botanical Research Institute, Lucknow, Uttar Pradesh, India
| | - Sudhir Sopory
- International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Sneh L Singla-Pareek
- International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Nidhi Adlakha
- Regional Centre for Biotechnology, NCR Biotech Science Cluster, Faridabad, Haryana, 121001, India
| | - Ashwani Pareek
- School of Life Sciences, Jawaharlal Nehru University, New Delhi, 110067, India
| |
Collapse
|
9
|
Islam R, Raju RS, Tasnim N, Shihab IH, Bhuiyan MA, Araf Y, Islam T. Choice of assemblers has a critical impact on de novo assembly of SARS-CoV-2 genome and characterizing variants. Brief Bioinform 2021; 22:6210065. [PMID: 33822878 PMCID: PMC8083570 DOI: 10.1093/bib/bbab102] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Revised: 03/06/2021] [Accepted: 03/08/2021] [Indexed: 12/18/2022] Open
Abstract
Background Coronavirus Disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has become a global pandemic following its initial emergence in China. SARS-CoV-2 has a positive-sense single-stranded RNA virus genome of around 30Kb. Using next-generation sequencing technologies, a large number of SARS-CoV-2 genomes are being sequenced at an unprecedented rate and being deposited in public repositories. For the de novo assembly of the SARS-CoV-2 genomes, a myriad of assemblers is being used, although their impact on the assembly quality has not been characterized for this virus. In this study, we aim to understand the variabilities on assembly qualities due to the choice of the assemblers. Results We performed 6648 de novo assemblies of 416 SARS-CoV-2 samples using eight different assemblers with different k-mer lengths. We used Illumina paired-end sequencing reads and compared the assembly quality of those assemblers. We showed that the choice of assembler plays a significant role in reconstructing the SARS-CoV-2 genome. Two metagenomic assemblers, e.g. MEGAHIT and metaSPAdes, performed better compared with others in most of the assembly quality metrics including, recovery of a larger fraction of the genome, constructing larger contigs and higher N50, NA50 values, etc. We showed that at least 09% (259/2873) of the variants present in the assemblies between MEGAHIT and metaSPAdes are unique to one of the assembly methods. Conclusion Our analyses indicate the critical role of assembly methods for assembling SARS-CoV-2 genome using short reads and their impact on variant characterization. This study could help guide future studies to determine the best-suited assembler for the de novo assembly of virus genomes.
Collapse
Affiliation(s)
| | - Rajan Saha Raju
- Computer Science and Engineering from the Shahjalal University of Science and Technology
| | - Nazia Tasnim
- Computer Science and Engineering at the Shahjalal University of Science and Technology
| | - Istiak Hossain Shihab
- Department of Computer Science and Engineering, Shahjalal University of Science and Technology
| | - Maruf Ahmed Bhuiyan
- Doctor of Medicine (MD) in Virology at Bangabandhu Sheikh Mujib Medical University
| | - Yusha Araf
- Genetic Engineering and Biotechnology at the Shahjalal University of Science and Technology
| | - Tofazzal Islam
- Institute of Biotechnology and Genetic Engineering, Bangabandhu Sheikh Mujibur Rahman Agricultural University, Bangladesh
| |
Collapse
|
10
|
Tahir M, Sardaraz M, Mehmood Z, Khan MS. ESREEM: Efficient Short Reads Error Estimation Computational Model for Next-generation Genome Sequencing. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200614171832] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Aims:
To assess the error profile in NGS data, generated from high throughput
sequencing machines.
Background:
Short-read sequencing data from Next Generation Sequencing (NGS) are currently
being generated by a number of research projects. Depicting the errors produced by NGS
platforms and expressing accurate genetic variation from reads are two inter-dependent phases. It
has high significance in various analyses, such as genome sequence assembly, SNPs calling,
evolutionary studies, and haplotype inference. The systematic and random errors show incidence
profile for each of the sequencing platforms i.e. Illumina sequencing, Pacific Biosciences, 454
pyrosequencing, Complete Genomics DNA nanoball sequencing, Ion Torrent sequencing, and
Oxford Nanopore sequencing. Advances in NGS deliver galactic data with the addition of errors.
Some ratio of these errors may emulate genuine true biological signals i.e., mutation, and may
subsequently negate the results. Various independent applications have been proposed to correct
the sequencing errors. Systematic analysis of these algorithms shows that state-of-the-art models
are missing.
Objective:
In this paper, an effcient error estimation computational model called ESREEM is
proposed to assess the error rates in NGS data.
Methods:
The proposed model prospects the analysis that there exists a true linear regression
association between the number of reads containing errors and the number of reads sequenced. The
model is based on a probabilistic error model integrated with the Hidden Markov Model (HMM).
Result:
The proposed model is evaluated on several benchmark datasets and the results obtained are
compared with state-of-the-art algorithms.
Conclusions:
Experimental results analyses show that the proposed model efficiently estimates errors
and runs in less time as compared to others.
Collapse
Affiliation(s)
- Muhammad Tahir
- Department of Computer Science, COMSATS University Islamabad, Attock Campus, Attock,Pakistan
| | - Muhammad Sardaraz
- Department of Computer Science, COMSATS University Islamabad, Attock Campus, Attock,Pakistan
| | - Zahid Mehmood
- Department of Software Engineering, University of Engineering and Technology, Taxila,Pakistan
| | - Muhammad Saud Khan
- Department of Computer Science, COMSATS University Islamabad, Attock Campus, Attock,Pakistan
| |
Collapse
|
11
|
Darragh K, Orteu A, Black D, Byers KJRP, Szczerbowski D, Warren IA, Rastas P, Pinharanda A, Davey JW, Fernanda Garza S, Abondano Almeida D, Merrill RM, McMillan WO, Schulz S, Jiggins CD. A novel terpene synthase controls differences in anti-aphrodisiac pheromone production between closely related Heliconius butterflies. PLoS Biol 2021; 19:e3001022. [PMID: 33465061 PMCID: PMC7815096 DOI: 10.1371/journal.pbio.3001022] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 11/30/2020] [Indexed: 02/07/2023] Open
Abstract
Plants and insects often use the same compounds for chemical communication, but not much is known about the genetics of convergent evolution of chemical signals. The terpene (E)-β-ocimene is a common component of floral scent and is also used by the butterfly Heliconius melpomene as an anti-aphrodisiac pheromone. While the biosynthesis of terpenes has been described in plants and microorganisms, few terpene synthases (TPSs) have been identified in insects. Here, we study the recent divergence of 2 species, H. melpomene and Heliconius cydno, which differ in the presence of (E)-β-ocimene; combining linkage mapping, gene expression, and functional analyses, we identify 2 novel TPSs. Furthermore, we demonstrate that one, HmelOS, is able to synthesise (E)-β-ocimene in vitro. We find no evidence for TPS activity in HcydOS (HmelOS ortholog of H. cydno), suggesting that the loss of (E)-β-ocimene in this species is the result of coding, not regulatory, differences. The TPS enzymes we discovered are unrelated to previously described plant and insect TPSs, demonstrating that chemical convergence has independent evolutionary origins. Plants and insects often use the same compounds for chemical communication, but little is known about the convergent evolution of such chemical signals. This study identifies a novel terpene synthase involved in production of an anti-aphrodisiac pheromone by the butterfly Heliconius melpomene. This enzyme is unrelated to other insect terpene synthases, providing evidence that the ability to synthesise terpenes has arisen multiple times independently within the insects.
Collapse
Affiliation(s)
- Kathy Darragh
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
- Smithsonian Tropical Research Institute, Panamá, Panamá
- * E-mail:
| | - Anna Orteu
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - Daniella Black
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
- School of Biology, University of St Andrews, St Andrews, United Kingdom
| | - Kelsey J. R. P. Byers
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
- Smithsonian Tropical Research Institute, Panamá, Panamá
| | - Daiane Szczerbowski
- Institute of Organic Chemistry, Department of Life Sciences, Technische Universität Braunschweig, Braunschweig, Germany
| | - Ian A. Warren
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - Pasi Rastas
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Ana Pinharanda
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - John W. Davey
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | | | | | - Richard M. Merrill
- Smithsonian Tropical Research Institute, Panamá, Panamá
- Division of Evolutionary Biology, Ludwig-Maximilians-Universität München, Munich, Germany
| | | | - Stefan Schulz
- Institute of Organic Chemistry, Department of Life Sciences, Technische Universität Braunschweig, Braunschweig, Germany
| | - Chris D. Jiggins
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
- Smithsonian Tropical Research Institute, Panamá, Panamá
| |
Collapse
|
12
|
Mangat PK, Gannaban RB, Singleton JJ, Angeles-Shim RB. Development of a PCR-based, genetic marker resource for the tomato-like nightshade relative, Solanum lycopersicoides using whole genome sequence analysis. PLoS One 2020; 15:e0242882. [PMID: 33227039 PMCID: PMC7682897 DOI: 10.1371/journal.pone.0242882] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Accepted: 11/10/2020] [Indexed: 11/23/2022] Open
Abstract
Solanum lycopersicoides is a wild nightshade relative of tomato with known resistance to a wide range of pests and pathogens, as well as tolerance to cold, drought and salt stress. To effectively utilize S. lycopersicoides as a genetic resource in breeding for tomato improvement, the underlying basis of observable traits in the species needs to be understood. Molecular markers are important tools that can unlock the genetic underpinnings of phenotypic variation in wild crop relatives. Unfortunately, DNA markers that are specific to S. lycopersicoides are limited in number, distribution and polymorphism rate. In this study, we developed a suite of S. lycopersicoides-specific SSR and indel markers by sequencing, building and analyzing a draft assembly of the wild nightshade genome. Mapping of a total of 1.45 Gb of S. lycopersicoides contigs against the tomato reference genome assembled a moderate number of contiguous reads into longer scaffolds. Interrogation of the obtained draft yielded SSR information for more than 55,000 loci in S. lycopersicoides for which more than 35,000 primers pairs were designed. Additionally, indel markers were developed based on sequence alignments between S. lycopersicoides and tomato. Synthesis and experimental validation of 345 primer sets resulted in the amplification of single and multilocus targets in S. lycopersicoides and polymorphic loci between S. lycopersicoides and tomato. Cross-species amplification of the 345 markers in tomato, eggplant, silverleaf nightshade and pepper resulted in varying degrees of transferability that ranged from 55 to 83%. The markers reported in this study significantly expands the genetic marker resource for S. lycopersicoides, as well as for related Solanum spp. for applications in genetics and breeding studies.
Collapse
Affiliation(s)
- Puneet Kaur Mangat
- Department of Plant and Soil Science, College of Agricultural Sciences and Natural Resources, Texas Tech University, Lubbock, Texas, United States of America
| | - Ritchel B. Gannaban
- Department of Plant and Soil Science, College of Agricultural Sciences and Natural Resources, Texas Tech University, Lubbock, Texas, United States of America
| | - Joshua J. Singleton
- Department of Plant and Soil Science, College of Agricultural Sciences and Natural Resources, Texas Tech University, Lubbock, Texas, United States of America
| | - Rosalyn B. Angeles-Shim
- Department of Plant and Soil Science, College of Agricultural Sciences and Natural Resources, Texas Tech University, Lubbock, Texas, United States of America
- * E-mail:
| |
Collapse
|
13
|
Epstein-Barr Virus Genomes Reveal Population Structure and Type 1 Association with Endemic Burkitt Lymphoma. J Virol 2020; 94:JVI.02007-19. [PMID: 32581102 DOI: 10.1128/jvi.02007-19] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Accepted: 06/16/2020] [Indexed: 12/14/2022] Open
Abstract
Endemic Burkitt lymphoma (eBL), the most prevalent pediatric cancer in sub-Saharan Africa, is distinguished by its inclusion of Epstein-Barr virus (EBV). In order to better understand the impact of EBV variation in eBL tumorigenesis, we improved viral DNA enrichment methods and generated a total of 98 new EBV genomes from both eBL cases (n = 58) and healthy controls (n = 40) residing in the same geographic region in Kenya. Using our unbiased methods, we found that EBV type 1 was significantly more prevalent in eBL patients (74.5%) than in healthy children (47.5%) (odds ratio = 3.24, 95% confidence interval = 1.36 to 7.71, P = 0.007), as opposed to similar proportions in both groups. Controlling for EBV type, we also performed a genome-wide association study identifying six nonsynonymous variants in the genes EBNA1, EBNA2, BcLF1, and BARF1 that were enriched in eBL patients. In addition, viruses isolated from plasma of eBL patients were identical to their tumor counterparts consistent with circulating viral DNA originating from the tumor. We also detected three intertypic recombinants carrying type 1 EBNA2 and type 2 EBNA3 regions, as well as one novel genome with a 20-kb deletion, resulting in the loss of multiple lytic and virion genes. Comparing EBV types, viral genes displayed differential variation rates as type 1 appeared to be more divergent, while type 2 demonstrated novel substructures. Overall, our findings highlight the complexities of the EBV population structure and provide new insight into viral variation, potentially deepening our understanding of eBL oncogenesis.IMPORTANCE Improved viral enrichment methods conclusively demonstrate EBV type 1 to be more prevalent in eBL patients than in geographically matched healthy controls, which previously underrepresented the prevalence of EBV type 2. Genome-wide association analysis between cases and controls identifies six eBL-associated nonsynonymous variants in EBNA1, EBNA2, BcLF1, and BARF1 genes. Analysis of population structure reveals that EBV type 2 exists as two genomic subgroups and was more commonly found in female than in male eBL patients.
Collapse
|
14
|
Draft Genome Sequence of Bacillus marisflavi CK-NBRI-03, Isolated from Agricultural Soil. Microbiol Resour Announc 2020; 9:9/7/e00044-20. [PMID: 32054702 PMCID: PMC7019057 DOI: 10.1128/mra.00044-20] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Here, we report the 4.34-Mb draft genome assembly of Bacillus marisflavi CK-NBRI-03 (or P3), a Gram-positive bacterium, with an average G+C content of 48.66%. P3 was isolated from agricultural soil from the Badaun (midwestern plain zone) region of Uttar Pradesh, India. Here, we report the 4.34-Mb draft genome assembly of Bacillus marisflavi CK-NBRI-03 (or P3), a Gram-positive bacterium, with an average G+C content of 48.66%. P3 was isolated from agricultural soil from the Badaun (midwestern plain zone) region of Uttar Pradesh, India.
Collapse
|
15
|
Draft Genome Sequences of Two Textile Azo Dye-Degrading Shewanella sp. Strains Isolated from a Textile Effluent in Peru. Microbiol Resour Announc 2019; 8:8/49/e00836-19. [PMID: 31806739 PMCID: PMC6895299 DOI: 10.1128/mra.00836-19] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Here, we report the annotated genome sequences of two Shewanella sp. strains isolated from textile industry wastewater effluent in Peru. Potential genes for encoding enzymes that enable the strain to decolorize and degrade textile azo dyes were detected in both genomes. Here, we report the annotated genome sequences of two Shewanella sp. strains isolated from textile industry wastewater effluent in Peru. Potential genes for encoding enzymes that enable the strain to decolorize and degrade textile azo dyes were detected in both genomes.
Collapse
|
16
|
Song B, Sang Q, Wang H, Pei H, Gan X, Wang F. Complement Genome Annotation Lift Over Using a Weighted Sequence Alignment Strategy. Front Genet 2019; 10:1046. [PMID: 31850053 PMCID: PMC6902276 DOI: 10.3389/fgene.2019.01046] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Accepted: 09/30/2019] [Indexed: 12/14/2022] Open
Abstract
With the broad application of high-throughput sequencing, more whole-genome resequencing data and de novo assemblies of natural populations are becoming available. For a particular species, in general, only the reference genome is well established and annotated. Computational tools based on sequence alignment have been developed to investigate the gene models of individuals belonging to the same or closely related species. During this process, inconsistent alignment often obscures genome annotation lift over and leads to improper functional impact prediction for a genomic variant, especially in plant species. Here, we proposed the zebraic striped dynamic programming algorithm, which provides different weights to genetic features to refine genome annotation lift over. Testing of our zebraic striped dynamic programming algorithm on both plant and animal genomic data showed complementation to standard sequence approach for highly diverse individuals. Using the lift over genome annotation as anchors, a base-pair resolution genome-wide sequence alignment and variant calling pipeline for de novo assembly has been implemented in the GEAN software. GEAN could be used to compare haplotype diversity, refine the genetic variant functional annotation, annotate de novo assembly genome sequence, detect homologous syntenic blocks, improve the quantification of gene expression levels using RNA-seq data, and unify genomic variants for population genetic analysis. We expect that GEAN will be a standard tool for the coming of age of de novo assembly population genetics.
Collapse
Affiliation(s)
- Baoxing Song
- The Department of Life Science, Qiannan Normal College for Nationalities, Duyun, China
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Köln, Germany
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, United States
| | - Qing Sang
- Department of Plant Developmental Biology, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Hai Wang
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Huimin Pei
- The Department of Life Science, Qiannan Normal College for Nationalities, Duyun, China
| | - XiangChao Gan
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Fen Wang
- The Department of Life Science, Qiannan Normal College for Nationalities, Duyun, China
| |
Collapse
|
17
|
Draft Genome Sequence of a Potential Plant Growth-Promoting Rhizobacterium, Pseudomonas sp. Strain CK-NBRI-02. Microbiol Resour Announc 2019; 8:8/43/e01113-19. [PMID: 31649082 PMCID: PMC6813394 DOI: 10.1128/mra.01113-19] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Pseudomonas sp. strain CK-NBRI-02 is a potential plant growth-promoting Gram-negative rhizobacterium isolated from the rhizosphere of maize plants growing in fields in Srinagar, Jammu, and Kashmir, India. Here, we report a 5.25-Mb draft assembly of the genome sequence of Pseudomonas sp. strain CK-NBRI-02 with an average G+C content of 62.47%. Pseudomonas sp. strain CK-NBRI-02 is a potential plant growth-promoting Gram-negative rhizobacterium isolated from the rhizosphere of maize plants growing in fields in Srinagar, Jammu, and Kashmir, India. Here, we report a 5.25-Mb draft assembly of the genome sequence of Pseudomonas sp. strain CK-NBRI-02 with an average G+C content of 62.47%.
Collapse
|
18
|
Genomic analysis provides insights into the transmission and pathogenicity of Talaromyces marneffei. Fungal Genet Biol 2019; 130:54-61. [DOI: 10.1016/j.fgb.2019.05.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Revised: 05/05/2019] [Accepted: 05/06/2019] [Indexed: 11/20/2022]
|
19
|
Wang A, Wang Z, Li Z, Li LM. BAUM: improving genome assembly by adaptive unique mapping and local overlap-layout-consensus approach. Bioinformatics 2019; 34:2019-2028. [PMID: 29346504 DOI: 10.1093/bioinformatics/bty020] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Accepted: 01/12/2018] [Indexed: 11/13/2022] Open
Abstract
Motivation It is highly desirable to assemble genomes of high continuity and consistency at low cost. The current bottleneck of draft genome continuity using the second generation sequencing (SGS) reads is primarily caused by uncertainty among repetitive sequences. Even though the single-molecule real-time sequencing technology is very promising to overcome the uncertainty issue, its relatively high cost and error rate add burden on budget or computation. Many long-read assemblers take the overlap-layout-consensus (OLC) paradigm, which is less sensitive to sequencing errors, heterozygosity and variability of coverage. However, current assemblers of SGS data do not sufficiently take advantage of the OLC approach. Results Aiming at minimizing uncertainty, the proposed method BAUM, breaks the whole genome into regions by adaptive unique mapping; then the local OLC is used to assemble each region in parallel. BAUM can (i) perform reference-assisted assembly based on the genome of a close species (ii) or improve the results of existing assemblies that are obtained based on short or long sequencing reads. The tests on two eukaryote genomes, a wild rice Oryza longistaminata and a parrot Melopsittacus undulatus, show that BAUM achieved substantial improvement on genome size and continuity. Besides, BAUM reconstructed a considerable amount of repetitive regions that failed to be assembled by existing short read assemblers. We also propose statistical approaches to control the uncertainty in different steps of BAUM. Availability and implementation http://www.zhanyuwang.xin/wordpress/index.php/2017/07/21/baum. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Anqi Wang
- National Center of Mathematics and Interdisciplinary Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Zhanyu Wang
- National Center of Mathematics and Interdisciplinary Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Zheng Li
- National Center of Mathematics and Interdisciplinary Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Lei M Li
- National Center of Mathematics and Interdisciplinary Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China.,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| |
Collapse
|
20
|
Whole-Genome Sequence of Brucella melitensis Strain B7, Isolated from a Blood Sample of a Brucellosis Patient from Hulunbuir, Inner Mongolia, China. Microbiol Resour Announc 2019; 8:8/24/e00119-19. [PMID: 31196914 PMCID: PMC6587674 DOI: 10.1128/mra.00119-19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
We report here the draft genome sequence of Brucella melitensis strain B7, isolated from a blood sample of a brucellosis patient from Hulunbuir, Inner Mongolia, China. This report describes the genome sequence of a human B. melitensis isolate, which is endemic in this area of China. We report here the draft genome sequence of Brucella melitensis strain B7, isolated from a blood sample of a brucellosis patient from Hulunbuir, Inner Mongolia, China. This report describes the genome sequence of a human B. melitensis isolate, which is endemic in this area of China.
Collapse
|
21
|
Böhme U, Otto TD, Sanders M, Newbold CI, Berriman M. Progression of the canonical reference malaria parasite genome from 2002-2019. Wellcome Open Res 2019; 4:58. [PMID: 31080894 PMCID: PMC6484455 DOI: 10.12688/wellcomeopenres.15194.2] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/24/2019] [Indexed: 01/15/2023] Open
Abstract
Here we describe the ways in which the sequence and annotation of the
Plasmodium falciparum reference genome has changed since its publication in 2002. As the malaria species responsible for the most deaths worldwide, the richness of annotation and accuracy of the sequence are important resources for the
P. falciparum research community as well as the basis for interpreting the genomes of subsequently sequenced species. At the time of publication in 2002 over 60% of predicted genes had unknown functions. As of March 2019, this number has been significantly decreased to 33%. The reduction is due to the inclusion of genes that were subsequently characterised experimentally and genes with significant similarity to others with known functions. In addition, the structural annotation of genes has been significantly refined; 27% of gene structures have been changed since 2002, comprising changes in exon-intron boundaries, addition or deletion of exons and the addition or deletion of genes. The sequence has also undergone significant improvements. In addition to the correction of a large number of single-base and insertion or deletion errors, a major miss-assembly between the subtelomeres of chromosome 7 and 8 has been corrected. As the number of sequenced isolates continues to grow rapidly, a single reference genome will not be an adequate basis for interpreting intra-species sequence diversity. We therefore describe in this publication a population reference genome of
P. falciparum, called Pfref1. This reference will enable the community to map to regions that are not present in the current assembly.
P. falciparum 3D7 will continue to be maintained, with ongoing curation ensuring continual improvements in annotation quality.
Collapse
Affiliation(s)
- Ulrike Böhme
- Parasite Genomics, Wellcome Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | - Thomas D Otto
- Parasite Genomics, Wellcome Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK.,Institute of Infection, Immunity and Inflammation, MVLS, University of Glasgow, Glasgow, G12 8QQ, UK
| | - Mandy Sanders
- Parasite Genomics, Wellcome Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | - Chris I Newbold
- Parasite Genomics, Wellcome Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK.,Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Matthew Berriman
- Parasite Genomics, Wellcome Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
22
|
Böhme U, Otto TD, Sanders M, Newbold CI, Berriman M. Progression of the canonical reference malaria parasite genome from 2002-2019. Wellcome Open Res 2019; 4:58. [PMID: 31080894 DOI: 10.12688/wellcomeopenres.15194.1] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/21/2019] [Indexed: 11/20/2022] Open
Abstract
Here we describe the ways in which the sequence and annotation of the Plasmodium falciparum reference genome has changed since its publication in 2002. As the malaria species responsible for the most deaths worldwide, the richness of annotation and accuracy of the sequence are important resources for the P. falciparum research community as well as the basis for interpreting the genomes of subsequently sequenced species. At the time of publication in 2002 over 60% of predicted genes had unknown functions. As of March 2019, this number has been significantly decreased to 33%. The reduction is due to the inclusion of genes that were subsequently characterised experimentally and genes with significant similarity to others with known functions. In addition, the structural annotation of genes has been significantly refined; 27% of gene structures have been changed since 2002, comprising changes in exon-intron boundaries, addition or deletion of exons and the addition or deletion of genes. The sequence has also undergone significant improvements. In addition to the correction of a large number of single-base and insertion or deletion errors, a major miss-assembly between the subtelomeres of chromosome 7 and 8 has been corrected. As the number of sequenced isolates continues to grow rapidly, a single reference genome will not be an adequate basis for interpreting intra-species sequence diversity. We therefore describe in this publication a population reference genome of P. falciparum, called Pfref1. This reference will enable the community to map to regions that are not present in the current assembly. P. falciparum 3D7 will continue to be maintained, with ongoing curation ensuring continual improvements in annotation quality.
Collapse
Affiliation(s)
- Ulrike Böhme
- Parasite Genomics, Wellcome Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | - Thomas D Otto
- Parasite Genomics, Wellcome Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK.,Institute of Infection, Immunity and Inflammation, MVLS, University of Glasgow, Glasgow, G12 8QQ, UK
| | - Mandy Sanders
- Parasite Genomics, Wellcome Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | - Chris I Newbold
- Parasite Genomics, Wellcome Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK.,Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Matthew Berriman
- Parasite Genomics, Wellcome Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
23
|
Ershov NI, Mordvinov VA, Prokhortchouk EB, Pakharukova MY, Gunbin KV, Ustyantsev K, Genaev MA, Blinov AG, Mazur A, Boulygina E, Tsygankova S, Khrameeva E, Chekanov N, Fan G, Xiao A, Zhang H, Xu X, Yang H, Solovyev V, Lee SMY, Liu X, Afonnikov DA, Skryabin KG. New insights from Opisthorchis felineus genome: update on genomics of the epidemiologically important liver flukes. BMC Genomics 2019; 20:399. [PMID: 31117933 PMCID: PMC6530080 DOI: 10.1186/s12864-019-5752-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 04/29/2019] [Indexed: 01/25/2023] Open
Abstract
Background The three epidemiologically important Opisthorchiidae liver flukes Opisthorchis felineus, O. viverrini, and Clonorchis sinensis, are believed to harbour similar potencies to provoke hepatobiliary diseases in their definitive hosts, although their populations have substantially different ecogeographical aspects including habitat, preferred hosts, population structure. Lack of O. felineus genomic data is an obstacle to the development of comparative molecular biological approaches necessary to obtain new knowledge about the biology of Opisthorchiidae trematodes, to identify essential pathways linked to parasite-host interaction, to predict genes that contribute to liver fluke pathogenesis and for the effective prevention and control of the disease. Results Here we present the first draft genome assembly of O. felineus and its gene repertoire accompanied by a comparative analysis with that of O. viverrini and Clonorchis sinensis. We observed both noticeably high heterozygosity of the sequenced individual and substantial genetic diversity in a pooled sample. This indicates that potency of O. felineus population for rapid adaptive response to control and preventive measures of opisthorchiasis is higher than in O. viverrini and C. sinensis. We also have found that all three species are characterized by more intensive involvement of trans-splicing in RNA processing compared to other trematodes. Conclusion All revealed peculiarities of structural organization of genomes are of extreme importance for a proper description of genes and their products in these parasitic species. This should be taken into account both in academic and applied research of epidemiologically important liver flukes. Further comparative genomics studies of liver flukes and non-carcinogenic flatworms allow for generation of well-grounded hypotheses on the mechanisms underlying development of cholangiocarcinoma associated with opisthorchiasis and clonorchiasis as well as species-specific mechanisms of these diseases. Electronic supplementary material The online version of this article (10.1186/s12864-019-5752-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nikita I Ershov
- Institute of Cytology and Genetics SB RAS, 10 Lavrentiev Ave, Novosibirsk, 630090, Russia.
| | | | - Egor B Prokhortchouk
- Russian Federal Research Center for Biotechnology, 33/2 Leninsky prospect, Moscow, 119071, Russia. .,ZAO Genoanalytica, 1 Leninskie Gory street, Moscow, 119234, Russia.
| | - Mariya Y Pakharukova
- Institute of Cytology and Genetics SB RAS, 10 Lavrentiev Ave, Novosibirsk, 630090, Russia.,Novosibirsk State University, 2 Pirogova Str, Novosibirsk, 630090, Russia
| | - Konstantin V Gunbin
- Institute of Cytology and Genetics SB RAS, 10 Lavrentiev Ave, Novosibirsk, 630090, Russia
| | - Kirill Ustyantsev
- Institute of Cytology and Genetics SB RAS, 10 Lavrentiev Ave, Novosibirsk, 630090, Russia
| | - Mikhail A Genaev
- Institute of Cytology and Genetics SB RAS, 10 Lavrentiev Ave, Novosibirsk, 630090, Russia
| | - Alexander G Blinov
- Institute of Cytology and Genetics SB RAS, 10 Lavrentiev Ave, Novosibirsk, 630090, Russia
| | - Alexander Mazur
- Russian Federal Research Center for Biotechnology, 33/2 Leninsky prospect, Moscow, 119071, Russia
| | | | | | | | - Nikolay Chekanov
- Russian Federal Research Center for Biotechnology, 33/2 Leninsky prospect, Moscow, 119071, Russia
| | - Guangyi Fan
- BGI-Shenzhen, 11 Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China.,State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macao, China
| | - An Xiao
- BGI-Shenzhen, 11 Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - He Zhang
- BGI-Shenzhen, 11 Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Xun Xu
- BGI-Shenzhen, 11 Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Huanming Yang
- BGI-Shenzhen, 11 Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Victor Solovyev
- Softberry Inc., 116 Radio Circle, Suite 400, Mount Kisco, NY, 10549, USA
| | - Simon Ming-Yuen Lee
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macao, China
| | - Xin Liu
- BGI-Shenzhen, 11 Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Dmitry A Afonnikov
- Institute of Cytology and Genetics SB RAS, 10 Lavrentiev Ave, Novosibirsk, 630090, Russia.,Novosibirsk State University, 2 Pirogova Str, Novosibirsk, 630090, Russia
| | - Konstantin G Skryabin
- Russian Federal Research Center for Biotechnology, 33/2 Leninsky prospect, Moscow, 119071, Russia.,Federal Research Center Kurchatov Institute, Moscow, Russia
| |
Collapse
|
24
|
Kurilung A, Keeratipusana C, Suriyaphol P, Hampson DJ, Prapasarakul N. Genomic analysis of Leptospira interrogans serovar Paidjan and Dadas isolates from carrier dogs and comparative genomic analysis to detect genes under positive selection. BMC Genomics 2019; 20:168. [PMID: 30832578 PMCID: PMC6399948 DOI: 10.1186/s12864-019-5562-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2018] [Accepted: 02/25/2019] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Leptospirosis is an emerging infectious disease worldwide that can cause high morbidity and mortality rates in humans and animals. The causative spirochetes have reservoirs in mammalian hosts, but there has been limited analysis of the genomes of isolates recovered from animals. The aims of this study were to characterize genomic features of two Leptospira interrogans strains recently isolated from asymptomatic dogs in Thailand (strains CUDO5 and CDUO8), and to perform comparative genome analyses with other strains. Molecular adaptive evolution in L. interrogans as signaled by positive selection also was analyzed. RESULTS Whole genome sequence analysis revealed that strains CUDO5 and CUDO8 had genome sizes of approximately 4.9 Mbp with 35.1% GC contents. Using monoclonal antibodies, strains CUDO5 and CUDO8 were identified as serovars Paidjan and Dadas, respectively. These strains harbored genes known to be associated with acute and chronic infections. Using Single Nucleotide Polymorphisms phylogeny (SNPs) with 97 L. interrogans strains, CUDO5 and CUDO8 had closest genetic relatedness with each other. Nevertheless, the serovar determinant region (rfb locus) showed variations in the genes encoding sugar biosynthesis. Amongst 13 representative L. interrogans strains examined for molecular adaptive evolution through positive selection under the site-model of Phylogenetic Analysis of Maximum Likelihood, genes responsible for iron acquisition (tlyA and hbpA), motility (fliN2, flgK, and flhB) and thermal adaptation (lpxD1) were under increased selective pressure. CONCLUSIONS L. interrogans serovar Paidjan strain CUDO5 and serovar Dadas strain CUDO8 had close genetic relatedness as analyzed by SNPs phylogeny. They contained genes with established roles in acute and chronic leptospirosis. The rfb locus in both serovars showed gene variation associated with sugar biosynthesis. Positive selection analysis indicated that genes encoding factors involved in motility, temperature adaptation, and iron acquisition were under strong positive selection in L. interrogans. These may be associated with adaptation in the early stages of infection.
Collapse
Affiliation(s)
- Alongkorn Kurilung
- Department of Microbiology, Faculty of Veterinary Science, Chulalongkorn University, Bangkok, Thailand
| | - Chantisa Keeratipusana
- Bioinformatics and Data Management for Research Unit, Office for Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Prapat Suriyaphol
- Bioinformatics and Data Management for Research Unit, Office for Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - David J. Hampson
- Department of Infectious Diseases and Public Health, College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Kowloon Tong, Hong Kong SAR
| | - Nuvee Prapasarakul
- Department of Microbiology, Faculty of Veterinary Science, Chulalongkorn University, Bangkok, Thailand
- Diagnosis and Monitoring of Animal Pathogens Research Unit, Department of Microbiology, Faculty of Veterinary Science, Chulalongkorn University, Bangkok, Thailand
| |
Collapse
|
25
|
Su HJ, Barkman TJ, Hao W, Jones SS, Naumann J, Skippington E, Wafula EK, Hu JM, Palmer JD, dePamphilis CW. Novel genetic code and record-setting AT-richness in the highly reduced plastid genome of the holoparasitic plant Balanophora. Proc Natl Acad Sci U S A 2019; 116:934-943. [PMID: 30598433 PMCID: PMC6338844 DOI: 10.1073/pnas.1816822116] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Plastid genomes (plastomes) vary enormously in size and gene content among the many lineages of nonphotosynthetic plants, but key lineages remain unexplored. We therefore investigated plastome sequence and expression in the holoparasitic and morphologically bizarre Balanophoraceae. The two Balanophora plastomes examined are remarkable, exhibiting features rarely if ever seen before in plastomes or in any other genomes. At 15.5 kb in size and with only 19 genes, they are among the most reduced plastomes known. They have no tRNA genes for protein synthesis, a trait found in only three other plastid lineages, and thus Balanophora plastids must import all tRNAs needed for translation. Balanophora plastomes are exceptionally compact, with numerous overlapping genes, highly reduced spacers, loss of all cis-spliced introns, and shrunken protein genes. With A+T contents of 87.8% and 88.4%, the Balanophora genomes are the most AT-rich genomes known save for a single mitochondrial genome that is merely bloated with AT-rich spacer DNA. Most plastid protein genes in Balanophora consist of ≥90% AT, with several between 95% and 98% AT, resulting in the most biased codon usage in any genome described to date. A potential consequence of its radical compositional evolution is the novel genetic code used by Balanophora plastids, in which TAG has been reassigned from stop to tryptophan. Despite its many exceptional properties, the Balanophora plastome must be functional because all examined genes are transcribed, its only intron is correctly trans-spliced, and its protein genes, although highly divergent, are evolving under various degrees of selective constraint.
Collapse
Affiliation(s)
- Huei-Jiun Su
- Department of Earth and Life Sciences, University of Taipei, 100 Taipei, Taiwan
- Department of Biology, Pennsylvania State University, University Park, PA 16802
- Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park, PA 16802
| | - Todd J Barkman
- Department of Biological Sciences, Western Michigan University, Kalamazoo, MI 49008
| | - Weilong Hao
- Department of Biological Sciences, Wayne State University, Detroit, MI 48202
| | - Samuel S Jones
- Graduate Program in Plant Biology, Pennsylvania State University, University Park, PA 16802
| | - Julia Naumann
- Department of Biology, Pennsylvania State University, University Park, PA 16802
- Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park, PA 16802
| | | | - Eric K Wafula
- Department of Biology, Pennsylvania State University, University Park, PA 16802
- Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park, PA 16802
| | - Jer-Ming Hu
- Institute of Ecology and Evolutionary Biology, National Taiwan University, 106 Taipei, Taiwan
| | - Jeffrey D Palmer
- Department of Biology, Indiana University, Bloomington, IN 47405;
| | - Claude W dePamphilis
- Department of Biology, Pennsylvania State University, University Park, PA 16802;
- Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park, PA 16802
- Graduate Program in Plant Biology, Pennsylvania State University, University Park, PA 16802
| |
Collapse
|
26
|
Bardy JJ, Sarovich DS, Price EP, Steinig E, Tong S, Drilling A, Ou J, Vreugde S, Wormald PJ, Psaltis AJ. Staphylococcus aureus from patients with chronic rhinosinusitis show minimal genetic association between polyp and non-polyp phenotypes. BMC EAR, NOSE, AND THROAT DISORDERS 2018; 18:16. [PMID: 30349419 PMCID: PMC6192324 DOI: 10.1186/s12901-018-0064-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/25/2018] [Accepted: 09/25/2018] [Indexed: 12/12/2022]
Abstract
Background Staphylococcus aureus has a high prevalence in chronic rhinosinusitis (CRS) patients and is suggested to play a more etiopathogenic role in CRS patients with nasal polyps (CRSwNP), a severe form of the CRS spectrum with poorer surgical outcomes. We performed a microbial genome-wide association study (mGWAS) to investigate whether S. aureus isolates from CRS patients have particular genetic markers associated with CRS with nasal polyps (CRSwNP) or CRS without nasal polyps (CRSsNP). Methods Whole genome sequencing was performed on S. aureus isolates collected from 28 CRSsNP and 30 CRSwNP patients. A mGWAS approach was employed using large-scale comparative genomics to identify genetic variation within our dataset. Results Considerable genetic variation was observed, with > 90,000 single nucleotide polymorphisms (SNPs) sites identified. There was little correlation with CRS subtype based on SNPs and Insertion/Delection (Indels). One indel was found to significantly correlate with CRSwNP and occurred in the promoter region of a bacitracin transport system ATP-binding protein. Additionally, two variants of the highly variable superantigen-like (SSL) proteins were found to significantly correlate with each CRS phenotype. No significant association with other virulence or antibiotic resistance genes were observed, consistent with previous studies. Conclusion To our knowledge this study is the first to use mGWAS to investigate the contribution of microbial genetic variation to CRS presentations. Utilising the most comprehensive genome-wide analysis methods available, our results suggest that CRS phenotype may be influenced by genetic factors other than specific virulence mechanisms within the S. aureus genome. Electronic supplementary material The online version of this article (10.1186/s12901-018-0064-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jake Jervis Bardy
- 1Department of Otolaryngology Head and Neck Surgery, The Queen Elizabeth Hospital and the University of Adelaide, Woodville South, South Australia SA 5011 Australia
| | - Derek S Sarovich
- 3Present address: Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, QLD, Sippy Downs, Australia
| | - Erin P Price
- 3Present address: Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, QLD, Sippy Downs, Australia
| | - Eike Steinig
- 4Present address: Australian Institute of Tropical Health and Medicine, James Cook University, QLD, Townsville, Australia
| | - Steven Tong
- 2Global and Tropical Health Division, Menzies School of Health Research, Charles Darwin University, Darwin, NT Australia
| | - Amanda Drilling
- 1Department of Otolaryngology Head and Neck Surgery, The Queen Elizabeth Hospital and the University of Adelaide, Woodville South, South Australia SA 5011 Australia
| | - Judy Ou
- 1Department of Otolaryngology Head and Neck Surgery, The Queen Elizabeth Hospital and the University of Adelaide, Woodville South, South Australia SA 5011 Australia
| | - Sarah Vreugde
- 1Department of Otolaryngology Head and Neck Surgery, The Queen Elizabeth Hospital and the University of Adelaide, Woodville South, South Australia SA 5011 Australia
| | - Peter-John Wormald
- 1Department of Otolaryngology Head and Neck Surgery, The Queen Elizabeth Hospital and the University of Adelaide, Woodville South, South Australia SA 5011 Australia
| | - Alkis J Psaltis
- 1Department of Otolaryngology Head and Neck Surgery, The Queen Elizabeth Hospital and the University of Adelaide, Woodville South, South Australia SA 5011 Australia
| |
Collapse
|
27
|
Shapiro LR, Paulson JN, Arnold BJ, Scully ED, Zhaxybayeva O, Pierce NE, Rocha J, Klepac-Ceraj V, Holton K, Kolter R. An Introduced Crop Plant Is Driving Diversification of the Virulent Bacterial Pathogen Erwinia tracheiphila. mBio 2018; 9:e01307-18. [PMID: 30279283 PMCID: PMC6168856 DOI: 10.1128/mbio.01307-18] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2018] [Accepted: 08/17/2018] [Indexed: 12/14/2022] Open
Abstract
Erwinia tracheiphila is the causal agent of bacterial wilt of cucurbits, an economically important phytopathogen affecting an economically important phytopathogen affecting few cultivated Cucurbitaceae few cultivated Cucurbitaceae host plant species in temperate eastern North America. However, essentially nothing is known about E. tracheiphila population structure or genetic diversity. To address this shortcoming, a representative collection of 88 E. tracheiphila isolates was gathered from throughout its geographic range, and their genomes were sequenced. Phylogenomic analysis revealed three genetic clusters with distinct hrpT3SS virulence gene repertoires, host plant association patterns, and geographic distributions. Low genetic heterogeneity within each cluster suggests a recent population bottleneck followed by population expansion. We showed that in the field and greenhouse, cucumber (Cucumis sativus), which was introduced to North America by early Spanish conquistadors, is the most susceptible host plant species and the only species susceptible to isolates from all three lineages. The establishment of large agricultural populations of highly susceptible C. sativus in temperate eastern North America may have facilitated the original emergence of E. tracheiphila into cucurbit agroecosystems, and this introduced plant species may now be acting as a highly susceptible reservoir host. Our findings have broad implications for agricultural sustainability by drawing attention to how worldwide crop plant movement, agricultural intensification, and locally unique environments may affect the emergence, evolution, and epidemic persistence of virulent microbial pathogens.IMPORTANCEErwinia tracheiphila is a virulent phytopathogen that infects two genera of cucurbit crop plants, Cucurbita spp. (pumpkin and squash) and Cucumis spp. (muskmelon and cucumber). One of the unusual ecological traits of this pathogen is that it is limited to temperate eastern North America. Here, we complete the first large-scale sequencing of an E. tracheiphila isolate collection. From phylogenomic, comparative genomic, and empirical analyses, we find that introduced Cucumis spp. crop plants are driving the diversification of E. tracheiphila into multiple lineages. Together, the results from this study show that locally unique biotic (plant population) and abiotic (climate) conditions can drive the evolutionary trajectories of locally endemic pathogens in unexpected ways.
Collapse
Affiliation(s)
- Lori R Shapiro
- Department of Microbiology and Immunology, Harvard Medical School, Boston, Massachusetts, USA
- Department of Organismal and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
- Department of Applied Ecology, North Carolina State University, Raleigh, North Carolina, USA
| | - Joseph N Paulson
- Department of Biostatistics, Product Development, Genentech Inc., San Francisco, California, USA
| | - Brian J Arnold
- Center for Communicable Disease Dynamics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Erin D Scully
- Stored Product Insect and Engineering Research Unit, USDA-ARS Center for Grain and Animal Health Research, Manhattan, Kansas, USA
| | - Olga Zhaxybayeva
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire, USA
- Department of Computer Science, Dartmouth College, Hanover, New Hampshire, USA
| | - Naomi E Pierce
- Department of Organismal and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| | - Jorge Rocha
- Department of Microbiology and Immunology, Harvard Medical School, Boston, Massachusetts, USA
- CIDEA Consortium Conacyt-Centro de Investigación en Alimentación y Desarrollo, Hermosillo, Mexico
| | - Vanja Klepac-Ceraj
- Department of Biological Sciences, Wellesley College, Wellesley, Massachusetts, USA
| | - Kristina Holton
- Department of Biostatistics, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Roberto Kolter
- Department of Microbiology and Immunology, Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
28
|
López-Fernández L, Sanchis M, Navarro-Rodríguez P, Nicolás FE, Silva-Franco F, Guarro J, Garre V, Navarro-Mendoza MI, Pérez-Arques C, Capilla J. Understanding Mucor circinelloides pathogenesis by comparative genomics and phenotypical studies. Virulence 2018; 9:707-720. [PMID: 29436903 PMCID: PMC5955452 DOI: 10.1080/21505594.2018.1435249] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
The increasing number of infections by species of Mucorales and their high mortality constitute an important concern for public health. This study aims to decipher the genetic basis of Mucor circinelloides pathogenicity, which displays virulence in a strain dependent manner. Assuming that genetic differences between strains may be linked to different pathotypes, we have conducted a study to explore genes responsible for virulence in M. circinelloides by whole genome sequencing of the avirulent strain NRRL3631 and comparison with the virulent strain CBS277.49. This genome analysis revealed 773 truncated, discontiguous and absent genes in the NRRL3631 strain. We also examined phenotypic traits resulting in reduced heat stress tolerance, chitosan content and lower susceptibility to toxic compounds (calcofluor white and sodium dodecyl sulphate) in the virulent strain, suggesting the influence of cell wall on pathogenesis. Based on these results, we focused on studying extracellular protein-coding genes by gene deletion and further pathotype characterization of mutants in murine models of pulmonary and systemic infection. Deletion of gene ID112092, which codes for a hypothetical extracellular protein of unknown function, resulted in significant reduction of virulence. Although pathogenesis is a multifactorial process, these findings highlight the crucial role of surface and secreted proteins in M. circinelloides virulence and should promote further studies of other differential genes.
Collapse
Affiliation(s)
- Loida López-Fernández
- a Unitat de Microbiologia, Facultat de Medicina i Ciències de la Salut, Universitat Rovira i Virgili and Institut d'Investigació Sanitària Pere Virgili (IISPV) , Reus , Spain
| | - Marta Sanchis
- a Unitat de Microbiologia, Facultat de Medicina i Ciències de la Salut, Universitat Rovira i Virgili and Institut d'Investigació Sanitària Pere Virgili (IISPV) , Reus , Spain
| | - Patricia Navarro-Rodríguez
- a Unitat de Microbiologia, Facultat de Medicina i Ciències de la Salut, Universitat Rovira i Virgili and Institut d'Investigació Sanitària Pere Virgili (IISPV) , Reus , Spain
| | - Francisco E Nicolás
- b Departamento de Genética y Microbiología , Facultad de Biología, Universidad de Murcia , Murcia , Spain
| | | | - Josep Guarro
- a Unitat de Microbiologia, Facultat de Medicina i Ciències de la Salut, Universitat Rovira i Virgili and Institut d'Investigació Sanitària Pere Virgili (IISPV) , Reus , Spain
| | - Victoriano Garre
- b Departamento de Genética y Microbiología , Facultad de Biología, Universidad de Murcia , Murcia , Spain
| | | | - Carlos Pérez-Arques
- b Departamento de Genética y Microbiología , Facultad de Biología, Universidad de Murcia , Murcia , Spain
| | - Javier Capilla
- a Unitat de Microbiologia, Facultat de Medicina i Ciències de la Salut, Universitat Rovira i Virgili and Institut d'Investigació Sanitària Pere Virgili (IISPV) , Reus , Spain
| |
Collapse
|
29
|
Abd El Ghany M, Sharaf H, Al-agamy MH, Shibl A, Hill-Cawthorne GA, Hong PY. Genomic characterization of NDM-1 and 5, and OXA-181 carbapenemases in uropathogenic Escherichia coli isolates from Riyadh, Saudi Arabia. PLoS One 2018; 13:e0201613. [PMID: 30110357 PMCID: PMC6093660 DOI: 10.1371/journal.pone.0201613] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2018] [Accepted: 07/18/2018] [Indexed: 12/18/2022] Open
Abstract
Urinary tract infections (UTIs) associated with Escherichia coli are a growing threat with an increase in the prevalence of multidrug resistant (MDR) strains, particularly ß-lactamase producers, occurring globally. We investigated the presence of carbapenem-resistant uropathogenic E. coli clones in community-acquired UTIs in Riyadh, Kingdom of Saudi Arabia (KSA) to identify the virulence and resistance structures of the resistant clones and relate the isolates to those circulating globally. A combination of comparative genomics and phenotypic approaches were used to characterize ten MDR-uropathogenic Escherichia coli isolates recovered from UTI patients in Riyadh between November 2014 and January 2015. We report the presence of NDM-1 and 5, and OXA-181 in carbapenem-resistant UPEC strains from Riyadh, KSA. Single nucleotide polymorphism analyses demonstrated that these ten isolates fell into four phylogenetically distinct clades within the UPEC phylogeny. Comparative genomic analyses indicate that these diverse clones could be distinguished according to their multilocus sequencing type (MLST), serology, and virulence and antimicrobial gene architectures. These clones include the blaNDM-1 carrying isolates of the globally predominant MDR ST131 and ST69 types, previously identified as one of the most common UPEC strains in KSA. This is in addition to clones of ST23Cplx (ST410) and ST448Cplx (ST448) that have likely evolved from common intestinal strains, carrying copies of ß-lactamase genes including blaNDM-5, blaCTX-M-15, blaTEM-1, blaCMY-42, blaOXA-1 and blaOXA-181. These data have identified an emerging public health concern and highlight the need to use comprehensive approaches to detect the structure of MDR E. coli populations associated with community-acquired UTIs in KSA.
Collapse
Affiliation(s)
- Moataz Abd El Ghany
- Westmead Institute for Medical Research, The University of Sydney, Sydney, Australia
- Marie Bashir Institute for Infectious Diseases and Biosecurity, The University of Sydney, Sydney, Australia
| | - Hazem Sharaf
- Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States of America
| | - Mohamed H. Al-agamy
- Department of Pharmaceutics and Microbiology, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
- Department of Microbiology and Immunology, Faculty of Pharmacy, Al-Azhar University, Cairo, Egypt
| | - Atef Shibl
- Microbiology and Immunology Department, College of Medicine, Alfaisal University, Riyadh, Saudi Arabia
| | - Grant A. Hill-Cawthorne
- Marie Bashir Institute for Infectious Diseases and Biosecurity, The University of Sydney, Sydney, Australia
- School of Public Health, The University of Sydney, Sydney, Australia
| | - Pei-Ying Hong
- Water Desalination and Reuse Center, Environmental Science and Engineering, Division of Biological and Environmental Science and Engineering, King Abdullah University of Science and Technology (KAUST), Jeddah, Saudi Arabia
| |
Collapse
|
30
|
Duan SF, Han PJ, Wang QM, Liu WQ, Shi JY, Li K, Zhang XL, Bai FY. The origin and adaptive evolution of domesticated populations of yeast from Far East Asia. Nat Commun 2018; 9:2690. [PMID: 30002370 PMCID: PMC6043522 DOI: 10.1038/s41467-018-05106-7] [Citation(s) in RCA: 126] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2017] [Accepted: 04/24/2018] [Indexed: 11/08/2022] Open
Abstract
The yeast Saccharomyces cerevisiae has been an essential component of human civilization because of its long global history of use in food and beverage fermentation. However, the diversity and evolutionary history of the domesticated populations of the yeast remain elusive. We show here that China/Far East Asia is likely the center of origin of the domesticated populations of the species. The domesticated populations form two major groups associated with solid- and liquid-state fermentation and appear to have originated from heterozygous ancestors, which were likely formed by outcrossing between diverse wild isolates primitively for adaptation to maltose-rich niches. We found consistent gene expansion and contraction in the whole domesticated population, as well as lineage-specific genome variations leading to adaptation to different environments. We show a nearly panoramic view of the diversity and life history of S. cerevisiae and provide new insights into the origin and evolution of the species.
Collapse
Affiliation(s)
- Shou-Fu Duan
- State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, No.19(A) Yuquan Road, Shijingshan District, Beijing, 100049, China
| | - Pei-Jie Han
- State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Qi-Ming Wang
- State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Wan-Qiu Liu
- State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Jun-Yan Shi
- State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, No.19(A) Yuquan Road, Shijingshan District, Beijing, 100049, China
| | - Kuan Li
- State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Xiao-Ling Zhang
- State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Feng-Yan Bai
- State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China.
- College of Life Sciences, University of Chinese Academy of Sciences, No.19(A) Yuquan Road, Shijingshan District, Beijing, 100049, China.
| |
Collapse
|
31
|
Network-guided genomic and metagenomic analysis of the faecal microbiota of the critically endangered kakapo. Sci Rep 2018; 8:8128. [PMID: 29802288 PMCID: PMC5970201 DOI: 10.1038/s41598-018-26484-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2017] [Accepted: 05/11/2018] [Indexed: 12/25/2022] Open
Abstract
The kakapo is a critically endangered, herbivorous parrot endemic to New Zealand. The kakapo hindgut hosts a dense microbial community of low taxonomic diversity, typically dominated by Escherichia fergusonii, and has proven to be a remarkably stable ecosystem, displaying little variation in core membership over years of study. To elucidate mechanisms underlying this robustness, we performed 16S rRNA gene-based co-occurrence network analysis to identify potential interactions between E. fergusonii and the wider bacterial community. Genomic and metagenomic sequencing were employed to facilitate interpretation of potential interactions observed in the network. E. fergusonii maintained very few correlations with other members of the microbiota, and isolates possessed genes for the generation of energy from a wide range of carbohydrate sources, including plant fibres such as cellulose. We surmise that this dominant microorganism is abundant not due to ecological interaction with other members of the microbiota, but its ability to metabolise a wide range of nutrients in the gut. This research represents the first concerted effort to understand the functional roles of the kakapo microbiota, and leverages metagenomic data to contextualise co-occurrence patterns. By combining these two techniques we provide a means for studying the diversity-stability hypothesis in the context of bacterial ecosystems.
Collapse
|
32
|
Draft Genome Sequence of Staphylococcus microti DSM 22147, Isolated from the Common Vole. GENOME ANNOUNCEMENTS 2018; 6:6/20/e00420-18. [PMID: 29773631 PMCID: PMC5958262 DOI: 10.1128/genomea.00420-18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Staphylococcus microti DSM 22147 was isolated from viscera of common voles (Microtus arvalis Pallas) with generalized Brucella microti infection in the Czech Republic. To the best of our knowledge, the genome sequence of the species S. microti has not been previously studied. The complete genome sequence of strain DSM 22147 includes a genome of 2,381,859 bp (38.0% GC content) without any plasmids.
Collapse
|
33
|
Tsai YM, Lo WS, Wu PS, Cho ST, Kuo CH. Complete Genome Sequence of Spiroplasma monobiae MQ-1 T (ATCC 33825), a Bacterium Isolated from the Vespid Wasp (Monobia quadridens). GENOME ANNOUNCEMENTS 2018; 6:e00347-18. [PMID: 29724836 PMCID: PMC5940939 DOI: 10.1128/genomea.00347-18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 03/21/2018] [Accepted: 03/24/2018] [Indexed: 11/23/2022]
Abstract
Spiroplasma monobiae MQ-1T (ATCC 33825) was isolated from the hemolymph of an adult vespid wasp (Monobia quadridens) collected in Maryland. Here, we report the complete genome sequence of this bacterium to facilitate the investigation of its biology and the comparative genomics among Spiroplasma species.
Collapse
Affiliation(s)
- Yi-Ming Tsai
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
| | - Wen-Sui Lo
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
| | - Pei-Shan Wu
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
| | - Shu-Ting Cho
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
| | - Chih-Horng Kuo
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
34
|
Schwab S, Terra LA, Baldani JI. Genomic characterization of Nitrospirillum amazonense strain CBAmC, a nitrogen-fixing bacterium isolated from surface-sterilized sugarcane stems. Mol Genet Genomics 2018; 293:997-1016. [PMID: 29696375 DOI: 10.1007/s00438-018-1439-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Accepted: 04/18/2018] [Indexed: 02/07/2023]
Abstract
Nitrospirillum amazonense is a nitrogen-fixing bacterium that shows potential to promote plant growth when inoculated into sugarcane and rice plants. This microorganism has been the subject of biochemical and genetic characterization to elucidate important functions related to host plant interaction and growth promotion, including the determination of draft genome sequences of two strains, Y2 and CBAmC, the second of which is the aim of the present study. CBAmC has been isolated from sugarcane (Saccharum spp.), and is currently used in a sugarcane consortium inoculant with four other nitrogen-fixing bacterial strains. The present paper describes a significant improvement in the genome sequence and assembly for the N. amazonense strain CBAmC, and determination for the first time of a complete genome sequence for this bacterial species, using PacBio technology. The analysis of the genomic data obtained allowed the discovery of genes coding for metabolic pathways and cellular structures that may be determinant for the success of the bacterial establishment and colonization into the host sugarcane plant, besides conferring important characteristics to the inoculant. These include genes for the use of sucrose and N-glycans, biosynthesis of autoinducer molecules, siderophore production and acquisition, auxin and polyamine biosynthesis, flagellum, σ-fimbriae, a variety of secretion systems, and a complete denitrification system. Concerning genes for nitrogenase and auxiliary proteins, it was possible to corroborate literature data that in N. amazonense these probably had originated from horizontal gene transfer, from bacteria of the Rhizobiales order. The complete genomic sequence of the CBAmC strain of N. amazonense revealed that the bacterium harbors four replicons, including three chromosomes and one chromid, a profile that coincides with that of other two strains, according to literature data, suggesting that as a replicon pattern for the species. Finally, results of phylogenomic analyses in this work support the recent reclassification of the species, separating it from the Azospirillum genus. More importantly, results of the present work shall guide subsequent studies on strain CBAmC as well as the development of a sugarcane inoculant.
Collapse
Affiliation(s)
- Stefan Schwab
- Instituto Nacional de Ciência e Tecnologia da Fixação Biológica de Nitrogênio, CNPq, Curitiba, Brazil.
- Embrapa Agrobiologia, Rodovia BR 465, km 7, Seropédica, RJ, 23891-000, Brazil.
| | - Leonardo Araujo Terra
- Instituto Nacional de Ciência e Tecnologia da Fixação Biológica de Nitrogênio, CNPq, Curitiba, Brazil
- Embrapa Agrobiologia, Rodovia BR 465, km 7, Seropédica, RJ, 23891-000, Brazil
- Pró-Reitoria de Pesquisa e Pós-Graduação, Universidade Federal Rural do Rio de Janeiro, Rodovia BR 465, km 7, Seropédica, RJ, 23890-000, Brazil
| | - José Ivo Baldani
- Instituto Nacional de Ciência e Tecnologia da Fixação Biológica de Nitrogênio, CNPq, Curitiba, Brazil
- Embrapa Agrobiologia, Rodovia BR 465, km 7, Seropédica, RJ, 23891-000, Brazil
| |
Collapse
|
35
|
Abstract
Agrobacterium tumefaciens 1D1609 is a highly virulent strain isolated from a crown gall tumor of alfalfa (Medicago sativa L.). Compared to other well-characterized A. tumefaciens strains, such as C58 and Ach5, 1D1609 has a distinctive host range. Here, we report its complete genome sequence to facilitate future studies.
Collapse
|
36
|
Complete Genome Sequence of Spiroplasma floricola 23-6
T
(ATCC 29989), a Bacterium Isolated from a Tulip Tree (Liriodendron tulipifera L.). GENOME ANNOUNCEMENTS 2018; 6:6/16/e00302-18. [PMID: 29674553 PMCID: PMC5908944 DOI: 10.1128/genomea.00302-18] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
Spiroplasma floricola 23-6T (ATCC 29989) was isolated from the flower surface of a tulip tree (Liriodendron tulipifera L.). Here, we report the complete genome sequence of this bacterium to facilitate the investigation of its biology and the comparative genomics among Spiroplasma species.
Collapse
|
37
|
Böhme U, Otto TD, Cotton JA, Steinbiss S, Sanders M, Oyola SO, Nicot A, Gandon S, Patra KP, Herd C, Bushell E, Modrzynska KK, Billker O, Vinetz JM, Rivero A, Newbold CI, Berriman M. Complete avian malaria parasite genomes reveal features associated with lineage-specific evolution in birds and mammals. Genome Res 2018; 28:547-560. [PMID: 29500236 PMCID: PMC5880244 DOI: 10.1101/gr.218123.116] [Citation(s) in RCA: 63] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2016] [Accepted: 02/28/2018] [Indexed: 01/08/2023]
Abstract
Avian malaria parasites are prevalent around the world and infect a wide diversity of bird species. Here, we report the sequencing and analysis of high-quality draft genome sequences for two avian malaria species, Plasmodium relictum and Plasmodium gallinaceum We identify 50 genes that are specific to avian malaria, located in an otherwise conserved core of the genome that shares gene synteny with all other sequenced malaria genomes. Phylogenetic analysis suggests that the avian malaria species form an outgroup to the mammalian Plasmodium species, and using amino acid divergence between species, we estimate the avian- and mammalian-infective lineages diverged in the order of 10 million years ago. Consistent with their phylogenetic position, we identify orthologs of genes that had previously appeared to be restricted to the clades of parasites containing Plasmodium falciparum and Plasmodium vivax, the species with the greatest impact on human health. From these orthologs, we explore differential diversifying selection across the genus and show that the avian lineage is remarkable in the extent to which invasion-related genes are evolving. The subtelomeres of the P. relictum and P. gallinaceum genomes contain several novel gene families, including an expanded surf multigene family. We also identify an expansion of reticulocyte binding protein homologs in P. relictum, and within these proteins, we detect distinct regions that are specific to nonhuman primate, humans, rodent, and avian hosts. For the first time in the Plasmodium lineage, we find evidence of transposable elements, including several hundred fragments of LTR-retrotransposons in both species and an apparently complete LTR-retrotransposon in the genome of P. gallinaceum.
Collapse
Affiliation(s)
- Ulrike Böhme
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Thomas D Otto
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - James A Cotton
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Sascha Steinbiss
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Mandy Sanders
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Samuel O Oyola
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
- International Livestock Research Institute, Nairobi 00100, Kenya
| | - Antoine Nicot
- CEFE UMR 5175, CNRS-Université de Montpellier-Université Paul-Valéry Montpellier-EPHE, 34293 Montpellier Cedex 5, France
| | - Sylvain Gandon
- CEFE UMR 5175, CNRS-Université de Montpellier-Université Paul-Valéry Montpellier-EPHE, 34293 Montpellier Cedex 5, France
| | - Kailash P Patra
- Department of Medicine, Division of Infectious Diseases, University of California San Diego, School of Medicine, La Jolla, California 92093, USA
| | - Colin Herd
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Ellen Bushell
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Katarzyna K Modrzynska
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Oliver Billker
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Joseph M Vinetz
- Department of Medicine, Division of Infectious Diseases, University of California San Diego, School of Medicine, La Jolla, California 92093, USA
| | - Ana Rivero
- MIVEGEC (CNRS UMR 5290), 34394 Montpellier Cedex 5, France
| | - Chris I Newbold
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
- Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DS, United Kingdom
| | - Matthew Berriman
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| |
Collapse
|
38
|
Comparative Genomics and Identification of an Enterotoxin-Bearing Pathogenicity Island, SEPI-1/SECI-1, in Staphylococcus epidermidis Pathogenic Strains. Toxins (Basel) 2018; 10:toxins10030093. [PMID: 29495323 PMCID: PMC5869381 DOI: 10.3390/toxins10030093] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Revised: 02/22/2018] [Accepted: 02/22/2018] [Indexed: 02/08/2023] Open
Abstract
Staphylococcus epidermidis is a leading cause of nosocomial infections, majorly resistant to beta-lactam antibiotics, and may transfer several mobile genetic elements among the members of its own species, as well as to Staphylococcus aureus; however, a genetic exchange from S. aureus to S. epidermidis remains controversial. We recently identified two pathogenic clinical strains of S. epidermidis that produce a staphylococcal enterotoxin C3-like (SEC) similar to that by S. aureus pathogenicity islands. This study aimed to determine the genetic environment of the SEC-coding sequence and to identify the mobile genetic elements. Whole-genome sequencing and annotation of the S. epidermidis strains were performed using Illumina technology and a bioinformatics pipeline for assembly, which provided evidence that the SEC-coding sequences were located in a composite pathogenicity island that was previously described in the S. epidermidis strain FRI909, called SePI-1/SeCI-1, with 83.8–89.7% nucleotide similarity. Various other plasmids were identified, particularly p_3_95 and p_4_95, which carry antibiotic resistance genes (hsrA and dfrG, respectively), and share homologies with SAP085A and pUSA04-2-SUR11, two plasmids described in S. aureus. Eventually, one complete prophage was identified, ΦSE90, sharing 30 out of 52 coding sequences with the Acinetobacter phage vB_AbaM_IME200. Thus, the SePI-1/SeCI-1 pathogenicity island was identified in two pathogenic strains of S. epidermidis that produced a SEC enterotoxin causing septic shock. These findings suggest the existence of in vivo genetic exchange from S. aureus to S. epidermidis.
Collapse
|
39
|
Lasserre M, Fresia P, Greif G, Iraola G, Castro-Ramos M, Juambeltz A, Nuñez Á, Naya H, Robello C, Berná L. Whole genome sequencing of the monomorphic pathogen Mycobacterium bovis reveals local differentiation of cattle clinical isolates. BMC Genomics 2018; 19:2. [PMID: 29291727 PMCID: PMC5748942 DOI: 10.1186/s12864-017-4249-6] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Accepted: 10/31/2017] [Indexed: 11/10/2022] Open
Abstract
Background Bovine tuberculosis (bTB) poses serious risks to animal welfare and economy, as well as to public health as a zoonosis. Its etiological agent, Mycobacterium bovis, belongs to the Mycobacterium tuberculosis complex (MTBC), a group of genetically monomorphic organisms featured by a remarkably high overall nucleotide identity (99.9%). Indeed, this characteristic is of major concern for correct typing and determination of strain-specific traits based on sequence diversity. Due to its historical economic dependence on cattle production, Uruguay is deeply affected by the prevailing incidence of Mycobacterium bovis. With the world’s highest number of cattle per human, and its intensive cattle production, Uruguay represents a particularly suited setting to evaluate genomic variability among isolates, and the diversity traits associated to this pathogen. Results We compared 186 genomes from MTBC strains isolated worldwide, and found a highly structured population in M. bovis. The analysis of 23 new M. bovis genomes, belonging to strains isolated in Uruguay evidenced three groups present in the country. Despite presenting an expected highly conserved genomic structure and sequence, these strains segregate into a clustered manner within the worldwide phylogeny. Analysis of the non-pe/ppe differential areas against a reference genome defined four main sources of variability, namely: regions of difference (RD), variable genes, duplications and novel genes. RDs and variant analysis segregated the strains into clusters that are concordant with their spoligotype identities. Due to its high homoplasy rate, spoligotyping failed to reflect the true genomic diversity among worldwide representative strains, however, it remains a good indicator for closely related populations. Conclusions This study introduces a comprehensive population structure analysis of worldwide M. bovis isolates. The incorporation and analysis of 23 novel Uruguayan M. bovis genomes, sheds light onto the genomic diversity of this pathogen, evidencing the existence of greater genetic variability among strains than previously contemplated. Electronic supplementary material The online version of this article (10.1186/s12864-017-4249-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Moira Lasserre
- Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Pablo Fresia
- Unidad de Bioinformática, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Gonzalo Greif
- Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Gregorio Iraola
- Unidad de Bioinformática, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Miguel Castro-Ramos
- Departamento de Bacteriología, División de Laboratorios Veterinarios (DI.LA.VE.) "Miguel C. Rubino", Montevideo, Uruguay
| | - Arturo Juambeltz
- Departamento de Bacteriología, División de Laboratorios Veterinarios (DI.LA.VE.) "Miguel C. Rubino", Montevideo, Uruguay
| | - Álvaro Nuñez
- Departamento de Bacteriología, División de Laboratorios Veterinarios (DI.LA.VE.) "Miguel C. Rubino", Montevideo, Uruguay
| | - Hugo Naya
- Unidad de Bioinformática, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Carlos Robello
- Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay. .,Departamento de Bioquímica, Facultad de Medicina, Universidad de la República, Montevideo, Uruguay.
| | - Luisa Berná
- Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay.
| |
Collapse
|
40
|
Jaén-Luchoro D, Aliaga-Lozano F, Gomila RM, Gomila M, Salvà-Serra F, Lalucat J, Bennasar-Figueras A. First insights into a type II toxin-antitoxin system from the clinical isolate Mycobacterium sp. MHSD3, similar to epsilon/zeta systems. PLoS One 2017; 12:e0189459. [PMID: 29236773 PMCID: PMC5728571 DOI: 10.1371/journal.pone.0189459] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2017] [Accepted: 11/28/2017] [Indexed: 01/21/2023] Open
Abstract
A putative type II toxin-antitoxin (TA) system was found in the clinical isolate Mycobacterium sp. MHSD3, a strain closely related to Mycobacterium chelonae. Further analyses of the protein sequences of the two genes revealed the presence of domains related to a TA system. BLAST analyses indicated the presence of closely related proteins in the genomes of other recently published M. chelonae strains. The functionality of both elements of the TA system was demonstrated when expressed in Escherichia coli cells, and the predicted structure of the toxin is very similar to those of well-known zeta-toxins, leading to the definition of a type II TA system similar to epsilon/zeta TA systems in strains that are closely related to M. chelonae.
Collapse
Affiliation(s)
- Daniel Jaén-Luchoro
- Microbiologia, Departament de Biologia, Universitat de les Illes Balears, Palma de Mallorca, Spain
- Department of Infectious Diseases, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
- Centre for Antibiotic Resistance Research (CARe) at University of Gothenburg, Gothenburg, Sweden
- * E-mail:
| | - Francisco Aliaga-Lozano
- Microbiologia, Departament de Biologia, Universitat de les Illes Balears, Palma de Mallorca, Spain
- Laboratorio de Biología Molecular, Clínica Rotger, Palma de Mallorca, Spain
| | - Rosa Maria Gomila
- Serveis Cientifico-Tècnics, Universitat de les Illes Balears, Palma de Mallorca, Spain
| | - Margarita Gomila
- Microbiologia, Departament de Biologia, Universitat de les Illes Balears, Palma de Mallorca, Spain
| | - Francisco Salvà-Serra
- Microbiologia, Departament de Biologia, Universitat de les Illes Balears, Palma de Mallorca, Spain
- Department of Infectious Diseases, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
- Centre for Antibiotic Resistance Research (CARe) at University of Gothenburg, Gothenburg, Sweden
| | - Jorge Lalucat
- Microbiologia, Departament de Biologia, Universitat de les Illes Balears, Palma de Mallorca, Spain
- Institut Mediterrani d’Estudis Avançats (IMEDEA, CSIC-UIB), Esporles, Spain
| | - Antoni Bennasar-Figueras
- Microbiologia, Departament de Biologia, Universitat de les Illes Balears, Palma de Mallorca, Spain
- Area de Enfermedades Infecciosas, Instituto Universitario de Investigaciones en Ciencias de la Salud (IUNICS-UIB), Universitat de les Illes Balears, Palma de Mallorca, Spain
| |
Collapse
|
41
|
Argemi X, Martin V, Loux V, Dahyot S, Lebeurre J, Guffroy A, Martin M, Velay A, Keller D, Riegel P, Hansmann Y, Paul N, Prévost G. Whole-Genome Sequencing of Seven Strains of Staphylococcus lugdunensis Allows Identification of Mobile Genetic Elements. Genome Biol Evol 2017; 9:3746526. [PMID: 28444231 PMCID: PMC5425232 DOI: 10.1093/gbe/evx077] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/20/2017] [Indexed: 12/17/2022] Open
Abstract
Coagulase negative staphylococci are normal inhabitant of the human skin flora that account for an increasing number of infections, particularly hospital-acquired infections. Staphylococcus lugdunensis has emerged as a most virulent species causing various infections with clinical characteristics close to what clinicians usually observe with Staphylococcus aureus and both bacteria share more than 70% of their genome. Virulence of S. aureus relies on a large repertoire of virulence factors, many of which are encoded on mobile genetic elements. S. lugdunensis also bears various putative virulence genes but only one complete genome with extensive analysis has been published with one prophage sequence (φSL2) and a unique plasmid was previously described. In this study, we performed de novo sequencing, whole genome assembly and annotation of seven strains of S. lugdunensis from VISLISI clinical trial. We searched for the presence of virulence genes and mobile genetics elements using bioinformatics tools. We identified four new prophages, named φSL2 to φSL4, belonging to the Siphoviridae class and five plasmids, named pVISLISI_1 to pVISLISI_5. Three plasmids are homologous to known plasmids that include, amongst others, one S. aureus plasmid. The two other plasmids were not described previously. This study provides a new context for the study of S. lugdunensis virulence suggesting the occurrence of several genetic recombination’ with other staphylococci.
Collapse
Affiliation(s)
- Xavier Argemi
- Hôpitaux Universitaires, Maladies Infectieuses et Tropicales, Strasbourg, France.,Université de Strasbourg, CHRU de Strasbourg, VBP EA7290, Fédération de Médecine Translationnelle de Strasbourg (FMTS), Institut de bactériologie, Hôpitaux Universitaires de Strasbourg, France
| | - Véronique Martin
- INRA - Unité Mathématiques et Informatique Appliquées, du Genome à l'Environnement (MaIAGE), Jouy-en Josas, France
| | - Valentin Loux
- INRA - Unité Mathématiques et Informatique Appliquées, du Genome à l'Environnement (MaIAGE), Jouy-en Josas, France
| | - Sandrine Dahyot
- Laboratoire GRAM EA2656, Université de Rouen - IRIB UFR Médecine-Pharmacie Batiment Recherche, Rouen, France
| | - Jérémie Lebeurre
- Laboratoire GRAM EA2656, Université de Rouen - IRIB UFR Médecine-Pharmacie Batiment Recherche, Rouen, France
| | - Aurélien Guffroy
- Service d'Immunologie Clinique et de Médecine Interne, Centre National de Référence des Maladies Auto-immunes Rares, Hôpitaux Universitaires de Strasbourg, Strasbourg, France.,CNRS UPR 3572, Immunopathologie et Chimie Thérapeutique/Equipe, Tolérance Cellulaire B et Auto-immunité, Laboratoire d'excellence Medalis, Institute of Molecular and Cellular Biology (IBMC), Strasbourg, France
| | - Mickael Martin
- Service d'Immunologie Clinique et de Médecine Interne, Centre National de Référence des Maladies Auto-immunes Rares, Hôpitaux Universitaires de Strasbourg, Strasbourg, France.,CNRS UPR 3572, Immunopathologie et Chimie Thérapeutique/Equipe, Tolérance Cellulaire B et Auto-immunité, Laboratoire d'excellence Medalis, Institute of Molecular and Cellular Biology (IBMC), Strasbourg, France
| | - Aurélie Velay
- Virology Laboratory, University Hospital of Strasbourg, Strasbourg, France.,2-INSERM, UMR_S1109, LabEx Transplantex, Fédération de Médecine Translationnelle de Strasbourg (FMTS), Université de Strasbourg, Strasbourg, France
| | - Daniel Keller
- Université de Strasbourg, CHRU de Strasbourg, VBP EA7290, Fédération de Médecine Translationnelle de Strasbourg (FMTS), Institut de bactériologie, Hôpitaux Universitaires de Strasbourg, France
| | - Philippe Riegel
- Université de Strasbourg, CHRU de Strasbourg, VBP EA7290, Fédération de Médecine Translationnelle de Strasbourg (FMTS), Institut de bactériologie, Hôpitaux Universitaires de Strasbourg, France
| | - Yves Hansmann
- Hôpitaux Universitaires, Maladies Infectieuses et Tropicales, Strasbourg, France.,Université de Strasbourg, CHRU de Strasbourg, VBP EA7290, Fédération de Médecine Translationnelle de Strasbourg (FMTS), Institut de bactériologie, Hôpitaux Universitaires de Strasbourg, France
| | - Nicodème Paul
- Laboratoire d'ImmunoRhumatologie Moléculaire, INSERM UMR_S 1109, Plateforme GENOMAX, Faculté de Médecine, Fédération Hospitalo-Universitaire OMICARE, Fédération de Médecine Translationnelle de Strasbourg (FMTS), Université de Strasbourg, Strasbourg, France
| | - Gilles Prévost
- Université de Strasbourg, CHRU de Strasbourg, VBP EA7290, Fédération de Médecine Translationnelle de Strasbourg (FMTS), Institut de bactériologie, Hôpitaux Universitaires de Strasbourg, France
| |
Collapse
|
42
|
Complete Genome Sequence of Spiroplasma corruscae EC-1
T
(DSM 19793), a Bacterium Isolated from a Lampyrid Beetle (Ellychnia corrusca). GENOME ANNOUNCEMENTS 2017; 5:5/37/e00964-17. [PMID: 28912320 PMCID: PMC5597761 DOI: 10.1128/genomea.00964-17] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Spiroplasma corruscae EC-1T (DSM 19793) was isolated from the gut of a lampryid beetle (Ellychnia corrusca) collected in Beltsville, MD, USA, in 1983. Here, we report the complete genome sequence of this bacterium to facilitate the investigation of its biology and the comparative genomics among Spiroplasma species.
Collapse
|
43
|
Price EP, Sarovich DS, Webb JR, Hall CM, Jaramillo SA, Sahl JW, Kaestli M, Mayo M, Harrington G, Baker AL, Sidak-Loftis LC, Settles EW, Lummis M, Schupp JM, Gillece JD, Tuanyok A, Warner J, Busch JD, Keim P, Currie BJ, Wagner DM. Phylogeographic, genomic, and meropenem susceptibility analysis of Burkholderia ubonensis. PLoS Negl Trop Dis 2017; 11:e0005928. [PMID: 28910350 PMCID: PMC5614643 DOI: 10.1371/journal.pntd.0005928] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2017] [Revised: 09/26/2017] [Accepted: 09/03/2017] [Indexed: 01/02/2023] Open
Abstract
The bacterium Burkholderia ubonensis is commonly co-isolated from environmental specimens harbouring the melioidosis pathogen, Burkholderia pseudomallei. B. ubonensis has been reported in northern Australia and Thailand but not North America, suggesting similar geographic distribution to B. pseudomallei. Unlike most other Burkholderia cepacia complex (Bcc) species, B. ubonensis is considered non-pathogenic, although its virulence potential has not been tested. Antibiotic resistance in B. ubonensis, particularly towards drugs used to treat the most severe B. pseudomallei infections, has also been poorly characterised. This study examined the population biology of B. ubonensis, and includes the first reported isolates from the Caribbean. Phylogenomic analysis of 264 B. ubonensis genomes identified distinct clades that corresponded with geographic origin, similar to B. pseudomallei. A small proportion (4%) of strains lacked the 920kb chromosome III replicon, with discordance of presence/absence amongst genetically highly related strains, demonstrating that the third chromosome of B. ubonensis, like other Bcc species, probably encodes for a nonessential pC3 megaplasmid. Multilocus sequence typing using the B. pseudomallei scheme revealed that one-third of strains lack the "housekeeping" narK locus. In comparison, all strains could be genotyped using the Bcc scheme. Several strains possessed high-level meropenem resistance (≥32 μg/mL), a concern due to potential transmission of this phenotype to B. pseudomallei. In silico analysis uncovered a high degree of heterogeneity among the lipopolysaccharide O-antigen cluster loci, with at least 35 different variants identified. Finally, we show that Asian B. ubonensis isolate RF23-BP41 is avirulent in the BALB/c mouse model via a subcutaneous route of infection. Our results provide several new insights into the biology of this understudied species.
Collapse
Affiliation(s)
- Erin P. Price
- Global and Tropical Health Division, Menzies School of Health Research, Darwin, Northern Territory, Australia
- Centre for Animal Health Innovation, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Sippy Downs, Queensland, Australia
| | - Derek S. Sarovich
- Global and Tropical Health Division, Menzies School of Health Research, Darwin, Northern Territory, Australia
- Centre for Animal Health Innovation, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Sippy Downs, Queensland, Australia
| | - Jessica R. Webb
- Global and Tropical Health Division, Menzies School of Health Research, Darwin, Northern Territory, Australia
| | - Carina M. Hall
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Sierra A. Jaramillo
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Jason W. Sahl
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Mirjam Kaestli
- Global and Tropical Health Division, Menzies School of Health Research, Darwin, Northern Territory, Australia
| | - Mark Mayo
- Global and Tropical Health Division, Menzies School of Health Research, Darwin, Northern Territory, Australia
| | - Glenda Harrington
- Global and Tropical Health Division, Menzies School of Health Research, Darwin, Northern Territory, Australia
| | - Anthony L. Baker
- Environmental and Public Health Microbiology Research Group, Microbiology and Immunology, James Cook University, Townsville, Queensland, Australia
- Tasmanian Institute of Agriculture, University of Tasmania, Hobart, Tasmania, Australia
| | - Lindsay C. Sidak-Loftis
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Erik W. Settles
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Madeline Lummis
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - James M. Schupp
- Translational Genomics Research Institute, Flagstaff, Arizona, United States of America
| | - John D. Gillece
- Translational Genomics Research Institute, Flagstaff, Arizona, United States of America
| | - Apichai Tuanyok
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Jeffrey Warner
- Environmental and Public Health Microbiology Research Group, Microbiology and Immunology, James Cook University, Townsville, Queensland, Australia
| | - Joseph D. Busch
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Paul Keim
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
- Translational Genomics Research Institute, Flagstaff, Arizona, United States of America
| | - Bart J. Currie
- Global and Tropical Health Division, Menzies School of Health Research, Darwin, Northern Territory, Australia
| | - David M. Wagner
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| |
Collapse
|
44
|
Utturkar SM, Klingeman DM, Hurt RA, Brown SD. A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies. Front Microbiol 2017; 8:1272. [PMID: 28769883 PMCID: PMC5513972 DOI: 10.3389/fmicb.2017.01272] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Accepted: 06/26/2017] [Indexed: 11/20/2022] Open
Abstract
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
Collapse
Affiliation(s)
- Sagar M Utturkar
- Graduate School of Genome Science and Technology, University of TennesseeKnoxville, TN, United States
| | - Dawn M Klingeman
- Biosciences Division, Oak Ridge National LaboratoryOak Ridge, TN, United States.,BioEnergy Science CenterOak Ridge, TN, United States
| | - Richard A Hurt
- Biosciences Division, Oak Ridge National LaboratoryOak Ridge, TN, United States
| | - Steven D Brown
- Graduate School of Genome Science and Technology, University of TennesseeKnoxville, TN, United States.,Biosciences Division, Oak Ridge National LaboratoryOak Ridge, TN, United States.,BioEnergy Science CenterOak Ridge, TN, United States
| |
Collapse
|
45
|
Draft Genome Sequence of Klebsiella michiganensis 3T412C, Harboring an Arsenic Resistance Genomic Island, Isolated from Mine Tailings in Peru. GENOME ANNOUNCEMENTS 2017; 5:5/28/e00611-17. [PMID: 28705974 PMCID: PMC5511913 DOI: 10.1128/genomea.00611-17] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
An arsenic resistance genomic island in the bacterium Klebsiella michiganensis 3T412C was isolated from mine tailings from Peru. This genomic island confers adaptation to extreme environments with high concentrations of arsenic. Isolate 3T412C contained a complete set of genes involved in resistance to arsenic. This operon is surrounded by putative genes for resistance to other heavy metals.
Collapse
|
46
|
Kremer FS, McBride AJA, Pinto LDS. Approaches for in silico finishing of microbial genome sequences. Genet Mol Biol 2017; 40:553-576. [PMID: 28898352 PMCID: PMC5596377 DOI: 10.1590/1678-4685-gmb-2016-0230] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2016] [Accepted: 03/13/2017] [Indexed: 12/15/2022] Open
Abstract
The introduction of next-generation sequencing (NGS) had a significant effect on the availability of genomic information, leading to an increase in the number of sequenced genomes from a large spectrum of organisms. Unfortunately, due to the limitations implied by the short-read sequencing platforms, most of these newly sequenced genomes remained as "drafts", incomplete representations of the whole genetic content. The previous genome sequencing studies indicated that finishing a genome sequenced by NGS, even bacteria, may require additional sequencing to fill the gaps, making the entire process very expensive. As such, several in silico approaches have been developed to optimize the genome assemblies and facilitate the finishing process. The present review aims to explore some free (open source, in many cases) tools that are available to facilitate genome finishing.
Collapse
Affiliation(s)
- Frederico Schmitt Kremer
- Programa de Pós-Graduação em Biotecnologia (PPGB), Centro de
Desenvolvimento Tecnológico, Universidade Federal de Pelotas, Pelotas, Brazil
| | - Alan John Alexander McBride
- Programa de Pós-Graduação em Biotecnologia (PPGB), Centro de
Desenvolvimento Tecnológico, Universidade Federal de Pelotas, Pelotas, Brazil
| | - Luciano da Silva Pinto
- Programa de Pós-Graduação em Biotecnologia (PPGB), Centro de
Desenvolvimento Tecnológico, Universidade Federal de Pelotas, Pelotas, Brazil
| |
Collapse
|
47
|
Pasini EM, Böhme U, Rutledge GG, Voorberg-Van der Wel A, Sanders M, Berriman M, Kocken CH, Otto TD. An improved Plasmodium cynomolgi genome assembly reveals an unexpected methyltransferase gene expansion. Wellcome Open Res 2017; 2:42. [PMID: 28748222 PMCID: PMC5500898 DOI: 10.12688/wellcomeopenres.11864.1] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/15/2017] [Indexed: 11/20/2022] Open
Abstract
Background:
Plasmodium cynomolgi, a non-human primate malaria parasite species, has been an important model parasite since its discovery in 1907. Similarities in the biology of
P. cynomolgi to the closely related, but less tractable, human malaria parasite
P. vivax make it the model parasite of choice for liver biology and vaccine studies pertinent to
P. vivax malaria. Molecular and genome-scale studies of
P. cynomolgi have relied on the current reference genome sequence, which remains highly fragmented with 1,649 unassigned scaffolds and little representation of the subtelomeres. Methods: Using long-read sequence data (Pacific Biosciences SMRT technology), we assembled and annotated a new reference genome sequence, PcyM, sourced from an Indian rhesus monkey. We compare the newly assembled genome sequence with those of several other
Plasmodium species, including a re-annotated
P. coatneyi assembly. Results: The new PcyM genome assembly is of significantly higher quality than the existing reference, comprising only 56 pieces, no gaps and an improved average gene length. Detailed manual curation has ensured a comprehensive annotation of the genome with 6,632 genes, nearly 1,000 more than previously attributed to
P. cynomolgi. The new assembly also has an improved representation of the subtelomeric regions, which account for nearly 40% of the sequence. Within the subtelomeres, we identified more than 1300
Plasmodium interspersed repeat (
pir) genes, as well as a striking expansion of 36 methyltransferase pseudogenes that originated from a single copy on chromosome 9. Conclusions: The manually curated PcyM reference genome sequence is an important new resource for the malaria research community. The high quality and contiguity of the data have enabled the discovery of a novel expansion of methyltransferase in the subtelomeres, and illustrates the new comparative genomics capabilities that are being unlocked by complete reference genomes.
Collapse
Affiliation(s)
- Erica M Pasini
- Biomedical Primate Research Centre, Rijswijk, Lange Kleiweg 161, 2288GJ Rijswijk, Netherlands
| | - Ulrike Böhme
- The Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | - Gavin G Rutledge
- The Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | | | - Mandy Sanders
- The Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | - Matt Berriman
- The Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | - Clemens Hm Kocken
- Biomedical Primate Research Centre, Rijswijk, Lange Kleiweg 161, 2288GJ Rijswijk, Netherlands
| | - Thomas Dan Otto
- The Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
48
|
Ccorahua-Santo R, Eca A, Abanto M, Guerra G, Ramírez P. Physiological and comparative genomic analysis of Acidithiobacillus ferrivorans PQ33 provides psychrotolerant fitness evidence for oxidation at low temperature. Res Microbiol 2017; 168:482-492. [DOI: 10.1016/j.resmic.2017.01.007] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2015] [Revised: 01/31/2017] [Accepted: 01/31/2017] [Indexed: 11/24/2022]
|
49
|
Marin MA, Fonseca E, Encinas F, Freitas F, Camargo DA, Coimbra RS, de Filippis I, Vicente AC. The invasive Neisseria meningitidis MenC CC103 from Brazil is characterized by an accessory gene repertoire. Sci Rep 2017; 7:1617. [PMID: 28487566 PMCID: PMC5431661 DOI: 10.1038/s41598-017-01671-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2016] [Accepted: 03/30/2017] [Indexed: 12/21/2022] Open
Abstract
Neisseria meningitidis infections are a major issue for global health. The invasive MenC ST-103 clonal complex (CC103) has been the most prevalent in meningococcal outbreaks in Brazil, occurring also in several countries worldwide. Here we have analysed the population structure and accessory genome of MenC CC103 strains from a global perspective. An in-depth phylogenomic analysis revealed a lineage of N. meningitidis causing meningitis in Brazil and the United Kingdom. This lineage was also characterized as harbouring a particular accessory genome composed of CRISPR/Cas and restriction modification systems. This lineage was also characterized by a genomic island resembling an integrative and conjugative element. This island carried genes potentially associated with virulence and fitness. We propose this accessory gene repertoire could be contributing to the spatial-temporal persistence of the invasive MenC CC103 lineage.
Collapse
Affiliation(s)
- Michel Abanto Marin
- Laboratório de Genética Molecular de Microrganismos, Instituto Oswaldo Cruz (IOC) - Fundação Oswaldo Cruz (FIOCRUZ), Av. Brasil, 4365, Rio de Janeiro, Brazil.
| | - Erica Fonseca
- Laboratório de Genética Molecular de Microrganismos, Instituto Oswaldo Cruz (IOC) - Fundação Oswaldo Cruz (FIOCRUZ), Av. Brasil, 4365, Rio de Janeiro, Brazil
| | - Fernando Encinas
- Laboratório de Genética Molecular de Microrganismos, Instituto Oswaldo Cruz (IOC) - Fundação Oswaldo Cruz (FIOCRUZ), Av. Brasil, 4365, Rio de Janeiro, Brazil
| | - Fernanda Freitas
- Laboratório de Genética Molecular de Microrganismos, Instituto Oswaldo Cruz (IOC) - Fundação Oswaldo Cruz (FIOCRUZ), Av. Brasil, 4365, Rio de Janeiro, Brazil
| | | | - Roney Santos Coimbra
- Neurogenômica, Centro de Pesquisas René Rachou (CPqRR), Fundação Oswaldo Cruz (FIOCRUZ), Belo Horizonte, MG, Brazil
| | - Ivano de Filippis
- Instituto Nacional de Controle de Qualidade em Saúde (INCQS), Fundação Oswaldo Cruz (FIOCRUZ), Rio de Janeiro, RJ, Brazil
| | - Ana Carolina Vicente
- Laboratório de Genética Molecular de Microrganismos, Instituto Oswaldo Cruz (IOC) - Fundação Oswaldo Cruz (FIOCRUZ), Av. Brasil, 4365, Rio de Janeiro, Brazil
| |
Collapse
|
50
|
Abstract
The target of rapamycin complex 1 (TORC1) pathway is a highly conserved signaling pathway across eukaryotes that integrates nutrient and stress signals to regulate the cellular growth rate and the transition into and maintenance of dormancy. The majority of the pathway's components, including the central TOR kinase, have been lost in the apicomplexan lineage, and it is unknown how these organisms detect and respond to nutrient starvation in its absence. Plasmodium falciparum encodes a putative ortholog of the RNA polymerase (Pol) III repressor Maf1, which has been demonstrated to modulate Pol III transcription in a TOR-dependent manner in a number of organisms. Here, we investigate the role of P. falciparum Maf1 (PfMaf1) in regulating RNA Pol III expression under conditions of nutrient starvation and other stresses. Using a transposon insertion mutant with an altered Maf1 expression profile, we demonstrated that proper Maf1 expression is necessary for survival of the dormancy-like state induced by prolonged amino acid starvation and is needed for full recovery from other stresses that slow or stall the parasite cell cycle. This Maf1 mutant is defective in the downregulation of pre-tRNA synthesis under nutrient-limiting conditions, indicating that the function of Maf1 as a stress-responsive regulator of structural RNA transcription is conserved in P. falciparum Recent work has demonstrated that parasites carrying artemisinin-resistant K13 alleles display an enhanced ability to recover from drug-induced growth retardation. We show that one such artemisinin-resistant line displays greater regulation of pre-tRNA expression and higher survival upon prolonged amino acid starvation, suggesting that overlapping, PfMaf1-associated pathways may regulate growth recovery from both artemisinin treatment and amino acid starvation.IMPORTANCE Eukaryote organisms sense changes in their environment and integrate this information through signaling pathways to activate response programs to ensure survival. The TOR pathway is a well-studied signaling pathway found throughout eukaryotes that is known to integrate a variety of signals to regulate organismal growth in response to starvation and other stresses. The human malaria parasite Plasmodium falciparum appears to have lost the TOR pathway over the course of evolution, and it is unclear how the parasite modulates its growth in response to starvation and drug treatment. Here, we show that Maf1, a protein regulated by TOR in other eukaryotes, plays an important role in maintaining the parasite's viability in the face of starvation and other forms of stress. This suggests that PfMaf1 is a component of a yet-to-be-described nutrient and stress response pathway.
Collapse
|