1
|
Cheney L, Payne M, Kaur S, Lan R. SaLTy: a novel Staphylococcus aureus Lineage Typer. Microb Genom 2024; 10:001250. [PMID: 38739116 PMCID: PMC11165655 DOI: 10.1099/mgen.0.001250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Accepted: 04/19/2024] [Indexed: 05/14/2024] Open
Abstract
Staphylococcus aureus asymptomatically colonises 30 % of humans but can also cause a range of diseases, which can be fatal. In 2017 S. aureus was associated with 20 000 deaths in the USA alone. Dividing S. aureus isolates into smaller sub-groups can reveal the emergence of distinct sub-populations with varying potential to cause infections. Despite multiple molecular typing methods categorising such sub-groups, they do not take full advantage of S. aureus genome sequences when describing the fundamental population structure of the species. In this study, we developed Staphylococcus aureus Lineage Typing (SaLTy), which rapidly divides the species into 61 phylogenetically congruent lineages. Alleles of three core genes were identified that uniquely define the 61 lineages and were used for SaLTy typing. SaLTy was validated on 5000 genomes and 99.12 % (4956/5000) of isolates were assigned the correct lineage. We compared SaLTy lineages to previously calculated clonal complexes (CCs) from BIGSdb (n=21 173). SALTy improves on CCs by grouping isolates congruently with phylogenetic structure. SaLTy lineages were further used to describe the carriage of Staphylococcal chromosomal cassette containing mecA (SCCmec) which is carried by methicillin-resistant S. aureus (MRSA). Most lineages had isolates lacking SCCmec and the four largest lineages varied in SCCmec over time. Classifying isolates into SaLTy lineages, which were further SCCmec typed, allowed SaLTy to describe high-level MRSA epidemiology. We provide SaLTy as a simple typing method that defines phylogenetic lineages (https://github.com/LanLab/SaLTy). SaLTy is highly accurate and can quickly analyse large amounts of S. aureus genome data. SaLTy will aid the characterisation of S. aureus populations and ongoing surveillance of sub-groups that threaten human health.
Collapse
Affiliation(s)
- Liam Cheney
- School of Biotechnology and Biomolecular Science, University of New South Wales, Sydney, Australia
| | - Michael Payne
- School of Biotechnology and Biomolecular Science, University of New South Wales, Sydney, Australia
| | - Sandeep Kaur
- School of Biotechnology and Biomolecular Science, University of New South Wales, Sydney, Australia
| | - Ruiting Lan
- School of Biotechnology and Biomolecular Science, University of New South Wales, Sydney, Australia
| |
Collapse
|
2
|
Luo Y, Payne M, Kaur S, Octavia S, Jiang J, Lan R. Emergence and genomic insights of non-pandemic O1 Vibrio cholerae in Zhejiang, China. Microbiol Spectr 2023; 11:e0261523. [PMID: 37819129 PMCID: PMC10871787 DOI: 10.1128/spectrum.02615-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Accepted: 09/06/2023] [Indexed: 10/13/2023] Open
Abstract
IMPORTANCE It is well recognized that only Vibrio cholerae O1 causes cholera pandemics. However, not all O1 strains cause pandemic-level disease. In this study, we analyzed non-pandemic O1 V. cholerae isolates from the 1960s to the 1990s from China and found that they fell into three lineages, one of which shared the most recent common ancestor with pandemic O1 strains. Each of these non-pandemic O1 lineages has unique properties that contribute to their capacity to cause cholera. The findings of this study enhanced our understanding of the emergence and evolution of both pandemic and non-pandemic O1 V. cholerae.
Collapse
Affiliation(s)
- Yun Luo
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Michael Payne
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Sandeep Kaur
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Sophie Octavia
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Jianmin Jiang
- Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, Zhejiang, China
- Key Lab of Vaccine, Prevention and Control of Infectious Disease of Zhejiang Province, Hangzhou, Zhejiang, China
| | - Ruiting Lan
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| |
Collapse
|
3
|
Liu Y, Li M, Hong X, Li H, Huang R, Han S, Hou J, Pan C. Screening and identification of high yield tetramethylpyrazine strains in Nongxiangxing liquor Daqu and study on the mechanism of tetramethylpyrazine production. JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE 2023; 103:6849-6860. [PMID: 37293782 DOI: 10.1002/jsfa.12773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 05/30/2023] [Accepted: 06/09/2023] [Indexed: 06/10/2023]
Abstract
BACKGROUND There are few reports on the breeding of high-yielding tetramethylpyrazine (TTMP) strains in strong-flavor Daqu. In addition, studies on the mechanism of TTMP production in strains are mostly based on common physiological and biochemical indicators, and there is no report on RNA level. Therefore, in this study, a strain with high production of TTMP was screened out from strong-flavor liquor, and transcriptome sequencing analysis was performed to analyze its key metabolic pathways and key genes, and to infer the mechanism of TTMP production in the strain. RESULTS In this study, a strain with a high yield of tetramethylpyrazine (TTMP) was screened out, and the yield was 29.83 μg mL-1 . The identified strain was Bacillus velezensis, which could increase the content of TTMP in liquor by about 88%. After transcriptome sequencing, a total of 1851 differentially expressed genes were screened, including 1055 up-regulated genes and 796 down-regulated genes. Three pathways related to the production of TTMP were identified by gene ontology (GO) annotation and COG annotation, including carbohydrate metabolism, cell movement and amino acid metabolism. The key genes of TTMP were analyzed, and the factors that might regulate the production of TTMP, such as the transfer of uracil phosphate ribose and glycosyltransferase, were obtained. CONCLUSIONS A strain of B. velezensis with high TTMP production was screened and identified in strong-flavor Daqu for the first time. The yield of TTMP was 29.83 μg mL-1 , which increased the TTMP content in liquor by 88%. The key metabolic pathways of TTMP production in the strain were obtained: carbohydrate metabolism, cell movement and amino acid metabolism, and the key regulatory genes of each pathway were found, which complemented the gap in gene level in the production regulation of the strain, and provided a theoretical basis for the subsequent study of TTMP in liquor. © 2023 Society of Chemical Industry.
Collapse
Affiliation(s)
- Yanbo Liu
- College of Food and Biological Engineering (Liquor College), Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Henan Liquor Style Engineering Technology Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Henan Province Brewing Special Grain Development and Application Engineering Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Zhengzhou Key Laboratory of Liquor Brewing Microbial Technology, Henan University of Animal Husbandry and Economy, Zhengzhou, China
| | - Mengke Li
- College of Food and Biological Engineering (Liquor College), Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Henan Liquor Style Engineering Technology Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Henan Province Brewing Special Grain Development and Application Engineering Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Zhengzhou Key Laboratory of Liquor Brewing Microbial Technology, Henan University of Animal Husbandry and Economy, Zhengzhou, China
| | - Xinfeng Hong
- College of Food and Biological Engineering (Liquor College), Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Henan Liquor Style Engineering Technology Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Henan Province Brewing Special Grain Development and Application Engineering Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Zhengzhou Key Laboratory of Liquor Brewing Microbial Technology, Henan University of Animal Husbandry and Economy, Zhengzhou, China
| | - Haideng Li
- College of Food and Biological Engineering (Liquor College), Henan University of Animal Husbandry and Economy, Zhengzhou, China
- College of Biological Engineering, Henan University of Technology, Zhengzhou, China
| | - Runna Huang
- Henan Yangshao Distillery Co., Ltd., Mianchi, China
| | - Suna Han
- Henan Yangshao Distillery Co., Ltd., Mianchi, China
| | | | - Chunmei Pan
- College of Food and Biological Engineering (Liquor College), Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Henan Liquor Style Engineering Technology Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Henan Province Brewing Special Grain Development and Application Engineering Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China
- Zhengzhou Key Laboratory of Liquor Brewing Microbial Technology, Henan University of Animal Husbandry and Economy, Zhengzhou, China
| |
Collapse
|
4
|
Genomic Epidemiology and Multilevel Genome Typing of Australian Salmonella enterica Serovar Enteritidis. Microbiol Spectr 2023; 11:e0301422. [PMID: 36625638 PMCID: PMC9927265 DOI: 10.1128/spectrum.03014-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Salmonella enterica serovar Enteritidis is one of the leading causes of salmonellosis in Australia. In this study, a total of 568 S. Enteritidis isolates from two Australian states across two consecutive years were analyzed and compared to international strains, using the S. Enteritidis multilevel genome typing (MGT) database, which contained 40,390 publicly available genomes from 99 countries. The Australian S. Enteritidis isolates were divided into three phylogenetic clades (A, B, and C). Clades A and C represented 16.4% and 3.5% of the total isolates, respectively, and were of local origin. Clade B accounted for 80.1% of the isolates which belonged to seven previously defined lineages but was dominated by the global epidemic lineage. At the MGT5 level, three out of five top sequence types (STs) in Australia were also top STs in Asia, suggesting that a fair proportion of Australian S. Enteritidis cases may be epidemiologically linked with Asian strains. In 2018, a large egg-associated local outbreak was caused by a recently defined clade B lineage prevalent in Europe and was closely related, but not directly linked, to three European isolates. Additionally, over half (54.8%) of predicted multidrug resistance (MDR) isolates belonged to 10 MDR-associated MGT-STs, which were also frequent in Asian S. Enteritidis . Overall, this study investigated the genomic epidemiology of S. Enteritidis in Australia, including the first large local outbreak, using MGT. The open MGT platform enables a standardized and sharable nomenclature that can be effectively applied to public health for unified surveillance of S. Enteritidis nationally and globally. IMPORTANCE Salmonella enterica serovar Enteritidis is a leading cause of foodborne infections. We previously developed a genomic typing database (MGTdb) for S. Enteritidis to facilitate global surveillance of this pathogen. In this study, we examined the genomic features of Australian S. Enteritidis using the MGTdb and found that Australian S. Enteritidis is mainly epidemiologically linked with Asian strains (especially strains carrying antimicrobial resistance genes), followed by European strains. The first large-scale egg-associated local outbreak in Australia was caused by a recently defined lineage prevalent in Europe, and three European isolates in the MGTdb were closely related but not directly linked to this outbreak. In summary, the S. Enteritidis MGTdb open platform is shown to be a potentially powerful tool for national and global public health surveillance of this pathogen.
Collapse
|
5
|
DeSantis TZ, Cardona C, Narayan NR, Viswanatham S, Ravichandar D, Wee B, Chow CE, Iwai S. StrainSelect: A novel microbiome reference database that disambiguates all bacterial strains, genome assemblies and extant cultures worldwide. Heliyon 2023; 9:e13314. [PMID: 36814618 PMCID: PMC9939595 DOI: 10.1016/j.heliyon.2023.e13314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 12/01/2022] [Accepted: 01/26/2023] [Indexed: 02/05/2023] Open
Abstract
Motivation: Microbial metagenomic profiling software and databases are advancing rapidly for development of novel disease biomarkers and therapeutics yet three problems impede analyses: 1) the conflation of "genome assembly" and "strain" in reference databases; 2) difficulty connecting DNA biomarkers to a procurable strain for laboratory experimentation; and 3) absence of a comprehensive and unified strain-resolved reference database for integrating both shotgun metagenomics and 16S rRNA gene data. Results: We demarcated 681,087 strains, the largest collection of its kind, by filtering public data into a knowledge graph of vertices representing contiguous DNA sequences, genome assemblies, strain monikers and bio-resource center (BRC) catalog numbers then adding inter-vertex edges only for synonyms or direct derivatives. Surprisingly, for 10,043 important strains, we found replicate RefSeq genome assemblies obstructing interpretation of database searches. We organized each strain into eight taxonomic ranks with bootstrap confidence inversely correlated with genome assembly contamination. The StrainSelect database is suited for applications where a taxonomic, functional or procurement reference is needed for shotgun or amplicon metagenomics since 636,568 strains have at least one 16S rRNA gene, 245,005 have at least one annotated genome assembly, and 36,671 are procurable from at least one BRC. The database overcomes all three aforementioned problems since it disambiguates strains from assemblies, locates strains at BRCs, and unifies a taxonomic reference for both 16S rRNA and shotgun metagenomics. Availability: The StrainSelect database is available in igraph and tabular vertex-edge formats compatible with Neo4J. Dereplicated MinHash and fasta databases are distributed for sourmash and usearch pipelines at http://strainselect.secondgenome.com. Contact:todd.desantis@gmail.com. Supplementary information: Supplementary data are available online.
Collapse
Affiliation(s)
- Todd Z. DeSantis
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA,Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, Germany,Corresponding author at: Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA.
| | - Cesar Cardona
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| | - Nicole R. Narayan
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| | - Satish Viswanatham
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| | - Divya Ravichandar
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| | - Brendan Wee
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| | | | - Shoko Iwai
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| |
Collapse
|
6
|
Xu Z, Hu D, Luu LDW, Octavia S, Keil AD, Sintchenko V, Tanaka MM, Mooi FR, Robson J, Lan R. Genomic dissection of the microevolution of Australian epidemic Bordetella pertussis. Emerg Microbes Infect 2022; 11:1460-1473. [PMID: 35543519 PMCID: PMC9176669 DOI: 10.1080/22221751.2022.2077129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Whooping cough (pertussis) is a highly contagious respiratory disease caused by the bacterium Bordetella pertussis. Despite high vaccine coverage, pertussis has re-emerged in many countries including Australia and caused two large epidemics in Australia since 2007. Here, we undertook a genomic and phylogeographic study of 385 Australian B. pertussis isolates collected from 2008 to 2017. The Australian B. pertussis population was found to be composed of mostly ptxP3 strains carrying different fim3 alleles, with ptxP3-fim3A genotype expanding far more than ptxP3-fim3B. Within the former, there were six co-circulating epidemic lineages (EL1 to EL6). The multiple ELs emerged, expanded, and then declined at different time points over the two epidemics. In population genetics terms, both hard and soft selective sweeps through vaccine selection pressures have determined the population dynamics of Australian B. pertussis. Relative risk estimation suggests that once a new B. pertussis lineage emerged, it was more likely to spread locally within the first 1.5 years. However, after 1.5 years, any new lineage was likely to expand to a wider region. Phylogenetic analysis revealed the expansion of ptxP3 strains was also associated with replacement of the type III secretion system allele bscI1 with bscI3. bscI3 is associated with decreased T3SS secretion and may allow B. pertussis to reduce immune recognition. This study advanced our understanding of the epidemic population structure and spatial and temporal dynamics of B. pertussis in a highly immunized population.
Collapse
Affiliation(s)
- Zheng Xu
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia
| | - Dalong Hu
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia
| | - Laurence Don Wai Luu
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia
| | - Sophie Octavia
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia
| | - Anthony D Keil
- Department of Microbiology, PathWest Laboratory Medicine WA, Perth Children's Hospital, Perth, Australia
| | - Vitali Sintchenko
- Centre for Infectious Diseases and Microbiology-Public Health, Institute of Clinical Pathology and Medical Research, NSW Health Pathology and Westmead Hospital, Sydney, Australia.,Sydney Institute for Infectious Diseases, The University of Sydney, Sydney, Australia
| | - Mark M Tanaka
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia
| | - Frits R Mooi
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia
| | - Jenny Robson
- Sullivan Nicolaides Pathology, Queensland, Australia
| | - Ruiting Lan
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia
| |
Collapse
|
7
|
Luo Y, Ye J, Payne M, Hu D, Jiang J, Lan R. Genomic Epidemiology of Vibrio cholerae O139, Zhejiang Province, China, 1994-2018. Emerg Infect Dis 2022; 28:2253-2260. [PMID: 36285907 PMCID: PMC9622232 DOI: 10.3201/eid2811.212066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/30/2023] Open
Abstract
Cholera caused by Vibrio cholerae O139 was first reported in Bangladesh and India in 1992. To determine the genomic epidemiology and origins of O139 in China, we sequenced 104 O139 isolates collected from Zhejiang Province, China, during 1994-2018 and compared them with 57 O139 genomes from other countries in Asia. Most Zhejiang isolates fell into 3 clusters (C1-C3), which probably originated in India (C1) and Thailand (C2 and C3) during the early 1990s. Different clusters harbored different antimicrobial resistance genes and IncA/C plasmids. The integrative and conjugative elements carried by Zhejiang isolates were of a new type, differing from ICEVchInd4 and SXTMO10 by single-nucleotide polymorphisms and presence of genes. Quinolone resistance-conferring mutations S85L in parC and S83I in gyrA occurred in 71.2% of the Zhejiang isolates. The ctxB copy number differed among the 3 clusters. Our findings provided new insights for prevention and control of O139 cholera .
Collapse
Affiliation(s)
- Yun Luo
- Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, China (Y. Luo, J. Ye, J. Jiang)
- University of New South Wales, Sydney, New South Wales, Australia (Y. Luo, M. Payne, D. Hu, R. Lan)
- Key Laboratory of Vaccine, Hangzhou (J. Jiang)
| | - Julian Ye
- Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, China (Y. Luo, J. Ye, J. Jiang)
- University of New South Wales, Sydney, New South Wales, Australia (Y. Luo, M. Payne, D. Hu, R. Lan)
- Key Laboratory of Vaccine, Hangzhou (J. Jiang)
| | - Michael Payne
- Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, China (Y. Luo, J. Ye, J. Jiang)
- University of New South Wales, Sydney, New South Wales, Australia (Y. Luo, M. Payne, D. Hu, R. Lan)
- Key Laboratory of Vaccine, Hangzhou (J. Jiang)
| | - Dalong Hu
- Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, China (Y. Luo, J. Ye, J. Jiang)
- University of New South Wales, Sydney, New South Wales, Australia (Y. Luo, M. Payne, D. Hu, R. Lan)
- Key Laboratory of Vaccine, Hangzhou (J. Jiang)
| | | | | |
Collapse
|
8
|
Zhang X, Payne M, Kaur S, Lan R. Improved Genomic Identification, Clustering, and Serotyping of Shiga Toxin-Producing Escherichia coli Using Cluster/Serotype-Specific Gene Markers. Front Cell Infect Microbiol 2022; 11:772574. [PMID: 35083165 PMCID: PMC8785982 DOI: 10.3389/fcimb.2021.772574] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 12/03/2021] [Indexed: 11/16/2022] Open
Abstract
Shiga toxin-producing Escherichia coli (STEC) have more than 470 serotypes. The well-known STEC O157:H7 serotype is a leading cause of STEC infections in humans. However, the incidence of non-O157:H7 STEC serotypes associated with foodborne outbreaks and human infections has increased in recent years. Current detection and serotyping assays are focusing on O157 and top six (“Big six”) non-O157 STEC serogroups. In this study, we performed phylogenetic analysis of nearly 41,000 publicly available STEC genomes representing 460 different STEC serotypes and identified 19 major and 229 minor STEC clusters. STEC cluster-specific gene markers were then identified through comparative genomic analysis. We further identified serotype-specific gene markers for the top 10 most frequent non-O157:H7 STEC serotypes. The cluster or serotype specific gene markers had 99.54% accuracy and more than 97.25% specificity when tested using 38,534 STEC and 14,216 non-STEC E. coli genomes, respectively. In addition, we developed a freely available in silico serotyping pipeline named STECFinder that combined these robust gene markers with established E. coli serotype specific O and H antigen genes and stx genes for accurate identification, cluster determination and serotyping of STEC. STECFinder can assign 99.85% and 99.83% of 38,534 STEC isolates to STEC clusters using assembled genomes and Illumina reads respectively and can simultaneously predict stx subtypes and STEC serotypes. Using shotgun metagenomic sequencing reads of STEC spiked food samples from a published study, we demonstrated that STECFinder can detect the spiked STEC serotypes, accurately. The cluster/serotype-specific gene markers could also be adapted for culture independent typing, facilitating rapid STEC typing. STECFinder is available as an installable package (https://github.com/LanLab/STECFinder) and will be useful for in silico STEC cluster identification and serotyping using genome data.
Collapse
Affiliation(s)
- Xiaomei Zhang
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Michael Payne
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Sandeep Kaur
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Ruiting Lan
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| |
Collapse
|
9
|
Hu D, Fuller NR, Caterson ID, Holmes AJ, Reeves PR. Single-gene long-read sequencing illuminates Escherichia coli strain dynamics in the human intestinal microbiome. Cell Rep 2022; 38:110239. [PMID: 35021078 DOI: 10.1016/j.celrep.2021.110239] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 10/17/2021] [Accepted: 12/17/2021] [Indexed: 02/01/2023] Open
Abstract
Gut microbiome is of major interest due to its close relationship to health and disease. Bacteria usually vary in gene content, leading to functional variations within species, so resolution higher than species-level methods is needed for ecological and clinical relevance. We design a protocol to identify strains in selected species with high discrimination and in high numbers by amplicon sequencing of the flagellin gene. We apply the protocol to fecal samples from a human diet trial, targeting Escherichia coli. Across the 119 samples from 16 individuals, there are 1,532 amplicon sequence variants (ASVs), but only 32 ASVs are dominant in one or more fecal samples, despite frequent dominant strain turnover. Major strains in an intestine are found to be commonly accompanied by a large number of satellite cells, and many are identified as potential extraintestinal pathogens. The protocol could be used to track epidemics or investigate the intra- or inter-host diversity of pathogens.
Collapse
Affiliation(s)
- Dalong Hu
- School of Life and Environmental Sciences, Charles Perkins Centre, University of Sydney, Camperdown, NSW 2006, Australia
| | - Nicholas R Fuller
- The Boden Institute, Charles Perkins Centre, University of Sydney, Camperdown, NSW 2006, Australia; Metabolism and Obesity Services, Royal Prince Alfred Hospital, Camperdown, NSW 2050, Australia
| | - Ian D Caterson
- School of Life and Environmental Sciences, Charles Perkins Centre, University of Sydney, Camperdown, NSW 2006, Australia; The Boden Institute, Charles Perkins Centre, University of Sydney, Camperdown, NSW 2006, Australia; Metabolism and Obesity Services, Royal Prince Alfred Hospital, Camperdown, NSW 2050, Australia
| | - Andrew J Holmes
- School of Life and Environmental Sciences, Charles Perkins Centre, University of Sydney, Camperdown, NSW 2006, Australia
| | - Peter R Reeves
- School of Life and Environmental Sciences, Charles Perkins Centre, University of Sydney, Camperdown, NSW 2006, Australia.
| |
Collapse
|
10
|
Zhang X, Payne M, Nguyen T, Kaur S, Lan R. Cluster-specific gene markers enhance Shigella and enteroinvasive Escherichia coli in silico serotyping. Microb Genom 2021; 7. [PMID: 34889728 PMCID: PMC8767346 DOI: 10.1099/mgen.0.000704] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Shigella and enteroinvasive Escherichia coli (EIEC) cause human bacillary dysentery with similar invasion mechanisms and share similar physiological, biochemical and genetic characteristics. Differentiation of Shigella from EIEC is important for clinical diagnostic and epidemiological investigations. However, phylogenetically, Shigella and EIEC strains are composed of multiple clusters and are different forms of E. coli, making it difficult to find genetic markers to discriminate between Shigella and EIEC. In this study, we identified 10 Shigella clusters, seven EIEC clusters and 53 sporadic types of EIEC by examining over 17000 publicly available Shigella and EIEC genomes. We compared Shigella and EIEC accessory genomes to identify cluster-specific gene markers for the 17 clusters and 53 sporadic types. The cluster-specific gene markers showed 99.64% accuracy and more than 97.02% specificity. In addition, we developed a freely available in silico serotyping pipeline named Shigella EIEC Cluster Enhanced Serotype Finder (ShigEiFinder) by incorporating the cluster-specific gene markers and established Shigella and EIEC serotype-specific O antigen genes and modification genes into typing. ShigEiFinder can process either paired-end Illumina sequencing reads or assembled genomes and almost perfectly differentiated Shigella from EIEC with 99.70 and 99.74% cluster assignment accuracy for the assembled genomes and read mapping respectively. ShigEiFinder was able to serotype over 59 Shigella serotypes and 22 EIEC serotypes and provided a high specificity of 99.40% for assembled genomes and 99.38% for read mapping for serotyping. The cluster-specific gene markers and our new serotyping tool, ShigEiFinder (installable package: https://github.com/LanLab/ShigEiFinder, online tool: https://mgtdb.unsw.edu.au/ShigEiFinder/), will be useful for epidemiological and diagnostic investigations.
Collapse
Affiliation(s)
- Xiaomei Zhang
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Michael Payne
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Thanh Nguyen
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Sandeep Kaur
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Ruiting Lan
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| |
Collapse
|
11
|
Luo L, Wang H, Payne MJ, Liang C, Bai L, Zheng H, Zhang Z, Zhang L, Zhang X, Yan G, Zou N, Chen X, Wan Z, Xiong Y, Lan R, Li Q. Comparative genomics of Chinese and international isolates of Escherichia albertii: population structure and evolution of virulence and antimicrobial resistance. Microb Genom 2021; 7. [PMID: 34882085 PMCID: PMC8767325 DOI: 10.1099/mgen.0.000710] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Escherichia albertii is a recently recognized species in the genus Escherichia that causes diarrhoea. The population structure, genetic diversity and genomic features have not been fully examined. Here, 169 E. albertii isolates from different sources and regions in China were sequenced and combined with 312 publicly available genomes (from additional 14 countries) for genomic analyses. The E. albertii population was divided into two clades and eight lineages, with lineage 3 (L3), L5 and L8 more common in China. Clinical isolates were observed in all clades/lineages. Virulence genes were found to be distributed differently among lineages: subtypes of the intimin encoding gene eae and the cytolethal distending toxin gene cdtB were lineage associated, and the second type three secretion system (ETT2) island was truncated in L3 and L6. Seven new eae subtypes and one new cdtB subtype (cdtB-VI) were identified. Alarmingly, 85.9 % of the Chinese E. albertii isolates were predicted to be multidrug-resistant (MDR) with 35.9 % harbouring genes capable of conferring resistance to 10 to 14 different drug classes. The majority of the MDR isolates were of poultry source from China and belonged to four sequence types (STs) [ST4638, ST4479, ST4633 and ST4488]. Thirty-four plasmids with some carrying MDR and virulence genes, and 130 prophages were identified from 17 complete E. albertii genomes. The 130 intact prophages were clustered into five groups, with group five prophages harbouring more virulence genes. We further identified three E. albertii specific genes as markers for the identification of this species. Our findings provided fundamental insights into the population structure, virulence variation and drug resistance of E. albertii.
Collapse
Affiliation(s)
- Lijuan Luo
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Hong Wang
- Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
| | - Michael J Payne
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Chelsea Liang
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Li Bai
- Division I of Risk Assessment, National Health Commission Key Laboratory of Food Safety Risk Assessment, Food Safety Research Unit (2019RU014) of Chinese Academy of Medical Science, China National Center for Food Safety Risk Assessment, Beijing, PR China
| | - Han Zheng
- State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, PR China
| | - Zhengdong Zhang
- Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
| | - Ling Zhang
- Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
| | - Xiaomei Zhang
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Guodong Yan
- Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
| | - Nianli Zou
- Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
| | - Xi Chen
- Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
| | - Ziting Wan
- Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
| | - Yanwen Xiong
- State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, PR China
| | - Ruiting Lan
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Qun Li
- Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
| |
Collapse
|
12
|
Luo L, Payne M, Kaur S, Hu D, Cheney L, Octavia S, Wang Q, Tanaka MM, Sintchenko V, Lan R. Elucidation of global and national genomic epidemiology of Salmonella enterica serovar Enteritidis through multilevel genome typing. Microb Genom 2021; 7. [PMID: 34292145 PMCID: PMC8477392 DOI: 10.1099/mgen.0.000605] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Salmonella enterica serovar Enteritidis is a major cause of foodborne Salmonella infections and outbreaks in humans. Effective surveillance and timely outbreak detection are essential for public health control. Multilevel genome typing (MGT) with multiple levels of resolution has been previously demonstrated as a promising tool for this purpose. In this study, we developed MGT with nine levels for S. Enteritidis and characterised the genomic epidemiology of S. Enteritidis in detail. We examined 26 670 publicly available S. Enteritidis genome sequences from isolates spanning 101 years from 86 countries to reveal their spatial and temporal distributions. Using the lower resolution MGT levels, globally prevalent and regionally restricted sequence types (STs) were identified; avian associated MGT4-STs were found that were common in human cases in the USA; temporal trends were observed in the UK with MGT5-STs from 2014 to 2018 revealing both long lived endemic STs and the rapid expansion of new STs. Using MGT3 to MGT6, we identified multidrug resistance (MDR) associated STs at various MGT levels, which improves precision of detection and global tracking of MDR clones. We also found that the majority of the global S. Enteritidis population fell within two predominant lineages, which had significantly different propensity of causing large scale outbreaks. An online open MGT database has been established for unified international surveillance of S. Enteritidis. We demonstrated that MGT provides a flexible and high-resolution genome typing tool for S. Enteritidis surveillance and outbreak detection.
Collapse
Affiliation(s)
- Lijuan Luo
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Michael Payne
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Sandeep Kaur
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Dalong Hu
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Liam Cheney
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Sophie Octavia
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Qinning Wang
- Centre for Infectious Diseases and Microbiology-Public Health, Institute of Clinical Pathology and Medical Research - NSW Health Pathology, Westmead Hospital, New South Wales, Australia
| | - Mark M Tanaka
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Vitali Sintchenko
- Centre for Infectious Diseases and Microbiology-Public Health, Institute of Clinical Pathology and Medical Research - NSW Health Pathology, Westmead Hospital, New South Wales, Australia.,Marie Bashir Institute for Infectious Diseases and Biosecurity, Sydney Medical School, University of Sydney, New South Wales, Australia
| | - Ruiting Lan
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| |
Collapse
|
13
|
Abstract
Bacterial and archaeal flagellins are remarkable in having a shared region with variation in housekeeping proteins and a region with extreme diversity, perhaps greater than for any other protein. Analysis of the 113,285 available full-gene sequences of flagellin genes from published bacterial and archaeal sequences revealed the nature and enormous extent of flagellin diversity. There were 35,898 unique amino acid sequences that were resolved into 187 clusters. Analysis of the Escherichia coli and Salmonella enterica flagellins revealed that the variation occurs at two levels. The first is the division of the variable regions into sequence forms that are so divergent that there is no meaningful alignment even within species, and these corresponded to the E. coli or S. enterica H-antigen groups. The second level is variation within these groups, which is extensive in both species. Shared sequence would allow PCR of the variable regions and thus strain-level analysis of microbiome DNA. Flagellin, the agent of prokaryotic flagellar motion, is very widely distributed and is the H antigen of serology. Flagellin molecules have a variable region that confers serotype specificity, encoded by the middle of the gene, and also conserved regions encoded by the two ends of the gene. We collected all available prokaryotic flagellin protein sequences and found the variable region diversity to be at two levels. In each species investigated, there are hypervariable region (HVR) forms without detectable homology in protein sequences between them. There is also considerable variation within HVR forms, indicating that some have been diverging for thousands of years and that interphylum horizontal gene transfers make a major contribution to the evolution of such atypical diversity. IMPORTANCE Bacterial and archaeal flagellins are remarkable in having a shared region with variation in housekeeping proteins and a region with extreme diversity, perhaps greater than for any other protein. Analysis of the 113,285 available full-gene sequences of flagellin genes from published bacterial and archaeal sequences revealed the nature and enormous extent of flagellin diversity. There were 35,898 unique amino acid sequences that were resolved into 187 clusters. Analysis of the Escherichia coli and Salmonella enterica flagellins revealed that the variation occurs at two levels. The first is the division of the variable regions into sequence forms that are so divergent that there is no meaningful alignment even within species, and these corresponded to the E. coli or S. enterica H-antigen groups. The second level is variation within these groups, which is extensive in both species. Shared sequence would allow PCR of the variable regions and thus strain-level analysis of microbiome DNA.
Collapse
|
14
|
Abstract
The 7th cholera pandemic began in 1961 in Sulawesi, Indonesia, and then spread around the world in at least three waves. However, the lack of genome sequences for Vibrio cholerae strains under long-term surveillance in East Asia, especially in China, has restricted our understanding of the dynamics of the intracountry and intercountry evolution and transmission of the 7th-pandemic clones. In this study, we obtained the genome sequences of 60 V. cholerae strains isolated in Shanghai, the largest port in the world and the largest city in China, from 1961 to 2011. Our whole-genome-based phylogeny of 7th-pandemic strains revealed that all but one fell into five "stages," most of which are single clades and share independent ancestors. Each stage dominated in succession for a period, with little overlap between them. In addition, two near-identical Shanghai strains belonging to a pre-7th-pandemic precursor and 4 nontoxigenic O1/O139 strains attributed to independent recombination events at the O-antigen loci were present. The major lineages of the 7th pandemic in Shanghai appeared to be closely related to V. cholerae strains isolated from South or Southeast Asia. Stage succession was consistently related to changes in society and human activity, implying that human-caused niche change may play a vital role in the cholera dynamics in Shanghai.IMPORTANCE V. cholerae is the causative agent of cholera, a life-threatening disease characterized by severe, watery diarrhea. The 7th pandemic started in Indonesia in 1961 and spread globally, currently infecting 1.3 million to 4 million people annually. Here, we applied whole-genome sequencing to analyze a long-term collection of V. cholerae clinical strains to reveal the phylogenetic background and evolutionary dynamics of the 7th pandemic in Shanghai, which had undergone breathtakingly rapid development in the last half-century. All but one of the Shanghai 7th-pandemic strains fell into five "stages" that were dominant in Shanghai and appeared to be closely related to 7th-pandemic strains of South or Southeast Asia. Our findings extended the understanding of the dynamics of the evolution and transmission of the 7th-pandemic clones in East Asia and the relationship between social changes and cholera epidemiology.
Collapse
|