Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hu D, Liu B, Wang L, Reeves PR. Living Trees: High-Quality Reproducible and Reusable Construction of Bacterial Phylogenetic Trees. Mol Biol Evol 2020;37:563-575. [PMID: 31633785 DOI: 10.1093/molbev/msz241] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

For:	Hu D, Liu B, Wang L, Reeves PR. Living Trees: High-Quality Reproducible and Reusable Construction of Bacterial Phylogenetic Trees. Mol Biol Evol 2020;37:563-575. [PMID: 31633785 DOI: 10.1093/molbev/msz241] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Number

Cited by Other Article(s)

Cheney L, Payne M, Kaur S, Lan R. SaLTy: a novel Staphylococcus aureus Lineage Typer. Microb Genom 2024;10:001250. [PMID: 38739116 PMCID: PMC11165655 DOI: 10.1099/mgen.0.001250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Accepted: 04/19/2024] [Indexed: 05/14/2024] Open

Abstract

Staphylococcus aureus asymptomatically colonises 30 % of humans but can also cause a range of diseases, which can be fatal. In 2017 S. aureus was associated with 20 000 deaths in the USA alone. Dividing S. aureus isolates into smaller sub-groups can reveal the emergence of distinct sub-populations with varying potential to cause infections. Despite multiple molecular typing methods categorising such sub-groups, they do not take full advantage of S. aureus genome sequences when describing the fundamental population structure of the species. In this study, we developed Staphylococcus aureus Lineage Typing (SaLTy), which rapidly divides the species into 61 phylogenetically congruent lineages. Alleles of three core genes were identified that uniquely define the 61 lineages and were used for SaLTy typing. SaLTy was validated on 5000 genomes and 99.12 % (4956/5000) of isolates were assigned the correct lineage. We compared SaLTy lineages to previously calculated clonal complexes (CCs) from BIGSdb (n=21 173). SALTy improves on CCs by grouping isolates congruently with phylogenetic structure. SaLTy lineages were further used to describe the carriage of Staphylococcal chromosomal cassette containing mecA (SCCmec) which is carried by methicillin-resistant S. aureus (MRSA). Most lineages had isolates lacking SCCmec and the four largest lineages varied in SCCmec over time. Classifying isolates into SaLTy lineages, which were further SCCmec typed, allowed SaLTy to describe high-level MRSA epidemiology. We provide SaLTy as a simple typing method that defines phylogenetic lineages (https://github.com/LanLab/SaLTy). SaLTy is highly accurate and can quickly analyse large amounts of S. aureus genome data. SaLTy will aid the characterisation of S. aureus populations and ongoing surveillance of sub-groups that threaten human health.

Collapse

Luo Y, Payne M, Kaur S, Octavia S, Jiang J, Lan R. Emergence and genomic insights of non-pandemic O1 Vibrio cholerae in Zhejiang, China. Microbiol Spectr 2023;11:e0261523. [PMID: 37819129 PMCID: PMC10871787 DOI: 10.1128/spectrum.02615-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Accepted: 09/06/2023] [Indexed: 10/13/2023] Open

Liu Y, Li M, Hong X, Li H, Huang R, Han S, Hou J, Pan C. Screening and identification of high yield tetramethylpyrazine strains in Nongxiangxing liquor Daqu and study on the mechanism of tetramethylpyrazine production. JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE 2023;103:6849-6860. [PMID: 37293782 DOI: 10.1002/jsfa.12773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 05/30/2023] [Accepted: 06/09/2023] [Indexed: 06/10/2023]

Abstract

BACKGROUND

There are few reports on the breeding of high-yielding tetramethylpyrazine (TTMP) strains in strong-flavor Daqu. In addition, studies on the mechanism of TTMP production in strains are mostly based on common physiological and biochemical indicators, and there is no report on RNA level. Therefore, in this study, a strain with high production of TTMP was screened out from strong-flavor liquor, and transcriptome sequencing analysis was performed to analyze its key metabolic pathways and key genes, and to infer the mechanism of TTMP production in the strain.

RESULTS

In this study, a strain with a high yield of tetramethylpyrazine (TTMP) was screened out, and the yield was 29.83 μg mL-1 . The identified strain was Bacillus velezensis, which could increase the content of TTMP in liquor by about 88%. After transcriptome sequencing, a total of 1851 differentially expressed genes were screened, including 1055 up-regulated genes and 796 down-regulated genes. Three pathways related to the production of TTMP were identified by gene ontology (GO) annotation and COG annotation, including carbohydrate metabolism, cell movement and amino acid metabolism. The key genes of TTMP were analyzed, and the factors that might regulate the production of TTMP, such as the transfer of uracil phosphate ribose and glycosyltransferase, were obtained.

CONCLUSIONS

A strain of B. velezensis with high TTMP production was screened and identified in strong-flavor Daqu for the first time. The yield of TTMP was 29.83 μg mL-1 , which increased the TTMP content in liquor by 88%. The key metabolic pathways of TTMP production in the strain were obtained: carbohydrate metabolism, cell movement and amino acid metabolism, and the key regulatory genes of each pathway were found, which complemented the gap in gene level in the production regulation of the strain, and provided a theoretical basis for the subsequent study of TTMP in liquor. © 2023 Society of Chemical Industry.

Collapse

Affiliation(s)

Yanbo Liu College of Food and Biological Engineering (Liquor College), Henan University of Animal Husbandry and Economy, Zhengzhou, China Henan Liquor Style Engineering Technology Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China Henan Province Brewing Special Grain Development and Application Engineering Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China Zhengzhou Key Laboratory of Liquor Brewing Microbial Technology, Henan University of Animal Husbandry and Economy, Zhengzhou, China
Mengke Li College of Food and Biological Engineering (Liquor College), Henan University of Animal Husbandry and Economy, Zhengzhou, China Henan Liquor Style Engineering Technology Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China Henan Province Brewing Special Grain Development and Application Engineering Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China Zhengzhou Key Laboratory of Liquor Brewing Microbial Technology, Henan University of Animal Husbandry and Economy, Zhengzhou, China
Xinfeng Hong College of Food and Biological Engineering (Liquor College), Henan University of Animal Husbandry and Economy, Zhengzhou, China Henan Liquor Style Engineering Technology Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China Henan Province Brewing Special Grain Development and Application Engineering Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China Zhengzhou Key Laboratory of Liquor Brewing Microbial Technology, Henan University of Animal Husbandry and Economy, Zhengzhou, China
Haideng Li College of Food and Biological Engineering (Liquor College), Henan University of Animal Husbandry and Economy, Zhengzhou, China College of Biological Engineering, Henan University of Technology, Zhengzhou, China
Runna Huang Henan Yangshao Distillery Co., Ltd., Mianchi, China
Suna Han Henan Yangshao Distillery Co., Ltd., Mianchi, China
Jianguang Hou Henan Yangshao Distillery Co., Ltd., Mianchi, China
Chunmei Pan College of Food and Biological Engineering (Liquor College), Henan University of Animal Husbandry and Economy, Zhengzhou, China Henan Liquor Style Engineering Technology Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China Henan Province Brewing Special Grain Development and Application Engineering Research Center, Henan University of Animal Husbandry and Economy, Zhengzhou, China Zhengzhou Key Laboratory of Liquor Brewing Microbial Technology, Henan University of Animal Husbandry and Economy, Zhengzhou, China

Collapse

Genomic Epidemiology and Multilevel Genome Typing of Australian Salmonella enterica Serovar Enteritidis. Microbiol Spectr 2023;11:e0301422. [PMID: 36625638 PMCID: PMC9927265 DOI: 10.1128/spectrum.03014-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open

Abstract

Salmonella enterica serovar Enteritidis is one of the leading causes of salmonellosis in Australia. In this study, a total of 568 S. Enteritidis isolates from two Australian states across two consecutive years were analyzed and compared to international strains, using the S. Enteritidis multilevel genome typing (MGT) database, which contained 40,390 publicly available genomes from 99 countries. The Australian S. Enteritidis isolates were divided into three phylogenetic clades (A, B, and C). Clades A and C represented 16.4% and 3.5% of the total isolates, respectively, and were of local origin. Clade B accounted for 80.1% of the isolates which belonged to seven previously defined lineages but was dominated by the global epidemic lineage. At the MGT5 level, three out of five top sequence types (STs) in Australia were also top STs in Asia, suggesting that a fair proportion of Australian S. Enteritidis cases may be epidemiologically linked with Asian strains. In 2018, a large egg-associated local outbreak was caused by a recently defined clade B lineage prevalent in Europe and was closely related, but not directly linked, to three European isolates. Additionally, over half (54.8%) of predicted multidrug resistance (MDR) isolates belonged to 10 MDR-associated MGT-STs, which were also frequent in Asian S. Enteritidis . Overall, this study investigated the genomic epidemiology of S. Enteritidis in Australia, including the first large local outbreak, using MGT. The open MGT platform enables a standardized and sharable nomenclature that can be effectively applied to public health for unified surveillance of S. Enteritidis nationally and globally. IMPORTANCE Salmonella enterica serovar Enteritidis is a leading cause of foodborne infections. We previously developed a genomic typing database (MGTdb) for S. Enteritidis to facilitate global surveillance of this pathogen. In this study, we examined the genomic features of Australian S. Enteritidis using the MGTdb and found that Australian S. Enteritidis is mainly epidemiologically linked with Asian strains (especially strains carrying antimicrobial resistance genes), followed by European strains. The first large-scale egg-associated local outbreak in Australia was caused by a recently defined lineage prevalent in Europe, and three European isolates in the MGTdb were closely related but not directly linked to this outbreak. In summary, the S. Enteritidis MGTdb open platform is shown to be a potentially powerful tool for national and global public health surveillance of this pathogen.

Collapse

DeSantis TZ, Cardona C, Narayan NR, Viswanatham S, Ravichandar D, Wee B, Chow CE, Iwai S. StrainSelect: A novel microbiome reference database that disambiguates all bacterial strains, genome assemblies and extant cultures worldwide. Heliyon 2023;9:e13314. [PMID: 36814618 PMCID: PMC9939595 DOI: 10.1016/j.heliyon.2023.e13314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 12/01/2022] [Accepted: 01/26/2023] [Indexed: 02/05/2023] Open

Abstract

Motivation: Microbial metagenomic profiling software and databases are advancing rapidly for development of novel disease biomarkers and therapeutics yet three problems impede analyses: 1) the conflation of "genome assembly" and "strain" in reference databases; 2) difficulty connecting DNA biomarkers to a procurable strain for laboratory experimentation; and 3) absence of a comprehensive and unified strain-resolved reference database for integrating both shotgun metagenomics and 16S rRNA gene data. Results: We demarcated 681,087 strains, the largest collection of its kind, by filtering public data into a knowledge graph of vertices representing contiguous DNA sequences, genome assemblies, strain monikers and bio-resource center (BRC) catalog numbers then adding inter-vertex edges only for synonyms or direct derivatives. Surprisingly, for 10,043 important strains, we found replicate RefSeq genome assemblies obstructing interpretation of database searches. We organized each strain into eight taxonomic ranks with bootstrap confidence inversely correlated with genome assembly contamination. The StrainSelect database is suited for applications where a taxonomic, functional or procurement reference is needed for shotgun or amplicon metagenomics since 636,568 strains have at least one 16S rRNA gene, 245,005 have at least one annotated genome assembly, and 36,671 are procurable from at least one BRC. The database overcomes all three aforementioned problems since it disambiguates strains from assemblies, locates strains at BRCs, and unifies a taxonomic reference for both 16S rRNA and shotgun metagenomics. Availability: The StrainSelect database is available in igraph and tabular vertex-edge formats compatible with Neo4J. Dereplicated MinHash and fasta databases are distributed for sourmash and usearch pipelines at http://strainselect.secondgenome.com. Contact:todd.desantis@gmail.com. Supplementary information: Supplementary data are available online.

Collapse

Xu Z, Hu D, Luu LDW, Octavia S, Keil AD, Sintchenko V, Tanaka MM, Mooi FR, Robson J, Lan R. Genomic dissection of the microevolution of Australian epidemic Bordetella pertussis. Emerg Microbes Infect 2022;11:1460-1473. [PMID: 35543519 PMCID: PMC9176669 DOI: 10.1080/22221751.2022.2077129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Luo Y, Ye J, Payne M, Hu D, Jiang J, Lan R. Genomic Epidemiology of Vibrio cholerae O139, Zhejiang Province, China, 1994-2018. Emerg Infect Dis 2022;28:2253-2260. [PMID: 36285907 PMCID: PMC9622232 DOI: 10.3201/eid2811.212066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/30/2023] Open

Zhang X, Payne M, Kaur S, Lan R. Improved Genomic Identification, Clustering, and Serotyping of Shiga Toxin-Producing Escherichia coli Using Cluster/Serotype-Specific Gene Markers. Front Cell Infect Microbiol 2022;11:772574. [PMID: 35083165 PMCID: PMC8785982 DOI: 10.3389/fcimb.2021.772574] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 12/03/2021] [Indexed: 11/16/2022] Open

Abstract

Shiga toxin-producing Escherichia coli (STEC) have more than 470 serotypes. The well-known STEC O157:H7 serotype is a leading cause of STEC infections in humans. However, the incidence of non-O157:H7 STEC serotypes associated with foodborne outbreaks and human infections has increased in recent years. Current detection and serotyping assays are focusing on O157 and top six (“Big six”) non-O157 STEC serogroups. In this study, we performed phylogenetic analysis of nearly 41,000 publicly available STEC genomes representing 460 different STEC serotypes and identified 19 major and 229 minor STEC clusters. STEC cluster-specific gene markers were then identified through comparative genomic analysis. We further identified serotype-specific gene markers for the top 10 most frequent non-O157:H7 STEC serotypes. The cluster or serotype specific gene markers had 99.54% accuracy and more than 97.25% specificity when tested using 38,534 STEC and 14,216 non-STEC E. coli genomes, respectively. In addition, we developed a freely available in silico serotyping pipeline named STECFinder that combined these robust gene markers with established E. coli serotype specific O and H antigen genes and stx genes for accurate identification, cluster determination and serotyping of STEC. STECFinder can assign 99.85% and 99.83% of 38,534 STEC isolates to STEC clusters using assembled genomes and Illumina reads respectively and can simultaneously predict stx subtypes and STEC serotypes. Using shotgun metagenomic sequencing reads of STEC spiked food samples from a published study, we demonstrated that STECFinder can detect the spiked STEC serotypes, accurately. The cluster/serotype-specific gene markers could also be adapted for culture independent typing, facilitating rapid STEC typing. STECFinder is available as an installable package (https://github.com/LanLab/STECFinder) and will be useful for in silico STEC cluster identification and serotyping using genome data.

Collapse

Hu D, Fuller NR, Caterson ID, Holmes AJ, Reeves PR. Single-gene long-read sequencing illuminates Escherichia coli strain dynamics in the human intestinal microbiome. Cell Rep 2022;38:110239. [PMID: 35021078 DOI: 10.1016/j.celrep.2021.110239] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 10/17/2021] [Accepted: 12/17/2021] [Indexed: 02/01/2023] Open

Zhang X, Payne M, Nguyen T, Kaur S, Lan R. Cluster-specific gene markers enhance Shigella and enteroinvasive Escherichia coli in silico serotyping. Microb Genom 2021;7. [PMID: 34889728 PMCID: PMC8767346 DOI: 10.1099/mgen.0.000704] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Abstract

Shigella and enteroinvasive Escherichia coli (EIEC) cause human bacillary dysentery with similar invasion mechanisms and share similar physiological, biochemical and genetic characteristics. Differentiation of Shigella from EIEC is important for clinical diagnostic and epidemiological investigations. However, phylogenetically, Shigella and EIEC strains are composed of multiple clusters and are different forms of E. coli, making it difficult to find genetic markers to discriminate between Shigella and EIEC. In this study, we identified 10 Shigella clusters, seven EIEC clusters and 53 sporadic types of EIEC by examining over 17000 publicly available Shigella and EIEC genomes. We compared Shigella and EIEC accessory genomes to identify cluster-specific gene markers for the 17 clusters and 53 sporadic types. The cluster-specific gene markers showed 99.64% accuracy and more than 97.02% specificity. In addition, we developed a freely available in silico serotyping pipeline named Shigella EIEC Cluster Enhanced Serotype Finder (ShigEiFinder) by incorporating the cluster-specific gene markers and established Shigella and EIEC serotype-specific O antigen genes and modification genes into typing. ShigEiFinder can process either paired-end Illumina sequencing reads or assembled genomes and almost perfectly differentiated Shigella from EIEC with 99.70 and 99.74% cluster assignment accuracy for the assembled genomes and read mapping respectively. ShigEiFinder was able to serotype over 59 Shigella serotypes and 22 EIEC serotypes and provided a high specificity of 99.40% for assembled genomes and 99.38% for read mapping for serotyping. The cluster-specific gene markers and our new serotyping tool, ShigEiFinder (installable package: https://github.com/LanLab/ShigEiFinder, online tool: https://mgtdb.unsw.edu.au/ShigEiFinder/), will be useful for epidemiological and diagnostic investigations.

Collapse

Luo L, Wang H, Payne MJ, Liang C, Bai L, Zheng H, Zhang Z, Zhang L, Zhang X, Yan G, Zou N, Chen X, Wan Z, Xiong Y, Lan R, Li Q. Comparative genomics of Chinese and international isolates of Escherichia albertii: population structure and evolution of virulence and antimicrobial resistance. Microb Genom 2021;7. [PMID: 34882085 PMCID: PMC8767325 DOI: 10.1099/mgen.0.000710] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Affiliation(s)

Lijuan Luo School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
Hong Wang Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
Michael J Payne School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
Chelsea Liang School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
Li Bai Division I of Risk Assessment, National Health Commission Key Laboratory of Food Safety Risk Assessment, Food Safety Research Unit (2019RU014) of Chinese Academy of Medical Science, China National Center for Food Safety Risk Assessment, Beijing, PR China
Han Zheng State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, PR China
Zhengdong Zhang Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
Ling Zhang Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
Xiaomei Zhang School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
Guodong Yan Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
Nianli Zou Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
Xi Chen Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
Ziting Wan Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China
Yanwen Xiong State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, PR China
Ruiting Lan School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
Qun Li Zigong Center for Disease Control and Prevention, Zigong, Sichuan, PR China

Collapse

Luo L, Payne M, Kaur S, Hu D, Cheney L, Octavia S, Wang Q, Tanaka MM, Sintchenko V, Lan R. Elucidation of global and national genomic epidemiology of Salmonella enterica serovar Enteritidis through multilevel genome typing. Microb Genom 2021;7. [PMID: 34292145 PMCID: PMC8477392 DOI: 10.1099/mgen.0.000605] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

The Remarkable Dual-Level Diversity of Prokaryotic Flagellins. mSystems 2020;5:5/1/e00705-19. [PMID: 32047063 PMCID: PMC7018530 DOI: 10.1128/msystems.00705-19] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open

Abstract

Bacterial and archaeal flagellins are remarkable in having a shared region with variation in housekeeping proteins and a region with extreme diversity, perhaps greater than for any other protein. Analysis of the 113,285 available full-gene sequences of flagellin genes from published bacterial and archaeal sequences revealed the nature and enormous extent of flagellin diversity. There were 35,898 unique amino acid sequences that were resolved into 187 clusters. Analysis of the Escherichia coli and Salmonella enterica flagellins revealed that the variation occurs at two levels. The first is the division of the variable regions into sequence forms that are so divergent that there is no meaningful alignment even within species, and these corresponded to the E. coli or S. enterica H-antigen groups. The second level is variation within these groups, which is extensive in both species. Shared sequence would allow PCR of the variable regions and thus strain-level analysis of microbiome DNA.

Flagellin, the agent of prokaryotic flagellar motion, is very widely distributed and is the H antigen of serology. Flagellin molecules have a variable region that confers serotype specificity, encoded by the middle of the gene, and also conserved regions encoded by the two ends of the gene. We collected all available prokaryotic flagellin protein sequences and found the variable region diversity to be at two levels. In each species investigated, there are hypervariable region (HVR) forms without detectable homology in protein sequences between them. There is also considerable variation within HVR forms, indicating that some have been diverging for thousands of years and that interphylum horizontal gene transfers make a major contribution to the evolution of such atypical diversity.

IMPORTANCE Bacterial and archaeal flagellins are remarkable in having a shared region with variation in housekeeping proteins and a region with extreme diversity, perhaps greater than for any other protein. Analysis of the 113,285 available full-gene sequences of flagellin genes from published bacterial and archaeal sequences revealed the nature and enormous extent of flagellin diversity. There were 35,898 unique amino acid sequences that were resolved into 187 clusters. Analysis of the Escherichia coli and Salmonella enterica flagellins revealed that the variation occurs at two levels. The first is the division of the variable regions into sequence forms that are so divergent that there is no meaningful alignment even within species, and these corresponded to the E. coli or S. enterica H-antigen groups. The second level is variation within these groups, which is extensive in both species. Shared sequence would allow PCR of the variable regions and thus strain-level analysis of microbiome DNA.

Collapse

Changing Molecular Epidemiology of Vibrio cholerae Outbreaks in Shanghai, China. mSystems 2019;4:4/6/e00561-19. [PMID: 31771974 PMCID: PMC6880041 DOI: 10.1128/msystems.00561-19] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

The 7th cholera pandemic began in 1961 in Sulawesi, Indonesia, and then spread around the world in at least three waves. However, the lack of genome sequences for Vibrio cholerae strains under long-term surveillance in East Asia, especially in China, has restricted our understanding of the dynamics of the intracountry and intercountry evolution and transmission of the 7th-pandemic clones. In this study, we obtained the genome sequences of 60 V. cholerae strains isolated in Shanghai, the largest port in the world and the largest city in China, from 1961 to 2011. Our whole-genome-based phylogeny of 7th-pandemic strains revealed that all but one fell into five "stages," most of which are single clades and share independent ancestors. Each stage dominated in succession for a period, with little overlap between them. In addition, two near-identical Shanghai strains belonging to a pre-7th-pandemic precursor and 4 nontoxigenic O1/O139 strains attributed to independent recombination events at the O-antigen loci were present. The major lineages of the 7th pandemic in Shanghai appeared to be closely related to V. cholerae strains isolated from South or Southeast Asia. Stage succession was consistently related to changes in society and human activity, implying that human-caused niche change may play a vital role in the cholera dynamics in Shanghai.IMPORTANCE V. cholerae is the causative agent of cholera, a life-threatening disease characterized by severe, watery diarrhea. The 7th pandemic started in Indonesia in 1961 and spread globally, currently infecting 1.3 million to 4 million people annually. Here, we applied whole-genome sequencing to analyze a long-term collection of V. cholerae clinical strains to reveal the phylogenetic background and evolutionary dynamics of the 7th pandemic in Shanghai, which had undergone breathtakingly rapid development in the last half-century. All but one of the Shanghai 7th-pandemic strains fell into five "stages" that were dominant in Shanghai and appeared to be closely related to 7th-pandemic strains of South or Southeast Asia. Our findings extended the understanding of the dynamics of the evolution and transmission of the 7th-pandemic clones in East Asia and the relationship between social changes and cholera epidemiology.

Collapse