1
|
Chaudhari NM, Pérez-Carrascal OM, Overholt WA, Totsche KU, Küsel K. Genome streamlining in Parcubacteria transitioning from soil to groundwater. ENVIRONMENTAL MICROBIOME 2024; 19:41. [PMID: 38902796 PMCID: PMC11188291 DOI: 10.1186/s40793-024-00581-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 06/03/2024] [Indexed: 06/22/2024]
Abstract
BACKGROUND To better understand the influence of habitat on the genetic content of bacteria, with a focus on members of Candidate Phyla Radiation (CPR) bacteria, we studied the effects of transitioning from soil via seepage waters to groundwater on genomic composition of ultra-small Parcubacteria, the dominating CPR class in seepage waters, using genome resolved metagenomics. RESULTS Bacterial metagenome-assembled genomes (MAGs), (318 total, 32 of Parcubacteria) were generated from seepage waters and compared directly to groundwater counterparts. The estimated average genome sizes of members of major phyla Proteobacteria, Bacteroidota and Cand. Patescibacteria (Candidate Phyla Radiation - CPR bacteria) were significantly higher in soil-seepage water as compared to their groundwater counterparts. Seepage water Parcubacteria (Paceibacteria) exhibited 1.18-fold greater mean genome size and 2-fold lower mean proportion of pseudogenes than those in groundwater. Bacteroidota and Proteobacteria also showed a similar trend of reduced genomes in groundwater compared to seepage. While exploring gene loss and adaptive gains in closely related CPR lineages in groundwater, we identified a membrane protein, and a lipoglycopeptide resistance gene unique to a seepage Parcubacterium genome. A nitrite reductase gene was also identified and was unique to the groundwater Parcubacteria genomes, likely acquired from other planktonic microbes via horizontal gene transfer. CONCLUSIONS Overall, our data suggest that bacteria in seepage waters, including ultra-small Parcubacteria, have significantly larger genomes and higher metabolic enrichment than their groundwater counterparts, highlighting possible genome streamlining of the latter in response to habitat selection in an oligotrophic environment.
Collapse
Affiliation(s)
- Narendrakumar M Chaudhari
- Aquatic Geomicrobiology, Institute of Biodiversity, Friedrich Schiller University Jena, Jena, Germany
- German Center for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Friedrich-Schiller-Universität, Leipzig, Germany
| | - Olga M Pérez-Carrascal
- Aquatic Geomicrobiology, Institute of Biodiversity, Friedrich Schiller University Jena, Jena, Germany
- Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, Jena, Germany
| | - Will A Overholt
- Aquatic Geomicrobiology, Institute of Biodiversity, Friedrich Schiller University Jena, Jena, Germany
| | - Kai U Totsche
- Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, Jena, Germany
- Hydrogeology, Institute of Geowissenschaften, Friedrich-Schiller-Universität Jena, Burgweg 11, 07749, Jena, Germany
| | - Kirsten Küsel
- Aquatic Geomicrobiology, Institute of Biodiversity, Friedrich Schiller University Jena, Jena, Germany.
- German Center for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Friedrich-Schiller-Universität, Leipzig, Germany.
- Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, Jena, Germany.
| |
Collapse
|
2
|
Hanke DM, Wang Y, Dagan T. Pseudogenes in plasmid genomes reveal past transitions in plasmid mobility. Nucleic Acids Res 2024:gkae430. [PMID: 38808675 DOI: 10.1093/nar/gkae430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 04/23/2024] [Accepted: 05/08/2024] [Indexed: 05/30/2024] Open
Abstract
Evidence for gene non-functionalization due to mutational processes is found in genomes in the form of pseudogenes. Pseudogenes are known to be rare in prokaryote chromosomes, with the exception of lineages that underwent an extreme genome reduction (e.g. obligatory symbionts). Much less is known about the frequency of pseudogenes in prokaryotic plasmids; those are genetic elements that can transfer between cells and may encode beneficial traits for their host. Non-functionalization of plasmid-encoded genes may alter the plasmid characteristics, e.g. mobility, or their effect on the host. Analyzing 10 832 prokaryotic genomes, we find that plasmid genomes are characterized by threefold-higher pseudogene density compared to chromosomes. The majority of plasmid pseudogenes correspond to deteriorated transposable elements. A detailed analysis of enterobacterial plasmids furthermore reveals frequent gene non-functionalization events associated with the loss of plasmid self-transmissibility. Reconstructing the evolution of closely related plasmids reveals that non-functionalization of the conjugation machinery led to the emergence of non-mobilizable plasmid types. Examples are virulence plasmids in Escherichia and Salmonella. Our study highlights non-functionalization of core plasmid mobility functions as one route for the evolution of domesticated plasmids. Pseudogenes in plasmids supply insights into past transitions in plasmid mobility that are akin to transitions in bacterial lifestyle.
Collapse
Affiliation(s)
- Dustin M Hanke
- Institute of General Microbiology, Kiel University, Kiel, Germany
| | - Yiqing Wang
- Institute of General Microbiology, Kiel University, Kiel, Germany
| | - Tal Dagan
- Institute of General Microbiology, Kiel University, Kiel, Germany
| |
Collapse
|
3
|
Yang Y, Wang P, Qaidi SE, Hardwidge PR, Huang J, Zhu G. Loss to gain: pseudogenes in microorganisms, focusing on eubacteria, and their biological significance. Appl Microbiol Biotechnol 2024; 108:328. [PMID: 38717672 PMCID: PMC11078800 DOI: 10.1007/s00253-023-12971-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 11/26/2023] [Accepted: 12/01/2023] [Indexed: 05/12/2024]
Abstract
Pseudogenes are defined as "non-functional" copies of corresponding parent genes. The cognition of pseudogenes continues to be refreshed through accumulating and updating research findings. Previous studies have predominantly focused on mammals, but pseudogenes have received relatively less attention in the field of microbiology. Given the increasing recognition on the importance of pseudogenes, in this review, we focus on several aspects of microorganism pseudogenes, including their classification and characteristics, their generation and fate, their identification, their abundance and distribution, their impact on virulence, their ability to recombine with functional genes, the extent to which some pseudogenes are transcribed and translated, and the relationship between pseudogenes and viruses. By summarizing and organizing the latest research progress, this review will provide a comprehensive perspective and improved understanding on pseudogenes in microorganisms. KEY POINTS: • Concept, classification and characteristics, identification and databases, content, and distribution of microbial pseudogenes are presented. • How pseudogenization contribute to pathogen virulence is highlighted. • Pseudogenes with potential functions in microorganisms are discussed.
Collapse
Affiliation(s)
- Yi Yang
- College of Veterinary Medicine, Yangzhou University, 12 East Wenhui Road, Yangzhou, 225009, Jiangsu, China
- Jiangsu Co-Innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou, 225009, China
- Joint Laboratory of International Cooperation On Prevention and Control Technology of Important Animal Diseases and Zoonoses of Jiangsu Higher Education Institutions, Yangzhou, 225009, China
| | - Pengzhi Wang
- College of Veterinary Medicine, Yangzhou University, 12 East Wenhui Road, Yangzhou, 225009, Jiangsu, China
- Jiangsu Co-Innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou, 225009, China
- Joint Laboratory of International Cooperation On Prevention and Control Technology of Important Animal Diseases and Zoonoses of Jiangsu Higher Education Institutions, Yangzhou, 225009, China
| | - Samir El Qaidi
- College of Veterinary Medicine, Kansas State University, Manhattan, KS, 66506, USA
| | - Philip R Hardwidge
- College of Veterinary Medicine, Kansas State University, Manhattan, KS, 66506, USA
| | - Jinlin Huang
- Jiangsu Co-Innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou, 225009, China.
- Jiangsu Key Lab of Zoonosis, Yangzhou University, Yangzhou, 225009, Jiangsu, China.
- College of Bioscience and Biotechnology, Yangzhou University, 12 East Wenhui Road Yangzhou, Jiangsu, 225009, China.
| | - Guoqiang Zhu
- College of Veterinary Medicine, Yangzhou University, 12 East Wenhui Road, Yangzhou, 225009, Jiangsu, China.
- Jiangsu Co-Innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou, 225009, China.
- Joint Laboratory of International Cooperation On Prevention and Control Technology of Important Animal Diseases and Zoonoses of Jiangsu Higher Education Institutions, Yangzhou, 225009, China.
| |
Collapse
|
4
|
Ahator SD, Wenzl K, Hegstad K, Lentz CS, Johannessen M. Comprehensive virulence profiling and evolutionary analysis of specificity determinants in Staphylococcus aureus two-component systems. mSystems 2024; 9:e0013024. [PMID: 38470253 PMCID: PMC11019936 DOI: 10.1128/msystems.00130-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 02/15/2024] [Indexed: 03/13/2024] Open
Abstract
In the Staphylococcus aureus genome, a set of highly conserved two-component systems (TCSs) composed of histidine kinases (HKs) and their cognate response regulators (RRs) sense and respond to environmental stimuli, which drive the adaptation of the bacteria. This study investigates the complex interplay between TCSs in S. aureus USA300, a predominant methicillin-resistant S. aureus strain, revealing shared and unique virulence regulatory pathways and genetic variations mediating signal specificity within TCSs. Using TCS-related mutants from the Nebraska Transposon Mutant Library, we analyzed the effects of inactivated TCS HKs and RRs on the production of various virulence factors, in vitro infection abilities, and adhesion assays. We found that the TCSs' influence on virulence determinants was not associated with their phylogenetic relationship, indicating divergent functional evolution. Using the co-crystallized structure of the DesK-DesR from Bacillus subtilis and the modeled structures of the four NarL TCSs in S. aureus, we identified interacting residues, revealing specificity determinants and conservation within the same TCS, even from different strain backgrounds. The interacting residues were highly conserved within strains but varied between species due to selection pressures and the coevolution of cognate pairs. This study unveils the complex interplay and divergent functional evolution of TCSs, highlighting their potential for future experimental exploration of phosphotransfer between cognate and non-cognate recombinant HK and RRs.IMPORTANCEGiven the widespread conservation of two-component systems (TCSs) in bacteria and their pivotal role in regulating metabolic and virulence pathways, they present a compelling target for anti-microbial agents, especially in the face of rising multi-drug-resistant infections. Harnessing TCSs therapeutically necessitates a profound understanding of their evolutionary trajectory in signal transduction, as this underlies their unique or shared virulence regulatory pathways. Such insights are critical for effectively targeting TCS components, ensuring an optimized impact on bacterial virulence, and mitigating the risk of resistance emergence via the evolution of alternative pathways. Our research offers an in-depth exploration of virulence determinants controlled by TCSs in S. aureus, shedding light on the evolving specificity determinants that orchestrate interactions between their cognate pairs.
Collapse
Affiliation(s)
- Stephen Dela Ahator
- Research Group for Host-Microbe Interactions, Centre for New Antibacterial Strategies (CANS), Department of Medical Biology, Faculty of Health Sciences, UiT- The Arctic University of Norway, Tromsø, Norway
| | - Karoline Wenzl
- Research Group for Host-Microbe Interactions, Centre for New Antibacterial Strategies (CANS), Department of Medical Biology, Faculty of Health Sciences, UiT- The Arctic University of Norway, Tromsø, Norway
| | - Kristin Hegstad
- Research Group for Host-Microbe Interactions, Centre for New Antibacterial Strategies (CANS), Department of Medical Biology, Faculty of Health Sciences, UiT- The Arctic University of Norway, Tromsø, Norway
- Norwegian National Advisory Unit on Detection of Antimicrobial Resistance, Department of Microbiology and Infection Control, University Hospital of North Norway, Tromsø, Norway
| | - Christian S. Lentz
- Research Group for Host-Microbe Interactions, Centre for New Antibacterial Strategies (CANS), Department of Medical Biology, Faculty of Health Sciences, UiT- The Arctic University of Norway, Tromsø, Norway
| | - Mona Johannessen
- Research Group for Host-Microbe Interactions, Centre for New Antibacterial Strategies (CANS), Department of Medical Biology, Faculty of Health Sciences, UiT- The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
5
|
Cooley NP, Wright ES. Many purported pseudogenes in bacterial genomes are bona fide genes. BMC Genomics 2024; 25:365. [PMID: 38622536 PMCID: PMC11017572 DOI: 10.1186/s12864-024-10137-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 02/17/2024] [Indexed: 04/17/2024] Open
Abstract
BACKGROUND Microbial genomes are largely comprised of protein coding sequences, yet some genomes contain many pseudogenes caused by frameshifts or internal stop codons. These pseudogenes are believed to result from gene degradation during evolution but could also be technical artifacts of genome sequencing or assembly. RESULTS Using a combination of observational and experimental data, we show that many putative pseudogenes are attributable to errors that are incorporated into genomes during assembly. Within 126,564 publicly available genomes, we observed that nearly identical genomes often substantially differed in pseudogene counts. Causal inference implicated assembler, sequencing platform, and coverage as likely causative factors. Reassembly of genomes from raw reads confirmed that each variable affects the number of putative pseudogenes in an assembly. Furthermore, simulated sequencing reads corroborated our observations that the quality and quantity of raw data can significantly impact the number of pseudogenes in an assembler dependent fashion. The number of unexpected pseudogenes due to internal stops was highly correlated (R2 = 0.96) with average nucleotide identity to the ground truth genome, implying relative pseudogene counts can be used as a proxy for overall assembly correctness. Applying our method to assemblies in RefSeq resulted in rejection of 3.6% of assemblies due to significantly elevated pseudogene counts. Reassembly from real reads obtained from high coverage genomes showed considerable variability in spurious pseudogenes beyond that observed with simulated reads, reinforcing the finding that high coverage is necessary to mitigate assembly errors. CONCLUSIONS Collectively, these results demonstrate that many pseudogenes in microbial genome assemblies are actually genes. Our results suggest that high read coverage is required for correct assembly and indicate an inflated number of pseudogenes due to internal stops is indicative of poor overall assembly quality.
Collapse
Affiliation(s)
- Nicholas P Cooley
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Erik S Wright
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA.
- Center for Evolutionary Biology and Medicine, Pittsburgh, PA, USA.
| |
Collapse
|
6
|
Tavis S, Hettich RL. Multi-Omics integration can be used to rescue metabolic information for some of the dark region of the Pseudomonas putida proteome. BMC Genomics 2024; 25:267. [PMID: 38468234 PMCID: PMC10926591 DOI: 10.1186/s12864-024-10082-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 02/02/2024] [Indexed: 03/13/2024] Open
Abstract
In every omics experiment, genes or their products are identified for which even state of the art tools are unable to assign a function. In the biotechnology chassis organism Pseudomonas putida, these proteins of unknown function make up 14% of the proteome. This missing information can bias analyses since these proteins can carry out functions which impact the engineering of organisms. As a consequence of predicting protein function across all organisms, function prediction tools generally fail to use all of the types of data available for any specific organism, including protein and transcript expression information. Additionally, the release of Alphafold predictions for all Uniprot proteins provides a novel opportunity for leveraging structural information. We constructed a bespoke machine learning model to predict the function of recalcitrant proteins of unknown function in Pseudomonas putida based on these sources of data, which annotated 1079 terms to 213 proteins. Among the predicted functions supplied by the model, we found evidence for a significant overrepresentation of nitrogen metabolism and macromolecule processing proteins. These findings were corroborated by manual analyses of selected proteins which identified, among others, a functionally unannotated operon that likely encodes a branch of the shikimate pathway.
Collapse
Affiliation(s)
- Steven Tavis
- Genome Science and Technology Graduate Program, University of Tennessee Knoxville, Knoxville, USA
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Robert L Hettich
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA.
| |
Collapse
|
7
|
Douglas GM, Shapiro BJ. Pseudogenes act as a neutral reference for detecting selection in prokaryotic pangenomes. Nat Ecol Evol 2024; 8:304-314. [PMID: 38177690 DOI: 10.1038/s41559-023-02268-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Accepted: 11/10/2023] [Indexed: 01/06/2024]
Abstract
A long-standing question is to what degree genetic drift and selection drive the divergence in rare accessory gene content between closely related bacteria. Rare genes, including singletons, make up a large proportion of pangenomes (all genes in a set of genomes), but it remains unclear how many such genes are adaptive, deleterious or neutral to their host genome. Estimates of species' effective population sizes (Ne) are positively associated with pangenome size and fluidity, which has independently been interpreted as evidence for both neutral and adaptive pangenome models. We hypothesized that pseudogenes, used as a neutral reference, could be used to distinguish these models. We find that most functional categories are depleted for rare pseudogenes when a genome encodes only a single intact copy of a gene family. In contrast, transposons are enriched in pseudogenes, suggesting they are mostly neutral or deleterious to the host genome. Thus, even if individual rare accessory genes vary in their effects on host fitness, we can confidently reject a model of entirely neutral or deleterious rare genes. We also define the ratio of singleton intact genes to singleton pseudogenes (si/sp) within a pangenome, compare this measure across 668 prokaryotic species and detect a signal consistent with the adaptive value of many rare accessory genes. Taken together, our work demonstrates that comparing with pseudogenes can improve inferences of the evolutionary forces driving pangenome variation.
Collapse
Affiliation(s)
- Gavin M Douglas
- Department of Microbiology and Immunology, McGill University, Montréal, Québec, Canada.
- McGill Genome Centre, McGill University, Montréal, Québec, Canada.
| | - B Jesse Shapiro
- Department of Microbiology and Immunology, McGill University, Montréal, Québec, Canada.
- McGill Genome Centre, McGill University, Montréal, Québec, Canada.
- McGill Centre for Microbiome Research, McGill University, Montréal, Québec, Canada.
| |
Collapse
|
8
|
Takeuchi N, Fullmer MS, Maddock DJ, Poole AM. The Constructive Black Queen hypothesis: new functions can evolve under conditions favouring gene loss. THE ISME JOURNAL 2024; 18:wrae011. [PMID: 38366199 PMCID: PMC10942775 DOI: 10.1093/ismejo/wrae011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 01/17/2024] [Accepted: 01/19/2024] [Indexed: 02/18/2024]
Abstract
Duplication is a major route for the emergence of new gene functions. However, the emergence of new gene functions via this route may be reduced in prokaryotes, as redundant genes are often rapidly purged. In lineages with compact, streamlined genomes, it thus appears challenging for novel function to emerge via duplication and divergence. A further pressure contributing to gene loss occurs under Black Queen dynamics, as cheaters that lose the capacity to produce a public good can instead acquire it from neighbouring producers. We propose that Black Queen dynamics can favour the emergence of new function because, under an emerging Black Queen dynamic, there is high gene redundancy spread across a community of interacting cells. Using computational modelling, we demonstrate that new gene functions can emerge under Black Queen dynamics. This result holds even if there is deletion bias due to low duplication rates and selection against redundant gene copies resulting from the high cost associated with carrying a locus. However, when the public good production costs are high, Black Queen dynamics impede the fixation of new functions. Our results expand the mechanisms by which new gene functions can emerge in prokaryotic systems.
Collapse
Affiliation(s)
- Nobuto Takeuchi
- School of Biological Sciences, University of Auckland, Auckland 1010, New Zealand
- Universal Biology Institute, University of Tokyo, Tokyo 113-0033, Japan
- Department of Biology, Faculty of Sciences, Kyushu University, Fukuoka 819-0395, Japan
| | - Matthew S Fullmer
- School of Biological Sciences, University of Auckland, Auckland 1010, New Zealand
| | - Danielle J Maddock
- School of Biological Sciences, University of Auckland, Auckland 1010, New Zealand
| | - Anthony M Poole
- School of Biological Sciences, University of Auckland, Auckland 1010, New Zealand
| |
Collapse
|
9
|
Fenske GJ, Pouzou JG, Pouillot R, Taylor DD, Costard S, Zagmutt FJ. The genomic and epidemiological virulence patterns of Salmonella enterica serovars in the United States. PLoS One 2023; 18:e0294624. [PMID: 38051743 DOI: 10.1371/journal.pone.0294624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 11/06/2023] [Indexed: 12/07/2023] Open
Abstract
The serovars of Salmonella enterica display dramatic differences in pathogenesis and host preferences. We developed a process (patent pending) for grouping Salmonella isolates and serovars by their public health risk. We collated a curated set of 12,337 S. enterica isolate genomes from human, beef, and bovine sources in the US. After annotating a virulence gene catalog for each isolate, we used unsupervised random forest methods to estimate the proximity (similarity) between isolates based upon the genomic presentation of putative virulence traits We then grouped isolates (virulence clusters) using hierarchical clustering (Ward's method), used non-parametric bootstrapping to assess cluster stability, and externally validated the clusters against epidemiological virulence measures from FoodNet, the National Outbreak Reporting System (NORS), and US federal sampling of beef products. We identified five stable virulence clusters of S. enterica serovars. Cluster 1 (higher virulence) serovars yielded an annual incidence rate of domestically acquired sporadic cases roughly one and a half times higher than the other four clusters combined (Clusters 2-5, lower virulence). Compared to other clusters, cluster 1 also had a higher proportion of infections leading to hospitalization and was implicated in more foodborne and beef-associated outbreaks, despite being isolated at a similar frequency from beef products as other clusters. We also identified subpopulations within 11 serovars. Remarkably, we found S. Infantis and S. Typhimurium subpopulations that significantly differed in genome length and clinical case presentation. Further, we found that the presence of the pESI plasmid accounted for the genome length differences between the S. Infantis subpopulations. Our results show that S. enterica strains associated with highest incidence of human infections share a common virulence repertoire. This work could be updated regularly and used in combination with foodborne surveillance information to prioritize serovars of public health concern.
Collapse
Affiliation(s)
- Gavin J Fenske
- EpiX Analytics, Fort Collins, Colorado, United States of America
| | - Jane G Pouzou
- EpiX Analytics, Fort Collins, Colorado, United States of America
| | - Régis Pouillot
- EpiX Analytics, Fort Collins, Colorado, United States of America
| | - Daniel D Taylor
- EpiX Analytics, Fort Collins, Colorado, United States of America
| | - Solenne Costard
- EpiX Analytics, Fort Collins, Colorado, United States of America
| | | |
Collapse
|
10
|
Cohen N, Veksler-Lublinsky I. A large-scale phylogeny-guided analysis of pseudogenes in Pseudomonas aeruginosa bacterium. Microbiol Spectr 2023; 11:e0170423. [PMID: 37750703 PMCID: PMC10580986 DOI: 10.1128/spectrum.01704-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 08/11/2023] [Indexed: 09/27/2023] Open
Abstract
Pseudogenes, once considered "junk DNA" based on the incorrect assumption that the absence of full coding potential means a complete lack of functionality, have recently become a subject of significant interest in the scientific community. Concurrently, it is widely assumed that bacterial genomes are compact and have a high density of coding genes with little room for non-coding genes, including pseudogenes. A key aspect of genome annotation is the correct identification of genes and the distinction between coding genes and pseudogenes, as it directly impacts functional and comparative genomics studies. In this study, we analyzed the genomic data of 4,699 strains of the bacterium Pseudomonas aeruginosa (P. aeruginosa) as they exhibit high variability in the number of annotated pseudogenes. In particular, we looked for correlations between the number of pseudogenes and other genomic and meta-features of the strains. We identified clusters of orthologous genes and pseudogenes and compared cluster size distributions and length homogeneity within clusters. We then mapped and examined orthology relationships between genes and pseudogenes. Additionally, we generated a phylogenetic tree of the strains and found that phylogenetically related strains are more homogeneous in the number of pseudogenes and share a significant amount of pseudogenes. Finally, we delved into clusters of orthologous genes and pseudogenes and quantified their phylogenetic neighborhood, classifying pseudogenes into evolutionary preserved pseudogenes, mis-annotated pseudogenes, or pseudogenes formed by failed horizontal transfer events. This in-depth study provides important insights that can be incorporated into pseudogene annotation pipelines in the future. IMPORTANCE Accurate annotation of genes and pseudogenes is vital for comparative genomics analysis. Recent studies have shown that bacterial pseudogenes have an important role in regulatory processes and can provide insight into the evolutionary history of homologous genes or the genome as a whole. Due to pseudogenes' nature as non-functional genes, there is no commonly accepted definition of a pseudogene, which poses difficulties in verifying the annotation through experimental methods and resolving discrepancies among different annotation techniques. Our study introduces an in-depth analysis of annotated genes and pseudogenes and insights that can be incorporated into improved pseudogene annotation pipelines in the future.
Collapse
Affiliation(s)
- Nimrod Cohen
- Department of Software and Information Systems Engineering, Faculty of Engineering, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Isana Veksler-Lublinsky
- Department of Software and Information Systems Engineering, Faculty of Engineering, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| |
Collapse
|
11
|
Yan S, Jiang Z, Zhang W, Liu Z, Dong X, Li D, Liu Z, Li C, Liu X, Zhu L. Genomes-based MLST, cgMLST, wgMLST and SNP analysis of Salmonella Typhimurium from animals and humans. Comp Immunol Microbiol Infect Dis 2023; 96:101973. [PMID: 36989679 DOI: 10.1016/j.cimid.2023.101973] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Revised: 03/19/2023] [Accepted: 03/21/2023] [Indexed: 03/29/2023]
Abstract
Salmonella Typhimurium (S. Typhimurium) is an important food-borne and zoonotic pathogen that causes salmonellosis. With the development of whole genome sequencing (WGS), genome-based typing has been widely applied to bacteriology. In this study, we investigated genotyping and phylogenetic clusters of S. Typhimurium isolates from humans and animals in different provinces (including Beijing, Shandong, Guangxi, Shaanxi, Henan, and Shanghai) of China during 2009-2018 using multi locus sequence typing (MLST), core genome MLST (cgMLST), whole genome MLST (wgMLST) and single nucleotide polymorphism (SNP) based on WGS. 29 S. Typhimurium isolates from chicken (n = 22), sick pigeon (n = 2), patients (n = 4) and sick swine (n = 1) were tested. MLST analysis showed S. Typhimurium strains were divided into four STs, namely ST19 (n = 14), ST34 (n = 12), ST128 (n = 2) and ST1544 (n = 1). cgMLST and wgMLST divided 29 strains into 27 cgSTs and 29 wgST, respectively. Phylogenetic clustering showed that all isolates were divided into 4 clusters and 4 singletons. SNP analysis was used to examine MLST, cgMLST, wgMLST analysis. Finally, comparisons of MLST, cgMLST, wgMLST, and SNP were analyzed and the results showed their precision increased in order. In summary, genomic typing and phylogenetic relationships of 29 S. Typhimurium strains from different sources in China were analyzed. These findings were beneficial to investigate molecular pathogenesis, bacterial diversity, and traceability analysis of Salmonella.
Collapse
Affiliation(s)
- Shigan Yan
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, PR China
| | - Zhaoxu Jiang
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, PR China
| | - Wencheng Zhang
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, PR China
| | - Zhenhai Liu
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, PR China
| | - Xiaorui Dong
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, PR China
| | - Donghui Li
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, PR China
| | - Zijun Liu
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, PR China
| | - Chengyu Li
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, PR China
| | - Xu Liu
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, PR China
| | - Liping Zhu
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, PR China.
| |
Collapse
|
12
|
Sicilia C, Corral-Lugo A, Smialowski P, McConnell MJ, Martín-Galiano AJ. Unsupervised Machine Learning Organization of the Functional Dark Proteome of Gram-Negative "Superbugs": Six Protein Clusters Amenable for Distinct Scientific Applications. ACS OMEGA 2022; 7:46131-46145. [PMID: 36570227 PMCID: PMC9774411 DOI: 10.1021/acsomega.2c04076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 10/06/2022] [Indexed: 06/17/2023]
Abstract
Uncharacterized proteins have been underutilized as targets for the development of novel therapeutics for difficult-to-treat bacterial infections. To facilitate the exploration of these proteins, 2819 predicted, uncharacterized proteins (19.1% of the total) from reference strains of multidrug Acinetobacter baumannii, Klebsiella pneumoniae, and Pseudomonas aeruginosa species were organized using an unsupervised k-means machine learning algorithm. Classification using normalized values for protein length, pI, hydrophobicity, degree of conservation, structural disorder, and %AT of the coding gene rendered six natural clusters. Cluster proteins showed different trends regarding operon membership, expression, presence of unknown function domains, and interactomic relevance. Clusters 2, 4, and 5 were enriched with highly disordered proteins, nonworkable membrane proteins, and likely spurious proteins, respectively. Clusters 1, 3, and 6 showed closer distances to known antigens, antibiotic targets, and virulence factors. Up to 21.8% of proteins in these clusters were structurally covered by modeling, which allowed assessment of druggability and discontinuous B-cell epitopes. Five proteins (4 in Cluster 1) were potential druggable targets for antibiotherapy. Eighteen proteins (11 in Cluster 6) were strong B-cell and T-cell immunogen candidates for vaccine development. Conclusively, we provide a feature-based schema to fractionate the functional dark proteome of critical pathogens for fundamental and biomedical purposes.
Collapse
Affiliation(s)
- Carlos Sicilia
- Intrahospital
Infections Laboratory, National Centre for Microbiology, Instituto de Salud Carlos III (ISCIII), Majadahonda, 28220 Madrid, Spain
| | - Andrés Corral-Lugo
- Intrahospital
Infections Laboratory, National Centre for Microbiology, Instituto de Salud Carlos III (ISCIII), Majadahonda, 28220 Madrid, Spain
| | - Pawel Smialowski
- Core
Facility Bioinformatics, Biomedical Center Munich, Faculty of Medicine, Ludwig Maximilians Universität München, Munich 80539, Germany
- Institute
of Stem Cell Research, Helmholtz Center Munich, Planegg-Martinsried 82152, Germany
| | - Michael J. McConnell
- Intrahospital
Infections Laboratory, National Centre for Microbiology, Instituto de Salud Carlos III (ISCIII), Majadahonda, 28220 Madrid, Spain
| | - Antonio J. Martín-Galiano
- Intrahospital
Infections Laboratory, National Centre for Microbiology, Instituto de Salud Carlos III (ISCIII), Majadahonda, 28220 Madrid, Spain
| |
Collapse
|
13
|
Montero-Calasanz MDC, Yaramis A, Rohde M, Schumann P, Klenk HP, Meier-Kolthoff JP. Genotype-phenotype correlations within the Geodermatophilaceae. Front Microbiol 2022; 13:975365. [PMID: 36439792 PMCID: PMC9686282 DOI: 10.3389/fmicb.2022.975365] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 10/11/2022] [Indexed: 11/11/2022] Open
Abstract
The integration of genomic information into microbial systematics along with physiological and chemotaxonomic parameters provides for a reliable classification of prokaryotes. In silico analysis of chemotaxonomic traits is now being introduced to replace characteristics traditionally determined in the laboratory with the dual goal of both increasing the speed of the description of taxa and the accuracy and consistency of taxonomic reports. Genomics has already successfully been applied in the taxonomic rearrangement of Geodermatophilaceae (Actinomycetota) but in the light of new genomic data the taxonomy of the family needs to be revisited. In conjunction with the taxonomic characterisation of four strains phylogenetically located within the family, we conducted a phylogenetic analysis of the whole proteomes of the sequenced type strains and established genotype-phenotype correlations for traits related to chemotaxonomy, cell morphology and metabolism. Results indicated that the four isolates under study represent four novel species within the genus Blastococcus. Additionally, the genera Blastococcus, Geodermatophilus and Modestobacter were shown to be paraphyletic. Consequently, the new genera Trujillonella, Pleomorpha and Goekera were proposed within the Geodermatophilaceae and Blastococcus endophyticus was reclassified as Trujillonella endophytica comb. nov., Geodermatophilus daqingensis as Pleomorpha daqingensis comb. nov. and Modestobacter deserti as Goekera deserti comb. nov. Accordingly, we also proposed emended descriptions of Blastococcus aggregatus, Blastococcus jejuensis, Blastococcus saxobsidens and Blastococcus xanthilyniticus. In silico chemotaxonomic results were overall consistent with wet-lab results. Even though in silico discriminatory levels varied depending on the respective chemotaxonomic trait, this approach is promising for effectively replacing and/or complementing chemotaxonomic analyses at taxonomic ranks above the species level. Finally, interesting but previously overlooked insights regarding morphology and ecology were revealed by the presence of a repertoire of genes related to flagellum synthesis, chemotaxis, spore production and pilus assembly in all representatives of the family. A rich carbon metabolism including four different CO2 fixation pathways and a battery of enzymes able to degrade complex carbohydrates were also identified in Blastococcus genomes.
Collapse
Affiliation(s)
- Maria del Carmen Montero-Calasanz
- IFAPA Las Torres-Andalusian Institute of Agricultural and Fisheries Research and Training, Junta de Andalucía, Seville, Spain,School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, United Kingdom,*Correspondence: Maria del Carmen Montero-Calasanz,
| | - Adnan Yaramis
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Manfred Rohde
- Central Facility for Microscopy, HZI – Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Peter Schumann
- Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Hans-Peter Klenk
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Jan P. Meier-Kolthoff
- Department Bioinformatics and Databases, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| |
Collapse
|
14
|
McInerney JO. Prokaryotic Pangenomes Act as Evolving Ecosystems. Mol Biol Evol 2022; 40:6775222. [PMID: 36288801 PMCID: PMC9851318 DOI: 10.1093/molbev/msac232] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 10/11/2022] [Accepted: 10/20/2022] [Indexed: 01/24/2023] Open
Abstract
Understanding adaptation to the local environment is a central tenet and a major focus of evolutionary biology. But this is only part of the adaptionist story. In addition to the external environment, one of the main drivers of genome composition is genetic background. In this perspective, I argue that there is a growing body of evidence that intra-genomic selective pressures play a significant part in the composition of prokaryotic genomes and play a significant role in the origin, maintenance and structuring of prokaryotic pangenomes.
Collapse
|
15
|
Adade NE, Aniweh Y, Mosi L, Valvano MA, Duodu S, Ahator SD. Comparative analysis of Vibrio cholerae isolates from Ghana reveals variations in genome architecture and adaptation of outbreak and environmental strains. Front Microbiol 2022; 13:998182. [PMID: 36312941 PMCID: PMC9608740 DOI: 10.3389/fmicb.2022.998182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 09/20/2022] [Indexed: 12/01/2022] Open
Abstract
Recurrent epidemics of cholera denote robust adaptive mechanisms of Vibrio cholerae for ecological shifting and persistence despite variable stress conditions. Tracking the evolution of pathobiological traits requires comparative genomic studies of isolates from endemic areas. Here, we investigated the genetic differentiation among V. cholerae clinical and environmental isolates by highlighting the genomic divergence associated with gene decay, genome plasticity, and the acquisition of virulence and adaptive traits. The clinical isolates showed high phylogenetic relatedness due to a higher frequency of shared orthologs and fewer gene variants in contrast to the evolutionarily divergent environmental strains. Divergence of the environmental isolates is linked to extensive genomic rearrangements in regions containing mobile genetic elements resulting in numerous breakpoints, relocations, and insertions coupled with the loss of virulence determinants acf, zot, tcp, and ctx in the genomic islands. Also, four isolates possessed the CRISPR-Cas systems with spacers specific for Vibrio phages and plasmids. Genome synteny and homology analysis of the CRISPR-Cas systems suggest horizontal acquisition. The marked differences in the distribution of other phage and plasmid defense systems such as Zorya, DdmABC, DdmDE, and type-I Restriction Modification systems among the isolates indicated a higher propensity for plasmid or phage disseminated traits in the environmental isolates. Our results reveal that V. cholerae strains undergo extensive genomic rearrangements coupled with gene acquisition, reflecting their adaptation during ecological shifts and pathogenicity.
Collapse
Affiliation(s)
- Nana Eghele Adade
- West African Centre for Cell Biology of Infectious Pathogens, College of Basic and Applied Sciences, University of Ghana, Accra, Ghana
- Department of Biochemistry, Cell, and Molecular Biology, College of Basic and Applied Sciences, University of Ghana, Accra, Ghana
- Wellcome-Wolfson Institute for Experimental Medicine, Queen’s University Belfast, Belfast, United Kingdom
- Department of Microbiology, Korle-Bu Teaching Hospital, Accra, Ghana
| | - Yaw Aniweh
- West African Centre for Cell Biology of Infectious Pathogens, College of Basic and Applied Sciences, University of Ghana, Accra, Ghana
| | - Lydia Mosi
- West African Centre for Cell Biology of Infectious Pathogens, College of Basic and Applied Sciences, University of Ghana, Accra, Ghana
- Department of Biochemistry, Cell, and Molecular Biology, College of Basic and Applied Sciences, University of Ghana, Accra, Ghana
| | - Miguel A. Valvano
- Wellcome-Wolfson Institute for Experimental Medicine, Queen’s University Belfast, Belfast, United Kingdom
| | - Samuel Duodu
- West African Centre for Cell Biology of Infectious Pathogens, College of Basic and Applied Sciences, University of Ghana, Accra, Ghana
- Department of Biochemistry, Cell, and Molecular Biology, College of Basic and Applied Sciences, University of Ghana, Accra, Ghana
- Samuel Duodu,
| | - Stephen Dela Ahator
- Centre for New Antibacterial Strategies (CANS) and Research Group for Host-Microbe Interactions, Department of Medical Biology, Faculty of Health Sciences, UiT- The Arctic University of Norway, Tromsø, Norway
- *Correspondence: Stephen Dela Ahator,
| |
Collapse
|
16
|
Soler-Camargo NC, Silva-Pereira TT, Zimpel CK, Camacho MF, Zelanis A, Aono AH, Patané JS, Dos Santos AP, Guimarães AMS. The rate and role of pseudogenes of the Mycobacterium tuberculosis complex. Microb Genom 2022; 8. [PMID: 36250787 DOI: 10.1099/mgen.0.000876] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Whole-genome sequence analyses have significantly contributed to the understanding of virulence and evolution of the Mycobacterium tuberculosis complex (MTBC), the causative pathogens of tuberculosis. Most MTBC evolutionary studies are focused on single nucleotide polymorphisms and deletions, but rare studies have evaluated gene content, whereas none has comprehensively evaluated pseudogenes. Accordingly, we describe an extensive study focused on quantifying and predicting possible functions of MTBC and Mycobacterium canettii pseudogenes. Using NCBI's PGAP-detected pseudogenes, we analysed 25 837 pseudogenes from 158 MTBC and M. canetii strains and combined transcriptomics and proteomics of M. tuberculosis H37Rv to gain insights about pseudogenes' expression. Our results indicate significant variability concerning rate and conservancy of in silico predicted pseudogenes among different ecotypes and lineages of tuberculous mycobacteria and pseudogenization of important virulence factors and genes of the metabolism and antimicrobial resistance/tolerance. We show that in silico predicted pseudogenes contribute considerably to MTBC genetic diversity at the population level. Moreover, the transcription machinery of M. tuberculosis can fully transcribe most pseudogenes, indicating intact promoters and recent pseudogene evolutionary emergence. Proteomics of M. tuberculosis and close evaluation of mutational lesions driving pseudogenization suggest that few in silico predicted pseudogenes are likely capable of neofunctionalization, nonsense mutation reversal, or phase variation, contradicting the classical definition of pseudogenes. Such findings indicate that genome annotation should be accompanied by proteomics and protein function assays to improve its accuracy. While indels and insertion sequences are the main drivers of the observed mutational lesions in these species, population bottlenecks and genetic drift are likely the evolutionary processes acting on pseudogenes' emergence over time. Our findings unveil a new perspective on MTBC's evolution and genetic diversity.
Collapse
Affiliation(s)
- Naila Cristina Soler-Camargo
- Laboratory of Applied Research in Mycobacteria, Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo, São Paulo, SP, Brazil.,Department of Preventive Veterinary Medicine and Animal Health, College of Veterinary Medicine, University of São Paulo, São Paulo, SP, Brazil
| | - Taiana Tainá Silva-Pereira
- Laboratory of Applied Research in Mycobacteria, Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo, São Paulo, SP, Brazil
| | - Cristina Kraemer Zimpel
- Laboratory of Applied Research in Mycobacteria, Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo, São Paulo, SP, Brazil.,Department of Preventive Veterinary Medicine and Animal Health, College of Veterinary Medicine, University of São Paulo, São Paulo, SP, Brazil
| | - Maurício F Camacho
- Functional Proteomics Laboratory, Federal University of São Paulo (UNIFESP), São José dos Campos, SP, Brazil
| | - André Zelanis
- Functional Proteomics Laboratory, Federal University of São Paulo (UNIFESP), São José dos Campos, SP, Brazil
| | - Alexandre H Aono
- Center of Molecular Biology and Genetic Engineering, University of Campinas, Campinas, SP, Brazil.,Institute of Science and Technology, Federal University of São Paulo (UNIFESP), São José dos Campos, SP, Brazil
| | | | | | - Ana Marcia Sá Guimarães
- Laboratory of Applied Research in Mycobacteria, Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo, São Paulo, SP, Brazil.,Department of Comparative Pathobiology, College of Veterinary Medicine, Purdue University
| |
Collapse
|
17
|
Abstract
Speciation is the process by which barriers to gene flow evolve between populations. Although we now know that speciation is largely driven by natural selection, knowledge of the agents of selection and the genetic and genomic mechanisms that facilitate divergence is required for a satisfactory theory of speciation. In this essay, we highlight three advances/problems in our understanding of speciation that have arisen from studies of the genes and genomic regions that underlie the evolution of reproductive isolation. First, we describe how the identification of “speciation” genes makes it possible to identify the agents of selection causing the evolution of reproductive isolation, while also noting that the link between the genetics of phenotypic divergence and intrinsic postzygotic reproductive barriers remains tenuous. Second, we discuss the important role of recombination suppressors in facilitating speciation with gene flow, but point out that the means and timing by which reproductive barriers become associated with recombination cold spots remains uncertain. Third, we establish the importance of ancient genetic variation in speciation, although we argue that the focus of speciation studies on evolutionarily young groups may bias conclusions in favor of ancient variation relative to new mutations.
Collapse
|
18
|
Syberg-Olsen MJ, Garber AI, Keeling PJ, McCutcheon JP, Husnik F. Pseudofinder: detection of pseudogenes in prokaryotic genomes. Mol Biol Evol 2022; 39:6633826. [PMID: 35801562 PMCID: PMC9336565 DOI: 10.1093/molbev/msac153] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Prokaryotic genomes are usually densely packed with intact and functional genes. However, in certain contexts, such as after recent ecological shifts or extreme population bottlenecks, broken and nonfunctional gene fragments can quickly accumulate and form a substantial fraction of the genome. Identification of these broken genes, called pseudogenes, is a critical step for understanding the evolutionary forces acting upon, and the functional potential encoded within, prokaryotic genomes. Here, we present Pseudofinder, an open-source software dedicated to pseudogene identification and analysis in bacterial and archaeal genomes. We demonstrate that Pseudofinder’s multi-pronged, reference-based approach can detect a wide variety of pseudogenes, including those that are highly degraded and typically missed by gene-calling pipelines, as well newly formed pseudogenes containing only one or a few inactivating mutations. Additionally, Pseudofinder can detect genes that lack inactivating substitutions but experiencing relaxed selection. Implementation of Pseudofinder in annotation pipelines will allow more precise estimations of the functional potential of sequenced microbes, while also generating new hypotheses related to the evolutionary dynamics of bacterial and archaeal genomes.
Collapse
Affiliation(s)
| | - Arkadiy I Garber
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Patrick J Keeling
- Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada
| | - John P McCutcheon
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA.,Howard Hughes Medical Institute, 4000 Jones Bridge Road, Chevy Chase, Maryland, USA
| | - Filip Husnik
- Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada.,Okinawa Institute of Science and Technology, Okinawa, Japan
| |
Collapse
|
19
|
Feng Y, Wang Z, Chien KY, Chen HL, Liang YH, Hua X, Chiu CH. "Pseudo-pseudogenes" in bacterial genomes: Proteogenomics reveals a wide but low protein expression of pseudogenes in Salmonella enterica. Nucleic Acids Res 2022; 50:5158-5170. [PMID: 35489061 PMCID: PMC9122581 DOI: 10.1093/nar/gkac302] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 04/11/2022] [Accepted: 04/14/2022] [Indexed: 12/03/2022] Open
Abstract
Pseudogenes (genes disrupted by frameshift or in-frame stop codons) are ubiquitously present in the bacterial genome and considered as nonfunctional fossil. Here, we used RNA-seq and mass-spectrometry technologies to measure the transcriptomes and proteomes of Salmonella enterica serovars Paratyphi A and Typhi. All pseudogenes’ mRNA sequences remained disrupted, and were present at comparable levels to their intact homologs. At the protein level, however, 101 out of 161 pseudogenes suggested successful translation, with their low expression regardless of growth conditions, genetic background and pseudogenization causes. The majority of frameshifting detected was compensatory for -1 frameshift mutations. Readthrough of in-frame stop codons primarily involved UAG; and cytosine was the most frequent base adjacent to the codon. Using a fluorescence reporter system, fifteen pseudogenes were confirmed to express successfully in vivo in Escherichia coli. Expression of the intact copy of the fifteen pseudogenes in S. Typhi affected bacterial pathogenesis as revealed in human macrophage and epithelial cell infection models. The above findings suggest the need to revisit the nonstandard translation mechanism as well as the biological role of pseudogenes in the bacterial genome.
Collapse
Affiliation(s)
- Ye Feng
- Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, People's Republic of China.,Institute of Translational Medicine, Zhejiang University School of Medicine, Hangzhou, People's Republic of China
| | - Zeyu Wang
- Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, People's Republic of China.,Institute of Translational Medicine, Zhejiang University School of Medicine, Hangzhou, People's Republic of China
| | - Kun-Yi Chien
- Graduate Institute of Biomedical Sciences, Chang Gung University College of Medicine, Taoyuan, Republic of China
| | - Hsiu-Ling Chen
- Molecular Infectious Disease Research Center, Chang Gung Memorial Hospital, Taoyuan, Republic of China
| | - Yi-Hua Liang
- Molecular Infectious Disease Research Center, Chang Gung Memorial Hospital, Taoyuan, Republic of China
| | - Xiaoting Hua
- Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, People's Republic of China
| | - Cheng-Hsun Chiu
- Graduate Institute of Biomedical Sciences, Chang Gung University College of Medicine, Taoyuan, Republic of China.,Molecular Infectious Disease Research Center, Chang Gung Memorial Hospital, Taoyuan, Republic of China.,Division of Pediatric Infectious Diseases, Department of Pediatrics, Chang Gung Memorial Hospital, Chang Gung University College of Medicine, Taoyuan, Republic of China
| |
Collapse
|
20
|
Neupert S, McCulloch GA, Foster BJ, Waters JM, Szyszka P. Reduced olfactory acuity in recently flightless insects suggests rapid regressive evolution. BMC Ecol Evol 2022; 22:50. [PMID: 35429979 PMCID: PMC9013461 DOI: 10.1186/s12862-022-02005-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2021] [Accepted: 04/08/2022] [Indexed: 11/10/2022] Open
Abstract
Abstract
Background
Insects have exceptionally fast smelling capabilities, and some can track the temporal structure of odour plumes at rates above 100 Hz. It has been hypothesized that this fast smelling capability is an adaptation for flying. We test this hypothesis by comparing the olfactory acuity of sympatric flighted versus flightless lineages within a wing-polymorphic stonefly species.
Results
Our analyses of olfactory receptor neuron responses reveal that recently-evolved flightless lineages have reduced olfactory acuity. By comparing flighted versus flightless ecotypes with similar genetic backgrounds, we eliminate other confounding factors that might have affected the evolution of their olfactory reception mechanisms. Our detection of different patterns of reduced olfactory response strength and speed in independently wing-reduced lineages suggests parallel evolution of reduced olfactory acuity.
Conclusions
These reductions in olfactory acuity echo the rapid reduction of wings themselves, and represent an olfactory parallel to the convergent phenotypic shifts seen under selective gradients in other sensory systems (e.g. parallel loss of vision in cave fauna). Our study provides evidence for the hypothesis that flight poses a selective pressure on the speed and strength of olfactory receptor neuron responses and emphasizes the energetic costs of rapid olfaction.
Collapse
|
21
|
Environmental stress leads to genome streamlining in a widely distributed species of soil bacteria. THE ISME JOURNAL 2022; 16:423-434. [PMID: 34408268 PMCID: PMC8776746 DOI: 10.1038/s41396-021-01082-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Revised: 07/14/2021] [Accepted: 07/28/2021] [Indexed: 02/07/2023]
Abstract
Bacteria have highly flexible pangenomes, which are thought to facilitate evolutionary responses to environmental change, but the impacts of environmental stress on pangenome evolution remain unclear. Using a landscape pangenomics approach, I demonstrate that environmental stress leads to consistent, continuous reduction in genome content along four environmental stress gradients (acidity, aridity, heat, salinity) in naturally occurring populations of Bradyrhizobium diazoefficiens (widespread soil-dwelling plant mutualists). Using gene-level network and duplication functional traits to predict accessory gene distributions across environments, genes predicted to be superfluous are more likely lost in high stress, while genes with multi-functional roles are more likely retained. Genes with higher probabilities of being lost with stress contain significantly higher proportions of codons under strong purifying and positive selection. Gene loss is widespread across the entire genome, with high gene-retention hotspots in close spatial proximity to core genes, suggesting Bradyrhizobium has evolved to cluster essential-function genes (accessory genes with multifunctional roles and core genes) in discrete genomic regions, which may stabilise viability during genomic decay. In conclusion, pangenome evolution through genome streamlining are important evolutionary responses to environmental change. This raises questions about impacts of genome streamlining on the adaptive capacity of bacterial populations facing rapid environmental change.
Collapse
|
22
|
Marques AT, Tanoeiro L, Duarte A, Gonçalves L, Vítor JMB, Vale FF. Genomic Analysis of Prophages from Klebsiella pneumoniae Clinical Isolates. Microorganisms 2021; 9:2252. [PMID: 34835377 PMCID: PMC8617712 DOI: 10.3390/microorganisms9112252] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 10/15/2021] [Accepted: 10/25/2021] [Indexed: 12/15/2022] Open
Abstract
Klebsiella pneumoniae is an increasing threat to public health and represents one of the most concerning pathogens involved in life-threatening infections. The resistant and virulence determinants are coded by mobile genetic elements which can easily spread between bacteria populations and co-evolve with its genomic host. In this study, we present the full genomic sequences, insertion sites and phylogenetic analysis of 150 prophages found in 40 K. pneumoniae clinical isolates obtained from an outbreak in a Portuguese hospital. All strains harbored at least one prophage and we identified 104 intact prophages (69.3%). The prophage size ranges from 29.7 to 50.6 kbp, coding between 32 and 78 putative genes. The prophage GC content is 51.2%, lower than the average GC content of 57.1% in K. pneumoniae. Complete prophages were classified into three families in the order Caudolovirales: Myoviridae (59.6%), Siphoviridae (38.5%) and Podoviridae (1.9%). In addition, an alignment and phylogenetic analysis revealed nine distinct clusters. Evidence of recombination was detected within the genome of some prophages but, in most cases, proteins involved in viral structure, transcription, replication and regulation (lysogenic/lysis) were maintained. These results support the knowledge that prophages are diverse and widely disseminated in K. pneumoniae genomes, contributing to the evolution of this species and conferring additional phenotypes. Moreover, we identified K. pneumoniae prophages in a set of endolysin genes, which were found to code for proteins with lysozyme activity, cleaving the β-1,4 linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in the peptidoglycan network and thus representing genes with the potential for lysin phage therapy.
Collapse
Affiliation(s)
- Andreia T. Marques
- Pathogen Genome Bioinformatics and Computational Biology, Research Institute for Medicines (iMed-ULisboa), Faculty of Pharmacy, Universidade de Lisboa, 1649-003 Lisboa, Portugal; (L.T.); (J.M.B.V.)
| | - Luís Tanoeiro
- Pathogen Genome Bioinformatics and Computational Biology, Research Institute for Medicines (iMed-ULisboa), Faculty of Pharmacy, Universidade de Lisboa, 1649-003 Lisboa, Portugal; (L.T.); (J.M.B.V.)
| | - Aida Duarte
- Faculty of Pharmacy, Universidade de Lisboa, Av. Gama Pinto, 1649-003 Lisboa, Portugal;
- Centro de Investigação Interdisciplinar Egas Moniz, Instituto Universitário Egas Moniz, 2829-511 Monte da Caparica, Portugal
| | - Luisa Gonçalves
- Clinical Pathology Unit, Hospital SAMS, Cidade de Gabela, 1849-017 Lisboa, Portugal;
| | - Jorge M. B. Vítor
- Pathogen Genome Bioinformatics and Computational Biology, Research Institute for Medicines (iMed-ULisboa), Faculty of Pharmacy, Universidade de Lisboa, 1649-003 Lisboa, Portugal; (L.T.); (J.M.B.V.)
| | - Filipa F. Vale
- Pathogen Genome Bioinformatics and Computational Biology, Research Institute for Medicines (iMed-ULisboa), Faculty of Pharmacy, Universidade de Lisboa, 1649-003 Lisboa, Portugal; (L.T.); (J.M.B.V.)
| |
Collapse
|
23
|
Armbruster CR, Marshall CW, Garber AI, Melvin JA, Zemke AC, Moore J, Zamora PF, Li K, Fritz IL, Manko CD, Weaver ML, Gaston JR, Morris A, Methé B, DePas WH, Lee SE, Cooper VS, Bomberger JM. Adaptation and genomic erosion in fragmented Pseudomonas aeruginosa populations in the sinuses of people with cystic fibrosis. Cell Rep 2021; 37:109829. [PMID: 34686349 PMCID: PMC8667756 DOI: 10.1016/j.celrep.2021.109829] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 09/09/2021] [Accepted: 09/22/2021] [Indexed: 10/20/2022] Open
Abstract
Pseudomonas aeruginosa notoriously adapts to the airways of people with cystic fibrosis (CF), yet how infection-site biogeography and associated evolutionary processes vary as lifelong infections progress remains unclear. Here we test the hypothesis that early adaptations promoting aggregation influence evolutionary-genetic trajectories by examining longitudinal P. aeruginosa from the sinuses of six adults with CF. Highly host-adapted lineages harbored mutator genotypes displaying signatures of early genome degradation associated with recent host restriction. Using an advanced imaging technique (MiPACT-HCR [microbial identification after passive clarity technique]), we find population structure tracks with genome degradation, with the most host-adapted, genome-degraded P. aeruginosa (the mutators) residing in small, sparse aggregates. We propose that following initial adaptive evolution in larger populations under strong selection for aggregation, P. aeruginosa persists in small, fragmented populations that experience stronger effects of genetic drift. These conditions enrich for mutators and promote degenerative genome evolution. Our findings underscore the importance of infection-site biogeography to pathogen evolution.
Collapse
Affiliation(s)
- Catherine R Armbruster
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA
| | | | - Arkadiy I Garber
- Biodesign Center for Mechanisms of Evolution and School of Life Sciences, Arizona State University, Tempe, AZ 85281, USA
| | - Jeffrey A Melvin
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA
| | - Anna C Zemke
- Department of Medicine, Division of Pulmonary and Critical Care Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA
| | - John Moore
- Department of Otolaryngology, University of Pittsburgh Medical Center, Pittsburgh, PA 15219, USA
| | - Paula F Zamora
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA
| | - Kelvin Li
- Center for Medicine and the Microbiome, University of Pittsburgh and University of Pittsburgh Medical Center, Pittsburgh, PA 15219, USA
| | - Ian L Fritz
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA
| | - Christopher D Manko
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA
| | - Madison L Weaver
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA
| | - Jordan R Gaston
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA
| | - Alison Morris
- Center for Medicine and the Microbiome, University of Pittsburgh and University of Pittsburgh Medical Center, Pittsburgh, PA 15219, USA
| | - Barbara Methé
- Center for Medicine and the Microbiome, University of Pittsburgh and University of Pittsburgh Medical Center, Pittsburgh, PA 15219, USA
| | - William H DePas
- Department of Pediatrics, Children's Hospital of Pittsburgh and University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA
| | - Stella E Lee
- Department of Otolaryngology, University of Pittsburgh Medical Center, Pittsburgh, PA 15219, USA.
| | - Vaughn S Cooper
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA; Center for Medicine and the Microbiome, University of Pittsburgh and University of Pittsburgh Medical Center, Pittsburgh, PA 15219, USA; Pittsburgh Center for Evolutionary Biology & Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA.
| | - Jennifer M Bomberger
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA 15219, USA.
| |
Collapse
|
24
|
Garber AI, Kupper M, Laetsch DR, Weldon SR, Ladinsky MS, Bjorkman PJ, McCutcheon JP. The Evolution of Interdependence in a Four-Way Mealybug Symbiosis. Genome Biol Evol 2021; 13:evab123. [PMID: 34061185 PMCID: PMC8331144 DOI: 10.1093/gbe/evab123] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/24/2021] [Indexed: 01/03/2023] Open
Abstract
Mealybugs are insects that maintain intracellular bacterial symbionts to supplement their nutrient-poor plant sap diets. Some mealybugs have a single betaproteobacterial endosymbiont, a Candidatus Tremblaya species (hereafter Tremblaya) that alone provides the insect with its required nutrients. Other mealybugs have two nutritional endosymbionts that together provision these same nutrients, where Tremblaya has gained a gammaproteobacterial partner that resides in its cytoplasm. Previous work had established that Pseudococcus longispinus mealybugs maintain not one but two species of gammaproteobacterial endosymbionts along with Tremblaya. Preliminary genomic analyses suggested that these two gammaproteobacterial endosymbionts have large genomes with features consistent with a relatively recent origin as insect endosymbionts, but the patterns of genomic complementarity between members of the symbiosis and their relative cellular locations were unknown. Here, using long-read sequencing and various types of microscopy, we show that the two gammaproteobacterial symbionts of P. longispinus are mixed together within Tremblaya cells, and that their genomes are somewhat reduced in size compared with their closest nonendosymbiotic relatives. Both gammaproteobacterial genomes contain thousands of pseudogenes, consistent with a relatively recent shift from a free-living to an endosymbiotic lifestyle. Biosynthetic pathways of key metabolites are partitioned in complex interdependent patterns among the two gammaproteobacterial genomes, the Tremblaya genome, and horizontally acquired bacterial genes that are encoded on the mealybug nuclear genome. Although these two gammaproteobacterial endosymbionts have been acquired recently in evolutionary time, they have already evolved codependencies with each other, Tremblaya, and their insect host.
Collapse
Affiliation(s)
- Arkadiy I Garber
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
- Biodesign Center for Mechanisms of Evolution and School of Life Sciences, Arizona State University, Tempe, Arizona, USA
| | - Maria Kupper
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
- Biodesign Center for Mechanisms of Evolution and School of Life Sciences, Arizona State University, Tempe, Arizona, USA
| | - Dominik R Laetsch
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
| | - Stephanie R Weldon
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Mark S Ladinsky
- Department of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, USA
| | - Pamela J Bjorkman
- Department of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, USA
| | - John P McCutcheon
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
- Biodesign Center for Mechanisms of Evolution and School of Life Sciences, Arizona State University, Tempe, Arizona, USA
| |
Collapse
|
25
|
Uelze L, Borowiak M, Deneke C, Fischer J, Flieger A, Simon S, Szabó I, Tausch SH, Malorny B. Comparative genomics of Salmonella enterica subsp. diarizonae serovar 61:k:1,5,(7) reveals lineage-specific host adaptation of ST432. Microb Genom 2021; 7. [PMID: 34338625 PMCID: PMC8549363 DOI: 10.1099/mgen.0.000604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Unlike most Salmonella enterica subsp. diarizonae, which are predominantly associated with cold-blooded animals such as reptiles, the serovar IIIb 61:k:1,5,(7) (termed SASd) is regarded as host-adapted to sheep. The bacterium is rarely associated with disease in humans but, nevertheless, SASd isolates are sporadically obtained from human clinical samples. It is unclear whether these transmissions are directly linked to sheep or whether transmissions may, for example, occur through other domestic animals also carrying SASd. For this reason, we utilized whole-genome sequencing to investigate a set of 119 diverse SASd isolates, including sheep-associated and human-associated isolates, as well as isolates obtained from other matrices. We discovered that serovar IIIb 61:k:1,5,(7) is composed of two distinct lineages defined by their sequence types ST432 and ST439. These two lineages are distinguished by a number of genetic features, as well as their prevalence and reservoir. ST432 appears to be the more prevalent sequence type, with the majority of isolates investigated in this study belonging to ST432. In contrast, only a small number of isolates were attributed to ST439. While ST432 isolates were of sheep, human or other origin, all ST439 isolates with source information available, were obtained from human clinical samples. Regarding their genetic features, lineage ST432 shows increased pseudogenization, harbours a virB/D4 plasmid that encodes a type IV secretion system (T4SS) and does not possess the iro gene cluster, which encodes a salmochelin siderophore for iron acquisition. These characteristics likely contribute to the ability of ST432 to persistently colonize the intestines of sheep. Furthermore, we found isolates of the lineage ST432 to be highly clonal, with little variation over the sampling period of almost 20 years. We conclude from the genomic comparisons that SASd underlies a microevolutionary process and that it is specifically lineage ST432 that should be considered as host-adapted to sheep.
Collapse
Affiliation(s)
- Laura Uelze
- Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Straße 8-10, 10589 Berlin, Germany
| | - Maria Borowiak
- Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Straße 8-10, 10589 Berlin, Germany
| | - Carlus Deneke
- Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Straße 8-10, 10589 Berlin, Germany
| | - Jennie Fischer
- Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Straße 8-10, 10589 Berlin, Germany
| | - Antje Flieger
- Unit for Enteropathogenic Bacteria and Legionella (FG11)/National Reference Centre for Salmonella and Other Bacterial Enteric Pathogens, Robert Koch Institute (RKI), Burgstraße 37, 38855 Wernigerode, Germany
| | - Sandra Simon
- Unit for Enteropathogenic Bacteria and Legionella (FG11)/National Reference Centre for Salmonella and Other Bacterial Enteric Pathogens, Robert Koch Institute (RKI), Burgstraße 37, 38855 Wernigerode, Germany
| | - István Szabó
- Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Straße 8-10, 10589 Berlin, Germany
| | - Simon H Tausch
- Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Straße 8-10, 10589 Berlin, Germany
| | - Burkhard Malorny
- Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Straße 8-10, 10589 Berlin, Germany
| |
Collapse
|
26
|
The fish pathogen Aliivibrio salmonicida LFI1238 can degrade and metabolize chitin despite major gene loss in the chitinolytic pathway. Appl Environ Microbiol 2021; 87:e0052921. [PMID: 34319813 DOI: 10.1128/aem.00529-21] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The fish pathogen Aliivibrio (Vibrio) salmonicida LFI1238 is thought to be incapable of utilizing chitin as a nutrient source since approximately half of the genes representing the chitinolytic pathway are disrupted by insertion sequences. In the present study, we combined a broad set of analytical methods to investigate this hypothesis. Cultivation studies revealed that Al. salmonicida grew efficiently on N-acetylglucosamine (GlcNAc) and chitobiose ((GlcNAc)2), the primary soluble products resulting from enzymatic chitin hydrolysis. The bacterium was also able to grow on chitin particles, albeit at a lower rate compared to the soluble substrates. The genome of the bacterium contains five disrupted chitinase genes (pseudogenes) and three intact genes encoding a glycoside hydrolase family 18 (GH18) chitinase and two auxiliary activity family 10 (AA10) lytic polysaccharide monooxygenases (LPMOs). Biochemical characterization showed that the chitinase and LPMOs were able to depolymerize both α- and β-chitin to (GlcNAc)2 and oxidized chitooligosaccharides, respectively. Notably, the chitinase displayed up to 50-fold lower activity compared to other well-studied chitinases. Deletion of the genes encoding the intact chitinolytic enzymes showed that the chitinase was important for growth on β-chitin, whereas the LPMO gene-deletion variants only showed minor growth defects on this substrate. Finally, proteomic analysis of Al. salmonicida LFI1238 growth on β-chitin showed expression of all three chitinolytic enzymes, and intriguingly also three of the disrupted chitinases. In conclusion, our results show that Al. salmonicida LFI1238 can utilize chitin as a nutrient source and that the GH18 chitinase and the two LPMOs are needed for this ability. IMPORTANCE The ability to utilize chitin as a source of nutrients is important for the survival and spread of marine microbial pathogens in the environment. One such pathogen is Aliivibrio (Vibrio) salmonicida, the causative agent of cold water vibriosis. Due to extensive gene decay, many key enzymes in the chitinolytic pathway have been disrupted, putatively rendering this bacterium incapable of chitin degradation and utilization. In the present study we demonstrate that Al. salmonicida can degrade and metabolize chitin, the most abundant biopolymer in the ocean. Our findings shed new light on the environmental adaption of this fish pathogen.
Collapse
|
27
|
Danneels B, Viruel J, Mcgrath K, Janssens SB, Wales N, Wilkin P, Carlier A. Patterns of transmission and horizontal gene transfer in the Dioscorea sansibarensis leaf symbiosis revealed by whole-genome sequencing. Curr Biol 2021; 31:2666-2673.e4. [PMID: 33852872 DOI: 10.1016/j.cub.2021.03.049] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 12/07/2020] [Accepted: 03/15/2021] [Indexed: 11/26/2022]
Abstract
Leaves of the wild yam species Dioscorea sansibarensis display prominent forerunner or "drip" tips filled with extracellular bacteria of the species Orrella dioscoreae.1 This species of yam is native to Madagascar and tropical Africa and reproduces mainly asexually through aerial bulbils and underground tubers, which also contain a small population of O. dioscoreae.2,3 Despite apparent vertical transmission, the genome of O. dioscoreae does not show any of the hallmarks of genome erosion often found in hereditary symbionts (e.g., small genome size and accumulation of pseudogenes).4-6 We investigated here the range and distribution of leaf symbiosis between D. sansibarensis and O. dioscoreae using preserved leaf samples from herbarium collections that were originally collected from various locations in Africa. We recovered DNA from the extracellular symbiont in all samples, showing that the symbiosis is widespread throughout continental Africa and Madagascar. Despite the degraded nature of this DNA, we constructed 17 symbiont genomes using de novo methods without relying on a reference. Phylogenetic and genomic analyses revealed that horizontal transmission of symbionts and horizontal gene transfer have shaped the evolution of the symbiont. These mechanisms could help explain lack of signs of reductive genome evolution despite an obligate host-associated lifestyle. Furthermore, phylogenetic analysis of D. sansibarensis based on plastid genomes revealed a strong geographical clustering of samples and provided evidence that the symbiosis originated at least 13 mya, earlier than previously estimated.3.
Collapse
Affiliation(s)
- Bram Danneels
- Laboratory of Microbiology, Ghent University, 9000 Ghent, Belgium
| | - Juan Viruel
- Royal Botanical Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Krista Mcgrath
- Department of Prehistory and Institute of Environmental Science and Technology (ICTA), Autonomous University of Barcelona, 08193 Bellaterra, Spain; Department of Archaeology, University of York, Heslington, York YO10 5DD, UK
| | - Steven B Janssens
- Meise Botanic Garden, 1860 Meise, Belgium; Department of Biology, KU Leuven, 3000 Leuven, Belgium
| | - Nathan Wales
- Department of Archaeology, University of York, Heslington, York YO10 5DD, UK
| | - Paul Wilkin
- Royal Botanical Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Aurélien Carlier
- Laboratory of Microbiology, Ghent University, 9000 Ghent, Belgium; LIPME, Université de Toulouse, INRAE, CNRS, 31320 Castanet-Tolosan, France.
| |
Collapse
|
28
|
Assis RAB, Varani AM, Sagawa CHD, Patané JSL, Setubal JC, Uceda-Campos G, da Silva AM, Zaini PA, Almeida NF, Moreira LM, Dandekar AM. A comparative genomic analysis of Xanthomonas arboricola pv. juglandis strains reveal hallmarks of mobile genetic elements in the adaptation and accelerated evolution of virulence. Genomics 2021; 113:2513-2525. [PMID: 34089784 DOI: 10.1016/j.ygeno.2021.06.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Revised: 03/01/2021] [Accepted: 06/01/2021] [Indexed: 01/25/2023]
Abstract
Xanthomonas arboricola pv. juglandis (Xaj) is the most significant aboveground walnut bacterial pathogen. Disease management uses copper-based pesticides which induce pathogen resistance. We examined the genetic repertoire associated with adaptation and virulence evolution in Xaj. Comparative genomics of 32 Xaj strains reveal the possible acquisition and propagation of virulence factors via insertion sequences (IS). Fine-scale annotation revealed a Tn3 transposon (TnXaj417) encoding copper resistance genes acquired by horizontal gene transfer and associated with adaptation and tolerance to metal-based pesticides commonly used to manage pathogens in orchard ecosystems. Phylogenomic analysis reveals IS involvement in acquisition and diversification of type III effector proteins ranging from two to eight in non-pathogenic strains, 16 to 20 in pathogenic strains, besides six other putative effectors with a reduced identity degree found mostly among pathogenic strains. Yersiniabactin, xopK, xopAI, and antibiotic resistance genes are also located near ISs or inside genomic islands and structures resembling composite transposons.
Collapse
Affiliation(s)
- Renata A B Assis
- Center of Research in Biological Science, Federal University of Ouro Preto, Ouro Preto, MG, Brazil; Department of Plant Sciences, University of California, Davis, CA, USA
| | - Alessandro M Varani
- Faculty of Agricultural and Veterinary Sciences of Jaboticabal (FCAV), Universidade Estadual Paulista (UNESP), Department of Technology, Jaboticabal, SP, Brazil
| | - Cintia H D Sagawa
- Department of Plant Sciences, University of California, Davis, CA, USA
| | - José S L Patané
- Cell Cycle Laboratory, Butantan Institute, Sao Paulo, SP, Brazil
| | - João Carlos Setubal
- Department of Biochemistry, Chemistry Institute, University of Sao Paulo, Sao Paulo, SP, Brazil
| | - Guillermo Uceda-Campos
- Department of Biochemistry, Chemistry Institute, University of Sao Paulo, Sao Paulo, SP, Brazil
| | - Aline Maria da Silva
- Department of Biochemistry, Chemistry Institute, University of Sao Paulo, Sao Paulo, SP, Brazil
| | - Paulo A Zaini
- Department of Plant Sciences, University of California, Davis, CA, USA
| | - Nalvo F Almeida
- School of Computing, Federal University of Mato Grosso do Sul, Mato Grosso do Sul, MS, Brazil
| | - Leandro Marcio Moreira
- Center of Research in Biological Science, Federal University of Ouro Preto, Ouro Preto, MG, Brazil; Department of Biological Science, Institute of Exact and Biological Science, Federal University of Ouro Preto, Ouro Preto, MG, Brazil.
| | - Abhaya M Dandekar
- Department of Plant Sciences, University of California, Davis, CA, USA.
| |
Collapse
|
29
|
Lobb B, Tremblay BJM, Moreno-Hagelsieb G, Doxey AC. An assessment of genome annotation coverage across the bacterial tree of life. Microb Genom 2020; 6. [PMID: 32124724 PMCID: PMC7200070 DOI: 10.1099/mgen.0.000341] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Although gene-finding in bacterial genomes is relatively straightforward, the automated assignment of gene function is still challenging, resulting in a vast quantity of hypothetical sequences of unknown function. But how prevalent are hypothetical sequences across bacteria, what proportion of genes in different bacterial genomes remain unannotated, and what factors affect annotation completeness? To address these questions, we surveyed over 27 000 bacterial genomes from the Genome Taxonomy Database, and measured genome annotation completeness as a function of annotation method, taxonomy, genome size, 'research bias' and publication date. Our analysis revealed that 52 and 79 % of the average bacterial proteome could be functionally annotated based on protein and domain-based homology searches, respectively. Annotation coverage using protein homology search varied significantly from as low as 14 % in some species to as high as 98 % in others. We found that taxonomy is a major factor influencing annotation completeness, with distinct trends observed across the microbial tree (e.g. the lowest level of completeness was found in the Patescibacteria lineage). Most lineages showed a significant association between genome size and annotation incompleteness, likely reflecting a greater degree of uncharacterized sequences in 'accessory' proteomes than in 'core' proteomes. Finally, research bias, as measured by publication volume, was also an important factor influencing genome annotation completeness, with early model organisms showing high completeness levels relative to other genomes in their own taxonomic lineages. Our work highlights the disparity in annotation coverage across the bacterial tree of life and emphasizes a need for more experimental characterization of accessory proteomes as well as understudied lineages.
Collapse
Affiliation(s)
- Briallen Lobb
- Department of Biology, University of Waterloo, 200 University Avenue West, Waterloo, ON N2L 3G1, Canada
| | | | - Gabriel Moreno-Hagelsieb
- Department of Biology, Wilfrid Laurier University, 75 University Avenue West, Waterloo, ON, Canada
| | - Andrew C Doxey
- Department of Biology, University of Waterloo, 200 University Avenue West, Waterloo, ON N2L 3G1, Canada
| |
Collapse
|
30
|
Genetic Variation and Preliminary Indications of Divergent Niche Adaptation in Cryptic Clade II of Escherichia. Microorganisms 2020; 8:microorganisms8111713. [PMID: 33142902 PMCID: PMC7716201 DOI: 10.3390/microorganisms8111713] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 10/24/2020] [Accepted: 10/30/2020] [Indexed: 12/03/2022] Open
Abstract
The evolution, habitat, and lifestyle of the cryptic clade II of Escherichia, which were first recovered at low frequency from non-human hosts and later from external environments, were poorly understood. Here, the genomes of selected strains were analyzed for preliminary indications of ecological differentiation within their population. We adopted the delta bitscore metrics to detect functional divergence of their orthologous genes and trained a random forest classifier to differentiate the genomes according to habitats (gastrointestinal vs external environment). Model was built with inclusion of other Escherichia genomes previously demonstrated to have exhibited genomic traits of adaptation to one of the habitats. Overall, gene degradation was more prominent in the gastrointestinal strains. The trained model correctly classified the genomes, identifying a set of predictor genes that were informative of habitat association. Functional divergence in many of these genes were reflective of ecological divergence. Accuracy of the trained model was confirmed by its correct prediction of the habitats of an independent set of strains with known habitat association. In summary, the cryptic clade II of Escherichia displayed genomic signatures that are consistent with divergent adaptation to gastrointestinal and external environments.
Collapse
|
31
|
Malik A, Kim YR, Kim SB. Genome Mining of the Genus Streptacidiphilus for Biosynthetic and Biodegradation Potential. Genes (Basel) 2020; 11:genes11101166. [PMID: 33022985 PMCID: PMC7601586 DOI: 10.3390/genes11101166] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2020] [Revised: 09/26/2020] [Accepted: 09/29/2020] [Indexed: 12/23/2022] Open
Abstract
The genus Streptacidiphilus represents a group of acidophilic actinobacteria within the family Streptomycetaceae, and currently encompasses 15 validly named species, which include five recent additions within the last two years. Considering the potential of the related genera within the family, namely Streptomyces and Kitasatospora, these relatively new members of the family can also be a promising source for novel secondary metabolites. At present, 15 genome data for 11 species from this genus are available, which can provide valuable information on their biology including the potential for metabolite production as well as enzymatic activities in comparison to the neighboring taxa. In this study, the genome sequences of 11 Streptacidiphilus species were subjected to the comparative analysis together with selected Streptomyces and Kitasatospora genomes. This study represents the first comprehensive comparative genomic analysis of the genus Streptacidiphilus. The results indicate that the genomes of Streptacidiphilus contained various secondary metabolite (SM) producing biosynthetic gene clusters (BGCs), some of them exclusively identified in Streptacidiphilus only. Several of these clusters may potentially code for SMs that may have a broad range of bioactivities, such as antibacterial, antifungal, antimalarial and antitumor activities. The biodegradation capabilities of Streptacidiphilus were also explored by investigating the hydrolytic enzymes for complex carbohydrates. Although all genomes were enriched with carbohydrate-active enzymes (CAZymes), their numbers in the genomes of some strains such as Streptacidiphilus carbonis NBRC 100919T were higher as compared to well-known carbohydrate degrading organisms. These distinctive features of each Streptacidiphilus species make them interesting candidates for future studies with respect to their potential for SM production and enzymatic activities.
Collapse
Affiliation(s)
- Adeel Malik
- Department of Microbiology and Molecular Biology, Chungnam National University, Daejeon 34134, Korea; (A.M.); (Y.R.K.)
- Institute of Intelligence Informatics Technology, Sangmyung University, Seoul 03016, Korea
| | - Yu Ri Kim
- Department of Microbiology and Molecular Biology, Chungnam National University, Daejeon 34134, Korea; (A.M.); (Y.R.K.)
| | - Seung Bum Kim
- Department of Microbiology and Molecular Biology, Chungnam National University, Daejeon 34134, Korea; (A.M.); (Y.R.K.)
- Correspondence:
| |
Collapse
|
32
|
Chu X, Li S, Wang S, Luo D, Luo H. Gene loss through pseudogenization contributes to the ecological diversification of a generalist Roseobacter lineage. ISME JOURNAL 2020; 15:489-502. [PMID: 32999421 DOI: 10.1038/s41396-020-00790-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Revised: 09/13/2020] [Accepted: 09/21/2020] [Indexed: 12/11/2022]
Abstract
Ecologically relevant genes generally show patchy distributions among related bacterial genomes. This is commonly attributed to lateral gene transfer, whereas the opposite mechanism-gene loss-has rarely been explored. Pseudogenization is a major mechanism underlying gene loss, and pseudogenes are best characterized by comparing closely related genomes because of their short life spans. To explore the role of pseudogenization in microbial ecological diversification, we apply rigorous methods to characterize pseudogenes in the 279 newly sequenced Ruegeria isolates of the globally abundant Roseobacter group collected from two typical coastal habitats in Hong Kong, the coral Platygyra acuta and the macroalga Sargassum hemiphyllum. Pseudogenes contribute to ~16% of the accessory genomes of these strains. Ancestral state reconstruction reveals that many pseudogenization events are correlated with ancestral niche shifts. Specifically, genes related to resource scavenging and energy acquisition were often pseudogenized when roseobacters inhabiting carbon-limited and energy-poor coral skeleton switched to other resource-richer niches. For roseobacters inhabiting the macroalgal niches, genes for nitrogen regulation and carbohydrate utilization were important but became dispensable upon shift to coral skeleton where nitrate is abundant but carbohydrates are less available. Whereas low-energy-demanding secondary transporters are more favorable in coral skeleton, ATP-driven primary transporters are preferentially kept in the energy-replete macroalgal niches. Moreover, a large proportion of these families mediate organismal interactions, suggesting their rapid losses by pseudogenization as a potential response to host and niche shift. These findings illustrate an important role of pseudogenization in shaping genome content and driving ecological diversification of marine roseobacters.
Collapse
Affiliation(s)
- Xiao Chu
- Simon F. S. Li Marine Science Laboratory, School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong SAR
| | - Siyao Li
- Simon F. S. Li Marine Science Laboratory, School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong SAR
| | - Sishuo Wang
- Simon F. S. Li Marine Science Laboratory, School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong SAR
| | - Danli Luo
- Simon F. S. Li Marine Science Laboratory, School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong SAR
| | - Haiwei Luo
- Simon F. S. Li Marine Science Laboratory, School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong SAR. .,Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, 518000, China.
| |
Collapse
|
33
|
Abstract
The genomes of bacteria contain fewer genes and substantially less noncoding DNA than those of eukaryotes, and as a result, they have much less raw material to invent new traits. Yet, bacteria are vastly more taxonomically diverse, numerically abundant, and globally successful in colonizing new habitats compared to eukaryotes. Although bacterial genomes are generally considered to be optimized for efficient growth and rapid adaptation, nonadaptive processes have played a major role in shaping the size, contents, and compact organization of bacterial genomes and have allowed the establishment of deleterious traits that serve as the raw materials for genetic innovation.
Collapse
Affiliation(s)
- Paul C Kirchberger
- Department of Integrative Biology, University of Texas at Austin, Texas 78712, USA; ; ;
| | - Marian L Schmidt
- Department of Integrative Biology, University of Texas at Austin, Texas 78712, USA; ; ;
| | - Howard Ochman
- Department of Integrative Biology, University of Texas at Austin, Texas 78712, USA; ; ;
| |
Collapse
|
34
|
Fumero MV, Villani A, Susca A, Haidukowski M, Cimmarusti MT, Toomajian C, Leslie JF, Chulze SN, Moretti A. Fumonisin and Beauvericin Chemotypes and Genotypes of the Sister Species Fusarium subglutinans and Fusarium temperatum. Appl Environ Microbiol 2020; 86:e00133-20. [PMID: 32358011 PMCID: PMC7301838 DOI: 10.1128/aem.00133-20] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Accepted: 04/28/2020] [Indexed: 12/19/2022] Open
Abstract
Fusarium subglutinans and Fusarium temperatum are common maize pathogens that produce mycotoxins and cause plant disease. The ability of these species to produce beauvericin and fumonisin mycotoxins is not settled, as reports of toxin production are not concordant. Our objective was to clarify this situation by determining both the chemotypes and genotypes for strains from both species. We analyzed 25 strains from Argentina, 13 F. subglutinans and 12 F. temperatum strains, for toxin production by ultraperformance liquid chromatography mass spectrometry (UPLC-MS). We used new genome sequences from two strains of F. subglutinans and one strain of F. temperatum, plus genomes of other Fusarium species, to determine the presence of functional gene clusters for the synthesis of these toxins. None of the strains examined from either species produced fumonisins. These strains also lack Fum biosynthetic genes but retain homologs of some genes that flank the Fum cluster in Fusarium verticillioides None of the F. subglutinans strains we examined produced beauvericin although 9 of 12 F. temperatum strains did. A complete beauvericin (Bea) gene cluster was present in all three new genome sequences. The Bea1 gene was presumably functional in F. temperatum but was not functional in F. subglutinans due to a large insertion and multiple mutations that resulted in premature stop codons. The accumulation of only a few mutations expected to disrupt Bea1 suggests that the process of its inactivation is relatively recent. Thus, none of the strains of F. subglutinans or F. temperatum we examined produce fumonisins, and the strains of F. subglutinans examined also cannot produce beauvericin. Variation in the ability of strains of F. temperatum to produce beauvericin requires further study and could reflect the recent shared ancestry of these two species.IMPORTANCEFusarium subglutinans and F. temperatum are sister species and maize pathogens commonly isolated worldwide that can produce several mycotoxins and cause seedling disease, stalk rot, and ear rot. The ability of these species to produce beauvericin and fumonisin mycotoxins is not settled, as reports of toxin production are not concordant at the species level. Our results are consistent with previous reports that strains of F. subglutinans produce neither fumonisins nor beauvericin. The status of toxin production by F. temperatum needs further work. Our strains of F. temperatum did not produce fumonisins, while some strains produced beauvericin and others did not. These results enable more accurate risk assessments of potential mycotoxin contamination if strains of these species are present. The nature of the genetic inactivation of BEA1 is consistent with its relatively recent occurrence and the close phylogenetic relationship of the two sister species.
Collapse
Affiliation(s)
- M Veronica Fumero
- Research Institute on Mycology and Mycotoxicology, National Research Council of Argentina, National University of Rio Cuarto, Rio Cuarto, Cordoba, Argentina
| | | | - Antonia Susca
- Institute of Sciences of Food Production, CNR, Bari, Italy
| | | | | | | | - John F Leslie
- Department of Plant Pathology, Kansas State University, Manhattan, Kansas, USA
| | - Sofia N Chulze
- Research Institute on Mycology and Mycotoxicology, National Research Council of Argentina, National University of Rio Cuarto, Rio Cuarto, Cordoba, Argentina
| | | |
Collapse
|
35
|
Cervantes-Rivera R, Tronnet S, Puhar A. Complete genome sequence and annotation of the laboratory reference strain Shigella flexneri serotype 5a M90T and genome-wide transcriptional start site determination. BMC Genomics 2020; 21:285. [PMID: 32252626 PMCID: PMC7132871 DOI: 10.1186/s12864-020-6565-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Accepted: 02/07/2020] [Indexed: 01/19/2023] Open
Abstract
Background Shigella is a Gram-negative facultative intracellular bacterium that causes bacillary dysentery in humans. Shigella invades cells of the colonic mucosa owing to its virulence plasmid-encoded Type 3 Secretion System (T3SS), and multiplies in the target cell cytosol. Although the laboratory reference strain S. flexneri serotype 5a M90T has been extensively used to understand the molecular mechanisms of pathogenesis, its complete genome sequence is not available, thereby greatly limiting studies employing high-throughput sequencing and systems biology approaches. Results We have sequenced, assembled, annotated and manually curated the full genome of S. flexneri 5a M90T. This yielded two complete circular contigs, the chromosome and the virulence plasmid (pWR100). To obtain the genome sequence, we have employed long-read PacBio DNA sequencing followed by polishing with Illumina RNA-seq data. This provides a new hybrid strategy to prepare gapless, highly accurate genome sequences, which also cover AT-rich tracks or repetitive sequences that are transcribed. Furthermore, we have performed genome-wide analysis of transcriptional start sites (TSS) and determined the length of 5′ untranslated regions (5′-UTRs) at typical culture conditions for the inoculum of in vitro infection experiments. We identified 6723 primary TSS (pTSS) and 7328 secondary TSS (sTSS). The S. flexneri 5a M90T annotated genome sequence and the transcriptional start sites are integrated into RegulonDB (http://regulondb.ccg.unam.mx) and RSAT (http://embnet.ccg.unam.mx/rsat/) databases to use their analysis tools in the S. flexneri 5a M90T genome. Conclusions We provide the first complete genome for S. flexneri serotype 5a, specifically the laboratory reference strain M90T. Our work opens the possibility of employing S. flexneri M90T in high-quality systems biology studies such as transcriptomic and differential expression analyses or in genome evolution studies. Moreover, the catalogue of TSS that we report here can be used in molecular pathogenesis studies as a resource to know which genes are transcribed before infection of host cells. The genome sequence, together with the analysis of transcriptional start sites, is also a valuable tool for precise genetic manipulation of S. flexneri 5a M90T. Further, we present a new hybrid strategy to prepare gapless, highly accurate genome sequences. Unlike currently used hybrid strategies combining long- and short-read DNA sequencing technologies to maximize accuracy, our workflow using long-read DNA sequencing and short-read RNA sequencing provides the added value of using non-redundant technologies, which yield distinct, exploitable datasets.
Collapse
Affiliation(s)
- Ramón Cervantes-Rivera
- The Laboratory for Molecular Infection Medicine Sweden (MIMS), 901 87 Umeå, Sweden.,Umeå Centre for Microbial Research (UCMR), 901 87, Umeå, Sweden.,Department of Molecular Biology, Umeå University, 901 87, Umeå, Sweden
| | - Sophie Tronnet
- The Laboratory for Molecular Infection Medicine Sweden (MIMS), 901 87 Umeå, Sweden.,Umeå Centre for Microbial Research (UCMR), 901 87, Umeå, Sweden.,Department of Molecular Biology, Umeå University, 901 87, Umeå, Sweden
| | - Andrea Puhar
- The Laboratory for Molecular Infection Medicine Sweden (MIMS), 901 87 Umeå, Sweden. .,Umeå Centre for Microbial Research (UCMR), 901 87, Umeå, Sweden. .,Department of Molecular Biology, Umeå University, 901 87, Umeå, Sweden.
| |
Collapse
|
36
|
Machado JP, Antunes A. The genomic context of retrocopies increases their chance of functional relevancy in mammals. Genomics 2020; 112:2410-2417. [PMID: 31981699 DOI: 10.1016/j.ygeno.2020.01.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Revised: 01/03/2020] [Accepted: 01/21/2020] [Indexed: 11/30/2022]
Abstract
Described as "junk" DNA, pseudogenes are dead structures of previously active genes present in genomes. Pseudogenes are categorized into two main classes: processed pseudogenes, formed through retrotransposition, and non-processed pseudogenes, typically originated from gene decay following duplication events. The term "processed pseudogene" has changed to "retrocopy" since they are likely to evolve new functional roles and became a retrogene. Here, we surveyed 38,080 retrocopies from chimpanzee, dog, human, mouse, and rat genomes to assess their potential adaptive value. The retrocopies inserted in the same chromosome of the parental gene have higher chances of remain potentially "active" (absence of premature stop codons and frameshifts) (~26.1%), while those placed into a different chromosome have a twofold decrease chance of continuing potentially "active" (~7.52%). The genomic context of their placement seems associated with their expression. Retrocopies placed in intragenic regions and the same sense of the "host" gene have higher chances of being expressed relative to other genomic contexts. The proximity of retrocopies to their parental gene is associated with a lower decay rate, and their location likely influence their expression. Thus, despite their unclear role, retrocopies are probably involved in adaptive processes. Our results evidence natural selection acting in retrocopies.
Collapse
Affiliation(s)
- João Paulo Machado
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal; Abel Salazar Biomedical Sciences Institute (ICBAS), University of Porto, Porto, Portugal
| | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal; Department of Biology, Faculty of Sciences, University of Porto, 4169 007 Porto, Portugal.
| |
Collapse
|
37
|
Goodhead I, Blow F, Brownridge P, Hughes M, Kenny J, Krishna R, McLean L, Pongchaikul P, Beynon R, Darby AC. Large-scale and significant expression from pseudogenes in Sodalis glossinidius - a facultative bacterial endosymbiont. Microb Genom 2020; 6:e000285. [PMID: 31922467 PMCID: PMC7067036 DOI: 10.1099/mgen.0.000285] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Accepted: 07/10/2019] [Indexed: 01/30/2023] Open
Abstract
The majority of bacterial genomes have high coding efficiencies, but there are some genomes of intracellular bacteria that have low gene density. The genome of the endosymbiont Sodalis glossinidius contains almost 50 % pseudogenes containing mutations that putatively silence them at the genomic level. We have applied multiple 'omic' strategies, combining Illumina and Pacific Biosciences Single-Molecule Real-Time DNA sequencing and annotation, stranded RNA sequencing and proteome analysis to better understand the transcriptional and translational landscape of Sodalis pseudogenes, and potential mechanisms for their control. Between 53 and 74 % of the Sodalis transcriptome remains active in cell-free culture. The mean sense transcription from coding domain sequences (CDSs) is four times greater than that from pseudogenes. Comparative genomic analysis of six Illumina-sequenced Sodalis isolates from different host Glossina species shows pseudogenes make up ~40 % of the 2729 genes in the core genome, suggesting that they are stable and/or that Sodalis is a recent introduction across the genus Glossina as a facultative symbiont. These data shed further light on the importance of transcriptional and translational control in deciphering host-microbe interactions. The combination of genomics, transcriptomics and proteomics gives a multidimensional perspective for studying prokaryotic genomes with a view to elucidating evolutionary adaptation to novel environmental niches.
Collapse
Affiliation(s)
- Ian Goodhead
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
- School of Science, Engineering and Environment, Peel Building, University of Salford, M5 4WT, UK
| | - Frances Blow
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
- Department of Entomology, Cornell University, Ithaca 14853, NY, USA
| | - Philip Brownridge
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
| | - Margaret Hughes
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
- Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
| | - John Kenny
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
- Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
| | - Ritesh Krishna
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
- IBM Research UK, STFC Daresbury Laboratory, Warrington, WA4 4AD, UK
| | - Lynn McLean
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
| | - Pisut Pongchaikul
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
| | - Rob Beynon
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
| | - Alistair C. Darby
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
- Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
| |
Collapse
|
38
|
Abstract
Prokaryotes commonly undergo genome reduction, particularly in the case of symbiotic bacteria. Genome reductions tend toward the energetically favorable removal of unnecessary, redundant, or nonfunctional genes. However, without mechanisms to compensate for these losses, deleterious mutation and genetic drift might otherwise overwhelm a population. Among the mechanisms employed to counter gene loss and share evolutionary success within a population, gene transfer agents (GTAs) are increasingly becoming recognized as important contributors. Although viral in origin, GTA particles package fragments of their "host" genome for distribution within a population of cells, often in a synchronized manner, rather than selfishly packaging genes necessary for their spread. Microbes as diverse as archaea and alpha-proteobacteria have been known to produce GTA particles, which are capable of transferring selective advantages such as virulence factors and antibiotic resistance. In this review, we discuss the various types of GTAs identified thus far, focusing on a defined set of symbiotic alpha-proteobacteria known to carry them. Drawing attention to the predicted presence of these genes, we discuss their potential within the selective marine and terrestrial environments occupied by mutualistic, parasitic, and endosymbiotic microbes.
Collapse
Affiliation(s)
- Steen Christensen
- Department of Biological Sciences, Florida International University, Miami, FL, USA.,Biomolecular Sciences Institute, Florida International University, Miami, FL, USA
| | - Laura R Serbus
- Department of Biological Sciences, Florida International University, Miami, FL, USA. .,Biomolecular Sciences Institute, Florida International University, Miami, FL, USA.
| |
Collapse
|
39
|
Genetic determinants of genus-level glycan diversity in a bacterial protein glycosylation system. PLoS Genet 2019; 15:e1008532. [PMID: 31869330 PMCID: PMC6959607 DOI: 10.1371/journal.pgen.1008532] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 01/14/2020] [Accepted: 11/22/2019] [Indexed: 12/27/2022] Open
Abstract
The human pathogens N. gonorrhoeae and N. meningitidis display robust intra- and interstrain glycan diversity associated with their O-linked protein glycosylation (pgl) systems. In an effort to better understand the evolution and function of protein glycosylation operating there, we aimed to determine if other human-restricted, Neisseria species similarly glycosylate proteins and if so, to assess the levels of glycoform diversity. Comparative genomics revealed the conservation of a subset of genes minimally required for O-linked protein glycosylation glycan and established those pgl genes as core genome constituents of the genus. In conjunction with mass spectrometric–based glycan phenotyping, we found that extant glycoform repertoires in N. gonorrhoeae, N. meningitidis and the closely related species N. polysaccharea and N. lactamica reflect the functional replacement of a progenitor glycan biosynthetic pathway. This replacement involved loss of pgl gene components of the primordial pathway coincident with the acquisition of two exogenous glycosyltransferase genes. Critical to this discovery was the identification of a ubiquitous but previously unrecognized glycosyltransferase gene (pglP) that has uniquely undergone parallel but independent pseudogenization in N. gonorrhoeae and N. meningitidis. We suggest that the pseudogenization events are driven by processes of compositional epistasis leading to gene decay. Additionally, we documented instances where inter-species recombination influences pgl gene status and creates discordant genetic interactions due ostensibly to the multi-locus nature of pgl gene networks. In summary, these findings provide a novel perspective on the evolution of protein glycosylation systems and identify phylogenetically informative, genetic differences associated with Neisseria species. Bacteria express a remarkable diversity of sugars and oligosaccharides in conjunction with protein glycosylation systems. Currently however, little is known about the evolutionary processes and selective forces shaping glycan biosynthetic pathways. The closely related bacterial pathogens Neisseria gonorrhoeae and Neisseria meningitidis remain serious sources of human disease and these species express antigenically variable oligosaccharides as components of their broad-spectrum, O‐linked protein glycosylation (pgl) systems. With the exception of isolates of Neisseria elongata subspecies glycolytica, the status of such post-translational modifications in related commensal species colonizing humans remains largely undefined. Here, we exploit new data from further studies of protein glycosylation in Neisseria elongata subspecies glycolytica to address these concerns. Employing comparative genomics and glycan phenotyping, we show that related pgl systems are indeed expressed by all human-restricted Neisseria species but identify unique gene gain and loss events as well as loss-of-function polymorphisms that accommodate a dramatic shift in glycoform structure occurring across the genus. These findings constitute novel perspectives on both the evolution of protein glycosylation systems in general and the macroevolutionary processes occurring in related bacterial species residing within a single host.
Collapse
|
40
|
Varesio LM, Willett JW, Fiebig A, Crosson S. A Carbonic Anhydrase Pseudogene Sensitizes Select Brucella Lineages to Low CO 2 Tension. J Bacteriol 2019; 201:e00509-19. [PMID: 31481543 PMCID: PMC6805109 DOI: 10.1128/jb.00509-19] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Accepted: 08/27/2019] [Indexed: 01/01/2023] Open
Abstract
Brucella spp. are intracellular pathogens that cause a disease known as brucellosis. Though the genus is highly monomorphic at the genetic level, species have animal host preferences and some defining physiologic characteristics. Of note is the requirement for CO2 supplementation to cultivate particular species, which confounded early efforts to isolate B. abortus from diseased cattle. Differences in the capacity of Brucella species to assimilate CO2 are determined by mutations in the carbonic anhydrase gene, bcaA Ancestral single-nucleotide insertions in bcaA have resulted in frameshifted pseudogenes in B. abortus and B. ovis lineages, which underlie their inability to grow under the low CO2 tension of a standard atmosphere. Incubation of wild-type B. ovis in air selects for mutations that "rescue" a functional bcaA reading frame, which enables growth under low CO2 and enhances the growth rate under high CO2 Accordingly, we show that heterologous expression of functional Escherichia coli carbonic anhydrases enables B. ovis growth in air. Growth of B. ovis is acutely sensitive to a reduction in CO2 tension, while frame-rescued B. ovis mutants are insensitive to CO2 shifts. B. ovis initiates a gene expression program upon CO2 downshift that resembles the stringent response and results in transcriptional activation of its type IV secretion system. Our study provides evidence that loss-of-function insertion mutations in bcaA sensitize the response of B. ovis and B. abortus to reduced CO2 tension relative to that of other Brucella lineages. CO2-dependent starvation and virulence gene expression programs in these species may influence persistence or transmission in natural hosts.IMPORTANCEBrucella spp. are highly related, but they exhibit differences in animal host preference that must be determined by genome sequence differences. B. ovis and the majority of B. abortus strains require high CO2 tension to be cultivated in vitro and harbor conserved insertional mutations in the carbonic anhydrase gene, bcaA, which underlie this trait. Mutants that grow in a standard atmosphere, first reported nearly a century ago, are easily selected in the laboratory. These mutants harbor varied indel polymorphisms in bcaA that restore its consensus reading frame and rescue its function. Loss of bcaA function has evolved independently in the B. ovis and B. abortus lineages and results in a dramatically increased sensitivity to CO2 limitation.
Collapse
Affiliation(s)
- Lydia M Varesio
- Committee on Microbiology, University of Chicago, Chicago, Illinois, USA
| | - Jonathan W Willett
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, USA
| | - Aretha Fiebig
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, USA
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, Michigan, USA
| | - Sean Crosson
- Committee on Microbiology, University of Chicago, Chicago, Illinois, USA
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, USA
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, Michigan, USA
| |
Collapse
|
41
|
Schelkunov MI, Nuraliev MS, Logacheva MD. Rhopalocnemis phalloides has one of the most reduced and mutated plastid genomes known. PeerJ 2019; 7:e7500. [PMID: 31565552 PMCID: PMC6745192 DOI: 10.7717/peerj.7500] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2018] [Accepted: 07/16/2019] [Indexed: 11/20/2022] Open
Abstract
Although most plant species are photosynthetic, several hundred species have lost the ability to photosynthesize and instead obtain nutrients via various types of heterotrophic feeding. Their plastid genomes markedly differ from the plastid genomes of photosynthetic plants. In this work, we describe the sequenced plastid genome of the heterotrophic plant Rhopalocnemis phalloides, which belongs to the family Balanophoraceae and feeds by parasitizing other plants. The genome is highly reduced (18,622 base pairs vs. approximately 150 kbp in autotrophic plants) and possesses an extraordinarily high AT content, 86.8%, which is inferior only to AT contents of plastid genomes of Balanophora, a genus from the same family. The gene content of this genome is quite typical of heterotrophic plants, with all of the genes related to photosynthesis having been lost. The remaining genes are notably distorted by a high mutation rate and the aforementioned AT content. The high AT content has led to sequence convergence between some of the remaining genes and their homologs from AT-rich plastid genomes of protists. Overall, the plastid genome of R. phalloides is one of the most unusual plastid genomes known.
Collapse
Affiliation(s)
- Mikhail I Schelkunov
- Skolkovo Institute of Science and Technology, Moscow, Russia.,Institute for Information Transmission Problems, Moscow, Russia
| | - Maxim S Nuraliev
- Faculty of Biology, Moscow State University, Moscow, Russia.,Joint Russian-Vietnamese Tropical Scientific and Technological Center, Cau Giay, Hanoi, Vietnam
| | - Maria D Logacheva
- Skolkovo Institute of Science and Technology, Moscow, Russia.,A.N. Belozersky Research Institute of Physico-Chemical Biology, Moscow State University, Moscow, Russia
| |
Collapse
|
42
|
Kim NH, Ha EJ, Ko DS, Lee CY, Kim JH, Kwon HJ. Molecular evolution of Salmonella enterica subsp. enterica serovar Gallinarum biovar Gallinarum in the field. Vet Microbiol 2019; 235:63-70. [PMID: 31282380 DOI: 10.1016/j.vetmic.2019.05.019] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Revised: 05/07/2019] [Accepted: 05/23/2019] [Indexed: 01/31/2023]
Abstract
Salmonella enterica subsp. enterica serovar Gallinarum biovar Gallinarum (SG) causes fowl typhoid (FT) and substantial economic loss in Korea due to egg drop syndrome and mortality. Despite the extensive use of vaccines, FT still occurs in the field. Therefore, the emergence of more pathogenic SG or the recovered pathogenicity of a vaccine strain has been suspected. SpvB, an ADP-ribosyl transferase, is a major pathogenesis determinant, and the length of the polyproline linker (PPL) of SpvB affects pathogenic potency. SG strains accumulate pseudogenes in their genomes during host adaptation, and pseudogene profiling may provide evolutionary information. In this study, we found that the PPL length of Korean SG isolates varied from 11 to 21 prolines and was longer than that of a live vaccine strain, SG 9R (9 prolines). According to growth competition in chickens, the growth of an SG isolate with a PPL length of 17 prolines exceeded that of an SG isolate with a PPL length of 15 prolines. We investigated the pseudogenes of the field isolates, SG 9R and reference strains in GenBank by resequencing and comparative genomics. The pseudogene profiles of the field isolates were notably different from those of the foreign SG strains, and they were subdivided into 7 pseudogene subgroups. Collectively, the field isolates had gradually evolved by changing PPL length and acquiring additional pseudogenes. Thus, the characterization of PPL length and pseudogene profiling may be useful to understand the molecular evolution of SG and the epidemiology of FT.
Collapse
Affiliation(s)
- Nam-Hyung Kim
- Laboratory of Avian Diseases, College of Veterinary Medicine, Seoul National University, Seoul 08826, Republic of Korea
| | - Eun-Jin Ha
- Laboratory of Avian Diseases, College of Veterinary Medicine, Seoul National University, Seoul 08826, Republic of Korea
| | - Dae-Sung Ko
- Laboratory of Avian Diseases, College of Veterinary Medicine, Seoul National University, Seoul 08826, Republic of Korea
| | - Chung-Young Lee
- Laboratory of Avian Diseases, College of Veterinary Medicine, Seoul National University, Seoul 08826, Republic of Korea; Research Institute for Veterinary Science, College of Veterinary Medicine, BK21 for Veterinary Science, Seoul 08826, Republic of Korea
| | - Jae-Hong Kim
- Laboratory of Avian Diseases, College of Veterinary Medicine, Seoul National University, Seoul 08826, Republic of Korea; Research Institute for Veterinary Science, College of Veterinary Medicine, BK21 for Veterinary Science, Seoul 08826, Republic of Korea
| | - Hyuk-Joon Kwon
- Department of Farm Animal Medicine, College of Veterinary Medicine, Seoul National University, Seoul 08826, Republic of Korea; Research Institute for Veterinary Science, College of Veterinary Medicine, BK21 for Veterinary Science, Seoul 08826, Republic of Korea; Farm Animal Clinical Training and Research Center (FACTRC), GBST, Seoul National University, Kangwon-do 25354, Republic of Korea.
| |
Collapse
|
43
|
Sevigny JL, Rothenheber D, Diaz KS, Zhang Y, Agustsson K, Bergeron RD, Thomas WK. Marker genes as predictors of shared genomic function. BMC Genomics 2019; 20:268. [PMID: 30947688 PMCID: PMC6449922 DOI: 10.1186/s12864-019-5641-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2018] [Accepted: 03/24/2019] [Indexed: 12/15/2022] Open
Abstract
Background Although high-throughput marker gene studies provide valuable insight into the diversity and relative abundance of taxa in microbial communities, they do not provide direct measures of their functional capacity. Recently, scientists have shown a general desire to predict functional profiles of microbial communities based on phylogenetic identification inferred from marker genes, and recent tools have been developed to link the two. However, to date, no large-scale examination has quantified the correlation between the marker gene based taxonomic identity and protein coding gene conservation. Here we utilize 4872 representative prokaryotic genomes from NCBI to investigate the relationship between marker gene identity and shared protein coding gene content. Results Even at 99–100% marker gene identity, genomes share on average less than 75% of their protein coding gene content. This occurs regardless of the marker gene(s) used: V4 region of the 16S rRNA, complete 16S rRNA, or single copy orthologs through a multi-locus sequence analysis. An important aspect related to this observation is the intra-organism variation of 16S copies from a single genome. Although the majority of 16S copies were found to have high sequence similarity (> 99%), several genomes contained copies that were highly diverged (< 97% identity). Conclusions This is the largest comparison between marker gene similarity and shared protein coding gene content to date. The study highlights the limitations of inferring a microbial community’s functions based on marker gene phylogeny. The data presented expands upon the results of previous studies that examined one or few bacterial species and supports the hypothesis that 16S rRNA and other marker genes cannot be directly used to fully predict the functional potential of a bacterial community. Electronic supplementary material The online version of this article (10.1186/s12864-019-5641-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Joseph L Sevigny
- Molecular, Cellular, and Biomedical Sciences, University of New Hampshire, 46 College Rd, Rudman Hall, Durham, NH, 03824, USA. .,Hubbard Center for Genome Studies, University of New Hampshire, 35 Colovos Rd, Gregg Hall, Durham, NH, 03824, USA.
| | - Derek Rothenheber
- Molecular, Cellular, and Biomedical Sciences, University of New Hampshire, 46 College Rd, Rudman Hall, Durham, NH, 03824, USA
| | - Krystalle Sharlyn Diaz
- Molecular, Cellular, and Biomedical Sciences, University of New Hampshire, 46 College Rd, Rudman Hall, Durham, NH, 03824, USA.,Hubbard Center for Genome Studies, University of New Hampshire, 35 Colovos Rd, Gregg Hall, Durham, NH, 03824, USA
| | - Ying Zhang
- Department of Computer Science, University of New Hampshire, 33 Academic Way, Kingsbury Hall, Durham, NH, 0324, USA
| | - Kristin Agustsson
- Department of Computer Science, University of New Hampshire, 33 Academic Way, Kingsbury Hall, Durham, NH, 0324, USA
| | - R Daniel Bergeron
- Department of Computer Science, University of New Hampshire, 33 Academic Way, Kingsbury Hall, Durham, NH, 0324, USA
| | - W Kelley Thomas
- Molecular, Cellular, and Biomedical Sciences, University of New Hampshire, 46 College Rd, Rudman Hall, Durham, NH, 03824, USA.,Hubbard Center for Genome Studies, University of New Hampshire, 35 Colovos Rd, Gregg Hall, Durham, NH, 03824, USA
| |
Collapse
|
44
|
Eugenia Nuñez-Valdez M, Lanois A, Pagès S, Duvic B, Gaudriault S. Inhibition of Spodoptera frugiperda phenoloxidase activity by the products of the Xenorhabdus rhabduscin gene cluster. PLoS One 2019; 14:e0212809. [PMID: 30794697 PMCID: PMC6386379 DOI: 10.1371/journal.pone.0212809] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 02/08/2019] [Indexed: 12/15/2022] Open
Abstract
We evaluated the impact of bacterial rhabduscin synthesis on bacterial virulence and phenoloxidase inhibition in a Spodoptera model. We first showed that the rhabduscin cluster of the entomopathogenic bacterium Xenorhabdus nematophila was not necessary for virulence in the larvae of Spodoptera littoralis and Spodoptera frugiperda. Bacteria with mutations affecting the rhabduscin synthesis cluster (ΔisnAB and ΔGT mutants) were as virulent as the wild-type strain. We then developed an assay for measuring phenoloxidase activity in S. frugiperda and assessed the ability of bacterial culture supernatants to inhibit the insect phenoloxidase. Our findings confirm that the X. nematophila rhabduscin cluster is required for the inhibition of S. frugiperda phenoloxidase activity. The X. nematophila ΔisnAB mutant was unable to inhibit phenoloxidase, whereas ΔGT mutants displayed intermediate levels of phenoloxidase inhibition relative to the wild-type strain. The culture supernatants of Escherichia coli and of two entomopathogenic bacteria, Serratia entomophila and Xenorhabdus poinarii, were unable to inhibit S. frugiperda phenoloxidase activity. Heterologous expression of the X. nematophila rhabduscin cluster in these three strains was sufficient to restore inhibition. Interestingly, we observed pseudogenization of the X. poinarii rhabduscin gene cluster via the insertion of a 120 bp element into the isnA promoter. The inhibition of phenoloxidase activity by X. poinarii culture supernatants was restored by expression of the X. poinarii rhabduscin cluster under the control of an inducible Ptet promoter, consistent with recent pseudogenization. This study paves the way for advances in our understanding of the virulence of several entomopathogenic bacteria in non-model insects, such as the new invasive S. frugiperda species in Africa.
Collapse
Affiliation(s)
| | - Anne Lanois
- DGIMI, INRA, Université de Montpellier, Montpellier, France
| | - Sylvie Pagès
- DGIMI, INRA, Université de Montpellier, Montpellier, France
| | - Bernard Duvic
- DGIMI, INRA, Université de Montpellier, Montpellier, France
| | - Sophie Gaudriault
- DGIMI, INRA, Université de Montpellier, Montpellier, France
- * E-mail: (MENV); (SG)
| |
Collapse
|
45
|
Anand A, Olson CA, Yang L, Sastry AV, Catoiu E, Choudhary KS, Phaneuf PV, Sandberg TE, Xu S, Hefner Y, Szubin R, Feist AM, Palsson BO. Pseudogene repair driven by selection pressure applied in experimental evolution. Nat Microbiol 2019; 4:386-389. [DOI: 10.1038/s41564-018-0340-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2018] [Accepted: 12/05/2018] [Indexed: 11/09/2022]
|
46
|
How Genomics Is Changing What We Know About the Evolution and Genome of Bordetella pertussis. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2019; 1183:1-17. [PMID: 31321755 DOI: 10.1007/5584_2019_401] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The evolution of Bordetella pertussis from a common ancestor similar to Bordetella bronchiseptica has occurred through large-scale gene loss, inactivation and rearrangements, largely driven by the spread of insertion sequence element repeats throughout the genome. B. pertussis is widely considered to be monomorphic, and recent evolution of the B. pertussis genome appears to, at least in part, be driven by vaccine-based selection. Given the recent global resurgence of whooping cough despite the wide-spread use of vaccination, a more thorough understanding of B. pertussis genomics could be highly informative. In this chapter we discuss the evolution of B. pertussis, including how vaccination is changing the circulating B. pertussis population at the gene-level, and how new sequencing technologies are revealing previously unknown levels of inter- and intra-strain variation at the genome-level.
Collapse
|
47
|
Danneels B, Pinto-Carbó M, Carlier A. Patterns of Nucleotide Deletion and Insertion Inferred from Bacterial Pseudogenes. Genome Biol Evol 2018; 10:1792-1802. [PMID: 29982456 PMCID: PMC6054270 DOI: 10.1093/gbe/evy140] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/29/2018] [Indexed: 02/06/2023] Open
Abstract
Pseudogenes are a paradigm of neutral evolution and their study has the potential to reveal intrinsic mutational biases. However, this potential is mitigated by the fact that pseudogenes are quickly purged from bacterial genomes. Here, we assembled a large set of pseudogenes from genomes experiencing reductive evolution as well as functional references for which we could establish reliable phylogenetic relationships. Using this unique dataset, we identified 857 independent insertion and deletion mutations and discover a pervasive bias towards deletions, but not insertions, with sizes multiples of 3 nt. We further show that selective constraints for the preservation of gene frame are unlikely to account for the observed mutational bias and propose that a mechanistic bias in alternative end-joining repair, a recombination-independent double strand break DNA repair mechanism, is responsible for the accumulation of 3n deletions.
Collapse
Affiliation(s)
- Bram Danneels
- Department of Biochemistry and Microbiology, Ghent University, Belgium
| | - Marta Pinto-Carbó
- Department of Plant and Microbial Biology, University of Zurich, Switzerland
| | - Aurelien Carlier
- Department of Biochemistry and Microbiology, Ghent University, Belgium
| |
Collapse
|
48
|
Microevolution of Streptococcus agalactiae ST-261 from Australia Indicates Dissemination via Imported Tilapia and Ongoing Adaptation to Marine Hosts or Environment. Appl Environ Microbiol 2018; 84:AEM.00859-18. [PMID: 29915111 DOI: 10.1128/aem.00859-18] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Accepted: 06/12/2018] [Indexed: 11/20/2022] Open
Abstract
Streptococcus agalactiae (group B Streptococcus [GBS]) causes disease in a wide range of animals. The serotype Ib lineage is highly adapted to aquatic hosts, exhibiting substantial genome reduction compared with terrestrial conspecifics. Here, we sequence genomes from 40 GBS isolates, including 25 isolates from wild fish and captive stingrays in Australia, six local veterinary or human clinical isolates, and nine isolates from farmed tilapia in Honduras, and compared them with 42 genomes from public databases. Phylogenetic analysis based on nonrecombinant core-genome single nucleotide polymorphisms (SNPs) indicated that aquatic serotype Ib isolates from Queensland were distantly related to local veterinary and human clinical isolates. In contrast, Australian aquatic isolates are most closely related to a tilapia isolate from Israel, differing by only 63 core-genome SNPs. A consensus minimum spanning tree based on core-genome SNPs indicates the dissemination of sequence type 261 (ST-261) from an ancestral tilapia strain, which is congruent with several introductions of tilapia into Australia from Israel during the 1970s and 1980s. Pangenome analysis identified 1,440 genes as core, with the majority being dispensable or strain specific, with non-protein-coding intergenic regions (IGRs) divided among core and strain-specific genes. Aquatic serotype Ib strains have lost many virulence factors during adaptation, but six adhesins were well conserved across the aquatic isolates and might be critical for virulence in fish and for targets in vaccine development. The close relationship among recent ST-261 isolates from Ghana, the United States, and China with the Israeli tilapia isolate from 1988 implicates the global trade in tilapia seed for aquaculture in the widespread dissemination of serotype Ib fish-adapted GBS.IMPORTANCEStreptococcus agalactiae (GBS) is a significant pathogen of humans and animals. Some lineages have become adapted to particular hosts, and serotype Ib is highly specialized to fish. Here, we show that this lineage is likely to have been distributed widely by the global trade in tilapia for aquaculture, with probable introduction into Australia in the 1970s and subsequent dissemination in wild fish populations. We report here the variability in the polysaccharide capsule among this lineage but identify a cohort of common surface proteins that may be a focus of future vaccine development to reduce the biosecurity risk in international fish trade.
Collapse
|
49
|
Lo WS, Kuo CH. Horizontal Acquisition and Transcriptional Integration of Novel Genes in Mosquito-Associated Spiroplasma. Genome Biol Evol 2018; 9:3246-3259. [PMID: 29177479 PMCID: PMC5726471 DOI: 10.1093/gbe/evx244] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/20/2017] [Indexed: 12/20/2022] Open
Abstract
Genetic differentiation among symbiotic bacteria is important in shaping biodiversity. The genus Spiroplasma contains species occupying diverse niches and is a model system for symbiont evolution. Previous studies have established that two mosquito-associated species have diverged extensively in their carbohydrate metabolism genes despite having a close phylogenetic relationship. Notably, although the commensal Spiroplasma diminutum lacks identifiable pathogenicity factors, the pathogenic Spiroplasma taiwanense was found to have acquired a virulence factor glpO and its associated genes through horizontal transfer. However, it is unclear if these acquired genes have been integrated into the regulatory network. In this study, we inferred the gene content evolution in these bacteria, as well as examined their transcriptomes in response to glucose availability. The results indicated that both species have many more gene acquisitions from the Mycoides-Entomoplasmataceae clade, which contains several important pathogens of ruminants, than previously thought. Moreover, several acquired genes have higher expression levels than the vertically inherited homologs, indicating possible functional replacement. Finally, the virulence factor and its functionally linked genes in S. taiwanense were up-regulated in response to glucose starvation, suggesting that these acquired genes are under expression regulation and the pathogenicity may be a stress response. In summary, although differential gene losses are a major process for symbiont divergence, gene gains are critical in counteracting genome degradation and driving diversification among facultative symbionts.
Collapse
Affiliation(s)
- Wen-Sui Lo
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan.,Molecular and Biological Agricultural Sciences Program, Taiwan International Graduate Program, National Chung Hsing University and Academia Sinica, Taipei, Taiwan.,Graduate Institute of Biotechnology, National Chung Hsing University, Taichung, Taiwan
| | - Chih-Horng Kuo
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan.,Molecular and Biological Agricultural Sciences Program, Taiwan International Graduate Program, National Chung Hsing University and Academia Sinica, Taipei, Taiwan.,Biotechnology Center, National Chung Hsing University, Taichung, Taiwan
| |
Collapse
|
50
|
Wheeler NE, Gardner PP, Barquist L. Machine learning identifies signatures of host adaptation in the bacterial pathogen Salmonella enterica. PLoS Genet 2018; 14:e1007333. [PMID: 29738521 PMCID: PMC5940178 DOI: 10.1371/journal.pgen.1007333] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Accepted: 03/24/2018] [Indexed: 11/18/2022] Open
Abstract
Emerging pathogens are a major threat to public health, however understanding how pathogens adapt to new niches remains a challenge. New methods are urgently required to provide functional insights into pathogens from the massive genomic data sets now being generated from routine pathogen surveillance for epidemiological purposes. Here, we measure the burden of atypical mutations in protein coding genes across independently evolved Salmonella enterica lineages, and use these as input to train a random forest classifier to identify strains associated with extraintestinal disease. Members of the species fall along a continuum, from pathovars which cause gastrointestinal infection and low mortality, associated with a broad host-range, to those that cause invasive infection and high mortality, associated with a narrowed host range. Our random forest classifier learned to perfectly discriminate long-established gastrointestinal and invasive serovars of Salmonella. Additionally, it was able to discriminate recently emerged Salmonella Enteritidis and Typhimurium lineages associated with invasive disease in immunocompromised populations in sub-Saharan Africa, and within-host adaptation to invasive infection. We dissect the architecture of the model to identify the genes that were most informative of phenotype, revealing a common theme of degradation of metabolic pathways in extraintestinal lineages. This approach accurately identifies patterns of gene degradation and diversifying selection specific to invasive serovars that have been captured by more labour-intensive investigations, but can be readily scaled to larger analyses.
Collapse
Affiliation(s)
- Nicole E. Wheeler
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Biomolecular Interaction Centre, School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
- * E-mail: (NEW); (LB)
| | - Paul P. Gardner
- Biomolecular Interaction Centre, School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | - Lars Barquist
- Institute for Molecular Infection Biology, University of Wuerzburg, Wuerzburg, Germany
- Helmholtz Institute for RNA-based Infection Research, Wuerzburg, Germany
- * E-mail: (NEW); (LB)
| |
Collapse
|