1
|
Raghuram V, Petit RA, Karol Z, Mehta R, Weissman DB, Read TD. Average nucleotide identity-based Staphylococcus aureus strain grouping allows identification of strain-specific genes in the pangenome. mSystems 2024; 9:e0014324. [PMID: 38934646 PMCID: PMC11265343 DOI: 10.1128/msystems.00143-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 04/16/2024] [Indexed: 06/28/2024] Open
Abstract
Staphylococcus aureus causes both hospital- and community-acquired infections in humans worldwide. Due to the high incidence of infection, S. aureus is also one of the most sampled and sequenced pathogens today, providing an outstanding resource to understand variation at the bacterial subspecies level. We processed and downsampled 83,383 public S. aureus Illumina whole-genome shotgun sequences and 1,263 complete genomes to produce 7,954 representative substrains. Pairwise comparison of average nucleotide identity revealed a natural boundary of 99.5% that could be used to define 145 distinct strains within the species. We found that intermediate frequency genes in the pangenome (present in 10%-95% of genomes) could be divided into those closely linked to strain background ("strain-concentrated") and those highly variable within strains ("strain-diffuse"). Non-core genes had different patterns of chromosome location. Notably, strain-diffuse genes were associated with prophages; strain-concentrated genes were associated with the vSaβ genome island and rare genes (<10% frequency) concentrated near the origin of replication. Antibiotic resistance genes were enriched in the strain-diffuse class, while virulence genes were distributed between strain-diffuse, strain-concentrated, core, and rare classes. This study shows how different patterns of gene movement help create strains as distinct subspecies entities and provide insight into the diverse histories of important S. aureus functions. IMPORTANCE We analyzed the genomic diversity of Staphylococcus aureus, a globally prevalent bacterial species that causes serious infections in humans. Our goal was to build a genetic picture of the different strains of S. aureus and which genes may be associated with them. We reprocessed >84,000 genomes and subsampled to remove redundancy. We found that individual samples sharing >99.5% of their genome could be grouped into strains. We also showed that a portion of genes that are present in intermediate frequency in the species are strongly associated with some strains but completely absent from others, suggesting a role in strain specificity. This work lays the foundation for understanding individual gene histories of the S. aureus species and also outlines strategies for processing large bacterial genomic data sets.
Collapse
Affiliation(s)
- Vishnu Raghuram
- Microbiology and Molecular Genetics Program, Graduate Division of Biological and Biomedical Sciences, Laney Graduate School, Emory University, Atlanta, Georgia, USA
| | - Robert A. Petit
- Division of Infectious Diseases, Department of Medicine, Emory University, Atlanta, Georgia, USA
| | - Zach Karol
- Department of Physics, Emory University, Atlanta, Georgia, USA
| | - Rohan Mehta
- Department of Physics, Emory University, Atlanta, Georgia, USA
| | | | - Timothy D. Read
- Division of Infectious Diseases, Department of Medicine, Emory University, Atlanta, Georgia, USA
| |
Collapse
|
2
|
Raghuram V, Petit RA, Karol Z, Mehta R, Weissman DB, Read TD. Average Nucleotide Identity based Staphylococcus aureus strain grouping allows identification of strain-specific genes in the pangenome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.29.577756. [PMID: 38352482 PMCID: PMC10862745 DOI: 10.1101/2024.01.29.577756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/22/2024]
Abstract
Staphylococcus aureus causes both hospital and community acquired infections in humans worldwide. Due to the high incidence of infection S. aureus is also one of the most sampled and sequenced pathogens today, providing an outstanding resource to understand variation at the bacterial subspecies level. We processed and downsampled 83,383 public S. aureus Illumina whole genome shotgun sequences and 1,263 complete genomes to produce 7,954 representative substrains. Pairwise comparison of core gene Average Nucleotide Identity (ANI) revealed a natural boundary of 99.5% that could be used to define 145 distinct strains within the species. We found that intermediate frequency genes in the pangenome (present in 10-95% of genomes) could be divided into those closely linked to strain background ("strain-concentrated") and those highly variable within strains ("strain-diffuse"). Non-core genes had different patterns of chromosome location; notably, strain-diffuse associated with prophages, strain-concentrated with the vSaβ genome island and rare genes (<10% frequency) concentrated near the origin of replication. Antibiotic genes were enriched in the strain-diffuse class, while virulence genes were distributed between strain-diffuse, strain-concentrated, core and rare classes. This study shows how different patterns of gene movement help create strains as distinct subspecies entities and provide insight into the diverse histories of important S. aureus functions.
Collapse
Affiliation(s)
- Vishnu Raghuram
- Microbiology and Molecular Genetics Program, Graduate Division of Biological and Biomedical Sciences, Laney Graduate School, Emory University, Atlanta, Georgia, USA
| | - Robert A Petit
- Division of Infectious Diseases, Department of Medicine, Emory University, Atlanta, Georgia, USA
| | - Zach Karol
- Department of Physics, Emory University, Atlanta, Georgia, USA
| | - Rohan Mehta
- Department of Physics, Emory University, Atlanta, Georgia, USA
| | | | - Timothy D. Read
- Division of Infectious Diseases, Department of Medicine, Emory University, Atlanta, Georgia, USA
| |
Collapse
|
3
|
Canário Viana MV, Profeta R, Cerqueira JC, Wattam AR, Barh D, Silva A, Azevedo V. Evidence of episodic positive selection in Corynebacterium diphtheriae complex of species and its implementations in identification of drug and vaccine targets. PeerJ 2022; 10:e12662. [PMID: 35190783 PMCID: PMC8857904 DOI: 10.7717/peerj.12662] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Accepted: 11/30/2021] [Indexed: 01/07/2023] Open
Abstract
BACKGROUND Within the pathogenic bacterial species Corynebacterium genus, six species that can produce diphtheria toxin (C. belfantii, C. diphtheriae, C. pseudotuberculosis, C. rouxii, C. silvaticum and C. ulcerans) form a clade referred to as the C. diphtheria complex. These species have been found in humans and other animals, causing diphtheria or other diseases. Here we show the results of a genome scale analysis to identify positive selection in protein-coding genes that may have resulted in the adaptations of these species to their ecological niches and suggest drug and vaccine targets. METHODS Forty genomes were sampled to represent species, subspecies or biovars of Corynebacterium. Ten phylogenetic groups were tested for positive selection using the PosiGene pipeline, including species and biovars from the C. diphtheria complex. The detected genes were tested for recombination and had their sequences alignments and homology manually examined. The final genes were investigated for their function and a probable role as vaccine or drug targets. RESULTS Nineteen genes were detected in the species C. diphtheriae (two), C. pseudotuberculosis (10), C. rouxii (one), and C. ulcerans (six). Those were found to be involved in defense, translation, energy production, and transport and in the metabolism of carbohydrates, amino acids, nucleotides, and coenzymes. Fourteen were identified as essential genes, and six as virulence factors. Thirteen from the 19 genes were identified as potential drug targets and four as potential vaccine candidates. These genes could be important in the prevention and treatment of the diseases caused by these bacteria.
Collapse
Affiliation(s)
- Marcus Vinicius Canário Viana
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
- Departamento de Genética, Universidade Federal do Pará, Belém, Pará, Brazil
| | - Rodrigo Profeta
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Janaína Canário Cerqueira
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Alice Rebecca Wattam
- Biocomplexity Institute, University of Virginia, Charlottesville, Virginia, United States
| | - Debmalya Barh
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
- Institute of Integrative Omics and Applied Biotechnology, Nonakuri, West Bengal, India
| | - Artur Silva
- Departamento de Genética, Universidade Federal do Pará, Belém, Pará, Brazil
| | - Vasco Azevedo
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| |
Collapse
|
4
|
Park S, Ronholm J. Staphylococcus aureus in Agriculture: Lessons in Evolution from a Multispecies Pathogen. Clin Microbiol Rev 2021; 34:e00182-20. [PMID: 33568553 PMCID: PMC7950364 DOI: 10.1128/cmr.00182-20] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Staphylococcus aureus is a formidable bacterial pathogen that is responsible for infections in humans and various species of wild, companion, and agricultural animals. The ability of S. aureus to move between humans and livestock is due to specific characteristics of this bacterium as well as modern agricultural practices. Pathoadaptive clonal lineages of S. aureus have emerged and caused significant economic losses in the agricultural sector. While humans appear to be a primary reservoir for S. aureus, the continued expansion of the livestock industry, globalization, and ubiquitous use of antibiotics has increased the dissemination of pathoadaptive S. aureus in this environment. This review comprehensively summarizes the available literature on the epidemiology, pathophysiology, genomics, antibiotic resistance (ABR), and clinical manifestations of S. aureus infections in domesticated livestock. The availability of S. aureus whole-genome sequence data has provided insight into the mechanisms of host adaptation and host specificity. Several lineages of S. aureus are specifically adapted to a narrow host range on a short evolutionary time scale. However, on a longer evolutionary time scale, host-specific S. aureus has jumped the species barrier between livestock and humans in both directions several times. S. aureus illustrates how close contact between humans and animals in high-density environments can drive evolution. The use of antibiotics in agriculture also drives the emergence of antibiotic-resistant strains, making the possible emergence of human-adapted ABR strains from agricultural practices concerning. Addressing the concerns of ABR S. aureus, without negatively affecting agricultural productivity, is a challenging priority.
Collapse
Affiliation(s)
- Soyoun Park
- Faculty of Agricultural and Environmental Sciences, McGill University, Sainte-Anne-de-Bellevue, Quebec, Canada
| | - Jennifer Ronholm
- Faculty of Agricultural and Environmental Sciences, McGill University, Sainte-Anne-de-Bellevue, Quebec, Canada
| |
Collapse
|
5
|
Asrani P, Hasan GM, Sohal SS, Hassan MI. Molecular Basis of Pathogenesis of Coronaviruses: A Comparative Genomics Approach to Planetary Health to Prevent Zoonotic Outbreaks in the 21st Century. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2020; 24:634-644. [PMID: 32940573 DOI: 10.1089/omi.2020.0131] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
In the first quarter of the 21st century, we are already facing the third emergence of a coronavirus outbreak, the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) responsible for the coronavirus disease 2019 (COVID-19) pandemic. Comparative genomics can inform a deeper understanding of the pathogenesis of COVID-19. Previous strains of coronavirus, SARS-CoV, and Middle-East respiratory syndrome-coronavirus (MERS-CoV), have been known to cause acute lung injuries in humans. SARS-CoV-2 shares genetic similarity with SARS-CoV with some modification in the S protein leading to their enhanced binding affinity toward the angiotensin-converting enzyme 2 (ACE2) receptors of human lung cells. This expert review examines the features of all three coronaviruses through a conceptual lens of comparative genomics. In particular, the life cycle of SARS-CoV-2 that enables its survival within the host is highlighted. Susceptibility of humans to coronavirus outbreaks in the 21st century calls for comparisons of the transmission history, hosts, reservoirs, and fatality rates of these viruses so that evidence-based and effective planetary health interventions can be devised to prevent future zoonotic outbreaks. Comparative genomics offers new insights on putative and novel viral targets with an eye to both therapeutic innovation and prevention. We conclude the expert review by (1) articulating the lessons learned so far, whereas the research is still being actively sought after in the field, and (2) the challenges and prospects in deciphering the linkages among multiomics biological variability and COVID-19 pathogenesis.
Collapse
Affiliation(s)
- Purva Asrani
- Division of Biochemistry, Indian Agricultural Research Institute, New Delhi, India
| | - Gulam Mustafa Hasan
- Department of Biochemistry, College of Medicine, Prince Sattam Bin Abdulaziz University, Al-Kharj, Kingdom of Saudi Arabia
| | - Sukhwinder Singh Sohal
- Respiratory Translational Research Group, Department of Laboratory Medicine, School of Health Sciences, College of Health and Medicine, University of Tasmania, Launceston, Australia
| | - Md Imtaiyaz Hassan
- Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India
| |
Collapse
|
6
|
Current status of pan-genome analysis for pathogenic bacteria. Curr Opin Biotechnol 2020; 63:54-62. [DOI: 10.1016/j.copbio.2019.12.001] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Revised: 11/16/2019] [Accepted: 12/02/2019] [Indexed: 02/07/2023]
|