1
|
Multilocus sequence analysis reveals genetic diversity in Staphylococcus aureus isolate of goat with mastitis persistent after treatment with enrofloxacin. Sci Rep 2021; 11:17252. [PMID: 34446803 PMCID: PMC8390490 DOI: 10.1038/s41598-021-96764-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 07/28/2021] [Indexed: 02/07/2023] Open
Abstract
Staphylococcus aureus is one of the main bacterial agents responsible for cases of mastitis in ruminants, playing an important role in the persistence and chronicity of diseases treated with antimicrobials. Using the multilocus sequence typing technique, network approaches and study of the population diversity of microorganisms, we performed analyzes of S. aureus (ES-GPM) isolated from goats with persistent mastitis (GPM). The most strains of ES-GPM were categorically different phylogenetically from the others and could be divided into two lineages: one with a majority belonging to ES-GPM and the other to varied strains. These two lineages were separated by 27 nuclear polymorphisms. The 43 strains comprised 22 clonal complexes (CCs), of which the ES-GPM strains were present in CC133, CC5 and a new complex formed by the sequence type 4966. The genetic diversity of some alleles showed be greater diversity and polymorphism than others, such as of the aroE and yqiL genes less than glpF gene. In addition, the sequences ES-GPM to the arc gene and glpF alleles showed the greatest number of mutations for ES-GPM in relation to non-ES-GPM. Therefore, this study identified genetic polymorphisms characteristic of S. aureus isolated from milk of goats diagnosed with persistent mastitis after the failed treatment with the antibiotic enrofloxacin. This study may help in the future to identify and discriminate this agent in cases of mastitis, and with that, the most appropriate antibiotic treatment can be performed in advance of the appearance of persistent mastitis caused by the agent, reducing the chances of premature culling and animal suffering.
Collapse
|
2
|
Zondervan NA, Martins Dos Santos VAP, Suarez-Diez M, Saccenti E. Phenotype and multi-omics comparison of Staphylococcus and Streptococcus uncovers pathogenic traits and predicts zoonotic potential. BMC Genomics 2021; 22:102. [PMID: 33541265 PMCID: PMC7860044 DOI: 10.1186/s12864-021-07388-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Accepted: 01/13/2021] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Staphylococcus and Streptococcus species can cause many different diseases, ranging from mild skin infections to life-threatening necrotizing fasciitis. Both genera consist of commensal species that colonize the skin and nose of humans and animals, and of which some can display a pathogenic phenotype. RESULTS We compared 235 Staphylococcus and 315 Streptococcus genomes based on their protein domain content. We show the relationships between protein persistence and essentiality by integrating essentiality predictions from two metabolic models and essentiality measurements from six large-scale transposon mutagenesis experiments. We identified clusters of strains within species based on proteins associated to similar biological processes. We built Random Forest classifiers that predicted the zoonotic potential. Furthermore, we identified shared attributes between of Staphylococcus aureus and Streptococcus pyogenes that allow them to cause necrotizing fasciitis. CONCLUSIONS Differences observed in clustering of strains based on functional groups of proteins correlate with phenotypes such as host tropism, capability to infect multiple hosts and drug resistance. Our method provides a solid basis towards large-scale prediction of phenotypes based on genomic information.
Collapse
Affiliation(s)
- Niels A Zondervan
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Stippeneng 4, 6708WE, Wageningen, Netherlands
| | - Vitor A P Martins Dos Santos
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Stippeneng 4, 6708WE, Wageningen, Netherlands
- LifeGlimmer GmBH, Markelstraße 38, 12163, Berlin, Germany
| | - Maria Suarez-Diez
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Stippeneng 4, 6708WE, Wageningen, Netherlands
| | - Edoardo Saccenti
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Stippeneng 4, 6708WE, Wageningen, Netherlands.
| |
Collapse
|
3
|
Zhou Y, Zhang C. Evolutionary patterns of chimeric retrogenes in Oryza species. Sci Rep 2019; 9:17733. [PMID: 31776387 PMCID: PMC6881317 DOI: 10.1038/s41598-019-54085-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Accepted: 10/30/2019] [Indexed: 11/23/2022] Open
Abstract
Chimeric retroposition is a process by which RNA is reverse transcribed and the resulting cDNA is integrated into the genome along with flanking sequences. This process plays essential roles and drives genome evolution. Although the origination rates of chimeric retrogenes are high in plant genomes, the evolutionary patterns of the retrogenes and their parental genes are relatively uncharacterised in the rice genome. In this study, we evaluated the substitution ratio of 24 retrogenes and their parental genes to clarify their evolutionary patterns. The results indicated that seven gene pairs were under positive selection. Additionally, soon after new chimeric retrogenes were formed, they rapidly evolved. However, an unexpected pattern was also revealed. Specifically, after an undefined period following the formation of new chimeric retrogenes, the parental genes, rather than the new chimeric retrogenes, rapidly evolved under positive selection. We also observed that one retro chimeric gene (RCG3) was highly expressed in infected calli, whereas its parental gene was not. Finally, a comparison of our Ka/Ks analysis with that of other species indicated that the proportion of genes under positive selection is greater for chimeric retrogenes than for non-chimeric retrogenes in the rice genome.
Collapse
Affiliation(s)
- Yanli Zhou
- The Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, No. 132 Lanhei Road, Kunming, 650201, Yunnan, China
| | - Chengjun Zhang
- The Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, No. 132 Lanhei Road, Kunming, 650201, Yunnan, China. .,Haiyan Engineering & Technology Center, Kunming Institute of Botany, Chinese Academy of Science, Jiaxing, 314300, Zhejiang, China.
| |
Collapse
|
4
|
Dunn KA, Kenney T, Gu H, Bielawski JP. Improved inference of site-specific positive selection under a generalized parametric codon model when there are multinucleotide mutations and multiple nonsynonymous rates. BMC Evol Biol 2019; 19:22. [PMID: 30642241 PMCID: PMC6332903 DOI: 10.1186/s12862-018-1326-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 12/11/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND An excess of nonsynonymous substitutions, over neutrality, is considered evidence of positive Darwinian selection. Inference for proteins often relies on estimation of the nonsynonymous to synonymous ratio (ω = dN/dS) within a codon model. However, to ease computational difficulties, ω is typically estimated assuming an idealized substitution process where (i) all nonsynonymous substitutions have the same rate (regardless of impact on organism fitness) and (ii) instantaneous double and triple (DT) nucleotide mutations have zero probability (despite evidence that they can occur). It follows that estimates of ω represent an imperfect summary of the intensity of selection, and that tests based on the ω > 1 threshold could be negatively impacted. RESULTS We developed a general-purpose parametric (GPP) modelling framework for codons. This novel approach allows specification of all possible instantaneous codon substitutions, including multiple nonsynonymous rates (MNRs) and instantaneous DT nucleotide changes. Existing codon models are specified as special cases of the GPP model. We use GPP models to implement likelihood ratio tests for ω > 1 that accommodate MNRs and DT mutations. Through both simulation and real data analysis, we find that failure to model MNRs and DT mutations reduces power in some cases and inflates false positives in others. False positives under traditional M2a and M8 models were very sensitive to DT changes. This was exacerbated by the choice of frequency parameterization (GY vs. MG), with rates sometimes > 90% under MG. By including MNRs and DT mutations, accuracy and power was greatly improved under the GPP framework. However, we also find that over-parameterized models can perform less well, and this can contribute to degraded performance of LRTs. CONCLUSIONS We suggest GPP models should be used alongside traditional codon models. Further, all codon models should be deployed within an experimental design that includes (i) assessing robustness to model assumptions, and (ii) investigation of non-standard behaviour of MLEs. As the goal of every analysis is to avoid false conclusions, more work is needed on model selection methods that consider both the increase in fit engendered by a model parameter and the degree to which that parameter is affected by un-modelled evolutionary processes.
Collapse
Affiliation(s)
- Katherine A. Dunn
- Department of Biology, Dalhousie University, Halifax, Nova Scotia B3H 4J1 Canada
| | - Toby Kenney
- Department of Mathematics & Statistics, Dalhousie University, Halifax, Nova Scotia B3H 4J1 Canada
| | - Hong Gu
- Department of Mathematics & Statistics, Dalhousie University, Halifax, Nova Scotia B3H 4J1 Canada
| | - Joseph P. Bielawski
- Department of Biology, Dalhousie University, Halifax, Nova Scotia B3H 4J1 Canada
- Department of Mathematics & Statistics, Dalhousie University, Halifax, Nova Scotia B3H 4J1 Canada
- Centre Comparative Genomics and Evolutionary Bioinformatics (CGEB) at Dalhousie University, Halifax, Canada
| |
Collapse
|
5
|
Genes under positive selection in the core genome of pathogenic Bacillus cereus group members. INFECTION GENETICS AND EVOLUTION 2018; 65:55-64. [PMID: 30006047 DOI: 10.1016/j.meegid.2018.07.009] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Revised: 07/06/2018] [Accepted: 07/07/2018] [Indexed: 11/24/2022]
Abstract
In this comparative genomics study our aim was to unravel genes under positive selection in the core genome of the Bacillus cereus group. Indeed, the members of this group share close genetic relationships but display a rather large phenotypic and ecological diversity, providing a unique opportunity for studying how genomic changes reflect ecological adaptation during the divergence of a bacterial group. For this purpose, we screened ten completely sequenced genomes of four pathogenic Bacillus species, finding that 254 out of 3093 genes have codon sites with dN/dS (ω) values above one. These results remained unchanged after having disentangled the confounding effects of recombination and selection signature in a Bayesian framework. The presumably adaptive nucleotide polymorphisms are distributed over a wide range of biological functions, such as antibiotic resistance, DNA repair, nutrient uptake, metabolism, cell wall assembly and spore structure. Our results indicate that adaptation to animal hosts, whether as pathogens, saprophytes or symbionts, is the major driving force in the evolution of the Bacillus cereus group. Future work should seek to understand the evolutionary dynamics of both core and accessory genes in an integrative framework to ultimately unravel the key networks involved in host adaptation.
Collapse
|
6
|
Yu D, Yin Z, Li B, Jin Y, Ren H, Zhou J, Zhou W, Liang L, Yue J. Gene flow, recombination, and positive selection in Stenotrophomonas maltophilia: mechanisms underlying the diversity of the widespread opportunistic pathogen. Genome 2016; 59:1063-1075. [PMID: 27696900 DOI: 10.1139/gen-2016-0073] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Stenotrophomonas maltophilia is a global multidrug-resistant human opportunistic pathogen in clinical environments. Stenotrophomonas maltophilia is also ubiquitous in aqueous environments, soil, and plants. Various molecular typing methods have revealed that S. maltophilia exhibits high levels of phenotypic and genotypic diversity. However, information regarding the genomic diversity within S. maltophilia and the corresponding genetic mechanisms resulting in said diversity remain scarce. The genome sequences of 17 S. maltophilia strains were selected to investigate the mechanisms contributing to genetic diversity at the genome level. The core and large pan-genomes of the species were first estimated, resulting in a large, open pan-genome. A species phylogeny was also reconstructed based on 344 orthologous genes with one copy per genome, and the contribution of four evolutionary mechanisms to the species genome diversity was quantified: 15%-35% of the genes showed evidence for recombination, 0%-25% of the genes in one genome were likely gained, 0%-44% of the genes in some genomes were likely lost, and less than 0.3% of the genes in a genome were under positive selection pressures. We observed that, among the four main mechanisms, homologous recombination plays a key role in maintaining diversity in S. maltophilia. In this study, we provide an overview of evolution in S. maltophilia to provide a better understanding of its evolutionary dynamics and its relationship with genome diversity.
Collapse
Affiliation(s)
- Dong Yu
- a Beijing Institute of Biotechnology, Beijing, China.,b Institute of Translational Medicine, The Second Military Medical University, Shanghai, China
| | - Zhiqiu Yin
- a Beijing Institute of Biotechnology, Beijing, China.,c College of Life Science, Anhui University, Hefei, Anhui, China
| | - Beiping Li
- a Beijing Institute of Biotechnology, Beijing, China
| | - Yuan Jin
- a Beijing Institute of Biotechnology, Beijing, China
| | - Hongguang Ren
- a Beijing Institute of Biotechnology, Beijing, China
| | - Jing Zhou
- a Beijing Institute of Biotechnology, Beijing, China
| | - Wei Zhou
- a Beijing Institute of Biotechnology, Beijing, China
| | - Long Liang
- a Beijing Institute of Biotechnology, Beijing, China.,c College of Life Science, Anhui University, Hefei, Anhui, China
| | - Junjie Yue
- a Beijing Institute of Biotechnology, Beijing, China
| |
Collapse
|
7
|
Mingrone J, Susko E, Bielawski J. Smoothed Bootstrap Aggregation for Assessing Selection Pressure at Amino Acid Sites. Mol Biol Evol 2016; 33:2976-2989. [PMID: 27486222 DOI: 10.1093/molbev/msw160] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
To detect positive selection at individual amino acid sites, most methods use an empirical Bayes approach. After parameters of a Markov process of codon evolution are estimated via maximum likelihood, they are passed to Bayes formula to compute the posterior probability that a site evolved under positive selection. A difficulty with this approach is that parameter estimates with large errors can negatively impact Bayesian classification. By assigning priors to some parameters, Bayes Empirical Bayes (BEB) mitigates this problem. However, as implemented, it imposes uniform priors, which causes it to be overly conservative in some cases. When standard regularity conditions are not met and parameter estimates are unstable, inference, even under BEB, can be negatively impacted. We present an alternative to BEB called smoothed bootstrap aggregation (SBA), which bootstraps site patterns from an alignment of protein coding DNA sequences to accommodate the uncertainty in the parameter estimates. We show that deriving the correction for parameter uncertainty from the data in hand, in combination with kernel smoothing techniques, improves site specific inference of positive selection. We compare BEB to SBA by simulation and real data analysis. Simulation results show that SBA balances accuracy and power at least as well as BEB, and when parameter estimates are unstable, the performance gap between BEB and SBA can widen in favor of SBA. SBA is applicable to a wide variety of other inference problems in molecular evolution.
Collapse
Affiliation(s)
- Joseph Mingrone
- Department of Mathematics and Statistics, Dalhousie University, Halifax, NS, Canada Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, NS, Canada
| | - Edward Susko
- Department of Mathematics and Statistics, Dalhousie University, Halifax, NS, Canada Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, NS, Canada
| | - Joseph Bielawski
- Department of Mathematics and Statistics, Dalhousie University, Halifax, NS, Canada Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, NS, Canada Department of Biology, Dalhousie University, Halifax, NS, Canada
| |
Collapse
|
8
|
Microbial Virulence and Interactions With Metals. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2016; 142:27-49. [DOI: 10.1016/bs.pmbts.2016.05.010] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
9
|
Simón M, Montiel R, Smerling A, Solórzano E, Díaz N, Álvarez-Sandoval BA, Jiménez-Marín AR, Malgosa A. Molecular analysis of ancient caries. Proc Biol Sci 2015; 281:rspb.2014.0586. [PMID: 25056622 DOI: 10.1098/rspb.2014.0586] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
An 84 base pair sequence of the Streptococcus mutans virulence factor, known as dextranase, has been obtained from 10 individuals from the Bronze Age to the Modern Era in Europe and from before and after the colonization in America. Modern samples show four polymorphic sites that have not been found in the ancient samples studied so far. The nucleotide and haplotype diversity of this region have increased over time, which could be reflecting the footprint of a population expansion. While this segment has apparently evolved according to neutral evolution, we have been able to detect one site that is under positive selection pressure both in present and past populations. This study is a first step to study the evolution of this microorganism, analysed using direct evidence obtained from ancient remains.
Collapse
Affiliation(s)
- Marc Simón
- Unitat d'Antropologia Biològica, Department of Biologia Animal, Biologia Vegetal i Ecologia, Universitat Autònoma de Barcelona, Spain
| | - Rafael Montiel
- Laboratorio Nacional de Genómica para la Biodiversidad, Unidad de Genómica Avanzada, CINVESTAV-IPN. Km. 9.6 Libramiento Norte Carretera Irapuato - León, Irapuato, Guanajuato, Mexico
| | - Andrea Smerling
- Unitat d'Antropologia Biològica, Department of Biologia Animal, Biologia Vegetal i Ecologia, Universitat Autònoma de Barcelona, Spain
| | - Eduvigis Solórzano
- Unitat d'Antropologia Biològica, Department of Biologia Animal, Biologia Vegetal i Ecologia, Universitat Autònoma de Barcelona, Spain
| | - Nancy Díaz
- Unitat d'Antropologia Biològica, Department of Biologia Animal, Biologia Vegetal i Ecologia, Universitat Autònoma de Barcelona, Spain
| | - Brenda A Álvarez-Sandoval
- Laboratorio Nacional de Genómica para la Biodiversidad, Unidad de Genómica Avanzada, CINVESTAV-IPN. Km. 9.6 Libramiento Norte Carretera Irapuato - León, Irapuato, Guanajuato, Mexico
| | - Andrea R Jiménez-Marín
- Laboratorio Nacional de Genómica para la Biodiversidad, Unidad de Genómica Avanzada, CINVESTAV-IPN. Km. 9.6 Libramiento Norte Carretera Irapuato - León, Irapuato, Guanajuato, Mexico
| | - Assumpció Malgosa
- Unitat d'Antropologia Biològica, Department of Biologia Animal, Biologia Vegetal i Ecologia, Universitat Autònoma de Barcelona, Spain
| |
Collapse
|
10
|
Genomics of the Genus Bifidobacterium Reveals Species-Specific Adaptation to the Glycan-Rich Gut Environment. Appl Environ Microbiol 2015; 82:980-991. [PMID: 26590291 DOI: 10.1128/aem.03500-15] [Citation(s) in RCA: 140] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Bifidobacteria represent one of the dominant microbial groups that occur in the gut of various animals, being particularly prevalent during the suckling period of humans and other mammals. Their ability to compete with other gut bacteria is largely attributed to their saccharolytic features. Comparative and functional genomic as well as transcriptomic analyses have revealed the genetic background that underpins the overall saccharolytic phenotype for each of the 47 bifidobacterial (sub)species representing the genus Bifidobacterium, while also generating insightful information regarding carbohydrate resource sharing and cross-feeding among bifidobacteria. The abundance of bifidobacterial saccharolytic features in human microbiomes supports the notion that metabolic accessibility to dietary and/or host-derived glycans is a potent evolutionary force that has shaped the bifidobacterial genome.
Collapse
|
11
|
Hongo JA, de Castro GM, Cintra LC, Zerlotini A, Lobo FP. POTION: an end-to-end pipeline for positive Darwinian selection detection in genome-scale data through phylogenetic comparison of protein-coding genes. BMC Genomics 2015; 16:567. [PMID: 26231214 PMCID: PMC4521464 DOI: 10.1186/s12864-015-1765-0] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2014] [Accepted: 07/10/2015] [Indexed: 11/29/2022] Open
Abstract
Background Detection of genes evolving under positive Darwinian evolution in genome-scale data is nowadays a prevailing strategy in comparative genomics studies to identify genes potentially involved in adaptation processes. Despite the large number of studies aiming to detect and contextualize such gene sets, there is virtually no software available to perform this task in a general, automatic, large-scale and reliable manner. This certainly occurs due to the computational challenges involved in this task, such as the appropriate modeling of data under analysis, the computation time to perform several of the required steps when dealing with genome-scale data and the highly error-prone nature of the sequence and alignment data structures needed for genome-wide positive selection detection. Results We present POTION, an open source, modular and end-to-end software for genome-scale detection of positive Darwinian selection in groups of homologous coding sequences. Our software represents a key step towards genome-scale, automated detection of positive selection, from predicted coding sequences and their homology relationships to high-quality groups of positively selected genes. POTION reduces false positives through several sophisticated sequence and group filters based on numeric, phylogenetic, quality and conservation criteria to remove spurious data and through multiple hypothesis corrections, and considerably reduces computation time thanks to a parallelized design. Our software achieved a high classification performance when used to evaluate a curated dataset of Trypanosoma brucei paralogs previously surveyed for positive selection. When used to analyze predicted groups of homologous genes of 19 strains of Mycobacterium tuberculosis as a case study we demonstrated the filters implemented in POTION to remove sources of errors that commonly inflate errors in positive selection detection. A thorough literature review found no other software similar to POTION in terms of customization, scale and automation. Conclusion To the best of our knowledge, POTION is the first tool to allow users to construct and check hypotheses regarding the occurrence of site-based evidence of positive selection in non-curated, genome-scale data within a feasible time frame and with no human intervention after initial configuration. POTION is available at http://www.lmb.cnptia.embrapa.br/share/POTION/. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1765-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jorge A Hongo
- Laboratório Multiusuário de Bioinformática, Embrapa Informática Agropecuária, Empresa Brasileira de Pesquisa Agropecuária (Embrapa), Campinas, São Paulo, 13083-886, Brazil.
| | - Giovanni M de Castro
- Laboratório Multiusuário de Bioinformática, Embrapa Informática Agropecuária, Empresa Brasileira de Pesquisa Agropecuária (Embrapa), Campinas, São Paulo, 13083-886, Brazil.
| | - Leandro C Cintra
- Laboratório Multiusuário de Bioinformática, Embrapa Informática Agropecuária, Empresa Brasileira de Pesquisa Agropecuária (Embrapa), Campinas, São Paulo, 13083-886, Brazil.
| | - Adhemar Zerlotini
- Laboratório Multiusuário de Bioinformática, Embrapa Informática Agropecuária, Empresa Brasileira de Pesquisa Agropecuária (Embrapa), Campinas, São Paulo, 13083-886, Brazil.
| | - Francisco P Lobo
- Laboratório Multiusuário de Bioinformática, Embrapa Informática Agropecuária, Empresa Brasileira de Pesquisa Agropecuária (Embrapa), Campinas, São Paulo, 13083-886, Brazil.
| |
Collapse
|
12
|
Bessen DE, McShan WM, Nguyen SV, Shetty A, Agrawal S, Tettelin H. Molecular epidemiology and genomics of group A Streptococcus. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2015; 33:393-418. [PMID: 25460818 PMCID: PMC4416080 DOI: 10.1016/j.meegid.2014.10.011] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2014] [Revised: 10/11/2014] [Accepted: 10/13/2014] [Indexed: 12/15/2022]
Abstract
Streptococcus pyogenes (group A Streptococcus; GAS) is a strict human pathogen with a very high prevalence worldwide. This review highlights the genetic organization of the species and the important ecological considerations that impact its evolution. Recent advances are presented on the topics of molecular epidemiology, population biology, molecular basis for genetic change, genome structure and genetic flux, phylogenomics and closely related streptococcal species, and the long- and short-term evolution of GAS. The application of whole genome sequence data to addressing key biological questions is discussed.
Collapse
Affiliation(s)
- Debra E Bessen
- Department of Microbiology & Immunology, New York Medical College, Valhalla, NY 10595, USA.
| | - W Michael McShan
- University of Oklahoma Health Sciences Center, Department of Pharmaceutical Sciences, College of Pharmacy, Oklahoma City, OK 73117, USA.
| | - Scott V Nguyen
- University of Oklahoma Health Sciences Center, Department of Pharmaceutical Sciences, College of Pharmacy, Oklahoma City, OK 73117, USA.
| | - Amol Shetty
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA.
| | - Sonia Agrawal
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA.
| | - Hervé Tettelin
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA.
| |
Collapse
|
13
|
Couvigny B, Thérial C, Gautier C, Renault P, Briandet R, Guédon E. Streptococcus thermophilus Biofilm Formation: A Remnant Trait of Ancestral Commensal Life? PLoS One 2015; 10:e0128099. [PMID: 26035177 PMCID: PMC4452758 DOI: 10.1371/journal.pone.0128099] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Accepted: 04/23/2015] [Indexed: 12/27/2022] Open
Abstract
Microorganisms have a long history of use in food production and preservation. Their adaptation to food environments has profoundly modified their features, mainly through genomic flux. Streptococcus thermophilus, one of the most frequent starter culture organisms consumed daily by humans emerged recently from a commensal ancestor. As such, it is a useful model for genomic studies of bacterial domestication processes. Many streptococcal species form biofilms, a key feature of the major lifestyle of these bacteria in nature. However, few descriptions of S. thermophilus biofilms have been reported. An analysis of the ability of a representative collection of natural isolates to form biofilms revealed that S. thermophilus was a poor biofilm producer and that this characteristic was associated with an inability to attach firmly to surfaces. The identification of three biofilm-associated genes in the strain producing the most biofilms shed light on the reasons for the rarity of this trait in this species. These genes encode proteins involved in crucial stages of biofilm formation and are heterogeneously distributed between strains. One of the biofilm genes appears to have been acquired by horizontal transfer. The other two are located in loci presenting features of reductive evolution, and are absent from most of the strains analyzed. Their orthologs in commensal bacteria are involved in adhesion to host cells, suggesting that they are remnants of ancestral functions. The biofilm phenotype appears to be a commensal trait that has been lost during the genetic domestication of S. thermophilus, consistent with its adaptation to the milk environment and the selection of starter strains for dairy fermentations.
Collapse
Affiliation(s)
- Benoit Couvigny
- INRA, UMR 1319 Micalis, Domaine de Vilvert, F-78352 Jouy-en-Josas, France
- AgroParisTech, UMR MICALIS, Jouy-en-Josas, France
| | - Claire Thérial
- INRA, UMR 1319 Micalis, Domaine de Vilvert, F-78352 Jouy-en-Josas, France
- AgroParisTech, UMR MICALIS, Jouy-en-Josas, France
| | - Céline Gautier
- INRA, UMR 1319 Micalis, Domaine de Vilvert, F-78352 Jouy-en-Josas, France
- AgroParisTech, UMR MICALIS, Jouy-en-Josas, France
| | - Pierre Renault
- INRA, UMR 1319 Micalis, Domaine de Vilvert, F-78352 Jouy-en-Josas, France
- AgroParisTech, UMR MICALIS, Jouy-en-Josas, France
| | - Romain Briandet
- INRA, UMR 1319 Micalis, Domaine de Vilvert, F-78352 Jouy-en-Josas, France
- AgroParisTech, UMR MICALIS, Jouy-en-Josas, France
| | - Eric Guédon
- INRA, UMR 1319 Micalis, Domaine de Vilvert, F-78352 Jouy-en-Josas, France
- AgroParisTech, UMR MICALIS, Jouy-en-Josas, France
- * E-mail:
| |
Collapse
|
14
|
Marra NJ, DeWoody JA. Transcriptomic characterization of the immunogenetic repertoires of heteromyid rodents. BMC Genomics 2014; 15:929. [PMID: 25341737 PMCID: PMC4216838 DOI: 10.1186/1471-2164-15-929] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2014] [Accepted: 10/16/2014] [Indexed: 11/18/2022] Open
Abstract
Background When populations evolve under disparate environmental conditions, they experience different selective pressures that shape patterns of sequence evolution and gene expression. These may be manifested in genetic and phenotypic differences such as a diverse immunogenetic repertoire in species from tropical latitudes that have greater and/or different parasite burdens than more temperate species. To test this idea, we compared the transcriptomes of one tropical species (Heteromys desmarestianus) and two species from temperate latitudes (Dipodomys spectabilis and Chaetodipus baileyi) from the Heteromyidae. We did so in a search for positive selection on sequences and/or differential expression, while controlling for phylogenetic history in our choice of species. Results We identified 127,812 contigs and annotated 34,878 of these, identifying immune genes associated with interleukins, cytokines, and the production of mast cells. We identified 632 genes that were upregulated in H. desmarestianus (8.7% of genes tested) and 492 (6.7%) that were downregulated. Gene ontology terms including “immune response” were associated with 31 (4.9%) of the 632 upregulated genes. We found preliminary evidence for positive selection on three genes (Palmitoyltransferase ZDHHC5 Ubiquitin-conjugating enzyme E2 N, Krueppel-like factor 10, and Spindle and kinetochore-associated protein 1) along the H. desmarestianus lineage. Conclusions Overall our findings pinpoint genes in species from disparate environments that are on different evolutionary trajectories in terms of expression levels and/or nucleotide sequence. Our data indicate there are significant differences in the expression of genes among the spleen transcriptomes of these species and that a number of these differentially expressed genes do not show the same pattern of differential expression in another tissue type. This points to the possibility of expression differences between these species specific to the spleen transcriptome. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-929) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nicholas J Marra
- Department of Population Medicine and Diagnostic Sciences, Cornell University, S3-111 Schurman Hall, Tower Road, Ithaca, NY 14853, USA.
| | | |
Collapse
|
15
|
Investigation of the evolutionary development of the genus Bifidobacterium by comparative genomics. Appl Environ Microbiol 2014; 80:6383-94. [PMID: 25107967 DOI: 10.1128/aem.02004-14] [Citation(s) in RCA: 93] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The Bifidobacterium genus currently encompasses 48 recognized taxa, which have been isolated from different ecosystems. However, the current phylogeny of bifidobacteria is hampered by the relative paucity of genotypic data. Here, we reassessed the taxonomy of this bacterial genus using genome-based approaches, which demonstrated that the previous taxonomic view of bifidobacteria contained several inconsistencies. In particular, high levels of genetic relatedness were shown to exist between particular Bifidobacterium taxa which would not justify their status as separate species. The results presented are here based on average nucleotide identity analysis involving the genome sequences for each type strain of the 48 bifidobacterial taxa, as well as phylogenetic comparative analysis of the predicted core genome of the Bifidobacterium genus. The results of this study demonstrate that the availability of complete genome sequences allows the reconstruction of a more robust bifidobacterial phylogeny than that obtained from a single gene-based sequence comparison, thus discouraging the assignment of a new or separate bifidobacterial taxon without such a genome-based validation.
Collapse
|
16
|
Marra NJ, Romero A, DeWoody JA. Natural selection and the genetic basis of osmoregulation in heteromyid rodents as revealed by RNA-seq. Mol Ecol 2014; 23:2699-711. [PMID: 24754676 DOI: 10.1111/mec.12764] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2014] [Revised: 04/12/2014] [Accepted: 04/17/2014] [Indexed: 12/23/2022]
Abstract
One adaptation of ecological and evolutionary interest is the extraordinary ability of desert rodents to retain water during waste production. Much is known regarding the unique kidney physiology of kangaroo rats (Dipodomys spp.) and their ability to retain water during waste production, yet the genetic basis of these physiological adaptations is relatively unknown. Herein, we utilized RNA-seq data to conduct a comparative study to identify osmoregulatory genes expressed in heteromyid rodents. We sequenced kidney tissue from two temperate desert species (Dipodomys spectabilis and Chaetodipus baileyi) from two separate subfamilies of the Heteromyidae and compared these transcriptomes to a tropical mesic species (Heteromys desmarestianus) from a third subfamily. The evolutionary history of these subfamilies provided a robust phylogenetic control that allowed us to separate shared evolutionary history from convergence. Using two methods to detect differential expression (DE), we identified 1890 genes that showed consistent patterns of DE between the arid and mesic species. A three-species reciprocal BLAST analysis revealed 3511 sets of putative orthologues that, upon comparison to known Mus musculus sequences, revealed 323 annotated and full-length genic regions. Selection tests displayed evidence of positive selection (dn/ds > 1) on six genes in the two desert species and remained significant for one of these genes after correction for multiple testing. Thus, our data suggest that both the coding sequence and expression of genes have been shaped by natural selection to provide the genetic architecture for efficient osmoregulation in desert-adapted heteromyid rodents.
Collapse
Affiliation(s)
- Nicholas J Marra
- Department of Forestry & Natural Resources, Purdue University, West Lafayette, IN, 47907, USA
| | | | | |
Collapse
|
17
|
Cheng CP, Liu YC, Tsai YL, Tseng VS. An efficient method for mining cross-timepoint gene regulation sequential patterns from time course gene expression datasets. BMC Bioinformatics 2013; 14 Suppl 12:S3. [PMID: 24267918 PMCID: PMC3848764 DOI: 10.1186/1471-2105-14-s12-s3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Observation of gene expression changes implying gene regulations using a repetitive experiment in time course has become more and more important. However, there is no effective method which can handle such kind of data. For instance, in a clinical/biological progression like inflammatory response or cancer formation, a great number of differentially expressed genes at different time points could be identified through a large-scale microarray approach. For each repetitive experiment with different samples, converting the microarray datasets into transactional databases with significant singleton genes at each time point would allow sequential patterns implying gene regulations to be identified. Although traditional sequential pattern mining methods have been successfully proposed and widely used in different interesting topics, like mining customer purchasing sequences from a transactional database, to our knowledge, the methods are not suitable for such biological dataset because every transaction in the converted database may contain too many items/genes. RESULTS In this paper, we propose a new algorithm called CTGR-Span (Cross-Timepoint Gene Regulation Sequential pattern) to efficiently mine CTGR-SPs (Cross-Timepoint Gene Regulation Sequential Patterns) even on larger datasets where traditional algorithms are infeasible. The CTGR-Span includes several biologically designed parameters based on the characteristics of gene regulation. We perform an optimal parameter tuning process using a GO enrichment analysis to yield CTGR-SPs more meaningful biologically. The proposed method was evaluated with two publicly available human time course microarray datasets and it was shown that it outperformed the traditional methods in terms of execution efficiency. After evaluating with previous literature, the resulting patterns also strongly correlated with the experimental backgrounds of the datasets used in this study. CONCLUSIONS We propose an efficient CTGR-Span to mine several biologically meaningful CTGR-SPs. We postulate that the biologist can benefit from our new algorithm since the patterns implying gene regulations could provide further insights into the mechanisms of novel gene regulations during a biological or clinical progression. The Java source code, program tutorial and other related materials used in this program are available at http://websystem.csie.ncku.edu.tw/CTGR-Span.rar.
Collapse
|
18
|
Alves JMP, Serrano MG, Maia da Silva F, Voegtly LJ, Matveyev AV, Teixeira MMG, Camargo EP, Buck GA. Genome evolution and phylogenomic analysis of Candidatus Kinetoplastibacterium, the betaproteobacterial endosymbionts of Strigomonas and Angomonas. Genome Biol Evol 2013; 5:338-50. [PMID: 23345457 PMCID: PMC3590767 DOI: 10.1093/gbe/evt012] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
It has been long known that insect-infecting trypanosomatid flagellates from the genera Angomonas and Strigomonas harbor bacterial endosymbionts (Candidatus Kinetoplastibacterium or TPE [trypanosomatid proteobacterial endosymbiont]) that supplement the host metabolism. Based on previous analyses of other bacterial endosymbiont genomes from other lineages, a stereotypical path of genome evolution in such bacteria over the duration of their association with the eukaryotic host has been characterized. In this work, we sequence and analyze the genomes of five TPEs, perform their metabolic reconstruction, do an extensive phylogenomic analyses with all available Betaproteobacteria, and compare the TPEs with their nearest betaproteobacterial relatives. We also identify a number of housekeeping and central metabolism genes that seem to have undergone positive selection. Our genome structure analyses show total synteny among the five TPEs despite millions of years of divergence, and that this lineage follows the common path of genome evolution observed in other endosymbionts of diverse ancestries. As previously suggested by cell biology and biochemistry experiments, Ca. Kinetoplastibacterium spp. preferentially maintain those genes necessary for the biosynthesis of compounds needed by their hosts. We have also shown that metabolic and informational genes related to the cooperation with the host are overrepresented amongst genes shown to be under positive selection. Finally, our phylogenomic analysis shows that, while being in the Alcaligenaceae family of Betaproteobacteria, the closest relatives of these endosymbionts are not in the genus Bordetella as previously reported, but more likely in the Taylorella genus.
Collapse
Affiliation(s)
- João M P Alves
- Department of Microbiology and Immunology and the Center for the Study of Biological Complexity, Virginia Commonwealth University, VA, USA.
| | | | | | | | | | | | | | | |
Collapse
|
19
|
Verma M, Lal D, Kaur J, Saxena A, Kaur J, Anand S, Lal R. Phylogenetic analyses of phylum Actinobacteria based on whole genome sequences. Res Microbiol 2013; 164:718-28. [DOI: 10.1016/j.resmic.2013.04.002] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2012] [Accepted: 03/26/2013] [Indexed: 11/25/2022]
|
20
|
Shao ZQ, Zhang YM, Pan XZ, Wang B, Chen JQ. Insight into the evolution of the histidine triad protein (HTP) family in Streptococcus. PLoS One 2013; 8:e60116. [PMID: 23527301 PMCID: PMC3603884 DOI: 10.1371/journal.pone.0060116] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2012] [Accepted: 02/21/2013] [Indexed: 12/19/2022] Open
Abstract
The Histidine Triad Proteins (HTPs), also known as Pht proteins in Streptococcus pneumoniae, constitute a family of surface-exposed proteins that exist in many pathogenic streptococcal species. Although many studies have revealed the importance of HTPs in streptococcal physiology and pathogenicity, little is known about their origin and evolution. In this study, after identifying all htp homologs from 105 streptococcal genomes representing 38 different species/subspecies, we analyzed their domain structures, positions in genome, and most importantly, their evolutionary histories. By further projecting this information onto the streptococcal phylogeny, we made several major findings. First, htp genes originated earlier than the Streptococcus genus and gene-loss events have occurred among three streptococcal groups, resulting in the absence of the htp gene in the Bovis, Mutans and Salivarius groups. Second, the copy number of htp genes in other groups of Streptococcus is variable, ranging from one to four functional copies. Third, both phylogenetic evidence and domain structure analyses support the division of two htp subfamilies, designated as htp I and htp II. Although present mainly in the pyogenic group and in Streptococcus suis, htp II members are distinct from htp I due to the presence of an additional leucine-rich-repeat domain at the C-terminus. Finally, htp genes exhibit a faster nucleotide substitution rate than do housekeeping genes. Specifically, the regions outside the HTP domains are under strong positive selection. This distinct evolutionary pattern likely helped Streptococcus to easily escape from recognition by host immunity.
Collapse
Affiliation(s)
- Zhu-Qing Shao
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu Province, China
| | - Yan-Mei Zhang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu Province, China
| | - Xiu-Zhen Pan
- Department of Epidemiology, Research Institute for Medicine of Nanjing Command, Nanjing, China
| | - Bin Wang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu Province, China
- * E-mail: (BW); (JQC)
| | - Jian-Qun Chen
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu Province, China
- * E-mail: (BW); (JQC)
| |
Collapse
|
21
|
Bryant J, Chewapreecha C, Bentley SD. Developing insights into the mechanisms of evolution of bacterial pathogens from whole-genome sequences. Future Microbiol 2012; 7:1283-1296. [PMID: 23075447 PMCID: PMC3996552 DOI: 10.2217/fmb.12.108] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Evolution of bacterial pathogen populations has been detected in a variety of ways including phenotypic tests, such as metabolic activity, reaction to antisera and drug resistance and genotypic tests that measure variation in chromosome structure, repetitive loci and individual gene sequences. While informative, these methods only capture a small subset of the total variation and, therefore, have limited resolution. Advances in sequencing technologies have made it feasible to capture whole-genome sequence variation for each sample under study, providing the potential to detect all changes at all positions in the genome from single nucleotide changes to large-scale insertions and deletions. In this review, we focus on recent work that has applied this powerful new approach and summarize some of the advances that this has brought in our understanding of the details of how bacterial pathogens evolve.
Collapse
Affiliation(s)
- Josephine Bryant
- Pathogen Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA, UK
| | - Claire Chewapreecha
- Pathogen Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA, UK
| | - Stephen D Bentley
- Pathogen Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA, UK
- Department of Medicine, University of Cambridge, Addenbrooke’s Hospital, Cambridge, CB2 0QQ, UK
| |
Collapse
|
22
|
Directional evolution of Chlamydia trachomatis towards niche-specific adaptation. J Bacteriol 2012; 194:6143-53. [PMID: 22961851 DOI: 10.1128/jb.01291-12] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
On behalf of the host-pathogen "arms race," a cutting-edge approach for elucidating genotype-phenotype relationships relies on the identification of positively selected loci involved in pathoadaptation. We studied the obligate intracellular bacterium Chlamydia trachomatis, for which same-species strains display a nearly identical core and pan genome, while presenting a wide range of tissue tropism and ecological success. We sought to evaluate the evolutionary patterns underlying species separation (divergence) and C. trachomatis serovar radiation (polymorphism) and to establish genotype-phenotype associations. By analyzing 60 Chlamydia strains, we detected traces of Muller's ratchet as a result of speciation and identified positively selected genes and codons hypothetically involved in the infection of different human cell types (e.g., columnar epithelial cells of ocular or genital mucosae and mononuclear phagocytes) and also events likely driving pathogenic and ecological success dissimilarities. In general, these genes code for proteins involved in immune response elicitation, proteolysis, and the subversion of host-cell functions, and also for proteins with unknown function(s). Several genes are potentially involved in more than one adaptive process, suggesting multiple functions or a distinct modus operandi for a specific function, and thus should be considered as crucial research targets. In addition, six of the nine genes encoding the putative antigen/adhesin polymorphic membrane proteins seem to be under positive selection along specific serovars, which sustains an essential biological role of this extra-large paralogue family in chlamydial pathobiology. This study provides insight into how evolutionary inferences illuminate ecological processes such as adaptation to different niches, pathogenicity, or ecological success driven by arms races.
Collapse
|
23
|
Quesada-Ocampo LM, Landers NA, Lebeis AC, Fulbright DW, Hausbeck MK. Genetic Structure of Clavibacter michiganensis subsp. michiganensis Populations in Michigan Commercial Tomato Fields. PLANT DISEASE 2012; 96:788-796. [PMID: 30727358 DOI: 10.1094/pdis-05-11-0402] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Clavibacter michiganensis subsp. michiganensis, causal agent of bacterial canker of tomato, is distinguished into four fingerprint types (A, B, C, and D) using BOX-polymerase chain reaction (PCR). To characterize the variation within the C. michiganensis subsp. michiganensis population in Michigan, 718 strains of C. michiganensis subsp. michiganensis were isolated from infected foliage and fruit collected from 14 and 9 Michigan commercial tomato fields in 1997 and 1998, respectively. The frequency of PCR types detected with BOX-PCR in all strains, and Bayesian cluster analysis, pairwise differentiation index comparisons, and genetic diversity estimates of 96 strains genotyped for six virulence-related genes revealed that C. michiganensis subsp. michiganensis populations in Michigan tomato fields are geographically structured. A multilocus haplotype cladogram was also consistent with geographic stratification in C. michiganensis subsp. Michiganensis populations. Some regions had strains predominantly of only one PCR type or belonging mostly to one genetic cluster, while other regions presented more diversity of occurrence of PCR types and genetic clusters. Results derived from this study provide information about the genetic structure of C. michiganensis subsp. michiganensis populations in Michigan and genetic diversity of C. michiganensis subsp. michiganensis inocula, which is key in developing disease management strategies.
Collapse
Affiliation(s)
| | | | | | | | - M K Hausbeck
- Professor, Department of Plant Pathology, Michigan State University, East Lansing 48824
| |
Collapse
|
24
|
Choi SC, Rasmussen MD, Hubisz MJ, Gronau I, Stanhope MJ, Siepel A. Replacing and additive horizontal gene transfer in Streptococcus. Mol Biol Evol 2012; 29:3309-20. [PMID: 22617954 DOI: 10.1093/molbev/mss138] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The prominent role of Horizontal Gene Transfer (HGT) in the evolution of bacteria is now well documented, but few studies have differentiated between evolutionary events that predominantly cause genes in one lineage to be replaced by homologs from another lineage ("replacing HGT") and events that result in the addition of substantial new genomic material ("additive HGT"). Here in, we make use of the distinct phylogenetic signatures of replacing and additive HGTs in a genome-wide study of the important human pathogen Streptococcus pyogenes (SPY) and its close relatives S. dysgalactiae subspecies equisimilis (SDE) and S. dysgalactiae subspecies dysgalactiae (SDD). Using recently developed statistical models and computational methods, we find evidence for abundant gene flow of both kinds within each of the SPY and SDE clades and of reduced levels of exchange between SPY and SDD. In addition, our analysis strongly supports a pronounced asymmetry in SPY-SDE gene flow, favoring the SPY-to-SDE direction. This finding is of particular interest in light of the recent increase in virulence of pathogenic SDE. We find much stronger evidence for SPY-SDE gene flow among replacing than among additive transfers, suggesting a primary influence from homologous recombination between co-occurring SPY and SDE cells in human hosts. Putative virulence genes are correlated with transfer events, but this correlation is found to be driven by additive, not replacing, HGTs. The genes affected by additive HGTs are enriched for functions having to do with transposition, recombination, and DNA integration, consistent with previous findings, whereas replacing HGTs seen to influence a more diverse set of genes. Additive transfers are also found to be associated with evidence of positive selection. These findings shed new light on the manner in which HGT has shaped pathogenic bacterial genomes.
Collapse
Affiliation(s)
- Sang Chul Choi
- Department of Biological Statistics and Computational Biology, Cornell University
| | | | | | | | | | | |
Collapse
|
25
|
Liu W, Fang L, Li M, Li S, Guo S, Luo R, Feng Z, Li B, Zhou Z, Shao G, Chen H, Xiao S. Comparative genomics of Mycoplasma: analysis of conserved essential genes and diversity of the pan-genome. PLoS One 2012; 7:e35698. [PMID: 22536428 PMCID: PMC3335003 DOI: 10.1371/journal.pone.0035698] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2011] [Accepted: 03/20/2012] [Indexed: 12/04/2022] Open
Abstract
Mycoplasma, the smallest self-replicating organism with a minimal metabolism and little genomic redundancy, is expected to be a close approximation to the minimal set of genes needed to sustain bacterial life. This study employs comparative evolutionary analysis of twenty Mycoplasma genomes to gain an improved understanding of essential genes. By analyzing the core genome of mycoplasmas, we finally revealed the conserved essential genes set for mycoplasma survival. Further analysis showed that the core genome set has many characteristics in common with experimentally identified essential genes. Several key genes, which are related to DNA replication and repair and can be disrupted in transposon mutagenesis studies, may be critical for bacteria survival especially over long period natural selection. Phylogenomic reconstructions based on 3,355 homologous groups allowed robust estimation of phylogenetic relatedness among mycoplasma strains. To obtain deeper insight into the relative roles of molecular evolution in pathogen adaptation to their hosts, we also analyzed the positive selection pressures on particular sites and lineages. There appears to be an approximate correlation between the divergence of species and the level of positive selection detected in corresponding lineages.
Collapse
Affiliation(s)
- Wei Liu
- Division of Animal Infectious Diseases, State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan, People's Republic of China
| | - Liurong Fang
- Division of Animal Infectious Diseases, State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan, People's Republic of China
| | - Mao Li
- Division of Animal Infectious Diseases, State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan, People's Republic of China
| | - Sha Li
- Division of Animal Infectious Diseases, State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan, People's Republic of China
| | - Shaohua Guo
- Division of Animal Infectious Diseases, State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan, People's Republic of China
| | - Rui Luo
- Division of Animal Infectious Diseases, State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan, People's Republic of China
| | - Zhixin Feng
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Nanjing, People's Republic of China
| | - Bin Li
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Nanjing, People's Republic of China
| | - Zhemin Zhou
- Environmental Research Institute, University College Cork, Cork, Ireland
| | - Guoqing Shao
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Nanjing, People's Republic of China
| | - Huanchun Chen
- Division of Animal Infectious Diseases, State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan, People's Republic of China
| | - Shaobo Xiao
- Division of Animal Infectious Diseases, State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan, People's Republic of China
- * E-mail:
| |
Collapse
|
26
|
Suzuki H, Stanhope MJ. Functional bias of positively selected genes in Streptococcus genomes. INFECTION GENETICS AND EVOLUTION 2011; 12:274-7. [PMID: 22155358 DOI: 10.1016/j.meegid.2011.11.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2011] [Revised: 11/07/2011] [Accepted: 11/10/2011] [Indexed: 11/26/2022]
Abstract
Rates of nonsynonymous substitution (dN) significantly higher than rates of synonymous substitution (dS) have been used as evidence of positive selection for the fixation of advantageous point mutations. It has been suggested that positive selection contributes to the evolution of virulence factors and certain functional categories in bacterial pathogens. The genus Streptococcus contains a number of important human and agricultural pathogens. Here we assessed positive selection across 13 Streptococcus species, and their relationship with virulence factors and functional categories. We found that known virulence genes were subject to positive selection pressure as much as other genes. After false discovery rate correction for multiple comparisons, no functional categories were significantly over- or under-represented in positively selected genes relative to other genes. Our results suggest that within the genus Streptococcus positive selection based on dN/dS ratios is not distributed with bias across biological functions.
Collapse
Affiliation(s)
- Haruo Suzuki
- Department of Population Medicine and Diagnostic Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY 14853, USA
| | | |
Collapse
|
27
|
Brieuc MSO, Naish KA. Detecting signatures of positive selection in partial sequences generated on a large scale: pitfalls, procedures and resources. Mol Ecol Resour 2011; 11 Suppl 1:172-83. [PMID: 21429173 DOI: 10.1111/j.1755-0998.2010.02948.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Studying the actions of selection provides insight into adaptation, population divergence and gene function. Next-generation sequencing produces large amounts of partial sequences, potentially facilitating efforts to detect signatures of selection based on comparisons between synonymous (d(S)) and nonsynonymous (d(N)) substitutions, and single nucleotide polymorphism assays placed in selected genes would improve the ability to study adaptation in population surveys. However, sequences generated by these technologies are typically short. In nonmodel organisms that are a focus of evolutionary studies, the lack of a reference genome that facilitates the assembly of short sequences has limited surveys of positive selection in large numbers of genes. Here, we describe a series of steps to facilitate these surveys. We provide PERL scripts to assist data analysis, and describe the use of commonly available programs. We demonstrate these approaches in six salmon species, which have partially duplicated genomes. We recommend using multiway blast to optimize the number of alignments between partial coding sequences. Reading frames should be manually detected after alignment with sequences in Genbank using the BLASTX program. We encourage the use of a phylogenetic approach to separate orthologs from paralogs in duplicated genomes. Simple simulations on a gene known to have undergone selection in salmon species, transferrin, showed that the ability to detect selection in short sequences (<600 bp) depended on the proportion of codons under selection (1-2%) within that sequence. This relationship was less relevant in longer sequences. In this exploratory study, we detected 11 genes showing evidence of positive selection.
Collapse
Affiliation(s)
- Marine S O Brieuc
- School of Aquatic and Fishery Sciences, University of Washington, Seattle, 98195, USA.
| | | |
Collapse
|
28
|
Esteban DJ, Hutchinson AP. Genes in the terminal regions of orthopoxvirus genomes experience adaptive molecular evolution. BMC Genomics 2011; 12:261. [PMID: 21605412 PMCID: PMC3123329 DOI: 10.1186/1471-2164-12-261] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2010] [Accepted: 05/23/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Orthopoxviruses are dsDNA viruses with large genomes, some encoding over 200 genes. Genes essential for viral replication are located in the center of the linear genome and genes encoding host response modifiers and other host interacting proteins are located in the terminal regions. The central portion of the genome is highly conserved, both in gene content and sequence, while the terminal regions are more diverse. In this study, we investigated the role of adaptive molecular evolution in poxvirus genes and the selective pressures that act on the different regions of the genome. The relative fixation rates of synonymous and non-synonymous mutations (the d(N)/d(S) ratio) are an indicator of the mechanism of evolution of sequences, and can be used to identify purifying, neutral, or diversifying selection acting on a gene. Like highly conserved residues, amino acids under diversifying selection may be functionally important. Many genes experiencing diversifying selection are involved in host-pathogen interactions, such as antigen-antibody interactions, or the "host-pathogen arms race." RESULTS We analyzed 175 gene families from orthopoxviruses for evidence of diversifying selection. 79 genes were identified as experiencing diversifying selection, 25 with high confidence. Many of these genes are located in the terminal regions of the genome and function to modify the host response to infection or are virion-associated, indicating a greater role for diversifying selection in host-interacting genes. Of the 79 genes, 20 are of unknown function, and implicating diversifying selection as an important mechanism in their evolution may help characterize their function or identify important functional residues. CONCLUSIONS We conclude that diversifying selection is an important mechanism of orthopoxvirus evolution. Diversifying selection in poxviruses may be the result of interaction with host defense mechanisms.
Collapse
Affiliation(s)
- David J Esteban
- Biology Department, Vassar College, Poughkeepsie, NY 12604, USA.
| | | |
Collapse
|
29
|
Markova-Raina P, Petrov D. High sensitivity to aligner and high rate of false positives in the estimates of positive selection in the 12 Drosophila genomes. Genome Res 2011; 21:863-74. [PMID: 21393387 DOI: 10.1101/gr.115949.110] [Citation(s) in RCA: 122] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
We investigate the effect of aligner choice on inferences of positive selection using site-specific models of molecular evolution. We find that independently of the choice of aligner, the rate of false positives is unacceptably high. Our study is a whole-genome analysis of all protein-coding genes in 12 Drosophila genomes annotated in either all 12 species (~6690 genes) or in the six melanogaster group species. We compare six popular aligners: PRANK, T-Coffee, ClustalW, ProbCons, AMAP, and MUSCLE, and find that the aligner choice strongly influences the estimates of positive selection. Differences persist when we use (1) different stringency cutoffs, (2) different selection inference models, (3) alignments with or without gaps, and/or additional masking, (4) per-site versus per-gene statistics, (5) closely related melanogaster group species versus more distant 12 Drosophila genomes. Furthermore, we find that these differences are consequential for downstream analyses such as determination of over/under-represented GO terms associated with positive selection. Visual analysis indicates that most sites inferred as positively selected are, in fact, misaligned at the codon level, resulting in false positive rates of 48%-82%. PRANK, which has been reported to outperform other aligners in simulations, performed best in our empirical study as well. Unfortunately, PRANK still had a high, and unacceptable for most applications, false positives rate of 50%-55%. We identify misannotations and indels, many of which appear to be located in disordered protein regions, as primary culprits for the high misalignment-related error levels and discuss possible workaround approaches to this apparently pervasive problem in genome-wide evolutionary analyses.
Collapse
Affiliation(s)
- Penka Markova-Raina
- Department of Biology, Stanford University, Stanford, California 94305, USA.
| | | |
Collapse
|
30
|
Ijaz M, Velineni S, Timoney JF. Selective pressure for allelic diversity in SeM of Streptococcus equi does not affect immunoreactive proteins SzPSe or Se18.9. INFECTION GENETICS AND EVOLUTION 2011; 11:1159-63. [PMID: 21256981 DOI: 10.1016/j.meegid.2011.01.011] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2010] [Revised: 12/22/2010] [Accepted: 01/16/2011] [Indexed: 01/25/2023]
Abstract
Streptococcus equi, a clone or biovar of an ancestral Streptococcus zooepidemicus of Lancefield group C causes equine strangles, a highly contagious tonsillitis and lymphadenitis of the head and neck. At least 74 alleles based on N-terminal amino acid sequence of the anti-phagocytic SeM have been observed among isolates of S. equi from N. America, Europe and Japan. A d(N)/d(S) ratio of 5.93 for the 5' region of sem is indicative of positive selective pressure. The aim of this study was to determine whether variations in SeM were accompanied by variations in the surface exposed SzPSe and secreted Se18.9, both of which bind to equine tonsillar epithelium and, along with SeM, elicit strong nasopharyngeal IgA responses during convalescence. Sequences of genes for these proteins from 25 S. equi expressing 19 different SeM alleles isolated over 40 years in different countries were compared. No variation was observed in szpse, except for an Australian isolate with a deletion of a single repeat in the 3' end of the gene. Interestingly, only two SNP loci were detected in se18.9 compared to 93 and 55 in sem and szpse, respectively. The high frequency of nucleotide substitutions in szpse may be related to its mosaic structure since this gene in S. zooepidemicus exists in a variety of combinations of sequence segments and has a central hypervariable region that includes exogenous DNA sequence based on an atypical G-C percentage. In summary, the results of this study document very different responses of streptococcal genes for 3 immunoreactive proteins to selection pressure of the nasopharyngeal mucosal immune response.
Collapse
Affiliation(s)
- Muhammad Ijaz
- Maxwell H Gluck Equine Research Center, University of Kentucky, Lexington, KY 40546-0099, United States
| | | | | |
Collapse
|
31
|
Moreno M, Marinotti O, Krzywinski J, Tadei WP, James AA, Achee NL, Conn JE. Complete mtDNA genomes of Anopheles darlingi and an approach to anopheline divergence time. Malar J 2010; 9:127. [PMID: 20470395 PMCID: PMC2877063 DOI: 10.1186/1475-2875-9-127] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2010] [Accepted: 05/14/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The complete sequences of the mitochondrial genomes (mtDNA) of members of the northern and southern genotypes of Anopheles (Nyssorhynchus) darlingi were used for comparative studies to estimate the time to the most recent common ancestor for modern anophelines, to evaluate differentiation within this taxon, and to seek evidence of incipient speciation. METHODS The mtDNAs were sequenced from mosquitoes from Belize and Brazil and comparative analyses of structure and base composition, among others, were performed. A maximum likelihood approach linked with phylogenetic information was employed to detect evidence of selection and a Bayesian approach was used to date the split between the subgenus Nyssorhynchus and other Anopheles subgenera. RESULTS The comparison of mtDNA sequences within the Anopheles darlingi taxon does not provide sufficient resolution to establish different units of speciation within the species. In addition, no evidence of positive selection in any protein-coding gene of the mtDNA was detected, and purifying selection likely is the basis for this lack of diversity. Bayesian analysis supports the conclusion that the most recent ancestor of Nyssorhynchus and Anopheles+Cellia was extant ~94 million years ago. CONCLUSION Analyses of mtDNA genomes of Anopheles darlingi do not provide support for speciation in the taxon. The dates estimated for divergence among the anopheline groups tested is in agreement with the geological split of western Gondwana (95 mya), and provides additional support for explaining the absence of Cellia in the New World, and Nyssorhynchus in the Afro-Eurasian continents.
Collapse
Affiliation(s)
- Marta Moreno
- Griffin Laboratory, New York State Department of Health, Wadsworth Center, 5668 State Farm Road, Slingerlands, NY 12159, USA.
| | | | | | | | | | | | | |
Collapse
|
32
|
Three paralogous LysR-type transcriptional regulators control sulfur amino acid supply in Streptococcus mutans. J Bacteriol 2010; 192:3464-73. [PMID: 20418399 DOI: 10.1128/jb.00119-10] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The genome of Streptococcus mutans encodes 4 LysR-type transcriptional regulators (LTTRs), three of which, MetR, CysR (cysteine synthesis regulator), and HomR (homocysteine synthesis regulator), are phylogenetically related. MetR was previously shown to control methionine metabolic gene expression. Functional analysis of CysR and HomR was carried out by phenotypical studies and transcriptional analysis. CysR is required to activate the transcription of cysK encoding the cysteine biosynthesis enzyme, tcyABC and gshT genes encoding cysteine and glutathione transporter systems, and homR. HomR activates the transcription of metBC encoding methionine biosynthesis enzymes, tcyDEFGH involved in cysteine transport, and still uncharacterized thiosulfate assimilation genes. Control of HomR by CysR provides evidence of a cascade regulation for sulfur amino acid metabolism in S. mutans. Two conserved motifs were found in the promoter regions of CysR and HomR target genes, suggesting their role in the regulator binding recognition site. Both CysR and HomR require O-acetylserine to activate transcription. A global sulfur amino acid supply gene regulatory pathway is proposed for S. mutans, including the cascade regulation consequent to transcriptional activation of HomR by CysR. Phylogenetic study of MetR, CysR, and HomR homologues and comparison of their potential regulatory patterns among the Streptococcaceae suggest their rapid evolution.
Collapse
|
33
|
Positive selection identifies an in vivo role for FimH during urinary tract infection in addition to mannose binding. Proc Natl Acad Sci U S A 2009; 106:22439-44. [PMID: 20018753 DOI: 10.1073/pnas.0902179106] [Citation(s) in RCA: 130] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
FimH, the type 1 pilus adhesin of uropathogenic Escherichia coli (UPEC), contains a receptor-binding domain with an acidic binding pocket specific for mannose. The fim operon, and thus type 1 pilus production, is under transcriptional control via phase variation of an invertible promoter element. FimH is critical during urinary tract infection for mediating colonization and invasion of the bladder epithelium and establishment of intracellular bacterial communities (IBCs). In silico analysis of FimH gene sequences from 279 E. coli strains identified specific amino acids evolving under positive selection outside of its mannose-binding pocket. Mutating two of these residues (A27V/V163A) had no effect on phase variation, pilus assembly, or mannose binding in vitro. However, compared to wild-type, this double mutant strain exhibited a 10,000-fold reduction in mouse bladder colonization 24 h after inoculation and was unable to form IBCs even though it bound normally to mannosylated receptors in the urothelium. In contrast, the single A62S mutation altered phase variation, reducing the proportion of piliated cells, reduced mannose binding 8-fold, and decreased bladder colonization 30-fold in vivo compared to wild-type. A phase-locked ON A62S mutant restored virulence to wild-type levels even though in vitro mannose binding remained impaired. Thus, positive selection analysis of FimH has separated mannose binding from in vivo fitness, suggesting that IBC formation is critical for successful infection of the mammalian bladder, providing support for more general use of in silico positive selection analysis to define the molecular underpinnings of bacterial pathogenesis.
Collapse
|
34
|
Aguileta G, Lengelle J, Marthey S, Chiapello H, Rodolphe F, Gendrault A, Yockteng R, Vercken E, Devier B, Fontaine MC, Wincker P, Dossat C, Cruaud C, Couloux A, Giraud T. Finding candidate genes under positive selection in Non-model species: examples of genes involved in host specialization in pathogens. Mol Ecol 2009; 19:292-306. [PMID: 20041992 DOI: 10.1111/j.1365-294x.2009.04454.x] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Numerous genes in diverse organisms have been shown to be under positive selection, especially genes involved in reproduction, adaptation to contrasting environments, hybrid inviability, and host-pathogen interactions. Looking for genes under positive selection in pathogens has been a priority in efforts to investigate coevolution dynamics and to develop vaccines or drugs. To elucidate the functions involved in host specialization, here we aimed at identifying candidate sequences that could have evolved under positive selection among closely related pathogens specialized on different hosts. For this goal, we sequenced c. 17,000-32,000 ESTs from each of four Microbotryum species, which are fungal pathogens responsible for anther smut disease on host plants in the Caryophyllaceae. Forty-two of the 372 predicted orthologous genes showed significant signal of positive selection, which represents a good number of candidate genes for further investigation. Sequencing 16 of these genes in 9 additional Microbotryum species confirmed that they have indeed been rapidly evolving in the pathogen species specialized on different hosts. The genes showing significant signals of positive selection were putatively involved in nutrient uptake from the host, secondary metabolite synthesis and secretion, respiration under stressful conditions and stress response, hyphal growth and differentiation, and regulation of expression by other genes. Many of these genes had transmembrane domains and may therefore also be involved in pathogen recognition by the host. Our approach thus revealed fruitful and should be feasible for many non-model organisms for which candidate genes for diversifying selection are needed.
Collapse
Affiliation(s)
- G Aguileta
- Ecologie, Systématique et Evolution, Université Paris-Sud, F-91405 Orsay cedex, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Ho Sui SJ, Fedynak A, Hsiao WWL, Langille MGI, Brinkman FSL. The association of virulence factors with genomic islands. PLoS One 2009; 4:e8094. [PMID: 19956607 PMCID: PMC2779486 DOI: 10.1371/journal.pone.0008094] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2009] [Accepted: 11/07/2009] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND It has been noted that many bacterial virulence factor genes are located within genomic islands (GIs; clusters of genes in a prokaryotic genome of probable horizontal origin). However, such studies have been limited to single genera or isolated observations. We have performed the first large-scale analysis of multiple diverse pathogens to examine this association. We additionally identified genes found predominantly in pathogens, but not non-pathogens, across multiple genera using 631 complete bacterial genomes, and we identified common trends in virulence for genes in GIs. Furthermore, we examined the relationship between GIs and clustered regularly interspaced palindromic repeats (CRISPRs) proposed to confer resistance to phage. METHODOLOGY/PRINCIPAL FINDINGS We show quantitatively that GIs disproportionately contain more virulence factors than the rest of a given genome (p<1E-40 using three GI datasets) and that CRISPRs are also over-represented in GIs. Virulence factors in GIs and pathogen-associated virulence factors are enriched for proteins having more "offensive" functions, e.g. active invasion of the host, and are disproportionately components of type III/IV secretion systems or toxins. Numerous hypothetical pathogen-associated genes were identified, meriting further study. CONCLUSIONS/SIGNIFICANCE This is the first systematic analysis across diverse genera indicating that virulence factors are disproportionately associated with GIs. "Offensive" virulence factors, as opposed to host-interaction factors, may more often be a recently acquired trait (on an evolutionary time scale detected by GI analysis). Newly identified pathogen-associated genes warrant further study. We discuss the implications of these results, which cement the significant role of GIs in the evolution of many pathogens.
Collapse
Affiliation(s)
- Shannan J. Ho Sui
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Amber Fedynak
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| | - William W. L. Hsiao
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Morgan G. I. Langille
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Fiona S. L. Brinkman
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada
- * E-mail:
| |
Collapse
|
36
|
High frequency of hotspot mutations in core genes of Escherichia coli due to short-term positive selection. Proc Natl Acad Sci U S A 2009; 106:12412-7. [PMID: 19617543 DOI: 10.1073/pnas.0906217106] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Core genes comprising the ubiquitous backbone of bacterial genomes are not subject to frequent horizontal transfer and generally are not thought to contribute to the adaptive evolution of bacterial pathogens. We determined, however, that at least one-third and possibly more than one-half of the core genes in Escherichia coli genomes are targeted by repeated replacement substitutions in the same amino acid positions-hotspot mutations. Occurrence of hotspot mutations is driven by positive selection, as their rate is significantly higher than expected by random chance alone, and neither intragenic recombination nor increased mutability can explain the observed patterns. Also, commensal E. coli strains have a significantly lower frequency of mutated genes and mutations per genome than pathogenic strains. E. coli strains causing extra-intestinal infections accumulate hotspot mutations at the highest rate, whereas the highest total number of mutated genes has been found among Shigella isolates, suggesting the pathoadaptive nature of such mutations. The vast majority of hotspot mutations are of recent evolutionary origin, implying short-term positive selection, where adaptive mutations emerge repeatedly but are not sustained in natural circulation for long. Such pattern of dynamics is consistent with source-sink model of virulence evolution.
Collapse
|
37
|
Bessen DE. Population biology of the human restricted pathogen, Streptococcus pyogenes. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2009; 9. [PMID: 19460325 PMCID: PMC2685916 DOI: 10.1016/j.meegid.2009.03.00] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Streptococcus pyogenes, also referred to as beta-hemolytic group A streptococci, are strictly human pathogens with a global distribution and high prevalence of infection. The organisms are characterized by high levels of genetic recombination, extensive strain diversity, and a narrow habitat. This review highlights many key features of the population genetics and molecular epidemiology of this biologically diverse bacterial species, with special emphasis on ecological subdivisions and tissue-specific infections, strain diversity and population dynamics in communities, selection pressures arising from the specific host immune response and antibiotic exposure, and within-host selection during the course of invasive disease.
Collapse
Affiliation(s)
- Debra E. Bessen
- Department of Microbiology & Immunology, New York Medical College, Valhalla, NY, USA 10595, , +1-914-594-4193
| |
Collapse
|
38
|
Merkl R, Wiezer A. GO4genome: a prokaryotic phylogeny based on genome organization. J Mol Evol 2009; 68:550-62. [PMID: 19436929 PMCID: PMC3085772 DOI: 10.1007/s00239-009-9233-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2008] [Revised: 03/10/2009] [Accepted: 04/03/2009] [Indexed: 11/24/2022]
Abstract
Determining the phylogeny of closely related prokaryotes may fail in an analysis of rRNA or a small set of sequences. Whole-genome phylogeny utilizes the maximally available sample space. For a precise determination of genome similarity, two aspects have to be considered when developing an algorithm of whole-genome phylogeny: (1) gene order conservation is a more precise signal than gene content; and (2) when using sequence similarity, failures in identifying orthologues or the in situ replacement of genes via horizontal gene transfer may give misleading results. GO4genome is a new paradigm, which is based on a detailed analysis of gene function and the location of the respective genes. For characterization of genes, the algorithm uses gene ontology enabling a comparison of function independent of evolutionary relationship. After the identification of locally optimal series of gene functions, their length distribution is utilized to compute a phylogenetic distance. The outcome is a classification of genomes based on metabolic capabilities and their organization. Thus, the impact of effects on genome organization that are not covered by methods of molecular phylogeny can be studied. Genomes of strains belonging to Escherichia coli, Shigella, Streptococcus, Methanosarcina, and Yersinia were analyzed. Differences from the findings of classical methods are discussed.
Collapse
Affiliation(s)
- Rainer Merkl
- Institut für Biophysik und Physikalische Biochemie, Universität Regensburg, 93040, Regensburg, Germany.
| | | |
Collapse
|
39
|
Bessen DE. Population biology of the human restricted pathogen, Streptococcus pyogenes. INFECTION GENETICS AND EVOLUTION 2009; 9:581-93. [PMID: 19460325 DOI: 10.1016/j.meegid.2009.03.002] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2008] [Revised: 02/24/2009] [Accepted: 03/04/2009] [Indexed: 12/31/2022]
Abstract
Streptococcus pyogenes, also referred to as beta-hemolytic group A streptococci, are strictly human pathogens with a global distribution and high prevalence of infection. The organisms are characterized by high levels of genetic recombination, extensive strain diversity, and a narrow habitat. This review highlights many key features of the population genetics and molecular epidemiology of this biologically diverse bacterial species, with special emphasis on ecological subdivisions and tissue-specific infections, strain diversity and population dynamics in communities, selection pressures arising from the specific host immune response and antibiotic exposure, and within-host selection during the course of invasive disease.
Collapse
Affiliation(s)
- Debra E Bessen
- Department of Microbiology & Immunology, New York Medical College, Valhalla, NY 10595, USA.
| |
Collapse
|
40
|
Lefébure T, Stanhope MJ. Pervasive, genome-wide positive selection leading to functional divergence in the bacterial genus Campylobacter. Genome Res 2009; 19:1224-32. [PMID: 19304960 DOI: 10.1101/gr.089250.108] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
An open question in bacterial genomics is the role that adaptive evolution of the core genome plays in diversification and adaptation of bacterial species, and how this might differ between groups of bacteria occupying different environmental circumstances. The genus Campylobacter encompasses several important human and animal enteric pathogens, with genome sequence data available for eight species. We estimate the Campylobacter core genome at 647 genes, with 92.5% of the nonrecombinant core genome loci under positive selection in at least one lineage and the same gene frequently under positive selection in multiple lineages. Tests are provided that reject recombination, saturation, and variation in codon usage bias as factors contributing to this high level of selection. We suggest this genome-wide adaptive evolution may result from a Red Queen macroevolutionary dynamic, in which species are involved in competition for resources within the mammalian and/or vertebrate gastrointestinal tract. Much reduced levels of positive selection evident in Streptococcus, as reported by the authors in an earlier work, may be a consequence of these taxa inhabiting less species-rich habitats, and more unique niches. Despite many common loci under positive selection in multiple Campylobacter lineages, we found no evidence for molecular adaptive convergence at the level of the same or adjacent codons, or even protein domains. Taken collectively, these results describe the diversification of a bacterial genus that involves pervasive natural selection pressure across virtually the entire genome, with this adaptation occurring in different ways in different lineages, despite the species tendency toward a common gastrointestinal habitat.
Collapse
Affiliation(s)
- Tristan Lefébure
- Department of Population Medicine and Diagnostic Sciences, College of Veterinary Medicine, Cornell University, Ithaca, New York 14853, USA
| | | |
Collapse
|
41
|
Devault A, Bañuls AL. The promastigote surface antigen gene family of the Leishmania parasite: differential evolution by positive selection and recombination. BMC Evol Biol 2008; 8:292. [PMID: 18950494 PMCID: PMC2584048 DOI: 10.1186/1471-2148-8-292] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2008] [Accepted: 10/24/2008] [Indexed: 11/10/2022] Open
Abstract
Background PSA (promastigote surface antigen) is one of the major classes of membrane proteins present at the surface of the parasitic protozoan Leishmania. While it harbours leucine rich repeats, which are suggestive of its involvement in parasite-to-host physical interactions, its exact role is largely unknown. Furthermore, the extent of diversity of this gene family, both in copy number and sequence has not been established. Results From the newly available complete genome sequences of L. major, L. infantum and L. braziliensis, we have established the complete list of PSA genes, based on the conservation of specific domain architecture. The latter includes an array of leucine rich repeats of unique signature flanked by conserved cysteine-rich domains. All PSA genes code either for secreted or membrane-anchored surface proteins. Besides the few previously identified PSA genes, which are shown here to be part of a relatively large subclass of PSA genes located on chromosome 12, this study identifies seven other PSA subtypes. The latter, whose genes lie on chromosomes 5, 9, 21 and 31 in all three species, form single gene (two genes in one instance) subfamilies, which phylogenetically cluster as highly related orthologs. On the other hand, genes found on chromosome 12 generally show high diversification, as reflected in greater sequence divergence between species, and in an extended set of divergent paralogs. Moreover, we show that the latter genes are submitted to strong positive selection. We also provide evidence that evolution of these genes is driven by intra- and intergenic recombination, thereby modulating the number of LRRs in protein and generating chimeric genes. Conclusion PSA is a Leishmania family of membrane-bound or secreted proteins, whose main signature consists in a specific LRR sequence. All PSA genes found in the genomes of three sequenced Leishmania species unambiguously distribute into eight subfamilies of orthologs. Seven of these are evolving relatively slowly and could correspond to basic functions related to parasite/host interactions. On the opposite, the other PSA gene class, which include all so far experimentally studied PSA genes, could be involved in more specialised adaptative functions.
Collapse
Affiliation(s)
- Alain Devault
- Génétique et Evolution des Maladies Infectieuses, IRD/CNRS (UMR 2724), Montpellier F-34394, France.
| | | |
Collapse
|
42
|
Anisimova M, Kosiol C. Investigating protein-coding sequence evolution with probabilistic codon substitution models. Mol Biol Evol 2008; 26:255-71. [PMID: 18922761 DOI: 10.1093/molbev/msn232] [Citation(s) in RCA: 127] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
This review is motivated by the true explosion in the number of recent studies both developing and ameliorating probabilistic models of codon evolution. Traditionally parametric, the first codon models focused on estimating the effects of selective pressure on the protein via an explicit parameter in the maximum likelihood framework. Likelihood ratio tests of nested codon models armed the biologists with powerful tools, which provided unambiguous evidence for positive selection in real data. This, in turn, triggered a new wave of methodological developments. The new generation of models views the codon evolution process in a more sophisticated way, relaxing several mathematical assumptions. These models make a greater use of physicochemical amino acid properties, genetic code machinery, and the large amounts of data from the public domain. The overview of the most recent advances on modeling codon evolution is presented here, and a wide range of their applications to real data is discussed. On the downside, availability of a large variety of models, each accounting for various biological factors, increases the margin for misinterpretation; the biological meaning of certain parameters may vary among models, and model selection procedures also deserve greater attention. Solid understanding of the modeling assumptions and their applicability is essential for successful statistical data analysis.
Collapse
Affiliation(s)
- Maria Anisimova
- Institute of Computational Science, Swiss Federal Institute of Technology, Zurich, Switzerland.
| | | |
Collapse
|
43
|
Evidence for positive selection in putative virulence factors within the Paracoccidioides brasiliensis species complex. PLoS Negl Trop Dis 2008; 2:e296. [PMID: 18820744 PMCID: PMC2553485 DOI: 10.1371/journal.pntd.0000296] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2008] [Accepted: 08/20/2008] [Indexed: 11/19/2022] Open
Abstract
Paracoccidioides brasiliensis is a dimorphic fungus that is the causative agent of paracoccidioidomycosis, the most important prevalent systemic mycosis in Latin America. Recently, the existence of three genetically isolated groups in P. brasiliensis was demonstrated, enabling comparative studies of molecular evolution among P. brasiliensis lineages. Thirty-two gene sequences coding for putative virulence factors were analyzed to determine whether they were under positive selection. Our maximum likelihood–based approach yielded evidence for selection in 12 genes that are involved in different cellular processes. An in-depth analysis of four of these genes showed them to be either antigenic or involved in pathogenesis. Here, we present evidence indicating that several replacement mutations in gp43 are under positive balancing selection. The other three genes (fks, cdc42 and p27) show very little variation among the P. brasiliensis lineages and appear to be under positive directional selection. Our results are consistent with the more general observations that selective constraints are variable across the genome, and that even in the genes under positive selection, only a few sites are altered. We present our results within an evolutionary framework that may be applicable for studying adaptation and pathogenesis in P. brasiliensis and other pathogenic fungi. The fungus Paracoccidioides brasiliensis is the causative agent of paracoccidioidomycosis, a severe pulmonary mycosis that is endemic to Latin America, where an estimated 10 million people are infected with the fungus. Despite the importance of this disease, we know little about the ecological and evolutionary history of this fungus. Here, we present a survey of genetic variation in putative virulence genes in P. brasiliensis in what constitutes the first systematic approach to understand the molecular evolution of the fungus. We used a population genetics approach to determine the role has natural selection played in the coding genes for proteins involved in pathogenesis. We found that nonsynonymous mutations are more common in genes that code for virulence factors than in housekeeping genes. Our results suggest that positive selection has played an important role in the evolution of virulence factors of P. brasiliensis and is therefore an important factor in the host–pathogen dynamics. Our results also have implications for the possible development of a vaccine against paracoccidioidomycosis, since gp43—the main vaccine candidate—has a high level of polymorphism maintained by natural selection.
Collapse
|