101
|
Niu Z, Xue Q, Wang H, Xie X, Zhu S, Liu W, Ding X. Mutational Biases and GC-Biased Gene Conversion Affect GC Content in the Plastomes of Dendrobium Genus. Int J Mol Sci 2017; 18:E2307. [PMID: 29099062 PMCID: PMC5713276 DOI: 10.3390/ijms18112307] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Revised: 09/27/2017] [Accepted: 10/20/2017] [Indexed: 01/03/2023] Open
Abstract
The variation of GC content is a key genome feature because it is associated with fundamental elements of genome organization. However, the reason for this variation is still an open question. Different kinds of hypotheses have been proposed to explain the variation of GC content during genome evolution. However, these hypotheses have not been explicitly investigated in whole plastome sequences. Dendrobium is one of the largest genera in the orchid species. Evolutionary studies of the plastomic organization and base composition are limited in this genus. In this study, we obtained the high-quality plastome sequences of D. loddigesii and D. devonianum. The comparison results showed a nearly identical organization in Dendrobium plastomes, indicating that the plastomic organization is highly conserved in Dendrobium genus. Furthermore, the impact of three evolutionary forces-selection, mutational biases, and GC-biased gene conversion (gBGC)-on the variation of GC content in Dendrobium plastomes was evaluated. Our results revealed: (1) consistent GC content evolution trends and mutational biases in single-copy (SC) and inverted repeats (IRs) regions; and (2) that gBGC has influenced the plastome-wide GC content evolution. These results suggest that both mutational biases and gBGC affect GC content in the plastomes of Dendrobium genus.
Collapse
Affiliation(s)
- Zhitao Niu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Qingyun Xue
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Hui Wang
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Xuezhu Xie
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Shuying Zhu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Wei Liu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Xiaoyu Ding
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| |
Collapse
|
102
|
Déraspe M, Raymond F, Boisvert S, Culley A, Roy PH, Laviolette F, Corbeil J. Phenetic Comparison of Prokaryotic Genomes Using k-mers. Mol Biol Evol 2017; 34:2716-2729. [PMID: 28957508 PMCID: PMC5850840 DOI: 10.1093/molbev/msx200] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Bacterial genomics studies are getting more extensive and complex, requiring new ways to envision analyses. Using the Ray Surveyor software, we demonstrate that comparison of genomes based on their k-mer content allows reconstruction of phenetic trees without the need of prior data curation, such as core genome alignment of a species. We validated the methodology using simulated genomes and previously published phylogenomic studies of Streptococcus pneumoniae and Pseudomonas aeruginosa. We also investigated the relationship of specific genetic determinants with bacterial population structures. By comparing clusters from the complete genomic content of a genome population with clusters from specific functional categories of genes, we can determine how the population structures are correlated. Indeed, the strain clustering based on a subset of k-mers allows determination of its similarity with the whole genome clusters. We also applied this methodology on 42 species of bacteria to determine the correlational significance of five important bacterial genomic characteristics. For example, intrinsic resistance is more important in P. aeruginosa than in S. pneumoniae, and the former has increased correlation of its population structure with antibiotic resistance genes. The global view of the pangenome of bacteria also demonstrated the taxa-dependent interaction of population structure with antibiotic resistance, bacteriophage, plasmid, and mobile element k-mer data sets.
Collapse
Affiliation(s)
- Maxime Déraspe
- Centre de Recherche en Infectiologie, CHU de Québec-Université Laval, Quebec City, QC, Canada
- Centre de Recherche en Données Massives de l’Université Laval, Quebec City, QC, Canada
- Département de Médecine Moléculaire, Université Laval, Quebec City, QC, Canada
| | - Frédéric Raymond
- Centre de Recherche en Infectiologie, CHU de Québec-Université Laval, Quebec City, QC, Canada
- Centre de Recherche en Données Massives de l’Université Laval, Quebec City, QC, Canada
| | | | - Alexander Culley
- Département de Biochimie, Microbiologie et Bio-informatique, Université Laval, Quebec City, QC, Canada
| | - Paul H. Roy
- Centre de Recherche en Infectiologie, CHU de Québec-Université Laval, Quebec City, QC, Canada
- Département de Biochimie, Microbiologie et Bio-informatique, Université Laval, Quebec City, QC, Canada
| | - François Laviolette
- Centre de Recherche en Données Massives de l’Université Laval, Quebec City, QC, Canada
- Département d’Informatique et de Génie Logiciel, Université Laval, Quebec City, QC, Canada
| | - Jacques Corbeil
- Centre de Recherche en Infectiologie, CHU de Québec-Université Laval, Quebec City, QC, Canada
- Centre de Recherche en Données Massives de l’Université Laval, Quebec City, QC, Canada
- Département de Médecine Moléculaire, Université Laval, Quebec City, QC, Canada
| |
Collapse
|
103
|
Vandamme P, Peeters C, De Smet B, Price EP, Sarovich DS, Henry DA, Hird TJ, Zlosnik JEA, Mayo M, Warner J, Baker A, Currie BJ, Carlier A. Comparative Genomics of Burkholderia singularis sp. nov., a Low G+C Content, Free-Living Bacterium That Defies Taxonomic Dissection of the Genus Burkholderia. Front Microbiol 2017; 8:1679. [PMID: 28932212 PMCID: PMC5592201 DOI: 10.3389/fmicb.2017.01679] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2017] [Accepted: 08/21/2017] [Indexed: 12/03/2022] Open
Abstract
Four Burkholderia pseudomallei-like isolates of human clinical origin were examined by a polyphasic taxonomic approach that included comparative whole genome analyses. The results demonstrated that these isolates represent a rare and unusual, novel Burkholderia species for which we propose the name B. singularis. The type strain is LMG 28154T (=CCUG 65685T). Its genome sequence has an average mol% G+C content of 64.34%, which is considerably lower than that of other Burkholderia species. The reduced G+C content of strain LMG 28154T was characterized by a genome wide AT bias that was not due to reduced GC-biased gene conversion or reductive genome evolution, but might have been caused by an altered DNA base excision repair pathway. B. singularis can be differentiated from other Burkholderia species by multilocus sequence analysis, MALDI-TOF mass spectrometry and a distinctive biochemical profile that includes the absence of nitrate reduction, a mucoid appearance on Columbia sheep blood agar, and a slowly positive oxidase reaction. Comparisons with publicly available whole genome sequences demonstrated that strain TSV85, an Australian water isolate, also represents the same species and therefore, to date, B. singularis has been recovered from human or environmental samples on three continents.
Collapse
Affiliation(s)
- Peter Vandamme
- Laboratory of Microbiology, Department of Biochemistry and Microbiology, Faculty of Sciences, Ghent UniversityGhent, Belgium
| | - Charlotte Peeters
- Laboratory of Microbiology, Department of Biochemistry and Microbiology, Faculty of Sciences, Ghent UniversityGhent, Belgium
| | - Birgit De Smet
- Laboratory of Microbiology, Department of Biochemistry and Microbiology, Faculty of Sciences, Ghent UniversityGhent, Belgium
| | - Erin P. Price
- Global and Tropical Health Division, Menzies School of Health Research, DarwinNT, Australia
- Centre for Animal Health Innovation, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Sippy DownsQLD, Australia
| | - Derek S. Sarovich
- Global and Tropical Health Division, Menzies School of Health Research, DarwinNT, Australia
- Centre for Animal Health Innovation, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Sippy DownsQLD, Australia
| | - Deborah A. Henry
- Centre for Understanding and Preventing Infection in Children, Department of Pediatrics, University of British Columbia, VancouverBC, Canada
| | - Trevor J. Hird
- Centre for Understanding and Preventing Infection in Children, Department of Pediatrics, University of British Columbia, VancouverBC, Canada
| | - James E. A. Zlosnik
- Centre for Understanding and Preventing Infection in Children, Department of Pediatrics, University of British Columbia, VancouverBC, Canada
| | - Mark Mayo
- Global and Tropical Health Division, Menzies School of Health Research, DarwinNT, Australia
| | - Jeffrey Warner
- College of Public Health, Medical and Veterinary Sciences, Australian Institute of Tropical Health and Medicine, James Cook University, TownsvilleQLD, Australia
| | - Anthony Baker
- Tasmanian Institute of Agriculture, University of Tasmania, HobartTAS, Australia
| | - Bart J. Currie
- Global and Tropical Health Division, Menzies School of Health Research, DarwinNT, Australia
| | - Aurélien Carlier
- Laboratory of Microbiology, Department of Biochemistry and Microbiology, Faculty of Sciences, Ghent UniversityGhent, Belgium
| |
Collapse
|
104
|
A strategy for the identification of patterns in the biosynthesis of nonribosomal peptides by Betaproteobacteria species. Sci Rep 2017; 7:10400. [PMID: 28871139 PMCID: PMC5583390 DOI: 10.1038/s41598-017-11314-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Accepted: 08/22/2017] [Indexed: 11/17/2022] Open
Abstract
Nonribosomal peptides have an important pharmacological role due to their extensive biological properties. The singularities in the biosynthesis of these natural products allowed the development of genome-mining strategies which associate them to their original biosynthetic gene clusters. Generally, these compounds present complex architectures that make their identification difficult. Based on these evidences, genomes from species of the class Betaproteobacteria were studied with the purpose of finding biosynthetic similarities among them. These organisms were applied as templates due to their large number of biosynthetic gene clusters and the natural products isolated from them. The strategy for Rapid Identification of Nonribosomal Peptides Portions (RINPEP) proposed in this work was built by reorganizing the data obtained from antiSMASH and NCBI with a product-centered way. The verification steps of RINPEP comprehended the fragments of existent compounds and predictions obtained in silico with the purpose of finding common subunits expressed by different genomic sequences. The results of this strategy revealed patterns in a global overview of the biosynthesis of nonribosomal peptides by Betaproteobacteria.
Collapse
|
105
|
diCenzo GC, Finan TM. The Divided Bacterial Genome: Structure, Function, and Evolution. Microbiol Mol Biol Rev 2017; 81:e00019-17. [PMID: 28794225 PMCID: PMC5584315 DOI: 10.1128/mmbr.00019-17] [Citation(s) in RCA: 135] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Approximately 10% of bacterial genomes are split between two or more large DNA fragments, a genome architecture referred to as a multipartite genome. This multipartite organization is found in many important organisms, including plant symbionts, such as the nitrogen-fixing rhizobia, and plant, animal, and human pathogens, including the genera Brucella, Vibrio, and Burkholderia. The availability of many complete bacterial genome sequences means that we can now examine on a broad scale the characteristics of the different types of DNA molecules in a genome. Recent work has begun to shed light on the unique properties of each class of replicon, the unique functional role of chromosomal and nonchromosomal DNA molecules, and how the exploitation of novel niches may have driven the evolution of the multipartite genome. The aims of this review are to (i) outline the literature regarding bacterial genomes that are divided into multiple fragments, (ii) provide a meta-analysis of completed bacterial genomes from 1,708 species as a way of reviewing the abundant information present in these genome sequences, and (iii) provide an encompassing model to explain the evolution and function of the multipartite genome structure. This review covers, among other topics, salient genome terminology; mechanisms of multipartite genome formation; the phylogenetic distribution of multipartite genomes; how each part of a genome differs with respect to genomic signatures, genetic variability, and gene functional annotation; how each DNA molecule may interact; as well as the costs and benefits of this genome structure.
Collapse
Affiliation(s)
- George C diCenzo
- Department of Biology, McMaster University, Hamilton, Ontario, Canada
| | - Turlough M Finan
- Department of Biology, McMaster University, Hamilton, Ontario, Canada
| |
Collapse
|
106
|
Krasovec M, Eyre-Walker A, Sanchez-Ferandin S, Piganeau G. Spontaneous Mutation Rate in the Smallest Photosynthetic Eukaryotes. Mol Biol Evol 2017; 34:1770-1779. [PMID: 28379581 PMCID: PMC5455958 DOI: 10.1093/molbev/msx119] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Mutation is the ultimate source of genetic variation, and knowledge of mutation rates is fundamental for our understanding of all evolutionary processes. High throughput sequencing of mutation accumulation lines has provided genome wide spontaneous mutation rates in a dozen model species, but estimates from nonmodel organisms from much of the diversity of life are very limited. Here, we report mutation rates in four haploid marine bacterial-sized photosynthetic eukaryotic algae; Bathycoccus prasinos, Ostreococcus tauri, Ostreococcus mediterraneus, and Micromonas pusilla. The spontaneous mutation rate between species varies from μ = 4.4 × 10-10 to 9.8 × 10-10 mutations per nucleotide per generation. Within genomes, there is a two-fold increase of the mutation rate in intergenic regions, consistent with an optimization of mismatch and transcription-coupled DNA repair in coding sequences. Additionally, we show that deviation from the equilibrium GC content increases the mutation rate by ∼2% to ∼12% because of a GC bias in coding sequences. More generally, the difference between the observed and equilibrium GC content of genomes explains some of the inter-specific variation in mutation rates.
Collapse
Affiliation(s)
- Marc Krasovec
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Biologie Intégrative des Organismes Marins (BIOM), Observatoire Océanologique, Banyuls/Mer, France
| | - Adam Eyre-Walker
- Evolution, behaviour and environment, School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Sophie Sanchez-Ferandin
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Biologie Intégrative des Organismes Marins (BIOM), Observatoire Océanologique, Banyuls/Mer, France
| | - Gwenael Piganeau
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Biologie Intégrative des Organismes Marins (BIOM), Observatoire Océanologique, Banyuls/Mer, France
| |
Collapse
|
107
|
Szövényi P, Ullrich KK, Rensing SA, Lang D, van Gessel N, Stenøien HK, Conti E, Reski R. Selfing in Haploid Plants and Efficacy of Selection: Codon Usage Bias in the Model Moss Physcomitrella patens. Genome Biol Evol 2017; 9:1528-1546. [PMID: 28549175 PMCID: PMC5507605 DOI: 10.1093/gbe/evx098] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/25/2017] [Indexed: 12/15/2022] Open
Abstract
A long-term reduction in effective population size will lead to major shift in genome evolution. In particular, when effective population size is small, genetic drift becomes dominant over natural selection. The onset of self-fertilization is one evolutionary event considerably reducing effective size of populations. Theory predicts that this reduction should be more dramatic in organisms capable for haploid than for diploid selfing. Although theoretically well-grounded, this assertion received mixed experimental support. Here, we test this hypothesis by analyzing synonymous codon usage bias of genes in the model moss Physcomitrella patens frequently undergoing haploid selfing. In line with population genetic theory, we found that the effect of natural selection on synonymous codon usage bias is very weak. Our conclusion is supported by four independent lines of evidence: 1) Very weak or nonsignificant correlation between gene expression and codon usage bias, 2) no increased codon usage bias in more broadly expressed genes, 3) no evidence that codon usage bias would constrain synonymous and nonsynonymous divergence, and 4) predominant role of genetic drift on synonymous codon usage predicted by a model-based analysis. These findings show striking similarity to those observed in AT-rich genomes with weak selection for optimal codon usage and GC content overall. Our finding is in contrast to a previous study reporting adaptive codon usage bias in the moss P. patens.
Collapse
Affiliation(s)
- Péter Szövényi
- Department of Systematic and Evolutionary Botany, University of Zurich, Switzerland
| | - Kristian K. Ullrich
- Plant Cell Biology, Faculty of Biology, University of Marburg, Germany
- Present address: Max-Planck-Insitut für Evolutionsbiologie, Plön, Germany
| | - Stefan A. Rensing
- Plant Cell Biology, Faculty of Biology, University of Marburg, Germany
- BIOSS—Centre for Biological Signalling Studies, University of Freiburg, Germany
| | - Daniel Lang
- Plant Genome and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Nico van Gessel
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Germany
| | | | - Elena Conti
- Department of Systematic and Evolutionary Botany, University of Zurich, Switzerland
| | - Ralf Reski
- BIOSS—Centre for Biological Signalling Studies, University of Freiburg, Germany
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Germany
| |
Collapse
|
108
|
Mao M, Yang X, Poff K, Bennett G. Comparative Genomics of the Dual-Obligate Symbionts from the Treehopper, Entylia carinata (Hemiptera: Membracidae), Provide Insight into the Origins and Evolution of an Ancient Symbiosis. Genome Biol Evol 2017; 9:1803-1815. [PMID: 28854637 PMCID: PMC5533117 DOI: 10.1093/gbe/evx134] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/08/2017] [Indexed: 12/20/2022] Open
Abstract
Insect species in the Auchenorrhyncha suborder (Hemiptera) maintain ancient obligate symbioses with bacteria that provide essential amino acids (EAAs) deficient in their plant-sap diets. Molecular studies have revealed that two complementary symbiont lineages, "Candidatus Sulcia muelleri" and a betaproteobacterium ("Ca. Zinderia insecticola" in spittlebugs [Cercopoidea] and "Ca. Nasuia deltocephalinicola" in leafhoppers [Cicadellidae]) may have persisted in the suborder since its origin ∼300 Ma. However, investigation of how this pair has co-evolved on a genomic level is limited to only a few host lineages. We sequenced the complete genomes of Sulcia and a betaproteobacterium from the treehopper, Entylia carinata (Membracidae: ENCA), as the first representative from this species-rich group. It also offers the opportunity to compare symbiont evolution across a major insect group, the Membracoidea (leafhoppers + treehoppers). Genomic analyses show that the betaproteobacteria in ENCA is a member of the Nasuia lineage. Both symbionts have larger genomes (Sulcia = 218 kb and Nasuia = 144 kb) than related lineages in Deltocephalinae leafhoppers, retaining genes involved in basic cellular functions and information processing. Nasuia-ENCA further exhibits few unique gene losses, suggesting that its parent lineage in the common ancestor to the Membracoidea was already highly reduced. Sulcia-ENCA has lost the abilities to synthesize menaquinone cofactor and to complete the synthesis of the branched-chain EAAs. Both capabilities are conserved in other Sulcia lineages sequenced from across the Auchenorrhyncha. Finally, metagenomic sequencing recovered the partial genome of an Arsenophonus symbiont, although it infects only 20% of individuals indicating a facultative role.
Collapse
Affiliation(s)
- Meng Mao
- Department of Plant and Environmental Protection Sciences, University of Hawaii at Mānoa
| | - Xiushuai Yang
- Department of Plant and Environmental Protection Sciences, University of Hawaii at Mānoa
| | - Kirsten Poff
- Department of Plant and Environmental Protection Sciences, University of Hawaii at Mānoa
| | - Gordon Bennett
- Department of Plant and Environmental Protection Sciences, University of Hawaii at Mānoa
| |
Collapse
|
109
|
Vélez JR, Cameron M, Rodríguez-Lecompte JC, Xia F, Heider LC, Saab M, McClure JT, Sánchez J. Whole-Genome Sequence Analysis of Antimicrobial Resistance Genes in Streptococcus uberis and Streptococcus dysgalactiae Isolates from Canadian Dairy Herds. Front Vet Sci 2017; 4:63. [PMID: 28589129 PMCID: PMC5438997 DOI: 10.3389/fvets.2017.00063] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2016] [Accepted: 04/19/2017] [Indexed: 11/20/2022] Open
Abstract
The objectives of this study are to determine the occurrence of antimicrobial resistance (AMR) genes using whole-genome sequence (WGS) of Streptococcus uberis (S. uberis) and Streptococcus dysgalactiae (S. dysgalactiae) isolates, recovered from dairy cows in the Canadian Maritime Provinces. A secondary objective included the exploration of the association between phenotypic AMR and the genomic characteristics (genome size, guanine–cytosine content, and occurrence of unique gene sequences). Initially, 91 isolates were sequenced, and of these isolates, 89 were assembled. Furthermore, 16 isolates were excluded due to larger than expected genomic sizes (>2.3 bp × 1,000 bp). In the final analysis, 73 were used with complete WGS and minimum inhibitory concentration records, which were part of the previous phenotypic AMR study, representing 18 dairy herds from the Maritime region of Canada (1). A total of 23 unique AMR gene sequences were found in the bacterial genomes, with a mean number of 8.1 (minimum: 5; maximum: 13) per genome. Overall, there were 10 AMR genes [ANT(6), TEM-127, TEM-163, TEM-89, TEM-95, Linb, Lnub, Ermb, Ermc, and TetS] present only in S. uberis genomes and 2 genes unique (EF-TU and TEM-71) to the S. dysgalactiae genomes; 11 AMR genes [APH(3′), TEM-1, TEM-136, TEM-157, TEM-47, TetM, bl2b, gyrA, parE, phoP, and rpoB] were found in both bacterial species. Two-way tabulations showed association between the phenotypic susceptibility to lincosamides and the presence of linB (P = 0.002) and lnuB (P < 0.001) genes and the between the presence of tetM (P = 0.015) and tetS (P = 0.064) genes and phenotypic resistance to tetracyclines only for the S. uberis isolates. The logistic model showed that the odds of resistance (to any of the phenotypically tested antimicrobials) was 4.35 times higher when there were >11 AMR genes present in the genome, compared with <7 AMR genes (P < 0.001). The odds of resistance was lower for S. dysgalactiae than S. uberis (P = 0.031). When the within-herd somatic cell count was >250,000 cells/mL, a trend toward higher odds of resistance compared with the baseline category of <150,000 cells/mL was observed. When the isolate corresponded to a post-mastitis sample, there were lower odds of resistance when compared with non-clinical isolates (P = 0.01). The results of this study showed the strength of associations between phenotypic AMR resistance of both mastitis pathogens and their genotypic resistome and other epidemiological characteristics.
Collapse
Affiliation(s)
- Julián Reyes Vélez
- Department of Health Management, Atlantic Veterinary College, University of Prince Edward Island, Charlottetown, PE, Canada
| | - Marguerite Cameron
- Department of Health Management, Atlantic Veterinary College, University of Prince Edward Island, Charlottetown, PE, Canada
| | - Juan Carlos Rodríguez-Lecompte
- Department of Pathology and Microbiology, Atlantic Veterinary College, University of Prince Edward Island, Charlottetown, PE, Canada
| | - Fangfang Xia
- Mathematics and Computer Science Division, Argonne National Laboratory, Chicago, IL, USA
| | - Luke C Heider
- Department of Health Management, Atlantic Veterinary College, University of Prince Edward Island, Charlottetown, PE, Canada
| | - Matthew Saab
- Department of Health Management, Atlantic Veterinary College, University of Prince Edward Island, Charlottetown, PE, Canada.,Diagnostic Services, Atlantic Veterinary College, University of Prince Edward Island, Charlottetown, PE, Canada
| | - J Trenton McClure
- Department of Health Management, Atlantic Veterinary College, University of Prince Edward Island, Charlottetown, PE, Canada
| | - Javier Sánchez
- Department of Health Management, Atlantic Veterinary College, University of Prince Edward Island, Charlottetown, PE, Canada
| |
Collapse
|
110
|
Evolutionary forces affecting synonymous variations in plant genomes. PLoS Genet 2017; 13:e1006799. [PMID: 28531201 PMCID: PMC5460877 DOI: 10.1371/journal.pgen.1006799] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Revised: 06/06/2017] [Accepted: 05/04/2017] [Indexed: 01/04/2023] Open
Abstract
Base composition is highly variable among and within plant genomes, especially at third codon positions, ranging from GC-poor and homogeneous species to GC-rich and highly heterogeneous ones (particularly Monocots). Consequently, synonymous codon usage is biased in most species, even when base composition is relatively homogeneous. The causes of these variations are still under debate, with three main forces being possibly involved: mutational bias, selection and GC-biased gene conversion (gBGC). So far, both selection and gBGC have been detected in some species but how their relative strength varies among and within species remains unclear. Population genetics approaches allow to jointly estimating the intensity of selection, gBGC and mutational bias. We extended a recently developed method and applied it to a large population genomic dataset based on transcriptome sequencing of 11 angiosperm species spread across the phylogeny. We found that at synonymous positions, base composition is far from mutation-drift equilibrium in most genomes and that gBGC is a widespread and stronger process than selection. gBGC could strongly contribute to base composition variation among plant species, implying that it should be taken into account in plant genome analyses, especially for GC-rich ones. In protein coding genes, base composition strongly varies within and among plant genomes, especially at positions where changes do not alter the coded protein (synonymous variations). Some species, such as the model plant Arabidopsis thaliana, are relatively GC-poor and homogeneous while others, such as grasses, are highly heterogeneous and GC-rich. The causes of these variations are still debated: are they mainly due to selective or neutral processes? Answering to this question is important to correctly infer whether variations in base composition may have functional roles or not. We extended a population genetics method to jointly estimate the different forces that may affect synonymous variations and applied it to genomic datasets in 11 flowering plant species. We found that GC-biased gene conversion, a neutral process associated with recombination that mimics selection by favouring G and C bases, is a widespread and stronger process than selection and that it could explain the large variation in base composition observed in plant genomes. Our results bear implications for analysing plant genomes and for correctly interpreting what could be functional or not.
Collapse
|
111
|
Bohlin J, Eldholm V, Pettersson JHO, Brynildsrud O, Snipen L. The nucleotide composition of microbial genomes indicates differential patterns of selection on core and accessory genomes. BMC Genomics 2017; 18:151. [PMID: 28187704 PMCID: PMC5303225 DOI: 10.1186/s12864-017-3543-7] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2016] [Accepted: 02/02/2017] [Indexed: 12/02/2022] Open
Abstract
Background The core genome consists of genes shared by the vast majority of a species and is therefore assumed to have been subjected to substantially stronger purifying selection than the more mobile elements of the genome, also known as the accessory genome. Here we examine intragenic base composition differences in core genomes and corresponding accessory genomes in 36 species, represented by the genomes of 731 bacterial strains, to assess the impact of selective forces on base composition in microbes. We also explore, in turn, how these results compare with findings for whole genome intragenic regions. Results We found that GC content in coding regions is significantly higher in core genomes than accessory genomes and whole genomes. Likewise, GC content variation within coding regions was significantly lower in core genomes than in accessory genomes and whole genomes. Relative entropy in coding regions, measured as the difference between observed and expected trinucleotide frequencies estimated from mononucleotide frequencies, was significantly higher in the core genomes than in accessory and whole genomes. Relative entropy was positively associated with coding region GC content within the accessory genomes, but not within the corresponding coding regions of core or whole genomes. Conclusion The higher intragenic GC content and relative entropy, as well as the lower GC content variation, observed in the core genomes is most likely associated with selective constraints. It is unclear whether the positive association between GC content and relative entropy in the more mobile accessory genomes constitutes signatures of selection or selective neutral processes. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3543-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jon Bohlin
- Infectious Disease Control and Environmental Health, Norwegian Institute of Public Health, Lovisenberggata 8, P.O. Box 4404, 0403, Oslo, Norway.
| | - Vegard Eldholm
- Infectious Disease Control and Environmental Health, Norwegian Institute of Public Health, Lovisenberggata 8, P.O. Box 4404, 0403, Oslo, Norway
| | - John H O Pettersson
- Infectious Disease Control and Environmental Health, Norwegian Institute of Public Health, Lovisenberggata 8, P.O. Box 4404, 0403, Oslo, Norway
| | - Ola Brynildsrud
- Infectious Disease Control and Environmental Health, Norwegian Institute of Public Health, Lovisenberggata 8, P.O. Box 4404, 0403, Oslo, Norway
| | - Lars Snipen
- Department of Chemistry, Biotechnology and Food Sciences, Norwegian University of Life Sciences, 1430, Ås, Norway
| |
Collapse
|
112
|
Chen Q, Zobel J, Verspoor K. Duplicates, redundancies and inconsistencies in the primary nucleotide databases: a descriptive study. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2017; 2017:baw163. [PMID: 28077566 PMCID: PMC5225397 DOI: 10.1093/database/baw163] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Revised: 11/17/2016] [Accepted: 11/21/2016] [Indexed: 01/22/2023]
Abstract
GenBank, the EMBL European Nucleotide Archive and the DNA DataBank of Japan, known collectively as the International Nucleotide Sequence Database Collaboration or INSDC, are the three most significant nucleotide sequence databases. Their records are derived from laboratory work undertaken by different individuals, by different teams, with a range of technologies and assumptions and over a period of decades. As a consequence, they contain a great many duplicates, redundancies and inconsistencies, but neither the prevalence nor the characteristics of various types of duplicates have been rigorously assessed. Existing duplicate detection methods in bioinformatics only address specific duplicate types, with inconsistent assumptions; and the impact of duplicates in bioinformatics databases has not been carefully assessed, making it difficult to judge the value of such methods. Our goal is to assess the scale, kinds and impact of duplicates in bioinformatics databases, through a retrospective analysis of merged groups in INSDC databases. Our outcomes are threefold: (1) We analyse a benchmark dataset consisting of duplicates manually identified in INSDC—a dataset of 67 888 merged groups with 111 823 duplicate pairs across 21 organisms from INSDC databases – in terms of the prevalence, types and impacts of duplicates. (2) We categorize duplicates at both sequence and annotation level, with supporting quantitative statistics, showing that different organisms have different prevalence of distinct kinds of duplicate. (3) We show that the presence of duplicates has practical impact via a simple case study on duplicates, in terms of GC content and melting temperature. We demonstrate that duplicates not only introduce redundancy, but can lead to inconsistent results for certain tasks. Our findings lead to a better understanding of the problem of duplication in biological databases. Database URL: the merged records are available at https://cloudstor.aarnet.edu.au/plus/index.php/s/Xef2fvsebBEAv9w
Collapse
Affiliation(s)
- Qingyu Chen
- Department of Computing and Information Systems, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Justin Zobel
- Department of Computing and Information Systems, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Karin Verspoor
- Department of Computing and Information Systems, The University of Melbourne, Parkville, VIC, 3010, Australia
| |
Collapse
|
113
|
Liu SS, Hockenberry AJ, Lancichinetti A, Jewett MC, Amaral LAN. NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents. PLoS Comput Biol 2016; 12:e1005184. [PMID: 27835644 PMCID: PMC5106001 DOI: 10.1371/journal.pcbi.1005184] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Accepted: 10/05/2016] [Indexed: 01/08/2023] Open
Abstract
The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems. The generation of random sequences is instrumental to the accurate identification of non-random motifs within genomes, yet there are currently no tools available that allow users to simultaneously specify amino acid and GC composition to create random coding sequences. Here, we develop an algorithm based on maximum entropy that consistently generates fully random nucleotide sequences with the desired amino acid composition and GC content.
Collapse
Affiliation(s)
- Sophia S. Liu
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois, United States of America
| | - Adam J. Hockenberry
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois, United States of America
- Interdisciplinary Program in Biological Sciences, Northwestern University, Evanston, Illinois, United States of America
| | - Andrea Lancichinetti
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois, United States of America
| | - Michael C. Jewett
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois, United States of America
- Interdisciplinary Program in Biological Sciences, Northwestern University, Evanston, Illinois, United States of America
- Northwestern Institute on Complex Systems, Northwestern University, Evanston, Illinois, United States of America
- Chemistry of Life Processes Institute, Northwestern University, Evanston, Illinois, United States of America
| | - Luís A. N. Amaral
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois, United States of America
- Northwestern Institute on Complex Systems, Northwestern University, Evanston, Illinois, United States of America
- Department of Physics and Astronomy, Northwestern University, Evanston, Illinois, United States of America
- * E-mail:
| |
Collapse
|
114
|
Patel S. Drivers of bacterial genomes plasticity and roles they play in pathogen virulence, persistence and drug resistance. INFECTION GENETICS AND EVOLUTION 2016; 45:151-164. [DOI: 10.1016/j.meegid.2016.08.030] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2016] [Revised: 08/26/2016] [Accepted: 08/27/2016] [Indexed: 12/11/2022]
|
115
|
Dillon MM, Sung W, Sebra R, Lynch M, Cooper VS. Genome-Wide Biases in the Rate and Molecular Spectrum of Spontaneous Mutations in Vibrio cholerae and Vibrio fischeri. Mol Biol Evol 2016; 34:93-109. [PMID: 27744412 PMCID: PMC5854121 DOI: 10.1093/molbev/msw224] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
The vast diversity in nucleotide composition and architecture among bacterial genomes may be partly explained by inherent biases in the rates and spectra of spontaneous mutations. Bacterial genomes with multiple chromosomes are relatively unusual but some are relevant to human health, none more so than the causative agent of cholera, Vibrio cholerae Here, we present the genome-wide mutation spectra in wild-type and mismatch repair (MMR) defective backgrounds of two Vibrio species, the low-%GC squid symbiont V. fischeri and the pathogen V. cholerae, collected under conditions that greatly minimize the efficiency of natural selection. In apparent contrast to their high diversity in nature, both wild-type V. fischeri and V. cholerae have among the lowest rates for base-substitution mutations (bpsms) and insertion-deletion mutations (indels) that have been measured, below 10-3/genome/generation. Vibrio fischeri and V. cholerae have distinct mutation spectra, but both are AT-biased and produce a surprising number of multi-nucleotide indels. Furthermore, the loss of a functional MMR system caused the mutation spectra of these species to converge, implying that the MMR system itself contributes to species-specific mutation patterns. Bpsm and indel rates varied among genome regions, but do not explain the more rapid evolutionary rates of genes on chromosome 2, which likely result from weaker purifying selection. More generally, the very low mutation rates of Vibrio species correlate inversely with their immense population sizes and suggest that selection may not only have maximized replication fidelity but also optimized other polygenic traits relative to the constraints of genetic drift.
Collapse
Affiliation(s)
- Marcus M Dillon
- Microbiology Graduate Program, University of New Hampshire, Durham, NH
| | - Way Sung
- Department of Bioinformatics and Genomics, University of North Carolina Charlotte, Charlotte, NC.,Department of Biology, Indiana University, Bloomington, IN
| | - Robert Sebra
- Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Michael Lynch
- Department of Biology, Indiana University, Bloomington, IN
| | - Vaughn S Cooper
- Microbiology Graduate Program, University of New Hampshire, Durham, NH .,Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA
| |
Collapse
|
116
|
Abstract
As genes originate at different evolutionary times, they harbor distinctive genomic signatures of evolutionary ages. Although previous studies have investigated different gene age-related signatures, what signatures dominantly associate with gene age remains unresolved. Here we address this question via a combined approach of comprehensive assignment of gene ages, gene family identification, and multivariate analyses. We first provide a comprehensive and improved gene age assignment by combining homolog clustering with phylogeny inference and categorize human genes into 26 age classes spanning the whole tree of life. We then explore the dominant age-related signatures based on a collection of 10 potential signatures (including gene composition, gene length, selection pressure, expression level, connectivity in protein–protein interaction network and DNA methylation). Our results show that GC content and connectivity in protein–protein interaction network (PPIN) associate dominantly with gene age. Furthermore, we investigate the heterogeneity of dominant signatures in duplicates and singletons. We find that GC content is a consistent primary factor of gene age in duplicates and singletons, whereas PPIN is more strongly associated with gene age in singletons than in duplicates. Taken together, GC content and PPIN are two dominant signatures in close association with gene age, exhibiting heterogeneity in duplicates and singletons and presumably reflecting complex differential interplays between natural selection and mutation.
Collapse
Affiliation(s)
- Hongyan Yin
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing, China BIG Data Center, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
| | - Guangyu Wang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing, China BIG Data Center, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
| | - Lina Ma
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing, China BIG Data Center, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing, China
| | - Soojin V Yi
- School of Biology, Georgia Institute of Technology, Atlanta
| | - Zhang Zhang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing, China BIG Data Center, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
117
|
Sarower MG, Shahriar SIM, Nakamura H, Rouf MA, Okada S. Taxonomic confirmation of mud crab species (genus Scylla) in Bangladesh by nuclear and mitochondrial DNA markers. Mitochondrial DNA A DNA Mapp Seq Anal 2016; 28:935-940. [DOI: 10.1080/24701394.2016.1214726] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Mohammed Golam Sarower
- Department of Aquatic Bioscience, Graduate School of Agricultural and Life Sciences, the University of Tokyo, Bunkyo, Tokyo, Japan
- Fisheries and Marine Resource Technology Discipline, Khulna University, Khulna, Bangladesh
| | | | - Hiromasa Nakamura
- Department of Aquatic Bioscience, Graduate School of Agricultural and Life Sciences, the University of Tokyo, Bunkyo, Tokyo, Japan
| | - Muhammad Abdur Rouf
- Fisheries and Marine Resource Technology Discipline, Khulna University, Khulna, Bangladesh
| | - Shigeru Okada
- Department of Aquatic Bioscience, Graduate School of Agricultural and Life Sciences, the University of Tokyo, Bunkyo, Tokyo, Japan
| |
Collapse
|
118
|
Xue C, Chen H, Yu F. Base-Biased Evolution of Disease-Associated Mutations in the Human Genome. Hum Mutat 2016; 37:1209-1214. [PMID: 27507420 DOI: 10.1002/humu.23065] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Revised: 08/02/2016] [Accepted: 08/07/2016] [Indexed: 11/08/2022]
Abstract
Understanding the evolution of disease-associated mutations is fundamental to analyze pathogenetics of diseases. Mutation, recombination (by GC-biased gene conversion, gBGC), and selection have been known to shape the evolution of disease-associated mutations, but how these evolutionary forces work together is still an open question. In this study, we analyzed several human large-scale datasets (1000 Genomes, ESP6500, ExAC and ClinVar), and found that base-biased mutagenesis generates more GC→AT than AT→GC mutations, while gBGC promotes the fixation of AT→GC mutations to balance the impact of base-biased mutation on genome. Due to this effect of gBGC, purifying selection removes more deleterious AT→GC mutations than GC→AT from population, but many high-frequency (fixed and nearly fixed) deleterious AT→GC mutations are remained possibly due to high genetic load. As a special subset, disease-associated mutations follow this evolutionary rule, in which disease-associated GC→AT mutations are more enriched in rare mutations compared with AT→GC, while disease-associated AT→GC are more enriched in mutations with high frequency. Thus, we presented a base-biased evolutionary framework that explains the base-biased generation and accumulation of disease-associated mutations in human populations.
Collapse
Affiliation(s)
- Cheng Xue
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
| | - Hua Chen
- Center for Computational Genomics, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Fuli Yu
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas.
| |
Collapse
|
119
|
Pouyet F, Bailly-Bechet M, Mouchiroud D, Guéguen L. SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage. Genome Biol Evol 2016; 8:2427-41. [PMID: 27401173 PMCID: PMC5010899 DOI: 10.1093/gbe/evw165] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences.
Collapse
Affiliation(s)
- Fanny Pouyet
- Laboratoire de Biologie et Biométrie Evolutive, University Claude Bernard Lyon 1-University of Lyon, Villeurbanne, France
| | - Marc Bailly-Bechet
- Laboratoire de Biologie et Biométrie Evolutive, University Claude Bernard Lyon 1-University of Lyon, Villeurbanne, France
| | - Dominique Mouchiroud
- Laboratoire de Biologie et Biométrie Evolutive, University Claude Bernard Lyon 1-University of Lyon, Villeurbanne, France
| | - Laurent Guéguen
- Laboratoire de Biologie et Biométrie Evolutive, University Claude Bernard Lyon 1-University of Lyon, Villeurbanne, France
| |
Collapse
|
120
|
Abstract
Mutations are the ultimate source of variation used for evolutionary adaptation, while also being predominantly deleterious and a source of genetic disorders. Understanding the rate of insertion-deletion mutations (indels) is essential to understanding evolutionary processes, especially in coding regions, where such mutations can disrupt production of essential proteins. Using direct estimates of indel rates from 14 phylogenetically diverse eukaryotic and bacterial species, along with measures of standing variation in such species, we obtain results that imply an inverse relationship of mutation rate and effective population size. These results, which corroborate earlier observations on the base-substitution mutation rate, appear most compatible with the hypothesis that natural selection reduces mutation rates per effective genome to the point at which the power of random genetic drift (approximated by the inverse of effective population size) becomes overwhelming. Given the substantial differences in DNA metabolism pathways that give rise to these two types of mutations, this consistency of results raises the possibility that refinement of other molecular and cellular traits may be inversely related to species-specific levels of random genetic drift.
Collapse
|
121
|
Sun S, Xiao J, Zhang H, Zhang Z. Pangenome Evidence for Higher Codon Usage Bias and Stronger Translational Selection in Core Genes of Escherichia coli. Front Microbiol 2016; 7:1180. [PMID: 27536275 PMCID: PMC4971109 DOI: 10.3389/fmicb.2016.01180] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 07/18/2016] [Indexed: 11/25/2022] Open
Abstract
Codon usage bias, as a combined interplay from mutation and selection, has been intensively studied in Escherichia coli. However, codon usage analysis in an E. coli pangenome remains unexplored and the relative importance of mutation and selection acting on core genes and strain-specific genes is unknown. Here we perform comprehensive codon usage analyses based on a collection of multiple complete genome sequences of E. coli. Our results show that core genes that are present in all strains have higher codon usage bias than strain-specific genes that are unique to single strains. We further explore the forces in influencing codon usage and investigate the difference of the major force between core and strain-specific genes. Our results demonstrate that although mutation may exert genome-wide influences on codon usage acting similarly in different gene sets, selection dominates as an important force to shape biased codon usage as genes are present in an increased number of strains. Together, our results provide important insights for better understanding genome plasticity and complexity as well as evolutionary mechanisms behind codon usage bias.
Collapse
Affiliation(s)
- Shixiang Sun
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of SciencesBeijing, China; BIG Data Center, Beijing Institute of Genomics, Chinese Academy of SciencesBeijing, China; University of Chinese Academy of SciencesBeijing, China
| | - Jingfa Xiao
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of SciencesBeijing, China; BIG Data Center, Beijing Institute of Genomics, Chinese Academy of SciencesBeijing, China
| | - Huiyong Zhang
- College of Life Sciences, Henan Agricultural University Zhengzhou, China
| | - Zhang Zhang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of SciencesBeijing, China; BIG Data Center, Beijing Institute of Genomics, Chinese Academy of SciencesBeijing, China
| |
Collapse
|
122
|
Chi PB, Liberles DA. Selection on protein structure, interaction, and sequence. Protein Sci 2016; 25:1168-78. [PMID: 26808055 PMCID: PMC4918422 DOI: 10.1002/pro.2886] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Revised: 01/18/2016] [Accepted: 01/19/2016] [Indexed: 11/10/2022]
Abstract
Characterizing the probabilities of observing amino acid substitutions at specific sites in a protein over evolutionary time is a major goal in the field of molecular evolution. While purely statistical approaches at different levels of complexity exist, approaches rooted in underlying biological processes are necessary to characterize both the context-dependence of sequence changes (epistasis) and to extrapolate to sequences not observed in biological databases. To develop such approaches, an understanding of the different selective forces that act on amino acid substitution is necessary. Here, an overview of selection on and corresponding modeling of folding stability, folding specificity, binding affinity and specificity for ligands, the evolution of new binding sites on protein surfaces, protein dynamics, intrinsic disorder, and protein aggregation as well as the interplay with protein expression level (concentration) and biased mutational processes are presented.
Collapse
Affiliation(s)
- Peter B Chi
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, Pennsylvania, 19122
- Department of Mathematics and Computer Science, Ursinus College, Collegeville, Pennsylvania, 19426
| | - David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, Pennsylvania, 19122
| |
Collapse
|
123
|
Sharma A, Gilbert JA, Lal R. (Meta)genomic insights into the pathogenome of Cellulosimicrobium cellulans. Sci Rep 2016; 6:25527. [PMID: 27151933 PMCID: PMC4858710 DOI: 10.1038/srep25527] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Accepted: 04/14/2016] [Indexed: 11/17/2022] Open
Abstract
Despite having serious clinical manifestations, Cellulosimicrobium cellulans remain under-reported with only three genome sequences available at the time of writing. Genome sequences of C. cellulans LMG16121, C. cellulans J36 and Cellulosimicrobium sp. strain MM were used to determine distribution of pathogenicity islands (PAIs) across C. cellulans, which revealed 49 potential marker genes with known association to human infections, e.g. Fic and VbhA toxin-antitoxin system. Oligonucleotide composition-based analysis of orthologous proteins (n = 791) across three genomes revealed significant negative correlation (P < 0.05) between frequency of optimal codons (Fopt) and gene G+C content, highlighting the G+C-biased gene conversion (gBGC) effect across Cellulosimicrobium strains. Bayesian molecular-clock analysis performed on three virulent PAI proteins (Fic; D-alanyl-D-alanine-carboxypeptidase; transposase) dated the divergence event at 300 million years ago from the most common recent ancestor. Synteny-based annotation of hypothetical proteins highlighted gene transfers from non-pathogenic bacteria as a key factor in the evolution of PAIs. Additonally, deciphering the metagenomic islands using strain MM’s genome with environmental data from the site of isolation (hot-spring biofilm) revealed (an)aerobic respiration as population segregation factor across the in situ cohorts. Using reference genomes and metagenomic data, our results highlight the emergence and evolution of PAIs in the genus Cellulosimicrobium.
Collapse
Affiliation(s)
| | - Jack A Gilbert
- Biosciences Division (BIO), Argonne National Laboratory, 9700 South Cass Avenue, Argonne, IL, USA.,Department of Surgery, University of Chicago, 5841 S Maryland Ave, Chicago, IL, USA.,Marine Biological Laboratory, Woods Hole, MA, USA
| | - Rup Lal
- Department of Zoology, University of Delhi, Delhi, India
| |
Collapse
|
124
|
Daubin V, Szöllősi GJ. Horizontal Gene Transfer and the History of Life. Cold Spring Harb Perspect Biol 2016; 8:a018036. [PMID: 26801681 DOI: 10.1101/cshperspect.a018036] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Microbes acquire DNA from a variety of sources. The last decades, which have seen the development of genome sequencing, have revealed that horizontal gene transfer has been a major evolutionary force that has constantly reshaped genomes throughout evolution. However, because the history of life must ultimately be deduced from gene phylogenies, the lack of methods to account for horizontal gene transfer has thrown into confusion the very concept of the tree of life. As a result, many questions remain open, but emerging methodological developments promise to use information conveyed by horizontal gene transfer that remains unexploited today.
Collapse
Affiliation(s)
- Vincent Daubin
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon, 69000 Lyon, France Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5558, Université Lyon 1, 69622 Villeurbanne, France
| | | |
Collapse
|
125
|
Yahara K, Didelot X, Jolley KA, Kobayashi I, Maiden MCJ, Sheppard SK, Falush D. The Landscape of Realized Homologous Recombination in Pathogenic Bacteria. Mol Biol Evol 2016; 33:456-71. [PMID: 26516092 PMCID: PMC4866539 DOI: 10.1093/molbev/msv237] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Recombination enhances the adaptive potential of organisms by allowing genetic variants to be tested on multiple genomic backgrounds. Its distribution in the genome can provide insight into the evolutionary forces that underlie traits, such as the emergence of pathogenicity. Here, we examined landscapes of realized homologous recombination of 500 genomes from ten bacterial species and found all species have "hot" regions with elevated rates relative to the genome average. We examined the size, gene content, and chromosomal features associated with these regions and the correlations between closely related species. The recombination landscape is variable and evolves rapidly. For example in Salmonella, only short regions of around 1 kb in length are hot whereas in the closely related species Escherichia coli, some hot regions exceed 100 kb, spanning many genes. Only Streptococcus pyogenes shows evidence for the positive correlation between GC content and recombination that has been reported for several eukaryotes. Genes with function related to the cell surface/membrane are often found in recombination hot regions but E. coli is the only species where genes annotated as "virulence associated" are consistently hotter. There is also evidence that some genes with "housekeeping" functions tend to be overrepresented in cold regions. For example, ribosomal proteins showed low recombination in all of the species. Among specific genes, transferrin-binding proteins are recombination hot in all three of the species in which they were found, and are subject to interspecies recombination.
Collapse
Affiliation(s)
- Koji Yahara
- Biostatistics Center, Kurume University, Kurume, Fukuoka, Japan College of Medicine, Institute of Life Science, Swansea University, Swansea, United Kingdom
| | - Xavier Didelot
- Department of Infectious Disease Epidemiology, Imperial College London, London, United Kingdom
| | - Keith A Jolley
- Department of Zoology, University of Oxford, Oxford, United Kingdom
| | - Ichizo Kobayashi
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| | | | - Samuel K Sheppard
- College of Medicine, Institute of Life Science, Swansea University, Swansea, United Kingdom Department of Zoology, University of Oxford, Oxford, United Kingdom
| | - Daniel Falush
- College of Medicine, Institute of Life Science, Swansea University, Swansea, United Kingdom Department of Medical Genome Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| |
Collapse
|
126
|
Diversity and Evolution of the Tn5801-tet(M)-Like Integrative and Conjugative Elements among Enterococcus, Streptococcus, and Staphylococcus. Antimicrob Agents Chemother 2016; 60:1736-46. [PMID: 26729505 DOI: 10.1128/aac.01864-15] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Accepted: 12/27/2015] [Indexed: 12/11/2022] Open
Abstract
This work describes the diversity and evolution of Tn5801 among enterococci, staphylococci, and streptococci based on analysis of the 5,073 genomes of these bacterial groups available in gene databases. We also examined 610 isolates of Enterococcus (from 10 countries, 1987 to 2010) for the presence of this and other known CTn-tet(M) elements due to the scarcity of data about Tn5801 among enterococci. Genome location (by ICeu-I-pulsed-field gel electrophoresis [PFGE] hybridization/integration site identification), conjugation and fitness (by standard methods), Tn5801 characterization (by long-PCR mapping/sequencing), and clonality (by PFGE/multilocus sequence typing [MLST]) were studied. Twenty-three Tn5801 variants (17 unpublished) clustered in two groups, designated "A" (25 kb; n = 14; predominant in Staphylococcus aureus) and "B" (20 kb; n = 9; predominant in Streptococcus agalactiae). The percent GC content of the common backbone suggests a streptococcal origin of Tn5801 group B, with further acquisition of a 5-kb fragment that resulted in group A. Deep sequence analysis allowed identification of variants associated with clonal lineages of S. aureus (clonal complex 8 [CC8], sequence type 239 [ST239]), S. agalactiae (CC17), Enterococcus faecium (ST17/ST18), or Enterococcus faecalis (ST8), local variants, or variants located in different species and geographical areas. All Tn5801 elements were chromosomally located upstream of the guaA gene, which serves as an integration hot spot. Transferability was demonstrated only for Tn5801 type B among E. faecalis clonal backgrounds, which eventually harbored another Tn5801 copy. The study documents early acquisition of Tn5801 by Enterococcus, Staphylococcus, and Streptococcus. Clonal waves of these pathogens seem to have contributed to the geographical spread and local evolution of the transposon. Horizontal transfer, also demonstrated, could explain the variability observed, with the isolates often containing sequences of different origins.
Collapse
|
127
|
Schandry N, de Lange O, Prior P, Lahaye T. TALE-Like Effectors Are an Ancestral Feature of the Ralstonia solanacearum Species Complex and Converge in DNA Targeting Specificity. FRONTIERS IN PLANT SCIENCE 2016; 7:1225. [PMID: 27582755 PMCID: PMC4987410 DOI: 10.3389/fpls.2016.01225] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2016] [Accepted: 08/02/2016] [Indexed: 05/19/2023]
Abstract
Ralstonia solanacearum, a species complex of bacterial plant pathogens divided into four monophyletic phylotypes, causes plant diseases in tropical climates around the world. Some strains exhibit a broad host range on solanaceous hosts, while others are highly host-specific as for example some banana-pathogenic strains. Previous studies showed that transcription activator-like (TAL) effectors from Ralstonia, termed RipTALs, are capable of activating reporter genes in planta, if these are preceded by a matching effector binding element (EBE). RipTALs target DNA via their central repeat domain (CRD), where one repeat pairs with one DNA-base of the given EBE. The repeat variable diresidue dictates base repeat specificity in a predictable fashion, known as the TALE code. In this work, we analyze RipTALs across all phylotypes of the Ralstonia solanacearum species complex. We find that RipTALs are prevalent in phylotypes I and IV but absent from most phylotype III and II strains (10/12, 8/14, 1/24, and 1/5 strains contained a RipTAL, respectively). RipTALs originating from strains of the same phylotype show high levels of sequence similarity (>98%) in the N-terminal and C-terminal regions, while RipTALs isolated from different phylotypes show 47-91% sequence similarity in those regions, giving rise to four RipTAL classes. We show that, despite sequence divergence, the base preference for guanine, mediated by the N-terminal region, is conserved across RipTALs of all classes. Using the number and order of repeats found in the CRD, we functionally sub-classify RipTALs, introduce a new simple nomenclature, and predict matching EBEs for all seven distinct RipTALs identified. We experimentally study RipTAL EBEs and uncover that some RipTALs are able to target the EBEs of other RipTALs, referred to as cross-reactivity. In particular, RipTALs from strains with a broad host range on solanaceous hosts cross-react on each other's EBEs. Investigation of sequence divergence between RipTAL repeats allows for a reconstruction of repeat array biogenesis, for example through slipped strand mispairing or gene conversion. Using these studies we show how RipTALs of broad host range strains evolved convergently toward a shared target sequence. Finally, we discuss the differences between TALE-likes of plant pathogens in the context of disease ecology.
Collapse
Affiliation(s)
- Niklas Schandry
- Center for Plant Molecular Biology, University of TübingenTübingen, Germany
| | - Orlando de Lange
- Center for Plant Molecular Biology, University of TübingenTübingen, Germany
| | - Philippe Prior
- UMR Peuplements Végétaux et Bioagresseurs en Milieu Tropical, Centre de Coopération Internationale en Recherche Agronomique pour le Développement – Institut National de la Recherche AgronomiqueSaint-Pierre, France
| | - Thomas Lahaye
- Center for Plant Molecular Biology, University of TübingenTübingen, Germany
- *Correspondence: Thomas Lahaye,
| |
Collapse
|
128
|
Weakly Deleterious Mutations and Low Rates of Recombination Limit the Impact of Natural Selection on Bacterial Genomes. mBio 2015; 6:e01302-15. [PMID: 26670382 PMCID: PMC4701828 DOI: 10.1128/mbio.01302-15] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Free-living bacteria are usually thought to have large effective population sizes, and so tiny selective differences can drive their evolution. However, because recombination is infrequent, “background selection” against slightly deleterious alleles should reduce the effective population size (Ne) by orders of magnitude. For example, for a well-mixed population with 1012 individuals and a typical level of homologous recombination (r/m = 3, i.e., nucleotide changes due to recombination [r] occur at 3 times the mutation rate [m]), we predict that Ne is <107. An argument for high Ne values for bacteria has been the high genetic diversity within many bacterial “species,” but this diversity may be due to population structure: diversity across subpopulations can be far higher than diversity within a subpopulation, which makes it difficult to estimate Ne correctly. Given an estimate of Ne, standard population genetics models imply that selection should be sufficient to drive evolution if Ne × s is >1, where s is the selection coefficient. We found that this remains approximately correct if background selection is occurring or when population structure is present. Overall, we predict that even for free-living bacteria with enormous populations, natural selection is only a significant force if s is above 10−7 or so. Because bacteria form huge populations with trillions of individuals, the simplest theoretical prediction is that the better allele at a site would predominate even if its advantage was just 10−9 per generation. In other words, virtually every nucleotide would be at the local optimum in most individuals. A more sophisticated theory considers that bacterial genomes have millions of sites each and selection events on these many sites could interfere with each other, so that only larger effects would be important. However, bacteria can exchange genetic material, and in principle, this exchange could eliminate the interference between the evolution of the sites. We used simulations to confirm that during multisite evolution with realistic levels of recombination, only larger effects are important. We propose that advantages of less than 10−7 are effectively neutral.
Collapse
|
129
|
Lassalle F, Muller D, Nesme X. Ecological speciation in bacteria: reverse ecology approaches reveal the adaptive part of bacterial cladogenesis. Res Microbiol 2015; 166:729-41. [DOI: 10.1016/j.resmic.2015.06.008] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Revised: 06/28/2015] [Accepted: 06/30/2015] [Indexed: 11/30/2022]
|
130
|
Biased Gene Conversion in Rhizobium etli Is Caused by Preferential Double-Strand Breaks on One of the Recombining Homologs. J Bacteriol 2015; 198:591-9. [PMID: 26598365 DOI: 10.1128/jb.00768-15] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Accepted: 11/16/2015] [Indexed: 12/25/2022] Open
Abstract
UNLABELLED Gene conversion, the nonreciprocal transfer of information during homologous recombination, is the main process that maintains identity between members of multigene families. Gene conversion in the nitrogenase (nifH) multigene family of Rhizobium etli was analyzed by using a two-plasmid system, where each plasmid carried a copy of nifH. One of the nifH copies was modified, creating restriction fragment length polymorphisms (RFLPs) spaced along the gene. Once the modified plasmid was introduced into R. etli, selection was done for cointegration with a resident plasmid lacking the RFLPs. Most of the cointegrate molecules harbor gene conversion events, biased toward a gain of RFLPs. This bias may be explained under the double-strand break repair model by proposing that the nifH gene lacking the RFLPs suffers a DNA double-strand break, so the incoming plasmid functions as a template for repairing the homolog on the resident plasmid. To support this proposal, we cloned an SceI site into the nifH homolog that had the RFLPs used for scoring gene conversion. In vivo expression of the meganuclease I-SceI allowed the generation of a double-strand break on this homolog. Upon introduction of this modified plasmid into an R. etli strain lacking I-SceI, biased gene conversion still favored the retention of markers on the incoming plasmid. In contrast, when the recipient strain ectopically expressed I-SceI, a dramatic reversal in gene conversion bias was seen, favoring the preservation of resident sequences. These results show that biased gene conversion is caused by preferential double-strand breaks on one of the recombining homologs. IMPORTANCE In this work, we analyzed gene conversion by using a system that entails horizontal gene transfer followed by homologous recombination in the recipient cell. Most gene conversion events are biased toward the acquisition of the incoming sequences, ranging in size from 120 bp to 800 bp. This bias is due to preferential cutting of resident DNA and can be reversed upon introduction of a double-strand break on the incoming sequence. Since conditions used in this work are similar to those in horizontal gene transfer, it provides evidence that, upon transfer, the resident DNA preferentially acquires gene variants.
Collapse
|
131
|
Mugal CF, Weber CC, Ellegren H. GC-biased gene conversion links the recombination landscape and demography to genomic base composition. Bioessays 2015; 37:1317-26. [DOI: 10.1002/bies.201500058] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Carina F. Mugal
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
| | - Claudia C. Weber
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
- Department of Biology; Center for Computational Genetics and Genomics; Temple University; Philadelphia PA USA
| | - Hans Ellegren
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
| |
Collapse
|
132
|
Bolívar P, Mugal CF, Nater A, Ellegren H. Recombination Rate Variation Modulates Gene Sequence Evolution Mainly via GC-Biased Gene Conversion, Not Hill-Robertson Interference, in an Avian System. Mol Biol Evol 2015; 33:216-27. [PMID: 26446902 PMCID: PMC4693978 DOI: 10.1093/molbev/msv214] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The ratio of nonsynonymous to synonymous substitution rates (ω) is often used to measure the strength of natural selection. However, ω may be influenced by linkage among different targets of selection, that is, Hill–Robertson interference (HRI), which reduces the efficacy of selection. Recombination modulates the extent of HRI but may also affect ω by means of GC-biased gene conversion (gBGC), a process leading to a preferential fixation of G:C (“strong,” S) over A:T (“weak,” W) alleles. As HRI and gBGC can have opposing effects on ω, it is essential to understand their relative impact to make proper inferences of ω. We used a model that separately estimated S-to-S, S-to-W, W-to-S, and W-to-W substitution rates in 8,423 avian genes in the Ficedula flycatcher lineage. We found that the W-to-S substitution rate was positively, and the S-to-W rate negatively, correlated with recombination rate, in accordance with gBGC but not predicted by HRI. The W-to-S rate further showed the strongest impact on both dN and dS. However, since the effects were stronger at 4-fold than at 0-fold degenerated sites, likely because the GC content of these sites is farther away from its equilibrium, ω slightly decreases with increasing recombination rate, which could falsely be interpreted as a consequence of HRI. We corroborated this hypothesis analytically and demonstrate that under particular conditions, ω can decrease with increasing recombination rate. Analyses of the site-frequency spectrum showed that W-to-S mutations were skewed toward high, and S-to-W mutations toward low, frequencies, consistent with a prevalent gBGC-driven fixation bias.
Collapse
Affiliation(s)
- Paulina Bolívar
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Carina F Mugal
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Alexander Nater
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
133
|
Hershberg R. Mutation--The Engine of Evolution: Studying Mutation and Its Role in the Evolution of Bacteria. Cold Spring Harb Perspect Biol 2015; 7:a018077. [PMID: 26330518 DOI: 10.1101/cshperspect.a018077] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Mutation is the engine of evolution in that it generates the genetic variation on which the evolutionary process depends. To understand the evolutionary process we must therefore characterize the rates and patterns of mutation. Starting with the seminal Luria and Delbruck fluctuation experiments in 1943, studies utilizing a variety of approaches have revealed much about mutation rates and patterns and about how these may vary between different bacterial strains and species along the chromosome and between different growth conditions. This work provides a critical overview of the results and conclusions drawn from these studies, of the debate surrounding some of these conclusions, and of the challenges faced when studying mutation and its role in bacterial evolution.
Collapse
Affiliation(s)
- Ruth Hershberg
- Rachel & Menachem Mendelovitch Evolutionary Processes of Mutation & Natural Selection Research Laboratory, Department of Genetics and Developmental Biology, The Ruth and Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa 31096, Israel
| |
Collapse
|
134
|
Wu CS, Chaw SM. Evolutionary Stasis in Cycad Plastomes and the First Case of Plastome GC-Biased Gene Conversion. Genome Biol Evol 2015; 7:2000-9. [PMID: 26116919 PMCID: PMC4524490 DOI: 10.1093/gbe/evv125] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/22/2015] [Indexed: 01/09/2023] Open
Abstract
In angiosperms, gene conversion has been known to reduce the mutational load of plastid genomes (the plastomes). Particularly, more frequent gene conversions in inverted repeat (IR) than in single copy (SC) regions result in contrasting substitution rates between these two regions. However, little has been known about the effect of gene conversion in the evolution of gymnosperm plastomes. Cycads (Cycadophyta) are the second largest gymnosperm group. Evolutionary study of their plastomes is limited to the basal cycad genus, Cycas. In this study, we addressed three questions. 1) Do the plastomes of other cycad genera evolve slowly as previously observed in the plastome of Cycas taitungensis? 2) Do substitution rates differ between their SC and IR regions? And 3) Does gene conversion occur in the cycad plastomes? If yes, is it AT-biased or GC-biased? Plastomes of eight species from other eight genera of cycads were sequenced. These plastomes are highly conserved in genome organization. Excluding ginkgo, cycad plastomes have significantly lower synonymous and nonsynonymous substitution rates than other gymnosperms, reflecting their evolutionary stasis in nucleotide mutations. In the IRs of cycad plastomes, the reduced substitution rates and GC-biased mutations are associated with a GC-biased gene conversion (gBGC) mechanism. Further investigations suggest that in cycads, gBGC is able to rectify plastome-wide mutations. Therefore, this study is the first to uncover the plastomic gBGC in seed plants. We also propose a gBGC model to interpret the dissimilar evolutionary patterns as well as the compositionally biased mutations in the SC and IR regions of cycad plastomes.
Collapse
Affiliation(s)
- Chung-Shien Wu
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Shu-Miaw Chaw
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
135
|
Luo H, Thompson LR, Stingl U, Hughes AL. Selection Maintains Low Genomic GC Content in Marine SAR11 Lineages. Mol Biol Evol 2015; 32:2738-48. [DOI: 10.1093/molbev/msv149] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
|
136
|
The Rate and Molecular Spectrum of Spontaneous Mutations in the GC-Rich Multichromosome Genome of Burkholderia cenocepacia. Genetics 2015; 200:935-46. [PMID: 25971664 DOI: 10.1534/genetics.115.176834] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Accepted: 05/07/2015] [Indexed: 12/18/2022] Open
Abstract
Spontaneous mutations are ultimately essential for evolutionary change and are also the root cause of many diseases. However, until recently, both biological and technical barriers have prevented detailed analyses of mutation profiles, constraining our understanding of the mutation process to a few model organisms and leaving major gaps in our understanding of the role of genome content and structure on mutation. Here, we present a genome-wide view of the molecular mutation spectrum in Burkholderia cenocepacia, a clinically relevant pathogen with high %GC content and multiple chromosomes. We find that B. cenocepacia has low genome-wide mutation rates with insertion-deletion mutations biased toward deletions, consistent with the idea that deletion pressure reduces prokaryotic genome sizes. Unlike prior studies of other organisms, mutations in B. cenocepacia are not AT biased, which suggests that at least some genomes with high %GC content experience unusual base-substitution mutation pressure. Importantly, we also observe variation in both the rates and spectra of mutations among chromosomes and elevated G:C > T:A transversions in late-replicating regions. Thus, although some patterns of mutation appear to be highly conserved across cellular life, others vary between species and even between chromosomes of the same species, potentially influencing the evolution of nucleotide composition and genome architecture.
Collapse
|
137
|
Abstract
The source and significance of the wide variation in the genomic base composition of bacteria have been a matter of continued debate. Although the variation was originally attributed to a strictly neutral process, i.e., species-specific differences in mutational patterns, recent genomic comparisons have shown that bacteria with G+C-rich genomes experience a mutational bias toward A+T. This difference between the mutational input to a genome and its overall base composition suggests the action of natural selection. Here, we examine if selection acts on G+C contents in Caulobacter crescentus and Pseudomonas aeruginosa, which both have very G+C-rich genomes, by testing whether the expression of gene variants that differ only in their base compositions at synonymous sites affects cellular growth rates. In C. crescentus, expression of the more A+T-rich gene variants decelerated growth, indicating that selection on genic base composition is, in part, responsible for the high G+C content of this genome. In contrast, no comparable effect was observed in P. aeruginosa, which has similarly high genome G+C contents. Selection for increased genic G+C-contents in C. crescentus acts independently of the species-specific codon usage pattern and represents an additional selective force operating in bacterial genomes.
Collapse
|
138
|
Reichenberger ER, Rosen G, Hershberg U, Hershberg R. Prokaryotic nucleotide composition is shaped by both phylogeny and the environment. Genome Biol Evol 2015; 7:1380-9. [PMID: 25861819 PMCID: PMC4453058 DOI: 10.1093/gbe/evv063] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/06/2015] [Indexed: 02/07/2023] Open
Abstract
The causes of the great variation in nucleotide composition of prokaryotic genomes have long been disputed. Here, we use extensive metagenomic and whole-genome data to demonstrate that both phylogeny and the environment shape prokaryotic nucleotide content. We show that across environments, various phyla are characterized by different mean guanine and cytosine (GC) values as well as by the extent of variation on that mean value. At the same time, we show that GC-content varies greatly as a function of environment, in a manner that cannot be entirely explained by disparities in phylogenetic composition. We find environmentally driven differences in nucleotide content not only between highly diverged environments (e.g., soil, vs. aquatic vs. human gut) but also within a single type of environment. More specifically, we demonstrate that some human guts are associated with a microbiome that is consistently more GC-rich across phyla, whereas others are associated with a more AT-rich microbiome. These differences appear to be driven both by variations in phylogenetic composition and by environmental differences-which are independent of these phylogenetic composition differences. Combined, our results demonstrate that both phylogeny and the environment significantly affect nucleotide composition and that the environmental differences affecting nucleotide composition are far subtler than previously appreciated.
Collapse
Affiliation(s)
- Erin R Reichenberger
- Department of Biomedical Engineering, Science & Health Systems, Drexel University
| | - Gail Rosen
- Department of Computer and Electrical Engineering, Drexel University
| | - Uri Hershberg
- Department of Biomedical Engineering, Science & Health Systems, Drexel University Department of Microbiology and Immunology, Drexel University College of Medicine
| | - Ruth Hershberg
- Rachel and Menachem Mendelovitch Evolutionary Processes of Mutation and Natural Selection Research Laboratory, Department of Genetics and Developmental Biology, The Ruth and Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel
| |
Collapse
|