1
|
Perturbation and resilience of the gut microbiome up to 3 months after β-lactams exposure in healthy volunteers suggest an important role of microbial β-lactamases. MICROBIOME 2024; 12:50. [PMID: 38468305 DOI: 10.1186/s40168-023-01746-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 12/20/2023] [Indexed: 03/13/2024]
Abstract
BACKGROUND Antibiotics notoriously perturb the gut microbiota. We treated healthy volunteers either with cefotaxime or ceftriaxone for 3 days, and collected in each subject 12 faecal samples up to day 90. Using untargeted and targeted phenotypic and genotypic approaches, we studied the changes in the bacterial, phage and fungal components of the microbiota as well as the metabolome and the β-lactamase activity of the stools. This allowed assessing their degrees of perturbation and resilience. RESULTS While only two subjects had detectable concentrations of antibiotics in their faeces, suggesting important antibiotic degradation in the gut, the intravenous treatment perturbed very significantly the bacterial and phage microbiota, as well as the composition of the metabolome. In contrast, treatment impact was relatively low on the fungal microbiota. At the end of the surveillance period, we found evidence of resilience across the gut system since most components returned to a state like the initial one, even if the structure of the bacterial microbiota changed and the dynamics of the different components over time were rarely correlated. The observed richness of the antibiotic resistance genes repertoire was significantly reduced up to day 30, while a significant increase in the relative abundance of β-lactamase encoding genes was observed up to day 10, consistent with a concomitant increase in the β-lactamase activity of the microbiota. The level of β-lactamase activity at baseline was positively associated with the resilience of the metabolome content of the stools. CONCLUSIONS In healthy adults, antibiotics perturb many components of the microbiota, which return close to the baseline state within 30 days. These data suggest an important role of endogenous β-lactamase-producing anaerobes in protecting the functions of the microbiota by de-activating the antibiotics reaching the colon. Video Abstract.
Collapse
|
2
|
Chance Favors the Prepared Genomes: Horizontal Transfer Shapes the Emergence of Antibiotic Resistance Mutations in Core Genes. Mol Biol Evol 2023; 40:msad217. [PMID: 37788575 PMCID: PMC10575684 DOI: 10.1093/molbev/msad217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 09/08/2023] [Accepted: 09/19/2023] [Indexed: 10/05/2023] Open
Abstract
Bacterial lineages acquire novel traits at diverse rates in part because the genetic background impacts the successful acquisition of novel genes by horizontal transfer. Yet, how horizontal transfer affects the subsequent evolution of core genes remains poorly understood. Here, we studied the evolution of resistance to quinolones in Escherichia coli accounting for population structure. We found 60 groups of genes whose gain or loss induced an increase in the probability of subsequently becoming resistant to quinolones by point mutations in the gyrase and topoisomerase genes. These groups include functions known to be associated with direct mitigation of the effect of quinolones, with metal uptake, cell growth inhibition, biofilm formation, and sugar metabolism. Many of them are encoded in phages or plasmids. Although some of the chronologies may reflect epidemiological trends, many of these groups encoded functions providing latent phenotypes of antibiotic low-level resistance, tolerance, or persistence under quinolone treatment. The mutations providing resistance were frequent and accumulated very quickly. Their emergence was found to increase the rate of acquisition of other antibiotic resistances setting the path for multidrug resistance. Hence, our findings show that horizontal gene transfer shapes the subsequent emergence of adaptive mutations in core genes. In turn, these mutations further affect the subsequent evolution of resistance by horizontal gene transfer. Given the substantial gene flow within bacterial genomes, interactions between horizontal transfer and point mutations in core genes may be a key to the success of adaptation processes.
Collapse
|
3
|
Selfish, promiscuous and sometimes useful: how mobile genetic elements drive horizontal gene transfer in microbial populations. Philos Trans R Soc Lond B Biol Sci 2022; 377:20210234. [PMID: 35989606 PMCID: PMC9393566 DOI: 10.1098/rstb.2021.0234] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Horizontal gene transfer (HGT) drives microbial adaptation but is often under the control of mobile genetic elements (MGEs) whose interests are not necessarily aligned with those of their hosts. In general, transfer is costly to the donor cell while potentially beneficial to the recipients. The diversity and plasticity of cell–MGEs interactions, and those among MGEs, result in complex evolutionary processes where the source, or even the existence of selection for maintaining a function in the genome, is often unclear. For example, MGE-driven HGT depends on cell envelope structures and defense systems, but many of these are transferred by MGEs themselves. MGEs can spur periods of intense gene transfer by increasing their own rates of horizontal transmission upon communicating, eavesdropping, or sensing the environment and the host physiology. This may result in high-frequency transfer of host genes unrelated to the MGE. Here, we review how MGEs drive HGT and how their transfer mechanisms, selective pressures and genomic traits affect gene flow, and therefore adaptation, in microbial populations. The encoding of many adaptive niche-defining microbial traits in MGEs means that intragenomic conflicts and alliances between cells and their MGEs are key to microbial functional diversification. This article is part of a discussion meeting issue ‘Genomic population structures of microbial pathogens’.
Collapse
|
4
|
Abstract
Bacteria and archaea have developed multiple antiviral mechanisms, and genomic evidence indicates that several of these antiviral systems co-occur in the same strain. Here, we introduce DefenseFinder, a tool that automatically detects known antiviral systems in prokaryotic genomes. We use DefenseFinder to analyse 21000 fully sequenced prokaryotic genomes, and find that antiviral strategies vary drastically between phyla, species and strains. Variations in composition of antiviral systems correlate with genome size, viral threat, and lifestyle traits. DefenseFinder will facilitate large-scale genomic analysis of antiviral defense systems and the study of host-virus interactions in prokaryotes.
Collapse
|
5
|
Causes and Consequences of Bacteriophage Diversification via Genetic Exchanges across Lifestyles and Bacterial Taxa. Mol Biol Evol 2021; 38:2497-2512. [PMID: 33570565 PMCID: PMC8136500 DOI: 10.1093/molbev/msab044] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Bacteriophages (phages) evolve rapidly by acquiring genes from other phages. This results in mosaic genomes. Here, we identify numerous genetic transfers between distantly related phages and aim at understanding their frequency, consequences, and the conditions favoring them. Gene flow tends to occur between phages that are enriched for recombinases, transposases, and nonhomologous end joining, suggesting that both homologous and illegitimate recombination contribute to gene flow. Phage family and host phyla are strong barriers to gene exchange, but phage lifestyle is not. Even if we observe four times more recent transfers between temperate phages than between other pairs, there is extensive gene flow between temperate and virulent phages, and between the latter. These predominantly involve virulent phages with large genomes previously classed as low gene flux, and lead to the preferential transfer of genes encoding functions involved in cell energetics, nucleotide metabolism, DNA packaging and injection, and virion assembly. Such exchanges may contribute to the observed twice larger genomes of virulent phages. We used genetic transfers, which occur upon coinfection of a host, to compare phage host range. We found that virulent phages have broader host ranges and can mediate genetic exchanges between narrow host range temperate phages infecting distant bacterial hosts, thus contributing to gene flow between virulent phages, as well as between temperate phages. This gene flow drastically expands the gene repertoires available for phage and bacterial evolution, including the transfer of functional innovations across taxa.
Collapse
|
6
|
Erratum: Causes and Consequences of Bacteriophage Diversification via Genetic Exchanges across Lifestyles and Bacterial Taxa. Mol Biol Evol 2021; 38:2675. [PMID: 33846748 PMCID: PMC8136495 DOI: 10.1093/molbev/msab082] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
|
7
|
Bacteria have numerous distinctive groups of phage-plasmids with conserved phage and variable plasmid gene repertoires. Nucleic Acids Res 2021; 49:2655-2673. [PMID: 33590101 PMCID: PMC7969092 DOI: 10.1093/nar/gkab064] [Citation(s) in RCA: 68] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 01/20/2021] [Accepted: 01/25/2021] [Indexed: 01/16/2023] Open
Abstract
Plasmids and temperate phages are key contributors to bacterial evolution. They are usually regarded as very distinct. However, some elements, termed phage–plasmids, are known to be both plasmids and phages, e.g. P1, N15 or SSU5. The number, distribution, relatedness and characteristics of these phage–plasmids are poorly known. Here, we screened for these elements among ca. 2500 phages and 12000 plasmids and identified 780 phage–plasmids across very diverse bacterial phyla. We grouped 92% of them by similarity of gene repertoires to eight defined groups and 18 other broader communities of elements. The existence of these large groups suggests that phage–plasmids are ancient. Their gene repertoires are large, the average element is larger than an average phage or plasmid, and they include slightly more homologs to phages than to plasmids. We analyzed the pangenomes and the genetic organization of each group of phage–plasmids and found the key phage genes to be conserved and co-localized within distinct groups, whereas genes with homologs in plasmids are much more variable and include most accessory genes. Phage–plasmids are a sizeable fraction of the sequenced plasmids (∼7%) and phages (∼5%), and could have key roles in bridging the genetic divide between phages and other mobile genetic elements.
Collapse
|
8
|
Phylogenetic background and habitat drive the genetic diversification of Escherichia coli. PLoS Genet 2020; 16:e1008866. [PMID: 32530914 PMCID: PMC7314097 DOI: 10.1371/journal.pgen.1008866] [Citation(s) in RCA: 96] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Revised: 06/24/2020] [Accepted: 05/18/2020] [Indexed: 12/15/2022] Open
Abstract
Escherichia coli is mostly a commensal of birds and mammals, including humans, where it can act as an opportunistic pathogen. It is also found in water and sediments. We investigated the phylogeny, genetic diversification, and habitat-association of 1,294 isolates representative of the phylogenetic diversity of more than 5,000 isolates from the Australian continent. Since many previous studies focused on clinical isolates, we investigated mostly other isolates originating from humans, poultry, wild animals and water. These strains represent the species genetic diversity and reveal widespread associations between phylogroups and isolation sources. The analysis of strains from the same sequence types revealed very rapid change of gene repertoires in the very early stages of divergence, driven by the acquisition of many different types of mobile genetic elements. These elements also lead to rapid variations in genome size, even if few of their genes rise to high frequency in the species. Variations in genome size are associated with phylogroup and isolation sources, but the latter determine the number of MGEs, a marker of recent transfer, suggesting that gene flow reinforces the association of certain genetic backgrounds with specific habitats. After a while, the divergence of gene repertoires becomes linear with phylogenetic distance, presumably reflecting the continuous turnover of mobile element and the occasional acquisition of adaptive genes. Surprisingly, the phylogroups with smallest genomes have the highest rates of gene repertoire diversification and fewer but more diverse mobile genetic elements. This suggests that smaller genomes are associated with higher, not lower, turnover of genetic information. Many of these genomes are from freshwater isolates and have peculiar traits, including a specific capsule, suggesting adaptation to this environment. Altogether, these data contribute to explain why epidemiological clones tend to emerge from specific phylogenetic groups in the presence of pervasive horizontal gene transfer across the species. Previous large scale studies on the evolution of E. coli focused on clinical isolates emphasizing virulence and antibiotic resistance in medically important lineages. Yet, most E. coli strains are either human commensals or not associated with humans at all. Here, we analyzed a large collection of non-clinical isolates of the species to assess the mechanisms of gene repertoire diversification in the light of isolation sources and phylogeny. We show that gene repertoires evolve so rapidly by the high turnover of mobile genetic elements that epidemiologically indistinguishable strains can be phenotypically extremely heterogeneous, illustrating the velocity of bacterial adaptation and the importance of accounting for the information on the whole genome at the epidemiological scale. Phylogeny and habitat shape the genetic diversification of E. coli to similar extents. Surprisingly, freshwater strains seem specifically adapted to this environment, breaking the paradigm that E. coli environmental isolates are systematically fecal contaminations. As a consequence, the evolution of this species is also shaped by environmental habitats, and it may diversify by acquiring genes and mobile elements from environmental bacteria (and not just from gut bacteria). This may facilitate the acquisition of virulence factors and antibiotic resistance in the strains that become pathogenic.
Collapse
|
9
|
CRISPRCasdb a successor of CRISPRdb containing CRISPR arrays and cas genes from complete genome sequences, and tools to download and query lists of repeats and spacers. Nucleic Acids Res 2020; 48:D535-D544. [PMID: 31624845 PMCID: PMC7145573 DOI: 10.1093/nar/gkz915] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2019] [Revised: 09/20/2019] [Accepted: 10/04/2019] [Indexed: 12/28/2022] Open
Abstract
In Archaea and Bacteria, the arrays called CRISPRs for 'clustered regularly interspaced short palindromic repeats' and the CRISPR associated genes or cas provide adaptive immunity against viruses, plasmids and transposable elements. Short sequences called spacers, corresponding to fragments of invading DNA, are stored in-between repeated sequences. The CRISPR-Cas systems target sequences homologous to spacers leading to their degradation. To facilitate investigations of CRISPRs, we developed 12 years ago a website holding the CRISPRdb. We now propose CRISPRCasdb, a completely new version giving access to both CRISPRs and cas genes. We used CRISPRCasFinder, a program that identifies CRISPR arrays and cas genes and determine the system's type and subtype, to process public whole genome assemblies. Strains are displayed either in an alphabetic list or in taxonomic order. The database is part of the CRISPR-Cas++ website which also offers the possibility to analyse submitted sequences and to download programs. A BLAST search against lists of repeats and spacers extracted from the database is proposed. To date, 16 990 complete prokaryote genomes (16 650 bacteria from 2973 species and 340 archaea from 300 species) are included. CRISPR-Cas systems were found in 36% of Bacteria and 75% of Archaea strains. CRISPRCasdb is freely accessible at https://crisprcas.i2bc.paris-saclay.fr/.
Collapse
|
10
|
A matter of background: DNA repair pathways as a possible cause for the sparse distribution of CRISPR-Cas systems in bacteria. Philos Trans R Soc Lond B Biol Sci 2020; 374:20180088. [PMID: 30905287 PMCID: PMC6452273 DOI: 10.1098/rstb.2018.0088] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The absence of CRISPR-Cas systems in more than half of the sequenced bacterial genomes is intriguing, because their role in adaptive immunity and their frequent transfer between species should have made them almost ubiquitous, as is the case in Archaea. Here, we investigate the possibility that the success of CRISPR-Cas acquisition by horizontal gene transfer is affected by the interactions of these systems with the host genetic background and especially with components of double-strand break repair systems (DSB-RS). We first described the distribution of systems specialized in the repair of double-strand breaks in Bacteria: homologous recombination and non-homologous end joining. This allowed us to show that such systems are more often positively or negatively correlated with the frequency of CRISPR-Cas systems than random genes of similar frequency. The detailed analysis of these co-occurrence patterns shows that our method identifies previously known cases of mechanistic interactions between these systems. It also reveals other positive and negative patterns of co-occurrence between DSB-RS and CRISPR-Cas systems. Notably, it shows that the patterns of distribution of CRISPR-Cas systems in Proteobacteria are strongly dependent on the epistatic groups including RecBCD and AddAB. Our results suggest that the genetic background plays an important role in the success of adaptive immunity in different bacterial clades and provide insights to guide further experimental research on the interactions between CRISPR-Cas and DSB-RS. This article is part of a discussion meeting issue 'The ecology and evolution of prokaryotic CRISPR-Cas adaptive immune systems'.
Collapse
|
11
|
Atypical organizations and epistatic interactions of CRISPRs and cas clusters in genomes and their mobile genetic elements. Nucleic Acids Res 2020; 48:748-760. [PMID: 31745554 PMCID: PMC7145637 DOI: 10.1093/nar/gkz1091] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2019] [Revised: 11/01/2019] [Accepted: 11/05/2019] [Indexed: 12/20/2022] Open
Abstract
Prokaryotes use CRISPR-Cas systems for adaptive immunity, but the reasons for the frequent existence of multiple CRISPRs and cas clusters remain poorly understood. Here, we analysed the joint distribution of CRISPR and cas genes in a large set of fully sequenced bacterial genomes and their mobile genetic elements. Our analysis suggests few negative and many positive epistatic interactions between Cas subtypes. The latter often result in complex genetic organizations, where a locus has a single adaptation module and diverse interference mechanisms that might provide more effective immunity. We typed CRISPRs that could not be unambiguously associated with a cas cluster and found that such complex loci tend to have unique type I repeats in multiple CRISPRs. Many chromosomal CRISPRs lack a neighboring Cas system and they often have repeats compatible with the Cas systems encoded in trans. Phages and 25 000 prophages were almost devoid of CRISPR-Cas systems, whereas 3% of plasmids had CRISPR-Cas systems or isolated CRISPRs. The latter were often compatible with the chromosomal cas clusters, suggesting that plasmids can co-opt the latter. These results highlight the importance of interactions between CRISPRs and cas present in multiple copies and in distinct genomic locations in the function and evolution of bacterial immunity.
Collapse
|
12
|
CRISPRCasdb a successor of CRISPRdb containing CRISPR arrays and cas genes from complete genome sequences, and tools to download and query lists of repeats and spacers. Nucleic Acids Res 2020. [PMID: 31624845 DOI: 10.1093/nar/gkz915.] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In Archaea and Bacteria, the arrays called CRISPRs for 'clustered regularly interspaced short palindromic repeats' and the CRISPR associated genes or cas provide adaptive immunity against viruses, plasmids and transposable elements. Short sequences called spacers, corresponding to fragments of invading DNA, are stored in-between repeated sequences. The CRISPR-Cas systems target sequences homologous to spacers leading to their degradation. To facilitate investigations of CRISPRs, we developed 12 years ago a website holding the CRISPRdb. We now propose CRISPRCasdb, a completely new version giving access to both CRISPRs and cas genes. We used CRISPRCasFinder, a program that identifies CRISPR arrays and cas genes and determine the system's type and subtype, to process public whole genome assemblies. Strains are displayed either in an alphabetic list or in taxonomic order. The database is part of the CRISPR-Cas++ website which also offers the possibility to analyse submitted sequences and to download programs. A BLAST search against lists of repeats and spacers extracted from the database is proposed. To date, 16 990 complete prokaryote genomes (16 650 bacteria from 2973 species and 340 archaea from 300 species) are included. CRISPR-Cas systems were found in 36% of Bacteria and 75% of Archaea strains. CRISPRCasdb is freely accessible at https://crisprcas.i2bc.paris-saclay.fr/.
Collapse
|
13
|
A simple, reproducible and cost-effective procedure to analyse gut phageome: from phage isolation to bioinformatic approach. Sci Rep 2019; 9:11331. [PMID: 31383878 PMCID: PMC6683287 DOI: 10.1038/s41598-019-47656-w] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 07/18/2019] [Indexed: 01/08/2023] Open
Abstract
The microbiota of the human gut is a complex and rich community where bacteria and their viruses, the bacteriophages, are dominant. There are few studies on the phage community and no clear standard for isolating them, sequencing and analysing their genomes. Since this makes comparisons between studies difficult, we aimed at defining an easy, low-cost, and reproducible methodology. We analysed five different techniques to isolate phages from human adult faeces and developed an approach to analyse their genomes in order to quantify contamination and classify phage contigs in terms of taxonomy and lifestyle. We chose the polyethylene glycol concentration method to isolate phages because of its simplicity, low cost, reproducibility, and of the high number and diversity of phage sequences that we obtained. We also tested the reproducibility of this method with multiple displacement amplification (MDA) and showed that MDA severely decreases the phage genetic diversity of the samples and the reproducibility of the method. Lastly, we studied the influence of sequencing depth on the analysis of phage diversity and observed the beginning of a plateau for phage contigs at 20,000,000 reads. This work contributes to the development of methods for the isolation of phages in faeces and for their comparative analysis.
Collapse
|
14
|
Genetic exchanges are more frequent in bacteria encoding capsules. PLoS Genet 2018; 14:e1007862. [PMID: 30576310 PMCID: PMC6322790 DOI: 10.1371/journal.pgen.1007862] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Revised: 01/07/2019] [Accepted: 11/29/2018] [Indexed: 12/12/2022] Open
Abstract
Capsules allow bacteria to colonize novel environments, to withstand numerous stresses, and to resist antibiotics. Yet, even though genetic exchanges with other cells should be adaptive under such circumstances, it has been suggested that capsules lower the rates of homologous recombination and horizontal gene transfer. We analysed over one hundred pan-genomes and thousands of bacterial genomes for the evidence of an association between genetic exchanges (or lack thereof) and the presence of a capsule system. We found that bacteria encoding capsules have larger pan-genomes, higher rates of horizontal gene transfer, and higher rates of homologous recombination in their core genomes. Accordingly, genomes encoding capsules have more plasmids, conjugative elements, transposases, prophages, and integrons. Furthermore, capsular loci are frequent in plasmids, and can be found in prophages. These results are valid for Bacteria, independently of their ability to be naturally transformable. Since we have shown previously that capsules are commonly present in nosocomial pathogens, we analysed their co-occurrence with antibiotic resistance genes. Genomes encoding capsules have more antibiotic resistance genes, especially those encoding efflux pumps, and they constitute the majority of the most worrisome nosocomial bacteria. We conclude that bacteria with capsule systems are more genetically diverse and have fast-evolving gene repertoires, which may further contribute to their success in colonizing novel niches such as humans under antibiotic therapy. Previous works showed that bacteria encoding capsules are better colonizers and are dominant in most environments suggesting a positive role for capsules in the genetic diversification of bacteria. Yet, it has been repeatedly suggested, based almost exclusively studies in few model species, that such bacteria are less diverse and engage in fewer genetic exchanges. Here, we reverse the current paradigm and show that bacteria encoding capsules have larger and more diverse gene repertoires, which change faster by horizontal gene transfer and recombination. Our study alters the traditional view of the capsule as a barrier to gene flow and raises novel questions about the role of capsules in bacterial adaptation.
Collapse
|
15
|
CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins. Nucleic Acids Res 2018; 46:W246-W251. [PMID: 29790974 PMCID: PMC6030898 DOI: 10.1093/nar/gky425] [Citation(s) in RCA: 718] [Impact Index Per Article: 119.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Accepted: 05/09/2018] [Indexed: 12/25/2022] Open
Abstract
CRISPR (clustered regularly interspaced short palindromic repeats) arrays and their associated (Cas) proteins confer bacteria and archaea adaptive immunity against exogenous mobile genetic elements, such as phages or plasmids. CRISPRCasFinder allows the identification of both CRISPR arrays and Cas proteins. The program includes: (i) an improved CRISPR array detection tool facilitating expert validation based on a rating system, (ii) prediction of CRISPR orientation and (iii) a Cas protein detection and typing tool updated to match the latest classification scheme of these systems. CRISPRCasFinder can either be used online or as a standalone tool compatible with Linux operating system. All third-party software packages employed by the program are freely available. CRISPRCasFinder is available at https://crisprcas.i2bc.paris-saclay.fr.
Collapse
|
16
|
Inhibition of NHEJ repair by type II-A CRISPR-Cas systems in bacteria. Nat Commun 2017; 8:2094. [PMID: 29234047 PMCID: PMC5727150 DOI: 10.1038/s41467-017-02350-1] [Citation(s) in RCA: 57] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2017] [Accepted: 11/22/2017] [Indexed: 12/26/2022] Open
Abstract
Type II CRISPR-Cas systems introduce double-strand breaks into DNA of invading genetic material and use DNA fragments to acquire novel spacers during adaptation. These breaks can be the substrate of several DNA repair pathways, paving the way for interactions. We report that non-homologous end-joining (NHEJ) and type II-A CRISPR-Cas systems only co-occur once among 5563 fully sequenced prokaryotic genomes. We investigated experimentally the possible molecular interactions using the NHEJ pathway from Bacillus subtilis and the type II-A CRISPR-Cas systems from Streptococcus thermophilus and Streptococcus pyogenes. Our results suggest that the NHEJ system has no effect on CRISPR immunity. On the other hand, we provide evidence for the inhibition of NHEJ repair by the Csn2 protein. Our findings give insights on the complex interactions between CRISPR-Cas systems and repair mechanisms in bacteria, contributing to explain the scattered distribution of CRISPR-Cas systems in bacterial genome.
Collapse
|
17
|
Metagenomic assessment of the interplay between the environment and the genetic diversification of Acinetobacter. Environ Microbiol 2017; 19:5010-5024. [PMID: 28967182 PMCID: PMC5767740 DOI: 10.1111/1462-2920.13949] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Revised: 09/08/2017] [Accepted: 09/26/2017] [Indexed: 01/22/2023]
Abstract
Most bacteria have poorly characterized environmental reservoirs and unknown closely related species. This hampers the study of bacterial evolutionary ecology because both the environment and the genetic background of ancestral lineages are unknown. We combined metagenomics, comparative genomics and phylogenomics to overcome this limitation, to identify novel taxa and to propose environments where they can be isolated. We applied this method to characterize the ecological distribution of known and novel lineages of Acinetobacter spp. We observed two major environmental transitions at deep phylogenetic levels, splitting the genus into three ecologically differentiated clades. One of these has rapidly shifted towards host‐association by acquiring genes involved in bacteria‐eukaryote interactions. We show that environmental perturbations affect species distribution in predictable ways: bovines have very diverse communities of Acinetobacter, unless they were administered antibiotics, in which case they show highly uniform communities of Acinetobacter spp. that resemble those of humans. Our results uncover the diversity of bacterial lineages, overpassing the limitations of classical cultivation methods and highlight the role of the environment in shaping their evolution.
Collapse
|
18
|
Integrative and conjugative elements and their hosts: composition, distribution and organization. Nucleic Acids Res 2017; 45:8943-8956. [PMID: 28911112 PMCID: PMC5587801 DOI: 10.1093/nar/gkx607] [Citation(s) in RCA: 77] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Accepted: 07/04/2017] [Indexed: 12/22/2022] Open
Abstract
Conjugation of single-stranded DNA drives horizontal gene transfer between bacteria and was widely studied in conjugative plasmids. The organization and function of integrative and conjugative elements (ICE), even if they are more abundant, was only studied in a few model systems. Comparative genomics of ICE has been precluded by the difficulty in finding and delimiting these elements. Here, we present the results of a method that circumvents these problems by requiring only the identification of the conjugation genes and the species’ pan-genome. We delimited 200 ICEs and this allowed the first large-scale characterization of these elements. We quantified the presence in ICEs of a wide set of functions associated with the biology of mobile genetic elements, including some that are typically associated with plasmids, such as partition and replication. Protein sequence similarity networks and phylogenetic analyses revealed that ICEs are structured in functional modules. Integrases and conjugation systems have different evolutionary histories, even if the gene repertoires of ICEs can be grouped in function of conjugation types. Our characterization of the composition and organization of ICEs paves the way for future functional and evolutionary analyses of their cargo genes, composed of a majority of unknown function genes.
Collapse
|
19
|
The chromosomal organization of horizontal gene transfer in bacteria. Nat Commun 2017; 8:841. [PMID: 29018197 PMCID: PMC5635113 DOI: 10.1038/s41467-017-00808-w] [Citation(s) in RCA: 116] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2017] [Accepted: 07/31/2017] [Indexed: 01/02/2023] Open
Abstract
Bacterial adaptation is accelerated by the acquisition of novel traits through horizontal gene transfer, but the integration of these genes affects genome organization. We found that transferred genes are concentrated in only ~1% of the chromosomal regions (hotspots) in 80 bacterial species. This concentration increases with genome size and with the rate of transfer. Hotspots diversify by rapid gene turnover; their chromosomal distribution depends on local contexts (neighboring core genes), and content in mobile genetic elements. Hotspots concentrate most changes in gene repertoires, reduce the trade-off between genome diversification and organization, and should be treasure troves of strain-specific adaptive genes. Most mobile genetic elements and antibiotic resistance genes are in hotspots, but many hotspots lack recognizable mobile genetic elements and exhibit frequent homologous recombination at flanking core genes. Overrepresentation of hotspots with fewer mobile genetic elements in naturally transformable bacteria suggests that homologous recombination and horizontal gene transfer are tightly linked in genome evolution. Horizontal gene transfer (HGT) is an important mechanism for genome evolution and adaptation in bacteria. Here, Oliveira and colleagues find HGT hotspots comprising ~ 1% of the chromosomal regions in 80 bacterial species.
Collapse
|
20
|
Comparative genomic analysis of Acinetobacter strains isolated from murine colonic crypts. BMC Genomics 2017; 18:525. [PMID: 28697749 PMCID: PMC5505149 DOI: 10.1186/s12864-017-3925-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 07/06/2017] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND A restricted set of aerobic bacteria dominated by the Acinetobacter genus was identified in murine intestinal colonic crypts. The vicinity of such bacteria with intestinal stem cells could indicate that they protect the crypt against cytotoxic and genotoxic signals. Genome analyses of these bacteria were performed to better appreciate their biodegradative capacities. RESULTS Two taxonomically different clusters of Acinetobacter were isolated from murine proximal colonic crypts, one was identified as A. modestus and the other as A. radioresistens. Their identification was performed through biochemical parameters and housekeeping gene sequencing. After selection of one strain of each cluster (A. modestus CM11G and A. radioresistens CM38.2), comparative genomic analysis was performed on whole-genome sequencing data. The antibiotic resistance pattern of these two strains is different, in line with the many genes involved in resistance to heavy metals identified in both genomes. Moreover whereas the operon benABCDE involved in benzoate metabolism is encoded by the two genomes, the operon antABC encoding the anthranilate dioxygenase, and the phenol hydroxylase gene cluster are absent in the A. modestus genomic sequence, indicating that the two strains have different capacities to metabolize xenobiotics. A common feature of the two strains is the presence of a type IV pili system, and the presence of genes encoding proteins pertaining to secretion systems such as Type I and Type II secretion systems. CONCLUSIONS Our comparative genomic analysis revealed that different Acinetobacter isolated from the same biological niche, even if they share a large majority of genes, possess unique features that could play a specific role in the protection of the intestinal crypt.
Collapse
|
21
|
Embracing the enemy: the diversification of microbial gene repertoires by phage-mediated horizontal gene transfer. Curr Opin Microbiol 2017; 38:66-73. [PMID: 28527384 DOI: 10.1016/j.mib.2017.04.010] [Citation(s) in RCA: 159] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2017] [Revised: 04/20/2017] [Accepted: 04/21/2017] [Indexed: 01/21/2023]
Abstract
Bacteriophages and archaeal viruses contribute, through lysogenic conversion or transduction, to the horizontal transfer of genetic material between microbial genomes. Recent genomics, metagenomics, and single cell studies have shown that lysogenic conversion is widespread and provides hosts with adaptive traits often associated with biotic interactions. The quantification of the evolutionary impact of transduction has lagged behind and requires further theoretical and experimental work. Nevertheless, recent studies suggested that generalized transduction plays a role in the transfer of antibiotic resistance genes and in the acquisition of novel genes during intra-specific bacterial competition. The characteristics of transduction and lysogenic conversion complement those of other mechanisms of transfer, and could play a key role in the spread of adaptive genes between communities.
Collapse
|
22
|
Erratum: Corrigendum: Uncovering Listeria monocytogenes hypervirulence by harnessing its biodiversity. Nat Genet 2017; 49:651. [DOI: 10.1038/ng0417-651b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
23
|
Metabolic diversity of the emerging pathogenic lineages of Klebsiella pneumoniae. Environ Microbiol 2017; 19:1881-1898. [PMID: 28181409 DOI: 10.1111/1462-2920.13689] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2016] [Revised: 02/01/2017] [Accepted: 02/01/2017] [Indexed: 11/30/2022]
Abstract
Multidrug resistant and hypervirulent clones of Klebsiella pneumoniae are emerging pathogens. To understand the association between genotypic and phenotypic diversity in this process, we combined genomic, phylogenomic and phenotypic analysis of a diverse set of K. pneumoniae and closely related species. These species were able to use an unusually large panel of metabolic substrates for growth, many of which were shared between all strains. We analysed the substrates used by only a fraction of the strains, identified some of their genetic basis, and found that many could not be explained by the phylogeny of the strains. Puzzlingly, few traits were associated with the ecological origin of the strains. One noticeable exception was the ability to use D-arabinose, which was much more frequent in hypervirulent strains. The broad carbon and nitrogen core metabolism of K. pneumoniae might contribute to its ability to thrive in diverse environments. Accordingly, even the hypervirulent and multidrug resistant clones have the metabolic signature of ubiquitous bacteria. The apparent few metabolic differences between hypervirulent, multi-resistant and environmental strains may favour the emergence of dual-risk strains that combine resistance and hypervirulence.
Collapse
|
24
|
Genetic and life-history traits associated with the distribution of prophages in bacteria. THE ISME JOURNAL 2016; 10:2744-2754. [PMID: 27015004 PMCID: PMC5113838 DOI: 10.1038/ismej.2016.47] [Citation(s) in RCA: 211] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2015] [Revised: 02/17/2016] [Accepted: 02/24/2016] [Indexed: 01/07/2023]
Abstract
Nearly half of the sequenced bacteria are lysogens and many of their prophages encode adaptive traits. Yet, the variables driving prophage distribution remain undetermined. We identified 2246 prophages in complete bacterial genomes to study the genetic and life-history traits associated with lysogeny. While optimal growth temperatures and average cell volumes were not associated with lysogeny, prophages were more frequent in pathogens and in bacteria with small minimal doubling times. Their frequency also increased with genome size, but only for genomes smaller than 6 Mb. The number of spacers in CRISPR-Cas systems and the frequency of type III systems were anticorrelated with prophage frequency, but lysogens were more likely to encode type I and type II systems. The minimal doubling time was the trait most correlated with lysogeny, followed by genome size and pathogenicity. We propose that bacteria with highly variable growth rates often encounter lower opportunity costs for lysogeny relative to lysis. These results contribute to explain the paucity of temperate phages in certain bacterial clades and of bacterial lysogens in certain environments. They suggest that genetic and life-history traits affect the contributions of temperate phages to bacterial genomes.
Collapse
|
25
|
Whole genome-based population biology and epidemiological surveillance of Listeria monocytogenes. Nat Microbiol 2016; 2:16185. [PMID: 27723724 DOI: 10.1038/nmicrobiol.2016.185] [Citation(s) in RCA: 397] [Impact Index Per Article: 49.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2016] [Accepted: 08/30/2016] [Indexed: 01/31/2023]
Abstract
Listeria monocytogenes (Lm) is a major human foodborne pathogen. Numerous Lm outbreaks have been reported worldwide and associated with a high case fatality rate, reinforcing the need for strongly coordinated surveillance and outbreak control. We developed a universally applicable genome-wide strain genotyping approach and investigated the population diversity of Lm using 1,696 isolates from diverse sources and geographical locations. We define, with unprecedented precision, the population structure of Lm, demonstrate the occurrence of international circulation of strains and reveal the extent of heterogeneity in virulence and stress resistance genomic features among clinical and food isolates. Using historical isolates, we show that the evolutionary rate of Lm from lineage I and lineage II is low (∼2.5 × 10-7 substitutions per site per year, as inferred from the core genome) and that major sublineages (corresponding to so-called 'epidemic clones') are estimated to be at least 50-150 years old. This work demonstrates the urgent need to monitor Lm strains at the global level and provides the unified approach needed for global harmonization of Lm genome-based typing and population biology.
Collapse
|
26
|
Identification and analysis of integrons and cassette arrays in bacterial genomes. Nucleic Acids Res 2016; 44:4539-50. [PMID: 27130947 PMCID: PMC4889954 DOI: 10.1093/nar/gkw319] [Citation(s) in RCA: 178] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2015] [Accepted: 04/13/2016] [Indexed: 12/13/2022] Open
Abstract
Integrons recombine gene arrays and favor the spread of antibiotic resistance. Their broader roles in bacterial adaptation remain mysterious, partly due to lack of computational tools. We made a program – IntegronFinder – to identify integrons with high accuracy and sensitivity. IntegronFinder is available as a standalone program and as a web application. It searches for attC sites using covariance models, for integron-integrases using HMM profiles, and for other features (promoters, attI site) using pattern matching. We searched for integrons, integron-integrases lacking attC sites, and clusters of attC sites lacking a neighboring integron-integrase in bacterial genomes. All these elements are especially frequent in genomes of intermediate size. They are missing in some key phyla, such as α-Proteobacteria, which might reflect selection against cell lineages that acquire integrons. The similarity between attC sites is proportional to the number of cassettes in the integron, and is particularly low in clusters of attC sites lacking integron-integrases. The latter are unexpectedly abundant in genomes lacking integron-integrases or their remains, and have a large novel pool of cassettes lacking homologs in the databases. They might represent an evolutionary step between the acquisition of genes within integrons and their stabilization in the new genome.
Collapse
|
27
|
Abstract
The cytoplasm of prokaryotes contains many molecular machines interacting directly with the chromosome. These vital interactions depend on the chromosome structure, as a molecule, and on the genome organization, as a unit of genetic information. Strong selection for the organization of the genetic elements implicated in these interactions drives replicon ploidy, gene distribution, operon conservation, and the formation of replication-associated traits. The genomes of prokaryotes are also very plastic with high rates of horizontal gene transfer and gene loss. The evolutionary conflicts between plasticity and organization lead to the formation of regions with high genetic diversity whose impact on chromosome structure is poorly understood. Prokaryotic genomes are remarkable documents of natural history because they carry the imprint of all of these selective and mutational forces. Their study allows a better understanding of molecular mechanisms, their impact on microbial evolution, and how they can be tinkered in synthetic biology.
Collapse
|
28
|
The chromosomal accommodation and domestication of mobile genetic elements. Curr Opin Microbiol 2014; 22:22-9. [DOI: 10.1016/j.mib.2014.09.010] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 09/17/2014] [Indexed: 11/17/2022]
|
29
|
Bacteria in Ostreococcus tauri cultures - friends, foes or hitchhikers? Front Microbiol 2014; 5:505. [PMID: 25426102 PMCID: PMC4224133 DOI: 10.3389/fmicb.2014.00505] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2014] [Accepted: 09/08/2014] [Indexed: 11/13/2022] Open
Abstract
Marine phytoplankton produce half of the oxygen we breathe and their astounding diversity is just starting to be unraveled. Many microbial phytoplankton are thought to be phototrophic, depending solely on inorganic sources of carbon and minerals for growth rather than preying on other planktonic cells. However, there is increasing evidence that symbiotic associations, to a large extent with bacteria, are required for vitamin or nutrient uptake for many eukaryotic microalgae. Here, we use in silico approaches to look for putative symbiotic interactions by analysing the gene content of microbial communities associated with 13 different Ostreococcus tauri (Chlorophyta, Mamilleophyceae) cultures sampled from the Mediterranean Sea. While we find evidence for bacteria in all cultures, there is no ubiquitous bacterial group, and the most prevalent group, Flavobacteria, is present in 10 out of 13 cultures. Among seven of the microbiomes, we detected genes predicted to encode type 3 secretion systems (T3SS, in 6/7 microbiomes) and/or putative type 6 secretion systems (T6SS, in 4/7 microbiomes). Phylogenetic analyses show that the corresponding genes are closely related to genes of systems identified in bacterial-plant interactions, suggesting that these T3SS might be involved in cell-to-cell interactions with O. tauri.
Collapse
|
30
|
MacSyFinder: a program to mine genomes for molecular systems with an application to CRISPR-Cas systems. PLoS One 2014; 9:e110726. [PMID: 25330359 PMCID: PMC4201578 DOI: 10.1371/journal.pone.0110726] [Citation(s) in RCA: 206] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2014] [Accepted: 09/15/2014] [Indexed: 01/21/2023] Open
Abstract
Motivation Biologists often wish to use their knowledge on a few experimental models of a given molecular system to identify homologs in genomic data. We developed a generic tool for this purpose. Results Macromolecular System Finder (MacSyFinder) provides a flexible framework to model the properties of molecular systems (cellular machinery or pathway) including their components, evolutionary associations with other systems and genetic architecture. Modelled features also include functional analogs, and the multiple uses of a same component by different systems. Models are used to search for molecular systems in complete genomes or in unstructured data like metagenomes. The components of the systems are searched by sequence similarity using Hidden Markov model (HMM) protein profiles. The assignment of hits to a given system is decided based on compliance with the content and organization of the system model. A graphical interface, MacSyView, facilitates the analysis of the results by showing overviews of component content and genomic context. To exemplify the use of MacSyFinder we built models to detect and class CRISPR-Cas systems following a previously established classification. We show that MacSyFinder allows to easily define an accurate “Cas-finder” using publicly available protein profiles. Availability and Implementation MacSyFinder is a standalone application implemented in Python. It requires Python 2.7, Hmmer and makeblastdb (version 2.2.28 or higher). It is freely available with its source code under a GPLv3 license at https://github.com/gem-pasteur/macsyfinder. It is compatible with all platforms supporting Python and Hmmer/makeblastdb. The “Cas-finder” (models and HMM profiles) is distributed as a compressed tarball archive as Supporting Information.
Collapse
|
31
|
The genomic diversification of the whole Acinetobacter genus: origins, mechanisms, and consequences. Genome Biol Evol 2014; 6:2866-82. [PMID: 25313016 PMCID: PMC4224351 DOI: 10.1093/gbe/evu225] [Citation(s) in RCA: 175] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Bacterial genomics has greatly expanded our understanding of microdiversification patterns within a species, but analyses at higher taxonomical levels are necessary to understand and predict the independent rise of pathogens in a genus. We have sampled, sequenced, and assessed the diversity of genomes of validly named and tentative species of the Acinetobacter genus, a clade including major nosocomial pathogens and biotechnologically important species. We inferred a robust global phylogeny and delimited several new putative species. The genus is very ancient and extremely diverse: Genomes of highly divergent species share more orthologs than certain strains within a species. We systematically characterized elements and mechanisms driving genome diversification, such as conjugative elements, insertion sequences, and natural transformation. We found many error-prone polymerases that may play a role in resistance to toxins, antibiotics, and in the generation of genetic variation. Surprisingly, temperate phages, poorly studied in Acinetobacter, were found to account for a significant fraction of most genomes. Accordingly, many genomes encode clustered regularly interspaced short palindromic repeats (CRISPR)-Cas systems with some of the largest CRISPR-arrays found so far in bacteria. Integrons are strongly overrepresented in Acinetobacter baumannii, which correlates with its frequent resistance to antibiotics. Our data suggest that A. baumannii arose from an ancient population bottleneck followed by population expansion under strong purifying selection. The outstanding diversification of the species occurred largely by horizontal transfer, including some allelic recombination, at specific hotspots preferentially located close to the replication terminus. Our work sets a quantitative basis to understand the diversification of Acinetobacter into emerging resistant and versatile pathogens.
Collapse
|
32
|
The interplay of restriction-modification systems with mobile genetic elements and their prokaryotic hosts. Nucleic Acids Res 2014; 42:10618-31. [PMID: 25120263 PMCID: PMC4176335 DOI: 10.1093/nar/gku734] [Citation(s) in RCA: 179] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2014] [Revised: 07/29/2014] [Accepted: 07/30/2014] [Indexed: 01/21/2023] Open
Abstract
The roles of restriction-modification (R-M) systems in providing immunity against horizontal gene transfer (HGT) and in stabilizing mobile genetic elements (MGEs) have been much debated. However, few studies have precisely addressed the distribution of these systems in light of HGT, its mechanisms and its vectors. We analyzed the distribution of R-M systems in 2261 prokaryote genomes and found their frequency to be strongly dependent on the presence of MGEs, CRISPR-Cas systems, integrons and natural transformation. Yet R-M systems are rare in plasmids, in prophages and nearly absent from other phages. Their abundance depends on genome size for small genomes where it relates with HGT but saturates at two occurrences per genome. Chromosomal R-M systems might evolve under cycles of purifying and relaxed selection, where sequence conservation depends on the biochemical activity and complexity of the system and total gene loss is frequent. Surprisingly, analysis of 43 pan-genomes suggests that solitary R-M genes rarely arise from the degradation of R-M systems. Solitary genes are transferred by large MGEs, whereas complete systems are more frequently transferred autonomously or in small MGEs. Our results suggest means of testing the roles for R-M systems and their associations with MGEs.
Collapse
|
33
|
Abstract
The human bacterial pathogen Listeria monocytogenes is emerging as a model organism to study RNA-mediated regulation in pathogenic bacteria. A class of non-coding RNAs called CRISPRs (clustered regularly interspaced short palindromic repeats) has been described to confer bacterial resistance against invading bacteriophages and conjugative plasmids. CRISPR function relies on the activity of CRISPR associated (cas) genes that encode a large family of proteins with nuclease or helicase activities and DNA and RNA binding domains. Here, we characterized a CRISPR element (RliB) that is expressed and processed in the L. monocytogenes strain EGD-e, which is completely devoid of cas genes. Structural probing revealed that RliB has an unexpected secondary structure comprising basepair interactions between the repeats and the adjacent spacers in place of canonical hairpins formed by the palindromic repeats. Moreover, in contrast to other CRISPR-Cas systems identified in Listeria, RliB-CRISPR is ubiquitously present among Listeria genomes at the same genomic locus and is never associated with the cas genes. We showed that RliB-CRISPR is a substrate for the endogenously encoded polynucleotide phosphorylase (PNPase) enzyme. The spacers of the different Listeria RliB-CRISPRs share many sequences with temperate and virulent phages. Furthermore, we show that a cas-less RliB-CRISPR lowers the acquisition frequency of a plasmid carrying the matching protospacer, provided that trans encoded cas genes of a second CRISPR-Cas system are present in the genome. Importantly, we show that PNPase is required for RliB-CRISPR mediated DNA interference. Altogether, our data reveal a yet undescribed CRISPR system whose both processing and activity depend on PNPase, highlighting a new and unexpected function for PNPase in “CRISPRology”. CRISPR-Cas systems confer to bacteria and archaea an adaptive immunity that protects them against invading bacteriophages and plasmids. In this study, we characterize a CRISPR (RliB-CRISPR) that is present in all L. monocytogenes strains at the same genomic locus but is never associated with a cas operon. It is an unusual CRISPR that, as we demonstrate, has a secondary structure consisting of basepair interactions between the repeat sequence and the adjacent spacer. We show that the RliB-CRISPR is processed by the endogenously encoded polynucleotide phosphorylase enzyme (PNPase). In addition, we show that the RliB-CRISPR system requires PNPase and presence of trans encoded cas genes of a second CRISPR-Cas system, to mediate DNA interference directed against a plasmid carrying a matching protospacer. Altogether, our data reveal a novel type of CRISPR system in bacteria that requires endogenously encoded PNPase enzyme for its processing and interference activity.
Collapse
|
34
|
Manipulating or superseding host recombination functions: a dilemma that shapes phage evolvability. PLoS Genet 2013; 9:e1003825. [PMID: 24086157 PMCID: PMC3784561 DOI: 10.1371/journal.pgen.1003825] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2013] [Accepted: 08/08/2013] [Indexed: 11/18/2022] Open
Abstract
Phages, like many parasites, tend to have small genomes and may encode autonomous functions or manipulate those of their hosts'. Recombination functions are essential for phage replication and diversification. They are also nearly ubiquitous in bacteria. The E. coli genome encodes many copies of an octamer (Chi) motif that upon recognition by RecBCD favors repair of double strand breaks by homologous recombination. This might allow self from non-self discrimination because RecBCD degrades DNA lacking Chi. Bacteriophage Lambda, an E. coli parasite, lacks Chi motifs, but escapes degradation by inhibiting RecBCD and encoding its own autonomous recombination machinery. We found that only half of 275 lambdoid genomes encode recombinases, the remaining relying on the host's machinery. Unexpectedly, we found that some lambdoid phages contain extremely high numbers of Chi motifs concentrated between the phage origin of replication and the packaging site. This suggests a tight association between replication, packaging and RecBCD-mediated recombination in these phages. Indeed, phages lacking recombinases strongly over-represent Chi motifs. Conversely, phages encoding recombinases and inhibiting host recombination machinery select for the absence of Chi motifs. Host and phage recombinases use different mechanisms and the latter are more tolerant to sequence divergence. Accordingly, we show that phages encoding their own recombination machinery have more mosaic genomes resulting from recent recombination events and have more diverse gene repertoires, i.e. larger pan genomes. We discuss the costs and benefits of superseding or manipulating host recombination functions and how this decision shapes phage genome structure and evolvability. Bacterial viruses, called bacteriophages, are extremely abundant in the biosphere. They have key roles in the regulation of bacterial populations and in the diversification of bacterial genomes. Among these viruses, lambdoid phages are very abundant in enterobacteria and exchange genetic material very frequently. This latter process is thought to increase phage diversity and therefore facilitate adaptation to hosts. Recombination is also essential for the replication of many lambdoid phages. Lambdoids have been described to encode their own recombination genes and inhibit their hosts'. In this study, we show that lambdoids are split regarding their capacity to encode autonomous recombination functions and that this affects the abundance of recombination-related sequence motifs. Half of the phages encode an autonomous system and inhibit their hosts'. The trade-off between superseding and manipulating the hosts' recombination functions has important consequences. The phages encoding autonomous recombination functions have more diverse gene repertoires and recombine more frequently. Viruses, as many other parasites, have small genomes and depend on their hosts for several housekeeping functions. Hence, they often face trade-offs between supersession and manipulation of molecular machineries. Our results suggest these trade-offs may shape viral gene repertoires, their sequence composition and even influence their evolvability.
Collapse
|
35
|
From the Flavobacterium genus to the phylum Bacteroidetes: genomic analysis of dnd gene clusters. FEMS Microbiol Lett 2013; 348:26-35. [PMID: 23965156 DOI: 10.1111/1574-6968.12239] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2013] [Revised: 08/12/2013] [Accepted: 08/18/2013] [Indexed: 01/19/2023] Open
Abstract
Phosphorothioate modification of DNA and the corresponding DNA degradation (Dnd) phenotype that occurs during gel electrophoresis are caused by dnd genes. Although widely distributed among Bacteria and Archaea, dnd genes have been found in only very few, taxonomically unrelated, bacterial species so far. Here, we report the presence of dnd genes and their associated Dnd phenotype in two Flavobacterium species. Comparison with dnd gene clusters previously described led us to report a noncanonical genetic organization and to identify a gene likely encoding a hybrid DndE protein. Hence, we showed that dnd genes are also present in members of the family Flavobacteriaceae, a bacterial group occurring in a variety of habitats with an interesting diversity of lifestyle. Two main types of genomic organization of dnd loci were uncovered probably denoting their spreading in the phylum Bacteroidetes via distinct genetic transfer events.
Collapse
|
36
|
Abstract
Rapid turnover of mobile elements drives the plasticity of bacterial genomes. Integrated bacteriophages (prophages) encode host-adaptive traits and represent a sizable fraction of bacterial chromosomes. We hypothesized that natural selection shapes prophage integration patterns relative to the host genome organization. We tested this idea by detecting and studying 500 prophages of 69 strains of Escherichia and Salmonella. Phage integrases often target not only conserved genes but also intergenic positions, suggesting purifying selection for integration sites. Furthermore, most integration hotspots are conserved between the two host genera. Integration sites seem also selected at the large chromosomal scale, as they are nonrandomly organized in terms of the origin-terminus axis and the macrodomain structure. The genes of lambdoid prophages are systematically co-oriented with the bacterial replication fork and display the host high frequency of polarized FtsK-orienting polar sequences motifs required for chromosome segregation. matS motifs are strongly avoided by prophages suggesting counter selection of motifs disrupting macrodomains. These results show how natural selection for seamless integration of prophages in the chromosome shapes the evolution of the bacterium and the phage. First, integration sites are highly conserved for many millions of years favoring lysogeny over the lytic cycle for temperate phages. Second, the global distribution of prophages is intimately associated with the chromosome structure and the patterns of gene expression. Third, the phage endures selection for DNA motifs that pertain exclusively to the biology of the prophage in the bacterial chromosome. Understanding prophage genetic adaptation sheds new lights on the coexistence of horizontal transfer and organized bacterial genomes.
Collapse
|
37
|
Antibiotic resistance plasmids spread among natural isolates of Escherichia coli in spite of CRISPR elements. Microbiology (Reading) 2012; 158:2997-3004. [DOI: 10.1099/mic.0.060814-0] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
|
38
|
Abstract
Proteins secreted to the extracellular environment or to the periphery of the cell envelope, the secretome, play essential roles in foraging, antagonistic and mutualistic interactions. We hypothesize that arms races, genetic conflicts and varying selective pressures should lead to the rapid change of sequences and gene repertoires of the secretome. The analysis of 42 bacterial pan-genomes shows that secreted, and especially extracellular proteins, are predominantly encoded in the accessory genome, i.e. among genes not ubiquitous within the clade. Genes encoding outer membrane proteins might engage more frequently in intra-chromosomal gene conversion because they are more often in multi-genic families. The gene sequences encoding the secretome evolve faster than the rest of the genome and in particular at non-synonymous positions. Cell wall proteins in Firmicutes evolve particularly fast when compared with outer membrane proteins of Proteobacteria. Virulence factors are over-represented in the secretome, notably in outer membrane proteins, but cell localization explains more of the variance in substitution rates and gene repertoires than sequence homology to known virulence factors. Accordingly, the repertoires and sequences of the genes encoding the secretome change fast in the clades of obligatory and facultative pathogens and also in the clades of mutualists and free-living bacteria. Our study shows that cell localization shapes genome evolution. In agreement with our hypothesis, the repertoires and the sequences of genes encoding secreted proteins evolve fast. The particularly rapid change of extracellular proteins suggests that these public goods are key players in bacterial adaptation.
Collapse
|
39
|
CRISPR distribution within the Escherichia coli species is not suggestive of immunity-associated diversifying selection. J Bacteriol 2011; 193:2460-7. [PMID: 21421763 PMCID: PMC3133152 DOI: 10.1128/jb.01307-10] [Citation(s) in RCA: 110] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2010] [Accepted: 03/07/2011] [Indexed: 01/21/2023] Open
Abstract
In order to get further insights into the role of the clustered, regularly interspaced, short palindromic repeats (CRISPRs) in Escherichia coli, we analyzed the CRISPR diversity in a collection of 290 strains, in the phylogenetic framework of the strains represented by multilocus sequence typing (MLST). The set included 263 natural E. coli isolates exposed to various environments and isolated over a 20-year period from humans and animals, as well as 27 fully sequenced strains. Our analyses confirm that there are two largely independent pairs of CRISPR loci (CRISPR1 and -2 and CRISPR3 and -4), each associated with a different type of cas genes (Ecoli and Ypest, respectively), but that each pair of CRISPRs has similar dynamics. Strikingly, the major phylogenetic group B2 is almost devoid of CRISPRs. The majority of genomes analyzed lack Ypest cas genes and contain CRISPR3 with spacers matching Ypest cas genes. The analysis of relatedness between strains in terms of spacer repertoire and the MLST tree shows a pattern where closely related strains (MLST phylogenetic distance of <0.005 corresponding to at least hundreds of thousands of years) often exhibit identical CRISPRs while more distantly related strains (MLST distance of >0.01) exhibit completely different CRISPRs. This suggests rare but radical turnover of spacers in CRISPRs rather than CRISPR gradual change. We found no link between the presence, size, or content of CRISPRs and the lifestyle of the strains. Our data suggest that, within the E. coli species, CRISPRs do not have the expected characteristics of a classical immune system.
Collapse
|
40
|
The small, slow and specialized CRISPR and anti-CRISPR of Escherichia and Salmonella. PLoS One 2010; 5:e11126. [PMID: 20559554 PMCID: PMC2886076 DOI: 10.1371/journal.pone.0011126] [Citation(s) in RCA: 174] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2010] [Accepted: 05/25/2010] [Indexed: 11/30/2022] Open
Abstract
Prokaryotes thrive in spite of the vast number and diversity of their viruses. This partly results from the evolution of mechanisms to inactivate or silence the action of exogenous DNA. Among these, Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are unique in providing adaptive immunity against elements with high local resemblance to genomes of previously infecting agents. Here, we analyze the CRISPR loci of 51 complete genomes of Escherichia and Salmonella. CRISPR are in two pairs of loci in Escherichia, one single pair in Salmonella, each pair showing a similar turnover rate, repeat sequence and putative linkage to a common set of cas genes. Yet, phylogeny shows that CRISPR and associated cas genes have different evolutionary histories, the latter being frequently exchanged or lost. In our set, one CRISPR pair seems specialized in plasmids often matching genes coding for the replication, conjugation and antirestriction machinery. Strikingly, this pair also matches the cognate cas genes in which case these genes are absent. The unexpectedly high conservation of this anti-CRISPR suggests selection to counteract the invasion of mobile elements containing functional CRISPR/cas systems. There are few spacers in most CRISPR, which rarely match genomes of known phages. Furthermore, we found that strains divergent less than 250 thousand years ago show virtually identical CRISPR. The lack of congruence between cas, CRISPR and the species phylogeny and the slow pace of CRISPR change make CRISPR poor epidemiological markers in enterobacteria. All these observations are at odds with the expectedly abundant and dynamic repertoire of spacers in an immune system aiming at protecting bacteria from phages. Since we observe purifying selection for the maintenance of CRISPR these results suggest that alternative evolutionary roles for CRISPR remain to be uncovered.
Collapse
|
41
|
No evidence for elemental-based streamlining of prokaryotic genomes. Trends Ecol Evol 2010; 25:319-20; author reply 320-1. [PMID: 20381897 DOI: 10.1016/j.tree.2010.03.001] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2010] [Revised: 03/04/2010] [Accepted: 03/04/2010] [Indexed: 11/15/2022]
|
42
|
Horizontal gene transfer of the secretome drives the evolution of bacterial cooperation and virulence. Curr Biol 2009; 19:1683-91. [PMID: 19800234 PMCID: PMC2773837 DOI: 10.1016/j.cub.2009.08.056] [Citation(s) in RCA: 162] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2009] [Revised: 08/10/2009] [Accepted: 08/20/2009] [Indexed: 01/09/2023]
Abstract
Background Microbes engage in a remarkable array of cooperative behaviors, secreting shared proteins that are essential for foraging, shelter, microbial warfare, and virulence. These proteins are costly, rendering populations of cooperators vulnerable to exploitation by nonproducing cheaters arising by gene loss or migration. In such conditions, how can cooperation persist? Results Our model predicts that differential gene mobility drives intragenomic variation in investment in cooperative traits. More mobile loci generate stronger among-individual genetic correlations at these loci (higher relatedness) and thereby allow the maintenance of more cooperative traits via kin selection. By analyzing 21 Escherichia genomes, we confirm that genes coding for secreted proteins—the secretome—are very frequently lost and gained and are associated with mobile elements. We show that homologs of the secretome are overrepresented among human gut metagenomics samples, consistent with increased relatedness at secretome loci across multiple species. The biosynthetic cost of secreted proteins is shown to be under intense selective pressure, even more than for highly expressed proteins, consistent with a cost of cooperation driving social dilemmas. Finally, we demonstrate that mobile elements are in conflict with their chromosomal hosts over the chimeric ensemble's social strategy, with mobile elements enforcing cooperation on their otherwise selfish hosts via the cotransfer of secretome genes with “mafia strategy” addictive systems (toxin-antitoxin and restriction-modification). Conclusion Our analysis matches the predictions of our model suggesting that horizontal transfer promotes cooperation, as transmission increases local genetic relatedness at mobile loci and enforces cooperation on the resident genes. As a consequence, horizontal transfer promoted by agents such as plasmids, phages, or integrons drives microbial cooperation.
Collapse
|
43
|
Abstract
DNA repeats are causes and consequences of genome plasticity. Repeats are created by intrachromosomal recombination or horizontal transfer. They are targeted by recombination processes leading to amplifications, deletions and rearrangements of genetic material. The identification and analysis of repeats in nearly 700 genomes of bacteria and archaea is facilitated by the existence of sequence data and adequate bioinformatic tools. These have revealed the immense diversity of repeats in genomes, from those created by selfish elements to the ones used for protection against selfish elements, from those arising from transient gene amplifications to the ones leading to stable duplications. Experimental works have shown that some repeats do not carry any adaptive value, while others allow functional diversification and increased expression. All repeats carry some potential to disorganize and destabilize genomes. Because recombination and selection for repeats vary between genomes, the number and types of repeats are also quite diverse and in line with ecological variables, such as host-dependent associations or population sizes, and with genetic variables, such as the recombination machinery. From an evolutionary point of view, repeats represent both opportunities and problems. We describe how repeats are created and how they can be found in genomes. We then focus on the functional and genomic consequences of repeats that dictate their fate.
Collapse
|
44
|
Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths. PLoS Genet 2009; 5:e1000344. [PMID: 19165319 PMCID: PMC2617782 DOI: 10.1371/journal.pgen.1000344] [Citation(s) in RCA: 775] [Impact Index Per Article: 51.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2008] [Accepted: 12/16/2008] [Indexed: 01/01/2023] Open
Abstract
The Escherichia coli species represents one of the best-studied model organisms, but also encompasses a variety of commensal and pathogenic strains that diversify by high rates of genetic change. We uniformly (re-) annotated the genomes of 20 commensal and pathogenic E. coli strains and one strain of E. fergusonii (the closest E. coli related species), including seven that we sequenced to completion. Within the ∼18,000 families of orthologous genes, we found ∼2,000 common to all strains. Although recombination rates are much higher than mutation rates, we show, both theoretically and using phylogenetic inference, that this does not obscure the phylogenetic signal, which places the B2 phylogenetic group and one group D strain at the basal position. Based on this phylogeny, we inferred past evolutionary events of gain and loss of genes, identifying functional classes under opposite selection pressures. We found an important adaptive role for metabolism diversification within group B2 and Shigella strains, but identified few or no extraintestinal virulence-specific genes, which could render difficult the development of a vaccine against extraintestinal infections. Genome flux in E. coli is confined to a small number of conserved positions in the chromosome, which most often are not associated with integrases or tRNA genes. Core genes flanking some of these regions show higher rates of recombination, suggesting that a gene, once acquired by a strain, spreads within the species by homologous recombination at the flanking genes. Finally, the genome's long-scale structure of recombination indicates lower recombination rates, but not higher mutation rates, at the terminus of replication. The ensuing effect of background selection and biased gene conversion may thus explain why this region is A+T-rich and shows high sequence divergence but low sequence polymorphism. Overall, despite a very high gene flow, genes co-exist in an organised genome. Although abundant knowledge has been accumulated regarding the E. coli laboratory strain K-12, little is known about the evolutionary trajectories that have driven the high diversity observed among natural isolates of the species, which encompass both commensal and highly virulent intestinal and extraintestinal pathogenic strains. We have annotated or re-annotated the genomes of 20 commensal and pathogenic E. coli strains and one strain of E. fergusonii (the closest E. coli related species), including seven that we sequenced to completion. Although recombination rates are much higher than mutation rates, we were able to reconstruct a robust phylogeny based on the ∼2,000 genes common to all strains. Based on this phylogeny, we established the evolutionary scenario of gains and losses of thousands of specific genes, identifying functional classes under opposite selection pressures. This genome flux is confined to very few positions in the chromosome, which are the same for every genome. Notably, we identified few or no extraintestinal virulence-specific genes. We also defined a long-scale structure of recombination in the genome with lower recombination rates at the terminus of replication. These findings demonstrate that, despite a very high gene flow, genes can co-exist in an organised genome.
Collapse
|
45
|
DNA replication timing data corroborate in silico human replication origin predictions. PHYSICAL REVIEW LETTERS 2007; 99:248102. [PMID: 18233493 DOI: 10.1103/physrevlett.99.248102] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2007] [Indexed: 05/25/2023]
Abstract
We develop a wavelet-based multiscale pattern recognition methodology to disentangle the replication- from the transcription-associated compositional strand asymmetries observed in the human genome. Comparing replication skew profiles to recent high-resolution replication timing data reveals that most of the putative replication origins that border the so-identified replication domains are replicated earlier than their surroundings whereas the central regions replicate late in the S phase. We discuss the implications of this first experimental confirmation of these replication origin predictions that are likely to be early replicating and active in most tissues.
Collapse
|
46
|
From GC skews to wavelets: a gentle guide to the analysis of compositional asymmetries in genomic data. Biochimie 2007; 90:648-59. [PMID: 17988781 DOI: 10.1016/j.biochi.2007.09.015] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2007] [Accepted: 09/21/2007] [Indexed: 12/29/2022]
Abstract
Compositional asymmetries are pervasive in DNA sequences. They are the result of the asymmetric interactions between DNA and cellular mechanisms such as replication and transcription. Here, we review many of the methods that have been proposed over the years to analyse compositional asymmetries in DNA sequences. Among these we list GC skews, oligonucleotide skews and wavelets, which among other uses have been extensively employed to delimitate origins and termini of replication in genomes. We also review the use of multivariate methods, such as factorial correspondence analysis, discriminant analysis and analysis of variance, which allow assigning compositional strand asymmetries to the different biological processes shaping sequence composition. Finally, we review methods that have been used to infer substitution matrices and allow understanding the mutational processes underlying strand asymmetry. We focus on replication asymmetries because they have been more thoroughly studied, but the methods may be adapted, and often are, to other problems. Although strand asymmetry has been studied more frequently through compositional skews of nucleotides or oligonucleotides, we recall that, depending on the goal of the analysis, other methods may be more appropriate to answer certain biological questions. We also refer to programs freely available to analyse strand asymmetry.
Collapse
|
47
|
Abstract
In this work, we investigated a large-scale organization of the human genes with respect to putative replication origins. We developed an appropriate multiscale method to analyze the nucleotide compositional skew along the genome and found that in more than one-quarter of the genome, the skew profile presents characteristic patterns consisting of successions of N-shaped structures, designated here N-domains, bordered by putative replication origins. Our analysis of recent experimental timing data confirmed that, in a number of cases, domain borders coincide with replication initiation zones active in the early S phase, whereas the central regions replicate in the late S phase. Around the putative origins, genes are abundant and broadly expressed, and their transcription is co-oriented with replication fork progression. These features weaken progressively with the distance from putative replication origins. At the center of domains, genes are rare and expressed in few tissues. We propose that this specific organization could result from the constraints of accommodating the replication and transcription initiation processes at chromatin level, and reducing head-on collisions between the two machineries. Our findings provide a new model of gene organization in the human genome, which integrates transcription, replication, and chromatin structure as coordinated determinants of genome architecture.
Collapse
|
48
|
Bifractality of human DNA strand-asymmetry profiles results from transcription. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2007; 75:032902. [PMID: 17500744 DOI: 10.1103/physreve.75.032902] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2006] [Indexed: 05/15/2023]
Abstract
We use the wavelet transform modulus maxima method to investigate the multifractal properties of strand-asymmetry DNA walk profiles in the human genome. This study reveals the bifractal nature of these profiles, which involve two competing scale-invariant (up to repeat-masked distances less, or similar 40 kbp) components characterized by Hölder exponents h{1}=0.78 and h{2}=1, respectively. The former corresponds to the long-range-correlated homogeneous fluctuations previously observed in DNA walks generated with structural codings. The latter is associated with the presence of jumps in the original strand-asymmetry noisy signal S. We show that a majority of upward (downward) jumps co-locate with gene transcription start (end) sites. Here 7228 human gene transcription start sites from the refGene database are found within 2 kbp from an upward jump of amplitude DeltaS > or = 0.1 which suggests that about 36% of annotated human genes present significant transcription-induced strand asymmetry and very likely high expression rate.
Collapse
|
49
|
Abstract
Insertion sequences (ISs) are the smallest and most frequent transposable elements in prokaryotes where they play an important evolutionary role by promoting gene inactivation and genome plasticity. Their genomic abundance varies by several orders of magnitude for reasons largely unknown and widely speculated. The current availability of hundreds of genomes renders testable many of these hypotheses, notably that IS abundance correlates positively with the frequency of horizontal gene transfer (HGT), genome size, pathogenicity, nonobligatory ecological associations, and human association. We thus reannotated ISs in 262 prokaryotic genomes and tested these hypotheses showing that when using appropriate controls, there is no empirical basis for IS family specificity, pathogenicity, or human association to influence IS abundance or density. HGT seems necessary for the presence of ISs, but cannot alone explain the absence of ISs in more than 20% of the organisms, some of which showing high rates of HGT. Gene transfer is also not a significant determinant of the abundance of IS elements in genomes, suggesting that IS abundance is controlled at the level of transposition and ensuing natural selection and not at the level of infection. Prokaryotes engaging in obligatory associations have fewer ISs when controlled for genome size, but this may be caused by some being sexually isolated. Surprisingly, genome size is the only significant predictor of IS numbers and density. Alone, it explains over 40% of the variance of IS abundance. Because we find that genome size and IS abundance correlate negatively with minimal doubling times, we conclude that selection for rapid replication cannot account for the few ISs found in small genomes. Instead, we show evidence that IS numbers are controlled by the frequency of highly deleterious insertion targets. Indeed, IS abundance increases quickly with genome size, which is the exact inverse trend found for the density of genes under strong selection such as essential genes. Hence, for ISs, the bigger the genome the better.
Collapse
|
50
|
Abstract
Compositional replication strand bias, commonly referred to as GC skew, is present in many genomes of prokaryotes, eukaryotes, and viruses. Although cytosine deamination in ssDNA (resulting in C-->T changes on the leading strand) is often invoked as its major cause, the precise contributions of this and other substitution types are currently unknown. It is also unclear if the underlying mutational asymmetries are the same among taxa, are stable over time, or how closely the observed biases are to mutational equilibrium. We analyzed nearly neutral sites of seven taxa each with between three and six complete bacterial genomes, and inferred the substitution spectra of fourfold degenerate positions in nonhighly expressed genes. Using a bootstrap procedure, we extracted compositional biases associated with replication and identified the significant asymmetries. Although all taxa showed an overrepresentation of G relative to C on the leading strand (and imbalances between A and T), widely variable substitution asymmetries are noted. Surprisingly, all substitution types show significant asymmetry in at least one taxon, but none were universally biased in all taxa. Notably, in the two most biased genomes, A-->G, rather than C-->T, shapes the compositional bias. Given the variability in these biases, we propose that the process is multifactorial. Finally, we also find that most genomes are not at compositional equilibrium, and suggest that mutational-based heterotachy is deeply imprinted in the history of biological macromolecules. This shows that similar compositional biases associated with the same essential well-conserved process, replication, do not reflect similar mutational processes in different genomes, and that caution is required in inferring the roles of specific mutational biases on the basis of contemporary patterns of sequence composition.
Collapse
|