1
|
Torrance EL, Diop A, Bobay LM. Homologous Recombination Shapes the Architecture and Evolution of Bacterial Genomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.31.596828. [PMID: 38895235 PMCID: PMC11185547 DOI: 10.1101/2024.05.31.596828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Homologous recombination is a key evolutionary force that varies considerably across bacterial species. However, how the landscape of homologous recombination varies across genes and within individual genomes has only been studied in a few species. Here, we used Approximate Bayesian Computation to estimate the recombination rate along the genomes of 145 bacterial species. Our results show that homologous recombination varies greatly along bacterial genomes and shapes many aspects of genome architecture and evolution. The genomic landscape of recombination presents several key signatures: rates are highest near the origin of replication in most species, patterns of recombination generally appear symmetrical in both replichores (i.e. replicational halves of circular chromosomes) and most species have genomic hotpots of recombination. Furthermore, many closely related species share conserved landscapes of recombination across orthologs indicating that recombination landscapes are conserved over significant evolutionary distances. We show evidence that recombination drives the evolution of GC-content through increasing the effectiveness of selection and not through biased gene conversion, thereby contributing to an ongoing debate. Finally, we demonstrate that the rate of recombination varies across gene function and that many hotspots of recombination are associated with adaptive and mobile regions often encoding genes involved in pathogenicity.
Collapse
Affiliation(s)
- Ellis L Torrance
- Dept. of Biology, University of North Carolina Greensboro, Greensboro, NC 27412
- Systems Biology Dept., Sandia National Laboratories, Livermore, CA 94551
| | - Awa Diop
- Dept. of Biological Sciences, North Carolina State University, Raleigh, NC 27695
| | - Louis-Marie Bobay
- Dept. of Biology, University of North Carolina Greensboro, Greensboro, NC 27412
- Dept. of Biological Sciences, North Carolina State University, Raleigh, NC 27695
| |
Collapse
|
2
|
Torrance EL, Burton C, Diop A, Bobay LM. Evolution of homologous recombination rates across bacteria. Proc Natl Acad Sci U S A 2024; 121:e2316302121. [PMID: 38657048 PMCID: PMC11067023 DOI: 10.1073/pnas.2316302121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 03/08/2024] [Indexed: 04/26/2024] Open
Abstract
Bacteria are nonsexual organisms but are capable of exchanging DNA at diverse degrees through homologous recombination. Intriguingly, the rates of recombination vary immensely across lineages where some species have been described as purely clonal and others as "quasi-sexual." However, estimating recombination rates has proven a difficult endeavor and estimates often vary substantially across studies. It is unclear whether these variations reflect natural variations across populations or are due to differences in methodologies. Consequently, the impact of recombination on bacterial evolution has not been extensively evaluated and the evolution of recombination rate-as a trait-remains to be accurately described. Here, we developed an approach based on Approximate Bayesian Computation that integrates multiple signals of recombination to estimate recombination rates. We inferred the rate of recombination of 162 bacterial species and one archaeon and tested the robustness of our approach. Our results confirm that recombination rates vary drastically across bacteria; however, we found that recombination rate-as a trait-is conserved in several lineages but evolves rapidly in others. Although some traits are thought to be associated with recombination rate (e.g., GC-content), we found no clear association between genomic or phenotypic traits and recombination rate. Overall, our results provide an overview of recombination rate, its evolution, and its impact on bacterial evolution.
Collapse
Affiliation(s)
- Ellis L Torrance
- Department of Biology, University of North Carolina, Greensboro, NC 27412
| | - Corey Burton
- Department of Biology, University of North Carolina, Greensboro, NC 27412
| | - Awa Diop
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695
| | - Louis-Marie Bobay
- Department of Biology, University of North Carolina, Greensboro, NC 27412
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695
| |
Collapse
|
3
|
Atre M, Joshi B, Babu J, Sawant S, Sharma S, Sankar TS. Origin, evolution, and maintenance of gene-strand bias in bacteria. Nucleic Acids Res 2024; 52:3493-3509. [PMID: 38442257 DOI: 10.1093/nar/gkae155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 02/06/2024] [Accepted: 02/19/2024] [Indexed: 03/07/2024] Open
Abstract
Gene-strand bias is a characteristic feature of bacterial genome organization wherein genes are preferentially encoded on the leading strand of replication, promoting co-orientation of replication and transcription. This co-orientation bias has evolved to protect gene essentiality, expression, and genomic stability from the harmful effects of head-on replication-transcription collisions. However, the origin, variation, and maintenance of gene-strand bias remain elusive. Here, we reveal that the frequency of inversions that alter gene orientation exhibits large variation across bacterial populations and negatively correlates with gene-strand bias. The density, distance, and distribution of inverted repeats show a similar negative relationship with gene-strand bias explaining the heterogeneity in inversions. Importantly, these observations are broadly evident across the entire bacterial kingdom uncovering inversions and inverted repeats as primary factors underlying the variation in gene-strand bias and its maintenance. The distinct catalytic subunits of replicative DNA polymerase have co-evolved with gene-strand bias, suggesting a close link between replication and the origin of gene-strand bias. Congruently, inversion frequencies and inverted repeats vary among bacteria with different DNA polymerases. In summary, we propose that the nature of replication determines the fitness cost of replication-transcription collisions, establishing a selection gradient on gene-strand bias by fine-tuning DNA sequence repeats and, thereby, gene inversions.
Collapse
Affiliation(s)
- Malhar Atre
- School of Biology, Indian Institute of Science Education and Research, Thiruvananthapuram, Kerala 695551, India
| | - Bharat Joshi
- School of Biology, Indian Institute of Science Education and Research, Thiruvananthapuram, Kerala 695551, India
| | - Jebin Babu
- School of Biology, Indian Institute of Science Education and Research, Thiruvananthapuram, Kerala 695551, India
| | - Shabduli Sawant
- School of Biology, Indian Institute of Science Education and Research, Thiruvananthapuram, Kerala 695551, India
| | - Shreya Sharma
- School of Biology, Indian Institute of Science Education and Research, Thiruvananthapuram, Kerala 695551, India
| | - T Sabari Sankar
- School of Biology, Indian Institute of Science Education and Research, Thiruvananthapuram, Kerala 695551, India
| |
Collapse
|
4
|
Diop K, Pidgeon R, Diop A, Benlaïfaoui M, Belkaid W, Malo J, Bernet E, Veyrier F, Jacq M, Brun Y, Elkrief A, Castagner B, Routy B, Richard C. Characterization and description of Gabonibacter chumensis sp. nov., isolated from feces of a patient with non-small cell lung cancer treated with immunotherapy. Arch Microbiol 2023; 205:338. [PMID: 37742282 PMCID: PMC10518271 DOI: 10.1007/s00203-023-03671-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 08/21/2023] [Accepted: 08/29/2023] [Indexed: 09/26/2023]
Abstract
A polyphasic taxonomic approach, incorporating analysis of phenotypic features, cellular fatty acid profiles, 16S rRNA gene sequences, and determination of average nucleotide identity (ANI) plus digital DNA-DNA hybridization (dDDH), was applied to characterize an anaerobic bacterial strain designated KD22T isolated from human feces. 16S rRNA gene-based phylogenetic analysis showed that strain KD22T was found to be most closely related to species of the genus Gabonibacter. At the 16S rRNA gene level, the closest species from the strain KD22T corresponded with Gabonibacter massiliensis GM7T, with a similarity of 97.58%. Cells of strain KD22T were Gram-negative coccobacillus, positive for indole and negative for catalase, nitrate reduction, oxidase, and urease activities. The fatty acid analysis demonstrated the presence of a high concentration of iso-C15: 0 (51.65%). Next, the complete whole-genome sequence of strain KD22T was 3,368,578 bp long with 42 mol% of DNA G + C contents. The DDH and ANI values between KD22T and type strains of phylogenetically related species were 67.40% and 95.43%, respectively. These phylogenetic, phenotypic, and genomic results supported the affiliation of strain KD22T as a novel bacterial species within the genus Gabonibacter. The proposed name is Gabonibacter chumensis and the type strain is KD22T (= CSUR Q8104T = DSM 115208 T).
Collapse
Affiliation(s)
- Khoudia Diop
- Laboratory of Immunotherapy and Onco-Microbiome, University of Montreal Healthcare Research Center (CRCHUM), 900 Rue Saint-Denis, Montreal, QC, H2X 0A9, Canada.
| | - Reilly Pidgeon
- Department of Pharmacology & Therapeutics, Faculty of Medicine and Health Sciences, McGill University, 3655 Promenade Sir-William-Osler, Montreal, QC, H3G 1Y6, Canada
| | - Awa Diop
- Department of Biology, University of North Carolina Greensboro, 321 McIver Street, PO Box 26170, Greensboro, NC, 27402, USA
| | - Myriam Benlaïfaoui
- Laboratory of Immunotherapy and Onco-Microbiome, University of Montreal Healthcare Research Center (CRCHUM), 900 Rue Saint-Denis, Montreal, QC, H2X 0A9, Canada
| | - Wiam Belkaid
- Laboratory of Immunotherapy and Onco-Microbiome, University of Montreal Healthcare Research Center (CRCHUM), 900 Rue Saint-Denis, Montreal, QC, H2X 0A9, Canada
| | - Julie Malo
- Laboratory of Immunotherapy and Onco-Microbiome, University of Montreal Healthcare Research Center (CRCHUM), 900 Rue Saint-Denis, Montreal, QC, H2X 0A9, Canada
| | - Eve Bernet
- INRS-Centre Armand-Frappier Santé Biotechnologie, Bacterial Symbionts Evolution, Laval, QC, H7V 1B7, Canada
| | - Frederic Veyrier
- INRS-Centre Armand-Frappier Santé Biotechnologie, Bacterial Symbionts Evolution, Laval, QC, H7V 1B7, Canada
| | - Maxime Jacq
- Faculty of Medicine, Department of Microbiology and Immunology, University of Montreal, Montreal, QC, Canada
| | - Yves Brun
- Faculty of Medicine, Department of Microbiology and Immunology, University of Montreal, Montreal, QC, Canada
| | - Arielle Elkrief
- Laboratory of Immunotherapy and Onco-Microbiome, University of Montreal Healthcare Research Center (CRCHUM), 900 Rue Saint-Denis, Montreal, QC, H2X 0A9, Canada
| | - Bastien Castagner
- Department of Pharmacology & Therapeutics, Faculty of Medicine and Health Sciences, McGill University, 3655 Promenade Sir-William-Osler, Montreal, QC, H3G 1Y6, Canada
| | - Bertrand Routy
- Laboratory of Immunotherapy and Onco-Microbiome, University of Montreal Healthcare Research Center (CRCHUM), 900 Rue Saint-Denis, Montreal, QC, H2X 0A9, Canada.
- Hematology-Oncology Service, Department of Medicine, University of Montreal Healthcare Centre (CHUM), Montreal, QC, H2X 0A9, Canada.
| | - Corentin Richard
- Laboratory of Immunotherapy and Onco-Microbiome, University of Montreal Healthcare Research Center (CRCHUM), 900 Rue Saint-Denis, Montreal, QC, H2X 0A9, Canada
| |
Collapse
|
5
|
Liu D, Liu LL, Zheng XQ, Chen R, Lin LR, Yang TC, Tong ML. Genetic Profiling of the Full-Length tprK Gene in Patients with Primary and Secondary Syphilis. Microbiol Spectr 2023; 11:e0493122. [PMID: 37036342 PMCID: PMC10269439 DOI: 10.1128/spectrum.04931-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 03/17/2023] [Indexed: 04/11/2023] Open
Abstract
TprK antigenic variation is acknowledged as an important strategy developed by Treponema pallidum to achieve immune evasion. Previous studies applied short-read sequencing to explore tprK gene sequence diversity in clinical samples; however, due to the limitations of short-read sequencing, it was difficult to determine the linkage between the seven V regions, and crucial information about full-length tprK variants was lost. Although two recent studies explored complete tprK gene profiles in natural human syphilis infection, there are still too few profiled full-length tprK variants among clinical T. pallidum isolates to fully understand the characteristics of TprK coding diversity. Here, Pacific Biosciences (PacBio) long-read sequencing was applied to examine the diversity of full-length tprK variants in 21 clinical T. pallidum isolates from 11 patients with primary syphilis and 10 patients with secondary syphilis. A total of 398 high-confidence full-length sequences, which presented remarkable sequence heterogeneity, were found. However, these full-length tprK variants exhibited limited variation in length and GC content, showing 24 length types and average GC content of 51.5 ± 0.42% and 51.6 ± 0.26% for primary and secondary syphilis samples, respectively. Additionally, the combined patterns of mutated V regions generating new tprK variants were obviously different in primary and secondary syphilis samples. The diversity of tprK gene sequences in primary syphilis samples may represent the underlying variability of the bacterium; conversely, the variability of the tprK gene in secondary syphilis samples may more accurately reflect how T. pallidum escapes host immune clearance. These data highlight the tprK gene as an important coding gene that shows conflicting genetic characteristics but underlies the persistence of spirochete infection. IMPORTANCE The resurgence of syphilis in both low- and high-income countries has attracted attention, and persistent infection by the pathogen has long been a research focus. The tprK gene, encoding the hypervariable outer membrane protein, is thought to be responsible for pathogen immune evasion and persistent infection. Here, PacBio long-read sequencing was applied to examine the diversity of full-length tprK variants in 21 clinical T. pallidum isolates from 11 patients with primary syphilis and 10 patients with secondary syphilis. The results showed that the sequences of the tprK gene were remarkably heterogeneous; however, the sequences presented limited variation in length and GC content. The investigation of the combined patterns of the V regions allowed us to gain insight into the features of the tprK gene generating new variants at different clinical stages. The findings of this study will be helpful for further exploration of the pathogenesis of syphilis.
Collapse
Affiliation(s)
- Dan Liu
- Center of Clinical Laboratory, Zhongshan Hospital, School of Medicine, Xiamen University, Xiamen, China
- Institute of Infectious Disease, School of Medicine, Xiamen University, Xiamen, China
| | - Li-Li Liu
- Center of Clinical Laboratory, Zhongshan Hospital, School of Medicine, Xiamen University, Xiamen, China
- Institute of Infectious Disease, School of Medicine, Xiamen University, Xiamen, China
| | - Xin-Qi Zheng
- Institute of Infectious Disease, School of Medicine, Xiamen University, Xiamen, China
| | - Rui Chen
- Institute of Infectious Disease, School of Medicine, Xiamen University, Xiamen, China
| | - Li-Rong Lin
- Center of Clinical Laboratory, Zhongshan Hospital, School of Medicine, Xiamen University, Xiamen, China
- Institute of Infectious Disease, School of Medicine, Xiamen University, Xiamen, China
| | - Tian-Ci Yang
- Center of Clinical Laboratory, Zhongshan Hospital, School of Medicine, Xiamen University, Xiamen, China
- Institute of Infectious Disease, School of Medicine, Xiamen University, Xiamen, China
| | - Man-Li Tong
- Center of Clinical Laboratory, Zhongshan Hospital, School of Medicine, Xiamen University, Xiamen, China
- Institute of Infectious Disease, School of Medicine, Xiamen University, Xiamen, China
| |
Collapse
|
6
|
Diop A, Torrance EL, Stott CM, Bobay LM. Gene flow and introgression are pervasive forces shaping the evolution of bacterial species. Genome Biol 2022; 23:239. [PMID: 36357919 PMCID: PMC9650840 DOI: 10.1186/s13059-022-02809-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 10/24/2022] [Indexed: 11/11/2022] Open
Abstract
BACKGROUND Although originally thought to evolve clonally, studies have revealed that most bacteria exchange DNA. However, it remains unclear to what extent gene flow shapes the evolution of bacterial genomes and maintains the cohesion of species. RESULTS Here, we analyze the patterns of gene flow within and between >2600 bacterial species. Our results show that fewer than 10% of bacterial species are truly clonal, indicating that purely asexual species are rare in nature. We further demonstrate that the taxonomic criterion of ~95% genome sequence identity routinely used to define bacterial species does not accurately represent a level of divergence that imposes an effective barrier to gene flow across bacterial species. Interruption of gene flow can occur at various sequence identities across lineages, generally from 90 to 98% genome identity. This likely explains why a ~95% genome sequence identity threshold has empirically been judged as a good approximation to define bacterial species. Our results support a universal mechanism where the availability of identical genomic DNA segments required to initiate homologous recombination is the primary determinant of gene flow and species boundaries in bacteria. We show that these barriers of gene flow remain porous since many distinct species maintain some level of gene flow, similar to introgression in sexual organisms. CONCLUSIONS Overall, bacterial evolution and speciation are likely shaped by similar forces driving the evolution of sexual organisms. Our findings support a model where the interruption of gene flow-although not necessarily the initial cause of speciation-leads to the establishment of permanent and irreversible species borders.
Collapse
Affiliation(s)
- Awa Diop
- grid.266860.c0000 0001 0671 255XDepartment of Biology, University of North Carolina Greensboro, Greensboro, North Carolina, 321 McIver Street, PO Box 26170, Greensboro, NC 27402 USA
| | - Ellis L. Torrance
- grid.266860.c0000 0001 0671 255XDepartment of Biology, University of North Carolina Greensboro, Greensboro, North Carolina, 321 McIver Street, PO Box 26170, Greensboro, NC 27402 USA
| | - Caroline M. Stott
- grid.266860.c0000 0001 0671 255XDepartment of Biology, University of North Carolina Greensboro, Greensboro, North Carolina, 321 McIver Street, PO Box 26170, Greensboro, NC 27402 USA
| | - Louis-Marie Bobay
- grid.266860.c0000 0001 0671 255XDepartment of Biology, University of North Carolina Greensboro, Greensboro, North Carolina, 321 McIver Street, PO Box 26170, Greensboro, NC 27402 USA
| |
Collapse
|
7
|
Bohlin J. A simple stochastic model describing the evolution of genomic GC content in asexually reproducing organisms. Sci Rep 2022; 12:18569. [PMID: 36329129 PMCID: PMC9631610 DOI: 10.1038/s41598-022-21709-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 09/30/2022] [Indexed: 11/06/2022] Open
Abstract
A genome's nucleotide composition can usually be summarized with (G)uanine + (C)ytosine (GC) or (A)denine + (T)hymine (AT) frequencies as GC% = 100% - AT%. Genomic AT/GC content has been linked to environment and selective processes in asexually reproducing organisms. A model is presented relating the evolution of genomic GC content over time to AT [Formula: see text] GC and GC [Formula: see text] AT mutation rates. By employing Itô calculus it is shown that if mutation rates are subject to random perturbations, that can vary over time, several implications follow. In particular, an extra Brownian motion term appears influencing genomic nucleotide variability; the greater the random perturbations the more genomic nucleotide variability. This can have several interpretations depending on the context. For instance, reducing the influence of the random perturbations on the AT/GC mutation rates and thus genomic nucleotide variability, to limit fitness decreasing and deleterious mutations, will likely suggest channeling of resources. On the other hand, increased genomic nucleotide diversity may be beneficial in variable environments. In asexually reproducing organisms, the Brownian motion term can be considered to be inversely reflective of the selective pressures an organism is subjected to at the molecular level. The presented model is a generalization of a previous model, limited to microbial symbionts, to all asexually reproducing, non-recombining organisms. Last, a connection between the presented model and the classical Luria-Delbrück mutation model is presented in an Itô calculus setting.
Collapse
Affiliation(s)
- Jon Bohlin
- grid.418193.60000 0001 1541 4204Division of Infection Control, Department of Methods Development and Analysis, Norwegian Institute of Public Health, Oslo, Norway ,grid.418193.60000 0001 1541 4204Centre for Fertility and Health, Norwegian Institute of Public Health, P.O. Box 4404, Lovisenberggata 8, 0403 Oslo, Norway
| |
Collapse
|
8
|
Mahajan S, Agashe D. Evolutionary jumps in bacterial GC content. G3 GENES|GENOMES|GENETICS 2022; 12:6586800. [PMID: 35579351 PMCID: PMC9339322 DOI: 10.1093/g3journal/jkac108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 04/20/2022] [Indexed: 11/29/2022]
Abstract
Genomic GC (Guanine-Cytosine) content is a fundamental molecular trait linked with many key genomic features such as codon and amino acid use. Across bacteria, GC content is surprisingly diverse and has been studied for many decades; yet its evolution remains incompletely understood. Since it is difficult to observe GC content evolve on laboratory time scales, phylogenetic comparative approaches are instrumental; but this dimension is rarely studied systematically in the case of bacterial GC content. We applied phylogenetic comparative models to analyze GC content evolution in multiple bacterial groups across 2 major bacterial phyla. We find that GC content diversifies via a combination of gradual evolution and evolutionary “jumps.” Surprisingly, unlike prior reports that solely focused on reductions in GC, we found a comparable number of jumps with both increased and decreased GC content. Overall, many of the identified jumps occur in lineages beyond the well-studied peculiar examples of endosymbiotic and AT-rich marine bacteria and do not support the predicted role of oxygen dependence. Our analysis of rapid and large shifts in GC content thus identifies new clades and novel contexts to further understand the ecological and evolutionary drivers of this important genomic trait.
Collapse
Affiliation(s)
- Saurabh Mahajan
- National Centre for Biological Sciences, Tata Institute of Fundamental Research , Bengaluru 560065, India
- Atria University , Bengaluru 560024, India
| | - Deepa Agashe
- National Centre for Biological Sciences, Tata Institute of Fundamental Research , Bengaluru 560065, India
| |
Collapse
|
9
|
Detecting Potentially Adaptive Mutations from the Parallel and Fixed Patterns in SARS-CoV-2 Evolution. Viruses 2022; 14:v14051087. [PMID: 35632828 PMCID: PMC9147038 DOI: 10.3390/v14051087] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Revised: 05/10/2022] [Accepted: 05/17/2022] [Indexed: 02/01/2023] Open
Abstract
Early identification of adaptive mutations could provide timely help for the control and prevention of the COVID-19 pandemic. The fast accumulation of SARS-CoV-2 sequencing data provides important support, while also raising a great challenge for the recognition of adaptive mutations. Here, we proposed a computational strategy to detect potentially adaptive mutations from their fixed and parallel patterns in the phylogenetic trajectory. We found that the biological meanings of fixed substitution and parallel mutation are highly complementary, and can reasonably be integrated as a fixed and parallel (paraFix) mutation, to identify potentially adaptive mutations. Tracking the dynamic evolution of SARS-CoV-2, 37 sites in spike protein were identified as having experienced paraFix mutations. Interestingly, 70% (26/37) of them have already been experimentally confirmed as adaptive mutations. Moreover, most of the mutations could be inferred as paraFix mutations one month earlier than when they became regionally dominant. Overall, we believe that the concept of paraFix mutations will help researchers to identify potentially adaptive mutations quickly and accurately, which will provide invaluable clues for disease control and prevention.
Collapse
|
10
|
Systems-Based Approach for Optimization of Assembly-Free Bacterial MLST Mapping. Life (Basel) 2022; 12:life12050670. [PMID: 35629339 PMCID: PMC9147691 DOI: 10.3390/life12050670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 04/24/2022] [Accepted: 04/25/2022] [Indexed: 12/02/2022] Open
Abstract
Epidemiological surveillance of bacterial pathogens requires real-time data analysis with a fast turnaround, while aiming at generating two main outcomes: (1) species-level identification and (2) variant mapping at different levels of genotypic resolution for population-based tracking and surveillance, in addition to predicting traits such as antimicrobial resistance (AMR). Multi-locus sequence typing (MLST) aids this process by identifying sequence types (ST) based on seven ubiquitous genome-scattered loci. In this paper, we selected one assembly-dependent and one assembly-free method for ST mapping and applied them with the default settings and ST schemes they are distributed with, and systematically assessed their accuracy and scalability across a wide array of phylogenetically divergent Public Health-relevant bacterial pathogens with available MLST databases. Our data show that the optimal k-mer length for stringMLST is species-specific and that genome-intrinsic and -extrinsic features can affect the performance and accuracy of the program. Although suitable parameters could be identified for most organisms, there were instances where this program may not be directly deployable in its current format. Next, we integrated stringMLST into our freely available and scalable hierarchical-based population genomics platform, ProkEvo, and further demonstrated how the implementation facilitates automated, reproducible bacterial population analysis.
Collapse
|
11
|
Lin X, McNichol J, Chu X, Qian Y, Luo H. Cryptic niche differentiation of novel sediment ecotypes of Rugeria pomeroyi correlates with nitrate respiration. Environ Microbiol 2021; 24:390-403. [PMID: 34964547 DOI: 10.1111/1462-2920.15882] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 12/16/2021] [Accepted: 12/18/2021] [Indexed: 10/19/2022]
Abstract
Marine intertidal sediments fluctuate in redox conditions and nutrient availability, and they are also known as an important sink of nitrogen mainly through denitrification, yet how denitrifying bacteria adapt to this dynamic habitat remains largely untapped. Here, we investigated novel intertidal benthic ecotypes of the model pelagic marine bacterium Ruegeria pomeroyi DSS-3 with a population genomic approach. While differing by only 1.3% at the 16S rRNA gene level, members of the intertidal benthic ecotypes are complete denitrifiers whereas the pelagic ecotype representative (DSS-3) is a partial denitrifier lacking a nitrate reductase. The intertidal benthic ecotypes are further differentiated by using non-homologous nitrate reductases and a different set of genes that allow alleviating oxidative stress and acquiring organic substrates. In the presence of nitrate, the two ecotypes showed contrasting growth patterns under initial oxygen concentrations at 1 vol% versus 7 vol% and supplemented with different carbon sources abundant in intertidal sediments. Collectively, this combination of evidence indicates that there are cryptic niches in coastal intertidal sediments that support divergent evolution of denitrifying bacteria. This knowledge will in turn help understand how these benthic environments operate to effectively remove nitrogen.
Collapse
Affiliation(s)
- Xingqin Lin
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, 518000, China
| | - Jesse McNichol
- Simon F. S. Li Marine Science Laboratory, School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong SAR
| | - Xiao Chu
- Simon F. S. Li Marine Science Laboratory, School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong SAR
| | - Yang Qian
- Simon F. S. Li Marine Science Laboratory, School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong SAR
| | - Haiwei Luo
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, 518000, China.,Simon F. S. Li Marine Science Laboratory, School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong SAR
| |
Collapse
|
12
|
Estimating maximal microbial growth rates from cultures, metagenomes, and single cells via codon usage patterns. Proc Natl Acad Sci U S A 2021; 118:2016810118. [PMID: 33723043 PMCID: PMC8000110 DOI: 10.1073/pnas.2016810118] [Citation(s) in RCA: 103] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Despite the wide perception that microbes have rapid growth rates, many environments like seawater and soil are often dominated by microorganisms that can only grow very slowly. Our knowledge about growth is necessarily biased toward easily culturable organisms, which tend to be those that grow fast, because microbial growth rates have traditionally been measured using laboratory growth experiments. However, how are potential growth rates distributed in nature? Using genomic data, we predicted the growth rates of over 200,000 organisms, including many as yet uncultivated species. These data reveal how current culture collections are strongly biased toward fast-growing organisms. Finally, we noticed a bimodal distribution of maximal growth rates, suggesting a natural division of microbial growth strategies into two classes. Maximal growth rate is a basic parameter of microbial lifestyle that varies over several orders of magnitude, with doubling times ranging from a matter of minutes to multiple days. Growth rates are typically measured using laboratory culture experiments. Yet, we lack sufficient understanding of the physiology of most microbes to design appropriate culture conditions for them, severely limiting our ability to assess the global diversity of microbial growth rates. Genomic estimators of maximal growth rate provide a practical solution to survey the distribution of microbial growth potential, regardless of cultivation status. We developed an improved maximal growth rate estimator and predicted maximal growth rates from over 200,000 genomes, metagenome-assembled genomes, and single-cell amplified genomes to survey growth potential across the range of prokaryotic diversity; extensions allow estimates from 16S rRNA sequences alone as well as weighted community estimates from metagenomes. We compared the growth rates of cultivated and uncultivated organisms to illustrate how culture collections are strongly biased toward organisms capable of rapid growth. Finally, we found that organisms naturally group into two growth classes and observed a bias in growth predictions for extremely slow-growing organisms. These observations ultimately led us to suggest evolutionary definitions of oligotrophy and copiotrophy based on the selective regime an organism occupies. We found that these growth classes are associated with distinct selective regimes and genomic functional potentials.
Collapse
|
13
|
Wasser D, Borst A, Hammelmann M, Ludt K, Soppa J. Characterization of Non-selected Intermolecular Gene Conversion in the Polyploid Haloarchaeon Haloferax volcanii. Front Microbiol 2021; 12:680854. [PMID: 34177864 PMCID: PMC8223754 DOI: 10.3389/fmicb.2021.680854] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 05/14/2021] [Indexed: 11/13/2022] Open
Abstract
Gene conversion is defined as the non-reciprocal transfer of genetic information from one site to a homologous, but not identical site of the genome. In prokaryotes, gene conversion can increase the variance of sequences, like in antigenic variation, but can also lead to a homogenization of sequences, like in the concerted evolution of multigene families. In contrast to these intramolecular mechanisms, the intermolecular gene conversion in polyploid prokaryotes, which leads to the equalization of the multiple genome copies, has hardly been studied. We have previously shown the intermolecular gene conversion in halophilic and methanogenic archaea is so efficient that it can be studied without selecting for conversion events. Here, we have established an approach to characterize unselected intermolecular gene conversion in Haloferax volcanii making use of two genes that encode enzymes involved in carotenoid biosynthesis. Heterozygous strains were generated by protoplast fusion, and gene conversion was quantified by phenotype analysis or/and PCR. It was verified that unselected gene conversion is extremely efficient and it was shown that gene conversion tracts are much longer than in antigenic variation or concerted evolution in bacteria. Two sites were nearly always co-converted when they were 600 bp apart, and more than 30% co-conversion even occurred when two sites were 5 kbp apart. The gene conversion frequency was independent from the extent of genome differences, and even a one nucleotide difference triggered conversion.
Collapse
Affiliation(s)
- Daniel Wasser
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt, Germany
| | - Andreas Borst
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt, Germany
| | - Mathias Hammelmann
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt, Germany
| | - Katharina Ludt
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt, Germany
| | - Jörg Soppa
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt, Germany
| |
Collapse
|
14
|
Gabashvili E, Kobakhidze S, Koulouris S, Robinson T, Kotetishvili M. Bi- and Multi-directional Gene Transfer in the Natural Populations of Polyvalent Bacteriophages, and Their Host Species Spectrum Representing Foodborne Versus Other Human and/or Animal Pathogens. FOOD AND ENVIRONMENTAL VIROLOGY 2021; 13:179-202. [PMID: 33484405 DOI: 10.1007/s12560-021-09460-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2020] [Accepted: 01/06/2021] [Indexed: 06/12/2023]
Abstract
Unraveling the trends of phage-host versus phage-phage coevolution is critical for avoiding possible undesirable outcomes from the use of phage preparations intended for therapeutic, food safety or environmental safety purposes. We aimed to investigate a phenomenon of intergeneric recombination and its trajectories across the natural populations of phages predominantly linked to foodborne pathogens. The results from the recombination analyses, using a large array of the recombination detection algorithms imbedded in SplitsTree, RDP4, and Simplot software packages, provided strong evidence (fit: 100; P ≤ 0.014) for both bi- and multi-directional intergeneric recombination of the genetic loci involved collectively in phage morphogenesis, host specificity, virulence, replication, and persistence. Intergeneric recombination was determined to occur not only among conspecifics of the virulent versus temperate phages but also between the phages with these different lifestyles. The recombining polyvalent phages were suggested to interact with fairly large host species networks, including sometimes genetically very distinct species, such as e.g., Salmonella enterica and/or Escherichia coli versus Staphylococcus aureus or Yersinia pestis. Further studies are needed to understand whether phage-driven intergeneric recombination can lead to undesirable changes of intestinal and other microbiota in humans and animals.
Collapse
Affiliation(s)
- Ekaterine Gabashvili
- School of Natural Sciences and Medicine, Ilia State University, 1 Giorgi Tsereteli exit, 0162, Tbilisi, Georgia
- Division of Risk Assessment, Scientific-Research Center of Agriculture, 6 Marshal Gelovani ave., 0159, Tbilisi, Georgia
| | - Saba Kobakhidze
- Division of Risk Assessment, Scientific-Research Center of Agriculture, 6 Marshal Gelovani ave., 0159, Tbilisi, Georgia
| | - Stylianos Koulouris
- Engagement and Cooperation Unit, European Food Safety Authority, Via Carlo Magno 1A, 43126, Parma, Italy
| | - Tobin Robinson
- Scientific Committee, and Emerging Risks Unit, European Food Safety Authority, Via Carlo Magno 1A, 43126, Parma, Italy
| | - Mamuka Kotetishvili
- Division of Risk Assessment, Scientific-Research Center of Agriculture, 6 Marshal Gelovani ave., 0159, Tbilisi, Georgia.
- Hygiene and Medical Ecology, G. Natadze Scientific-Research Institute of Sanitation, 78 D. Uznadze St., 0102, Tbilisi, Georgia.
| |
Collapse
|
15
|
Duan B, Ding P, Navarre WW, Liu J, Xia B. Xenogeneic Silencing and Bacterial Genome Evolution: Mechanisms for DNA Recognition Imply Multifaceted Roles of Xenogeneic Silencers. Mol Biol Evol 2021; 38:4135-4148. [PMID: 34003286 PMCID: PMC8476142 DOI: 10.1093/molbev/msab136] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Revised: 04/08/2021] [Indexed: 12/14/2022] Open
Abstract
Horizontal gene transfer (HGT) is a major driving force for bacterial evolution. To avoid the deleterious effects due to the unregulated expression of newly acquired foreign genes, bacteria have evolved specific proteins named xenogeneic silencers to recognize foreign DNA sequences and suppress their transcription. As there is considerable diversity in genomic base compositions among bacteria, how xenogeneic silencers distinguish self- from nonself DNA in different bacteria remains poorly understood. This review summarizes the progress in studying the DNA binding preferences and the underlying molecular mechanisms of known xenogeneic silencer families, represented by H-NS of Escherichia coli, Lsr2 of Mycobacterium, MvaT of Pseudomonas, and Rok of Bacillus. Comparative analyses of the published data indicate that the differences in DNA recognition mechanisms enable these xenogeneic silencers to have clear characteristics in DNA sequence preferences, which are further correlated with different host genomic features. These correlations provide insights into the mechanisms of how these xenogeneic silencers selectively target foreign DNA in different genomic backgrounds. Furthermore, it is revealed that the genomic AT contents of bacterial species with the same xenogeneic silencer family proteins are distributed in a limited range and are generally lower than those species without any known xenogeneic silencers in the same phylum/class/genus, indicating that xenogeneic silencers have multifaceted roles on bacterial genome evolution. In addition to regulating horizontal gene transfer, xenogeneic silencers also act as a selective force against the GC to AT mutational bias found in bacterial genomes and help the host genomic AT contents maintained at relatively low levels.
Collapse
Affiliation(s)
- Bo Duan
- Beijing Nuclear Magnetic Resonance Center, College of Chemistry and Molecular Engineering, and School of Life Sciences, Peking University, Beijing, 100871, China
| | - Pengfei Ding
- Beijing Nuclear Magnetic Resonance Center, College of Chemistry and Molecular Engineering, and School of Life Sciences, Peking University, Beijing, 100871, China
| | - William Wiley Navarre
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, M5G 1M1, Canada
| | - Jun Liu
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, M5G 1M1, Canada
| | - Bin Xia
- Beijing Nuclear Magnetic Resonance Center, College of Chemistry and Molecular Engineering, and School of Life Sciences, Peking University, Beijing, 100871, China
| |
Collapse
|
16
|
Nguyen DT, Wu B, Xiao S, Hao W. Evolution of a Record-Setting AT-Rich Genome: Indel Mutation, Recombination, and Substitution Bias. Genome Biol Evol 2020; 12:2344-2354. [PMID: 32986811 PMCID: PMC7846184 DOI: 10.1093/gbe/evaa202] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/19/2020] [Indexed: 12/16/2022] Open
Abstract
Genome-wide nucleotide composition varies widely among species. Despite extensive research, the source of genome-wide nucleotide composition diversity remains elusive. Yeast mitochondrial genomes (mitogenomes) are highly A + T rich, and they provide a unique opportunity to study the evolution of AT-biased landscape. In this study, we sequenced ten complete mitogenomes of the Saccharomycodes ludwigii yeast with 8% G + C content, the lowest genome-wide %(G + C) in all published genomes to date. The S. ludwigii mitogenomes have high densities of short tandem repeats but severely underrepresented mononucleotide repeats. Comparative population genomics of these record-setting A + T-rich genomes shows dynamic indel mutations and strong mutation bias toward A/T. Indel mutations play a greater role in genomic variation among very closely related strains than nucleotide substitutions. Indels have resulted in presence–absence polymorphism of tRNAArg (ACG) among S. ludwigii mitogenomes. Interestingly, these mitogenomes have undergone recombination, a genetic process that can increase G + C content by GC-biased gene conversion. Finally, the expected equilibrium G + C content under mutation pressure alone is higher than observed G + C content, suggesting existence of mechanisms other than AT-biased mutation operating to increase A/T. Together, our findings shed new lights on mechanisms driving extremely AT-rich genomes.
Collapse
Affiliation(s)
- Duong T Nguyen
- Department of Biological Sciences, Wayne State University
| | - Baojun Wu
- Department of Biological Sciences, Wayne State University
| | - Shujie Xiao
- Department of Biological Sciences, Wayne State University
| | - Weilong Hao
- Department of Biological Sciences, Wayne State University
| |
Collapse
|
17
|
Bohlin J, Rose B, Brynildsrud O, Birgitte Freiesleben De Blasio. A simple stochastic model describing genomic evolution over time of GC content in microbial symbionts. J Theor Biol 2020; 503:110389. [PMID: 32634385 DOI: 10.1016/j.jtbi.2020.110389] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Revised: 04/21/2020] [Accepted: 06/24/2020] [Indexed: 11/29/2022]
Abstract
An organism's genomic base composition is usually summarized by its AT or GC content due to Chargaff's parity laws. Variation in prokaryotic GC content can be substantial between taxa but is generally small within microbial genomes. This variation has been found to correlate with both phylogeny and environmental factors. Since novel single-nucleotide polymorphisms (SNPs) within genomes are at least partially linked to the environment through natural selection, SNP GC content can be considered a compound measure of an organism's environmental influences, lifestyle, phylogeny as well as other more or less random processes. While there are several models describing genomic GC content few, if any, consider AT/GC mutation rates subjected to random perturbations. We present a mathematical model that describes how GC content in microbial genomes evolves over time as a function of the AT → GC and GC → AT mutation rates with Gaussian white noise disturbances. The model, which is suited specifically to non-recombining vertically transmitted prokaryotic symbionts, suggests that small differences in the AT/GC mutation rates can lead to profound differences in outcome due to the ensuing stochastic process. In other words, the model indicates that time to extinction could be a consequence of the mutation rate trajectory on which the symbiont embarked early on in its evolutionary history.
Collapse
Affiliation(s)
- Jon Bohlin
- Division of Infection Control and Environmental Health, Norwegian Institute of Public Health, Oslo, Norway; Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway; Department of Production Animals, Faculty of Veterinary Medicine, Norwegian University of Life Science, Oslo, Norway
| | - Brittany Rose
- Division of Infection Control and Environmental Health, Norwegian Institute of Public Health, Oslo, Norway; Department of Biostatistics, Oslo Centre for Biostatistics and Epidemiology, Institute of Basic Medical Sciences, University of Oslo, Oslo, Norway
| | - Ola Brynildsrud
- Division of Infection Control and Environmental Health, Norwegian Institute of Public Health, Oslo, Norway; Department of Production Animals, Faculty of Veterinary Medicine, Norwegian University of Life Science, Oslo, Norway
| | - Birgitte Freiesleben De Blasio
- Division of Infection Control and Environmental Health, Norwegian Institute of Public Health, Oslo, Norway; Department of Biostatistics, Oslo Centre for Biostatistics and Epidemiology, Institute of Basic Medical Sciences, University of Oslo, Oslo, Norway
| |
Collapse
|
18
|
Gupta A, Nair S. Dynamics of Insect-Microbiome Interaction Influence Host and Microbial Symbiont. Front Microbiol 2020; 11:1357. [PMID: 32676060 PMCID: PMC7333248 DOI: 10.3389/fmicb.2020.01357] [Citation(s) in RCA: 85] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Accepted: 05/27/2020] [Indexed: 12/21/2022] Open
Abstract
Insects share an intimate relationship with their gut microflora and this symbiotic association has developed into an essential evolutionary outcome intended for their survival through extreme environmental conditions. While it has been clearly established that insects, with very few exceptions, associate with several microbes during their life cycle, information regarding several aspects of these associations is yet to be fully unraveled. Acquisition of bacteria by insects marks the onset of microbial symbiosis, which is followed by the adaptation of these bacterial species to the gut environment for prolonged sustenance and successful transmission across generations. Although several insect-microbiome associations have been reported and each with their distinctive features, diversifications and specializations, it is still unclear as to what led to these diversifications. Recent studies have indicated the involvement of various evolutionary processes operating within an insect body that govern the transition of a free-living microbe to an obligate or facultative symbiont and eventually leading to the establishment and diversification of these symbiotic relationships. Data from various studies, summarized in this review, indicate that the symbiotic partners, i.e., the bacteria and the insect undergo several genetic, biochemical and physiological changes that have profound influence on their life cycle and biology. An interesting outcome of the insect-microbe interaction is the compliance of the microbial partner to its eventual genome reduction. Endosymbionts possess a smaller genome as compared to their free-living forms, and thus raising the question what is leading to reductive evolution in the microbial partner. This review attempts to highlight the fate of microbes within an insect body and its implications for both the bacteria and its insect host. While discussion on each specific association would be too voluminous and outside the scope of this review, we present an overview of some recent studies that contribute to a better understanding of the evolutionary trajectory and dynamics of the insect-microbe association and speculate that, in the future, a better understanding of the nature of this interaction could pave the path to a sustainable and environmentally safe way for controlling economically important pests of crop plants.
Collapse
Affiliation(s)
| | - Suresh Nair
- Plant-Insect Interaction Group, International Centre for Genetic Engineering and Biotechnology, New Delhi, India
| |
Collapse
|
19
|
Castillo AI, Chacón-Díaz C, Rodríguez-Murillo N, Coletta-Filho HD, Almeida RPP. Impacts of local population history and ecology on the evolution of a globally dispersed pathogen. BMC Genomics 2020; 21:369. [PMID: 32434538 PMCID: PMC7238557 DOI: 10.1186/s12864-020-06778-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 05/12/2020] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND Pathogens with a global distribution face diverse biotic and abiotic conditions across populations. Moreover, the ecological and evolutionary history of each population is unique. Xylella fastidiosa is a xylem-dwelling bacterium infecting multiple plant hosts, often with detrimental effects. As a group, X. fastidiosa is divided into distinct subspecies with allopatric historical distributions and patterns of multiple introductions from numerous source populations. The capacity of X. fastidiosa to successfully colonize and cause disease in naïve plant hosts varies among subspecies, and potentially, among populations. Within Central America (i.e. Costa Rica) two X. fastidiosa subspecies coexist: the native subsp. fastidiosa and the introduced subsp. pauca. Using whole genome sequences, the patterns of gene gain/loss, genomic introgression, and genetic diversity were characterized within Costa Rica and contrasted to other X. fastidiosa populations. RESULTS Within Costa Rica, accessory and core genome analyses showed a highly malleable genome with numerous intra- and inter-subspecific gain/loss events. Likewise, variable levels of inter-subspecific introgression were found within and between both coexisting subspecies; nonetheless, the direction of donor/recipient subspecies to the recombinant segments varied. Some strains appeared to recombine more frequently than others; however, no group of genes or gene functions were overrepresented within recombinant segments. Finally, the patterns of genetic diversity of subsp. fastidiosa in Costa Rica were consistent with those of other native populations (i.e. subsp. pauca in Brazil). CONCLUSIONS Overall, this study shows the importance of characterizing local evolutionary and ecological history in the context of world-wide pathogen distribution.
Collapse
Affiliation(s)
- Andreina I Castillo
- Department of Environmental Science, Policy and Management, University of California, Berkeley, CA, USA
| | - Carlos Chacón-Díaz
- Centro de Investigación en Enfermedades Tropicales, Facultad de Microbiología, Universidad de Costa Rica, San José, Costa Rica
| | - Neysa Rodríguez-Murillo
- Centro de Investigación en Enfermedades Tropicales, Facultad de Microbiología, Universidad de Costa Rica, San José, Costa Rica
| | | | - Rodrigo P P Almeida
- Department of Environmental Science, Policy and Management, University of California, Berkeley, CA, USA.
| |
Collapse
|
20
|
Bohlin J, Rose B, Pettersson JHO. Estimation of AT and GC content distributions of nucleotide substitution rates in bacterial core genomes. BIG DATA ANALYTICS 2019. [DOI: 10.1186/s41044-019-0042-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
21
|
Weissman JL, Fagan WF, Johnson PLF. Linking high GC content to the repair of double strand breaks in prokaryotic genomes. PLoS Genet 2019; 15:e1008493. [PMID: 31703064 PMCID: PMC6867656 DOI: 10.1371/journal.pgen.1008493] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 11/20/2019] [Accepted: 10/25/2019] [Indexed: 01/21/2023] Open
Abstract
Genomic GC content varies widely among microbes for reasons unknown. While mutation bias partially explains this variation, prokaryotes near-universally have a higher GC content than predicted solely by this bias. Debate surrounds the relative importance of the remaining explanations of selection versus biased gene conversion favoring GC alleles. Some environments (e.g. soils) are associated with a high genomic GC content of their inhabitants, which implies that either high GC content is a selective adaptation to particular habitats, or that certain habitats favor increased rates of gene conversion. Here, we report a novel association between the presence of the non-homologous end joining DNA double-strand break repair pathway and GC content; this observation suggests that DNA damage may be a fundamental driver of GC content, leading in part to the many environmental patterns observed to-date. We discuss potential mechanisms accounting for the observed association, and provide preliminary evidence that sites experiencing higher rates of double-strand breaks are under selection for increased GC content relative to the genomic background. The overall nucleotide composition of an organism’s genome varies greatly between species. Previous work has identified certain environmental factors (e.g., oxygen availability) associated with the relative number of GC bases as opposed to AT bases in the genomes of species. Many of these environments that are associated with high GC content are also associated with relatively high rates of DNA damage. We show that organisms possessing the non-homologous end-joining DNA repair pathway, which is one mechanism to repair DNA double-strand breaks, have an elevated GC content relative to expectation. We also show that certain sites on the genome that are particularly susceptible to double strand breaks have an elevated GC content. This leads us to suggest that an important underlying driver of variability in nucleotide composition across environments is the rate of DNA damage (specifically double-strand breaks) to which an organism living in each environment is exposed.
Collapse
Affiliation(s)
- JL Weissman
- Department of Biology, University of Maryland - College Park, College Park, Maryland, United States of America
| | - William F. Fagan
- Department of Biology, University of Maryland - College Park, College Park, Maryland, United States of America
| | - Philip L. F. Johnson
- Department of Biology, University of Maryland - College Park, College Park, Maryland, United States of America
- * E-mail:
| |
Collapse
|
22
|
Timilsina S, Pereira-Martin JA, Minsavage GV, Iruegas-Bocardo F, Abrahamian P, Potnis N, Kolaczkowski B, Vallad GE, Goss EM, Jones JB. Multiple Recombination Events Drive the Current Genetic Structure of Xanthomonas perforans in Florida. Front Microbiol 2019; 10:448. [PMID: 30930868 PMCID: PMC6425879 DOI: 10.3389/fmicb.2019.00448] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Accepted: 02/20/2019] [Indexed: 11/23/2022] Open
Abstract
Prior to the identification of Xanthomonas perforans associated with bacterial spot of tomato in 1991, X. euvesicatoria was the only known species in Florida. Currently, X. perforans is the Xanthomonas sp. associated with tomato in Florida. Changes in pathogenic race and sequence alleles over time signify shifts in the dominant X. perforans genotype in Florida. We previously reported recombination of X. perforans strains with closely related Xanthomonas species as a potential driving factor for X. perforans evolution. However, the extent of recombination across the X. perforans genomes was unknown. We used a core genome multilocus sequence analysis approach to identify conserved genes and evaluated recombination-associated evolution of these genes in X. perforans. A total of 1,356 genes were determined to be "core" genes conserved among the 58 X. perforans genomes used in the study. Our approach identified three genetic groups of X. perforans in Florida based on the principal component analysis (PCA) using core genes. Nucleotide variation in 241 genes defined these groups, that are referred as Phylogenetic-group Defining (PgD) genes. Furthermore, alleles of many of these PgD genes showed 100% sequence identity with X. euvesicatoria, suggesting that variation likely has been introduced by recombination at multiple locations throughout the bacterial chromosome. Site-specific recombinase genes along with plasmid mobilization and phage associated genes were observed at different frequencies in the three phylogenetic groups and were associated with clusters of recombinant genes. Our analysis of core genes revealed the extent, source, and mechanisms of recombination events that shaped the current population and genomic structure of X. perforans in Florida.
Collapse
Affiliation(s)
- Sujan Timilsina
- Department of Plant Pathology, University of Florida, Gainesville, FL, United States
| | | | - Gerald V. Minsavage
- Department of Plant Pathology, University of Florida, Gainesville, FL, United States
| | | | - Peter Abrahamian
- Gulf Coast Research and Education Center, University of Florida, Gainesville, FL, United States
| | - Neha Potnis
- Department of Entomology and Plant Pathology, Auburn University, Auburn, AL, United States
| | - Bryan Kolaczkowski
- Microbiology and Cell Science, University of Florida, Gainesville, FL, United States
| | - Gary E. Vallad
- Gulf Coast Research and Education Center, University of Florida, Gainesville, FL, United States
| | - Erica M. Goss
- Department of Plant Pathology, University of Florida, Gainesville, FL, United States
- Emerging Pathogens Institute, University of Florida, Gainesville, FL, United States
| | - Jeffrey B. Jones
- Department of Plant Pathology, University of Florida, Gainesville, FL, United States
| |
Collapse
|
23
|
Bohlin J, Pettersson JHO. Evolution of Genomic Base Composition: From Single Cell Microbes to Multicellular Animals. Comput Struct Biotechnol J 2019; 17:362-370. [PMID: 30949307 PMCID: PMC6429543 DOI: 10.1016/j.csbj.2019.03.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Revised: 02/28/2019] [Accepted: 03/01/2019] [Indexed: 01/07/2023] Open
Abstract
Whole genome sequencing (WGS) of thousands of microbial genomes has provided considerable insight into evolutionary mechanisms in the microbial world. While substantially fewer eukaryotic genomes are available for analyses the number is rapidly increasing. This mini-review summarizes broadly evolutionary dynamics of base composition in the different domains of life from the perspective of prokaryotes. Common and different evolutionary mechanisms influencing genomic base composition in eukaryotes and prokaryotes are discussed. The conclusion from the data currently available suggests that while there are similarities there are also striking differences in how genomic base composition has evolved within prokaryotes and eukaryotes. For instance, homologous recombination appears to increase GC content locally in eukaryotes due to a non-selective process termed GC-biased gene conversion (gBGC). For prokaryotes on the other hand, increase in genomic GC content seems to be driven by the environment and selection. We find that similar phenomena observed for some organisms in each respective domain may be caused by very different mechanisms: while gBGC and recombination rates appear to explain the negative correlation between GC3 (GC content based on the third codon nucleotides) and genome size in some eukaryotes uptake of AT rich DNA sequences is the main reason for a similar negative correlation observed in prokaryotes. We provide further examples that indicate that base composition in prokaryotes and eukaryotes have evolved under very different constraints.
Collapse
Affiliation(s)
- Jon Bohlin
- Norwegian Institute of Public Health, Division of Infection Control and Environmental Health, Department of Infectious Disease Epidemiology and Modelling, Lovisenberggata 8, 0456 Oslo, Norway.,Centre for Fertility and Health, Norwegian Institute of Public Health, PO-Box 222 Skøyen, N-0213 Oslo, Norway.,Norwegian University of Life Sciences, Faculty of Veterinary Sciences, Production Animal Clinical Sciences, Ullevålsveien 72, 0454 Oslo, Norway
| | - John H-O Pettersson
- Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School the University of Sydney, New South Wales 2006, Australia.,Zoonosis Science Center, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden.,Public Health Agency of Sweden, Nobels vg 18, SE-171 82 Solna, Sweden
| |
Collapse
|
24
|
Abstract
Microbial populations exchange genetic material through a process called homologous recombination. Although this process has been studied in particular organisms, we lack an understanding of its differential impact over the genome and across microbes with different life-styles. We used a common analytical framework to assess this process in a representative set of microorganisms. Our results uncovered important trends. First, microbes with different lifestyles are differentially impacted, with endosymbionts and obligate pathogens being those less prone to undergo this process. Second, certain genetic elements such as restriction-modification systems seem to be associated with higher rates of recombination. Most importantly, recombined genomes show the footprints of natural selection in which recombined regions preferentially contain genes that can be related to specific ecological adaptations. Taken together, our results clarify the relative contributions of factors modulating homologous recombination and show evidence for a clear a role of this process in shaping microbial genomes and driving ecological adaptations. Homologous recombination (HR) enables the exchange of genetic material between and within species. Recent studies suggest that this process plays a major role in the microevolution of microbial genomes, contributing to core genome homogenization and to the maintenance of cohesive population structures. However, we still have a very poor understanding of the possible adaptive roles of intraspecific HR and of the factors that determine its differential impact across clades and lifestyles. Here we used a unified methodological framework to assess HR in 338 complete genomes from 54 phylogenetically diverse and representative prokaryotic species, encompassing different lifestyles and a broad phylogenetic distribution. Our results indicate that lifestyle and presence of restriction-modification (RM) machineries are among the main factors shaping HR patterns, with symbionts and intracellular pathogens having the lowest HR levels. Similarly, the size of exchanged genomic fragments correlated with the presence of RM and competence machineries. Finally, genes exchanged by HR showed functional enrichments which could be related to adaptations to different environments and ecological strategies. Taken together, our results clarify the factors underlying HR impact and suggest important adaptive roles of genes exchanged through this mechanism. Our results also revealed that the extent of genetic exchange correlated with lifestyle and some genomic features. Moreover, the genes in exchanged regions were enriched for functions that reflected specific adaptations, supporting identification of HR as one of the main evolutionary mechanisms shaping prokaryotic core genomes.
Collapse
|
25
|
Crispell J, Balaz D, Gordon SV. HomoplasyFinder: a simple tool to identify homoplasies on a phylogeny. Microb Genom 2019; 5:e000245. [PMID: 30663960 PMCID: PMC6412054 DOI: 10.1099/mgen.0.000245] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Accepted: 11/26/2018] [Indexed: 01/10/2023] Open
Abstract
A homoplasy is a nucleotide identity resulting from a process other than inheritance from a common ancestor. Importantly, by distorting the ancestral relationships between nucleotide sequences, homoplasies can change the structure of the phylogeny. Homoplasies can emerge naturally, especially under high selection pressures and/or high mutation rates, or be created during the generation and processing of sequencing data. Identification of homoplasies is critical, both to understand their influence on the analyses of phylogenetic data and to allow an investigation into how they arose. Here we present HomoplasyFinder, a java application that can be used as a stand-a-lone tool or within the statistical programming environment R. Within R and Java, HomoplasyFinder is shown to be able to automatically, and quickly, identify any homoplasies present in simulated and real phylogenetic data. HomoplasyFinder can easily be incorporated into existing analysis pipelines, either within or outside of R, allowing the user to quickly identify homoplasies to inform downstream analyses and interpretation.
Collapse
Affiliation(s)
- Joseph Crispell
- School of Veterinary Medicine, College of Health and Agricultural Sciences, University College Dublin, Republic of Ireland
| | - Daniel Balaz
- Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, Scotland
| | - Stephen Vincent Gordon
- School of Veterinary Medicine, College of Health and Agricultural Sciences, University College Dublin, Republic of Ireland
| |
Collapse
|
26
|
Bohlin J, Eldholm V, Brynildsrud O, Petterson JHO, Alfsnes K. Modeling of the GC content of the substituted bases in bacterial core genomes. BMC Genomics 2018; 19:589. [PMID: 30081825 PMCID: PMC6080486 DOI: 10.1186/s12864-018-4984-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2018] [Accepted: 07/31/2018] [Indexed: 12/13/2022] Open
Abstract
Background The purpose of the present study was to examine the GC content of substituted bases (sbGC) in the core genomes of 35 bacterial species. Each species, or core genome, constituted genomes from at least 10 strains. We also wanted to explore whether sbGC for each strain was associated with the corresponding species’ core genome GC content (cgGC). We present a simple mathematical model that estimates sbGC from cgGC. The model assumes only that the estimated sbGC is a function of cgGC proportional to fixed AT→GC (α) and GC → AT (β) mutation rates. Non-linear regression was used to estimate parameters α and β from the empirical data described above. Results We found that sbGC for each strain showed a non-linear association with the corresponding cgGC with a bias towards higher GC content for most core genomes (66.3% of the strains), assuming as a null-hypothesis that sbGC should be approximately equal to cgGC. The most GC rich core genomes (i.e. approximately %GC > 60), on the other hand, exhibited slightly less GC-biased sbGC than expected. The best fitted regression model indicates that GC → AT mutation rates β = (1.91 ± 0.13) p < 0.001 are approximately (1.91/0.79) = 2.42 times as high, on average, as AT→GC α = (− 0.79 ± 0.25) p < 0.001 mutation rates. Whether the observed sbGC GC-bias for all but the most GC-rich prokaryotic species is due to selection, compensating for the GC → AT mutation bias, and/or selective neutral processes is currently debated. Residual standard error was found to be σ = 0.076 indicating estimated errors of sbGC to be approximately within ±15.2% GC (95% confidence interval) for the strains of all species in the study. Conclusion Not only did our mathematical model give reasonable estimates of sbGC it also provides further support to previous observations that mutation rates in prokaryotes exhibit a universal GC → AT bias that appears to be remarkably consistent between taxa. Electronic supplementary material The online version of this article (10.1186/s12864-018-4984-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jon Bohlin
- Norwegian Institute of Public Health, Lovisenberggata 8, P.O. Box 4404, 0403, Oslo, Norway.
| | - Vegard Eldholm
- Norwegian Institute of Public Health, Lovisenberggata 8, P.O. Box 4404, 0403, Oslo, Norway
| | - Ola Brynildsrud
- Norwegian Institute of Public Health, Lovisenberggata 8, P.O. Box 4404, 0403, Oslo, Norway
| | - John H-O Petterson
- Norwegian Institute of Public Health, Lovisenberggata 8, P.O. Box 4404, 0403, Oslo, Norway
| | - Kristian Alfsnes
- Norwegian Institute of Public Health, Lovisenberggata 8, P.O. Box 4404, 0403, Oslo, Norway
| |
Collapse
|
27
|
Rocha EPC. Neutral Theory, Microbial Practice: Challenges in Bacterial Population Genetics. Mol Biol Evol 2018; 35:1338-1347. [DOI: 10.1093/molbev/msy078] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Affiliation(s)
- Eduardo P C Rocha
- Microbial Evolutionary Genomics, Institut Pasteur, Paris, France
- CNRS, UMR3525, Paris, France
| |
Collapse
|
28
|
Alves RJE, Minh BQ, Urich T, von Haeseler A, Schleper C. Unifying the global phylogeny and environmental distribution of ammonia-oxidising archaea based on amoA genes. Nat Commun 2018; 9:1517. [PMID: 29666365 PMCID: PMC5904100 DOI: 10.1038/s41467-018-03861-1] [Citation(s) in RCA: 157] [Impact Index Per Article: 26.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Accepted: 03/19/2018] [Indexed: 12/30/2022] Open
Abstract
Ammonia-oxidising archaea (AOA) are ubiquitous and abundant in nature and play a major role in nitrogen cycling. AOA have been studied intensively based on the amoA gene (encoding ammonia monooxygenase subunit A), making it the most sequenced functional marker gene. Here, based on extensive phylogenetic and meta-data analyses of 33,378 curated archaeal amoA sequences, we define a highly resolved taxonomy and uncover global environmental patterns that challenge many earlier generalisations. Particularly, we show: (i) the global frequency of AOA is extremely uneven, with few clades dominating AOA diversity in most ecosystems; (ii) characterised AOA do not represent most predominant clades in nature, including soils and oceans; (iii) the functional role of the most prevalent environmental AOA clade remains unclear; and (iv) AOA harbour molecular signatures that possibly reflect phenotypic traits. Our work synthesises information from a decade of research and provides the first integrative framework to study AOA in a global context.
Collapse
Affiliation(s)
- Ricardo J Eloy Alves
- Archaea Biology and Ecogenomics Division, Department of Ecogenomics and Systems Biology, University of Vienna, Althanstrasse 14, 1090, Vienna, Austria
| | - Bui Quang Minh
- Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Campus Vienna Biocenter 5, Dr. Bohr Gasse 9, 1030, Vienna, Austria
- Ecology and Evolution, Research School of Biology, Australian National University, 2601, Canberra, ACT, Australia
| | - Tim Urich
- Institute of Microbiology, Ernst-Moritz-Arndt University, Felix-Hausdorff-Strasse 8, 17487, Greifswald, Germany
| | - Arndt von Haeseler
- Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Campus Vienna Biocenter 5, Dr. Bohr Gasse 9, 1030, Vienna, Austria
| | - Christa Schleper
- Archaea Biology and Ecogenomics Division, Department of Ecogenomics and Systems Biology, University of Vienna, Althanstrasse 14, 1090, Vienna, Austria.
| |
Collapse
|
29
|
Carbon limitation drives GC content evolution of a marine bacterium in an individual-based genome-scale model. ISME JOURNAL 2018; 12:1180-1187. [PMID: 29330536 DOI: 10.1038/s41396-017-0023-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2017] [Revised: 11/14/2017] [Accepted: 11/21/2017] [Indexed: 01/13/2023]
Abstract
An important unanswered question in evolutionary genomics is the source of considerable variation of genomic base composition (GC content) even among organisms that share one habitat. Evolution toward GC-poor genomes has been considered a major adaptive pathway in the oligotrophic ocean, but GC-rich bacteria are also prevalent and highly successful in this environment. We quantify the contribution of multiple factors to the change of genomic GC content of Ruegeria pomeroyi DSS-3, a representative and GC-rich member in the globally abundant Roseobacter clade, using an agent-based model. The model simulates 2 × 108 cells, which allows random genetic drift to act in a realistic manner. Each cell has a whole genome subject to base-substitution mutation and recombination, which affect the carbon and nitrogen requirements of DNA and protein pools. Nonsynonymous changes can be functionally deleterious. Together, these factors affect the growth and fitness. Simulations show that experimentally determined mutation bias toward GC is not sufficient to build the GC-rich genome of DSS-3. While nitrogen availability has been repeatedly hypothesized to drive the evolution of GC content in marine bacterioplankton, our model instead predicts that DSS-3 and its ancestors have been evolving in environments primarily limited by carbon.
Collapse
|
30
|
Tetrad analysis in plants and fungi finds large differences in gene conversion rates but no GC bias. Nat Ecol Evol 2017; 2:164-173. [PMID: 29158556 PMCID: PMC5733138 DOI: 10.1038/s41559-017-0372-7] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Accepted: 10/09/2017] [Indexed: 11/29/2022]
Abstract
GC-favoring gene conversion enables fixation of deleterious alleles, disturbs tests of natural selection and potentially explains both the evolution of recombination as well as the commonly reported intra-genomic correlation between G+C content and recombination rate. In addition, gene conversion disturbs linkage disequilibrium, potentially affecting the ability to detect causative variants. However, the importance and generality of these effects is unresolved, not simply because direct analyses are technically challenging but also because prior within- and between-species discrepant results can be hard to appraise owing to methodological differences. Here we report results of methodologically uniform whole-genome sequencing of all tetrad products in Saccharomyces, Neurospora, Chlamydomonas and Arabidopsis. The proportion of polymorphic markers converted varies over three orders of magnitude between species (from 2% of markers converted in yeast to only ~0.005% in the two plants) with at least 87.5% of the variance in per tetrad conversion rates being between-species. This is largely owing to differences in recombination rate and median tract length. Despite three of the species showing a positive GC-recombination correlation, there is no significant net AT->GC conversion bias in any, despite relatively high resolution in the two taxa (Saccharomyces and Neurospora) with relatively common gene conversion. The absence of a GC bias means: 1) that there should be no presumption that gene conversion is GC biased, nor 2) that a GC-recombination correlation necessarily implies biased gene conversion, 3) that Ka/Ks tests should be unaffected in these species and 4) it is unlikely that gene conversion explains the evolution of recombination.
Collapse
|