1
|
Bohlin J, Pettersson JHO. Compression rates of microbial genomes are associated with genome size and base composition. Genomics Inform 2024; 22:16. [PMID: 39390533 PMCID: PMC11468749 DOI: 10.1186/s44342-024-00018-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 09/10/2024] [Indexed: 10/12/2024] Open
Abstract
BACKGROUND To what degree a string of symbols can be compressed reveals important details about its complexity. For instance, strings that are not compressible are random and carry a low information potential while the opposite is true for highly compressible strings. We explore to what extent microbial genomes are amenable to compression as they vary considerably both with respect to size and base composition. For instance, microbial genome sizes vary from less than 100,000 base pairs in symbionts to more than 10 million in soil-dwellers. Genomic base composition, often summarized as genomic AT or GC content due to the similar frequencies of adenine and thymine on one hand and cytosine and guanine on the other, also vary substantially; the most extreme microbes can have genomes with AT content below 25% or above 85% AT. Base composition determines the frequency of DNA words, consisting of multiple nucleotides or oligonucleotides, and may therefore also influence compressibility. Using 4,713 RefSeq genomes, we examined the association between compressibility, using both a DNA based- (MBGC) and a general purpose (ZPAQ) compression algorithm, and genome size, AT content as well as genomic oligonucleotide usage variance (OUV) using generalized additive models. RESULTS We find that genome size (p < 0.001) and OUV (p < 0.001) are both strongly associated with genome redundancy for both type of file compressors. The DNA-based MBGC compressor managed to improve compression with approximately 3% on average with respect to ZPAQ. Moreover, MBGC detected a significant (p < 0.001) compression ratio difference between AT poor and AT rich genomes which was not detected with ZPAQ. CONCLUSION As lack of compressibility is equivalent to randomness, our findings suggest that smaller and AT rich genomes may have accumulated more random mutations on average than larger and AT poor genomes which, in turn, were significantly more redundant. Moreover, we find that OUV is a strong proxy for genome compressibility in microbial genomes. The ZPAQ compressor was found to agree with the MBGC compressor, albeit with a poorer performance, except for the compressibility of AT-rich and AT-poor/GC-rich genomes.
Collapse
Affiliation(s)
- Jon Bohlin
- Norwegian Institute of Public Health, Domain for Infection Control, Section for Modeling and Bioinformatics, Oslo, Norway.
| | - John H-O Pettersson
- Zoonosis Science Center, Clinical Microbiology, Department of Medical Sciences, University of Uppsala, 751 85, Uppsala, Sweden
- Clinical Microbiology and Hospital Hygiene, Uppsala University Hospital, 751 85, Uppsala, Sweden
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia
| |
Collapse
|
2
|
Teng W, Liao B, Chen M, Shu W. Genomic Legacies of Ancient Adaptation Illuminate GC-Content Evolution in Bacteria. Microbiol Spectr 2023; 11:e0214522. [PMID: 36511682 PMCID: PMC9927291 DOI: 10.1128/spectrum.02145-22] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Bacterial evolution is characterized by strong purifying selection as well as rapid adaptive evolution in changing environments. In this context, the genomic GC content (genomic GC) varies greatly but presents some level of phylogenetic stability, making it challenging to explain based on current hypotheses. To illuminate the evolutionary mechanisms of the genomic GC, we analyzed the base composition and functional inventory of 11,083 representative genomes. A phylogenetically constrained bimodal distribution of the genomic GC, which mainly originated from parallel divergences in the early evolution, was demonstrated. Such variation of the genomic GC can be well explained by DNA replication and repair (DRR), in which multiple pathways correlate with the genomic GC. Furthermore, the biased conservation of various stress-related genes, especially the DRR-related ones, implies distinct adaptive processes in the ancestral lineages of high- or low-GC clades which are likely induced by major environmental changes. Our findings support that the mutational biases resulting from these legacies of ancient adaptation have changed the course of adaptive evolution and generated great variation in the genomic GC. This highlights the importance of indirect effects of natural selection, which indicates a new model for bacterial evolution. IMPORTANCE GC content has been shown to be an important factor in microbial ecology and evolution, and the genomic GC of bacteria can be characterized by great intergenomic heterogeneity, high intragenomic homogeneity, and strong phylogenetic inertia, as well as being associated with the environment. Current hypotheses concerning direct selection or mutational biases cannot well explain these features simultaneously. Our findings of the genomic GC showing that ancient adaptations have transformed the DRR system and that the resulting mutational biases further contributed to a bimodal distribution of it offer a more reasonable scenario for the mechanism. This would imply that, when thinking about the evolution of life, diverse processes of adaptation exist, and combined effects of natural selection should be considered.
Collapse
Affiliation(s)
- Wenkai Teng
- School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Bin Liao
- School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Mengyun Chen
- School of Life Sciences, South China Normal University, Guangzhou, Guangdong, China
| | - Wensheng Shu
- School of Life Sciences, South China Normal University, Guangzhou, Guangdong, China
| |
Collapse
|
3
|
Wei TS, Gao ZM, Gong L, Li QM, Zhou YL, Chen HG, He LS, Wang Y. Genome-centric view of the microbiome in a new deep-sea glass sponge species Bathydorus sp. Front Microbiol 2023; 14:1078171. [PMID: 36846759 PMCID: PMC9944714 DOI: 10.3389/fmicb.2023.1078171] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 01/12/2023] [Indexed: 02/10/2023] Open
Abstract
Sponges are widely distributed in the global ocean and harbor diverse symbiotic microbes with mutualistic relationships. However, sponge symbionts in the deep sea remain poorly studied at the genome level. Here, we report a new glass sponge species of the genus Bathydorus and provide a genome-centric view of its microbiome. We obtained 14 high-quality prokaryotic metagenome-assembled genomes (MAGs) affiliated with the phyla Nitrososphaerota, Pseudomonadota, Nitrospirota, Bdellovibrionota, SAR324, Bacteroidota, and Patescibacteria. In total, 13 of these MAGs probably represent new species, suggesting the high novelty of the deep-sea glass sponge microbiome. An ammonia-oxidizing Nitrososphaerota MAG B01, which accounted for up to 70% of the metagenome reads, dominated the sponge microbiomes. The B01 genome had a highly complex CRISPR array, which likely represents an advantageous evolution toward a symbiotic lifestyle and forceful ability to defend against phages. A sulfur-oxidizing Gammaproteobacteria species was the second most dominant symbiont, and a nitrite-oxidizing Nitrospirota species could also be detected, but with lower relative abundance. Bdellovibrio species represented by two MAGs, B11 and B12, were first reported as potential predatory symbionts in deep-sea glass sponges and have undergone dramatic genome reduction. Comprehensive functional analysis indicated that most of the sponge symbionts encoded CRISPR-Cas systems and eukaryotic-like proteins for symbiotic interactions with the host. Metabolic reconstruction further illustrated their essential roles in carbon, nitrogen, and sulfur cycles. In addition, diverse putative phages were identified from the sponge metagenomes. Our study expands the knowledge of microbial diversity, evolutionary adaption, and metabolic complementarity in deep-sea glass sponges.
Collapse
Affiliation(s)
- Tao-Shu Wei
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China,University of Chinese Academy of Sciences, Beijing, China
| | - Zhao-Ming Gao
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China,*Correspondence: Zhao-Ming Gao ✉
| | - Lin Gong
- Institute of Oceanology, Chinese Academy of Sciences, Qingdao, Shandong, China
| | - Qing-Mei Li
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China
| | - Ying-Li Zhou
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China
| | - Hua-Guan Chen
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China,University of Chinese Academy of Sciences, Beijing, China
| | - Li-Sheng He
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China
| | - Yong Wang
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China,Institute for Ocean Engineering, Shenzhen International Graduate School, Tsinghua University, Shenzhen, China,Yong Wang ✉
| |
Collapse
|
4
|
Bohlin J. A simple stochastic model describing the evolution of genomic GC content in asexually reproducing organisms. Sci Rep 2022; 12:18569. [PMID: 36329129 PMCID: PMC9631610 DOI: 10.1038/s41598-022-21709-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 09/30/2022] [Indexed: 11/06/2022] Open
Abstract
A genome's nucleotide composition can usually be summarized with (G)uanine + (C)ytosine (GC) or (A)denine + (T)hymine (AT) frequencies as GC% = 100% - AT%. Genomic AT/GC content has been linked to environment and selective processes in asexually reproducing organisms. A model is presented relating the evolution of genomic GC content over time to AT [Formula: see text] GC and GC [Formula: see text] AT mutation rates. By employing Itô calculus it is shown that if mutation rates are subject to random perturbations, that can vary over time, several implications follow. In particular, an extra Brownian motion term appears influencing genomic nucleotide variability; the greater the random perturbations the more genomic nucleotide variability. This can have several interpretations depending on the context. For instance, reducing the influence of the random perturbations on the AT/GC mutation rates and thus genomic nucleotide variability, to limit fitness decreasing and deleterious mutations, will likely suggest channeling of resources. On the other hand, increased genomic nucleotide diversity may be beneficial in variable environments. In asexually reproducing organisms, the Brownian motion term can be considered to be inversely reflective of the selective pressures an organism is subjected to at the molecular level. The presented model is a generalization of a previous model, limited to microbial symbionts, to all asexually reproducing, non-recombining organisms. Last, a connection between the presented model and the classical Luria-Delbrück mutation model is presented in an Itô calculus setting.
Collapse
Affiliation(s)
- Jon Bohlin
- grid.418193.60000 0001 1541 4204Division of Infection Control, Department of Methods Development and Analysis, Norwegian Institute of Public Health, Oslo, Norway ,grid.418193.60000 0001 1541 4204Centre for Fertility and Health, Norwegian Institute of Public Health, P.O. Box 4404, Lovisenberggata 8, 0403 Oslo, Norway
| |
Collapse
|
5
|
Smith G, Manzano-Marín A, Reyes-Prieto M, Antunes CSR, Ashworth V, Goselle ON, Jan AAA, Moya A, Latorre A, Perotti MA, Braig HR. Human follicular mites: Ectoparasites becoming symbionts. Mol Biol Evol 2022; 39:msac125. [PMID: 35724423 PMCID: PMC9218549 DOI: 10.1093/molbev/msac125] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2021] [Revised: 05/23/2022] [Accepted: 05/31/2022] [Indexed: 12/13/2022] Open
Abstract
Most humans carry mites in the hair follicles of their skin for their entire lives. Follicular mites are the only metazoans tha continuously live on humans. We propose that Demodex folliculorum (Acari) represents a transitional stage from a host-injuring obligate parasite to an obligate symbiont. Here, we describe the profound impact of this transition on the genome and physiology of the mite. Genome sequencing revealed that the permanent host association of D. folliculorum led to an extensive genome reduction through relaxed selection and genetic drift, resulting in the smallest number of protein-coding genes yet identified among panarthropods. Confocal microscopy revealed that this gene loss coincided with an extreme reduction in the number of cells. Single uninucleate muscle cells are sufficient to operate each of the three segments that form each walking leg. While it has been assumed that the reduction of the cell number in parasites starts early in development, we identified a greater total number of cells in the last developmental stage (nymph) than in the terminal adult stage, suggesting that reduction starts at the adult or ultimate stage of development. This is the first evolutionary step in an arthropod species adopting a reductive, parasitic or endosymbiotic lifestyle. Somatic nuclei show underreplication at the diploid stage. Novel eye structures or photoreceptors as well as a unique human host melatonin-guided day/night rhythm are proposed for the first time. The loss of DNA repair genes coupled with extreme endogamy might have set this mite species on an evolutionary dead-end trajectory.
Collapse
Affiliation(s)
- Gilbert Smith
- School of Natural Sciences, Bangor University, Bangor, Wales, United Kingdom
| | - Alejandro Manzano-Marín
- Centre for Microbiology and Environmental Systems Science (CMESS), University of Vienna, Vienna, Austria
| | - Mariana Reyes-Prieto
- Institute of Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), València, Spain
- Foundation for the Promotion of Health and Biomedical Research of the Valencian Community (FISABIO), València, Spain
| | | | - Victoria Ashworth
- School of Natural Sciences, Bangor University, Bangor, Wales, United Kingdom
| | - Obed Nanjul Goselle
- School of Natural Sciences, Bangor University, Bangor, Wales, United Kingdom
| | | | - Andrés Moya
- Institute of Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), València, Spain
- Foundation for the Promotion of Health and Biomedical Research of the Valencian Community (FISABIO), València, Spain
- Center for Networked Biomedical Research in Epidemiology and Public Health (CIBEResp), Madrid, Spain
| | - Amparo Latorre
- Institute of Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), València, Spain
- Foundation for the Promotion of Health and Biomedical Research of the Valencian Community (FISABIO), València, Spain
- Center for Networked Biomedical Research in Epidemiology and Public Health (CIBEResp), Madrid, Spain
| | - M Alejandra Perotti
- School of Biological Sciences, University of Reading, Reading, United Kingdom
| | - Henk R Braig
- School of Natural Sciences, Bangor University, Bangor, Wales, United Kingdom
- Institute and Museum of Natural Sciences, National University of San Juan, San Juan, Argentina
| |
Collapse
|