1
|
Ding H, Gao J, Yang J, Zhang S, Han S, Yi R, Ye Y, Kan X. Genome evolution of Buchnera aphidicola (Gammaproteobacteria): Insights into strand compositional asymmetry, codon usage bias, and phylogenetic implications. Int J Biol Macromol 2023; 253:126738. [PMID: 37690648 DOI: 10.1016/j.ijbiomac.2023.126738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 08/15/2023] [Accepted: 08/25/2023] [Indexed: 09/12/2023]
Abstract
Taxa of Buchnera aphidicola (hereafter "Buchnera") are mutualistic intracellular symbionts of aphids, known for their remarkable biological traits such as genome reduction, strand compositional asymmetry, and symbiont-host coevolution. With the growing availability of genomic data, we performed a comprehensive analysis of 103 genomes of Buchnera strains from 12 host subfamilies, focusing on the genomic characterizations, codon usage patterns, and phylogenetic implications. Our findings revealed consistent features among all genomes, including small genome sizes, low GC contents, and gene losses. We also identified strong strand compositional asymmetries in all strains at the genome level. Further investigation suggested that mutation pressure may have played a crucial role in shaping codon usage of Buchnera. Moreover, the genomic asymmetries were reflected in asymmetric codon usage preferences within chromosomal genes. Notably, the levels of these asymmetries were varied among strains and were significantly influenced by the degrees of genome shrinkages. Lastly, our phylogenetic analyses presented an alternative topology of Aphididae, based on the Buchnera symbionts, providing robust confirmation of the paraphylies of Eriosomatinae, and Macrosiphini. Our objectives are to further understand the strand compositional asymmetry and codon usage bias of Buchnera taxa, and provide new perspectives for phylogenetic studies of Aphididae.
Collapse
Affiliation(s)
- Hengwu Ding
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; Key Laboratory of Development and Application of Rural Renewable Energy, Biogas Institute of Ministry of Agriculture and Rural Affairs, Chengdu 610041, China
| | - Jinming Gao
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Jianke Yang
- School of Basic Medical Sciences, Wannan Medical College, Wuhu 241000, China
| | - Sijia Zhang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Shiyun Han
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Ran Yi
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Yuanxin Ye
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Xianzhao Kan
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China.
| |
Collapse
|
2
|
Koppenhöfer S, Tomasch J, Lang AS. Shared properties of gene transfer agent and core genes revealed by comparative genomics of Alphaproteobacteria. Microb Genom 2022; 8:mgen000890. [PMID: 36350115 PMCID: PMC9836097 DOI: 10.1099/mgen.0.000890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Gene transfer agents (GTAs) are phage-like particles that transfer pieces of cellular genomic DNA to other cells. Homologues of the Rhodobacter capsulatus GTA (RcGTA) structural genes are widely distributed in the Alphaproteobacteria and particularly well conserved in the order Rhodobacterales. Possible reasons for their widespread conservation are still being discussed. It has been suggested that these alphaproteobacterial elements originate from a prophage that was present in an ancestral bacterium and subsequently evolved into a GTA that is now widely maintained in extant descendant lineages. Here, we analysed genomic properties that might relate to the conservation of these alphaproteobacterial GTAs. This revealed that the chromosomal locations of the GTA gene clusters are biased. They primarily occur on the leading strand of DNA replication, at large distances from long repetitive elements, and thus are in regions of lower plasticity, and in areas of extreme GC skew, which also accumulate core genes. These extreme GC skew regions arise from the preferential use of codons with an excess of G over C, a distinct phenomenon from the elevated GC content that has previously been found to be associated with GTA genes. The observed properties, along with their high level of conservation, show that GTA genes share multiple features with core genes in the examined lineages of the Alphaproteobacteria.
Collapse
Affiliation(s)
- Sonja Koppenhöfer
- Department of Biology, Memorial University of Newfoundland, St John’s, Newfoundland and Labrador, Canada
| | - Jürgen Tomasch
- Laboratory of Anoxygenic Phototrophs, Institute of Microbiology of the Czech Academy of Science – Centre Algatech, Třeboň, Czech Republic
| | - Andrew S. Lang
- Department of Biology, Memorial University of Newfoundland, St John’s, Newfoundland and Labrador, Canada,*Correspondence: Andrew S. Lang,
| |
Collapse
|
3
|
Dineen RL, Penno C, Kelleher P, Bourin MJB, O'Connell‐Motherway M, van Sinderen D. Molecular analysis of the replication functions of the bifidobacterial conjugative megaplasmid pMP7017. Microb Biotechnol 2021; 14:1494-1511. [PMID: 33939264 PMCID: PMC8313286 DOI: 10.1111/1751-7915.13810] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Accepted: 03/22/2021] [Indexed: 11/29/2022] Open
Abstract
pMP7017 is a conjugative megaplasmid isolated from the gut commensal Bifidobacterium breve JCM7017 and was shown to encode two putative replicases, designated here as RepA and RepB. In the current work, RepB was identified as the pMP7017 replicative initiator, as the repB gene, and its surrounding region was shown to be sufficient to allow autonomous replication in two bifidobacterial species, B. breve and Bifidobacterium longum subsp. longum. RepB was shown to bind to repeat sequence downstream of its coding sequence and this region was determined to be essential for efficient replication. Based on our results, we hypothesize that pMP7017 is an iteron-regulated plasmid (IRP) under strict auto-regulatory control. Recombinantly produced and purified RepB was determined to exist as a dimer in solution, differing from replicases of other IRPs, which exist as a mix of dimers and monomers. Furthermore, a stable low-copy Bifidobacterium-E. coli shuttle vector, pRD1.3, was created which can be employed for cloning and expression of large genes, as was demonstrated by the cloning and heterologous expression of the 5.1 kb apuB gene encoding the extracellular amylopullulanase from B. breve UCC2003 into B. longum subsp. longum NCIMB8809.
Collapse
Affiliation(s)
- Rebecca L. Dineen
- APC Microbiome IrelandUniversity College CorkWestern RoadCorkIreland
- School of MicrobiologyUniversity College CorkWestern RoadCorkIreland
| | - Christophe Penno
- CNRS UMR 6553 EcoBioUniversite de Rennes 1Campus de Beaulieu, Bat. 14ARennes cedex35042France
| | - Philip Kelleher
- APC Microbiome IrelandUniversity College CorkWestern RoadCorkIreland
- School of MicrobiologyUniversity College CorkWestern RoadCorkIreland
| | - Maxence J. B. Bourin
- APC Microbiome IrelandUniversity College CorkWestern RoadCorkIreland
- School of MicrobiologyUniversity College CorkWestern RoadCorkIreland
| | | | - Douwe van Sinderen
- APC Microbiome IrelandUniversity College CorkWestern RoadCorkIreland
- School of MicrobiologyUniversity College CorkWestern RoadCorkIreland
| |
Collapse
|
4
|
Quan CL, Gao F. Quantitative analysis and assessment of base composition asymmetry and gene orientation bias in bacterial genomes. FEBS Lett 2019; 593:918-925. [PMID: 30941752 DOI: 10.1002/1873-3468.13374] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Revised: 03/28/2019] [Accepted: 03/31/2019] [Indexed: 11/10/2022]
Abstract
Base composition asymmetry and gene orientation bias are two common genomic structures in bacterial genomes. Here, correlation coefficients between nucleotide disparities and coding sequence (CDS) skew have been calculated, which provides insights into their relationship from an individual genome perspective. Consequently, we find GC and RY disparities correlate significantly with CDS skew, since around 60% of the bacterial genomes under study have correlation coefficients > 0.9. Then, we present a model for quantitative assessment of nucleotide disparity and CDS skew in which a numerical index R2 is used for evaluation. We find that skew curves with higher R2 perform better on the prediction of replication origins in bacteria.
Collapse
Affiliation(s)
- Chun-Lan Quan
- Department of Physics, School of Science, Tianjin University, China
| | - Feng Gao
- Department of Physics, School of Science, Tianjin University, China.,Key Laboratory of Systems Bioengineering, Ministry of Education, Tianjin University, China.,SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), China
| |
Collapse
|
5
|
Lin JH, Wu ZY, Gong L, Wong CH, Chao WC, Yen CM, Wang CP, Wei CL, Huang YT, Liu PY. Complex Microbiome in Brain Abscess Revealed by Whole-Genome Culture-Independent and Culture-Based Sequencing. J Clin Med 2019; 8:jcm8030351. [PMID: 30871085 PMCID: PMC6462986 DOI: 10.3390/jcm8030351] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Revised: 03/04/2019] [Accepted: 03/07/2019] [Indexed: 02/07/2023] Open
Abstract
Brain abscess is a severe infectious disease with high mortality and mobility. Although culture-based techniques have been widely used for the investigation of microbial composition of brain abscess, these approaches are inherent biased. Recent studies using 16S ribosomal sequencing approaches revealed high complexity of the bacterial community involved in brain abscess but fail to detect fungal and viral composition. In the study, both culture-independent nanopore metagenomic sequencing and culture-based whole-genome sequencing using both the Illumina and the Nanopore platforms were conducted to investigate the microbial composition and genomic characterization in brain abscess. Culture-independent metagenomic sequencing revealed not only a larger taxonomic diversity of bacteria but also the presence of fungi and virus communities. The culture-based whole-genome sequencing identified a novel species in Prevotella and reconstructs a Streptococcus constellatus with a high GC-skew genome. Antibiotic-resistance genes CfxA and ErmF associated with resistance to penicillin and clindamycin were also identified in culture-based and culture-free sequencing. This study implies current understanding of brain abscess need to consider the broader diversity of microorganisms.
Collapse
Affiliation(s)
- Jyun-Hong Lin
- Department of Computer Science and Information Engineering, National Chung Cheng University, Chia-Yi 62102, Taiwan.
| | - Zong-Yen Wu
- Department of Veterinary Medicine, National Chung Hsing University, Taichung 40227, Taiwan.
| | - Liang Gong
- Genome Technologies, The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA.
| | - Chee-Hong Wong
- Genome Technologies, The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA.
| | - Wen-Cheng Chao
- Department of Medical Research, Taichung Veterans General Hospital, Taichung 40705, Taiwan.
| | - Chun-Ming Yen
- Program in Translational Medicine, National Chung Hsing University, Taichung 40227, Taiwan.
- Department of Neurosurgery, Neurological Institute, Taichung Veterans General Hospital, Taichung 40705, Taiwan.
| | - Ching-Ping Wang
- Department of Otolaryngology-Head and Neck Surgery, Taichung Veterans General Hospital, Taichung 40705, Taiwan.
| | - Chia-Lin Wei
- Genome Technologies, The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA.
| | - Yao-Ting Huang
- Department of Computer Science and Information Engineering, National Chung Cheng University, Chia-Yi 62102, Taiwan.
| | - Po-Yu Liu
- Program in Translational Medicine, National Chung Hsing University, Taichung 40227, Taiwan.
- Division of Infectious Diseases, Department of Internal Medicine, Taichung Veterans General Hospital, Taichung 40705, Taiwan.
| |
Collapse
|
6
|
Merino N, Zhang S, Tomita M, Suzuki H. Comparative genomics of Bacteria commonly identified in the built environment. BMC Genomics 2019; 20:92. [PMID: 30691394 PMCID: PMC6350394 DOI: 10.1186/s12864-018-5389-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Accepted: 12/18/2018] [Indexed: 02/01/2023] Open
Abstract
BACKGROUND The microbial community of the built environment (BE) can impact the lives of people and has been studied for a variety of indoor, outdoor, underground, and extreme locations. Thus far, these microorganisms have mainly been investigated by culture-based methods or amplicon sequencing. However, both methods have limitations, complicating multi-study comparisons and limiting the knowledge gained regarding in-situ microbial lifestyles. A greater understanding of BE microorganisms can be achieved through basic information derived from the complete genome. Here, we investigate the level of diversity and genomic features (genome size, GC content, replication strand skew, and codon usage bias) from complete genomes of bacteria commonly identified in the BE, providing a first step towards understanding these bacterial lifestyles. RESULTS Here, we selected bacterial genera commonly identified in the BE (or "Common BE genomes") and compared them against other prokaryotic genera ("Other genomes"). The "Common BE genomes" were identified in various climates and in indoor, outdoor, underground, or extreme built environments. The diversity level of the 16S rRNA varied greatly between genera. The genome size, GC content and GC skew strength of the "Common BE genomes" were statistically larger than those of the "Other genomes" but were not practically significant. In contrast, the strength of selected codon usage bias (S value) was statistically higher with a large effect size in the "Common BE genomes" compared to the "Other genomes." CONCLUSION Of the four genomic features tested, the S value could play a more important role in understanding the lifestyles of bacteria living in the BE. This parameter could be indicative of bacterial growth rates, gene expression, and other factors, potentially affected by BE growth conditions (e.g., temperature, humidity, and nutrients). However, further experimental evidence, species-level BE studies, and classification by BE location is needed to define the relationship between genomic features and the lifestyles of BE bacteria more robustly.
Collapse
Affiliation(s)
- Nancy Merino
- Earth-Life Science Institute, Tokyo Institute of Technology, Ookayama, Meguro-ku, Tokyo, 152-8550, Japan.,Department of Earth Sciences, University of Southern California, Stauffer Hall of Science, Los Angeles, CA, 90089, USA
| | - Shu Zhang
- Global Research Center for Environment and Energy based on Nanomaterials Science, National Institute for Material Science, 1-1 Namiki, Tsukuba, Ibaraki, 305-0044, Japan.,Section of Infection and Immunity, Herman Ostrow School of Dentistry of USC, University of Southern California, Los Angeles, CA, 90089-0641, USA
| | - Masaru Tomita
- Faculty of Environment and Information Studies, Keio University, Fujisawa, Kanagawa, 252-0882, Japan.,Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata, 997-0035, Japan
| | - Haruo Suzuki
- Faculty of Environment and Information Studies, Keio University, Fujisawa, Kanagawa, 252-0882, Japan. .,Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata, 997-0035, Japan.
| |
Collapse
|
7
|
Joesch-Cohen LM, Robinson M, Jabbari N, Lausted CG, Glusman G. Novel metrics for quantifying bacterial genome composition skews. BMC Genomics 2018; 19:528. [PMID: 29996771 PMCID: PMC6042203 DOI: 10.1186/s12864-018-4913-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Accepted: 07/02/2018] [Indexed: 11/17/2022] Open
Abstract
Background Bacterial genomes have characteristic compositional skews, which are differences in nucleotide frequency between the leading and lagging DNA strands across a segment of a genome. It is thought that these strand asymmetries arise as a result of mutational biases and selective constraints, particularly for energy efficiency. Analysis of compositional skews in a diverse set of bacteria provides a comparative context in which mutational and selective environmental constraints can be studied. These analyses typically require finished and well-annotated genomic sequences. Results We present three novel metrics for examining genome composition skews; all three metrics can be computed for unfinished or partially-annotated genomes. The first two metrics, (dot-skew and cross-skew) depend on sequence and gene annotation of a single genome, while the third metric (residual skew) highlights unusual genomes by subtracting a GC content-based model of a library of genome sequences. We applied these metrics to 7738 available bacterial genomes, including partial drafts, and identified outlier species. A phylogenetically diverse set of these outliers (i.e., Borrelia, Ehrlichia, Kinetoplastibacterium, and Phytoplasma) display similar skew patterns but share lifestyle characteristics, such as intracellularity and biosynthetic dependence on their hosts. Conclusions Our novel metrics appear to reflect the effects of biosynthetic constraints and adaptations to life within one or more hosts on genome composition. We provide results for each analyzed genome, software and interactive visualizations at http://db.systemsbiology.net/gestalt/skew_metrics. Electronic supplementary material The online version of this article (10.1186/s12864-018-4913-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lena M Joesch-Cohen
- Institute for Systems Biology, 401 Terry Ave N, Seattle, WA, 98109, USA.,Brown University, Providence, RI, 02912, USA
| | - Max Robinson
- Institute for Systems Biology, 401 Terry Ave N, Seattle, WA, 98109, USA
| | - Neda Jabbari
- Institute for Systems Biology, 401 Terry Ave N, Seattle, WA, 98109, USA
| | | | - Gustavo Glusman
- Institute for Systems Biology, 401 Terry Ave N, Seattle, WA, 98109, USA.
| |
Collapse
|
8
|
Luo H, Quan CL, Peng C, Gao F. Recent development of Ori-Finder system and DoriC database for microbial replication origins. Brief Bioinform 2018; 20:1114-1124. [DOI: 10.1093/bib/bbx174] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Revised: 12/04/2017] [Indexed: 01/28/2023] Open
Abstract
Abstract
DNA replication begins at replication origins in all three domains of life. Identification and characterization of replication origins are important not only in providing insights into the structure and function of the replication origins but also in understanding the regulatory mechanisms of the initiation step in DNA replication. The Z-curve method has been used in the identification of replication origins in archaeal genomes successfully since 2002. Furthermore, the Web servers of Ori-Finder and Ori-Finder 2 have been developed to predict replication origins in both bacterial and archaeal genomes based on the Z-curve method, and the replication origins with manual curation have been collected into an online database, DoriC. Ori-Finder system and DoriC database are currently used in the research field of DNA replication origins in prokaryotes, including: (i) identification of oriC regions in bacterial and archaeal genomes; (ii) discovery and analysis of the conserved sequences within oriC regions; and (iii) strand-biased analysis of bacterial genomes.
Up to now, more and more predicted results by Ori-Finder system were supported by subsequent experiments, and Ori-Finder system has been used to identify the replication origins in > 100 newly sequenced prokaryotes in their genome reports. In addition, the data in DoriC database have been widely used in the large-scale analyses of replication origins and strand bias in prokaryotic genomes. Here, we review the development of Ori-Finder system and DoriC database as well as their applications. Some future directions and aspects for extending the application of Ori-Finder and DoriC are also presented.
Collapse
|
9
|
Khrustalev VV, Khrustaleva TA, Sharma N, Giri R. Mutational Pressure in Zika Virus: Local ADAR-Editing Areas Associated with Pauses in Translation and Replication. Front Cell Infect Microbiol 2017; 7:44. [PMID: 28275585 PMCID: PMC5319961 DOI: 10.3389/fcimb.2017.00044] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 02/07/2017] [Indexed: 12/21/2022] Open
Abstract
Zika virus (ZIKV) spread led to the recent medical health emergency of international concern. Understanding the variations in virus system is of utmost need. Using available complete sequences of ZIKV we estimated directions of mutational pressure along the length of consensus sequences of three lineages of the virus. Results showed that guanine usage is growing in ZIKV RNA plus strand due to adenine to guanine transitions, while adenine usage is growing due to cytosine to adenine transversions. Especially high levels of guanine have been found in two-fold degenerated sites of certain areas of RNA plus strand with high amount of secondary structure. The usage of cytosine in two-fold degenerated sites shows direct dependence on the amount of secondary structure in 52% (consensus sequence of East African ZIKV lineage)—32% (consensus sequence of epidemic strains) of the length of RNA minus strand. These facts are the evidences of ADAR-editing of both strands of ZIKV genome during pauses in replication. RNA plus strand can also be edited by ADAR during pauses in translation caused by the appearance of groups of rare codons. According to our results, RNA minus strand of epidemic ZIKV strain has lower number of points in which polymerase can be stalled (allowing ADAR-editing) compared to other strains. The data on preferable directions of mutational pressure in epidemic ZIKV strain is useful for future vaccine development and understanding the evolution of new strains.
Collapse
Affiliation(s)
| | - Tatyana A Khrustaleva
- Laboratory of Cellular Technologies, Institute of Physiology of the National Academy of Sciences of Belarus Minsk, Belarus
| | - Nitin Sharma
- School of Basic Sciences, Indian Institute of Technology Mandi Mandi, India
| | - Rajanish Giri
- School of Basic Sciences, Indian Institute of Technology Mandi Mandi, India
| |
Collapse
|