1
|
Pouresmaeil M, Azizi-Dargahlou S. Investigation of CaMV-host co-evolution through synonymous codon pattern. J Basic Microbiol 2024; 64:e2300664. [PMID: 38436477 DOI: 10.1002/jobm.202300664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 01/20/2024] [Accepted: 02/10/2024] [Indexed: 03/05/2024]
Abstract
Cauliflower mosaic virus (CaMV) has a double-stranded DNA genome and is globally distributed. The phylogeny tree of 121 CaMV isolates was categorized into two primary groups, with Iranian isolates showing the greatest genetic variations. Nucleotide A demonstrated the highest percentage (36.95%) in the CaMV genome and the dinucleotide odds ratio analysis revealed that TC dinucleotide (1.34 ≥ 1.23) and CG dinucleotide (0.63 ≤ 0.78) are overrepresented and underrepresented, respectively. Relative synonymous codon usage (RSCU) analysis confirmed codon usage bias in CaMV and its hosts. Brassica oleracea and Brassica rapa, among the susceptible hosts of CaMV, showed a codon adaptation index (CAI) value above 0.8. Additionally, relative codon deoptimization index (RCDI) results exhibited the highest degree of deoptimization in Raphanus sativus. These findings suggest that the genes of CaMV underwent codon adaptation with its hosts. Among the CaMV open reading frames (ORFs), genes that produce reverse transcriptase and virus coat proteins showed the highest CAI value of 0.83. These genes are crucial for the creation of new virion particles. The results confirm that CaMV co-evolved with its host to ensure the optimal expression of its genes in the hosts, allowing for easy infection and effective spread. To detect the force behind codon usage bias, an effective number of codons (ENC)-plot and neutrality plot were conducted. The results indicated that natural selection is the primary factor influencing CaMV codon usage bias.
Collapse
Affiliation(s)
- Mahin Pouresmaeil
- Faculty of Agriculture and Natural Resources, University of Mohaghegh Ardabili, Ardabil, Iran
| | - Shahnam Azizi-Dargahlou
- Agricultural Biotechnology, Seed and Plant Certification and Registration Institute, Ardabil Agricultural and Natural Resources Research Center, Agricultural Research, Education and Extension Organization (AREEO), Karaj, Iran
| |
Collapse
|
2
|
Yang J, Chu Q, Meng G, Kong W. The complete chloroplast genome sequences of three Broussonetia species and comparative analysis within the Moraceae. PeerJ 2022; 10:e14293. [PMID: 36340196 PMCID: PMC9632464 DOI: 10.7717/peerj.14293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 10/03/2022] [Indexed: 01/22/2023] Open
Abstract
Background Species of Broussonetia (family Moraceae) are commonly used to make textiles and high-grade paper. The distribution of Broussonetia papyrifera L. is considered to be related to the spread and location of humans. The complete chloroplast (cp) genomes of B. papyrifera, Broussonetia kazinoki Sieb., and Broussonetia kaempferi Sieb. were analyzed to better understand the status and evolutionary biology of the genus Broussonetia. Methods The cp genomes were assembled and characterized using SOAPdenovo2 and DOGMA. Phylogenetic and molecular dating analysis were performed using the concatenated nucleotide sequences of 35 species in the Moraceae family and were based on 66 protein-coding genes (PCGs). An analysis of the sequence divergence (pi) of each PCG among the 35 cp genomes was conducted using DnaSP v6. Codon usage indices were calculated using the CodonW program. Results All three cp genomes had the typical land plant quadripartite structure, ranging in size from 160,239 bp to 160,841 bp. The ribosomal protein L22 gene (RPL22) was either incomplete or missing in all three Broussonetia species. Phylogenetic analysis revealed two clades. Clade 1 included Morus and Artocarpus, whereas clade 2 included the other seven genera. Malaisia scandens Lour. was clustered within the genus Broussonetia. The differentiation of Broussonetia was estimated to have taken place 26 million years ago. The PCGs' pi values ranged from 0.0005 to 0.0419, indicating small differences within the Moraceae family. The distribution of most of the genes in the effective number of codons plot (ENc-plot) fell on or near the trend line; the slopes of the trend line of neutrality plots were within the range of 0.0363-0.171. These results will facilitate the identification, taxonomy, and utilization of the Broussonetia species and further the evolutionary studies of the Moraceae family.
Collapse
Affiliation(s)
- Jinhong Yang
- Shaanxi Key Laboratory of Sericulture, Ankang University, Ankang, China
| | - Qu Chu
- Shaanxi Key Laboratory of Sericulture, Ankang University, Ankang, China
| | - Gang Meng
- Shaanxi Key Laboratory of Sericulture, Ankang University, Ankang, China
| | - Weiqing Kong
- Shaanxi Key Laboratory of Sericulture, Ankang University, Ankang, China
| |
Collapse
|
3
|
Cao Q, Gao Q, Ma X, Zhang F, Xing R, Chi X, Chen S. Plastome structure, phylogenomics and evolution of plastid genes in Swertia (Gentianaceae) in the Qing-Tibetan Plateau. BMC PLANT BIOLOGY 2022; 22:195. [PMID: 35413790 PMCID: PMC9004202 DOI: 10.1186/s12870-022-03577-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 03/28/2022] [Indexed: 05/08/2023]
Abstract
BACKGROUND The genus Swertia is of great medicinal importance and one of the most taxonomically challenging taxa within Gentianaceae, largely due to the morphological similarities of species within this genus and with its closely related genera. Previous molecular studies confirmed its polyphyly but suffered from low phylogenetic resolutions because only limited sequence loci were used. Thus, we conducted the structural, gene evolutionary, and phylogenetic analyses of 11 newly obtained plastomes of Swertia. Our result greatly improved the phylogenetic resolutions in Swertia, shed new light on the plastome evolution and phylogenetic relationships of this genus. RESULTS The 11 Swertia plastomes together with the published seven species proved highly similar in overall size, structure, gene order, and content, but revealed some structural variations caused by the expansion and contraction of the IRb region into the LSC region, due to the heterogeneous length of the ψycf1. The gene rps16 was found to be in a state flux with pseudogenes or completely lost. Similar situation was also documented in other genera of Gentianaceae. This might imply loss of the gene in the common ancestor of Gentianaceae. The distribution plot of ENC vs. GC3 showed all these plastomes arranging very close in the Wright line with an expected ENC value (49-52%), suggesting the codon usage of Swertia was mainly constrained by a GC mutation bias. Most of the genes remained under the purifying selection, however, the cemA was identified under positive selection, possibly reflecting an adaptive response to low CO2 atmospheric conditions during the Late Miocene. Our phylogenomic analyses, based on 74 protein-coding genes (CDS), supported the polyphyly of Swertia with its close allies in the subtribe Swertiinae, presumably due to recent rapid radiation. The topology inferred from our phylogenetic analyses partly supported the current taxonomic treatment. Finally, several highly variable loci were identified, which can be used in future phylogenetic studies and accurate identification of medicinal genuineness of Swertia. CONCLUSIONS Our study confirmed the polyphyly of Swertia and demonstrated the power of plastome phylogenomics in improvement of phylogenetic resolution, thus contributing to a better understanding of the evolutionary history of Swertia.
Collapse
Affiliation(s)
- Qian Cao
- Key Laboratory of Crop Molecular Breeding of Qinghai Province, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, 810008, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Qingbo Gao
- Key Laboratory of Crop Molecular Breeding of Qinghai Province, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, 810008, China
| | - Xiaolei Ma
- Key Laboratory of Crop Molecular Breeding of Qinghai Province, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, 810008, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Faqi Zhang
- Key Laboratory of Crop Molecular Breeding of Qinghai Province, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, 810008, China
| | - Rui Xing
- Key Laboratory of Crop Molecular Breeding of Qinghai Province, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, 810008, China
| | - Xiaofeng Chi
- Key Laboratory of Crop Molecular Breeding of Qinghai Province, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, 810008, China
| | - Shilong Chen
- Key Laboratory of Crop Molecular Breeding of Qinghai Province, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, 810008, China.
| |
Collapse
|
4
|
Tikhomirova TS, Matyunin MA, Lobanov MY, Galzitskaya OV. In-depth analysis of amino acid and nucleotide sequences of Hsp60: how conserved is this protein? Proteins 2021; 90:1119-1141. [PMID: 34964171 DOI: 10.1002/prot.26294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Revised: 12/21/2021] [Accepted: 12/23/2021] [Indexed: 11/07/2022]
Abstract
Chaperonin Hsp60, as a protein found in all organisms, is of great interest in medicine, since it is present in many tissues and can be used both as a drug and as an object of targeted therapy. Hence, Hsp60 deserves a fundamental comparative analysis to assess its evolutionary characteristics. It was found that the percent identity of Hsp60 amino acid sequences both within and between phyla was not high enough to identify Hsp60s as highly conserved proteins. However, their ATP binding sites are largely conserved. The amino acid composition of Hsp60s remained relatively constant. At the same time, the analysis of the nucleotide sequences showed that GC content in the Hsp60 genes was comparable to or greater than the genomic values, which may indicate a high resistance to mutations due to tight control of the nucleotide composition by DNA repair systems. Natural selection plays a dominant role in the evolution of Hsp60 genes. The degree of mutational pressure affecting the Hsp60 genes is quite low, and its direction does not depend on taxonomy. Interestingly, for the Hsp60 genes from Chordata, Arthropoda, and Proteobacteria the exact direction of mutational pressure could not be determined. However, upon further division into classes, it was found that the direction of the mutational pressure for Hsp60 genes from Fish differs from that for other chordates. The direction of the mutational pressure affects the synonymous codon usage bias. The number of high and low represented codons increases with increasing GC content, which can improve codon usage. Special server has been created for bioinformatics analysis of Hsp60: http://oka.protres.ru:4202/.
Collapse
Affiliation(s)
- Tatyana S Tikhomirova
- Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", Pushchino, Moscow Region, Russia
| | - Maxim A Matyunin
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
| | - Michail Yu Lobanov
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
| | - Oxana V Galzitskaya
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
- Institute of Theoretical and Experimental Biophysics, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
| |
Collapse
|
5
|
Saha J, Saha BK, Pal Sarkar M, Roy V, Mandal P, Pal A. Comparative Genomic Analysis of Soil Dwelling Bacteria Utilizing a Combinational Codon Usage and Molecular Phylogenetic Approach Accentuating on Key Housekeeping Genes. Front Microbiol 2019; 10:2896. [PMID: 31921071 PMCID: PMC6928123 DOI: 10.3389/fmicb.2019.02896] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 12/02/2019] [Indexed: 01/02/2023] Open
Abstract
Soil is a diversified and complex ecological niche, home to a myriad of microorganisms particularly bacteria. The physico-chemical complexities of soil results in a plethora of physiological variations to exist within the different types of soil dwelling bacteria, giving rise to a wide variation in genome structure and complexity. This serves as an attractive proposition to analyze and compare the genome of a large number soil bacteria to comprehend their genome complexity and evolution. In this study a combination of codon usage and molecular phylogenetics of the whole genome and key housekeeping genes like infB (translation initiation factor 2), trpB (tryptophan synthase, beta subunit), atpD (ATP synthase, beta subunit), and rpoB (RNA polymerase, beta subunit) of 92 soil bacterial species spread across the entire eubacterial domain and residing in different soil types was performed. The results indicated the direct relationship of genome size with codon bias and coding frequency in the studied bacteria. The codon usage profile demonstrated by the gene trpB was found to be relatively different from the rest of the housekeeping genes with a large number of bacteria having a greater percentage of genes with Nc values less than the Nc of trpB. The results from the overall codon usage bias profile also depicted that the codon usage bias in the key housekeeping genes of soil bacteria was majorly due to selectional pressure and not mutation. The analysis of hydrophobicity of the gene product encoded by the rpoB coding sequences demonstrated tight clustering across all the soil bacteria suggesting conservation of protein structure for maintenance of form and function. The phylogenetic affinities inferred using 16S rRNA gene and the housekeeping genes demonstrated conflicting signals with trpB gene being the noisiest one. The housekeeping gene atpD was found to depict the least amount of evolutionary change in the soil bacteria considered in this study except in two Clostridium species. The phylogenetic and codon usage analysis of the soil bacteria consistently demonstrated the relatedness of Azotobacter chroococcum with different species of the genus Pseudomonas.
Collapse
Affiliation(s)
- Jayanti Saha
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| | - Barnan K. Saha
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| | - Monalisha Pal Sarkar
- Mycology & Plant Pathology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| | - Vivek Roy
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| | - Parimal Mandal
- Mycology & Plant Pathology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| | - Ayon Pal
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| |
Collapse
|
6
|
Pal A, Saha BK, Saha J. Comparative in silico analysis of ftsZ gene from different bacteria reveals the preference for core set of codons in coding sequence structuring and secondary structural elements determination. PLoS One 2019; 14:e0219231. [PMID: 31841523 PMCID: PMC6913975 DOI: 10.1371/journal.pone.0219231] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Accepted: 11/28/2019] [Indexed: 11/19/2022] Open
Abstract
The deluge of sequence information in the recent times provide us with an excellent opportunity to compare organisms on a large genomic scale. In this study we have tried to decipher the variation in the gene organization and structuring of a vital bacterial gene called ftsZ which codes for an integral component of the bacterial cell division, the FtsZ protein. FtsZ is homologous to tubulin protein and has been found to be ubiquitous in eubacteria. FtsZ is showing increasing promise as a target for antibacterial drug discovery. Our study of ftsZ protein from 143 different bacterial species spanning a wider range of morphological and physiological type demonstrates that the ftsZ gene of about ninety three percent of the organisms show relatively biased codon usage profile and significant GC deviation from their genomic GC content. Comparative codon usage analysis of ftsZ and a core housekeeping gene rpoB demonstrated that codon usage pattern of ftsZ CDS is shaped by natural selection to a large extent and mimics that of a housekeeping gene. We have also detected a tendency among the different organisms to utilize a core set of codons in structuring the ftsZ coding sequence. We observed that the compositional frequency of the amino acid serine in the FtsZ protein appears to be a indicator of the bacterial lifestyle. Our meticulous analysis of the ftsZ gene linked with the corresponding FtsZ protein show that there is a bias towards the use of specific synonymous codons particularly in the helix and strand regions of the multi-domain FtsZ protein. Overall our findings suggest that in an indispensable and vital protein such as FtsZ, there is an inherent tendency to maintain form for optimized performance in spite of the extrinsic variability in coding features.
Collapse
Affiliation(s)
- Ayon Pal
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, West Bengal, India
| | - Barnan Kumar Saha
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, West Bengal, India
| | - Jayanti Saha
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, West Bengal, India
| |
Collapse
|
7
|
Liu SS, Hockenberry AJ, Jewett MC, Amaral LAN. A novel framework for evaluating the performance of codon usage bias metrics. J R Soc Interface 2019; 15:rsif.2017.0667. [PMID: 29386398 DOI: 10.1098/rsif.2017.0667] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Accepted: 01/04/2018] [Indexed: 11/12/2022] Open
Abstract
The unequal utilization of synonymous codons affects numerous cellular processes including translation rates, protein folding and mRNA degradation. In order to understand the biological impact of variable codon usage bias (CUB) between genes and genomes, it is crucial to be able to accurately measure CUB for a given sequence. A large number of metrics have been developed for this purpose, but there is currently no way of systematically testing the accuracy of individual metrics or knowing whether metrics provide consistent results. This lack of standardization can result in false-positive and false-negative findings if underpowered or inaccurate metrics are applied as tools for discovery. Here, we show that the choice of CUB metric impacts both the significance and measured effect sizes in numerous empirical datasets, raising questions about the generality of findings in published research. To bring about standardization, we developed a novel method to create synthetic protein-coding DNA sequences according to different models of codon usage. We use these benchmark sequences to identify the most accurate and robust metrics with regard to sequence length, GC content and amino acid heterogeneity. Finally, we show how our benchmark can aid the development of new metrics by providing feedback on its performance compared to the state of the art.
Collapse
Affiliation(s)
- Sophia S Liu
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
| | - Adam J Hockenberry
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA.,Interdisciplinary Program in Biological Sciences, Northwestern University, Evanston, IL, USA
| | - Michael C Jewett
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA .,Interdisciplinary Program in Biological Sciences, Northwestern University, Evanston, IL, USA.,Center for Synthetic Biology, Northwestern University, Evanston, IL, USA.,Simpson Querrey BioNanotechnology Institute, Northwestern University, Evanston, IL, USA.,Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, USA
| | - Luís A N Amaral
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA .,Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, USA.,Department of Physics and Astronomy, Northwestern University, Evanston, IL, USA
| |
Collapse
|