1
|
Sharma M, Zhang H, Ehrenkaufer G, Singh U. Stress Response in Entamoeba histolytica Is Associated with Robust Processing of tRNA to tRNA Halves. mBio 2023; 14:e0345022. [PMID: 36809068 PMCID: PMC10127584 DOI: 10.1128/mbio.03450-22] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 01/18/2023] [Indexed: 02/23/2023] Open
Abstract
tRNA-derived fragments have been reported in many different organisms and have diverse cellular roles, such as regulating gene expression, inhibiting protein translation, silencing transposable elements, and modulating cell proliferation. In particular, tRNA halves, a class of tRNA fragments produced by the cleavage of tRNAs in the anti-codon loop, have been widely reported to accumulate under stress and regulate translation in cells. Here, we report the presence of tRNA-derived fragments in Entamoeba, with tRNA halves being the most abundant. We further established that tRNA halves accumulate in the parasites upon different stress stimuli such as oxidative stress, heat shock, and serum starvation. We also observed differential expression of tRNA halves during developmental changes of trophozoite-to-cyst conversion, with various tRNA halves accumulating during early encystation. In contrast to other systems, the stress response does not appear to be mediated by a few specific tRNA halves, as multiple tRNAs appear to be processed during the various stresses. Furthermore, we identified some tRNA-derived fragments associated with Entamoeba Argonaute proteins, EhAgo2-2 and EhAgo2-3, which have a preference for different tRNA-derived fragment species. Finally, we show that tRNA halves are packaged inside extracellular vesicles secreted by amoebas. The ubiquitous presence of tRNA-derived fragments, their association with the Argonaute proteins, and the accumulation of tRNA halves during multiple different stresses, including encystation, suggest a nuanced level of gene expression regulation mediated by different tRNA-derived fragments in Entamoeba. IMPORTANCE In the present study, we report for the first time the presence of tRNA-derived fragments in Entamoeba. tRNA-derived fragments were identified by bioinformatics analyses of small-RNA sequencing data sets from the parasites and also confirmed experimentally. We found that tRNA halves accumulated in parasites exposed to environmental stress or during the developmental process of encystation. We also found that shorter tRNA-derived fragments are bound to Entamoeba Argonaute proteins, indicating that they may have a potential role in the Argonaute-mediated RNA-interference pathway, which mediates robust gene silencing in Entamoeba. We noticed that in response to heat shock, the protein translation levels were elevated in the parasites. This effect was reversed in the presence of an analog of leucine, which also reduced the levels of the tRNA halves in the stressed cells. Our results suggest that tRNA-derived fragments in Entamoeba have a possible role in regulating gene expression during environmental stress.
Collapse
Affiliation(s)
- Manu Sharma
- Division of Infectious Diseases, Stanford University School of Medicine, Stanford, California, USA
| | - Hanbang Zhang
- Division of Infectious Diseases, Stanford University School of Medicine, Stanford, California, USA
| | - Gretchen Ehrenkaufer
- Division of Infectious Diseases, Stanford University School of Medicine, Stanford, California, USA
| | - Upinder Singh
- Division of Infectious Diseases, Stanford University School of Medicine, Stanford, California, USA
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, California, USA
| |
Collapse
|
2
|
Rahman SU, Rehman HU, Rahman IU, Khan MA, Rahim F, Ali H, Chen D, Ma W. Evolution of codon usage in Taenia saginata genomes and its impact on the host. Front Vet Sci 2023; 9:1021440. [PMID: 36713873 PMCID: PMC9875090 DOI: 10.3389/fvets.2022.1021440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 10/03/2022] [Indexed: 01/13/2023] Open
Abstract
The beef tapeworm, also known as Taenia saginata, is a zoonotic tapeworm from the genus Taenia in the order Cyclophyllidea. Taenia saginata is a food-borne zoonotic parasite with a worldwide distribution. It poses serious health risks to the host and has a considerable negative socioeconomic impact. Previous studies have explained the population structure of T. saginata within the evolutionary time scale and adaptive evolution. However, it is still unknown how synonymous codons are used by T. saginata. In this study, we used 90 T. saginata strains, applying the codon usage bias (CUB). Both base content and relative synonymous codon usage (RSCU) analysis revealed that AT-ended codons were more frequently used in the genome of T. saginata. Further low CUB was observed from the effective number of codons (ENC) value. The neutrality plot analysis suggested that the dominant factor of natural selection was involved in the structuring of CUB in T. saginata. Further analysis showed that T. saginata has adapted host-specific codon usage patterns to sustain successful replication and transmission chains within hosts (Bos taurus and Homo sapiens). Generally, both natural selection and mutational pressure have an impact on the codon usage patterns of the protein-coding genes in T. saginata. This study is important because it characterized the codon usage pattern in the T. saginata genomes and provided the necessary data for a basic evolutionary study on them.
Collapse
Affiliation(s)
- Siddiq Ur Rahman
- Department of Computer Science and Bioinformatics, Khushal Khan Khattak University, Karak, Pakistan
| | - Hassan Ur Rehman
- Department of Computer Science and Bioinformatics, Khushal Khan Khattak University, Karak, Pakistan
| | - Inayat Ur Rahman
- Department of Botany, Khushal Khan Khattak University, Karak, Pakistan
| | - Muazzam Ali Khan
- Department of Botany, Bacha Khan University, Charsadda, KP, Pakistan
| | - Fazli Rahim
- Department of Botany, Bacha Khan University, Charsadda, KP, Pakistan
| | - Hamid Ali
- Department of Biotechnology and Genetic Engineering, Hazara University, Mansehra, Pakistan
| | - Dekun Chen
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi, China
| | - Wentao Ma
- Veterinary Immunology Laboratory, College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi, China,*Correspondence: Wentao Ma ✉
| |
Collapse
|
3
|
Analysis of Codon Usage Patterns in Giardia duodenalis Based on Transcriptome Data from GiardiaDB. Genes (Basel) 2021; 12:genes12081169. [PMID: 34440343 PMCID: PMC8393687 DOI: 10.3390/genes12081169] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 07/24/2021] [Accepted: 07/27/2021] [Indexed: 12/03/2022] Open
Abstract
Giardia duodenalis, a flagellated parasitic protozoan, the most common cause of parasite-induced diarrheal diseases worldwide. Codon usage bias (CUB) is an important evolutionary character in most species. However, G. duodenalis CUB remains unclear. Thus, this study analyzes codon usage patterns to assess the restriction factors and obtain useful information in shaping G. duodenalis CUB. The neutrality analysis result indicates that G. duodenalis has a wide GC3 distribution, which significantly correlates with GC12. ENC-plot result—suggesting that most genes were close to the expected curve with only a few strayed away points. This indicates that mutational pressure and natural selection played an important role in the development of CUB. The Parity Rule 2 plot (PR2) result demonstrates that the usage of GC and AT was out of proportion. Interestingly, we identified 26 optimal codons in the G. duodenalis genome, ending with G or C. In addition, GC content, gene expression, and protein size also influence G. duodenalis CUB formation. This study systematically analyzes G. duodenalis codon usage pattern and clarifies the mechanisms of G. duodenalis CUB. These results will be very useful to identify new genes, molecular genetic manipulation, and study of G. duodenalis evolution.
Collapse
|
4
|
Biswas R, Panja AS, Bandopadhyay R. In Silico Analyses of Burial Codon Bias Among the Species of Dipterocarpaceae Through Molecular and Phylogenetic Data. Evol Bioinform Online 2019; 15:1176934319834888. [PMID: 31223230 PMCID: PMC6563522 DOI: 10.1177/1176934319834888] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Accepted: 02/07/2019] [Indexed: 11/15/2022] Open
Abstract
Introduction: DNA barcode, a molecular marker, is used to distinguish among the closely
related species, and it can be applied across a broad range of taxa to
understand ecology and evolution. MaturaseK gene (matK) and
rubisco bisphosphate carboxylase/oxygenase form I gene
(rbcL) of the chloroplast are highly conserved in a
plant system, which are used as core barcode. This present endeavor entails
the comprehensive examination of the under threat plant species based on
success of discrimination on DNA barcode under selection pressure. Result: The family Dipterocarpaceae comprising of 15 genera is under threat due to
some factors, namely, deforestation, habitat alteration, poor seed, pollen
dispersal, etc. Species of this family was grouped into 6 clusters for
matK and 5 clusters and 2 sub-clusters for
rbcL in the phylogenetic tree by using neighbor-joining
method. Cluster I to cluster VI of matK and cluster I to
cluster V of rbcL genes were analyzed by various codon and
substitution bias tools. Mutational pressure guided the codon bias which was
favored by the avoidance of higher GC content and significant negative
correlation between GC12 and GC3 (in sub-cluster I of cluster I
[0.03 < P], cluster I
[0.00001 < P], and cluster II
[0.01 < P] of rbcL, and cluster IV
[0.013 < P] of matK). After
refining the results, it could be speculated that the lower null expectation
values (R = 0.5 or <0.5) were less divergent from the
evolutionary perspective. Apart from that, the higher null expectation
values (R = >0.85) also showed the same result, which
possibly could be due to the negative impact of very high and low transition
rate than transversion. Conclusion: Through the analysis of inter-generic, inter/intra-specific variation and
phylogenetic data, it was found that both selection and mutation played an
important role in synonymous codon choice in these genes, but they acted
inconsistently on the genes, both matK and
rbcL. In vitro stable proteins of both
matK and rbcL were selected through
natural selection rather than mutational selection. matK
gene had higher individual discrimination and barcode success compared with
rbcL. These discriminatory approaches may describe the
problem related to the extinction of plant species. Hence, it becomes very
imperative to identify and detect the under threat plant species in
advance.
Collapse
Affiliation(s)
- Raju Biswas
- UGC-Center of Advanced Study, Department of Botany, The University of Burdwan, Bardhaman, India
| | - Anindya Sundar Panja
- Department of Biotechnology, Oriental Institute of Science and Technology, Vidyasagar University, Midnapore, India
| | - Rajib Bandopadhyay
- UGC-Center of Advanced Study, Department of Botany, The University of Burdwan, Bardhaman, India
| |
Collapse
|
5
|
Delgado-Galván CJ, Padilla-Vaca F, Montiel FBR, Rangel-Serrano Á, Paramo-Pérez I, Anaya-Velázquez F, Franco B. Red fluorescent protein (DsRFP) optimization for Entamoeba histolytica expression. Exp Parasitol 2018; 187:86-92. [PMID: 29476758 DOI: 10.1016/j.exppara.2018.01.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2017] [Revised: 01/05/2018] [Accepted: 01/30/2018] [Indexed: 12/01/2022]
Abstract
Entamoeba histolytica genetic organization and genome structure is complex and under intense research. The genome is fully sequenced, and several tools have been developed for the molecular study of this organism. Nevertheless, good protein tracking tags that are easy to measure and image, like the fluorescent proteins are lacking. In this report, we codon-optimized the red fluorescent protein from the coral Discosoma striata (DsRFP) for its use in E. histolytica and demonstrated functionality in vivo. We envision that this protein can be widely used for the development of transcriptional reporter systems and protein-tagging applications.
Collapse
Affiliation(s)
- Cindy Jazmín Delgado-Galván
- Departamento de Biología, División de Ciencias Naturales y Exactas, Universidad de Guanajuato, Noria Alta s/n, Guanajuato, Gto. 36050, Mexico
| | - Felipe Padilla-Vaca
- Departamento de Biología, División de Ciencias Naturales y Exactas, Universidad de Guanajuato, Noria Alta s/n, Guanajuato, Gto. 36050, Mexico
| | - Fátima Berenice Ramírez Montiel
- Departamento de Biología, División de Ciencias Naturales y Exactas, Universidad de Guanajuato, Noria Alta s/n, Guanajuato, Gto. 36050, Mexico
| | - Ángeles Rangel-Serrano
- Departamento de Biología, División de Ciencias Naturales y Exactas, Universidad de Guanajuato, Noria Alta s/n, Guanajuato, Gto. 36050, Mexico
| | - Itzel Paramo-Pérez
- Departamento de Biología, División de Ciencias Naturales y Exactas, Universidad de Guanajuato, Noria Alta s/n, Guanajuato, Gto. 36050, Mexico
| | - Fernando Anaya-Velázquez
- Departamento de Biología, División de Ciencias Naturales y Exactas, Universidad de Guanajuato, Noria Alta s/n, Guanajuato, Gto. 36050, Mexico
| | - Bernardo Franco
- Departamento de Biología, División de Ciencias Naturales y Exactas, Universidad de Guanajuato, Noria Alta s/n, Guanajuato, Gto. 36050, Mexico.
| |
Collapse
|
6
|
Lin D, Li L, Xie T, Yin Q, Saksena N, Wu R, Li W, Dai G, Ma J, Zhou X, Chen X. Codon usage variation of Zika virus: The potential roles of NS2B and NS4A in its global pandemic. Virus Res 2018; 247:71-83. [PMID: 29428601 DOI: 10.1016/j.virusres.2018.01.014] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2017] [Revised: 01/26/2018] [Accepted: 01/29/2018] [Indexed: 12/18/2022]
Abstract
A comprehensive demonstration of Zika virus (ZIKV) molecular evolution is essential for understanding its adaptation and expansion in its recent pandemics. Despite several studies on mutations and codon usage in ZIKVs, the variations in codon usage patterns across individual genes and their biological implication remains unclear. Here, we performed a gene-by-gene comparison of the codon usage variation in ZIKVs of the African and Asian lineages. We found that besides the evidence of positive selection (Ka/Ks >1) in the Asian lineage of the ZIKV genome, codon usage patterns were gene-specific and codon usage variation of ZIKV genes, was possibly constrained by their individual functional features, such as transmembrane domains, or antigenicity. In particular, the NS2B and NS4A genes showed distinct codon usage patterns, clearly separating them from the clusters of other genes in the correspondence analysis (CA). In the Asian lineage, the NS2B and NS4A genes showed the highest codon usage bias (ENC values: 51.01 ± 0.72 and 48.89 ± 0.99 respectively), and were subjected to the highest translation selection (ENCobs/ENCexp ratio: 0.847 ± 0.0297 and 0.828 ± 0.0233 respectively) in comparison to the African lineages of ZIKV. The CpG frequency of the NS2B showed a gradual ascending trend in the Asian ZIKV lineages, while in NS4A it was constrained along with the expansion of the Asian lineage. Furthermore, between the African and Asian lineages, differentiated and specific over-represented codons were more prominent in the NS2B and NS4A. Together, our study implies that ZIKVs are in the process of evolutionary fine tuning their codon as seen in the recent pandemics, and NS2B and NS4A could have played a potential role in the molecular evolution of the Asian lineage and their establishment.
Collapse
Affiliation(s)
- Dechun Lin
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, Guangdong, China; BGI-Shenzhen, Shenzhen 518083, Guangdong, China; China National Genebank, BGI-Shenzhen, Shenzhen 518083, Guangdong, China.
| | - Liqiang Li
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China; China National Genebank, BGI-Shenzhen, Shenzhen 518083, Guangdong, China.
| | - Tian Xie
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Qingqing Yin
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Nitin Saksena
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China; China National Genebank, BGI-Shenzhen, Shenzhen 518083, Guangdong, China.
| | - Rangke Wu
- The School of Foreign Studies, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Wanyu Li
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Geyang Dai
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Jinmin Ma
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China; China National Genebank, BGI-Shenzhen, Shenzhen 518083, Guangdong, China; Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Xiaohong Zhou
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Xiaoguang Chen
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| |
Collapse
|
7
|
Genome-wide analysis of codon usage bias patterns in an enterotoxigenic Escherichia coli F18 strain. Genes Genomics 2017. [DOI: 10.1007/s13258-017-0519-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
8
|
Factors affecting the codon usage bias of SRY gene across mammals. Gene 2017; 630:13-20. [PMID: 28827114 DOI: 10.1016/j.gene.2017.08.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Revised: 07/20/2017] [Accepted: 08/03/2017] [Indexed: 11/24/2022]
Abstract
Codon usage bias (CUB) is extensively found in a wide variety genomes and it is mostly affected by mutation pressure and natural selection. Analysis of CUB helps in studying the evolutionary features of a genome. The SRY gene plays an important role in male reproductive organ and a good candidate to study the evolutionary forces, since little work was reported earlier on this gene. We used bioinformatic methods to analyze the protein-coding sequences of SRY gene in 172 different mammalian species to understand the patterns of codon usage and the evolutionary forces acting on it. We found that the codon bias of SRY gene varies widely across mammals. Relative synonymous codon usage (RSCU) value revealed that the codons such as TCG, CCG, CAT, ATT, ACT, GCT, GTT, GCG, GGG and GGT were over-represented. Correspondence analysis indicated that the distribution of codons was more close to the axes indicating that compositional constraints might correlate to codon bias. Z-score analysis on RSCU values of codons identified a set of 11 codons viz. TCT, TTT, CTA, CTC, TAT, CAG, CGT, ATA, ACC, AAT and GTA which differed significantly at p<0.01 between 5% high and low gene expression datasets. Further, it was evident from the neutrality plot that GC12 was influenced by both mutation pressure and natural selection. From the study we concluded that natural selection played a dominant role, but mutational pressure played a minor role in the codon usage pattern of SRY gene across mammals.
Collapse
|
9
|
Analysis of Codon Usage Patterns in Herbaceous Peony (Paeonia lactiflora Pall.) Based on Transcriptome Data. Genes (Basel) 2015; 6:1125-39. [PMID: 26506393 PMCID: PMC4690031 DOI: 10.3390/genes6041125] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Revised: 10/10/2015] [Accepted: 10/13/2015] [Indexed: 01/27/2023] Open
Abstract
Codon usage bias, which exists in many genomes, is mainly determined by mutation and selection. To elucidate the genetic features and evolutionary history of herbaceous peony (Paeonia lactiflora), a well-known symbol of prosperity in China, we examined synonymous codon usage in 24,216 reconstructed genes from the P. lactiflora transcriptome. The mean GC content was 44.4%, indicating that the nucleotide content of P. lactiflora genes is slightly AT rich and GC poor. The P. lactiflora genome has a wide range of GC3 (GC content at the third synonymous codon position) distribution, with a significant correlation between GC12 and GC3. ENC (effective number of codons) analysis suggested that mutational bias played a major role in shaping codon usage. Parity Rule 2 (PR2) analysis revealed that GC and AU were not used proportionally. We identified 22 “optimal codons”, most ending with an A or U. Our results suggested that nucleotide composition mutation bias and translational selection were the main driving factors of codon usage bias in P. lactiflora. These results lay the foundation for exploring the evolutionary mechanisms and heterologous expression of functionally-important proteins in P. lactiflora.
Collapse
|
10
|
Analysis of codon usage pattern evolution in avian rotaviruses and their preferred host. INFECTION GENETICS AND EVOLUTION 2015; 34:17-25. [PMID: 26086995 DOI: 10.1016/j.meegid.2015.06.018] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2015] [Revised: 05/26/2015] [Accepted: 06/14/2015] [Indexed: 10/23/2022]
Abstract
Rotavirus infection is a worldwide problem, with occurrence of highly divergent viruses classified in 8 species (A-H). We report here the evolution assessment of codon usage patterns in virus-host system in avian rotavirus (AvRV) of species RVA, RVD, RVF and RVG (preferentially affecting birds). The nucleotide contents, codon usage bias (CUB), relative synonymous codon usage (RSCU), and effective number of codons (ENCs) values were investigated targeting overexpressing major inner capsid viral protein (VP6) of these AvRV species. The results confirm that the evolutionary characteristics influences the rotavirus (RV) genetic diversity and impact of host's natural selection on the AvRVs codons. Synonymous codon usage patterns were evaluated following multivariate statistical procedures on all available AvRV coding gene sequences. RSCU trees accommodated all AvRV species and preferred host sequences in one topology confirming greater imminence of AvRVs with the host chicken cell genes. Similarly, the codon adaptation index (CAI) results also displayed a higher adaptation of AvRVs to its chicken host. The codon preference analysis of RVs revealed that VP6 gene express more proficiently in the yeast system, whereas, codon optimization might be required for the effectual expression in Escherichia coli and Homo sapiens. The findings provide basic evidence on the dynamics of AvRV evolution and its host adaptation, which could be exploited for additional research on avian species in future.
Collapse
|
11
|
Yang X, Luo X, Cai X. Analysis of codon usage pattern in Taenia saginata based on a transcriptome dataset. Parasit Vectors 2014; 7:527. [PMID: 25440955 PMCID: PMC4268816 DOI: 10.1186/s13071-014-0527-1] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Accepted: 11/06/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Codon usage bias is an important evolutionary feature in a genome and has been widely documented in many genomes. Analysis of codon usage bias has significance for mRNA translation, design of transgenes, new gene discovery, and studies of molecular biology and evolution, etc. However, the information about synonymous codon usage pattern of T. saginata genome remains unclear. T. saginata is a food-borne zoonotic cestode which infects approximataely 50 million humans worldwide, and causes significant health problems to the host and considerable socio-economic losses as a consequence. In this study, synonymous codon usage in T. saginata were examined. METHODS Total RNA was isolated from T. saginata cysticerci and 91,487 unigenes were generated using Illumina sequencing technology. After filtering, the final sequence collection containing 11,399 CDSs was used for our analysis. RESULTS Neutrality analysis showed that the T. saginata had a wide GC3 distribution and a significant correlation was observed between GC12 and GC3. NC-plot showed most of genes on or close to the expected curve, but only a few points with low-ENC values were below it, suggesting that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally. We also identified twenty-three optimal codons in the T. saginata genome, all of which were ended with a G or C residue. These results suggest that mutational and selection forces are probably driving factors of codon usage bias in T. saginata genome. Meanwhile, other factors such as protein length, gene expression, GC content of genes, the hydropathicity of each protein also influence codon usage. CONCLUSIONS Here, we systematically analyzed the codon usage pattern and identified factors shaping in codon usage bias in T. saginata. Currently, no complete nuclear genome is available for codon usage analysis at the genome level in T. saginata. This is the first report to investigate codon biology in T. sagninata. Such information does not only bring about a new perspective for understanding the mechanisms of biased usage of synonymous codons but also provide useful clues for molecular genetic engineering and evolutionary studies.
Collapse
Affiliation(s)
- Xing Yang
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, 730046, PR China. .,College of Veterinary Medicine, Jilin University, Changchun, 130000, PR China.
| | - Xuenong Luo
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, 730046, PR China.
| | - Xuepeng Cai
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, 730046, PR China. .,College of Veterinary Medicine, Jilin University, Changchun, 130000, PR China.
| |
Collapse
|
12
|
Sateriale A, Roy NH, Huston CD. SNAP-tag technology optimized for use in Entamoeba histolytica. PLoS One 2013; 8:e83997. [PMID: 24391864 PMCID: PMC3877135 DOI: 10.1371/journal.pone.0083997] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2013] [Accepted: 11/08/2013] [Indexed: 01/11/2023] Open
Abstract
Entamoeba histolytica is a protozoan parasite responsible for invasive intestinal and extraintestinal amebiasis. The pathology of amebiasis is still poorly understood, which can be largely attributed to lack of molecular tools. Here we present the optimization of SNAP-tag technology via codon optimization specific for E. histolytica. The resultant SNAP protein is highly expressed in amebic trophozoites, and shows proper localization when tagged with an endoplasmic reticulum retention signal. We further demonstrate the capabilities of this system using super resolution microscopy, done for the first time in E. histolytica.
Collapse
Affiliation(s)
- Adam Sateriale
- University of Vermont Cellular, Molecular, and Biomedical Sciences Program, Burlington, Vermont, United States of America
- University of Vermont Department of Medicine, Burlington, Vermont, United States of America
| | - Nathan H. Roy
- University of Vermont Cellular, Molecular, and Biomedical Sciences Program, Burlington, Vermont, United States of America
- University of Vermont Microbiology and Molecular Genetics, Burlington, Vermont, United States of America
| | - Christopher D. Huston
- University of Vermont Cellular, Molecular, and Biomedical Sciences Program, Burlington, Vermont, United States of America
- University of Vermont Microbiology and Molecular Genetics, Burlington, Vermont, United States of America
- University of Vermont Department of Medicine, Burlington, Vermont, United States of America
- * E-mail:
| |
Collapse
|
13
|
Evolution of tryptophan biosynthetic pathway in microbial genomes: a comparative genetic study. SYSTEMS AND SYNTHETIC BIOLOGY 2013; 8:59-72. [PMID: 24592292 DOI: 10.1007/s11693-013-9127-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2013] [Revised: 10/05/2013] [Accepted: 10/08/2013] [Indexed: 10/26/2022]
Abstract
Biosynthetic pathway evolution needs to consider the evolution of a group of genes that code for enzymes catalysing the multiple chemical reaction steps leading to the final end product. Tryptophan biosynthetic pathway has five chemical reaction steps that are highly conserved in diverse microbial genomes, though the genes of the pathway enzymes show considerable variations in arrangements, operon structure (gene fusion and splitting) and regulation. We use a combined bioinformatic and statistical analyses approach to address the question if the pathway genes from different microbial genomes, belonging to a wide range of groups, show similar evolutionary relationships within and between them. Our analyses involved detailed study of gene organization (fusion/splitting events), base composition, relative synonymous codon usage pattern of the genes, gene expressivity, amino acid usage, etc. to assess inter- and intra-genic variations, between and within the pathway genes, in diverse group of microorganisms. We describe these genetic and genomic variations in the tryptophan pathway genes in different microorganisms to show the similarities across organisms, and compare the same genes across different organisms to find the possible variability arising possibly due to horizontal gene transfers. Such studies form the basis for moving from single gene evolution to pathway evolutionary studies that are important steps towards understanding the systems biology of intracellular pathways.
Collapse
|
14
|
Roy A, Bhattacharya S, Bothra AK, Sen A. A database for Mycobacterium secretome analysis: 'MycoSec' to accelerate global health research. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2013; 17:502-9. [PMID: 23952586 DOI: 10.1089/omi.2013.0015] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Abstract Members of the genus Mycobacterium are notorious for their pathogenesis. Investigations from various perspectives have identified the pathogenic strategies employed by these lethal pathogens. Secretomes are believed to play crucial roles in host cell recognition and cross-talks, in cellular attachment, and in triggering other functions related to host pathogen interactions. However, a proper idea of the mycobacterial secretomes and their mechanism of functionality still remains elusive. In the present study, we have developed a comprehensive database of potential mycobacterial secretomes (MycoSec) using pre-existing algorithms for secretome prediction for researchers interested in this particular field. The database provides a platform for retrieval and analysis of identified secretomes in all finished genomes of the family Mycobacteriaceae. The database contains valuable information regarding secretory signal peptides (Sec type), lipoprotein signal peptides (Lipo type), and Twin arginine (RR/KR) signal peptides (TAT type), prevalent in mycobacteria. Information pertaining to COG analysis, codon usage, and gene expression of the predicted secretomes has also been incorporated in the database. MycoSec promises to be a useful repertoire providing a plethora of information regarding mycobacterial secretomes and may well be a platform to speed global health research. MycoSec is freely accessible at http://www.bicnbu.in/mycosec .
Collapse
Affiliation(s)
- Ayan Roy
- 1 Bioinformatics Facility, Department of Botany, University of North Bengal , Siliguri, India
| | | | | | | |
Collapse
|
15
|
Reply to "codon usage frequency of RNA virus genomes from high-temperature acidic-environment metagenomes". J Virol 2013; 87:1920-1. [PMID: 23308028 DOI: 10.1128/jvi.02883-12] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
16
|
Yang J, Li S, Liu YX. Systematic analysis of diabetes- and glucose metabolism-related proteins and its application to Alzheimer’s disease. ACTA ACUST UNITED AC 2013. [DOI: 10.4236/jbise.2013.66078] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
17
|
Abstract
The analysis on codon usage bias of OmpA/MotB gene of Riemerella anatipestifer (RA) may provide a basis for understanding the evolution and pathogenesis of RA and for selecting appropriate host expression systems to improve the expression of target genes in vivo and in vitro. In our study, a comparative analysis of the codon usage bias in the newly discovered RA OmpA/MotB gene and the OmpA/MotB gene of 20 reference flavobacteriaceae was performed. The results of the codon adaptation indes (CAI), effective number of codon (ENC), and GC3s values indicated that synonymous codon usage bias in the OmpA/MotB gene of flavobacteriaceae. The results showed that codon usage bias in the RA OmpA/MotB gene was strong bias towards the synonymous codons with A and T at the third codon position. A high level of diversity in codon usage bias existed, and the effective number of codons used in a gene plot revealed that the G+C compositional constraint is the main factor that determines the codon usage bias in OmpA/MotB gene of flavobacteriaceae. Comparison of the codon usage in the OmpA/MotB gene of different organisms revealed that there were 31 codons showing distinct codon usage differences between the RA and E. coli, 41 between the RA and humans, but 29 between the RA and yeast. Therefore the yeast expression system may be more suitable for the expression of RA OmpA/MotB gene. These results may improve our understanding of the evolution, pathogenesis and functional studies of RA, as well as contribute significantly to the area of flavobacteriaceae research.
Collapse
|
18
|
Hu JS, Wang QQ, Zhang J, Chen HT, Xu ZW, Zhu L, Ding YZ, Ma LN, Xu K, Gu YX, Liu YS. The characteristic of codon usage pattern and its evolution of hepatitis C virus. INFECTION GENETICS AND EVOLUTION 2011; 11:2098-102. [DOI: 10.1016/j.meegid.2011.08.025] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2011] [Revised: 08/22/2011] [Accepted: 08/24/2011] [Indexed: 01/29/2023]
|
19
|
Xu C, Cai X, Chen Q, Zhou H, Cai Y, Ben A. Factors affecting synonymous codon usage bias in chloroplast genome of oncidium gower ramsey. Evol Bioinform Online 2011; 7:271-8. [PMID: 22253533 PMCID: PMC3255522 DOI: 10.4137/ebo.s8092] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Oncidium Gower Ramsey is a fascinating and important ornamental flower in floral industry. In this research, the complete nucleotide sequence of the chloroplast genome in Oncidium Gower Ramsey was studied, then analyzed using Codonw software. Correspondence analysis and method of effective number of codon as Nc-plot were conducted to analyze synonymous codon usage. According to the corresponding analysis, codon bias in the chloroplast genome of Oncidium Gower Ramsey is related to their gene length, mutation bias, gene hydropathy level of each protein, gene function and selection or gene expression only subtly affect codon usage. This study will provide insights into the molecular evolution study and high-level transgene expression.
Collapse
Affiliation(s)
- Chen Xu
- School of Biochemical and Environmental Engineering, Nanjing Xiaozhuang University, Nanjing 211171, Jiangsu, China
| | | | | | | | | | | |
Collapse
|
20
|
Anti-inflammatory defense mechanisms of Entamoeba histolytica. Inflamm Res 2010; 60:111-7. [PMID: 20938711 DOI: 10.1007/s00011-010-0261-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2010] [Revised: 09/02/2010] [Accepted: 09/22/2010] [Indexed: 01/23/2023] Open
Abstract
The monocyte locomotion inhibitory factor (MLIF), a heat-stable oligopeptide found in the supernatant fluid of Entamoeba histolytica axenic cultures, may contribute to the delayed inflammation observed in amoebic hepatic abscess. This factor was isolated by ultra-filtration and high powered liquid chromatography, obtaining a primary Met-Gln-Cys-Asn-Ser structure, identified afterwards as the carboxyl-terminal (…Cys-Asn-Ser) active site. The selective anti-inflammatory effects of the pentapeptide have been observed in both in vitro and in vivo models, using a synthetic pentapeptide to maintain the same anti-inflammatory conditions during the experimental assays. Anti-inflammatory effects observed include inhibition of human monocyte locomotion and the respiratory burst in monocytes and neutrophils, increasing expression of anti-inflammatory cytokines and inhibiting expression of the adhesion molecules VLA-4 and VCAM, among others. In this review, we will describe the effects of MLIF detected so far and how it might be used as a therapeutical agent against inflammatory diseases.
Collapse
|
21
|
Zhou JH, Zhang J, Chen HT, Ma LN, Liu YS. Analysis of synonymous codon usage in foot-and-mouth disease virus. Vet Res Commun 2010; 34:393-404. [PMID: 20425142 DOI: 10.1007/s11259-010-9359-4] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/29/2010] [Indexed: 10/19/2022]
Abstract
In this study, we calculate the relative synonymous codon usage (RSCU) values and codon usage bias (CUB) values to carry out a comparative analysis of codon usage pattern for open reading frames (ORFs) among 85 samples which belong to all seven serotypes of foot-and-mouth disease virus (FMDV). Although the degree of CUB for ORFs is a relatively slight, there is a significant variation for CUB among different serotypes, which is mainly determined by codon usage pattern depending on RSCU. By comparison with RSCU values for all samples, although RSCU values fail to show the relationship of specific-lineage serotype, there are two main genetic populations existing in FMDV, namely (i) serotypes Asia 1, A, C & O; (ii) serotypes SAT 1, 2 & 3. This interesting characteristic may be formed by the mechanism of RNA virus recombination. The analysis of quantitative & qualitative evaluation based on CUB indicates interesting characteristic of codon usage, which suggests that more FMDV genome diversity may exist in specific-lineage serotypes rather than exist randomly. Furthermore, the relationship between amino acids and codon usage pattern indicates that mutation pressure rather than translational selection in nature is the important determinant of the codon usage bias observed. Our work might give some sight into some characteristics of FMDV ORF and some evolutionary information of this virus.
Collapse
Affiliation(s)
- Jian-Hua Zhou
- Key Laboratory of Animal Virology of Ministry of Agriculture, State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, China.
| | | | | | | | | |
Collapse
|
22
|
Codon Usage Patterns in Corynebacterium glutamicum: Mutational Bias, Natural Selection and Amino Acid Conservation. Comp Funct Genomics 2010; 2010:343569. [PMID: 20445740 PMCID: PMC2860111 DOI: 10.1155/2010/343569] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2009] [Revised: 01/29/2010] [Accepted: 02/04/2010] [Indexed: 11/17/2022] Open
Abstract
The alternative synonymous codons in Corynebacterium glutamicum, a well-known bacterium used in industry for the production of amino acid, have been investigated by multivariate analysis. As C. glutamicum is a GC-rich organism, G and C are expected to predominate at the third position of codons. Indeed, overall codon usage analyses have indicated that C and/or G ending codons are predominant in this organism. Through multivariate statistical analysis, apart from mutational selection, we identified three other trends of codon usage variation among the genes. Firstly, the majority of highly expressed genes are scattered towards the positive end of the first axis, whereas the majority of lowly expressed genes are clustered towards the other end of the first axis. Furthermore, the distinct difference in the two sets of genes was that the C ending codons are predominate in putatively highly expressed genes, suggesting that the C ending codons are translationally optimal in this organism. Secondly, the majority of the putatively highly expressed genes have a tendency to locate on the leading strand, which indicates that replicational and transciptional selection might be invoked. Thirdly, highly expressed genes are more conserved than lowly expressed genes by synonymous and nonsynonymous substitutions among orthologous genes fromthe genomes of C. glutamicum and C. diphtheriae. We also analyzed other factors such as the length of genes and hydrophobicity that might influence codon usage and found their contributions to be weak.
Collapse
|
23
|
Gilchrist MA, Shah P, Zaretzki R. Measuring and detecting molecular adaptation in codon usage against nonsense errors during protein translation. Genetics 2009; 183:1493-505. [PMID: 19822731 PMCID: PMC2787434 DOI: 10.1534/genetics.109.108209] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2009] [Accepted: 09/26/2009] [Indexed: 11/18/2022] Open
Abstract
Codon usage bias (CUB) has been documented across a wide range of taxa and is the subject of numerous studies. While most explanations of CUB invoke some type of natural selection, most measures of CUB adaptation are heuristically defined. In contrast, we present a novel and mechanistic method for defining and contextualizing CUB adaptation to reduce the cost of nonsense errors during protein translation. Using a model of protein translation, we develop a general approach for measuring the protein production cost in the face of nonsense errors of a given allele as well as the mean and variance of these costs across its coding synonyms. We then use these results to define the nonsense error adaptation index (NAI) of the allele or a contiguous subset thereof. Conceptually, the NAI value of an allele is a relative measure of its elevation on a specific and well-defined adaptive landscape. To illustrate its utility, we calculate NAI values for the entire coding sequence and across a set of nonoverlapping windows for each gene in the Saccharomyces cerevisiae S288c genome. Our results provide clear evidence of adaptation to reduce the cost of nonsense errors and increasing adaptation with codon position and expression. The magnitude and nature of this adaptation are also largely consistent with simulation results in which nonsense errors are the only selective force driving CUB evolution. Because NAI is derived from mechanistic models, it is both easier to interpret and more amenable to future refinement than other commonly used measures of codon bias. Further, our approach can also be used as a starting point for developing other mechanistically derived measures of adaptation such as for translational accuracy.
Collapse
Affiliation(s)
- Michael A Gilchrist
- Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, Tennessee 37996-1610, USA.
| | | | | |
Collapse
|
24
|
Chen X, Shi Z. Sequence Analysis of the Full-length cDNA and Protein Structure Homology Modeling of FABP2 from Paralichthys Olivaceus. Bioinform Biol Insights 2009; 3:29-35. [PMID: 20140063 PMCID: PMC2808173 DOI: 10.4137/bbi.s2287] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Using zebrafish intestinal fatty acid-binding protein 2 (FABP2) mRNA sequence as the initial query probe, four highly homologous Paralichthys olivaceus EST sequences were retrieved from Genbank database. The assembled full-length cDNA contains the open reading frame of P. olivaceus FABP2 gene, which was validated by subsequent RT-PCR cloning. In the coding region, the average GC content is 56%, but it would reach 76.8% if only counting for the third base of the codons. The deduced P. olivaceus FABP2 polypeptide contains 132 amino acids (aa), with a predicted molecular size of 15.3 kD and pI at 6.74. This protein multiple-alignment has shown that this peptide is 75.7% identical to the corresponding homologous protein in Danio rerio. Among the 7 aa that are essential for FABP2 function, 3 were found to be conserved among P. olivaceus, Danio rerio, Tetraodon nigroviridi, Rattus norvegicus, and Homo sapiens. The study provides essential information on molecular evolution and function of FABP family.
Collapse
Affiliation(s)
- Xiaowu Chen
- College of Fisheries and Life Science, Shanghai Ocean University, Shanghai, China
| | | |
Collapse
|
25
|
Jiang P, Sun X, Lu Z. Analysis of synonymous codon usage in Aeropyrum pernix K1 and other Crenarchaeota microorganisms. J Genet Genomics 2009; 34:275-84. [PMID: 17498625 PMCID: PMC7129909 DOI: 10.1016/s1673-8527(07)60029-0] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2006] [Accepted: 08/22/2006] [Indexed: 11/18/2022]
Abstract
In this study, a comparative analysis of the codon usage bias was performed in Aeropyrum pernix K1 and two other phylogenetically related Crenarchaeota microorganisms (i.e., Pyrobaculum aerophilum str. IM2 and Sulfolobus acidocaldarius DSM 639). The results indicated that the synonymous codon usage in A. pernix K1 was less biased, which was highly correlated with the GC(3S) value. The codon usage patterns were phylogenetically conserved among these Crenarchaeota microorganisms. Comparatively, it is the species function rather than the gene function that determines their gene codon usage patterns. A. pernix K1, P. aerophilum str. IM2, and S. acidocaldarius DSM 639 live in differently extreme conditions. It is presumed that the living environment played an important role in determining the codon usage pattern of these microorganisms. Besides, there was no strain-specific codon usage among these microorganisms. The extent of codon bias in A. pernix K1 and S. acidocaldarius DSM 639 were highly correlated with the gene expression level, but no such association was detected in P. aerophilum str. IM2 genomes.
Collapse
Affiliation(s)
- Peng Jiang
- State Key Laboratory of Bioelectronics, Department of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China
| | | | | |
Collapse
|
26
|
Ma J, Nguyen MN, Rajapakse JC. Gene classification using codon usage and support vector machines. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2009; 6:134-143. [PMID: 19179707 DOI: 10.1109/tcbb.2007.70240] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
A novel approach for gene classification, which adopts codon usage bias as input feature vector for classification by support vector machines (SVM) is proposed. The DNA sequence is first converted to a 59-dimensional feature vector where each element corresponds to the relative synonymous usage frequency of a codon. As the input to the classifier is independent of sequence length and variance, our approach is useful when the sequences to be classified are of different lengths, a condition that homology-based methods tend to fail. The method is demonstrated by using 1,841 Human Leukocyte Antigen (HLA) sequences which are classified into two major classes: HLA-I and HLA-II; each major class is further subdivided into sub-groups of HLA-I and HLA-II molecules. Using codon usage frequencies, binary SVM achieved accuracy rate of 99.3% for HLA major class classification and multi-class SVM achieved accuracy rates of 99.73% and 98.38% for sub-class classification of HLA-I and HLA-II molecules, respectively. The results show that gene classification based on codon usage bias is consistent with the molecular structures and biological functions of HLA molecules.
Collapse
Affiliation(s)
- Jianmin Ma
- BioInformatics Research Center, NanyangTechnological University, Singapore 637553.
| | | | | |
Collapse
|
27
|
Analysis of synonymous codon usage in the UL24 gene of duck enteritis virus. Virus Genes 2008; 38:96-103. [PMID: 18958612 DOI: 10.1007/s11262-008-0295-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2008] [Accepted: 10/09/2008] [Indexed: 10/21/2022]
Abstract
The analysis on codon usage bias of UL24 gene of duck enteritis virus (DEV) may improve our understanding of the evolution and pathogenesis of DEV and provide a basis for understanding the relevant mechanism for biased usage of synonymous codons and for selecting appropriate expression systems to improve the expression of target genes. The codon usage bias of UL24 genes of DEV and 27 reference herpesviruses were analyzed. The results showed that codon of UL24 gene of DEV was strong bias toward the synonymous codons with A and T at the third codon position. A high level of diversity in codon usage bias existed, and the effective number of codons used in a gene plot revealed that the genetic heterogeneity in UL24 gene of herpesviruses was constrained by the G + C content. The phylogentic analysis suggested that DEV was evolutionarily closer to Alphaherpesvirinae and that there was no significant deviation in codon usage in different virus strains. There were 20 codons showing distinct usage differences between DEV and Escherichia coli, 23 between DEV and Homo sapiens, but only 16 codons between DEV and yeast. Therefore the yeast expression system may be more suitable for the expression of DEV genes.
Collapse
|
28
|
Kang JF, Li XL, Zhou RY, Li LH, Feng FJ, Guo XL. Bioinformatics analysis of lactoferrin gene for several species. Biochem Genet 2008; 46:312-22. [PMID: 18228129 DOI: 10.1007/s10528-008-9147-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2007] [Accepted: 10/12/2007] [Indexed: 11/30/2022]
Abstract
Much attention has been focused on the study of lactoferrin due to its function in antibacterial, antiviral, antifungal, anti-inflammatory, anti-oxidant, and immunomodulatory activities. A total of 60 lactoferrin (LF) gene sequences with the complete coding regions (CDS) and corresponding amino acids belonging to 11 species were analyzed, and the differentiation within and among the species was also studied. The results showed that most of the species have the stop codon TAA, with the variation of TGA for Mus musculus. The length of the LF gene with the complete CDS varies greatly, from 2,055 to 2,190 bp, due to deletion, insertion, and stop codon mutation resulting in elongation. Observed genetic diversity was higher among species than within species, and Sus scrofa had more polymorphisms than any other species. Novel amino acid variation sites were detected within several species (8 in Homo sapiens, 6 in Mus musculus, 6 in Capra hircus, 10 in Bos taurus, and 20 in Sus scrofa), which might be used to illustrate the functional variation. Differentiation of the LF gene was obvious among species, and the clustering result was consistent with the taxonomy in the National Center for Biotechnology Information.
Collapse
Affiliation(s)
- Jing-Fen Kang
- College of Animal Science and Technology, Agricultural University of Hebei, Baoding 071001, P.R. China
| | | | | | | | | | | |
Collapse
|
29
|
Incorporating PCA and fuzzy-ART techniques into achieve organism classification based on codon usage consideration. Comput Biol Med 2008; 38:886-93. [DOI: 10.1016/j.compbiomed.2008.05.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2006] [Revised: 03/26/2008] [Accepted: 05/19/2008] [Indexed: 11/20/2022]
|
30
|
The implication of life style on codon usage patterns and predicted highly expressed genes for three Frankia genomes. Antonie van Leeuwenhoek 2008; 93:335-46. [DOI: 10.1007/s10482-007-9211-1] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2007] [Accepted: 11/12/2007] [Indexed: 11/27/2022]
|
31
|
Banerjee T, Ghosh TC. Gene expression level shapes the amino acid usages in Prochlorococcus marinus MED4. J Biomol Struct Dyn 2006; 23:547-54. [PMID: 16494504 DOI: 10.1080/07391102.2006.10507079] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Prochlorococcus species are the first example of free-living bacteria with reduced genome. Codon and amino acid usages bias of Prochlorococcus marinus MED4 was investigated using all protein coding genes having length greater than or equal to 100 amino acids. Correspondence analysis on relative synonymous codon usage (RSCU) values shows that there is no such influence of translational selection in shaping the codon usage variation among the genes in this organism. However, amino acid usages were markedly different between the highly and lowly expressed genes in this organism and in particular, GC rich amino acids were found to occur significantly higher in highly expressed genes than the lowly expressed genes. Comparative analysis of the homologous genes of Synechococcus sp. WH8102 and Prochlorococcus marinus MED4 shows that amino acids conservation in highly expressed genes is significantly higher than lowly expressed genes. Based on our results we concluded that conservation of GC rich amino acids in the highly expressed genes to its ancestor is the major source of variation in amino acid usages in the organism.
Collapse
Affiliation(s)
- T Banerjee
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, India
| | | |
Collapse
|
32
|
Gilchrist MA, Wagner A. A model of protein translation including codon bias, nonsense errors, and ribosome recycling. J Theor Biol 2006; 239:417-34. [PMID: 16171830 DOI: 10.1016/j.jtbi.2005.08.007] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2005] [Revised: 08/05/2005] [Accepted: 08/08/2005] [Indexed: 11/15/2022]
Abstract
We present and analyse a model of protein translation at the scale of an individual messenger RNA (mRNA) transcript. The model we develop is unique in that it incorporates the phenomena of ribosome recycling and nonsense errors. The model conceptualizes translation as a probabilistic wave of ribosome occupancy traveling down a heterogeneous medium, the mRNA transcript. Our results show that the heterogeneity of the codon translation rates along the mRNA results in short-scale spikes and dips in the wave. Nonsense errors attenuate this wave on a longer scale while ribosome recycling reinforces it. We find that the combination of nonsense errors and codon usage bias can have a large effect on the probability that a ribosome will completely translate a transcript. We also elucidate how these forces interact with ribosome recycling to determine the overall translation rate of an mRNA transcript. We derive a simple cost function for nonsense errors using our model and apply this function to the yeast (Saccharomyces cervisiae) genome. Using this function we are able to detect position dependent selection on codon bias which correlates with gene expression levels as predicted a priori. These results indirectly validate our underlying model assumptions and confirm that nonsense errors can play an important role in shaping codon usage bias.
Collapse
Affiliation(s)
- Michael A Gilchrist
- Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, 37996, USA.
| | | |
Collapse
|
33
|
Sau K, Gupta SK, Sau S, Ghosh TC. Synonymous codon usage bias in 16 Staphylococcus aureus phages: implication in phage therapy. Virus Res 2005; 113:123-31. [PMID: 15970346 DOI: 10.1016/j.virusres.2005.05.001] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2005] [Revised: 05/06/2005] [Accepted: 05/10/2005] [Indexed: 11/22/2022]
Abstract
To reveal the factors influencing architecture of protein-coding genes in staphylococcal phages, relative synonymous codon usage variation has been investigated in 920 protein-coding genes of 16 staphylococcal phages. As expected for AT rich genomes, there are predominantly A and T ending codons in all 16 phages. Both Nc plot and correspondence analysis on relative synonymous codon usage indicates that mutation bias influences codon usage variation in the 16 phages. Correspondence analysis also suggests that translational selection and gene length also influence the codon usage variation in the phages to some extent and codon usage in staphylococcal phages is phage-specific but not S. aureus-specific. Further analysis indicates that among 16 staphylococcal phages, 44AHJD, P68 and K may be extremely virulent in nature as most of their genes have high translation efficiency. If this is true, then above three phages may be useful for curing staphylococcal infections.
Collapse
Affiliation(s)
- K Sau
- Bioinformatics Centre, Bose Institute, P1/12, CIT Scheme VII M, Calcutta 700 054, India.
| | | | | | | |
Collapse
|
34
|
Gupta SK, Banerjee T, Basak S, Sahu K, Sau S, Ghosh TC. Studies on codon usage inThermoplasma acidophilum and its possible implications on the occurrences of lateral gene transfer. J Basic Microbiol 2005; 45:344-54. [PMID: 16187257 DOI: 10.1002/jobm.200510576] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Codon usage studies have been carried out on the coding sequences of Thermoplasma acidophilum, which is an archaeon and grows at very low pH and high temperature. Overall codon usage data analysis indicates that all the four bases are almost equifrequent at the third position of codons, which is expected (since genomic GC % of this genome is about 46%). However, multivariate statistical analysis indicates that there are two major trends in the codon usage variation among the genes in this organism. In the first major trend it is observed that genes having G and C ending codons are clustered at one end while, A and T ending ones are clustered at the other end. We have also found a significant positive correlation between the expressivities of genes and GC contents at the synonymous third codon positions. In the second major trend, it is seen that the genes are clustered into three distinct parts. A comparative analyses of codon usage data of T. acidophilum and Sulfolobus solfataricus reveals that one of the three clusters of genes of T. acidophilum is very similar to a considerable number of S. solfataricus genes, suggesting possible occurrences of lateral gene transfer between these two microorganisms as reported by earlier workers.
Collapse
Affiliation(s)
- S K Gupta
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme, VII M Calcutta 700 054. India
| | | | | | | | | | | |
Collapse
|
35
|
Sau K, Sau S, Mandal SC, Ghosh TC. Factors influencing the synonymous codon and amino acid usage bias in AT-rich Pseudomonas aeruginosa phage PhiKZ. Acta Biochim Biophys Sin (Shanghai) 2005; 37:625-33. [PMID: 16143818 PMCID: PMC7109957 DOI: 10.1111/j.1745-7270.2005.00089.x] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
To reveal how the AT-rich genome of bacteriophage PhiKZ has been shaped in order to carry out its growth in the GC-rich host Pseudomonas aeruginosa, synonymous codon and amino acid usage bias of PhiKZ was investigated and the data were compared with that of P. aeruginosa. It was found that synonymous codon and amino acid usage of PhiKZ was distinct from that of P. aeruginosa. In contrast to P. aeruginosa, the third codon position of the synonymous codons of PhiKZ carries mostly A or T base; codon usage bias in PhiKZ is dictated mainly by mutational bias and, to a lesser extent, by translational selection. A cluster analysis of the relative synonymous codon usage values of 16 myoviruses including PhiKZ shows that PhiKZ is evolutionary much closer to Escherichia coli phage T4. Further analysis reveals that the three factors of mean molecular weight, aromaticity and cysteine content are mostly responsible for the variation of amino acid usage in PhiKZ proteins, whereas amino acid usage of P. aeruginosa proteins is mainly governed by grand average of hydropathicity, aromaticity and cysteine content. Based on these observations, we suggest that codons of the phage-like PhiKZ have evolved to preferentially incorporate the smaller amino acid residues into their proteins during translation, thereby economizing the cost of its development in GC-rich P. aeruginosa.
Collapse
Affiliation(s)
- K. Sau
- Department of Mathematics, Jadavpur UniversityCalcutta 700 032, India
| | - S. Sau
- Department of Biochemistry, Bose Institute, P1/12-CIT Scheme VII MCalcutta 700 054, India
| | - S. C. Mandal
- Department of Mathematics, Jadavpur UniversityCalcutta 700 032, India
- Corresponding authors: S. C. MANDAL: E-mail,
| | - T. C. Ghosh
- Bioinformatics Centre, Bose Institute, P1/12-CIT Scheme VII MCalcutta 700 054, India
- T. C. GHOSH: Tel, +91-33-2334 6626; Fax, +91-33-2334 3886; E-mail,
| |
Collapse
|
36
|
Sahu K, Gupta SK, Sau S, Ghosh TC. Comparative Analysis of the Base Composition and Codon Usages in Fourteen Mycobacteriophage Genomes. J Biomol Struct Dyn 2005; 23:63-71. [PMID: 15918677 DOI: 10.1080/07391102.2005.10507047] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
To study the possible codon usage and base composition variation in the bacteriophages, fourteen mycobacteriophages were used as a model system here and both the parameters in all these phages and their plating bacteria, M. smegmatis had been determined and compared. As all the organisms are GC-rich, the GC contents at third codon positions were found in fact higher than the second codon positions as well as the first + second codon positions in all the organisms indicating that directional mutational pressure is strongly operative at the synonymous third codon positions. Nc plot indicates that codon usage variation in all these organisms are governed by the forces other than compositional constraints. Correspondence analysis suggests that: (i) there are codon usage variation among the genes and genomes of the fourteen mycobacteriophages and M. smegmatis, i.e., codon usage patterns in the mycobacteriophages is phage-specific but not the M. smegmatis-specific; (ii) synonymous codon usage patterns of Barnyard, Che8, Che9d, and Omega are more similar than the rest mycobacteriophages and M. smegmatis; (iii) codon usage bias in the mycobacteriophages are mainly determined by mutational pressure; and (iv) the genes of comparatively GC rich genomes are more biased than the GC poor genomes. Translational selection in determining the codon usage variation in highly expressed genes can be invoked from the predominant occurrences of C ending codons in the highly expressed genes. Cluster analysis based on codon usage data also shows that there are two distinct branches for the fourteen mycobacteriophages and there is codon usage variation even among the phages of each branch.
Collapse
Affiliation(s)
- K Sahu
- Bioinformatics Centre, Bose Institute, P1/12 - CIT Scheme VII M, Calcutta 700 054, India
| | | | | | | |
Collapse
|
37
|
Zhou T, Gu W, Ma J, Sun X, Lu Z. Analysis of synonymous codon usage in H5N1 virus and other influenza A viruses. Biosystems 2005; 81:77-86. [PMID: 15917130 DOI: 10.1016/j.biosystems.2005.03.002] [Citation(s) in RCA: 103] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2004] [Revised: 03/05/2005] [Accepted: 03/07/2005] [Indexed: 10/25/2022]
Abstract
In this study, we calculated the codon usage bias in H5N1 virus and performed a comparative analysis of synonymous codon usage patterns in H5N1 virus, five other evolutionary related influenza A viruses and a influenza B virus. Codon usage bias in H5N1 genome is a little slight, which is mainly determined by the base compositions on the third codon position. By comparing synonymous codon usage patterns in different viruses, we observed that the codon usage pattern of H5N1 virus is similar with other influenza A viruses, but not influenza B virus, and the synonymous codon usage in influenza A virus genes is phylogenetically conservative, but not strain-specific. Synonymous codon usage in genes encoded by different influenza A viruses is genus conservative. Compositional constraints could explain most of the variation of synonymous codon usage among these virus genes, while gene function is also correlated to synonymous codon usages to a certain extent. However, translational selection and gene length have no effect on the variations of synonymous codon usage in these virus genes.
Collapse
Affiliation(s)
- Tong Zhou
- Key Laboratory of Molecular and Biomolecular Electronics of the Ministry of Education, Southeast University, Nanjing, Jiangsu 210096, China
| | | | | | | | | |
Collapse
|
38
|
Banerjee T, Basak S, Gupta SK, Ghosh TC. Evolutionary forces in shaping the codon and amino acid usages in Blochmannia floridanus. J Biomol Struct Dyn 2005; 22:13-23. [PMID: 15214801 DOI: 10.1080/07391102.2004.10506976] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Endosymbiotic relationship has great effect on ecological system. Codon and amino acid usages bias of endosymbiotic bacteria Blochmannia floridanus (whose host is an ant Camponotus floridanus) was investigated using experimentally known genes of this organism. Correspondence Analysis on RSCU values show that there exists only one single explanatory major axis that is linked to the strand specific mutational biases. Majority of the genes have a tendency to concentrate on the leading strand, which may be related to the adaptive property related to the replication mechanisms. Amino acid usages were markedly different between the highly and lowly expressed genes in this organism and in particular, GC rich amino acids were found to occur significantly higher in highly expressed genes than the lowly expressed genes. Comparative analyses of the orthologous genes of Escherichia coli and Blochmannia floridanus show that highly expressed genes are significantly more conserved than lowly expressed genes. Based on our results we concluded that strand specific mutational bias is strongly operational in selecting the codon usage in this organism. Replicational-transcriptional selection can be invoked from the presence of majority of highly expressed genes in the leading strand. Conservation of GC rich amino acids in the highly expressed genes to its ancestor is the major source of variation in amino acid usages in the organism. Hydrophobicity of the genes is the second major source in differentiating the genes according to their amino acid usages in this organism.
Collapse
Affiliation(s)
- T Banerjee
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, India
| | | | | | | |
Collapse
|
39
|
Fadiel A, Lithwick S, Naftolin F. The influence of environmental adaptation on bacterial genome structure. Lett Appl Microbiol 2005; 40:12-8. [PMID: 15612996 DOI: 10.1111/j.1472-765x.2004.01619.x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
AIMS Researchers have long been puzzled by the diversity of life. Now that the complete genomic sequence of many organisms has been determined, it is possible to evaluate the impact of organismal variation on sequence structure or vice versa. The aim of this investigation was to explore genomic changes mandated by organismal adaptation to its ecological niches. METHODS AND RESULTS Coding sequences from three phylogenetically related bacterial species namely Mycoplasma genitalium, M. pneumoniae and Ureaplasma urealyticum were subject to in depth sequence analyses. M. genitalium and M. pneumoniae both belong to the genus Mycoplasma while U. urealyticum is a member of the genus Ureaplasma. However, M. genitalium and U. urealyticum are urogenital pathogens while M. pneumoniae is a respiratory pathogen. Complete transcriptomes were downloaded from NCBI for each species, and were subject to in silico investigation using in-house software, and public sequence analysis tools. Clear similarities in transcriptome structure were identified among the functionally similar species M. genitalium and U. urealyticum while no such relationship was identified among the phylogenetically related species M. genitalium and M. pneumoniae. CONCLUSIONS It is plausible to conclude that, in these bacterial species, environmental stimuli might be more influential in shaping sequence signatures than phylogenetic relationships. SIGNIFICANCE AND IMPACT OF THE STUDY This study suggests that molecular signatures within the transcriptomes of the species examined are likely to be a product of evolutionary adaptation to diverse environmental ecological stimuli, and not a result of common phylogeny.
Collapse
Affiliation(s)
- A Fadiel
- The Bioinformatics Supercomputing Centre, The Hospital for Sick Children, Toronto, ON, Canada.
| | | | | |
Collapse
|
40
|
Gu W, Zhou T, Ma J, Sun X, Lu Z. The relationship between synonymous codon usage and protein structure in Escherichia coli and Homo sapiens. Biosystems 2004; 73:89-97. [PMID: 15013221 DOI: 10.1016/j.biosystems.2003.10.001] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2003] [Revised: 10/05/2003] [Accepted: 10/05/2003] [Indexed: 11/15/2022]
Abstract
The role of silent position in the codon on the protein structure is an interesting and yet unclear problem. In this paper, 563 Homo sapiens genes and 417 Escherichia coli genes coding for proteins with four different folding types have been analyzed using variance analysis, a multivariate analysis method newly used in codon usage analysis, to find the correlation between amino acid composition, synonymous codon, and protein structure in different organisms. It has been found that in E. coli, both amino acid compositions in differently folded proteins and synonymous codon usage in different gene classes coding for differently folded proteins are significantly different. It was also found that only amino acid composition is different in different protein classes in H. sapiens. There is no universal correlation between synonymous codon usage and protein structure in these two different organisms. Further analysis has shown that GC content on the second codon position can distinguish coding genes for different folded proteins in both organisms.
Collapse
Affiliation(s)
- Wanjun Gu
- Key Laboratory of Molecular and Biomolecular Electronics (Southeast University), Ministry of Education, Nanjing 210096, China
| | | | | | | | | |
Collapse
|
41
|
Gu W, Zhou T, Ma J, Sun X, Lu Z. Analysis of synonymous codon usage in SARS Coronavirus and other viruses in the Nidovirales. Virus Res 2004; 101:155-61. [PMID: 15041183 PMCID: PMC7127446 DOI: 10.1016/j.virusres.2004.01.006] [Citation(s) in RCA: 129] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2003] [Revised: 01/09/2004] [Accepted: 01/09/2004] [Indexed: 12/04/2022]
Abstract
In this study, we calculated the codon usage bias in severe acute respiratory syndrome Coronavirus (SARSCoV) and performed a comparative analysis of synonymous codon usage patterns in SARSCoV and 10 other evolutionary related viruses in the Nidovirales. Although there is a significant variation in codon usage bias among different SARSCoV genes, codon usage bias in SARSCoV is a little slight, which is mainly determined by the base compositions on the third codon position. By comparing synonymous codon usage patterns in different viruses, we observed that synonymous codon usage pattern in these virus genes was virus specific and phylogenetically conserved, but it was not host specific. Phylogenetic analysis based on codon usage pattern suggested that SARSCoV was diverged far from all three known groups of Coronavirus. Compositional constraints could explain most of the variation of synonymous codon usage among these virus genes, while gene function is also correlated to synonymous codon usages to a certain extent. However, translational selection and gene length have no effect on the variations of synonymous codon usage in these virus genes.
Collapse
Affiliation(s)
| | | | | | | | - Zuhong Lu
- Corresponding author. Tel.: +86-25-83619983; fax: +86-25-83619983.
| |
Collapse
|
42
|
Gupta SK, Bhattacharyya TK, Ghosh TC. Synonymous Codon Usage inLactococcus lactis: Mutational Bias Versus Translational Selection. J Biomol Struct Dyn 2004; 21:527-36. [PMID: 14692797 DOI: 10.1080/07391102.2004.10506946] [Citation(s) in RCA: 78] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
In this study codon usage bias of all experimentally known genes of Lactococcus lactis has been analyzed. Since Lactococcus lactis is an AT rich organism, it is expected to occur A and/or T at the third position of codons and detailed analysis of overall codon usage data indicates that A and/or T ending codons are predominant in this organism. However, multivariate statistical analyses based both on codon count and on relative synonymous codon usage (RSCU) detect a large number of genes, which are supposed to be highly expressed are clustered at one end of the first major axis, while majority of the putatively lowly expressed genes are clustered at the other end of the first major axis. It was observed that in the highly expressed genes C and T ending codons are significantly higher than the lowly expressed genes and also it was observed that C ending codons are predominant in the duets of highly expressed genes, whereas the T endings codons are abundant in the quartets. Abundance of C and T ending codons in the highly expressed genes suggest that, besides, compositional biases, translational selection are also operating in shaping the codon usage variation among the genes in this organism as observed in other compositionally skewed organisms. The second major axis generated by correspondence analysis on simple codon counts differentiates the genes into two distinct groups according to their hydrophobicity values, but the same analysis computed with relative synonymous codon usage values could not discriminate the genes according to the hydropathy values. This suggests that amino acid composition exerts constraints on codon usage in this organism. On the other hand the second major axis produced by correspondence analysis on RSCU values differentiates the genes into two groups according to the synonymous codon usage for cysteine residues (rarest amino acids in this organism), which is nothing but a artifactual effect induced by the RSCU values. Other factors such as length of the genes and the positions of the genes in the leading and lagging strand of replication have practically no influence in the codon usage variation among the genes in this organism.
Collapse
Affiliation(s)
- S K Gupta
- Bioinformatics Centre Bose Institute, P 1/12, CIT Scheme VII M, Kolkata 700 054, India
| | | | | |
Collapse
|
43
|
von Samson-Himmelstjerna G, Harder A, Failing K, Pape M, Schnieder T. Analysis of codon usage in beta-tubulin sequences of helminths. Parasitol Res 2003; 90:294-300. [PMID: 12684885 DOI: 10.1007/s00436-003-0840-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2002] [Accepted: 01/21/2003] [Indexed: 11/27/2022]
Abstract
Codon usage bias has been shown to be correlated with gene expression levels in many organisms, including the nematode Caenorhabditis elegans. Here, the codon usage (cu) characteristics for a set of currently available beta-tubulin coding sequences of helminths were assessed by calculating several indices, including the effective codon number (Nc), the intrinsic codon deviation index (ICDI), the P2 value and the mutational response index (MRI). The P2 value gives a measure of translational pressure, which has been shown to be correlated to high gene expression levels in some organisms, but it has not yet been analysed in that respect in helminths. For all but two of the C. elegans beta-tubulin coding sequences investigated, the P2 value was the only index that indicated the presence of codon usage bias. Therefore, we propose that in general the helminth beta-tubulin sequences investigated here are not expressed at high levels. Furthermore, we calculated the correlation coefficients for the cu patterns of the helminth beta-tubulin sequences compared with those of highly expressed genes in organisms such as Escherichia coli and C. elegans. It was found that beta-tubulin cu patterns for all sequences of members of the Strongylida were significantly correlated to those for highly expressed C. elegans genes. This approach provides a new measure for comparing the adaptation of cu of a particular coding sequence with that of highly expressed genes in possible expression systems.Finally, using the cu patterns of the sequences studied, a phylogenetic tree was constructed. The topology of this tree was very much in concordance with that of a phylogeny based on small subunit ribosomal DNA sequence alignments.
Collapse
Affiliation(s)
- G von Samson-Himmelstjerna
- Institute of Parasitology, Hannover School of Veterinary Medicine, Buenteweg 17, 30559 Hannover, Germany.
| | | | | | | | | |
Collapse
|
44
|
Ma J, Zhou T, Gu W, Sun X, Lu Z. Cluster analysis of the codon use frequency of MHC genes from different species. Biosystems 2002; 65:199-207. [PMID: 12069729 DOI: 10.1016/s0303-2647(02)00016-3] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The relative synonymous codon use frequency of 135 MHC genes from four mammal species (Homo sapiens, Pan troglodyte, Macaca mulanta and Rattus norvegicus) is analyzed using a hierarchical cluster method. The result suggests that gene function is the dominant factor that determines codon usage bias, while species is a minor factor that determines further difference in codon usage bias for genes with similar functions. The conclusion may be useful in gene classification and gene function prediction.
Collapse
Affiliation(s)
- Jianmin Ma
- Chien-Shiung Wu Laboratory, Southeast University, 210096 Jiangsu Province, Nanjing, People's Republic of China.
| | | | | | | | | |
Collapse
|
45
|
Gupta SK, Ghosh TC. Gene expressivity is the main factor in dictating the codon usage variation among the genes in Pseudomonas aeruginosa. Gene 2001; 273:63-70. [PMID: 11483361 DOI: 10.1016/s0378-1119(01)00576-5] [Citation(s) in RCA: 89] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Codon usage biases of all DNA sequences (length greater than or equal to 300 bp) from the complete genome of Pseudomonas aeruginosa have been analyzed. As P. aeruginosa is a GC-rich organism, G and/or C are expected to predominate in their codons. Overall codon usage data analysis indicates that indeed codons ending in G and/or C are predominant in this organism. But multivariate statistical analysis indicates that there is a single major trend in the codon usage variation among the genes in this organism, which has a strong negative correlation with the expressivities of the genes. The majority of the lowly expressed genes are scattered towards the positive end of the major axis whereas the highly expressed genes are clustered towards the negative end. This is the first report where the prokaryotic organism having highly skewed base composition is dictated mainly by translational selection, though some other factors such as the lengths of the genes as well as the hydrophobicity of genes also influence the codon usage variation among the genes in this organism in a minor way.
Collapse
Affiliation(s)
- S K Gupta
- Distributed Information Centre, Bose Institute, P 1/12, C.I.T. Scheme, VII M, Calcutta 700 054, India
| | | |
Collapse
|
46
|
Naya H, Romero H, Carels N, Zavala A, Musto H. Translational selection shapes codon usage in the GC-rich genome of Chlamydomonas reinhardtii. FEBS Lett 2001; 501:127-30. [PMID: 11470270 DOI: 10.1016/s0014-5793(01)02644-8] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
In unicellular species codon usage is determined by mutational biases and natural selection. Among prokaryotes, the influence of these factors is different if the genome is skewed towards AT or GC, since in AT-rich organisms translational selection is absent. On the other hand, in AT-rich unicellular eukaryotes the two factors are present. In order to understand if GC-rich genomes display a similar behavior, the case of Chlamydomonas reinhardtii was studied. Since we found that translational selection strongly influences codon usage in this species, we conclude that there is not a common pattern among unicellular organisms.
Collapse
Affiliation(s)
- H Naya
- Departamento de Biología Celular y Molecular, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | | | | | | | | |
Collapse
|
47
|
Fadiel A, Lithwick S, Wanas MQ, Cuticchia AJ. Influence of intercodon and base frequencies on codon usage in filarial parasites. Genomics 2001; 74:197-210. [PMID: 11386756 DOI: 10.1006/geno.2001.6531] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Base frequency, codon usage, and intercodon identity were analyzed in five filarial parasite species representing five Onchocercidae genera. Wucheria bancrofti, Brugia malayi, Onchocerca volvulus, Acanthocheilonema viteae, and Dirofilaria immitis gene sequences were downloaded from NCBI, and analysis was performed using locally designed computer programs and other freely available applications. A clear sequence bias was observed among the nematode species examined. At the nucleotide level, AT basepairs were present in gene sequences at higher frequencies than GC. In addition, codons ending in A or T were used proportionately more than those with G or C in the third-codon position. In addition, the amino acids used most often corresponded to codons ending in AT basepairs. Intercodon base proportion was biased in that A was found most often at N4, second only to T in certain specific cases. Since all of these sequence biases were observed in a relatively consistent fashion among all of the organisms studied, we conclude that sequence bias is a genetic characteristic, which is associated with multiple filarial genera.
Collapse
Affiliation(s)
- A Fadiel
- Bioinformatics Supercomputing Centre, The Hospital for Sick Children, Toronto, Ontario M5G 1Z8, Canada.
| | | | | | | |
Collapse
|
48
|
Grocock RJ, Sharp PM. Synonymous codon usage in Cryptosporidium parvum: identification of two distinct trends among genes. Int J Parasitol 2001; 31:402-12. [PMID: 11306119 DOI: 10.1016/s0020-7519(01)00129-1] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The usage of alternative synonymous codons in the apicomplexan Cryptosporidium parvum has been investigated. A data set of 54 genes was analysed. Overall, A- and U-ending codons predominate, as expected in an A+T-rich genome. Two trends of codon usage variation among genes were identified using correspondence analysis. The primary trend is in the extent of usage of a subset of presumably translationally optimal codons, that are used at significantly higher frequencies in genes expected to be expressed at high levels. Fifteen of the 18 codons identified as optimal are more G+C-rich than the otherwise common codons, so that codon selection associated with translation opposes the general mutation bias. Among 40 genes with lower frequencies of these optimal codons, a secondary trend in G+C content was identified. In these genes, G+C content at synonymously variable third positions of codons is correlated with that in 5' and 3' flanking sequences, indicative of regional variation in G+C content, perhaps reflecting regional variation in mutational biases.
Collapse
Affiliation(s)
- R J Grocock
- Institute of Genetics, University of Nottingham, Queens Medical Centre, Nottingham, NG7 2UH, UK
| | | |
Collapse
|