51
|
Deka H, Nath D, Uddin A, Chakraborty S. DNA compositional dynamics and codon usage patterns of M1 and M2 matrix protein genes in influenza A virus. INFECTION GENETICS AND EVOLUTION 2018; 67:7-16. [PMID: 30367980 DOI: 10.1016/j.meegid.2018.10.015] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Revised: 10/11/2018] [Accepted: 10/23/2018] [Indexed: 11/30/2022]
Abstract
Influenza A virus subtype H3N2 has been a serious health issue across the globe with approximately 36 thousand annual casualties in the United States of America only. Co-circulation in avian, swine and human hosts has led to frequent mutations in the virus genome, due to which development of successful antivirals against the virus has become a formidable challenge. Recently, focussed research is being carried out targeting the matrix proteins of this strain as vaccine candidates. This study is carried out to unravel the key features of the genes encoding the matrix proteins that manoeuvre the codon usage profile in the H3N2 strains. The findings reveal differential codon choice for both matrix protein 1 and matrix protein 2. The overall codon usage bias is less pronounced in both the datasets which is evident from higher value of effective number of codons (>55). Comparison of the codon usage for both the genes under study with that of humans revealed that the viral codon usage is not fully optimized for the human host conditions. Both the genes enrolled in the study showed variation which was reflected in almost all the indices used for codon usage studies. Neutrality analysis revealed a weak role of mutation pressure while selection was the major contributor towards codon usage.
Collapse
Affiliation(s)
- Himangshu Deka
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Durbba Nath
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Hailakandi 788150, Assam, India.
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India.
| |
Collapse
|
52
|
Barbhuiya PA, Uddin A, Chakraborty S. Compositional properties and codon usage of TP73 gene family. Gene 2018; 683:159-168. [PMID: 30316927 DOI: 10.1016/j.gene.2018.10.030] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2018] [Revised: 10/03/2018] [Accepted: 10/11/2018] [Indexed: 12/19/2022]
Abstract
The TP73 gene is considered as one of the members of TP53 gene family and shows much homology to p53 gene. TP73 gene plays a pivotal role in cancer studies in addition to other biological functions. Codon usage bias (CUB) is the phenomenon of unequal usage of synonymous codons for an amino acid wherein some codons are more frequently used than others and it reveals the evolutionary relationship of a gene. Here, we report the pattern of codon usage in TP73 gene using various bioinformatic tools as no work was reported yet. Nucleotide composition analysis suggested that the mean nucleobase C was the highest, followed by G and the gene was GC rich. Correlation analysis between codon usage and GC3 suggested that most of the GC-ending codons showed positive correlation while most of the AT-ending codons showed negative correlation with GC3 in the coding sequences of TP73 gene variants in human. The CUB is moderate in human TP73 gene as evident from intrinsic codon deviation index (ICDI) analysis. Nature selected against two codons namely ATA (isoleucine) and AGA (arginine) in the coding sequences of TP73 gene during the course of evolution. A significant correlation (p < 0.05) was found between overall nucleotide composition and its composition at the 3rd codon position, indicating that both mutation pressure and natural selection might influence the CUB. The correlation analysis between ICDI and biochemical properties of protein suggested that variation of CUB was associated with degree of hydrophobicity and length of protein.
Collapse
Affiliation(s)
- Parvin A Barbhuiya
- Departments of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi 788150, Assam, India
| | - Supriyo Chakraborty
- Departments of Biotechnology, Assam University, Silchar 788011, Assam, India.
| |
Collapse
|
53
|
Mazumder GA, Uddin A, Chakraborty S. Preference of A/T ending codons in mitochondrial ATP6 gene under phylum Platyhelminthes. Mol Biochem Parasitol 2018; 225:15-26. [DOI: 10.1016/j.molbiopara.2018.08.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2018] [Revised: 08/17/2018] [Accepted: 08/22/2018] [Indexed: 11/27/2022]
|
54
|
Maldonado LL, Stegmayer G, Milone DH, Oliveira G, Rosenzvit M, Kamenetzky L. Whole genome analysis of codon usage in Echinococcus. Mol Biochem Parasitol 2018; 225:54-66. [DOI: 10.1016/j.molbiopara.2018.08.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2018] [Revised: 07/20/2018] [Accepted: 08/01/2018] [Indexed: 01/15/2023]
|
55
|
Chakraborty S, Uddin A, Mazumder TH, Choudhury MN, Malakar AK, Paul P, Halder B, Deka H, Mazumder GA, Barbhuiya RA, Barbhuiya MA, Devi WJ. Codon usage and expression level of human mitochondrial 13 protein coding genes across six continents. Mitochondrion 2018; 42:64-76. [DOI: 10.1016/j.mito.2017.11.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Revised: 10/09/2017] [Accepted: 11/27/2017] [Indexed: 02/03/2023]
|
56
|
Guan DL, Ma LB, Khan MS, Zhang XX, Xu SQ, Xie JY. Analysis of codon usage patterns in Hirudinaria manillensis reveals a preference for GC-ending codons caused by dominant selection constraints. BMC Genomics 2018; 19:542. [PMID: 30016953 PMCID: PMC6050667 DOI: 10.1186/s12864-018-4937-x] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Accepted: 07/10/2018] [Indexed: 02/07/2023] Open
Abstract
Background Hirudinaria manillensis is an ephemeral, blood-sucking ectoparasite, possessing anticoagulant capacities with potential medical applications. Analysis of codon usage patterns would contribute to our understanding of the evolutionary mechanisms and genetic architecture of H. manillensis, which in turn would provide insight into the characteristics of other leeches. We analysed codon usage and related indices using 18,000 coding sequences (CDSs) retrieved from H. manillensis RNA-Seq data. Results We identified four highly preferred codons in H. manillensis that have G/C-endings. Points generated in an effective number of codons (ENC) plot distributed below the standard curve and the slope of a neutrality plot was less than 1. Highly expressed CDSs had lower ENC content and higher GC content than weakly expressed CDSs. Principal component analysis conducted on relative synonymous codon usage (RSCU) values divided CDSs according to GC content and divided codons according to ending bases. Moreover, by determining codon usage, we found that the majority of blood-diet related genes have undergone less adaptive evolution in H. manillensis, except for those with homologous sequences in the host species. Conclusions Codon usage in H. manillensis had an overall preference toward C-endings and indicated that codon usage patterns are mediated by differential expression, GC content, and biological function. Although mutation pressure effects were also notable, the majority of genetic evolution in H. manillensis was driven by natural selection. Electronic supplementary material The online version of this article (10.1186/s12864-018-4937-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- De-Long Guan
- College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, Shaanxi, People's Republic of China
| | - Li-Bin Ma
- College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, Shaanxi, People's Republic of China
| | - Muhammad Salabat Khan
- College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, Shaanxi, People's Republic of China
| | - Xiu-Xiu Zhang
- College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, Shaanxi, People's Republic of China
| | - Sheng-Quan Xu
- College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, Shaanxi, People's Republic of China.
| | - Juan-Ying Xie
- School of Computer Science, Shaanxi Normal University, Xi'an, 710119, Shaanxi, People's Republic of China.
| |
Collapse
|
57
|
Uddin A, Chakraborty S. Codon Usage Pattern of Genes Involved in Central Nervous System. Mol Neurobiol 2018; 56:1737-1748. [PMID: 29922982 DOI: 10.1007/s12035-018-1173-y] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Accepted: 06/01/2018] [Indexed: 11/28/2022]
Abstract
Codon usage bias (CUB) is the non-uniform usage of synonymous codons in which some codons are more preferred to others in the transcript. Analysis of codon usage bias has applications in understanding the basics of molecular biology, genetics, gene expression, and molecular evolution. To understand the patterns of codon usage in genes involved in the central nervous system (CNS), we used bioinformatic approaches to analyze the protein-coding sequences of genes involved in the CNS. The improved effective number of codons (ENC) suggested that the overall codon usage bias was low. The relative synonymous codon usage (RSCU) revealed that the most frequently occurring codons had a G or C at the third codon position. The codons namely TCC, AGC, CTG, CAG, CGC, ATC, ACC, GTG, GCC, GGC, and CGG (average RSCU > 1.6) were over-represented. Both mutation pressure and natural selection might affect the codon usage pattern as evident from correspondence and parity plot analyses. The overall GC content (59.93) was higher than AT content, i.e., genes were GC-rich. The correlation of GC12 with GC3 suggested that mutation pressure might affect the codon usage pattern.
Collapse
Affiliation(s)
- Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi, Assam, 788150, India.
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar, Assam, 788011, India.
| |
Collapse
|
58
|
Dissimilar substitution rates between two strands of DNA influence codon usage pattern in some human genes. Gene 2018; 645:179-187. [PMID: 29229516 DOI: 10.1016/j.gene.2017.12.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Revised: 12/05/2017] [Accepted: 12/07/2017] [Indexed: 11/23/2022]
Abstract
We illustrated the descriptive aspects of codon usage of some important human genes and their expression potential in E. coli. By comparing the results of various codon usage parameters, effects that are due to selection and mutational pressures have been deciphered. The variation in GC3s explains a significant proportion of the variation in codon usage patterns. The codons CGC, CGG, CTG and GCG showed strong positive correlation with GC3, which suggested that codon usage had been influenced by GC bias. We also found that ACC (Thr, RSCU-1.77), GCC (Ala, RSCU-1.67), CCC (Pro, RSCU-1.54), TCC (Ser, RSCU-1.47) were frequently used which signified that C was common at 2nd and 3rd codon positions. Correspondence analysis revealed that F1 axis had significant correlation with various GC contents suggesting that compositional properties under mutation pressure might affect codon usage bias. Nc-GC3 plot analysis suggested that both mutation pressure and natural selection might affect the codon usage bias which is also supported by neutrality plot analysis. The dinucleotide CT, TG and AG were significantly over-represented and CG, TA, AT, TT, and GT were underrepresented due to high rate of spontaneous mutation resulting from cytosine deamination.
Collapse
|
59
|
Paul P, Malakar AK, Chakraborty S. Codon usage vis-a-vis start and stop codon context analysis of three dicot species. J Genet 2018; 97:97-107. [PMID: 29666329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
To understand the variation in genomic composition and its effect on codon usage, we performed the comparative analysis of codon usage and nucleotide usage in the genes of three dicots, Glycine max, Arabidopsis thaliana and Medicago truncatula. The dicot genes were found to be A/T rich and have predominantly A-ending and/or T-ending codons. GC3s directly mimic theusage pattern of global GC content. Relative synonymous codon usage analysis suggests that the high usage frequency of A/T over G/C mononucleotide containing codons in AT-rich dicot genome is due to compositional constraint as a factor of codon usage bias. Odds ratio analysis identified the dinucleotides TpG, TpC, GpA, CpA and CpT as over-represented, where, CpG and TpA as under-represented dinucleotides. The results of (NcExp-NcObs)/NcExp plot suggests that selection pressure other than mutation played a significant role in influencing the pattern of codon usage in these dicots. PR2 analysis revealed the significant role of selection pressure on codon usage. Analysis of varience on codon usage at start and stop site showed variation in codon selection in these sites. This study provides evidence that the dicot genes were subjected to compositional selection pressure.
Collapse
Affiliation(s)
- Prosenjit Paul
- Department of Biotechnology, Assam University, Silchar 788 011, India.
| | | | | |
Collapse
|
60
|
Deb B, Uddin A, Mazumder GA, Chakraborty S. Analysis of codon usage pattern of mitochondrial protein-coding genes in different hookworms. Mol Biochem Parasitol 2018; 219:24-32. [DOI: 10.1016/j.molbiopara.2017.11.005] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2017] [Revised: 11/14/2017] [Accepted: 11/16/2017] [Indexed: 12/11/2022]
|
61
|
Paul P, Malakar AK, Chakraborty S. Compositional bias coupled with selection and mutation pressure drives codon usage in Brassica campestris genes. Food Sci Biotechnol 2017; 27:725-733. [PMID: 30263798 DOI: 10.1007/s10068-017-0285-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2017] [Revised: 11/28/2017] [Accepted: 12/03/2017] [Indexed: 11/25/2022] Open
Abstract
The plant Brassica campestris includes the vegetables turnip and Chinese cabbage, important plants of economic importance. Here, we have analysed the codon usage bias of B. campestris for 116 protein coding genes. Neutrality analysis showed that B. campestris had a wide range of GC3s, and a significant correlation was observed between GC12 and GC3. Nc versus GC3s plot showed a few genes on or proximate to the expected curve, but the majority of points were found to be scattered distantly from the expected curve. Correspondence analysis on codon usage revealed that the position preference of codons on multidimensional space totally depends on the presence of A and T at synonymous third codon position. These results altogether suggest that composition bias along with selection (major) and mutation pressure (minor) affects the codon usage pattern of the protein coding genes in Brassica campestris.
Collapse
Affiliation(s)
- Prosenjit Paul
- Department of Biotechnology, Assam University, Silchar, Assam 788011 India
| | - Arup Kumar Malakar
- Department of Biotechnology, Assam University, Silchar, Assam 788011 India
| | | |
Collapse
|
62
|
Goswami AM. Codon usage patterns of 3β-hydroxysteroid dehydrogenase type 2 gene across mammalian species and the influence of mutation and selection pressure. GENE REPORTS 2017. [DOI: 10.1016/j.genrep.2017.08.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
63
|
Genome-wide analysis of codon usage bias patterns in an enterotoxigenic Escherichia coli F18 strain. Genes Genomics 2017. [DOI: 10.1007/s13258-017-0519-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
64
|
Factors affecting the codon usage bias of SRY gene across mammals. Gene 2017; 630:13-20. [PMID: 28827114 DOI: 10.1016/j.gene.2017.08.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Revised: 07/20/2017] [Accepted: 08/03/2017] [Indexed: 11/24/2022]
Abstract
Codon usage bias (CUB) is extensively found in a wide variety genomes and it is mostly affected by mutation pressure and natural selection. Analysis of CUB helps in studying the evolutionary features of a genome. The SRY gene plays an important role in male reproductive organ and a good candidate to study the evolutionary forces, since little work was reported earlier on this gene. We used bioinformatic methods to analyze the protein-coding sequences of SRY gene in 172 different mammalian species to understand the patterns of codon usage and the evolutionary forces acting on it. We found that the codon bias of SRY gene varies widely across mammals. Relative synonymous codon usage (RSCU) value revealed that the codons such as TCG, CCG, CAT, ATT, ACT, GCT, GTT, GCG, GGG and GGT were over-represented. Correspondence analysis indicated that the distribution of codons was more close to the axes indicating that compositional constraints might correlate to codon bias. Z-score analysis on RSCU values of codons identified a set of 11 codons viz. TCT, TTT, CTA, CTC, TAT, CAG, CGT, ATA, ACC, AAT and GTA which differed significantly at p<0.01 between 5% high and low gene expression datasets. Further, it was evident from the neutrality plot that GC12 was influenced by both mutation pressure and natural selection. From the study we concluded that natural selection played a dominant role, but mutational pressure played a minor role in the codon usage pattern of SRY gene across mammals.
Collapse
|
65
|
Nath Choudhury M, Uddin A, Chakraborty S. Codon usage bias and its influencing factors for Y-linked genes in human. Comput Biol Chem 2017; 69:77-86. [DOI: 10.1016/j.compbiolchem.2017.05.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2016] [Revised: 05/04/2017] [Accepted: 05/20/2017] [Indexed: 11/30/2022]
|
66
|
Sadhasivam A, Vetrivel U. Genome-wide codon usage profiling of ocular infective Chlamydia trachomatis serovars and drug target identification. J Biomol Struct Dyn 2017. [PMID: 28627970 DOI: 10.1080/07391102.2017.1343685] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Chlamydia trachomatis (C.t) is a Gram-negative obligate intracellular bacteria and is a major causative of infectious blindness and sexually transmitted diseases. Among the varied serovars of this organism, A, B and C are reported as prominent ocular pathogens. Genomic studies of these strains shall aid in deciphering potential drug targets and genomic influence on pathogenesis. Hence, in this study we performed deep statistical profiling of codon usage in these serovars. The overall base composition analysis reveals that these serovars are over biased to AU than GC. Similarly, relative synonymous codon usage also showed preference towards A/U ending codons. Parity Rule 2 analysis inferred unequal distribution of AT and GC, indicative of other unknown factors acting along with mutational pressure to influence codon usage bias (CUB). Moreover, absolute quantification of CUB also revealed lower bias across these serovars. The effect of natural selection on CUB was also confirmed by neutrality plot, reinforcing natural selection under mutational pressure turned to be a pivotal role in shaping the CUB in the strains studied. Correspondence analysis (COA) clarified that, C.t C/TW-3 to show a unique trend in codon usage variation. Host influence analysis on shaping the codon usage pattern also inferred some speculative relativity. In a nutshell, our finding suggests that mutational pressure is the dominating factor in shaping CUB in the strains studied, followed by natural selection. We also propose potential drug targets based on cumulative analysis of strand bias, CUB and human non-homologue screening.
Collapse
Affiliation(s)
- Anupriya Sadhasivam
- a Centre for Bioinformatics , Kamalnayan Bajaj Institute for Research in Vision and Ophthalmology, Vision Research Foundation, Sankara Nethralaya , Chennai 600 006 , Tamil Nadu , India
| | - Umashankar Vetrivel
- a Centre for Bioinformatics , Kamalnayan Bajaj Institute for Research in Vision and Ophthalmology, Vision Research Foundation, Sankara Nethralaya , Chennai 600 006 , Tamil Nadu , India
| |
Collapse
|
67
|
Pathak J, Kannaujiya VK, Singh SP, Sinha RP. Codon usage analysis of photolyase encoding genes of cyanobacteria inhabiting diverse habitats. 3 Biotech 2017; 7:192. [PMID: 28664377 DOI: 10.1007/s13205-017-0826-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2017] [Accepted: 05/31/2017] [Indexed: 12/17/2022] Open
Abstract
Nucleotide and amino acid compositions were studied to determine the genomic and structural relationship of photolyase gene in freshwater, marine and hot spring cyanobacteria. Among three habitats, photolyase encoding genes from hot spring cyanobacteria were found to have highest GC content. The genomic GC content was found to influence the codon usage and amino acid variability in photolyases. The third position of codon was found to have more effect on amino acid variability in photolyases than the first and second positions of codon. The variation of amino acids Ala, Asp, Glu, Gly, His, Leu, Pro, Gln, Arg and Val in photolyases of three different habitats was found to be controlled by first position of codon (G1C1). However, second position (G2C2) of codon regulates variation of Ala, Cys, Gly, Pro, Arg, Ser, Thr and Tyr contents in photolyases. Third position (G3C3) of codon controls incorporation of amino acids such as Ala, Phe, Gly, Leu, Gln, Pro, Arg, Ser, Thr and Tyr in photolyases from three habitats. Photolyase encoding genes of hot spring cyanobacteria have 85% codons with G or C at third position, whereas marine and freshwater cyanobacteria showed 82 and 60% codons, respectively, with G or C at third position. Principal component analysis (PCA) showed that GC content has a profound effect in separating the genes along the first major axis according to their RSCU (relative synonymous codon usage) values, and neutrality analysis indicated that mutational pressure has resulted in codon bias in photolyase genes of cyanobacteria.
Collapse
Affiliation(s)
- Jainendra Pathak
- Laboratory of Photobiology and Molecular Microbiology, Centre of Advanced Study in Botany, Institute of Science, Banaras Hindu University, Varanasi, 221005, India
| | - Vinod K Kannaujiya
- Laboratory of Photobiology and Molecular Microbiology, Centre of Advanced Study in Botany, Institute of Science, Banaras Hindu University, Varanasi, 221005, India
| | - Shailendra P Singh
- Laboratory of Photobiology and Molecular Microbiology, Centre of Advanced Study in Botany, Institute of Science, Banaras Hindu University, Varanasi, 221005, India
| | - Rajeshwar P Sinha
- Laboratory of Photobiology and Molecular Microbiology, Centre of Advanced Study in Botany, Institute of Science, Banaras Hindu University, Varanasi, 221005, India.
| |
Collapse
|
68
|
Uddin A, Choudhury MN, Chakraborty S. Factors influencing codon usage of mitochondrial ND1 gene in pisces, aves and mammals. Mitochondrion 2017; 37:17-26. [PMID: 28668667 DOI: 10.1016/j.mito.2017.06.004] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2016] [Revised: 05/19/2017] [Accepted: 06/26/2017] [Indexed: 01/05/2023]
Abstract
Animal mitochondrial genome harbours 13 protein coding genes which regulate the process of respiration. The mitochondrial NADH dehydrogenase 1 (MT-ND1) gene, one of the 13 protein-coding genes, encodes the NADH dehydrogenase 1 enzyme of the respiratory chain. Analysis of codon usage bias (CUB) acquires importance for better understanding of the molecular biology, new gene discovery, design of transgenes and gene evolution. The MT-ND1 gene seems to be a good candidate for analyzing codon usage pattern, since no work has yet been reported. Moreover, it is still not clear which factors significantly influence the codon usage pattern. In the present study, comparative analysis of codon usage pattern, expression level and influencing factors for MT-ND1 gene from 100 different species each of pisces, aves and mammals were used for CUB analysis. Our result suggests that the gene is AT rich in pisces, aves and mammals and most of the nucleotides significantly differ among them as revealed from t-test. CUB was not remarkable as reflected by high value of effective number of codons and it also significantly differs among pisces, aves and mammals. Although we found that CUB is mainly influenced by natural selection and mutation pressure for MT-ND1 gene as suggested by correlation and correspondence analysis but neutrality plot further revealed that natural selection played a major role and mutation pressure played a minor role in codon usage pattern. Additionally, t-test analysis showed that the MT-ND1 gene has a wide significant discrepancy in codon choices in pisces, aves and mammals. This study has contributed to boost our understanding about the mechanism of distribution of the codons and the factors that may influence the evolution of the MT-ND1 gene.
Collapse
Affiliation(s)
- Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi 788150, Assam, India.
| | | | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India.
| |
Collapse
|
69
|
Bae YA. Codon Usage Patterns of Tyrosinase Genes in Clonorchis sinensis. THE KOREAN JOURNAL OF PARASITOLOGY 2017; 55:175-183. [PMID: 28506040 PMCID: PMC5450960 DOI: 10.3347/kjp.2017.55.2.175] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Revised: 04/05/2017] [Accepted: 04/06/2017] [Indexed: 11/28/2022]
Abstract
Codon usage bias (CUB) is a unique property of genomes and has contributed to the better understanding of the molecular features and the evolution processes of particular gene. In this study, genetic indices associated with CUB, including relative synonymous codon usage and effective numbers of codons, as well as the nucleotide composition, were investigated in the Clonorchis sinensis tyrosinase genes and their platyhelminth orthologs, which play an important role in the eggshell formation. The relative synonymous codon usage patterns substantially differed among tyrosinase genes examined. In a neutrality analysis, the correlation between GC12 and GC3 was statistically significant, and the regression line had a relatively gradual slope (0.218). NC-plot, i.e., GC3 vs effective number of codons (ENC), showed that most of the tyrosinase genes were below the expected curve. The codon adaptation index (CAI) values of the platyhelminth tyrosinases had a narrow distribution between 0.685/0.714 and 0.797/0.837, and were negatively correlated with their ENC. Taken together, these results suggested that CUB in the tyrosinase genes seemed to be basically governed by selection pressures rather than mutational bias, although the latter factor provided an additional force in shaping CUB of the C. sinensis and Opisthorchis viverrini genes. It was also apparent that the equilibrium point between selection pressure and mutational bias is much more inclined to selection pressure in highly expressed C. sinensis genes, than in poorly expressed genes.
Collapse
|
70
|
Huang X, Xu J, Chen L, Wang Y, Gu X, Peng X, Yang G. Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps. BMC Genomics 2017; 18:308. [PMID: 28427327 PMCID: PMC5397707 DOI: 10.1186/s12864-017-3704-8] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2016] [Accepted: 04/12/2017] [Indexed: 12/04/2022] Open
Abstract
Background Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. Results Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as “optimal codons”. Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. Conclusions In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3704-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xing Huang
- Department of Parasitology, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, 611130, China.,Chengdu Agricultural College, Chengdu, 611130, China
| | - Jing Xu
- Department of Parasitology, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, 611130, China
| | - Lin Chen
- Meat-processing Application Key Laboratory of Sichuan Province, College of Pharmacy and Biological Engineering, Chengdu University, Chengdu, 610106, China
| | - Yu Wang
- Department of Parasitology, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, 611130, China
| | - Xiaobin Gu
- Department of Parasitology, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, 611130, China
| | - Xuerong Peng
- College of Science, Sichuan Agricultural University, Ya'an, 625014, China
| | - Guangyou Yang
- Department of Parasitology, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, 611130, China.
| |
Collapse
|
71
|
Gene expression, nucleotide composition and codon usage bias of genes associated with human Y chromosome. Genetica 2017; 145:295-305. [PMID: 28421323 DOI: 10.1007/s10709-017-9965-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2016] [Accepted: 04/08/2017] [Indexed: 10/19/2022]
Abstract
Analysis of codon usage pattern is important to understand the genetic and evolutionary characteristics of genomes. We have used bioinformatic approaches to analyze the codon usage bias (CUB) of the genes located in human Y chromosome. Codon bias index (CBI) indicated that the overall extent of codon usage bias was low. The relative synonymous codon usage (RSCU) analysis suggested that approximately half of the codons out of 59 synonymous codons were most frequently used, and possessed a T or G at the third codon position. The codon usage pattern was different in different genes as revealed from correspondence analysis (COA). A significant correlation between effective number of codons (ENC) and various GC contents suggests that both mutation pressure and natural selection affect the codon usage pattern of genes located in human Y chromosome. In addition, Y-linked genes have significant difference in GC contents at the second and third codon positions, expression level, and codon usage pattern of some codons like the SPANX genes in X chromosome.
Collapse
|
72
|
Choudhury MN, Uddin A, Chakraborty S. Nucleotide composition and codon usage bias of SRY gene. Andrologia 2017; 50. [PMID: 28124482 DOI: 10.1111/and.12787] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/24/2016] [Indexed: 11/27/2022] Open
Abstract
The SRY gene is present within the sex-determining region of the Y chromosome which is responsible for maleness in mammals. The nonuniform usage of synonymous codons in the mRNA transcript for encoding a particular amino acid is the codon usage bias (CUB). Analysis of codon usage pattern is important to understand the genetic and molecular organisation of a gene. It also helps in heterologous gene expression, design of primer and synthetic gene. However, the analysis of codon usage bias of SRY gene was not yet studied. We have used bioinformatic tools to analyse codon usage bias of SRY gene across mammals. Codon bias index (CBI) indicated that the overall extent of codon usage bias was weak. The relative synonymous codon usage (RSCU) analysis suggested that most frequently used codons had an A or C at the third codon position. Compositional constraint played an important role in codon usage pattern as evident from correspondence analysis (CA). Significant correlation among nucleotides constraints indicated that both mutation pressure and natural selection affect the codon usage pattern. Neutrality plot suggested that natural selection might play a major role, while mutation pressure might play a minor role in codon usage pattern in SRY gene in different species of mammals.
Collapse
Affiliation(s)
- M N Choudhury
- Department of Biotechnology, Assam University, Silchar, Assam, India
| | - A Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi, India
| | - S Chakraborty
- Department of Biotechnology, Assam University, Silchar, Assam, India
| |
Collapse
|
73
|
Analysis of codon usage patterns in Ginkgo biloba reveals codon usage tendency from A/U-ending to G/C-ending. Sci Rep 2016; 6:35927. [PMID: 27808241 PMCID: PMC5093902 DOI: 10.1038/srep35927] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2016] [Accepted: 10/07/2016] [Indexed: 11/08/2022] Open
Abstract
As one of the most ancient tree species, the codon usage pattern analysis of Ginkgo biloba is a useful way to understand its evolutionary and genetic mechanisms. Several studies have been conducted on angiosperms, but seldom on gymnosperms. Based on RNA-Seq data of the G. biloba transcriptome, amount to 17,579 unigenes longer than 300 bp were selected and analyzed from 68,547 candidates. The codon usage pattern tended towards more frequently use of A/U-ending codons, which showed an obvious gradient progressing from gymnosperms to dicots to monocots. Meanwhile, analysis of high/low-expression unigenes revealed that high-expression unigenes tended to use G/C-ending codons together with more codon usage bias. Variation of unigenes with different functions suggested that unigenes involving in environment adaptation use G/C-ending codons more frequently with more usage bias, and these results were consistent with the conclusion that the formation of G. biloba codon usage bias was dominated by natural selection.
Collapse
|
74
|
Zhao Y, Zheng H, Xu A, Yan D, Jiang Z, Qi Q, Sun J. Analysis of codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) and its relation to evolution. BMC Genomics 2016; 17:677. [PMID: 27558469 PMCID: PMC4997668 DOI: 10.1186/s12864-016-3021-7] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2016] [Accepted: 08/16/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Analysis of codon usage bias is an extremely versatile method using in furthering understanding of the genetic and evolutionary paths of species. Codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) has remained largely unexplored at present. Hence, the codon usage bias of NPV envelope glycoprotein was analyzed here to reveal the genetic and evolutionary relationships between different viral species in baculovirus genus. RESULTS A total of 9236 codons from 18 different species of NPV of the baculovirus genera were used to perform this analysis. Glycoprotein of NPV exhibits weaker codon usage bias. Neutrality plot analysis and correlation analysis of effective number of codons (ENC) values indicate that natural selection is the main factor influencing codon usage bias, and that the impact of mutation pressure is relatively smaller. Another cluster analysis shows that the kinship or evolutionary relationships of these viral species can be divided into two broad categories despite all of these 18 species are from the same baculovirus genus. CONCLUSIONS There are many elements that can affect codon bias, such as the composition of amino acids, mutation pressure, natural selection, gene expression level, and etc. In the meantime, cluster analysis also illustrates that codon usage bias of virus envelope glycoprotein can serve as an effective means of evolutionary classification in baculovirus genus.
Collapse
Affiliation(s)
- Yongchao Zhao
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Hao Zheng
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Anying Xu
- Sericultural Research Institute, Chinese Academy of Agricultural Sciences, Zhenjiang Jiangsu, 212018, People's Republic of China
| | - Donghua Yan
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Zijian Jiang
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Qi Qi
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Jingchen Sun
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| |
Collapse
|
75
|
Malakar AK, Halder B, Paul P, Chakraborty S. Cytochrome P450 genes in coronary artery diseases: Codon usage analysis reveals genomic GC adaptation. Gene 2016; 590:35-43. [PMID: 27275533 DOI: 10.1016/j.gene.2016.06.011] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2016] [Revised: 04/12/2016] [Accepted: 06/03/2016] [Indexed: 10/21/2022]
Abstract
Establishing codon usage biases are imperative for understanding the etiology of coronary artery diseases (CAD) as well as the genetic factors associated with these diseases. The aim of this study was to evaluate the contribution of 18 responsible cytochrome P450 (CYP) genes for the risk of CAD. Effective number of codon (Nc) showed a negative correlation with both GC3 and synonymous codon usage order (SCUO) suggesting an antagonistic relationship between codon usage and Nc of genes. The dinucleotide analysis revealed that CG and TA dinucleotides have the lowest odds ratio in these genes. Principal component analysis showed that GC composition has a profound effect in separating the genes along the first major axis. Our findings revealed that mutational pressure and natural selection could possibly be the major factors responsible for codon bias in these genes. The study not only offers an insight into the mechanisms of genomic GC adaptation, but also illustrates the complexity of CYP genes in CAD.
Collapse
Affiliation(s)
- Arup Kumar Malakar
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Binata Halder
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Prosenjit Paul
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India.
| |
Collapse
|
76
|
Expression levels and codon usage patterns in nuclear genes of the filarial nematode Wucheraria bancrofti and the blood fluke Schistosoma haematobium. J Helminthol 2016; 91:72-79. [PMID: 27048929 DOI: 10.1017/s0022149x16000092] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Synonymous codons are used with different frequencies, a phenomenon known as codon bias, which exists in many genomes and is mainly resolute by mutation and selection. To elucidate the genetic characteristics and evolutionary relationship of Wucheraria bancrofti and Schistosoma haematobium we examined the pattern of synonymous codon usage in nuclear genes of both the species. The mean overall GC contents of W. bancrofti and S. haematobium were 43.41 and 36.37%, respectively, which suggests that genes in both the species were AT rich. The value of the High Effective Number of Codons in both species suggests that codon usage bias was weak. Both species had a wide range of P3 distribution in the neutrality plot, with a significant correlation between P12 and P3. The codons were closer to the axes in correspondence analysis, suggesting that mutation pressure influenced the codon usage pattern in these species. We have identified the more frequently used codons in these species, most codons ending with an A or T. The nucleotides A/T and C/G were not proportionally used at the third position of codons, which reveals that natural selection might influence the codon usage patterns. The regression equation of P12 on P3 suggests that natural selection might have played a major role, while mutational pressure played a minor role in codon usage pattern in both species. These results form the basis of exploring the evolutionary mechanisms and the heterologous expression of medically important proteins of W. bancrofti and S. haematobium.
Collapse
|
77
|
García-Montoya GM, Mesa-Arango JA, Isaza-Agudelo JP, Agudelo-Lopez SP, Cabarcas F, Barrera LF, Alzate JF. Transcriptome profiling of the cysticercus stage of the laboratory model Taenia crassiceps, strain ORF. Acta Trop 2016; 154:50-62. [PMID: 26571070 DOI: 10.1016/j.actatropica.2015.11.001] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Revised: 11/04/2015] [Accepted: 11/05/2015] [Indexed: 11/30/2022]
Abstract
Neurocysticercosis (NC) is a serious public health problem mainly in developing countries. NC caused by the cysticercus stage from cestode Taenia solium is considered by the WHO and ITFDE as a potentially eradicable disease. Definitive diagnosis of NC is challenging because of the unspecific clinical manifestations such as the non-definitive evidence presented by neuroimaging (in most cases) and the lack of definitive serological test. Taenia crassiceps (ORF strain) is a cestode closely related to T. solium and it has frequently been used as a source of antigens for immunodiagnostics. A murine model to study host immune response to infection has also been established by using T. crassiceps. Despite the extensive use of T. crassiceps for research, molecular information for this cestode is scarce in public databases. With the aim of providing more extensive information on T. crassiceps biology, an RNA-seq experiment and subsequent bioinformatic transcriptome processing of this cestode parasite mRNA in its cysticercus stage were carried out. A total of 227,082 read/ESTs were sequenced using the 454-GS FLX Titanium technology and assembled into 10,787 contigs. This transcriptome dataset represents new and valuable molecular information of the cestode T. crassiceps (ORF). This information will substantially improve public information and will help to achieve a better understanding of the biology of T. crassiceps and to identify target proteins for serodiagnosis and vaccination.
Collapse
Affiliation(s)
| | - Jairo A Mesa-Arango
- Grupo de Parasitología, Facultad de Medicina, Universidad de Antioquia, Colombia; Centro Nacional de Secuenciación Genómica-CNSG, Sede de Investigación Universitaria-SIU, Universidad de Antioquia, Colombia
| | - Juan P Isaza-Agudelo
- Grupo de Parasitología, Facultad de Medicina, Universidad de Antioquia, Colombia; Centro Nacional de Secuenciación Genómica-CNSG, Sede de Investigación Universitaria-SIU, Universidad de Antioquia, Colombia
| | | | - Felipe Cabarcas
- Centro Nacional de Secuenciación Genómica-CNSG, Sede de Investigación Universitaria-SIU, Universidad de Antioquia, Colombia; Grupo Sistemas Embebidos e Inteligencia Computacional-SISTEMIC, Departamento de Ingeniería Electrónica, Facultad de Ingeniería, Universidad de Antioquia, Colombia
| | - Luis F Barrera
- Grupo de Inmunología Celular e Inmunogenética, Facultad de Medicina, Universidad de Antioquia-GICIG, Colombia
| | - Juan F Alzate
- Grupo de Parasitología, Facultad de Medicina, Universidad de Antioquia, Colombia; Centro Nacional de Secuenciación Genómica-CNSG, Sede de Investigación Universitaria-SIU, Universidad de Antioquia, Colombia.
| |
Collapse
|
78
|
Analysis of Codon Usage Patterns in Herbaceous Peony (Paeonia lactiflora Pall.) Based on Transcriptome Data. Genes (Basel) 2015; 6:1125-39. [PMID: 26506393 PMCID: PMC4690031 DOI: 10.3390/genes6041125] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Revised: 10/10/2015] [Accepted: 10/13/2015] [Indexed: 01/27/2023] Open
Abstract
Codon usage bias, which exists in many genomes, is mainly determined by mutation and selection. To elucidate the genetic features and evolutionary history of herbaceous peony (Paeonia lactiflora), a well-known symbol of prosperity in China, we examined synonymous codon usage in 24,216 reconstructed genes from the P. lactiflora transcriptome. The mean GC content was 44.4%, indicating that the nucleotide content of P. lactiflora genes is slightly AT rich and GC poor. The P. lactiflora genome has a wide range of GC3 (GC content at the third synonymous codon position) distribution, with a significant correlation between GC12 and GC3. ENC (effective number of codons) analysis suggested that mutational bias played a major role in shaping codon usage. Parity Rule 2 (PR2) analysis revealed that GC and AU were not used proportionally. We identified 22 “optimal codons”, most ending with an A or U. Our results suggested that nucleotide composition mutation bias and translational selection were the main driving factors of codon usage bias in P. lactiflora. These results lay the foundation for exploring the evolutionary mechanisms and heterologous expression of functionally-important proteins in P. lactiflora.
Collapse
|