1
|
Li Y, Hu X, Xiao M, Huang J, Lou Y, Hu F, Fu X, Li Y, He H, Cheng J. An analysis of codon utilization patterns in the chloroplast genomes of three species of Coffea. BMC Genom Data 2023; 24:42. [PMID: 37558997 PMCID: PMC10413492 DOI: 10.1186/s12863-023-01143-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 07/28/2023] [Indexed: 08/11/2023] Open
Abstract
BACKGROUND The chloroplast genome of plants is known for its small size and low mutation and recombination rates, making it a valuable tool in plant phylogeny, molecular evolution, and population genetics studies. Codon usage bias, an important evolutionary feature, provides insights into species evolution, gene function, and the expression of exogenous genes. Coffee, a key crop in the global tropical agricultural economy, trade, and daily life, warrants investigation into its codon usage bias to guide future research, including the selection of efficient heterologous expression systems for coffee genetic transformation. RESULTS Analysis of the codon utilization patterns in the chloroplast genomes of three Coffea species revealed a high degree of similarity among them. All three species exhibited similar base compositions, with high A/T content and low G/C content and a preference for A/T-ending codons. Among the 30 high-frequency codons identified, 96.67% had A/T endings. Fourteen codons were identified as ideal. Multiple mechanisms, including natural selection, were found to influence the codon usage patterns in the three coffee species, as indicated by ENc-GC3s mapping, PR2 analysis, and neutral analysis. Nicotiana tabacum and Saccharomyces cerevisiae have potential value as the heterologous expression host for three species of coffee genes. CONCLUSION This study highlights the remarkable similarity in codon usage patterns among the three coffee genomes, primarily driven by natural selection. Understanding the gene expression characteristics of coffee and elucidating the laws governing its genetic evolution are facilitated by investigating the codon preferences in these species. The findings can enhance the efficacy of exogenous gene expression and serve as a basis for future studies on coffee evolution.
Collapse
Affiliation(s)
- Yaqi Li
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Xiang Hu
- Institute of Tropical Eco-Agricultural, Yunnan Academy of Agricultural Sciences, Yuanmou, Yunnan, China
| | - Mingkun Xiao
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Jiaxiong Huang
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Yuqiang Lou
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Faguang Hu
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Xingfei Fu
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Yanan Li
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Hongyan He
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
- Yunnan Academy of Agricultural Engineering, Kunming, Yunnan, China
| | - Jinhuan Cheng
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China.
- Yunnan Academy of Agricultural Engineering, Kunming, Yunnan, China.
| |
Collapse
|
2
|
Savinkova LK, Sharypova EB, Kolchanov NA. On the Role of TATA Boxes and TATA-Binding Protein in Arabidopsis thaliana. PLANTS (BASEL, SWITZERLAND) 2023; 12:1000. [PMID: 36903861 PMCID: PMC10005294 DOI: 10.3390/plants12051000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 01/13/2023] [Accepted: 02/20/2023] [Indexed: 06/18/2023]
Abstract
For transcription initiation by RNA polymerase II (Pol II), all eukaryotes require assembly of basal transcription machinery on the core promoter, a region located approximately in the locus spanning a transcription start site (-50; +50 bp). Although Pol II is a complex multi-subunit enzyme conserved among all eukaryotes, it cannot initiate transcription without the participation of many other proteins. Transcription initiation on TATA-containing promoters requires the assembly of the preinitiation complex; this process is triggered by an interaction of TATA-binding protein (TBP, a component of the general transcription factor TFIID (transcription factor II D)) with a TATA box. The interaction of TBP with various TATA boxes in plants, in particular Arabidopsis thaliana, has hardly been investigated, except for a few early studies that addressed the role of a TATA box and substitutions in it in plant transcription systems. This is despite the fact that the interaction of TBP with TATA boxes and their variants can be used to regulate transcription. In this review, we examine the roles of some general transcription factors in the assembly of the basal transcription complex, as well as functions of TATA boxes of the model plant A. thaliana. We review examples showing not only the involvement of TATA boxes in the initiation of transcription machinery assembly but also their indirect participation in plant adaptation to environmental conditions in responses to light and other phenomena. Examples of an influence of the expression levels of A. thaliana TBP1 and TBP2 on morphological traits of the plants are also examined. We summarize available functional data on these two early players that trigger the assembly of transcription machinery. This information will deepen the understanding of the mechanisms underlying transcription by Pol II in plants and will help to utilize the functions of the interaction of TBP with TATA boxes in practice.
Collapse
|
3
|
Tyagi S, Kabade PG, Gnanapragasam N, Singh UM, Gurjar AKS, Rai A, Sinha P, Kumar A, Singh VK. Codon Usage Provide Insights into the Adaptation of Rice Genes under Stress Condition. Int J Mol Sci 2023; 24:ijms24021098. [PMID: 36674611 PMCID: PMC9861248 DOI: 10.3390/ijms24021098] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Revised: 12/14/2022] [Accepted: 12/17/2022] [Indexed: 01/09/2023] Open
Abstract
Plants experience different stresses, i.e., abiotic, or biotic, and to combat them, plants re-program the expression of growth-, metabolism-, and resistance-related genes. These genes differ in their synonymous codon usage frequency and show codon usage bias. Here, we investigated the correlation among codon usage bias, gene expression, and underlying mechanisms in rice under abiotic and biotic stress conditions. The results indicated that genes with higher expression (up- or downregulated) levels had high GC content (≥60%), a low effective number of codon usage (≤40), and exhibited strong biases towards the codons with C/G at the third nucleotide position, irrespective of stress received. TTC, ATC, and CTC were the most preferred codons, while TAC, CAC, AAC, GAC, and TGC were moderately preferred under any stress (abiotic or biotic) condition. Additionally, downregulated genes are under mutational pressure (R2 ≥ 0.5) while upregulated genes are under natural selection pressure (R2 ≤ 0.5). Based on these results, we also identified the possible target codons that can be used to design an optimized set of genes with specific codons to develop climate-resilient varieties. Conclusively, under stress, rice has a bias towards codon usage which is correlated with GC content, gene expression level, and gene length.
Collapse
Affiliation(s)
- Swati Tyagi
- International Rice Research Institute-South Asia Regional Centre (ISARC), Varanasi 221106, India
| | | | - Niranjani Gnanapragasam
- International Rice Research Institute (IRRI)-South-Asia Hub, International Crops Research Institute for the Semi-Arid Tropics, Hyderabad 502324, India
| | - Uma Maheshwar Singh
- International Rice Research Institute-South Asia Regional Centre (ISARC), Varanasi 221106, India
| | | | - Ashutosh Rai
- International Rice Research Institute-South Asia Regional Centre (ISARC), Varanasi 221106, India
| | - Pallavi Sinha
- International Rice Research Institute (IRRI)-South-Asia Hub, International Crops Research Institute for the Semi-Arid Tropics, Hyderabad 502324, India
| | - Arvind Kumar
- International Rice Research Institute-South Asia Regional Centre (ISARC), Varanasi 221106, India
| | - Vikas Kumar Singh
- International Rice Research Institute-South Asia Regional Centre (ISARC), Varanasi 221106, India
- International Rice Research Institute (IRRI)-South-Asia Hub, International Crops Research Institute for the Semi-Arid Tropics, Hyderabad 502324, India
- Correspondence:
| |
Collapse
|
4
|
Sahoo S, Rakshit R. The pattern of coding sequences in the chloroplast genome of Atropa belladonna and a comparative analysis with other related genomes in the nightshade family. Genomics Inform 2022; 20:e43. [PMID: 36617650 PMCID: PMC9847383 DOI: 10.5808/gi.22045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 12/12/2022] [Indexed: 12/31/2022] Open
Abstract
Atropa belladonna is a valuable medicinal plant and a commercial source of tropane alkaloids, which are frequently utilized in therapeutic practice. In this study, bioinformaticmethodologies were used to examine the pattern of coding sequences and the factors thatmight influence codon usage bias in the chloroplast genome of Atropa belladonna andother nightshade genomes. The chloroplast engineering being a promising field in modernbiotechnology, the characterization of chloroplast genome is very important. The resultsrevealed that the chloroplast genomes of Nicotiana tabacum, Solanum lycopersicum, Capsicum frutescens, Datura stramonium, Lyciumbarbarum, Solanum melongena, and Solanumtuberosum exhibited comparable codon usage patterns. In these chloroplast genomes, weobserved a weak codon usage bias. According to the correspondence analysis, the genesisof the codon use bias in these chloroplast genes might be explained by natural selection,directed mutational pressure, and other factors. GC12 and GC3S were shown to have nomeaningful relationship. Further research revealed that natural selection primarily shapedthe codon usage in A. belladonna and other nightshade genomes for translational efficiency. The sequencing properties of these chloroplast genomes were also investigated by investing the occurrences of palindromes and inverted repeats, which would be useful forfuture research on medicinal plants.
Collapse
Affiliation(s)
- Satyabrata Sahoo
- Department of Physics, Dhruba Chand Halder College, Dakshin Barasat 743372, India,*Corresponding author E-mail:
| | - Ria Rakshit
- Department of Botany, Baruipur College, Baruipur 743610, India
| |
Collapse
|
5
|
Tyagi A, Nagar V. Genome dynamics, codon usage patterns and influencing factors in Aeromonas hydrophila phages. Virus Res 2022; 320:198900. [PMID: 36029927 DOI: 10.1016/j.virusres.2022.198900] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 08/22/2022] [Accepted: 08/23/2022] [Indexed: 11/29/2022]
Abstract
In the present study, genome characteristics and codon usage patterns of 44 Aeromonas hydrophila phages were studied. Phage genomes varied from 30.8 to 262.0 kb with mean±SD and median values of 111.3 ± 81.4 kb and 79.4 kb, respectively. Though the great variation in phage GC contents (35.1-62.2%) was observed, GC contents of all phages (except two phages) were significantly less than the GC content (62.4 ± 5.6%) of the host. The effective number of codons (ENC) values of phage genes ranged from 27.7 to 61 with a mean±SD value of 47.4 ± 6.8. Out of a total 5773 phage genes, 207 (3.6%), 3,528 (61.1%) and 2,012 (34.9%) genes had strong (ENC < 35), moderate (35 < ENC < 50) and low (ENC ≥ 50) codon usage bias, respectively. During relative synonymous codon usage (RSCU) analysis, shared usage of preferred codons was also observed between the phages and host. During codon adaptation index (CAI) analysis, 1028 (17.8%) phage genes showed significant adaptation towards the host. Among these genes, 797 (78.0%) genes encoded hypothetical proteins or proteins of unknown function; whereas 118 (12%) genes encoded the phage structural and packaging proteins. Segregation of ENC, RSCU and CAI analysis results based on genome size also indicated that codon usage bias was more prominent in phages with small genomes. Correlation, neutrality and GC3 versus ENC analyzes indicated a more dominant role of natural selection in shaping the codon usage patterns of A. hydrophila phages. The findings of the current study could be useful from evolutionary and host-pathogen interaction perspectives leading to efficient utilization of phages for therapeutic and other applications.
Collapse
Affiliation(s)
- Anuj Tyagi
- Department of Aquatic Environment, College of Fisheries, Guru Angad Dev Veterinary and Animal Sciences University, Ludhiana, Punjab 141004, India.
| | - Vandan Nagar
- Food Microbiology Group, Food Technology Division, Bhabha Atomic Research Centre, Trombay, Mumbai, Maharashtra 400085, India
| |
Collapse
|
6
|
Sophiarani Y, Chakraborty S. Comparison of compositional constraints: Nuclear genome vs plasmid genome of Pseudomonas syringae pv. tomato DC3000. J Biosci 2022. [DOI: 10.1007/s12038-022-00296-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
7
|
Khandia R, Saeed M, Alharbi AM, Ashraf GM, Greig NH, Kamal MA. Codon Usage Bias Correlates With Gene Length in Neurodegeneration Associated Genes. Front Neurosci 2022; 16:895607. [PMID: 35860292 PMCID: PMC9289476 DOI: 10.3389/fnins.2022.895607] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 06/08/2022] [Indexed: 11/13/2022] Open
Abstract
Codon usage analysis is a crucial part of molecular characterization and is used to determine the factors affecting the evolution of a gene. The length of a gene is an important parameter that affects the characteristics of the gene, such as codon usage, compositional parameters, and sometimes, its functions. In the present study, we investigated the association of various parameters related to codon usage with the length of genes. Gene expression is affected by nucleotide disproportion. In sixty genes related to neurodegenerative disorders, the G nucleotide was the most abundant and the T nucleotide was the least. The nucleotide T exhibited a significant association with the length of the gene at both the overall compositional level and the first and second codon positions. Codon usage bias (CUB) of these genes was affected by pyrimidine and keto skews. Gene length was found to be significantly correlated with codon bias in neurodegeneration associated genes. In gene segments with lengths below 1,200 bp and above 2,400 bp, CUB was positively associated with length. Relative synonymous CUB, which is another measure of CUB, showed that codons TTA, GTT, GTC, TCA, GGT, and GGA exhibited a positive association with length, whereas codons GTA, AGC, CGT, CGA, and GGG showed a negative association. GC-ending codons were preferred over AT-ending codons. Overall analysis indicated that the association between CUB and length varies depending on the segment size; however, CUB of 1,200–2,000 bp gene segments appeared not affected by gene length. In synopsis, analysis suggests that length of the genes correlates with various imperative molecular signatures including A/T nucleotide disproportion and codon choices. In the present study we additionally evaluated various molecular features and their correlation with different indices of codon usage, like the Codon Adaptation Index (CAI) and Relative Dynonymous Codon Usage (RSCU) of codons. We also considered the impact of gene fragment size on different molecular features in genes related to neurodegeneration. This analysis will aid our understanding of and in potentially modulating gene expression in cases of defective gene functioning in clinical settings.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, India
- *Correspondence: Rekha Khandia, ;
| | - Mohd. Saeed
- Department of Biology, College of Sciences, University of Hail, Hail, Saudi Arabia
| | - Ahmed M. Alharbi
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, University of Hail, Hail, Saudi Arabia
| | - Ghulam Md. Ashraf
- Pre-clinical Research Unit, King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Nigel H. Greig
- Drug Design and Development Section, Translational Gerontology Branch, Intramural Research Program National Institute on Aging, NIH, Baltimore, MD, United States
| | - Mohammad Amjad Kamal
- Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
- King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia
- Department of Pharmacy, Faculty of Allied Health Sciences, Daffodil International University, Dhaka, Bangladesh
- Enzymoics, Novel Global Community Educational Foundation, Hebersham, NSW, Australia
| |
Collapse
|
8
|
Wang Z, Cai Q, Wang Y, Li M, Wang C, Wang Z, Jiao C, Xu C, Wang H, Zhang Z. Comparative Analysis of Codon Bias in the Chloroplast Genomes of Theaceae Species. Front Genet 2022; 13:824610. [PMID: 35360853 PMCID: PMC8961065 DOI: 10.3389/fgene.2022.824610] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 01/31/2022] [Indexed: 11/13/2022] Open
Abstract
Theaceae species are dicotyledonous angiosperms with extremely high ornamental and economic value. The chloroplast genome is traditionally used to study species evolution, expression of chloroplast genes and chloroplast transformation. Codon usage bias (CUB) analysis is beneficial for investigations of evolutionary relationships and can be used to improve gene expression efficiency in genetic transformation research. However, there are relatively few systematic studies of the CUB in the chloroplast genomes of Theaceae species. In this study, CUB and nucleotide compositions parameters were determined by the scripts written in the Perl language, CodonW 1.4.2, CU.Win2000, RStudio and SPSS 23.0. The chloroplast genome data of 40 Theaceae species were obtained to analyse the codon usage (CU) characteristics of the coding regions and the influence of the source of variation on CUB. To explore the relationship between the CUB and gene expression levels in these 40 Theaceae plastomes, the synonymous codon usage order (SCUO) and measure independent of length and composition (MILC) values were determined. Finally, phylogenetic analysis revealed the genetic evolutionary relationships among these Theaceae species. Our results showed that based on the chloroplast genomes of these 40 Theaceae species, the CUB was for codons containing A/T bases and those that ended with A/T bases. Moreover, there was great commonality in the CUB of the Theaceae species according to comparative analysis of relative synonymous codon usage (RSCU) and relative frequency of synonymous codon (RFSC): these species had 29 identical codons with bias (RSCU > 1), and there were 19 identical high-frequency codons. The CUB of Theaceae species is mainly affected by natural selection. The SCUO value of the 40 Theaceae species was 0.23 or 0.24, and the chloroplast gene expression level was moderate, according to MILC values. Additionally, we observed a positive correlation between the SCUO and MILC values, which indicated that CUB might affect gene expression. Furthermore, the phylogenetic analysis showed that the evolutionary relationships in these 40 Theaceae species were relatively conserved. A systematic study on the CUB and expression of Theaceae species provides further evidence for their evolution and phylogeny.
Collapse
Affiliation(s)
- Zhanjun Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, China
| | - Qianwen Cai
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Yue Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Minhui Li
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Chenchen Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Zhaoxia Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Chunyan Jiao
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Congcong Xu
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Hongyan Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Zhaoliang Zhang
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, China
- *Correspondence: Zhaoliang Zhang,
| |
Collapse
|
9
|
Andargie M, Congyi Z. Genome-wide analysis of codon usage in sesame ( Sesamum indicum L.). Heliyon 2022; 8:e08687. [PMID: 35106386 PMCID: PMC8789531 DOI: 10.1016/j.heliyon.2021.e08687] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 11/20/2021] [Accepted: 12/24/2021] [Indexed: 10/28/2022] Open
Abstract
Sesamum indicum is an ancient oil crop grown in tropical and subtropical areas of the world. We have analyzed 23,538 coding sequences (CDS) of S. indicum to understand the factors shaping codon usage in this important oil crop plant. We identified eleven highly preferred codons in S. indicum that have AT-endings. The slope of a neutrality plot was less than one while effective number of codons (ENC) plot showed distribution above and below the standard curve. There is a significant relationship between protein length and relative synonymous codon usage (RSCU) at the primary axis while there is a weak correlation between protein length and Nc values. Correspondence analysis conducted on RSCU values differentiated CDS based on their GC content and their characteristic feature and showed a discrete distribution. Moreover, by determining codon usage, we found out that majority of the lignan biosynthesis related genes showed a weaker codon usage bias. These results provide insights into understanding codon evolution in sesame.
Collapse
Affiliation(s)
- Mebeaselassie Andargie
- University of Goettingen, Molecular Phytopathology and Mycotoxin Research, Grisebachstrasse 6, 37077 Goettingen, Germany
| | - Zhu Congyi
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization (MOA), Guangdong Province Key Laboratory of Tropical and Subtropical Fruit Tree Research, Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| |
Collapse
|
10
|
Abstract
Codon usage bias is the preferential or non-random use of synonymous codons, a ubiquitous phenomenon observed in bacteria, plants and animals. Different species have consistent and characteristic codon biases. Codon bias varies not only with species, family or group within kingdom, but also between the genes within an organism. Codon usage bias has evolved through mutation, natural selection, and genetic drift in various organisms. Genome composition, GC content, expression level and length of genes, position and context of codons in the genes, recombination rates, mRNA folding, and tRNA abundance and interactions are some factors influencing codon bias. The factors shaping codon bias may also be involved in evolution of the universal genetic code. Codon-usage bias is critical factor determining gene expression and cellular function by influencing diverse processes such as RNA processing, protein translation and protein folding. Codon usage bias reflects the origin, mutation patterns and evolution of the species or genes. Investigations of codon bias patterns in genomes can reveal phylogenetic relationships between organisms, horizontal gene transfers, molecular evolution of genes and identify selective forces that drive their evolution. Most important application of codon bias analysis is in the design of transgenes, to increase gene expression levels through codon optimization, for development of transgenic crops. The review gives an overview of deviations of genetic code, factors influencing codon usage or bias, codon usage bias of nuclear and organellar genes, computational methods to determine codon usage and the significance as well as applications of codon usage analysis in biological research, with emphasis on plants.
Collapse
Affiliation(s)
| | - Varatharajalu Udayasuriyan
- Department of Biotechnology, Centre for Plant Molecular Biology and Biotechnology, Tamil Nadu Agricultural University, Coimbatore, 641003, India
| | - Vijaipal Bhadana
- ICAR-Indian Institute of Agricultural Biotechnology, Ranchi, Jharkhand, 834010, India
| |
Collapse
|
11
|
Mazumder TH, Alqahtani AM, Alqahtani T, Emran TB, A. Aldahish A, Uddin A. Analysis of Codon Usage of Speech Gene FoxP2 among Animals. BIOLOGY 2021; 10:biology10111078. [PMID: 34827071 PMCID: PMC8614651 DOI: 10.3390/biology10111078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 10/12/2021] [Accepted: 10/16/2021] [Indexed: 12/03/2022]
Abstract
Simple Summary We evaluated codon usage bias in the FoxP2 gene in fishes, birds, reptiles, and mammals. Fishes use C or G—ending codons, while birds, reptiles, and mammals employ T or A—ending codons. Apart from the nucleotide composition, natural selection and mutation pressure might influence the CUB. The ENC observed/ENC expected ratio demonstrated that mutation pressure influences FoxP2 codon usage patterns. Natural selection may have had a key influence in shaping the CUB, although mutation pressure may have played a minor role. FoxP2 gene codon usage is affected by the base composition under mutation bias. Abstract The protein-coding gene FoxP2 (fork head box protein P2) plays a major role in communication and evolutionary changes. The present study carried out a comprehensive codon usage bias analysis in the FoxP2 gene among a diverse group of animals including fishes, birds, reptiles, and mammals. We observed that in the genome of fishes for the FoxP2 gene, codons ending with C or G were most frequently used, while in birds, reptiles, and mammals, codons ending with T or A were most frequently used. A higher ENC value was observed for the FoxP2 gene indicating a lower CUB. Parity role two-bias plots suggested that apart from mutation pressure, other factors such as natural selection might have influenced the CUB. The frequency distribution of the ENC observed and ENC expected ratio revealed that mutation pressure plays a key role in the patterns of codon usage of FoxP2. Besides, correspondence analysis exposed the composition of the nucleobase under mutation bias affects the codon usage of the FoxP2 gene. However, neutrality plots revealed the major role of natural selection over mutation pressure in the CUB of FoxP2. In addition, the codon usage patterns for FoxP2 among the selected genomes suggested that nature has favored nearly all the synonymous codons for encoding the corresponding amino acid. The uniform usage of 12 synonymous codons for FoxP2 was observed among the species of birds. The amino acid usage frequency for FoxP2 revealed that the amino acids Leucine, Glutamine, and Serine were predominant over other amino acids among all the species of fishes, birds, reptiles, and mammals.
Collapse
Affiliation(s)
| | - Ali M. Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha 62529, Saudi Arabia; (A.M.A.); (T.A.); (A.A.A.)
| | - Taha Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha 62529, Saudi Arabia; (A.M.A.); (T.A.); (A.A.A.)
| | - Talha Bin Emran
- Department of Pharmacy, BGC Trust University Bangladesh, Chittagong 4381, Bangladesh;
| | - Afaf A. Aldahish
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha 62529, Saudi Arabia; (A.M.A.); (T.A.); (A.A.A.)
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial College, Hailakandi 788150, Assam, India
- Correspondence:
| |
Collapse
|
12
|
Wang J, Lin Y, Xi M. Analysis of Codon Usage Patterns of Six Sequenced Brachypodium distachyon Lines Reveals a Declining CG Skew of the CDSs from the 5'-ends to the 3'-ends. Genes (Basel) 2021; 12:1467. [PMID: 34680862 PMCID: PMC8535453 DOI: 10.3390/genes12101467] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 09/09/2021] [Accepted: 09/20/2021] [Indexed: 02/01/2023] Open
Abstract
Brachypodium distachyon, a new monocotyledonous model plant, has received wide attention in biological research due to its small genome and numerous genetic resources. Codon usage bias is an important feature of genes and genomes, and it can be used in transgenic and evolutionary studies. In this study, the nucleotide compositions and patterns of codon usage bias were calculated using Codon W. Additionally, an ENC plot, Parity rule 2 and correspondence analyses were used to explore the major factors influencing codon usage bias patterns. The numbers of hydrogen bonds and skews were used to analyze the GC trend in the 5'-ends of the coding sequences. The results showed that minor differences in the codon usage bias patterns were revealed by the ENC plot, Parity rule 2 and correspondence analyses. The analyses of the CG-skew and the number of hydrogen bonds showed a declining trend in the number of cytosines at the 5'-ends of the CDSs (from the 5'-ends to the 3'-ends), indicating that GC may play a major role in codon usage bias. In addition, our results laid a foundation for the study of codon usage bias patterns in Brachypodium genus and suggested that the GC plays a major role in determining these patterns.
Collapse
Affiliation(s)
- Jianyong Wang
- Key Laboratory of Forest Genetics and Biotechnology of Ministry of Education, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, China;
| | - Yujing Lin
- Shanghai Center for Plant Stress Biology and Center for Excellence in Molecular Plant Sciences, University of Chinese Academy of Sciences, Shanghai 200032, China;
| | - Mengli Xi
- Key Laboratory of Forest Genetics and Biotechnology of Ministry of Education, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, China;
| |
Collapse
|
13
|
Yang C, Zhao Q, Wang Y, Zhao J, Qiao L, Wu B, Yan S, Zheng J, Zheng X. Comparative Analysis of Genomic and Transcriptome Sequences Reveals Divergent Patterns of Codon Bias in Wheat and Its Ancestor Species. Front Genet 2021; 12:732432. [PMID: 34490050 PMCID: PMC8417831 DOI: 10.3389/fgene.2021.732432] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 07/29/2021] [Indexed: 11/29/2022] Open
Abstract
The synonymous codons usage shows a characteristic pattern of preference in each organism. This codon usage bias is thought to have evolved for efficient protein synthesis. Synonymous codon usage was studied in genes of the hexaploid wheat Triticum aestivum (AABBDD) and its progenitor species, Triticum urartu (AA), Aegilops tauschii (DD), and Triticum turgidum (AABB). Triticum aestivum exhibited stronger usage bias for G/C-ending codons than did the three progenitor species, and this bias was especially higher compared to T. turgidum and Ae. tauschii. High GC content is a primary factor influencing codon usage in T. aestivum. Neutrality analysis showed a significant positive correlation (p<0.001) between GC12 and GC3 in the four species with regression line slopes near zero (0.16–0.20), suggesting that the effect of mutation on codon usage was only 16–20%. The GC3s values of genes were associated with gene length and distribution density within chromosomes. tRNA abundance data indicated that codon preference corresponded to the relative abundance of isoaccepting tRNAs in the four species. Both mutation and selection have affected synonymous codon usage in hexaploid wheat and its progenitor species. GO enrichment showed that GC biased genes were commonly enriched in physiological processes such as photosynthesis and response to acid chemical. In some certain gene families with important functions, the codon usage of small parts of genes has changed during the evolution process of T. aestivum.
Collapse
Affiliation(s)
- Chenkang Yang
- School of Life Science, Shanxi University, Taiyuan, China
| | - Qi Zhao
- School of Life Science, Shanxi University, Taiyuan, China
| | - Ying Wang
- School of Life Science, Shanxi University, Taiyuan, China
| | - Jiajia Zhao
- State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| | - Ling Qiao
- State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| | - Bangbang Wu
- State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| | - Suxian Yan
- State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| | - Jun Zheng
- School of Life Science, Shanxi University, Taiyuan, China.,State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| | - Xingwei Zheng
- School of Life Science, Shanxi University, Taiyuan, China.,State Key Laboratory of Sustainable Dryland Agriculture, Institute of Wheat Research, Shanxi Agricultural University, Linfen, China
| |
Collapse
|
14
|
Deb B, Uddin A, Chakraborty S. Analysis of codon usage of Horseshoe Bat Hepatitis B virus and its host. Virology 2021; 561:69-79. [PMID: 34171764 DOI: 10.1016/j.virol.2021.05.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 05/07/2021] [Accepted: 05/19/2021] [Indexed: 11/28/2022]
Abstract
In the present analysis, codon usage strategies and base distribution of Horseshoe bat hepatitis B virus (HBHBV) were analyzed and compared with its host Rhinolophus sinicus, as no work was yet reported. The magnitude of synonymous codon usage bias (CUB) in the virus and its host was low with higher proportion of the base C. Notably, 21 more frequently used codons, 19 less frequently used codons and 3 underrepresented codons (TCG, ACG and GCG) were found to be similar in both virus and its host coding sequences. Neutrality plot analysis reported greater role of natural selection in HBHBV (67.84%) and R. sinicus (76.90%) over mutation pressure. Base skewness and protein properties also influenced the CUB of genes. Further, codon usage analysis depicted, HBHBV and R. sinicus had many similarities in codon usage patterns that might reflect viral adaptation to its host.
Collapse
Affiliation(s)
- Bornali Deb
- Department of Biotechnology, Assam University, Silchar, 788150, Assam, India
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi, 788150, Assam, India
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar, 788150, Assam, India.
| |
Collapse
|
15
|
|
16
|
Duan H, Zhang Q, Wang C, Li F, Tian F, Lu Y, Hu Y, Yang H, Cui G. Analysis of codon usage patterns of the chloroplast genome in Delphinium grandiflorum L. reveals a preference for AT-ending codons as a result of major selection constraints. PeerJ 2021; 9:e10787. [PMID: 33552742 PMCID: PMC7819120 DOI: 10.7717/peerj.10787] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 12/24/2020] [Indexed: 01/28/2023] Open
Abstract
Background Codon usage bias analysis is a suitable strategy for identifying the principal evolutionary driving forces in different organisms. Delphinium grandiflorum L. is a perennial herb with high economic value and typical biological characteristics. Evolutionary analysis of D. grandiflorum can provide a rich resource of genetic information for developing hybridization resources of the genus Delphinium. Methods Synonymous codon usage (SCU) and related indices of 51 coding sequences from the D. grandiflorum chloroplast (cp) genome were calculated using Codon W, Cups of EMBOSS, SPSS and Microsoft Excel. Multivariate statistical analysis combined by principal component analysis (PCA), correspondence analysis (COA), PR2-plot mapping analysis and ENC plot analysis was then conducted to explore the factors affecting the usage of synonymous codons. Results The SCU bias of D. grandiflorum was weak and codons preferred A/T ending. A SCU imbalance between A/T and G/C at the third base position was revealed by PR2-plot mapping analysis. A total of eight codons were identified as the optimal codons. The PCA and COA results indicated that base composition (GC content, GC3 content) and gene expression were important for SCU bias. A majority of genes were distributed below the expected curve from the ENC plot analysis and up the standard curve by neutrality plot analysis. Our results showed that with the exception of notable mutation pressure effects, the majority of genetic evolution in the D. grandiflorum cp genome might be driven by natural selection. Discussions Our results provide a theoretical foundation for elucidating the genetic architecture and mechanisms of D. grandiflorum, and contribute to enriching D. grandiflorum genetic resources.
Collapse
Affiliation(s)
- Huirong Duan
- Chinese Academy of Agricultural Sciences Lanzhou Institute of Husbandry and Pharmaceutical Science, Lanzhou, China
| | - Qian Zhang
- Chinese Academy of Agricultural Sciences Lanzhou Institute of Husbandry and Pharmaceutical Science, Lanzhou, China
| | - Chunmei Wang
- Chinese Academy of Agricultural Sciences Lanzhou Institute of Husbandry and Pharmaceutical Science, Lanzhou, China
| | - Fang Li
- Institute of Grassland Science, Chinese Academy of Agricultural Sciences, Hohhot, China
| | - Fuping Tian
- Chinese Academy of Agricultural Sciences Lanzhou Institute of Husbandry and Pharmaceutical Science, Lanzhou, China
| | - Yuan Lu
- Chinese Academy of Agricultural Sciences Lanzhou Institute of Husbandry and Pharmaceutical Science, Lanzhou, China
| | - Yu Hu
- Chinese Academy of Agricultural Sciences Lanzhou Institute of Husbandry and Pharmaceutical Science, Lanzhou, China
| | - Hongshan Yang
- Chinese Academy of Agricultural Sciences Lanzhou Institute of Husbandry and Pharmaceutical Science, Lanzhou, China
| | - Guangxin Cui
- Chinese Academy of Agricultural Sciences Lanzhou Institute of Husbandry and Pharmaceutical Science, Lanzhou, China
| |
Collapse
|
17
|
Uddin A, Chakraborty S. Analysis of mitochondrial protein-coding genes of Antheraea assamensis: Muga silkworm of Assam. ARCHIVES OF INSECT BIOCHEMISTRY AND PHYSIOLOGY 2021; 106:e21750. [PMID: 33075174 DOI: 10.1002/arch.21750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 09/18/2020] [Accepted: 09/30/2020] [Indexed: 06/11/2023]
Abstract
To understand the synonymous codon usage pattern in mitochondrial genome of Antheraea assamensis, we analyzed the 13 mitochondrial protein-coding genes of this species using a bioinformatic approach as no work was reported yet. The nucleotide composition analysis suggested that the percentages of A, T, G,and C were 33.73, 46.39, 9.7 and 10.17, respectively and the overall GC content was 19.86, that is, lower than 50% and the genes were AT rich. The mean effective number of codons of mitochondrial protein-coding genes was 36.30 and it indicated low codon usage bias (CUB). Relative synonymous codon usage analysis suggested overrepresented and underrepresented codons in each gene and the pattern of codon usage was different among genes. Neutrality plot analysis revealed a narrow range of distribution for GC content at the third codon position and some points were diagonally distributed, suggesting both mutation pressure and natural selection influenced the CUB.
Collapse
Affiliation(s)
- Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Assam, India
| | | |
Collapse
|
18
|
Li Y, Yang X, Wang N, Wang H, Yin B, Yang X, Jiang W. GC usage of SARS-CoV-2 genes might adapt to the environment of human lung expressed genes. Mol Genet Genomics 2020; 295:1537-1546. [PMID: 32888056 PMCID: PMC7473593 DOI: 10.1007/s00438-020-01719-0] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 08/20/2020] [Indexed: 12/13/2022]
Abstract
Understanding how SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus 2) efficiently reproduces itself by taking resources from the human host could facilitate the development of drugs against the virus. SARS-CoV-2 translates its own proteins by using the host tRNAs, so that its GC or codon usage should fit that of the host cells. It is necessary to study both the virus and human genomes in the light of evolution and adaptation. The SARS-CoV-2 virus has significantly lower GC content and GC3 as compared to human. However, when we selected a set of human genes that have similar GC properties to SARS-CoV-2, we found that these genes were enriched in particular pathways. Moreover, these human genes have the codon composition perfectly correlated with the SARS-CoV-2, and were extraordinarily highly expressed in human lung tissues, demonstrating that the SARS-CoV-2 genes have similar GC usage as compared to the lung expressed human genes. RSCU (relative synonymous codon usage) and CAI (codon adaptation index) profiles further support the matching between SARS-CoV-2 and lungs. Our study indicates that SARS-CoV-2 might have adapted to the human lung environment by observing the high correlation between GC usage of SARS-CoV-2 and human lung genes, which suggests the GC content of SARS-CoV-2 is optimized to take advantage of human lung tissues.
Collapse
Affiliation(s)
- Yue Li
- Department of Respiratory Diseases, Qingdao Haici Hospital, Qingdao, China
| | - Xinai Yang
- Department of Respiratory Diseases, Qingdao Haici Hospital, Qingdao, China
| | - Na Wang
- Department of Respiratory Diseases, Qingdao Haici Hospital, Qingdao, China
| | - Haiyan Wang
- Department of Respiratory Diseases, Qingdao Haici Hospital, Qingdao, China
| | - Bin Yin
- Department of Respiratory Diseases, Qingdao Haici Hospital, Qingdao, China
| | - Xiaoping Yang
- Department of Respiratory Diseases, Qingdao Haici Hospital, Qingdao, China
| | - Wenqing Jiang
- Department of Respiratory Diseases, Qingdao Haici Hospital, Qingdao, China.
| |
Collapse
|
19
|
Chamani Mohasses F, Solouki M, Ghareyazie B, Fahmideh L, Mohsenpour M. Correlation between gene expression levels under drought stress and synonymous codon usage in rice plant by in-silico study. PLoS One 2020; 15:e0237334. [PMID: 32776991 PMCID: PMC7416939 DOI: 10.1371/journal.pone.0237334] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2020] [Accepted: 07/23/2020] [Indexed: 11/24/2022] Open
Abstract
We studied the correlation of synonymous codon usage (SCU) on gene expression levels under drought stress in rice. Sixty genes related to drought stress (with high, intermediate and low expression) were selected from rice meta-analysis data and various codon usage indices such as the effective number of codon usage (ENC), codon adaptation index (CAI) and relative synonymous codon usage (RSCU) were calculated. We found that in genes highly expressing under drought 1) GC content was higher, 2) ENC value was lower, 3) the preferred codons of some amino acids changed and 4) the RSCU ratio of GC-end codons relative to AT-end codons for 18 amino acids increased significantly compared with those in other genes. We introduce ARSCU as the Average ratio of RSCUs of GC-end codons to AT-end codons in each gene that could significantly separate high-expression genes under drought from low-expression genes. ARSCU is calculated using the program ARSCU-Calculator developed by our group to help predicting expression level of rice genes under drought. An index above ARSCU threshold is expected to indicate that the gene under study may belong to the "high expression group under drought". This information may be applied for codon optimization of genes for rice genetic engineering. To validate these findings, we further used 60 other genes (randomly selected subset of 43233 genes studied for their response to drought stress). ARSCU value was able to predict the level of expression at 88.33% of the cases. Using third set of 60 genes selected amongst high expressing genes not related to drought, only 31.65% of the genes showed ARSCU value of higher than the set threshold. This indicates that the phenomenon we described in this report may be unique for drought related genes. To justify the observed correlation between CUB and high expressing genes under drought, possible role of tRNA post transcriptional modification and tRFs was hypothesized as possible underlying biological mechanism.
Collapse
Affiliation(s)
- Fatemeh Chamani Mohasses
- Department of Plant Breeding and Biotechnology (PBB), Faculty of Agriculture, University of Zabol, Zabol, Iran
| | - Mahmood Solouki
- Department of Plant Breeding and Biotechnology (PBB), Faculty of Agriculture, University of Zabol, Zabol, Iran
| | - Behzad Ghareyazie
- Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research Education and Extension Organization (AREEO), Karaj, Iran
| | - Leila Fahmideh
- Department of Plant Breeding and Biotechnology (PBB), Faculty of Agriculture, University of Zabol, Zabol, Iran
| | - Motahhareh Mohsenpour
- Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research Education and Extension Organization (AREEO), Karaj, Iran
| |
Collapse
|
20
|
Chu D, Wei L. Genome-wide analysis on the maize genome reveals weak selection on synonymous mutations. BMC Genomics 2020; 21:333. [PMID: 32349669 PMCID: PMC7190201 DOI: 10.1186/s12864-020-6745-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 04/21/2020] [Indexed: 02/06/2023] Open
Abstract
Background Synonymous mutations are able to change the tAI (tRNA adaptation index) of a codon and consequently affect the local translation rate. Intuitively, one may hypothesize that those synonymous mutations which increase the tAI values are favored by natural selection. Results We use the maize (Zea mays) genome to test our assumption. The first supporting evidence is that the tAI-increasing synonymous mutations have higher fixed-to-polymorphic ratios than the tAI-decreasing ones. Next, the DAF (derived allele frequency) or MAF (minor allele frequency) of the former is significantly higher than the latter. Moreover, similar results are obtained when we investigate CAI (codon adaptation index) instead of tAI. Conclusion The synonymous mutations in the maize genome are not strictly neutral. The tAI-increasing mutations are positively selected while those tAI-decreasing ones undergo purifying selection. This selection force might be weak but should not be automatically ignored.
Collapse
Affiliation(s)
- Duan Chu
- College of Life Sciences, Beijing Normal University, No. 19 Xinjiekouwai Street, Haidian District, Beijing, China
| | - Lai Wei
- College of Life Sciences, Beijing Normal University, No. 19 Xinjiekouwai Street, Haidian District, Beijing, China.
| |
Collapse
|
21
|
Majeed A, Kaur H, Bhardwaj P. Selection constraints determine preference for A/U-ending codons in Taxus contorta. Genome 2020; 63:215-224. [PMID: 31986060 DOI: 10.1139/gen-2019-0165] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Unequal utilization of synonymous codons is a well-known phenomenon among living organisms. This phenomenon plays a major role in the enhancement of the accuracy and efficiency of translation. Gymnosperms are rarely paid attention in this aspect. Understanding the degree of and determining the forces influencing codon usage bias (CUB) in Taxus contorta, an endangered Himalayan gymnosperm, will prove useful in interpreting the evolutionary characteristics of this species. Using RNAseq data, 93 790 assembled transcripts were clustered into 32 701 unigenes. Around 13 061 full-length sequences were utilized for the analysis of CUB. Compositional properties showed that GC-content ranged from 28.76% to 65.22%, with an average value of 44.28%, suggesting an AT-rich genome. The mean effective number of codons (ENC) value revealed that CUB is not strong in T. contorta. The preferred codons tended to be A/U ending, whereas the avoided codons tended to be G/C ending. A P2 index of 0.54 and a Mutation Responsive Index (MRI) value of -0.02 in addition to the results revealed by the neutrality, ENC, and parity plots showed that natural selection is a predominating factor governing CUB. Mutational pressure, gene length, hydropathiciy, aromaticity, and nucleotide composition influence CUB weakly.
Collapse
Affiliation(s)
- Aasim Majeed
- Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India.,Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India
| | - Harpreet Kaur
- Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India.,Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India
| | - Pankaj Bhardwaj
- Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India.,Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India
| |
Collapse
|
22
|
|
23
|
De Mandal S, Mazumder TH, Panda AK, Kumar NS, Jin F. Analysis of synonymous codon usage patterns of HPRT1 gene across twelve mammalian species. Genomics 2020; 112:304-311. [DOI: 10.1016/j.ygeno.2019.02.010] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2018] [Revised: 02/05/2019] [Accepted: 02/14/2019] [Indexed: 01/01/2023]
|
24
|
Liu H, Lu Y, Lan B, Xu J. Codon usage by chloroplast gene is bias in Hemiptelea davidii. J Genet 2020; 99:8. [PMID: 32089527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
The base composition of the chloroplast genes is of great interest because they play a highly significant role in the evolutionary development of the plants. Evaluation of the 48 chloroplast protein-coding genes of Hemiptelea davidii showed that the average GC content was about 37.32%, while at the third codon base position alone the average GC content was only 27.80%. The 48 genes were classified into five groups based on the gene function and each group displayed specific codon characteristics. Based on the relative synonymous codon usage analysis, a total of 30 high-frequency codons and 11 optimal codons were identified, most of them ended with A or T. Neutrality plot, ENC-plot and PR2-plot analyses showed that the codon usage bias of the chloroplast genes of H. davidii was greatly influenced by natural selection pressures. Meanwhile, the frequency of codon usage of chloroplast genes among different plant species displayed similarities, with some synonymous codons were preferred to be used in H. davidii. In this study, the codon usage pattern of the chloroplast protein coding genes of H. davidii provides us with a better understanding of the expression of chloroplast genes, and may advice the future molecular breeding programmes.
Collapse
Affiliation(s)
- Huabo Liu
- National Engineering Laboratory for Tree Breeding, Beijing Forestry University, Beijing 100083, People's Republic of China.
| | | | | | | |
Collapse
|
25
|
Nariyampet SA, Hajamohideen AJA. A study on codon usage bias in cytochrome c oxidase I (COI) gene of solitary ascidian Herdmania momus Savigny, 1816. GENE REPORTS 2019. [DOI: 10.1016/j.genrep.2019.100523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
26
|
Priya R, Sneha P, Dass JFP, Doss C GP, Manickavasagam M, Siva R. Exploring the codon patterns between CCD and NCED genes among different plant species. Comput Biol Med 2019; 114:103449. [DOI: 10.1016/j.compbiomed.2019.103449] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2019] [Revised: 09/13/2019] [Accepted: 09/13/2019] [Indexed: 01/16/2023]
|
27
|
Mazumder TH, Uddin A, Chakraborty S. Insights into the nucleotide composition and codon usage pattern of human tumor suppressor genes. Mol Carcinog 2019; 59:15-23. [PMID: 31583785 DOI: 10.1002/mc.23124] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2019] [Revised: 09/05/2019] [Accepted: 09/21/2019] [Indexed: 01/21/2023]
Abstract
Tumor suppressor genes encode different proteins that inhibit the uncontrolled proliferation of cell growth and tumor development. To acquire clues for predicting gene expression level, it is essential to understand the codon usage bias (CUB) of genes to characterize genome which possesses its own compositional characteristics and unique coding sequences. We used bioinformatic tools to analyze the codon usage patterns of 637 human tumor suppressor genes as no work was reported earlier. The mean effective number of codons of these genes was 48, indicating low CUB. Our results exhibited a significant positive correlation among different nucleotide compositions and the codons ending with C base was most frequently used along with the most over-represented codon CTG and GTG codifying leucine and valine amino acid, respectively, in human tumor suppressor genes. The neutrality plot showed a significant positive correlation (Pearson, r = 0. 646; P < .01) suggesting that mutation on GC bias might affect the CUB. However, the linear regression coefficient of GC12 on GC3 in human tumor suppressor genes suggested that natural selection played a major role while mutation pressure played a minor role in the codon usage patterns of tumor suppressor genes in human. Our study would throw light into the factors that affect CUB and the codon usage patterns in the human tumor suppressor genes.
Collapse
Affiliation(s)
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Hailakandi, Assam, India
| | | |
Collapse
|
28
|
Uddin A, Paul N, Chakraborty S. The codon usage pattern of genes involved in ovarian cancer. Ann N Y Acad Sci 2019; 1440:67-78. [PMID: 30843242 DOI: 10.1111/nyas.14019] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 01/04/2019] [Accepted: 01/14/2019] [Indexed: 12/20/2022]
Abstract
In this study, we analyzed the compositional dynamics and codon usage pattern of genes involved in ovarian cancer (OC) using a computational method. Mutations in specific genes are associated with OC, and some genes are risk factors for progression of OC, but no work has been reported yet on the codon usage pattern of genes involved in OC. Nucleotide composition analysis of OC-related genes suggested that the overall GC content was higher than AT content; that is, the genes were GC rich. The improved effective number of codons indicated that the overall extent of codon usage bias of genes involved in OC was low. The codons AGC, CTG, ATC, ACC, GTG, and GCC were overrepresented, while the codons TCG, TTA, CTA, CCG, CAA, CGT, ATA, ACG, GTA, GTT, GCG, and GGT were underrepresented in the genes. Correspondence analysis suggested that the codon usage pattern was different in different genes. A highly significant correlation was observed between GC12 and GC3 (r = 0.587, P < 0.01) of genes, suggesting that directional mutation affected the three codon positions. Our report on the codon usage pattern of genes involved in OC includes a new perspective for elucidating the mechanisms of biased usage of synonymous codons, as well as providing useful clues for molecular genetic engineering.
Collapse
Affiliation(s)
- Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Assam, India
| | - Nirmal Paul
- Department of Biotechnology, Assam University, Assam, India
| | | |
Collapse
|
29
|
Compositional dynamics and codon usage pattern of BRCA1 gene across nine mammalian species. Genomics 2019; 111:167-176. [DOI: 10.1016/j.ygeno.2018.01.013] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Revised: 12/22/2017] [Accepted: 01/22/2018] [Indexed: 11/19/2022]
|
30
|
Chakraborty S, Uddin A, Mazumder TH, Choudhury MN, Malakar AK, Paul P, Halder B, Deka H, Mazumder GA, Barbhuiya RA, Barbhuiya MA, Devi WJ. Codon usage and expression level of human mitochondrial 13 protein coding genes across six continents. Mitochondrion 2018; 42:64-76. [DOI: 10.1016/j.mito.2017.11.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Revised: 10/09/2017] [Accepted: 11/27/2017] [Indexed: 02/03/2023]
|
31
|
Mazumdar P, Binti Othman R, Mebus K, Ramakrishnan N, Ann Harikrishna J. Codon usage and codon pair patterns in non-grass monocot genomes. ANNALS OF BOTANY 2017; 120:893-909. [PMID: 29155926 PMCID: PMC5710610 DOI: 10.1093/aob/mcx112] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Accepted: 09/19/2017] [Indexed: 05/19/2023]
Abstract
BACKGROUND AND AIMS Studies on codon usage in monocots have focused on grasses, and observed patterns of this taxon were generalized to all monocot species. Here, non-grass monocot species were analysed to investigate the differences between grass and non-grass monocots. METHODS First, studies of codon usage in monocots were reviewed. The current information was then extended regarding codon usage, as well as codon-pair context bias, using four completely sequenced non-grass monocot genomes (Musa acuminata, Musa balbisiana, Phoenix dactylifera and Spirodela polyrhiza) for which comparable transcriptome datasets are available. Measurements were taken regarding relative synonymous codon usage, effective number of codons, derived optimal codon and GC content and then the relationships investigated to infer the underlying evolutionary forces. KEY RESULTS The research identified optimal codons, rare codons and preferred codon-pair context in the non-grass monocot species studied. In contrast to the bimodal distribution of GC3 (GC content in third codon position) in grasses, non-grass monocots showed a unimodal distribution. Disproportionate use of G and C (and of A and T) in two- and four-codon amino acids detected in the analysis rules out the mutational bias hypothesis as an explanation of genomic variation in GC content. There was found to be a positive relationship between CAI (codon adaptation index; predicts the level of expression of a gene) and GC3. In addition, a strong correlation was observed between coding and genomic GC content and negative correlation of GC3 with gene length, indicating a strong impact of GC-biased gene conversion (gBGC) in shaping codon usage and nucleotide composition in non-grass monocots. CONCLUSION Optimal codons in these non-grass monocots show a preference for G/C in the third codon position. These results support the concept that codon usage and nucleotide composition in non-grass monocots are mainly driven by gBGC.
Collapse
Affiliation(s)
- Purabi Mazumdar
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
| | - RofinaYasmin Binti Othman
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Katharina Mebus
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
| | - N Ramakrishnan
- Electrical and Computer System Engineering, School of Engineering, Monash University Malaysia, Bandar Sunway, Malaysia
| | - Jennifer Ann Harikrishna
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
- For correspondence. E-mail:
| |
Collapse
|
32
|
Song H, Gao H, Liu J, Tian P, Nan Z. Comprehensive analysis of correlations among codon usage bias, gene expression, and substitution rate in Arachis duranensis and Arachis ipaënsis orthologs. Sci Rep 2017; 7:14853. [PMID: 29093502 PMCID: PMC5665869 DOI: 10.1038/s41598-017-13981-1] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Accepted: 10/04/2017] [Indexed: 11/22/2022] Open
Abstract
The relationship between evolutionary rates and gene expression in model plant orthologs is well documented. However, little is known about the relationships between gene expression and evolutionary trends in Arachis orthologs. We identified 7,435 one-to-one orthologs, including 925 single-copy and 6,510 multiple-copy sequences in Arachis duranensis and Arachis ipaënsis. Codon usage was stronger for shorter polypeptides, which were encoded by codons with higher GC contents. Highly expressed coding sequences had higher codon usage bias, GC content, and expression breadth. Additionally, expression breadth was positively correlated with polypeptide length, but there was no correlation between gene expression and polypeptide length. Inferred selective pressure was also negatively correlated with both gene expression and expression breadth in all one-to-one orthologs, while positively but non-significantly correlated with gene expression in sequences with signatures of positive selection. Gene expression levels and expression breadth were significantly higher for single-copy genes than for multiple-copy genes. Similarly, the gene expression and expression breadth in sequences with signatures of purifying selection were higher than those of sequences with positive selective signatures. These results indicated that gene expression differed between single-copy and multiple-copy genes as well as sequences with signatures of positive and purifying selection.
Collapse
Affiliation(s)
- Hui Song
- State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730000, China.
| | - Hongjuan Gao
- State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730000, China
| | - Jing Liu
- State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730000, China
| | - Pei Tian
- State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730000, China
| | - Zhibiao Nan
- State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730000, China.
| |
Collapse
|
33
|
Nath Choudhury M, Uddin A, Chakraborty S. Codon usage bias and its influencing factors for Y-linked genes in human. Comput Biol Chem 2017; 69:77-86. [DOI: 10.1016/j.compbiolchem.2017.05.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2016] [Revised: 05/04/2017] [Accepted: 05/20/2017] [Indexed: 11/30/2022]
|
34
|
Analysis of Ribosome-Associated mRNAs in Rice Reveals the Importance of Transcript Size and GC Content in Translation. G3-GENES GENOMES GENETICS 2017; 7:203-219. [PMID: 27852012 PMCID: PMC5217110 DOI: 10.1534/g3.116.036020] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Gene expression is controlled at transcriptional and post-transcriptional levels including decoding of messenger RNA (mRNA) into polypeptides via ribosome-mediated translation. Translational regulation has been intensively studied in the model dicot plant Arabidopsis thaliana, and in this study, we assessed the translational status [proportion of steady-state mRNA associated with ribosomes] of mRNAs by Translating Ribosome Affinity Purification followed by mRNA-sequencing (TRAP-seq) in rice (Oryza sativa), a model monocot plant and the most important food crop. A survey of three tissues found that most transcribed rice genes are translated whereas few transposable elements are associated with ribosomes. Genes with short and GC-rich coding regions are overrepresented in ribosome-associated mRNAs, suggesting that the GC-richness characteristic of coding sequences in grasses may be an adaptation that favors efficient translation. Transcripts with retained introns and extended 5′ untranslated regions are underrepresented on ribosomes, and rice genes belonging to different evolutionary lineages exhibited differential enrichment on the ribosomes that was associated with GC content. Genes involved in photosynthesis and stress responses are preferentially associated with ribosomes, whereas genes in epigenetic regulation pathways are the least enriched on ribosomes. Such variation is more dramatic in rice than that in Arabidopsis and is correlated with the wide variation of GC content of transcripts in rice. Taken together, variation in the translation status of individual transcripts reflects important mechanisms of gene regulation, which may have a role in evolution and diversification.
Collapse
|
35
|
Analysis of codon usage patterns in Ginkgo biloba reveals codon usage tendency from A/U-ending to G/C-ending. Sci Rep 2016; 6:35927. [PMID: 27808241 PMCID: PMC5093902 DOI: 10.1038/srep35927] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2016] [Accepted: 10/07/2016] [Indexed: 11/08/2022] Open
Abstract
As one of the most ancient tree species, the codon usage pattern analysis of Ginkgo biloba is a useful way to understand its evolutionary and genetic mechanisms. Several studies have been conducted on angiosperms, but seldom on gymnosperms. Based on RNA-Seq data of the G. biloba transcriptome, amount to 17,579 unigenes longer than 300 bp were selected and analyzed from 68,547 candidates. The codon usage pattern tended towards more frequently use of A/U-ending codons, which showed an obvious gradient progressing from gymnosperms to dicots to monocots. Meanwhile, analysis of high/low-expression unigenes revealed that high-expression unigenes tended to use G/C-ending codons together with more codon usage bias. Variation of unigenes with different functions suggested that unigenes involving in environment adaptation use G/C-ending codons more frequently with more usage bias, and these results were consistent with the conclusion that the formation of G. biloba codon usage bias was dominated by natural selection.
Collapse
|
36
|
Li N, Li Y, Zheng C, Huang J, Zhang S. Genome-wide comparative analysis of the codon usage patterns in plants. Genes Genomics 2016. [DOI: 10.1007/s13258-016-0417-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
37
|
Expression levels and codon usage patterns in nuclear genes of the filarial nematode Wucheraria bancrofti and the blood fluke Schistosoma haematobium. J Helminthol 2016; 91:72-79. [PMID: 27048929 DOI: 10.1017/s0022149x16000092] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Synonymous codons are used with different frequencies, a phenomenon known as codon bias, which exists in many genomes and is mainly resolute by mutation and selection. To elucidate the genetic characteristics and evolutionary relationship of Wucheraria bancrofti and Schistosoma haematobium we examined the pattern of synonymous codon usage in nuclear genes of both the species. The mean overall GC contents of W. bancrofti and S. haematobium were 43.41 and 36.37%, respectively, which suggests that genes in both the species were AT rich. The value of the High Effective Number of Codons in both species suggests that codon usage bias was weak. Both species had a wide range of P3 distribution in the neutrality plot, with a significant correlation between P12 and P3. The codons were closer to the axes in correspondence analysis, suggesting that mutation pressure influenced the codon usage pattern in these species. We have identified the more frequently used codons in these species, most codons ending with an A or T. The nucleotides A/T and C/G were not proportionally used at the third position of codons, which reveals that natural selection might influence the codon usage patterns. The regression equation of P12 on P3 suggests that natural selection might have played a major role, while mutational pressure played a minor role in codon usage pattern in both species. These results form the basis of exploring the evolutionary mechanisms and the heterologous expression of medically important proteins of W. bancrofti and S. haematobium.
Collapse
|
38
|
Abstract
Exonic splice enhancers (ESEs) are short nucleotide motifs, enriched near exon ends, that enhance the recognition of the splice site and thus promote splicing. Are intronless genes under selection to avoid these motifs so as not to attract the splicing machinery to an mRNA that should not be spliced, thereby preventing the production of an aberrant transcript? Consistent with this possibility, we find that ESEs in putative recent retrocopies are at a higher density and evolving faster than those in other intronless genes, suggesting that they are being lost. Moreover, intronless genes are less dense in putative ESEs than intron-containing ones. However, this latter difference is likely due to the skewed base composition of intronless sequences, a skew that is in line with the general GC richness of few exon genes. Indeed, after controlling for such biases, we find that both intronless and intron-containing genes are denser in ESEs than expected by chance. Importantly, nucleotide-controlled analysis of evolutionary rates at synonymous sites in ESEs indicates that the ESEs in intronless genes are under purifying selection in both human and mouse. We conclude that on the loss of introns, some but not all, ESE motifs are lost, the remainder having functions beyond a role in splice promotion. These results have implications for the design of intronless transgenes and for understanding the causes of selection on synonymous sites.
Collapse
Affiliation(s)
- Rosina Savisaar
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
39
|
Sundararajan A, Dukowic-Schulze S, Kwicklis M, Engstrom K, Garcia N, Oviedo OJ, Ramaraj T, Gonzales MD, He Y, Wang M, Sun Q, Pillardy J, Kianian SF, Pawlowski WP, Chen C, Mudge J. Gene Evolutionary Trajectories and GC Patterns Driven by Recombination in Zea mays. FRONTIERS IN PLANT SCIENCE 2016; 7:1433. [PMID: 27713757 PMCID: PMC5031598 DOI: 10.3389/fpls.2016.01433] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 09/08/2016] [Indexed: 05/20/2023]
Abstract
Recombination occurring during meiosis is critical for creating genetic variation and plays an essential role in plant evolution. In addition to creating novel gene combinations, recombination can affect genome structure through altering GC patterns. In maize (Zea mays) and other grasses, another intriguing GC pattern exists. Maize genes show a bimodal GC content distribution that has been attributed to nucleotide bias in the third, or wobble, position of the codon. Recombination may be an underlying driving force given that recombination sites are often associated with high GC content. Here we explore the relationship between recombination and genomic GC patterns by comparing GC gene content at each of the three codon positions (GC1, GC2, and GC3, collectively termed GCx) to instances of a variable GC-rich motif that underlies double strand break (DSB) hotspots and to meiocyte-specific gene expression. Surprisingly, GCx bimodality in maize cannot be fully explained by the codon wobble hypothesis. High GCx genes show a strong overlap with the DSB hotspot motif, possibly providing a mechanism for the high evolutionary rates seen in these genes. On the other hand, genes that are turned on in meiosis (early prophase I) are biased against both high GCx genes and genes with the DSB hotspot motif, possibly allowing important meiotic genes to avoid DSBs. Our data suggests a strong link between the GC-rich motif underlying DSB hotspots and high GCx genes.
Collapse
Affiliation(s)
| | | | | | | | - Nathan Garcia
- National Center for Genome Resources, Santa FeNM, USA
| | | | | | | | - Yan He
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
| | - Minghui Wang
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Qi Sun
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Jaroslaw Pillardy
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Shahryar F. Kianian
- Cereal Disease Laboratory, United States Department of Agriculture – Agricultural Research Service, St. PaulMN, USA
| | - Wojciech P. Pawlowski
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
| | - Changbin Chen
- Department of Horticultural Science, University of Minnesota, St. PaulMN, USA
| | - Joann Mudge
- National Center for Genome Resources, Santa FeNM, USA
- *Correspondence: Joann Mudge,
| |
Collapse
|
40
|
|
41
|
A Comprehensive Analysis of Codon Usage Patterns in Blunt Snout Bream (Megalobrama amblycephala) Based on RNA-Seq Data. Int J Mol Sci 2015; 16:11996-2013. [PMID: 26016504 PMCID: PMC4490425 DOI: 10.3390/ijms160611996] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2015] [Accepted: 05/19/2015] [Indexed: 11/21/2022] Open
Abstract
Blunt snout bream (Megalobrama amblycephala) is an important fish species for its delicacy and high economic value in China. Codon usage analysis could be helpful to understand its codon biology, mRNA translation and vertebrate evolution. Based on RNA-Seq data for M. amblycephala, high-frequency codons (CUG, AGA, GUG, CAG and GAG), as well as low-frequency ones (NUA and NCG codons) were identified. A total of 724 high-frequency codon pairs were observed. Meanwhile, 14 preferred and 199 avoided neighboring codon pairs were also identified, but bias was almost not shown with one or more intervening codons inserted between the same pairs. Codon usage bias in the regions close to start and stop codons indicated apparent heterogeneity, which even occurs in the flanking nucleotide sequence. Codon usage bias (RSCU and SCUO) was related to GC3 (GC content of 3rd nucleotide in codon) bias. Six GO (Gene ontology) categories and the number of methylation targets were influenced by GC3. Codon usage patterns comparison among 23 vertebrates showed species specificities by using GC contents, codon usage and codon context analysis. This work provided new insights into fish biology and new information for breeding projects.
Collapse
|
42
|
Yuan Y, Long P, Jiang C, Li M, Huang L. Development and characterization of simple sequence repeat (SSR) markers based on a full-length cDNA library of Scutellaria baicalensis. Genomics 2015; 105:61-7. [DOI: 10.1016/j.ygeno.2014.10.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Revised: 10/13/2014] [Accepted: 10/21/2014] [Indexed: 12/17/2022]
|
43
|
Wei L, He J, Jia X, Qi Q, Liang Z, Zheng H, Ping Y, Liu S, Sun J. Analysis of codon usage bias of mitochondrial genome in Bombyx mori and its relation to evolution. BMC Evol Biol 2014; 14:262. [PMID: 25515024 PMCID: PMC4276022 DOI: 10.1186/s12862-014-0262-4] [Citation(s) in RCA: 101] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2014] [Accepted: 12/09/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Synonymous codon usage bias (SCUB) is an inevitable phenomenon in organismic taxa, generally referring to differences in the occurrence frequency of codons across different species or within the genome of the same species. SCUB happens in various degrees under pressure from nature selection, mutation bias and other factors in different ways. It also attaches great significance to gene expression and species evolution, however, a systematic investigation towards the codon usage in Bombyx mori (B. mori) has not been reported yet. Moreover, it is still indistinct about the reasons contributing to the bias or the relationship between the bias and the evolution of B. mori. RESULTS The comparison of the codon usage pattern between the genomic DNA (gDNA) and the mitochondrial DNA (mtDNA) from B. mori suggests that mtDNA has a higher level of codon bias. Furthermore, the correspondence analysis suggests that natural selection, such as gene length, gene function and translational selection, dominates the codon preference of mtDNA, while the composition constraints for mutation bias only plays a minor role. Additionally, the clustering results of the silkworm superfamily suggest a lack of explicitness in the relationship between the codon usage of mitogenome and species evolution. CONCLUSIONS Among the complicated influence factors leading to codon bias, natural selection is found to play a major role in shaping the high bias in the mtDNA of B. mori from our current data. Although the cluster analysis reveals that codon bias correlates little with the species evolution, furthermore, a detailed analysis of codon usage of mitogenome provides better insight into the evolutionary relationships in Lepidoptera. However, more new methods and data are needed to investigate the relationship between the mtDNA bias and evolution.
Collapse
Affiliation(s)
- Lei Wei
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Jian He
- Guangzhou East Campus Lab Center, Sun Yat-sen University, Guangzhou, 510006, China.
| | - Xian Jia
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Qi Qi
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Zhisheng Liang
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Hao Zheng
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Yao Ping
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Shuyu Liu
- Guangzhou East Campus Lab Center, Sun Yat-sen University, Guangzhou, 510006, China.
| | - Jingchen Sun
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| |
Collapse
|
44
|
Yang X, Luo X, Cai X. Analysis of codon usage pattern in Taenia saginata based on a transcriptome dataset. Parasit Vectors 2014; 7:527. [PMID: 25440955 PMCID: PMC4268816 DOI: 10.1186/s13071-014-0527-1] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Accepted: 11/06/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Codon usage bias is an important evolutionary feature in a genome and has been widely documented in many genomes. Analysis of codon usage bias has significance for mRNA translation, design of transgenes, new gene discovery, and studies of molecular biology and evolution, etc. However, the information about synonymous codon usage pattern of T. saginata genome remains unclear. T. saginata is a food-borne zoonotic cestode which infects approximataely 50 million humans worldwide, and causes significant health problems to the host and considerable socio-economic losses as a consequence. In this study, synonymous codon usage in T. saginata were examined. METHODS Total RNA was isolated from T. saginata cysticerci and 91,487 unigenes were generated using Illumina sequencing technology. After filtering, the final sequence collection containing 11,399 CDSs was used for our analysis. RESULTS Neutrality analysis showed that the T. saginata had a wide GC3 distribution and a significant correlation was observed between GC12 and GC3. NC-plot showed most of genes on or close to the expected curve, but only a few points with low-ENC values were below it, suggesting that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally. We also identified twenty-three optimal codons in the T. saginata genome, all of which were ended with a G or C residue. These results suggest that mutational and selection forces are probably driving factors of codon usage bias in T. saginata genome. Meanwhile, other factors such as protein length, gene expression, GC content of genes, the hydropathicity of each protein also influence codon usage. CONCLUSIONS Here, we systematically analyzed the codon usage pattern and identified factors shaping in codon usage bias in T. saginata. Currently, no complete nuclear genome is available for codon usage analysis at the genome level in T. saginata. This is the first report to investigate codon biology in T. sagninata. Such information does not only bring about a new perspective for understanding the mechanisms of biased usage of synonymous codons but also provide useful clues for molecular genetic engineering and evolutionary studies.
Collapse
Affiliation(s)
- Xing Yang
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, 730046, PR China. .,College of Veterinary Medicine, Jilin University, Changchun, 130000, PR China.
| | - Xuenong Luo
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, 730046, PR China.
| | - Xuepeng Cai
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, 730046, PR China. .,College of Veterinary Medicine, Jilin University, Changchun, 130000, PR China.
| |
Collapse
|
45
|
Liu Q, Hu H, Wang H. Mutational bias is the driving force for shaping the synonymous codon usage pattern of alternatively spliced genes in rice (Oryza sativa L.). Mol Genet Genomics 2014; 290:649-60. [PMID: 25407289 DOI: 10.1007/s00438-014-0951-0] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2014] [Accepted: 11/04/2014] [Indexed: 11/30/2022]
Abstract
Alternative splicing plays important roles in diverse aspects of plant development, metabolism, and stress responses. However, the regulatory mechanisms of alternative splicing of genes still remain incompletely elucidated, especially in plants. In this study, the synonymous codon usage pattern of alternatively spliced (AS) genes in rice was firstly explored using the combination of correspondence analysis (CA), internal CA, correlation and ANOVA analyses. The results show that alternatively and non-alternatively spliced (non-AS) genes have similar tendency for overall codon usage, but exhibit significant difference in 58 out of 64 codons. AS and non-AS genes are both under strong purifying selection, but the former ones have significant lower mutation rate and are prone to be enriched towards the chromosomal ends. In the group of AS genes, the variability in synonymous codon usage between genes is mainly due to the variations in GC content, CDS length, as well as gene functions. Mutational bias that accounts for 25.85 % of the total codon usage variability plays a major role in shaping the codon usage pattern of AS genes. In contrast, no obvious evidence is found for the contributions of translational selection, AS types, the conservation of AS events, and numbers of AS variants to the codon usage divergence between AS genes. These findings may be useful for further understanding the mechanisms of origination, differentiation and regulation of alternatively spliced genes in plants.
Collapse
Affiliation(s)
- Qingpo Liu
- Department of Agronomy, and the Key Laboratory for Quality Improvement of Agricultural Products of Zhejiang Province, College of Agriculture and Food Science, Zhejiang A & F University, Lin'an, 311300, Zhejiang, China,
| | | | | |
Collapse
|
46
|
Chen H, Sun S, Norenburg JL, Sundberg P. Mutation and selection cause codon usage and bias in mitochondrial genomes of ribbon worms (Nemertea). PLoS One 2014; 9:e85631. [PMID: 24454907 PMCID: PMC3893253 DOI: 10.1371/journal.pone.0085631] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2013] [Accepted: 12/05/2013] [Indexed: 12/14/2022] Open
Abstract
The phenomenon of codon usage bias is known to exist in many genomes and it is mainly determined by mutation and selection. To understand the patterns of codon usage in nemertean mitochondrial genomes, we use bioinformatic approaches to analyze the protein-coding sequences of eight nemertean species. Neutrality analysis did not find a significant correlation between GC12 and GC3. ENc-plot showed a few genes on or close to the expected curve, but the majority of points with low-ENc values are below it. ENc-plot suggested that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally and we propose that codons containing A or U at third position are used preferentially in nemertean species, regardless of whether corresponding tRNAs are encoded in the mitochondrial DNA. Context-dependent analysis indicated that the nucleotide at the second codon position slightly affects synonymous codon choices. These results suggested that mutational and selection forces are probably acting to codon usage bias in nemertean mitochondrial genomes.
Collapse
Affiliation(s)
- Haixia Chen
- Department of Biological and Environmental Sciences, University of Gothenburg, Sweden
- * E-mail: (HXC); (PS)
| | - Shichun Sun
- Institute of Evolution & Marine Biodiversity, Ocean University of China, Qingdao, China
| | - Jon L. Norenburg
- Department of Invertebrate Zoology, Smithsonian's National Museum of Natural History, Washington D.C., United States of America
| | - Per Sundberg
- Department of Biological and Environmental Sciences, University of Gothenburg, Sweden
- * E-mail: (HXC); (PS)
| |
Collapse
|
47
|
Feng C, Xu CJ, Wang Y, Liu WL, Yin XR, Li X, Chen M, Chen KS. Codon usage patterns in Chinese bayberry (Myrica rubra) based on RNA-Seq data. BMC Genomics 2013; 14:732. [PMID: 24160180 PMCID: PMC4008310 DOI: 10.1186/1471-2164-14-732] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2013] [Accepted: 10/21/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Codon usage analysis has been a classical topic for decades and has significances for studies of evolution, mRNA translation, and new gene discovery, etc. While the codon usage varies among different members of the plant kingdom, indicating the necessity for species-specific study, this work has mostly been limited to model organisms. Recently, the development of deep sequencing, especial RNA-Seq, has made it possible to carry out studies in non-model species. RESULT RNA-Seq data of Chinese bayberry was analyzed to investigate the bias of codon usage and codon pairs. High frequency codons (AGG, GCU, AAG and GAU), as well as low frequency ones (NCG and NUA codons) were identified, and 397 high frequency codon pairs were observed. Meanwhile, 26 preferred and 141 avoided neighboring codon pairs were also identified, which showed more significant bias than the same pairs with one or more intervening codons. Codon patterns were also analyzed at the plant kingdom, organism and gene levels. Changes during plant evolution were evident using RSCU (relative synonymous codon usage), which was even more significant than GC3s (GC content of 3rd synonymous codons). Nine GO categories were differentially and independently influenced by CAI (codon adaptation index) or GC3s, especially in 'Molecular function' category. Within a gene, the average CAI increased from 0.720 to 0.785 in the first 50 codons, and then more slowly thereafter. Furthermore, the preferred as well as avoided codons at the position just following the start codon AUG were identified and discussed in relation to the key positions in Kozak sequences. CONCLUSION A comprehensive codon usage Table and number of high-frequency codon pairs were established. Bias in codon usage as well as in neighboring codon pairs was observed, and the significance of this in avoiding DNA mutation, increasing protein production and regulating protein synthesis rate was proposed. Codon usage patterns at three levels were revealed and the significance in plant evolution analysis, gene function classification, and protein translation start site predication were discussed. This work promotes the study of codon biology, and provides some reference for analysis and comprehensive application of RNA-Seq data from other non-model species.
Collapse
Affiliation(s)
- Chao Feng
- Laboratory of Fruit Quality Biology / The State Agriculture Ministry Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Zhejiang University, Hangzhou, 310058, China
| | - Chang-jie Xu
- Laboratory of Fruit Quality Biology / The State Agriculture Ministry Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Zhejiang University, Hangzhou, 310058, China
| | - Yue Wang
- Department of Bioinformatics / The State Key Laboratory of Plant Physiology and Biochemistry, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Wen-li Liu
- Department of Mathematics, Zhejiang University, Hangzhou, 310027, China
| | - Xue-ren Yin
- Laboratory of Fruit Quality Biology / The State Agriculture Ministry Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Zhejiang University, Hangzhou, 310058, China
| | - Xian Li
- Laboratory of Fruit Quality Biology / The State Agriculture Ministry Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Zhejiang University, Hangzhou, 310058, China
| | - Ming Chen
- Department of Bioinformatics / The State Key Laboratory of Plant Physiology and Biochemistry, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Kun-song Chen
- Laboratory of Fruit Quality Biology / The State Agriculture Ministry Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Zhejiang University, Hangzhou, 310058, China
| |
Collapse
|
48
|
Pan LL, Wang Y, Hu JH, Ding ZT, Li C. Analysis of codon use features of stearoyl-acyl carrier protein desaturase gene in Camellia sinensis. J Theor Biol 2013; 334:80-6. [PMID: 23774066 DOI: 10.1016/j.jtbi.2013.06.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2013] [Revised: 06/03/2013] [Accepted: 06/06/2013] [Indexed: 11/19/2022]
Abstract
The stearoyl-acyl carrier protein desaturase (SAD) gene widely exists in all kinds of plants. In this paper, the Camellia sinensis SAD gene (CsSAD) sequence was firstly analyzed by Codon W, CHIPS, and CUSP programs online, and then compared with genomes of the tea plant, other species and SAD genes from 11 plant species. The results show that the CsSAD gene and the selected 73 of C. sinensis genes have similar codon usage bias. The CsSAD gene has a bias toward the synonymous codons with A and T at the third codon position, the same as the 73 of C. sinensis genes. Compared with monocotyledons such as Triticum aestivum and Zea mays, the differences in codon usage frequency between the CsSAD gene and dicotyledons such as Arabidopsis thaliana and Nicotiana tobacum are less. Therefore, A. thaliana and N. tobacum expression systems may be more suitable for the expression of the CsSAD gene. The analysis result of SAD genes from 12 plant species also shows that most of the SAD genes are biased toward the synonymous codons with G and C at the third codon position. We believe that the codon usage bias analysis presented in this study will be essential for providing a theoretical basis for discussing the structure and function of the CsSAD gene.
Collapse
Affiliation(s)
- Lu-Lu Pan
- Tea Research Institute, Qingdao Agricultural University, Changcheng Road 700#, Chengyang District, Qingdao, Shandong 266109, China.
| | | | | | | | | |
Collapse
|
49
|
Sablok G, Wu X, Kuo J, Nayak KC, Baev V, Varotto C, Zhou F. Combinational effect of mutational bias and translational selection for translation efficiency in tomato (Solanum lycopersicum) cv. Micro-Tom. Genomics 2013; 101:290-5. [PMID: 23474140 DOI: 10.1016/j.ygeno.2013.02.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2012] [Revised: 01/21/2013] [Accepted: 02/21/2013] [Indexed: 11/24/2022]
Abstract
We conducted a comprehensive analysis of codon usage bias (CUB) based on the available non-redundant full-length cDNA (nrFLcDNA) and expressed sequence tags (ESTs) data of cultivar Micro-Tom and evaluated the associations of observed CUB and measurements of transcriptional and translational effectiveness. The analysis presented in our study suggests a correlation, which is negative but highly correlated between Axis 1 and GC3s (r=-0.827, P<0.01), indicating that mutational bias has a significant and dominant repressive role to the choices of GC3. We also observed a strong positive correlation between codon adaptation index (CAI) and translational adaptation index (tAIg) (0.407, P<0.01), which demonstrates the facilitation of efficient translation by the optimal codon usage patterns of the highly expressed genes. We believe that the complete set of optimal codon usage patterns detected in this study will serve as a model to enhance the transgenesis in the studied cultivar of Solanum lycopersicum.
Collapse
Affiliation(s)
- Gaurav Sablok
- Department of Biodiversity and Molecular Ecology, Research and Innovation Centre, Fondazione Edmund Mach, Via E Mach 1, 38010 S. Michele all'Adige (TN), Italy.
| | | | | | | | | | | | | |
Collapse
|
50
|
Lee J, Noh EK, Choi HS, Shin SC, Park H, Lee H. Transcriptome sequencing of the Antarctic vascular plant Deschampsia antarctica Desv. under abiotic stress. PLANTA 2013; 237:823-36. [PMID: 23135329 DOI: 10.1007/s00425-012-1797-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2012] [Accepted: 10/23/2012] [Indexed: 05/15/2023]
Abstract
Antarctic hairgrass (Deschampsia antarctica Desv.) is the only natural grass species in the maritime Antarctic. It has been studied as an extremophile that has successfully adapted to marginal land with the harshest environment for terrestrial plants. However, limited genetic research has focused on this species due to the lack of genomic resources. Here, we present the first de novo assembly of its transcriptome by massive parallel sequencing and its expression profile using D. antarctica grown under various stress conditions. Total sequence reads generated by pyrosequencing were assembled into 60,765 unigenes (28,177 contigs and 32,588 singletons). A total of 29,173 unique protein-coding genes were identified based on sequence similarities to known proteins. The combined results from all three stress conditions indicated differential expression of 3,110 genes. Quantitative reverse transcription polymerase chain reaction showed that several well-known stress-responsive genes encoding late embryogenesis abundant protein, dehydrin 1, and ice recrystallization inhibition protein were induced dramatically and that genes encoding U-box-domain-containing protein, electron transfer flavoprotein-ubiquinone, and F-box-containing protein were induced by abiotic stressors in a manner conserved with other plant species. We identified more than 2,000 simple sequence repeats that can be developed as functional molecular markers. This dataset is the most comprehensive transcriptome resource currently available for D. antarctica and is therefore expected to be an important foundation for future genetic studies of grasses and extremophiles.
Collapse
Affiliation(s)
- Jungeun Lee
- Division of Life Sciences, Korea Polar Research Institute, 12 Gaetbeol-ro, Yeonsu-gu, Incheon, 406-840, Korea
| | | | | | | | | | | |
Collapse
|