1
|
Brew-Appiah RAT, York ZB, Krishnan V, Roalson EH, Sanguinet KA. Genome-wide identification and analysis of the ALTERNATIVE OXIDASE gene family in diploid and hexaploid wheat. PLoS One 2018; 13:e0201439. [PMID: 30074999 PMCID: PMC6075773 DOI: 10.1371/journal.pone.0201439] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Accepted: 07/16/2018] [Indexed: 11/19/2022] Open
Abstract
A comprehensive understanding of wheat responses to environmental stress will contribute to the long-term goal of feeding the planet. ALERNATIVE OXIDASE (AOX) genes encode proteins involved in a bypass of the electron transport chain and are also known to be involved in stress tolerance in multiple species. Here, we report the identification and characterization of the AOX gene family in diploid and hexaploid wheat. Four genes each were found in the diploid ancestors Triticum urartu, and Aegilops tauschii, and three in Aegilops speltoides. In hexaploid wheat (Triticum aestivum), 20 genes were identified, some with multiple splice variants, corresponding to a total of 24 proteins for those with observed transcription and translation. These proteins were classified as AOX1a, AOX1c, AOX1e or AOX1d via phylogenetic analysis. Proteins lacking most or all signature AOX motifs were assigned to putative regulatory roles. Analysis of protein-targeting sequences suggests mixed localization to the mitochondria and other organelles. In comparison to the most studied AOX from Trypanosoma brucei, there were amino acid substitutions at critical functional domains indicating possible role divergence in wheat or grasses in general. In hexaploid wheat, AOX genes were expressed at specific developmental stages as well as in response to both biotic and abiotic stresses such as fungal pathogens, heat and drought. These AOX expression patterns suggest a highly regulated and diverse transcription and expression system. The insights gained provide a framework for the continued and expanded study of AOX genes in wheat for stress tolerance through breeding new varieties, as well as resistance to AOX-targeted herbicides, all of which can ultimately be used synergistically to improve crop yield.
Collapse
Affiliation(s)
- Rhoda A. T. Brew-Appiah
- Department of Crop and Soil Sciences, Washington State University, Pullman, Washington, United States of America
| | - Zara B. York
- Department of Crop and Soil Sciences, Washington State University, Pullman, Washington, United States of America
| | - Vandhana Krishnan
- Stanford Center for Genomics and Personalized Medicine, Department of Genetics, Stanford University, Stanford, United States of America
| | - Eric H. Roalson
- School of Biological Sciences, Washington State University, Pullman, Washington, United States of America
| | - Karen A. Sanguinet
- Department of Crop and Soil Sciences, Washington State University, Pullman, Washington, United States of America
| |
Collapse
|
2
|
Tatarinova T, Elhaik E, Pellegrini M. Cross-species analysis of genic GC3 content and DNA methylation patterns. Genome Biol Evol 2013; 5:1443-56. [PMID: 23833164 PMCID: PMC3762193 DOI: 10.1093/gbe/evt103] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
The GC content in the third codon position (GC3) exhibits a unimodal distribution in many plant and animal genomes. Interestingly, grasses and homeotherm vertebrates exhibit a unique bimodal distribution. High GC3 was previously found to be associated with variable expression, higher frequency of upstream TATA boxes, and an increase of GC3 from 5′ to 3′. Moreover, GC3-rich genes are predominant in certain gene classes and are enriched in CpG dinucleotides that are potential targets for methylation. Based on the GC3 bimodal distribution we hypothesize that GC3 has a regulatory role involving methylation and gene expression. To test that hypothesis, we selected diverse taxa (rice, thale cress, bee, and human) that varied in the modality of their GC3 distribution and tested the association between GC3, DNA methylation, and gene expression. We examine the relationship between cytosine methylation levels and GC3, gene expression, genome signature, gene length, and other gene compositional features. We find a strong negative correlation (Pearson’s correlation coefficient r = −0.67, P value < 0.0001) between GC3 and genic CpG methylation. The comparison between 5′-3′ gradients of CG3-skew and genic methylation for the taxa in the study suggests interplay between gene-body methylation and transcription-coupled cytosine deamination effect. Compositional features are correlated with methylation levels of genes in rice, thale cress, human, bee, and fruit fly (which acts as an unmethylated control). These patterns allow us to generate evolutionary hypotheses about the relationships between GC3 and methylation and how these affect expression patterns. Specifically, we propose that the opposite effects of methylation and compositional gradients along coding regions of GC3-poor and GC3-rich genes are the products of several competing processes.
Collapse
Affiliation(s)
- Tatiana Tatarinova
- Laboratory of Applied Pharmacokinetics and Bioinformatics, University of Southern California.
| | | | | |
Collapse
|
3
|
Kochetov AV, Prayaga PD, Volkova OA, Sankararamakrishnan R. Hidden coding potential of eukaryotic genomes: nonAUG started ORFs. J Biomol Struct Dyn 2012; 31:103-14. [PMID: 22804099 DOI: 10.1080/07391102.2012.691367] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
It is widely considered that the vast majority of eukaryotic mRNAs contain only one open reading frame (ORF) and encode single protein. However, eukaryotic ribosomes can initiate translation at alternative start codons due to leaky scanning or reinitiation mechanisms that provides an opportunity to synthesize several protein products. Recent investigations also demonstrated that alternative translation from nonAUG start codons and AUG codons in a weak nucleotide context could make an important contribution to eukaryotic proteomes. However, accurate prediction of alternative start codons demands detailed investigation of mRNA features influencing their recognition by eukaryotic ribosomes. In this work, we present the results of computational analysis of characteristics of yeast and mammalian mRNAs potentially involved in the recognition of nonAUG start codons. It was found that sequence features of nonAUG started Saccharomyces cerevisiae upstream ORFs (uORFs) were adjusted for efficient translation and these uORFs could frequently encode functional polypeptides. In particular, our initial studies revealed that predicted tertiary structures downstream of nonAUG start sites in mammalian mRNAs were energetically more stable than those predicted for AUG start sites with strong Kozak context. We hypothesize that presence of such stable tertiary structure downstream of nonAUG start sites could be an important factor for the ribosome to recognize noncanonical start codons.
Collapse
Affiliation(s)
- Alex V Kochetov
- Institute of Cytology and Genetics, 10, Lavrentiev Ave, Novosibirsk, 630090, Russia
| | | | | | | |
Collapse
|
4
|
Du MZ, Guo FB, Chen YY. Gene re-annotation in genome of the extremophile Pyrobaculum aerophilum by using bioinformatics methods. J Biomol Struct Dyn 2012; 29:391-401. [PMID: 21875157 DOI: 10.1080/07391102.2011.10507393] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
In this paper, we re-annotated the genome of Pyrobaculum aerophilum str. IM2, particularly for hypothetical ORFs. The annotation process includes three parts. Firstly and most importantly, 23 new genes, which were missed in the original annotation, are found by combining similarity search and the ab initio gene finding approaches. Among these new genes, five have significant similarities with function-known genes and the rest have significant similarities with hypothetical ORFs contained in other genomes. Secondly, the coding potentials of the 1645 hypothetical ORFs are re-predicted by using 33 Z curve variables combined with Fisher linear discrimination method. With the accuracy being 99.68%, 25 originally annotated hypothetical ORFs are recognized as non-coding by our method. Thirdly, 80 hypothetical ORFs are assigned with potential functions by using similarity search with BLAST program. Re-annotation of the genome will benefit related researches on this hyperthermophilic crenarchaeon. Also, the re-annotation procedure could be taken as a reference for other archaeal genomes. Details of the revised annotation are freely available at http://cobi.uestc.edu.cn/resource/paero/
Collapse
Affiliation(s)
- Meng-Ze Du
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | | | | |
Collapse
|
5
|
Chakraborty J, Dutta TK. From lipid transport to oxygenation of aromatic compounds: evolution within the Bet v1-like superfamily. J Biomol Struct Dyn 2011; 29:67-78. [PMID: 21696226 DOI: 10.1080/07391102.2011.10507375] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
In absence of significant sequence similarity, remote homology between proteins can be confused with analogy and in such a case, shared ancestry can be inferred in light of certain unique and common features. In the present study, to understand the evolutionary origin of catalytic domain of large subunit of ring-hydroxylating oxygenases (RHOs), belonging to the Bet v1-like superfamily, structure-based phylogenies have been derived from structural alignment of representative proteins of the superfamily. A careful inspection of the structural relatedness of RHOs with the rest of the families showed closest similarity between RHO catalytic domain and PA1206-like protein. In addition, phylogenetic relationship of the Rieske domain of the large subunit of RHOs with functionally and structurally similar proteins has also been elucidated so as to postulate the most possible events leading to the genesis of the large subunit of RHOs.
Collapse
Affiliation(s)
- Joydeep Chakraborty
- Department of Microbiology, Bose Institute, P-1/12 C.I.T. Scheme VII M, Kolkata 700054, India
| | | |
Collapse
|
6
|
Favorov A, Lvovs D, Speier W, Parmigiani G, Ochs MF. OnionTree XML: A Format to Exchange Gene-Related Probabilities. J Biomol Struct Dyn 2011; 29:417-23. [DOI: 10.1080/073911011010524994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
7
|
Al-Khatib RM, Rashid NAA, Abdullah R. Thermodynamic Heuristics with Case-Based Reasoning: Combined Insights for RNA Pseudoknot Secondary Structure. J Biomol Struct Dyn 2011; 29:1-26. [DOI: 10.1080/07391102.2011.10507373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
8
|
Kochetov AV, Volkova OA, Poliakov A, Dubchak I, Rogozin IB. Tandem termination signal in plant mRNAs. Gene 2011; 481:1-6. [PMID: 21539902 DOI: 10.1016/j.gene.2011.04.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2010] [Revised: 02/17/2011] [Accepted: 04/15/2011] [Indexed: 12/12/2022]
Abstract
It was proposed that if some mRNA characteristics resulted in a low efficiency of termination signal, an additional closely located stop codon (tandem stop codons) could be used to prevent the harmful readthrough. However, the role of tandem terminators in higher eukaryotes was not verified and remains hypothetical. In this work the sequence features of Arabidopsis thaliana and Oryza sativa mRNAs were analyzed. It was found that plant mRNAs with UGA terminator were characterized by a higher frequency of nonsense codons in the first triplet position of 3'-UTR that could result from a weak natural selection for "reserve" stop signal. Interestingly, the presence of tandem stop codons positively correlated with a specific amino acid composition in the C-terminal position of the encoded proteins. In particular, C-terminal glycine positively correlated with significantly higher frequencies of reserve terminators at the beginning positions of 3'-UTR in UGA-containing mRNAs. This finding coincides with some earlier observations concerning the role of glycine and its codons in inefficient termination of translation and recoding (e.g., 2A oligopeptide).
Collapse
Affiliation(s)
- Alex V Kochetov
- Institute of Cytology and Genetics, Lavrentieva ave. 10, Novosibirsk, 630090, Russia.
| | | | | | | | | |
Collapse
|
9
|
Chang G, Wang T. Weighted relative entropy for alignment-free sequence comparison based on Markov model. J Biomol Struct Dyn 2011; 28:545-55. [PMID: 21142223 DOI: 10.1080/07391102.2011.10508594] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
In this paper, we introduce a probabilistic measure for computing the similarity between two biological sequences without alignment. The computation of the similarity measure is based on the Kullback-Leibler divergence of two constructed Markov models. We firstly validate the method on clustering nine chromosomes from three species. Secondly, we give the result of similarity search based on our new method. We lastly apply the measure to the construction of phylogenetic tree of 48 HEV genome sequences. Our results indicate that the weighted relative entropy is an efficient and powerful alignment-free measure for the analysis of sequences in the genomic scale.
Collapse
Affiliation(s)
- Guisong Chang
- School of Mathematical Sciences, Dalian University of Technology, Dalian 116024, PR China.
| | | |
Collapse
|
10
|
Cao J, Shi F, Liu X, Jia J, Zeng J, Huang G. Genome-wide identification and evolutionary analysis of Arabidopsis sm genes family. J Biomol Struct Dyn 2011; 28:535-44. [PMID: 21142222 DOI: 10.1080/07391102.2011.10508593] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Sm proteins are members of a family of small proteins that are widespread in biosphere and found associated with RNA metabolism. To date, to our knowledge, only Arabidopsis SAD1 gene has been studied functionally in plant. In this study, 42 Sm genes are identified through comprehensive analysis in Arabidopsis. And a complete overview of this gene family is presented, including the gene structures, phylogeny, chromosome locations, selection pressure and expression. The results reveal that gene duplication contributes to the expansion of the Sm gene family in Arabidopsis genome, diverse expression patterns suggest their functional differentiation and divergence analysis indicates purifying selection as a key role in evolution. Our comparative genomics analysis of Sm genes will provide the first step towards the future experimental research on determining the functions of these genes.
Collapse
Affiliation(s)
- Jun Cao
- Institute of Life Science, Jiangsu University, Xuefu Road 301, Zhenjiang 212013, Jiangsu, PR China.
| | | | | | | | | | | |
Collapse
|
11
|
Das S, Mitra S, Sahoo S, Chakrabarti J. Novel Hybrid Encodes both Continuous and Split tRNA Genes? J Biomol Struct Dyn 2011; 28:827-31. [DOI: 10.1080/07391102.2011.10508610] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
12
|
Liu X, Dai Q, Li L, He Z. An efficient binomial model-based measure for sequence comparison and its application. J Biomol Struct Dyn 2011; 28:833-43. [PMID: 21294594 DOI: 10.1080/07391102.2011.10508611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Sequence comparison is one of the major tasks in bioinformatics, which could serve as evidence of structural and functional conservation, as well as of evolutionary relations. There are several similarity/dissimilarity measures for sequence comparison, but challenges remains. This paper presented a binomial model-based measure to analyze biological sequences. With help of a random indicator, the occurrence of a word at any position of sequence can be regarded as a random Bernoulli variable, and the distribution of a sum of the word occurrence is well known to be a binomial one. By using a recursive formula, we computed the binomial probability of the word count and proposed a binomial model-based measure based on the relative entropy. The proposed measure was tested by extensive experiments including classification of HEV genotypes and phylogenetic analysis, and further compared with alignment-based and alignment-free measures. The results demonstrate that the proposed measure based on binomial model is more efficient.
Collapse
Affiliation(s)
- Xiaoqing Liu
- School of Science, Hangzhou Dianzi Unviersity, Hangzhou 310018, People's Republic of China
| | | | | | | |
Collapse
|
13
|
Zhang Y, Chen W. A Measure of DNA Sequence Dissimilarity Based on Free Energy of Nearest-neighbor Interaction. J Biomol Struct Dyn 2011; 28:557-65. [DOI: 10.1080/07391102.2011.10508595] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|