1
|
Sophiarani Y, Chakraborty S. Synonymous sites for accessibility around microRNA binding sites in bacterial spot and speck disease resistance genes of tomato. Funct Integr Genomics 2023; 23:247. [PMID: 37468805 DOI: 10.1007/s10142-023-01178-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Revised: 05/15/2023] [Accepted: 07/13/2023] [Indexed: 07/21/2023]
Abstract
The major causes of mass tomato infections in both covered and open ground are agents of bacterial spot and bacterial speck diseases. MicroRNAs (miRNAs) are 16-21 nucleotides in length, non-coding RNAs that inhibit translation and trigger mRNA degradation. MiRNAs play a significant part in plant resistance to abiotic and biotic stresses by mediating gene regulation via post-transcriptional RNA silencing. In this study, we analyzed a collection of bacterial resistance genes of tomato and their binding sites for tomato miRNAs and Pseudomonas syringe pv. tomato miRNAs. Our study found that two genes, bacterial spot disease resistance gene (Bs4) and bacterial speck disease resistance gene (Prf), have a 7mer-m8 perfect seed match with miRNAs. Bs4 was targeted by one tomato miRNA (sly-miR9470-3p) and three Pseudomonas syringe pv. tomato miRNAs (PSTJ4_3p_27246, PSTJ4_3p_27246, and PSTJ4_3p_27246). Again, Prf gene was found to be targeted by two tomato miRNAs namely, sly-miR9469-5p and sly-miR9474-3p. The accessibility of the miRNA-target site and its flanking regions and the relationship between relative synonymous codon usage and tRNAs were compared. Strong access to miRNA targeting regions and decreased rate of translations suggested that miRNAs might be efficient in binding to their particular targets. We also found the existence of rare codons, which suggests that it could enhance miRNA targeting even more. The codon usage pattern analysis of the two genes revealed that both were AT-rich (Bs4 = 63.2%; Prf = 60.8%). We found a low codon usage bias in both genes, suggesting that selective restriction might regulate them. The silencing property of miRNAs would allow researchers to discover the involvement of plant miRNAs in pathogen invasion. However, the efficient validation of direct targets of miRNAs is an urgent need that might be highly beneficial in enhancing plant resistance to multiple pathogenic diseases.
Collapse
Affiliation(s)
- Yengkhom Sophiarani
- Department of Biotechnology, Assam University, Silchar, Assam, 788011, India
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar, Assam, 788011, India.
| |
Collapse
|
2
|
Barbieri M. Evolution of the genetic code: The ambiguity-reduction theory. Biosystems 2019; 185:104024. [DOI: 10.1016/j.biosystems.2019.104024] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 08/26/2019] [Accepted: 08/26/2019] [Indexed: 10/26/2022]
|
3
|
A general model on the origin of biological codes. Biosystems 2019; 181:11-19. [DOI: 10.1016/j.biosystems.2019.04.010] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Revised: 04/16/2019] [Accepted: 04/16/2019] [Indexed: 01/09/2023]
|
4
|
Wrublewski DT. Analysis for Science Librarians of the 2018 Nobel Prize in Chemistry: Directed Evolution of Enzymes and Phage Display of Peptides and Antibodies. ACTA ACUST UNITED AC 2019. [DOI: 10.1080/0194262x.2019.1579159] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
5
|
Wan J, Gao X, Mao Y, Zhang X, Qian SB. A Coding Sequence-Embedded Principle Governs Translational Reading Frame Fidelity. RESEARCH (WASHINGTON, D.C.) 2018; 2018:7089174. [PMID: 31549036 PMCID: PMC6750092 DOI: 10.1155/2018/7089174] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/28/2018] [Accepted: 08/28/2018] [Indexed: 06/10/2023]
Abstract
Upon initiation at a start codon, the ribosome must maintain the correct reading frame for hundreds of codons in order to produce functional proteins. While some sequence elements are able to trigger programmed ribosomal frameshifting (PRF), very little is known about how the ribosome normally prevents spontaneous frameshift errors that can have dire consequences if uncorrected. Using high resolution ribosome profiling data sets, we discovered that the translating ribosome uses the 3' end of 18S rRNA to scan the AUG-like codons after the decoding process. The postdecoding mRNA:rRNA interaction not only contributes to predominant translational pausing, but also provides a retrospective mechanism to safeguard the ribosome in the correct reading frame. Partially eliminating the AUG-like "sticky" codons in the reporter message leads to increased +1 frameshift errors. Remarkably, mutating the highly conserved CAU triplet of 18S rRNA globally changes the codon "stickiness". Further supporting the role of "sticky" sequences in reading frame maintenance, the codon composition of open reading frames is highly optimized across eukaryotic genomes. These results suggest an important layer of information embedded within the protein-coding sequences that instructs the ribosome to ensure reading frame fidelity during translation.
Collapse
Affiliation(s)
- Ji Wan
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Xiangwei Gao
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Yuanhui Mao
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Xingqian Zhang
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Shu-Bing Qian
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
- Graduate Programs in Genetics Genomics and Development, Biochemistry Molecular and Cellular Biology, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
6
|
Vasanthi S, Dass JFP. Comparative genome-wide analysis of codon usage of different bacterial species infecting Oryza sativa. J Cell Biochem 2018; 119:9346-9356. [PMID: 30105828 DOI: 10.1002/jcb.27214] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2018] [Accepted: 06/13/2018] [Indexed: 11/05/2022]
Abstract
Oryza sativa is vastly affected by microbial pathogen, causing blight-related diseases, which in turn deplete the growth and productivity of rice. In this study, we analyzed four bacterial rice pathogen genomes and reported on their codon usage that might have greater implication in mutation-related research. Differential codon usage indices, such as codon adaptation index (CAI), codon bias index (CBI), effective number of codons (ENc), relative synonymous codon usage (RSCU), correspondence analysis (COA), and parity plots, were applied on coding sequences of Pseudomonas fuscovaginae, Pseudomonas syringae, Xanthomonas oryzae, and Pseudomonas avenae speices. The RSCU results proposed a high-frequency usage of CUG and CGC that codes for leucine and arginine in all of the species. The CBI and CAI values between the genomes range from 0.17 to 0.3 and from 0.26 to 0.35, respectively, indicating a direct proportionality between these indexes. The mean ENc value of P. avenae coding sequence showed high codon bias compared with other genomes. The axis I variation from COA analysis shows a mean value of 42.28% codon variations in these bacterial species. Correlation studies between axis I and ENc-GC3, along with CAI and CBI, suggested the presence of nucleotide bias and mutational pressure as major forces for codon bias within these species. Hence, certain genes with high CAI-CBI have been correlated for better gene expression. Our study highlights the importance of nucleotide biasness, mutation pressure, and natural selection in shaping protein-coding genes in these four rice-affecting bacteria. This would further help in investigating the evolution of pathogenic gene families, which may direct research toward synthetic genes that could be suppressed or overrepresented based on their codon usage pattern toward pathogenicity.
Collapse
Affiliation(s)
- S Vasanthi
- Department of Integrative Biology, School of Biosciences and Technology, VIT, Vellore, Tamil Nadu, India
| | - J Febin Prabhu Dass
- Department of Integrative Biology, School of Biosciences and Technology, VIT, Vellore, Tamil Nadu, India
| |
Collapse
|
7
|
Alignment-based and alignment-free methods converge with experimental data on amino acids coded by stop codons at split between nuclear and mitochondrial genetic codes. Biosystems 2018; 167:33-46. [DOI: 10.1016/j.biosystems.2018.03.002] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2018] [Revised: 03/18/2018] [Accepted: 03/19/2018] [Indexed: 12/11/2022]
|
8
|
Barbieri M. What is code biology? Biosystems 2018; 164:1-10. [DOI: 10.1016/j.biosystems.2017.10.005] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Revised: 10/04/2017] [Accepted: 10/05/2017] [Indexed: 01/29/2023]
|
9
|
Gonzalez DL, Giannerini S, Rosa R. The non-power model of the genetic code: a paradigm for interpreting genomic information. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2016; 374:rsta.2015.0062. [PMID: 26857679 DOI: 10.1098/rsta.2015.0062] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 10/27/2015] [Indexed: 06/05/2023]
Abstract
In this article, we present a mathematical framework based on redundant (non-power) representations of integer numbers as a paradigm for the interpretation of genomic information. The core of the approach relies on modelling the degeneracy of the genetic code. The model allows one to explain many features and symmetries of the genetic code and to uncover hidden symmetries. Also, it provides us with new tools for the analysis of genomic sequences. We review briefly three main areas: (i) the Euplotid nuclear code, (ii) the vertebrate mitochondrial code, and (iii) the main coding/decoding strategies used in the three domains of life. In every case, we show how the non-power model is a natural unified framework for describing degeneracy and deriving sound biological hypotheses on protein coding. The approach is rooted on number theory and group theory; nevertheless, we have kept the technical level to a minimum by focusing on key concepts and on the biological implications.
Collapse
Affiliation(s)
- Diego Luis Gonzalez
- Dipartimento di Scienze Statistiche, Università di Bologna, Via delle Belle Arti 41, 40126 Bologna, Italy CNR-IMM, Sezione di Bologna, Via Gobetti 101, 40129 Bologna, Italy
| | - Simone Giannerini
- Dipartimento di Scienze Statistiche, Università di Bologna, Via delle Belle Arti 41, 40126 Bologna, Italy
| | - Rodolfo Rosa
- CNR-IMM, Sezione di Bologna, Via Gobetti 101, 40129 Bologna, Italy
| |
Collapse
|
10
|
Fimmel E, Strüngmann L. Maximal dinucleotide comma-free codes. J Theor Biol 2015; 389:206-13. [PMID: 26562635 DOI: 10.1016/j.jtbi.2015.10.022] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2015] [Revised: 10/16/2015] [Accepted: 10/19/2015] [Indexed: 10/22/2022]
Abstract
The problem of retrieval and maintenance of the correct reading frame plays a significant role in RNA transcription. Circular codes, and especially comma-free codes, can help to understand the underlying mechanisms of error-detection in this process. In recent years much attention has been paid to the investigation of trinucleotide circular codes (see, for instance, Fimmel et al., 2014; Fimmel and Strüngmann, 2015a; Michel and Pirillo, 2012; Michel et al., 2012, 2008), while dinucleotide codes had been touched on only marginally, even though dinucleotides are associated to important biological functions. Recently, all maximal dinucleotide circular codes were classified (Fimmel et al., 2015; Michel and Pirillo, 2013). The present paper studies maximal dinucleotide comma-free codes and their close connection to maximal dinucleotide circular codes. We give a construction principle for such codes and provide a graphical representation that allows them to be visualized geometrically. Moreover, we compare the results for dinucleotide codes with the corresponding situation for trinucleotide maximal self-complementary C(3)-codes. Finally, the results obtained are discussed with respect to Crick׳s hypothesis about frame-shift-detecting codes without commas.
Collapse
Affiliation(s)
- Elena Fimmel
- Institute of Mathematical Biology, Faculty of Computer Sciences, Mannheim University of Applied Sciences, 68163 Mannheim, Germany.
| | - Lutz Strüngmann
- Institute of Mathematical Biology, Faculty of Computer Sciences, Mannheim University of Applied Sciences, 68163 Mannheim, Germany.
| |
Collapse
|
11
|
|
12
|
Limitations of the ‘ambush hypothesis’ at the single-gene scale: what codon biases are to blame? Mol Genet Genomics 2014; 290:493-504. [DOI: 10.1007/s00438-014-0937-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2014] [Accepted: 10/01/2014] [Indexed: 10/24/2022]
|
13
|
Abstract
Whole-genome and functional analyses suggest a wealth of secondary or auxiliary genetic information (AGI) within the redundancy component of the genetic code. Although there are multiple aspects of biased codon use, we focus on two types of auxiliary information: codon-specific translational pauses that can be used by particular proteins toward their unique folding and biased codon patterns shared by groups of functionally related mRNAs with coordinate regulation. AGI is important to genetics in general and to human disease; here, we consider influences of its three major components, biased codon use itself, variations in the tRNAome, and anticodon modifications that distinguish synonymous decoding. AGI is plastic and can be used by different species to different extents, with tissue-specificity and in stress responses. Because AGI is species-specific, it is important to consider codon-sensitive experiments when using heterologous systems; for this we focus on the tRNA anticodon loop modification enzyme, CDKAL1, and its link to type 2 diabetes. Newly uncovered tRNAome variability among humans suggests roles in penetrance and as a genetic modifier and disease modifier. Development of experimental and bioinformatics methods are needed to uncover additional means of auxiliary genetic information.
Collapse
Affiliation(s)
- Richard J. Maraia
- Intramural Research Program on Genomics of Differentiation, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892, USA
- Corresponding authorE-mail
| | - James R. Iben
- Intramural Research Program on Genomics of Differentiation, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892, USA
| |
Collapse
|
14
|
Abstract
Frameshifting results from two main mechanisms: genomic insertions or deletions (indels) or programmed ribosomal frameshifting. Whereas indels can disrupt normal protein function, programmed ribosomal frameshifting can result in dual-coding genes, each of which can produce multiple functional products. Here, I summarize technical advances that have made it possible to identify programmed ribosomal frameshifting events in a systematic way. The results of these studies suggest that such frameshifting occurs in all genomes, and I will discuss methods that could help characterize the resulting alternative proteomes.
Collapse
Affiliation(s)
- Robin Ketteler
- MRC Laboratory for Molecular Cell Biology, Translational Research Resource Centre, University College London London, UK
| |
Collapse
|
15
|
Selection on Synonymous Sites for Increased Accessibility around miRNA Binding Sites in Plants. Mol Biol Evol 2012; 29:3037-44. [DOI: 10.1093/molbev/mss109] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
|
16
|
Kloc M, Foreman V, Reddy SA. Binary function of mRNA. Biochimie 2011; 93:1955-61. [PMID: 21784124 DOI: 10.1016/j.biochi.2011.07.008] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2011] [Accepted: 07/08/2011] [Indexed: 12/21/2022]
Abstract
Since the discovery of messenger RNA (mRNA) over half a century ago, the assumption has always been that the only function of mRNA is to make a protein. However, recent studies of prokaryotic and eukaryotic organisms unexpectedly show that some mRNAs may be functionally binary and have additional structural functions that are unrelated to their translation product. These findings imply that some of the phenotypic features of cells and organisms can also be binary, that is, they depend both on the function of a protein and the independent structural function of its mRNA. In this review, we will discuss this concept within the framework of multifunctional RNA molecules and the RNA World Hypothesis.
Collapse
Affiliation(s)
- Malgorzata Kloc
- Department of Surgery, The Methodist Hospital and The Methodist Hospital Research Institute, 6565 Fannin St., Houston, TX 77030, USA.
| | | | | |
Collapse
|
17
|
Dinger ME, Gascoigne DK, Mattick JS. The evolution of RNAs with multiple functions. Biochimie 2011; 93:2013-8. [PMID: 21802485 DOI: 10.1016/j.biochi.2011.07.018] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2011] [Accepted: 07/12/2011] [Indexed: 01/21/2023]
Abstract
Increasing numbers of transcripts have been reported to transmit both protein-coding and regulatory information. Apart from challenging our conception of the gene, this observation raises the question as to what extent this phenomenon occurs across the genome and how and why such dual encoding of function has evolved in the eukaryotic genome. To address this question, we consider the evolutionary path of genes in the earliest forms of life on Earth, where it is generally regarded that proteins evolved from a cellular machinery based entirely within RNA. This led to the domination of protein-coding genes in the genomes of microorganisms, although it is likely that RNA never lost its other capacities and functionalities, as evidenced by cis-acting riboswitches and UTRs. On the basis that the subsequent evolution of a more sophisticated regulatory architecture to provide higher levels of epigenetic control and accurate spatiotemporal expression in developmentally complex organisms is a complicated task, we hypothesize: (i) that mRNAs have been and remain subject to secondary selection to provide trans-acting regulatory capability in parallel with protein-coding functions; (ii) that some and perhaps many protein-coding loci, possibly as a consequence of gene duplication, have lost protein-coding functions en route to acquiring more sophisticated trans-regulatory functions; (iii) that many transcripts have become subject to secondary processing to release different products; and (iv) that novel proteins have emerged within loci that previously evolved functionality as regulatory RNAs. In support of the idea that there is a dynamic flux between different types of informational RNAs in both evolutionary and real time, we review recent observations that have arisen from transcriptomic surveys of complex eukaryotes and reconsider how these observations impact on the notion that apparently discrete loci may express transcripts with more than one function. In conclusion, we posit that many eukaryotic loci have evolved the capacity to transact a multitude of overlapping and potentially independent functions as both regulatory and protein-coding RNAs.
Collapse
Affiliation(s)
- Marcel E Dinger
- Institute for Molecular Bioscience, University of Queensland, 306 Carmody Road, St Lucia, QLD 4072, Australia.
| | | | | |
Collapse
|
18
|
Error compensation of tRNA misacylation by codon-anticodon mismatch prevents translational amino acid misinsertion. Comput Biol Chem 2011; 35:81-95. [PMID: 21470914 DOI: 10.1016/j.compbiolchem.2011.03.001] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2010] [Revised: 02/22/2011] [Accepted: 03/01/2011] [Indexed: 11/20/2022]
Abstract
Codon-anticodon mismatches and tRNA misloadings cause translational amino acid misinsertions, producing dysfunctional proteins. Here I explore the original hypothesis whether mismatches tend to compensate misacylation, so as to insert the amino acid coded by the codon. This error compensation is promoted by the fact that codon-anticodon mismatch stabilities increase with tRNA misacylation potentials (predicted by 'tfam') by non-cognate amino acids coded by the mismatched codons for most tRNAs examined. Error compensation is independent of preferential misacylation by non-cognate amino acids physico-chemically similar to cognate amino acids, a phenomenon that decreases misinsertion impacts. Error compensation correlates negatively with (a) codon/anticodon abundance (in human mitochondria and Escherichia coli); (b) developmental instability (estimated by fluctuating asymmetry in bilateral counts of subdigital lamellae, in each of two lizard genera, Anolis and Sceloporus); and (c) pathogenicity of human mitochondrial tRNA polymorphisms. Patterns described here suggest that tRNA misacylation is sometimes compensated by codon-anticodon mismatches. Hence translation inserts the amino acid coded by the mismatched codon, despite mismatch and misloading. Results suggest that this phenomenon is sufficiently important to affect whole organism phenotypes, as shown by correlations with pathologies and morphological estimates of developmental stability.
Collapse
|
19
|
Abstract
Alterations to the genetic code--codon reassignments--have occurred many times in life's history, despite the fact that genomes are coadapted to their genetic codes and therefore alterations are likely to be maladaptive. A potential mechanism for adaptive codon reassignment, which could trigger either a temporary period of codon ambiguity or a permanent genetic code change, is the reactivation of a pseudogene by a nonsense suppressor mutant transfer RNA. I examine the population genetics of each stage of this process and find that pseudogene rescue is plausible and also readily explains some features of extant variability in genetic codes.
Collapse
Affiliation(s)
- L J Johnson
- School of Biological Sciences, University of Reading, Reading, UK.
| |
Collapse
|
20
|
Görnerup O, Jacobi MN. A model-independent approach to infer hierarchical codon substitution dynamics. BMC Bioinformatics 2010; 11:201. [PMID: 20412602 PMCID: PMC2868013 DOI: 10.1186/1471-2105-11-201] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2009] [Accepted: 04/23/2010] [Indexed: 12/03/2022] Open
Abstract
Background Codon substitution constitutes a fundamental process in molecular biology that has been studied extensively. However, prior studies rely on various assumptions, e.g. regarding the relevance of specific biochemical properties, or on conservation criteria for defining substitution groups. Ideally, one would instead like to analyze the substitution process in terms of raw dynamics, independently of underlying system specifics. In this paper we propose a method for doing this by identifying groups of codons and amino acids such that these groups imply closed dynamics. The approach relies on recently developed spectral and agglomerative techniques for identifying hierarchical organization in dynamical systems. Results We have applied the techniques on an empirically derived Markov model of the codon substitution process that is provided in the literature. Without system specific knowledge of the substitution process, the techniques manage to "blindly" identify multiple levels of dynamics; from amino acid substitutions (via the standard genetic code) to higher order dynamics on the level of amino acid groups. We hypothesize that the acquired groups reflect earlier versions of the genetic code. Conclusions The results demonstrate the applicability of the techniques. Due to their generality, we believe that they can be used to coarse grain and identify hierarchical organization in a broad range of other biological systems and processes, such as protein interaction networks, genetic regulatory networks and food webs.
Collapse
Affiliation(s)
- Olof Görnerup
- Complex Systems Group, Department of Energy and Environment, Chalmers University of Technology, 412 96 Göteborg, Sweden.
| | | |
Collapse
|
21
|
Salato VK, Rediske NW, Zhang C, Hastings ML, Munroe SH. An exonic splicing enhancer within a bidirectional coding sequence regulates alternative splicing of an antisense mRNA. RNA Biol 2010; 7:179-90. [PMID: 20200494 DOI: 10.4161/rna.7.2.11182] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The discovery of increasing numbers of genes with overlapping sequences highlights the problem of expression in the context of constraining regulatory elements from more than one gene. This study identifies regulatory sequences encompassed within two genes that overlap in an antisense orientation at their 3' ends. The genes encode the alpha-thyroid hormone receptor gene (TRalpha or NR1A1) and Rev-erbalpha (NR1D1). In mammals TRalpha pre-mRNAs are alternatively spliced to yield mRNAs encoding functionally antagonistic proteins: TRalpha1, an authentic thyroid hormone receptor; and TRalpha2, a non-hormone-binding variant that acts as a repressor. TRalpha2-specific splicing requires two regulatory elements that overlap with Rev-erbalpha sequences. Functional mapping of these elements reveals minimal splicing enhancer elements that have evolved within the constraints of the overlapping Rev-erbalpha sequence. These results provide insight into the evolution of regulatory elements within the context of bidirectional coding sequences. They also demonstrate the ability of the genetic code to accommodate multiple layers of information within a given sequence, an important property of the code recently suggested on theoretical grounds.
Collapse
Affiliation(s)
- Valerie K Salato
- Department of Biological Sciences, Marquette University, Milwaukee, WI, USA
| | | | | | | | | |
Collapse
|
22
|
Das S, Yennamalli RM, Vishnoi A, Gupta P, Bhattacharya A. Single-nucleotide variations associated with Mycobacterium tuberculosis KwaZulu-Natal strains. J Biosci 2009; 34:397-404. [PMID: 19805901 DOI: 10.1007/s12038-009-0046-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The occurrence of drug resistance in Mycobacterium tuberculosis, the aetiological agent of tuberculosis (TB), is hampering the management and control of TB in the world. Here we present a computational analysis of recently sequenced drug-sensitive (DS), multidrug-resistant (MDR) and extensively drug-resistant (XDR) strains of M. tuberculosis. Single-nucleotide variations (SNVs) were identified in a pair-wise manner using the anchor-based whole genome comparison (ABWGC) tool and its modified version. For this analysis, four fully sequenced genomes of different strains of M. tuberculosis were taken along with three KwaZulu-Natal (KZN) strains isolated from South Africa including one XDR and one MDR strain. KZN strains were compared with other fully sequenced strains and also among each other. The variations were analysed with respect to their biological influence as a result of either altered structure or synthesis. The results suggest that the DR phenotype may be due to changes in a number of genes. The database on KZN strains can be accessed through the website http://mirna.jnu.ac.in/mgdd/.
Collapse
Affiliation(s)
- Sarbashis Das
- Center for Computational Biology and Bioinformatics, School of Information Technology, Jawaharlal Nehru University, New Delhi 110 067, India
| | | | | | | | | |
Collapse
|
23
|
Bina M, Wyss P, Lazarus SA, Shah SR, Ren W, Szpankowski W, Crawford GE, Park SP, Song XC. Discovering sequences with potential regulatory characteristics. Genomics 2009; 93:314-22. [DOI: 10.1016/j.ygeno.2008.11.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2007] [Revised: 05/28/2008] [Accepted: 11/17/2008] [Indexed: 11/25/2022]
|
24
|
Frenkel FE, Korotkov EV. Using triplet periodicity of nucleotide sequences for finding potential reading frame shifts in genes. DNA Res 2009; 16:105-14. [PMID: 19261626 PMCID: PMC2671204 DOI: 10.1093/dnares/dsp002] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
We introduce a novel approach for the detection of possible mutations leading to a reading frame (RF) shift in a gene. Deletions and insertions of DNA coding regions are considerable events for genes because an RF shift results in modifications of the extensive region of amino acid sequence coded by a gene. The suggested method is based on the phenomenon of triplet periodicity (TP) in coding regions of genes and its relative resistance to substitutions in DNA sequence. We attempted to extend 326 933 regions of continuous TP found in genes from the KEGG databank by considering possible insertions and deletions. We revealed totally 824 genes where such extension was possible and statistically significant. Then we generated amino acid sequences according to active (KEGG's) and hypothetically ancient RFs in order to find confirmation of a shift at a protein level. Consequently, 64 sequences have protein similarities only for ancient RF, 176 only for active RF, 3 for both and 581 have no protein similarity at all. We aimed to have revealed lower bound for the number of genes in which a shift between RF and TP is possible. Further ways to increase the number of revealed RF shifts are discussed.
Collapse
Affiliation(s)
- F E Frenkel
- Bioengineering Centre of RAS, 60-letiya Oktyabrya prosp., 7/1, Moscow, Russia.
| | | |
Collapse
|
25
|
Abstract
Non-protein-coding sequences increasingly dominate the genomes of multicellular organisms as their complexity increases, in contrast to protein-coding genes, which remain relatively static. Most of the mammalian genome and indeed that of all eukaryotes is expressed in a cell- and tissue-specific manner, and there is mounting evidence that much of this transcription is involved in the regulation of differentiation and development. Different classes of small and large noncoding RNAs (ncRNAs) have been shown to regulate almost every level of gene expression, including the activation and repression of homeotic genes and the targeting of chromatin-remodeling complexes. ncRNAs are involved in developmental processes in both simple and complex eukaryotes, and we illustrate this in the latter by focusing on the animal germline, brain, and eye. While most have yet to be systematically studied, the emerging evidence suggests that there is a vast hidden layer of regulatory ncRNAs that constitutes the majority of the genomic programming of multicellular organisms and plays a major role in controlling the epigenetic trajectories that underlie their ontogeny.
Collapse
|
26
|
DNA codes and information: formal structures and relational causes. Acta Biotheor 2008; 56:205-32. [PMID: 18465197 DOI: 10.1007/s10441-008-9049-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2007] [Accepted: 03/09/2008] [Indexed: 02/03/2023]
Abstract
Recently the terms "codes" and "information" as used in the context of molecular biology have been the subject of much discussion. Here I propose that a variety of structural realism can assist us in rethinking the concepts of DNA codes and information apart from semantic criteria. Using the genetic code as a theoretical backdrop, a necessary distinction is made between codes qua symbolic representations and information qua structure that accords with data. Structural attractors are also shown to be entailed by the mapping relation that any DNA code is a part of (as the domain). In this framework, these attractors are higher-order informational structures that obviate any "DNA-centric" reductionism. In addition to the implications that are discussed, this approach validates the array of coding systems now recognized in molecular biology.
Collapse
|
27
|
Abstract
The codon table for the canonical genetic code can be rearranged in such a way that the code is divided into four quarters and two halves according to the variability of their GC and purine contents, respectively. For prokaryotic genomes, when the genomic GC content increases, their amino acid contents tend to be restricted to the GC-rich quarter and the purine-content insensitive half, where all codons are fourfold degenerate and relatively mutation-tolerant. Conversely, when the genomic GC content decreases, most of the codons retract to the AU-rich quarter and the purine-content sensitive half; most of the codons not only remain encoding physicochemically diversified amino acids but also vary when transversion (between purine and pyrimidine) happens. Amino acids with sixfold-degenerate codons are distributed into all four quarters and across the two halves; their fourfold-degenerate codons are all partitioned into the purine-insensitive half in favorite of robustness against mutations. The features manifested in the rearranged codon table explain most of the intrinsic relationship between protein coding sequences (the informational content) and amino acid compositions (the functional content). The renovated codon table is useful in predicting abundant amino acids and positioning the amino acids with related or distinct physicochemical properties.
Collapse
Affiliation(s)
- Jun Yu
- Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 101300, China.
| |
Collapse
|
28
|
Abstract
While less than 1.5% of the mammalian genome encodes proteins, it is now evident that the vast majority is transcribed, mainly into non-protein-coding RNAs. This raises the question of what fraction of the genome is functional, i.e., composed of sequences that yield functional products, are required for the expression (regulation or processing) of these products, or are required for chromosome replication and maintenance. Many of the observed noncoding transcripts are differentially expressed, and, while most have not yet been studied, increasing numbers are being shown to be functional and/or trafficked to specific subcellular locations, as well as exhibit subtle evidence of selection. On the other hand, analyses of conservation patterns indicate that only approximately 5% (3%-8%) of the human genome is under purifying selection for functions common to mammals. However, these estimates rely on the assumption that reference sequences (usually ancient transposon-derived sequences) have evolved neutrally, which may not be the case, and if so would lead to an underestimate of the fraction of the genome under evolutionary constraint. These analyses also do not detect functional sequences that are evolving rapidly and/or have acquired lineage-specific functions. Indeed, many regulatory sequences and known functional noncoding RNAs, including many microRNAs, are not conserved over significant evolutionary distances, and recent evidence from the ENCODE project suggests that many functional elements show no detectable level of sequence constraint. Thus, it is likely that much more than 5% of the genome encodes functional information, and although the upper bound is unknown, it may be considerably higher than currently thought.
Collapse
Affiliation(s)
- Michael Pheasant
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, St Lucia, Queensland 4072, Australia
| | | |
Collapse
|