Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Michel CJ. New statistical approach to discriminate between protein coding and non-coding regions in DNA sequences and its evaluation. J Theor Biol 1986;120:223-36. [PMID: 3784581 DOI: 10.1016/s0022-5193(86)80176-x] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

For:	Michel CJ. New statistical approach to discriminate between protein coding and non-coding regions in DNA sequences and its evaluation. J Theor Biol 1986;120:223-36. [PMID: 3784581 DOI: 10.1016/s0022-5193(86)80176-x] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Number

Cited by Other Article(s)

Michel CJ. Circular code in introns. Biosystems 2024;239:105215. [PMID: 38641199 DOI: 10.1016/j.biosystems.2024.105215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 04/14/2024] [Accepted: 04/15/2024] [Indexed: 04/21/2024]

Michel CJ, Sereni JS. Reading Frame Retrieval of Genes: A New Parameter of Codon Usage Based on the Circular Code Theory. Bull Math Biol 2023;85:24. [PMID: 36826719 PMCID: PMC9950712 DOI: 10.1007/s11538-023-01129-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2022] [Accepted: 01/26/2023] [Indexed: 02/25/2023]

Michel CJ, Thompson JD. Identification of a circular code periodicity in the bacterial ribosome: origin of codon periodicity in genes? RNA Biol 2020;17:571-583. [PMID: 31960748 DOI: 10.1080/15476286.2020.1719311] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023] Open

Abstract

Three-base periodicity (TBP), where nucleotides and higher order n-tuples are preferentially spaced by 3, 6, 9, etc. bases, is a well-known intrinsic property of protein-coding DNA sequences. However, its origins are still not fully understood. One hypothesis is that the periodicity reflects a primordial coding system that was used before the emergence of the modern standard genetic code (SGC). Recent evidence suggests that the X circular code, a set of 20 trinucleotides allowing the reading frames in genes to be retrieved locally, represents a possible ancestor of the SGC. Motifs from the X circular code have been found in the reading frame of protein-coding regions in extant organisms from bacteria to eukaryotes, in many transfer RNA (tRNA) genes and in important functional regions of the ribosomal RNA (rRNA), notably in the peptidyl transferase centre and the decoding centre. Here, we have used a powerful correlation function to search for periodicity patterns involving the 20 trinucleotides of the X circular code in a large set of bacterial protein-coding genes, as well as in the translation machinery, including rRNA and tRNA sequences. As might be expected, we found a strong circular code periodicity 0 modulo 3 in the protein-coding genes. More surprisingly, we also identified a similar circular code periodicity in a large region of the 16S rRNA. This region includes the 3' major domain corresponding to the primordial proto-ribosome decoding centre and containing numerous sites that interact with the tRNA and messenger RNA (mRNA) during translation. Furthermore, 3D structural analysis shows that the periodicity region surrounds the mRNA channel that lies between the head and the body of the SSU. Our results support the hypothesis that the X circular code may constitute an ancestral translation code involved in reading frame retrieval and maintenance, traces of which persist in modern mRNA, tRNA and rRNA despite their long evolution and adaptation to the SGC.

Collapse

Fimmel E, Michel CJ, Pirot F, Sereni JS, Strüngmann L. Mixed circular codes. Math Biosci 2019;317:108231. [PMID: 31325443 DOI: 10.1016/j.mbs.2019.108231] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Revised: 07/16/2019] [Accepted: 07/17/2019] [Indexed: 12/11/2022]

Fimmel E, Gumbel M, Karpuzoglu A, Petoukhov S. On comparing composition principles of long DNA sequences with those of random ones. Biosystems 2019;180:101-108. [DOI: 10.1016/j.biosystems.2019.04.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 04/05/2019] [Accepted: 04/06/2019] [Indexed: 11/25/2022]

Diletter circular codes over finite alphabets. Math Biosci 2017;294:120-129. [PMID: 29024747 DOI: 10.1016/j.mbs.2017.10.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2017] [Revised: 08/26/2017] [Accepted: 10/08/2017] [Indexed: 11/22/2022]

Fimmel E, Michel CJ, Strüngmann L. n-Nucleotide circular codes in graph theory. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2016;374:rsta.2015.0058. [PMID: 26857680 DOI: 10.1098/rsta.2015.0058] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 09/05/2015] [Indexed: 06/05/2023]

Abstract

The circular code theory proposes that genes are constituted of two trinucleotide codes: the classical genetic code with 61 trinucleotides for coding the 20 amino acids (except the three stop codons {TAA,TAG,TGA}) and a circular code based on 20 trinucleotides for retrieving, maintaining and synchronizing the reading frame. It relies on two main results: the identification of a maximal C(3) self-complementary trinucleotide circular code X in genes of bacteria, eukaryotes, plasmids and viruses (Michel 2015 J. Theor. Biol. 380, 156-177. (doi:10.1016/j.jtbi.2015.04.009); Arquès & Michel 1996 J. Theor. Biol. 182, 45-58. (doi:10.1006/jtbi.1996.0142)) and the finding of X circular code motifs in tRNAs and rRNAs, in particular in the ribosome decoding centre (Michel 2012 Comput. Biol. Chem. 37, 24-37. (doi:10.1016/j.compbiolchem.2011.10.002); El Soufi & Michel 2014 Comput. Biol. Chem. 52, 9-17. (doi:10.1016/j.compbiolchem.2014.08.001)). The univerally conserved nucleotides A1492 and A1493 and the conserved nucleotide G530 are included in X circular code motifs. Recently, dinucleotide circular codes were also investigated (Michel & Pirillo 2013 ISRN Biomath. 2013, 538631. (doi:10.1155/2013/538631); Fimmel et al. 2015 J. Theor. Biol. 386, 159-165. (doi:10.1016/j.jtbi.2015.08.034)). As the genetic motifs of different lengths are ubiquitous in genes and genomes, we introduce a new approach based on graph theory to study in full generality n-nucleotide circular codes X, i.e. of length 2 (dinucleotide), 3 (trinucleotide), 4 (tetranucleotide), etc. Indeed, we prove that an n-nucleotide code X is circular if and only if the corresponding graph [Formula: see text] is acyclic. Moreover, the maximal length of a path in [Formula: see text] corresponds to the window of nucleotides in a sequence for detecting the correct reading frame. Finally, the graph theory of tournaments is applied to the study of dinucleotide circular codes. It has full equivalence between the combinatorics theory (Michel & Pirillo 2013 ISRN Biomath. 2013, 538631. (doi:10.1155/2013/538631)) and the group theory (Fimmel et al. 2015 J. Theor. Biol. 386, 159-165. (doi:10.1016/j.jtbi.2015.08.034)) of dinucleotide circular codes while its mathematical approach is simpler.

Collapse

Gonzalez D, Giannerini S, Rosa R. Circular codes revisited: A statistical approach. J Theor Biol 2011;275:21-8. [DOI: 10.1016/j.jtbi.2011.01.028] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2010] [Revised: 01/18/2011] [Accepted: 01/19/2011] [Indexed: 11/29/2022]

Michel CJ. Evolution probabilities and phylogenetic distance of dinucleotides. J Theor Biol 2007;249:271-7. [PMID: 17884102 DOI: 10.1016/j.jtbi.2007.07.032] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2007] [Revised: 07/18/2007] [Accepted: 07/20/2007] [Indexed: 11/15/2022]

Laskin AA, Kudryashov NA, Skryabin KG, Korotkov EV. Latent periodicity of serine-threonine and tyrosine protein kinases and other protein families. Comput Biol Chem 2005;29:229-43. [PMID: 15979043 DOI: 10.1016/j.compbiolchem.2005.04.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2004] [Revised: 04/18/2005] [Accepted: 04/18/2005] [Indexed: 11/22/2022]

Xu R, Xiao Y. A common sequence-associated physicochemical feature for proteins of beta-trefoil family. Comput Biol Chem 2005;29:79-82. [PMID: 15680588 DOI: 10.1016/j.compbiolchem.2004.12.003] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2004] [Revised: 12/10/2004] [Accepted: 12/10/2004] [Indexed: 11/26/2022]

Vendramini D. Noncoding DNA and the teem theory of inheritance, emotions and innate behaviour. Med Hypotheses 2005;64:512-9. [PMID: 15617858 DOI: 10.1016/j.mehy.2004.08.022] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2004] [Accepted: 08/25/2004] [Indexed: 10/26/2022]

Holste D, Grosse I, Buldyrev SV, Stanley HE, Herzel H. Optimization of coding potentials using positional dependence of nucleotide frequencies. J Theor Biol 2000;206:525-37. [PMID: 11013113 DOI: 10.1006/jtbi.2000.2144] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Gelfand MS. Prediction of function in DNA sequence analysis. J Comput Biol 1995;2:87-115. [PMID: 7497122 DOI: 10.1089/cmb.1995.2.87] [Citation(s) in RCA: 91] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open

Fickett JW, Tung CS. Assessment of protein coding measures. Nucleic Acids Res 1992;20:6441-50. [PMID: 1480466 PMCID: PMC334555 DOI: 10.1093/nar/20.24.6441] [Citation(s) in RCA: 259] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open

Michel CJ. A study of the purine/pyrimidine codon occurrence with a reduced centered variable and an evaluation compared to the frequency statistic. Math Biosci 1989;97:161-77. [PMID: 2520209 DOI: 10.1016/0025-5564(89)90003-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Sibbald PR. Patterns of base usage, nearest neighbour analysis and identification of genes in two completely sequenced chloroplast genomes. Curr Genet 1988. [DOI: 10.1007/bf02427759] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Louis BG, Ganoza MC. Signals determining translational start-site recognition in eukaryotes and their role in prediction of genetic reading frames. Mol Biol Rep 1988;13:103-15. [PMID: 3221841 DOI: 10.1007/bf00539058] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Abstract

A special methionyl-tRNA (RNAi) is universally required to initiate translation. The conversation of this reactant throughout evolution, as well as its unusual decoding properties, suggested an alternate mechanism for tRNA-mRNA interactions at initiation. We have reported that the sequence of bases neighboring the start codons of many eubacterial genes are complementary not only to the 16S rRNA 3' end and to the anticodon of tRNAi, but, also, have the potential to base-pair the D, T or extended anticodon loops of this tRNAi. The coding properties of tRNAi and mutations that affect translation suggest that these signals may function. This hypothesis explains the observation that unusual triplets can start prokaryotic and mitochondrial genes and predicts the occurrence of other reading frames. Furthermore, it suggests a unifying model of chain initiation based on RNA-RNA contacts and displacements. Here we examine the start domain of 290 eukaryotic genes for their ability to base-pair the tRNAi loops and the 18S rRNA. We observe that both methionine start, and methionine coding regions have the potential to pair with the 18S rRNA, but that the nucleotide distribution about start codons strongly favoured such pairings over that near internal AUGs. The 5' extended anticodon of tRNAi is methylated, and was not represented in the mRNA with high frequency. However, the tetramer AUGg did occur with high frequency in the start domain. A modification of the tRNAi T loop also decreases its base-pairing potential. Interestingly, complementarity to the T loop did not occur with high frequency in the start sites. The early coding region, 10 to 34 nucleotides 3' to the initiator AUG, is complementary to the tRNAi D loop in many cases, while no such affinity is found near internal AUGs. The nucleotides around initiator AUGs were heavily biassed toward the sequence gccaccAUGgcg. No such tendency was noted around internal AUGs. Although the role of this sequence bias is unclear, the sequence gccaccAUGg has been shown by Kozak to promote initiation. Another distinguishing feature was a C-rich tract 7 to 34 nucleotides 5' to the initiator AUGs. Ability to pair with more than eight bases of the start consensus sequence, matching of 6 or 7 nucleotides to the D loop on the 3' side, an C-richness on the 5' side were used as criteria for distinguishing start AUGs.(ABSTRACT TRUNCATED AT 400 WORDS)

Collapse

Arquès DG, Michel CJ. Periodicities in introns. Nucleic Acids Res 1987;15:7581-92. [PMID: 3658704 PMCID: PMC306269 DOI: 10.1093/nar/15.18.7581] [Citation(s) in RCA: 27] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open

Study of a perturbation in the coding periodicity. Math Biosci 1987. [DOI: 10.1016/0025-5564(87)90060-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]