Brown AP. Codon-level analysis of histone primary sequence: evidence of a repeat tetrapeptide origin and later inclusion of transcribed sequence.
J Theor Biol 1983;
104:401-16. [PMID:
6361392 DOI:
10.1016/0022-5193(83)90114-5]
[Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
This work is directed to the question of protein sequence conservation. By reference to the genetic code the aminoacyl sequence of histones H2A, H4, H3, H2B and H1 (fragment) were rewritten as the codon sequences. The N-terminal regions were set aside on the grounds of different composition and sequence. The remainder of the molecule could be referred to simple repeat-tetrapeptide proteins by codon composition (high Gxy, low xGy content) and by sequence. Random segments of three to six residues occur characterized by composition and sequence as originating from the complimentary DNA strand, i.e. as codon "transcript". Ancestral features are probably best seen in H3, point mutations appear to be more extensive in H2B and H1. Segments in reverse order in H2A and in "transcript" in H4 distinguish these two from the other three histones. There is a tenuous possibility the N-terminals also originated as repeat-tetrapeptide now intensively modified. At codon-level the 50S ribosomal protein (L7/L12) of E. coli has features in common with histones (including a palindrome-containing N-terminal). It has the composition and sequence of a well-conserved tetrapeptide-repeat strand (statistical support). If interpretations made here are substantially correct, the 50S r-protein illustrates a significant stage in evolution of histone codon strands.
Collapse