Adindla S, Inampudi KK, Guruprasad K, Guruprasad L. Identification and analysis of novel tandem repeats in the cell surface proteins of archaeal and bacterial genomes using computational tools.
Comp Funct Genomics 2010;
5:2-16. [PMID:
18629042 PMCID:
PMC2447325 DOI:
10.1002/cfg.358]
[Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2003] [Revised: 10/21/2003] [Accepted: 10/23/2003] [Indexed: 11/30/2022] Open
Abstract
We have identified four novel repeats and two domains in cell surface proteins
encoded by the Methanosarcina acetivorans genome and in some archaeal and
bacterial genomes. The repeats correspond to a certain number of amino acid residues
present in tandem in a protein sequence and each repeat is characterized by conserved
sequence motifs. These correspond to: (a) a 42 amino acid (aa) residue RIVW repeat;
(b) a 45 aa residue LGxL repeat; (c) a 42 aa residue LVIVD repeat; and (d) a 54 aa
residue LGFP repeat. The domains correspond to a certain number of aa residues in
a protein sequence that do not comprise internal repeats. These correspond to: (a) a
200 aa residue DNRLRE domain; and (b) a 70 aa residue PEGA domain. We discuss
the occurrence of these repeats and domains in the different proteins and genomes
analysed in this work.
Collapse