Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Neuwald AF, Liu JS, Lipman DJ, Lawrence CE. Extracting protein alignment models from the sequence database. Nucleic Acids Res 1997;25:1665-77. [PMID: 9108146 PMCID: PMC146639 DOI: 10.1093/nar/25.9.1665] [Citation(s) in RCA: 180] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open

For:	Neuwald AF, Liu JS, Lipman DJ, Lawrence CE. Extracting protein alignment models from the sequence database. Nucleic Acids Res 1997;25:1665-77. [PMID: 9108146 PMCID: PMC146639 DOI: 10.1093/nar/25.9.1665] [Citation(s) in RCA: 180] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open

Number

Cited by Other Article(s)

Lee I, Hong W. RAP--a putative RNA-binding domain. Trends Biochem Sci 2005;29:567-70. [PMID: 15501674 DOI: 10.1016/j.tibs.2004.09.005] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Frère JM, Galleni M, Bush K, Dideberg O. Is it necessary to change the classification of {beta}-lactamases? J Antimicrob Chemother 2005;55:1051-3. [PMID: 15886262 DOI: 10.1093/jac/dki155] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Likić VA, Perry A, Hulett J, Derby M, Traven A, Waller RF, Keeling PJ, Koehler CM, Curran SP, Gooley PR, Lithgow T. Patterns that Define the Four Domains Conserved in Known and Novel Isoforms of the Protein Import Receptor Tom20. J Mol Biol 2005;347:81-93. [PMID: 15733919 DOI: 10.1016/j.jmb.2004.12.057] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2004] [Revised: 12/17/2004] [Accepted: 12/27/2004] [Indexed: 11/22/2022]

Spence P, Bard J, Jones P, Betty M. The identification of G-protein coupled receptors in sequence databases. Expert Opin Ther Pat 2005. [DOI: 10.1517/13543776.8.3.235] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Raphael B, Zhi D, Tang H, Pevzner P. A novel method for multiple alignment of sequences with repeated and shuffled elements. Genome Res 2005;14:2336-46. [PMID: 15520295 PMCID: PMC525693 DOI: 10.1101/gr.2657504] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Conklin D. Recognition of the Helical Cytokine Fold. J Comput Biol 2004;11:1189-200. [PMID: 15662206 DOI: 10.1089/cmb.2004.11.1189] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Neuwald AF, Liu JS. Gapped alignment of protein sequence motifs through Monte Carlo optimization of a hidden Markov model. BMC Bioinformatics 2004;5:157. [PMID: 15504234 PMCID: PMC538276 DOI: 10.1186/1471-2105-5-157] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2004] [Accepted: 10/25/2004] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Certain protein families are highly conserved across distantly related organisms and belong to large and functionally diverse superfamilies. The patterns of conservation present in these protein sequences presumably are due to selective constraints maintaining important but unknown structural mechanisms with some constraints specific to each family and others shared by a larger subset or by the entire superfamily. To exploit these patterns as a source of functional information, we recently devised a statistically based approach called contrast hierarchical alignment and interaction network (CHAIN) analysis, which infers the strengths of various categories of selective constraints from co-conserved patterns in a multiple alignment. The power of this approach strongly depends on the quality of the multiple alignments, which thus motivated development of theoretical concepts and strategies to improve alignment of conserved motifs within large sets of distantly related sequences.

RESULTS

Here we describe a hidden Markov model (HMM), an algebraic system, and Markov chain Monte Carlo (MCMC) sampling strategies for alignment of multiple sequence motifs. The MCMC sampling strategies are useful both for alignment optimization and for adjusting position specific background amino acid frequencies for alignment uncertainties. Associated statistical formulations provide an objective measure of alignment quality as well as automatic gap penalty optimization. Improved alignments obtained in this way are compared with PSI-BLAST based alignments within the context of CHAIN analysis of three protein families: Gialpha subunits, prolyl oligopeptidases, and transitional endoplasmic reticulum (p97) AAA+ ATPases.

CONCLUSION

While not entirely replacing PSI-BLAST based alignments, which likewise may be optimized for CHAIN analysis using this approach, these motif-based methods often more accurately align very distantly related sequences and thus can provide a better measure of selective constraints. In some instances, these new approaches also provide a better understanding of family-specific constraints, as we illustrate for p97 ATPases. Programs implementing these procedures and supplementary information are available from the authors.

Collapse

Iyer LM, Makarova KS, Koonin EV, Aravind L. Comparative genomics of the FtsK-HerA superfamily of pumping ATPases: implications for the origins of chromosome segregation, cell division and viral capsid packaging. Nucleic Acids Res 2004;32:5260-79. [PMID: 15466593 PMCID: PMC521647 DOI: 10.1093/nar/gkh828] [Citation(s) in RCA: 246] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Abstract

Recently, it has been shown that a predicted P-loop ATPase (the HerA or MlaA protein), which is highly conserved in archaea and also present in many bacteria but absent in eukaryotes, has a bidirectional helicase activity and forms hexameric rings similar to those described for the TrwB ATPase. In this study, the FtsK-HerA superfamily of P-loop ATPases, in which the HerA clade comprises one of the major branches, is analyzed in detail. We show that, in addition to the FtsK and HerA clades, this superfamily includes several families of characterized or predicted ATPases which are predominantly involved in extrusion of DNA and peptides through membrane pores. The DNA-packaging ATPases of various bacteriophages and eukaryotic double-stranded DNA viruses also belong to the FtsK-HerA superfamily. The FtsK protein is the essential bacterial ATPase that is responsible for the correct segregation of daughter chromosomes during cell division. The structural and evolutionary relationship between HerA and FtsK and the nearly perfect complementarity of their phyletic distributions suggest that HerA similarly mediates DNA pumping into the progeny cells during archaeal cell division. It appears likely that the HerA and FtsK families diverged concomitantly with the archaeal-bacterial division and that the last universal common ancestor of modern life forms had an ancestral DNA-pumping ATPase that gave rise to these families. Furthermore, the relationship of these cellular proteins with the packaging ATPases of diverse DNA viruses suggests that a common DNA pumping mechanism might be operational in both cellular and viral genome segregation. The herA gene forms a highly conserved operon with the gene for the NurA nuclease and, in many archaea, also with the orthologs of eukaryotic double-strand break repair proteins MRE11 and Rad50. HerA is predicted to function in a complex with these proteins in DNA pumping and repair of double-stranded breaks introduced during this process and, possibly, also during DNA replication. Extensive comparative analysis of the 'genomic context' combined with in-depth sequence analysis led to the prediction of numerous previously unnoticed nucleases of the NurA superfamily, including a specific version that is likely to be the endonuclease component of a novel restriction-modification system. This analysis also led to the identification of previously uncharacterized nucleases, such as a novel predicted nuclease of the Sir2-type Rossmann fold, and phosphatases of the HAD superfamily that are likely to function as partners of the FtsK-HerA superfamily ATPases.

Collapse

Leipe DD, Koonin EV, Aravind L. STAND, a Class of P-Loop NTPases Including Animal and Plant Regulators of Programmed Cell Death: Multiple, Complex Domain Architectures, Unusual Phyletic Patterns, and Evolution by Horizontal Gene Transfer. J Mol Biol 2004;343:1-28. [PMID: 15381417 DOI: 10.1016/j.jmb.2004.08.023] [Citation(s) in RCA: 325] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2004] [Revised: 07/27/2004] [Accepted: 08/10/2004] [Indexed: 10/26/2022]

Abstract

Using sequence profile analysis and sequence-based structure predictions, we define a previously unrecognized, widespread class of P-loop NTPases. The signal transduction ATPases with numerous domains (STAND) class includes the AP-ATPases (animal apoptosis regulators CED4/Apaf-1, plant disease resistance proteins, and bacterial AfsR-like transcription regulators) and NACHT NTPases (e.g. NAIP, TLP1, Het-E-1) that have been studied extensively in the context of apoptosis, pathogen response in animals and plants, and transcriptional regulation in bacteria. We show that, in addition to these well-characterized protein families, the STAND class includes several other groups of (predicted) NTPase domains from diverse signaling and transcription regulatory proteins from bacteria and eukaryotes, and three Archaea-specific families. We identified the STAND domain in several biologically well-characterized proteins that have not been suspected to have NTPase activity, including soluble adenylyl cyclases, nephrocystin 3 (implicated in polycystic kidney disease), and Rolling pebble (a regulator of muscle development); these findings are expected to facilitate elucidation of the functions of these proteins. The STAND class belongs to the additional strand, catalytic E division of P-loop NTPases together with the AAA+ ATPases, RecA/helicase-related ATPases, ABC-ATPases, and VirD4/PilT-like ATPases. The STAND proteins are distinguished from other P-loop NTPases by the presence of unique sequence motifs associated with the N-terminal helix and the core strand-4, as well as a C-terminal helical bundle that is fused to the NTPase domain. This helical module contains a signature GxP motif in the loop between the two distal helices. With the exception of the archaeal families, almost all STAND NTPases are multidomain proteins containing three or more domains. In addition to the NTPase domain, these proteins typically contain DNA-binding or protein-binding domains, superstructure-forming repeats, such as WD40 and TPR, and enzymatic domains involved in signal transduction, including adenylate cyclases and kinases. By analogy to the AAA+ ATPases, it can be predicted that STAND NTPases use the C-terminal helical bundle as a "lever" to transmit the conformational changes brought about by NTP hydrolysis to effector domains. STAND NTPases represent a novel paradigm in signal transduction, whereby adaptor, regulatory switch, scaffolding, and, in some cases, signal-generating moieties are combined into a single polypeptide. The STAND class consists of 14 distinct families, and the evolutionary history of most of these families is riddled with dramatic instances of lineage-specific expansion and apparent horizontal gene transfer. The STAND NTPases are most abundant in developmentally and organizationally complex prokaryotes and eukaryotes. Transfer of genes for STAND NTPases from bacteria to eukaryotes on several occasions might have played a significant role in the evolution of eukaryotic signaling systems.

Collapse

Roesner A, Fuchs C, Hankeln T, Burmester T. A globin gene of ancient evolutionary origin in lower vertebrates: evidence for two distinct globin families in animals. Mol Biol Evol 2004;22:12-20. [PMID: 15356282 DOI: 10.1093/molbev/msh258] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Garau G, García-Sáez I, Bebrone C, Anne C, Mercuri P, Galleni M, Frère JM, Dideberg O. Update of the standard numbering scheme for class B beta-lactamases. Antimicrob Agents Chemother 2004;48:2347-9. [PMID: 15215079 PMCID: PMC434215 DOI: 10.1128/aac.48.7.2347-2349.2004] [Citation(s) in RCA: 231] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Ascenzi P, Bocedi A, de Sanctis D, Pesce A, Bolognesi M, Marden MC, Dewilde S, Moens L, Hankeln T, Burmester T. Neuroglobin and cytoglobin: Two new entries in the hemoglobin superfamily*. BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION : A BIMONTHLY PUBLICATION OF THE INTERNATIONAL UNION OF BIOCHEMISTRY AND MOLECULAR BIOLOGY 2004;32:305-313. [PMID: 21706744 DOI: 10.1002/bmb.2004.494032050386] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]

Okamoto Y, Morishita J, Tsuboi K, Tonai T, Ueda N. Molecular Characterization of a Phospholipase D Generating Anandamide and Its Congeners. J Biol Chem 2004;279:5298-305. [PMID: 14634025 DOI: 10.1074/jbc.m306642200] [Citation(s) in RCA: 580] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

de Sanctis D, Dewilde S, Pesce A, Moens L, Ascenzi P, Hankeln T, Burmester T, Bolognesi M. Crystal Structure of Cytoglobin: The Fourth Globin Type Discovered in Man Displays Heme Hexa-coordination. J Mol Biol 2004;336:917-27. [PMID: 15095869 DOI: 10.1016/j.jmb.2003.12.063] [Citation(s) in RCA: 114] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Andersen C. Channel-tunnels: outer membrane components of type I secretion systems and multidrug efflux pumps of Gram-negative bacteria. Rev Physiol Biochem Pharmacol 2003;147:122-65. [PMID: 12783268 DOI: 10.1007/s10254-003-0008-y] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Sitbon E, Pietrokovski S. New types of conserved sequence domains in DNA-binding regions of homing endonucleases. Trends Biochem Sci 2003;28:473-7. [PMID: 13678957 DOI: 10.1016/s0968-0004(03)00170-1] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Anantharaman V, Aravind L. New connections in the prokaryotic toxin-antitoxin network: relationship with the eukaryotic nonsense-mediated RNA decay system. Genome Biol 2003;4:R81. [PMID: 14659018 PMCID: PMC329420 DOI: 10.1186/gb-2003-4-12-r81] [Citation(s) in RCA: 195] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2003] [Revised: 10/13/2003] [Accepted: 10/10/2003] [Indexed: 11/28/2022] Open

Abstract

Sequence profile analysis of the RelE- and ParE-type post-segregational cell killing (PSK) toxins from diverse bacteria and archaea has unified these proteins into a single superfamily. Further comparative analysis suggests that the core of the eukaryotic nonsense-mediated RNA decay system has probably evolved from a PSK-related system.

Background

Several prokaryotic plasmids maintain themselves in their hosts by means of diverse post-segregational cell killing systems. Recent findings suggest that chromosomally encoded copies of toxins and antitoxins of post-segregational cell killing systems - such as the RelE system - might function as regulatory switches under stress conditions. The RelE toxin cleaves ribosome-associated transcripts, whereas another post-segregational cell killing toxin, ParE, functions as a gyrase inhibitor.

Results

Using sequence profile analysis we were able unify the RelE- and ParE-type toxins with several families of small, uncharacterized proteins from diverse bacteria and archaea into a single superfamily. Gene neighborhood analysis showed that the majority of these proteins were encoded by genes in characteristic neighborhoods, in which genes encoding toxins always co-occurred with genes encoding transcription factors that are also antitoxins. The transcription factors accompanying the RelE/ParE superfamily may belong to unrelated or distantly related superfamilies, however. We used this conserved neighborhood template to transitively search genomes and identify novel post-segregational cell killing-related systems. One of these novel systems, observed in several prokaryotes, contained a predicted toxin with a PilT-N terminal (PIN) domain, which is also found in proteins of the eukaryotic nonsense-mediated RNA decay system. These searches also identified novel transcription factors (antitoxins) in post-segregational cell killing systems. Furthermore, the toxin Doc defines a potential metalloenzyme superfamily, with novel representatives in bacteria, archaea and eukaryotes, that probably acts on nucleic acids.

Conclusions

The tightly maintained gene neighborhoods of post-segregational cell killing-related systems appear to have evolved by in situ displacement of genes for toxins or antitoxins by functionally equivalent but evolutionarily unrelated genes. We predict that the novel post-segregational cell killing-related systems containing a PilT-N terminal domain toxin and the eukaryotic nonsense-mediated RNA decay system are likely to function via a common mechanism, in which the PilT-N terminal domain cleaves ribosome-associated transcripts. The core of the eukaryotic nonsense-mediated RNA decay system has probably evolved from a post-segregational cell killing-related system.

Collapse

Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, Rao BS, Smirnov S, Sverdlov AV, Vasudevan S, Wolf YI, Yin JJ, Natale DA. The COG database: an updated version includes eukaryotes. BMC Bioinformatics 2003;4:41. [PMID: 12969510 PMCID: PMC222959 DOI: 10.1186/1471-2105-4-41] [Citation(s) in RCA: 3241] [Impact Index Per Article: 154.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2003] [Accepted: 09/11/2003] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies.

RESULTS

We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or approximately 54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of approximately 20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (approximately 1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes.

CONCLUSION

The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies.

Collapse

Affiliation(s)

Roman L Tatusov National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Natalie D Fedorova National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
John D Jackson National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Aviva R Jacobs National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Boris Kiryutin National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Eugene V Koonin National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Dmitri M Krylov National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Raja Mazumder Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20007, USA
Sergei L Mekhedov National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Anastasia N Nikolskaya Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20007, USA
B Sridhar Rao National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Sergei Smirnov National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Alexander V Sverdlov National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Sona Vasudevan National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Yuri I Wolf National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Jodie J Yin National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
Darren A Natale Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20007, USA

Collapse

Qian B, Goldstein RA. Detecting distant homologs using phylogenetic tree-based HMMs. Proteins 2003;52:446-53. [PMID: 12866055 DOI: 10.1002/prot.10373] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Jackson DB, Minch E, Munro RE. Bioinformatics. EXS 2003:31-69. [PMID: 12613171 DOI: 10.1007/978-3-0348-7997-2_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/01/2023]

Teufel M, Saudek V, Ledig JP, Bernhardt A, Boularand S, Carreau A, Cairns NJ, Carter C, Cowley DJ, Duverger D, Ganzhorn AJ, Guenet C, Heintzelmann B, Laucher V, Sauvage C, Smirnova T. Sequence identification and characterization of human carnosinase and a closely related non-specific dipeptidase. J Biol Chem 2003;278:6521-31. [PMID: 12473676 DOI: 10.1074/jbc.m209764200] [Citation(s) in RCA: 241] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Iyer LM, Koonin EV, Aravind L. Evolutionary connection between the catalytic subunits of DNA-dependent RNA polymerases and eukaryotic RNA-dependent RNA polymerases and the origin of RNA polymerases. BMC STRUCTURAL BIOLOGY 2003;3:1. [PMID: 12553882 PMCID: PMC151600 DOI: 10.1186/1472-6807-3-1] [Citation(s) in RCA: 165] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2003] [Accepted: 01/28/2003] [Indexed: 12/02/2022]

Abstract

BACKGROUND

The eukaryotic RNA-dependent RNA polymerase (RDRP) is involved in the amplification of regulatory microRNAs during post-transcriptional gene silencing. This enzyme is highly conserved in most eukaryotes but is missing in archaea and bacteria. No evolutionary relationship between RDRP and other polymerases has been reported so far, hence the origin of this eukaryote-specific polymerase remains a mystery.

RESULTS

Using extensive sequence profile searches, we identified bacteriophage homologs of the eukaryotic RDRP. The comparison of the eukaryotic RDRP and their homologs from bacteriophages led to the delineation of the conserved portion of these enzymes, which is predicted to harbor the catalytic site. Further, detailed sequence comparison, aided by examination of the crystal structure of the DNA-dependent RNA polymerase (DDRP), showed that the RDRP and the beta' subunit of DDRP (and its orthologs in archaea and eukaryotes) contain a conserved double-psi beta-barrel (DPBB) domain. This DPBB domain contains the signature motif DbDGD (b is a bulky residue), which is conserved in all RDRPs and DDRPs and contributes to catalysis via a coordinated divalent cation. Apart from the DPBB domain, no similarity was detected between RDRP and DDRP, which leaves open two scenarios for the origin of RDRP: i) RDRP evolved at the onset of the evolution of eukaryotes via a duplication of the DDRP beta' subunit followed by dramatic divergence that obliterated the sequence similarity outside the core catalytic domain and ii) the primordial RDRP, which consisted primarily of the DPBB domain, evolved from a common ancestor with the DDRP at a very early stage of evolution, during the RNA world era. The latter hypothesis implies that RDRP had been subsequently eliminated from cellular life forms and might have been reintroduced into the eukaryotic genomes through a bacteriophage. Sequence and structure analysis of the DDRP led to further insights into the evolution of RNA polymerases. In addition to the beta' subunit, beta subunit of DDRP also contains a DPBB domain, which is, however, distorted by large inserts and does not harbor a counterpart of the DbDGD motif. The DPBB domains of the two DDRP subunits together form the catalytic cleft, with the domain from the beta' subunit supplying the metal-coordinating DbDGD motif and the one from the beta subunit providing two lysine residues involved in catalysis. Given that the two DPBB domains of DDRP contribute completely different sets of active residues to the catalytic center, it is hypothesized that the ultimate ancestor of RNA polymerases functioned as a homodimer of a generic, RNA-binding DPBB domain. This ancestral protein probably did not have catalytic activity and served as a cofactor for a ribozyme RNA polymerase. Subsequent evolution of DDRP and RDRP involved accretion of distinct sets of additional domains. In the DDRPs, these included a RNA-binding Zn-ribbon, an AT-hook-like module and a sandwich-barrel hybrid motif (SBHM) domain. Further, lineage-specific accretion of SBHM domains and other, DDRP-specific domains is observed in bacterial DDRPs. In contrast, the orthologs of the beta' subunit in archaea and eukaryotes contains a four-stranded alpha + beta domain that is shared with the alpha-subunit of bacterial DDRP, eukaryotic DDRP subunit RBP11, translation factor eIF1 and type II topoisomerases. The additional domains of the RDRPs remain to be characterized.

CONCLUSIONS

Eukaryotic RNA-dependent RNA polymerases share the catalytic double-psi beta-barrel domain, containing a signature metal-coordinating motif, with the universally conserved beta' subunit of DNA-dependent RNA polymerases. Beyond this core catalytic domain, the two classes of RNA polymerases do not have common domains, suggesting early divergence from a common ancestor, with subsequent independent domain accretion. The beta-subunit of DDRP contains another, highly diverged DPBB domain. The presence of two distinct DPBB domains in two subunits of DDRP is compatible with the hypothesis that the ith the hypothesis that the ultimate ancestor of RNA polymerases was a RNA-binding DPBB domain that had no catalytic activity but rather functioned as a homodimeric cofactor for a ribozyme polymerase.

Collapse

Panchenko AR. Finding weak similarities between proteins by sequence profile comparison. Nucleic Acids Res 2003;31:683-9. [PMID: 12527777 PMCID: PMC140518 DOI: 10.1093/nar/gkg154] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Tulin A, Chinenov Y, Spradling A. Regulation of chromatin structure and gene activity by poly(ADP-ribose) polymerases. Curr Top Dev Biol 2003;56:55-83. [PMID: 14584726 DOI: 10.1016/s0070-2153(03)01007-x] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Pesce A, Bolognesi M, Bocedi A, Ascenzi P, Dewilde S, Moens L, Hankeln T, Burmester T. Neuroglobin and cytoglobin. Fresh blood for the vertebrate globin family. EMBO Rep 2002;3:1146-51. [PMID: 12475928 PMCID: PMC1308314 DOI: 10.1093/embo-reports/kvf248] [Citation(s) in RCA: 232] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Wilk S, Wilk E, Magnusson RP. Identification of histidine residues important in the catalysis and structure of aspartyl aminopeptidase. Arch Biochem Biophys 2002;407:176-83. [PMID: 12413488 DOI: 10.1016/s0003-9861(02)00494-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Shaw E, Dordick JS. Predicting amino acid residues responsible for enzyme specificity solely from protein sequences. Biotechnol Bioeng 2002;79:295-300. [PMID: 12115418 DOI: 10.1002/bit.10289] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Zgurskaya HI. Molecular analysis of efflux pump-based antibiotic resistance. Int J Med Microbiol 2002;292:95-105. [PMID: 12195740 DOI: 10.1078/1438-4221-00195] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

Aravind L, Anantharaman V, Koonin EV. Monophyly of class I aminoacyl tRNA synthetase, USPA, ETFP, photolyase, and PP-ATPase nucleotide-binding domains: implications for protein evolution in the RNA. Proteins 2002;48:1-14. [PMID: 12012333 DOI: 10.1002/prot.10064] [Citation(s) in RCA: 111] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Koretke KK, Russell RB, Lupas AN. Fold recognition without folds. Protein Sci 2002;11:1575-9. [PMID: 12021456 PMCID: PMC2373620 DOI: 10.1110/ps.3590102] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]

Pesce A, Nardini M, Dewilde S, Geuens E, Yamauchi K, Ascenzi P, Riggs AF, Moens L, Bolognesi M. The 109 residue nerve tissue minihemoglobin from Cerebratulus lacteus highlights striking structural plasticity of the alpha-helical globin fold. Structure 2002;10:725-35. [PMID: 12015154 DOI: 10.1016/s0969-2126(02)00763-3] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]

Shaw E, McCue LA, Lawrence CE, Dordick JS. Identification of a novel class in the alpha/beta hydrolase fold superfamily: the N-myc differentiation-related proteins. Proteins 2002;47:163-8. [PMID: 11933063 DOI: 10.1002/prot.10083] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Bujnicki JM, Rychlewski L. RNA:(guanine-N2) methyltransferases RsmC/RsmD and their homologs revisited--bioinformatic analysis and prediction of the active site based on the uncharacterized Mj0882 protein structure. BMC Bioinformatics 2002;3:10. [PMID: 11929612 PMCID: PMC102759 DOI: 10.1186/1471-2105-3-10] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2001] [Accepted: 04/03/2002] [Indexed: 01/01/2023] Open

Burmester T, Ebner B, Weich B, Hankeln T. Cytoglobin: a novel globin type ubiquitously expressed in vertebrate tissues. Mol Biol Evol 2002;19:416-21. [PMID: 11919282 DOI: 10.1093/oxfordjournals.molbev.a004096] [Citation(s) in RCA: 364] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Dlakić M. A new family of putative insulin receptor-like proteins in C. elegans. Curr Biol 2002;12:R155-7. [PMID: 11882301 DOI: 10.1016/s0960-9822(02)00729-7] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Purkayastha A, McCue LA, McDonough KA. Identification of a Mycobacterium tuberculosis putative classical nitroreductase gene whose expression is coregulated with that of the acr aene within macrophages, in standing versus shaking cultures, and under low oxygen conditions. Infect Immun 2002;70:1518-29. [PMID: 11854240 PMCID: PMC127740 DOI: 10.1128/iai.70.3.1518-1529.2002] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Chinenov Y. A second catalytic domain in the Elp3 histone acetyltransferases: a candidate for histone demethylase activity? Trends Biochem Sci 2002;27:115-7. [PMID: 11893502 DOI: 10.1016/s0968-0004(02)02058-3] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Aravind L, Koonin EV. Classification of the caspase-hemoglobinase fold: detection of new families and implications for the origin of the eukaryotic separins. Proteins 2002;46:355-67. [PMID: 11835511 DOI: 10.1002/prot.10060] [Citation(s) in RCA: 134] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Abstract

A comprehensive sequence and structural comparative analysis of the caspase-hemoglobinase protein fold resulted in the delineation of the minimal structural core of the protease domain and the identification of numerous, previously undetected members, including a new protease family typified by the HetF protein from the cyanobacterium Nostoc. The first bacterial homologs of legumains and hemoglobinases were also identified. Most proteins containing this fold are known or predicted to be active proteases, but multiple, independent inactivations were noticed in nearly all lineages. Together with the tendency of caspase-related proteases to form intramolecular or intermolecular dimers, this suggests a widespread regulatory role for the inactive forms. A classification of the caspase-hemoglobinase fold was developed to reflect the inferred evolutionary relationships between the constituent protein families. Proteins containing this domain were so far detected almost exclusively in bacteria and eukaryotes. This analysis indicates that caspase-hemoglobinase-fold proteases and their inactivated derivatives are widespread in diverse bacteria, particularly those with a complex development, such as Streptomyces, Anabaena, Mesorhizobium, and Myxococcus. The eukaryotic separin family was shown to be most closely related to the mainly prokaryotic HetF family. The phyletic patterns and evolutionary relationships between these proteins suggest that they probably were acquired by eukaryotes from bacteria during the primary, promitochondrial endosymbiosis. A similar scenario, supported by phylogenetic analysis, seems to apply to metacaspases and paracaspases, with the latter, perhaps, being acquired in an independent horizontal transfer to the eukaryotes. The acquisition of the caspase-hemoglobinase-fold domains by eukaryotes might have been critical in the evolution of important eukaryotic processes, such as mitosis and programmed cell death.

Collapse

Panchenko AR, Bryant SH. A comparison of position-specific score matrices based on sequence and structure alignments. Protein Sci 2002;11:361-70. [PMID: 11790846 PMCID: PMC2373449 DOI: 10.1110/ps.19902] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]

Bornberg-Bauer E. Randomness, Structural Uniqueness, Modularity and Neutral Evolution in Sequence Space of Model Proteins. ACTA ACUST UNITED AC 2002. [DOI: 10.1524/zpch.2002.216.2.139] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Håkansson K, Miller CG. Structure of peptidase T from Salmonella typhimurium. EUROPEAN JOURNAL OF BIOCHEMISTRY 2002;269:443-50. [PMID: 11856302 DOI: 10.1046/j.0014-2956.2001.02665.x] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Bujnicki JM. In silico analysis of the tRNA:m1A58 methyltransferase family: homology-based fold prediction and identification of new members from Eubacteria and Archaea. FEBS Lett 2001;507:123-7. [PMID: 11684083 DOI: 10.1016/s0014-5793(01)02962-3] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Sharff A, Fanutti C, Shi J, Calladine C, Luisi B. The role of the TolC family in protein transport and multidrug efflux. From stereochemical certainty to mechanistic hypothesis. EUROPEAN JOURNAL OF BIOCHEMISTRY 2001;268:5011-26. [PMID: 11589692 DOI: 10.1046/j.0014-2956.2001.02442.x] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Spencer J, Clarke AR, Walsh TR. Novel mechanism of hydrolysis of therapeutic beta-lactams by Stenotrophomonas maltophilia L1 metallo-beta-lactamase. J Biol Chem 2001;276:33638-44. [PMID: 11443136 DOI: 10.1074/jbc.m105550200] [Citation(s) in RCA: 82] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Florczyk MA, McCue LA, Stack RF, Hauer CR, McDonough KA. Identification and characterization of mycobacterial proteins differentially expressed under standing and shaking culture conditions, including Rv2623 from a novel class of putative ATP-binding proteins. Infect Immun 2001;69:5777-85. [PMID: 11500455 PMCID: PMC98695 DOI: 10.1128/iai.69.9.5777-5785.2001] [Citation(s) in RCA: 79] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Daiyasu H, Osaka K, Ishino Y, Toh H. Expansion of the zinc metallo-hydrolase family of the beta-lactamase fold. FEBS Lett 2001;503:1-6. [PMID: 11513844 DOI: 10.1016/s0014-5793(01)02686-2] [Citation(s) in RCA: 258] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Grigoriev IV, Zhang C, Kim SH. Sequence-based detection of distantly related proteins with the same fold. PROTEIN ENGINEERING 2001;14:455-8. [PMID: 11522917 DOI: 10.1093/protein/14.7.455] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Lewis JA, Hatfull GF. Control of directionality in integrase-mediated recombination: examination of recombination directionality factors (RDFs) including Xis and Cox proteins. Nucleic Acids Res 2001;29:2205-16. [PMID: 11376138 PMCID: PMC55702 DOI: 10.1093/nar/29.11.2205] [Citation(s) in RCA: 147] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2001] [Revised: 03/28/2001] [Accepted: 04/11/2001] [Indexed: 11/12/2022] Open

Lecompte O, Thompson JD, Plewniak F, Thierry J, Poch O. Multiple alignment of complete sequences (MACS) in the post-genomic era. Gene 2001;270:17-30. [PMID: 11403999 DOI: 10.1016/s0378-1119(01)00461-9] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

100

Kwak J, McCue LA, Trczianka K, Kendrick KE. Identification and characterization of a developmentally regulated protein, EshA, required for sporogenic hyphal branches in Streptomyces griseus. J Bacteriol 2001;183:3004-15. [PMID: 11325927 PMCID: PMC95199 DOI: 10.1128/jb.183.10.3004-3015.2001] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open