51
|
Abstract
A novel approximately 60-residue domain has been identified in Homo sapiens MGC5297 and various other proteins in eukaryotes. Sequence searches reveal that the domain is particularly abundant in apicomplexans and is predicted to be involved in diverse RNA-binding activities.
Collapse
Affiliation(s)
- Ian Lee
- Computational Molecular Biology Programme, Institute of Molecular and Cell Biology, 61 Biopolis Drive, Proteos, Singapore 138673.
| | | |
Collapse
|
52
|
Frère JM, Galleni M, Bush K, Dideberg O. Is it necessary to change the classification of {beta}-lactamases? J Antimicrob Chemother 2005; 55:1051-3. [PMID: 15886262 DOI: 10.1093/jac/dki155] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
53
|
Likić VA, Perry A, Hulett J, Derby M, Traven A, Waller RF, Keeling PJ, Koehler CM, Curran SP, Gooley PR, Lithgow T. Patterns that Define the Four Domains Conserved in Known and Novel Isoforms of the Protein Import Receptor Tom20. J Mol Biol 2005; 347:81-93. [PMID: 15733919 DOI: 10.1016/j.jmb.2004.12.057] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2004] [Revised: 12/17/2004] [Accepted: 12/27/2004] [Indexed: 11/22/2022]
Abstract
Tom20 is the master receptor for protein import into mitochondria. Analysis of motifs present in Tom20 sequences from fungi and animals found several highly conserved regions, including features of the transmembrane segment, the ligand-binding domain and functionally important flexible segments at the N terminus and the C terminus of the protein. Hidden Markov model searches of genome sequence data revealed novel isoforms of Tom20 in vertebrate and invertebrate animals. A three-dimensional comparative model of the novel type I Tom20, based on the structurally characterized type II isoform, shows important differences in the amino acid residues lining the ligand-binding groove, where the type I protein from animals is more similar to the fungal form of Tom20. Given that the two receptor types from mouse interact with the same set of precursor protein substrates, comparative analysis of the substrate-binding site provides unique insight into the mechanism of substrate recognition. No Tom20-related protein was found in genome sequence data from plants or protozoans, suggesting the receptor Tom20 evolved after the split of animals and fungi from the main lineage of eukaryotes.
Collapse
Affiliation(s)
- Vladimir A Likić
- Russell Grimwade School of Biochemistry and Molecular Biology, University of Melbourne, Melbourne 3010, Australia
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
54
|
Spence P, Bard J, Jones P, Betty M. The identification of G-protein coupled receptors in sequence databases. Expert Opin Ther Pat 2005. [DOI: 10.1517/13543776.8.3.235] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
55
|
Raphael B, Zhi D, Tang H, Pevzner P. A novel method for multiple alignment of sequences with repeated and shuffled elements. Genome Res 2005; 14:2336-46. [PMID: 15520295 PMCID: PMC525693 DOI: 10.1101/gr.2657504] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We describe ABA (A-Bruijn alignment), a new method for multiple alignment of biological sequences. The major difference between ABA and existing multiple alignment methods is that ABA represents an alignment as a directed graph, possibly containing cycles. This representation provides more flexibility than does a traditional alignment matrix or the recently introduced partial order alignment (POA) graph by allowing a larger class of evolutionary relationships between the aligned sequences. Our graph representation is particularly well-suited to the alignment of protein sequences with shuffled and/or repeated domain structure, and allows one to construct multiple alignments of proteins containing (1) domains that are not present in all proteins, (2) domains that are present in different orders in different proteins, and (3) domains that are present in multiple copies in some proteins. In addition, ABA is useful in the alignment of genomic sequences that contain duplications and inversions. We provide several examples illustrating the applications of ABA.
Collapse
Affiliation(s)
- Benjamin Raphael
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, California 92093-0114, USA.
| | | | | | | |
Collapse
|
56
|
Abstract
Cytokines are an important class of proteins responsible for intercellular communication. The helical cytokines have a four-helix bundle fold, and they have remained largely intractable for sequence alignment methods due to their high evolutionary divergence. This paper presents a method that is specifically designed to recognize the helical cytokine fold in preprotein sequences such as full-length cDNA translations or transcripts predicted by gene finding methods. A protein fold is modeled by multiple sequence profiles, each representing a structurally conserved region. Nonstructural profiles are used to represent additional signals found in preprotein sequences. Profiles are connected by loop regions, each of a specified minimum and maximum length. A model for the helical cytokines is created by progressively improving a placement of four amphipathic helices onto training sequences. The sensitivity and specificity of the method are evaluated by a cross-validation procedure, demonstrating that cytokines with no intrafamily sequence similarity can be recognized. The method has been successfully used for the discovery of several new helical cytokines in the human genome.
Collapse
|
57
|
Neuwald AF, Liu JS. Gapped alignment of protein sequence motifs through Monte Carlo optimization of a hidden Markov model. BMC Bioinformatics 2004; 5:157. [PMID: 15504234 PMCID: PMC538276 DOI: 10.1186/1471-2105-5-157] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2004] [Accepted: 10/25/2004] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Certain protein families are highly conserved across distantly related organisms and belong to large and functionally diverse superfamilies. The patterns of conservation present in these protein sequences presumably are due to selective constraints maintaining important but unknown structural mechanisms with some constraints specific to each family and others shared by a larger subset or by the entire superfamily. To exploit these patterns as a source of functional information, we recently devised a statistically based approach called contrast hierarchical alignment and interaction network (CHAIN) analysis, which infers the strengths of various categories of selective constraints from co-conserved patterns in a multiple alignment. The power of this approach strongly depends on the quality of the multiple alignments, which thus motivated development of theoretical concepts and strategies to improve alignment of conserved motifs within large sets of distantly related sequences. RESULTS Here we describe a hidden Markov model (HMM), an algebraic system, and Markov chain Monte Carlo (MCMC) sampling strategies for alignment of multiple sequence motifs. The MCMC sampling strategies are useful both for alignment optimization and for adjusting position specific background amino acid frequencies for alignment uncertainties. Associated statistical formulations provide an objective measure of alignment quality as well as automatic gap penalty optimization. Improved alignments obtained in this way are compared with PSI-BLAST based alignments within the context of CHAIN analysis of three protein families: Gialpha subunits, prolyl oligopeptidases, and transitional endoplasmic reticulum (p97) AAA+ ATPases. CONCLUSION While not entirely replacing PSI-BLAST based alignments, which likewise may be optimized for CHAIN analysis using this approach, these motif-based methods often more accurately align very distantly related sequences and thus can provide a better measure of selective constraints. In some instances, these new approaches also provide a better understanding of family-specific constraints, as we illustrate for p97 ATPases. Programs implementing these procedures and supplementary information are available from the authors.
Collapse
Affiliation(s)
- Andrew F Neuwald
- Cold Spring Harbor Laboratory, 1 Bungtown Road, P.O. Box 100, Cold Spring Harbor, NY 11724, USA
| | - Jun S Liu
- Department of Statistics, Harvard University, 1 Oxford Street, Cambridge MA, 02138, USA
| |
Collapse
|
58
|
Iyer LM, Makarova KS, Koonin EV, Aravind L. Comparative genomics of the FtsK-HerA superfamily of pumping ATPases: implications for the origins of chromosome segregation, cell division and viral capsid packaging. Nucleic Acids Res 2004; 32:5260-79. [PMID: 15466593 PMCID: PMC521647 DOI: 10.1093/nar/gkh828] [Citation(s) in RCA: 246] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Recently, it has been shown that a predicted P-loop ATPase (the HerA or MlaA protein), which is highly conserved in archaea and also present in many bacteria but absent in eukaryotes, has a bidirectional helicase activity and forms hexameric rings similar to those described for the TrwB ATPase. In this study, the FtsK-HerA superfamily of P-loop ATPases, in which the HerA clade comprises one of the major branches, is analyzed in detail. We show that, in addition to the FtsK and HerA clades, this superfamily includes several families of characterized or predicted ATPases which are predominantly involved in extrusion of DNA and peptides through membrane pores. The DNA-packaging ATPases of various bacteriophages and eukaryotic double-stranded DNA viruses also belong to the FtsK-HerA superfamily. The FtsK protein is the essential bacterial ATPase that is responsible for the correct segregation of daughter chromosomes during cell division. The structural and evolutionary relationship between HerA and FtsK and the nearly perfect complementarity of their phyletic distributions suggest that HerA similarly mediates DNA pumping into the progeny cells during archaeal cell division. It appears likely that the HerA and FtsK families diverged concomitantly with the archaeal-bacterial division and that the last universal common ancestor of modern life forms had an ancestral DNA-pumping ATPase that gave rise to these families. Furthermore, the relationship of these cellular proteins with the packaging ATPases of diverse DNA viruses suggests that a common DNA pumping mechanism might be operational in both cellular and viral genome segregation. The herA gene forms a highly conserved operon with the gene for the NurA nuclease and, in many archaea, also with the orthologs of eukaryotic double-strand break repair proteins MRE11 and Rad50. HerA is predicted to function in a complex with these proteins in DNA pumping and repair of double-stranded breaks introduced during this process and, possibly, also during DNA replication. Extensive comparative analysis of the 'genomic context' combined with in-depth sequence analysis led to the prediction of numerous previously unnoticed nucleases of the NurA superfamily, including a specific version that is likely to be the endonuclease component of a novel restriction-modification system. This analysis also led to the identification of previously uncharacterized nucleases, such as a novel predicted nuclease of the Sir2-type Rossmann fold, and phosphatases of the HAD superfamily that are likely to function as partners of the FtsK-HerA superfamily ATPases.
Collapse
Affiliation(s)
- Lakshminarayan M Iyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
59
|
Leipe DD, Koonin EV, Aravind L. STAND, a Class of P-Loop NTPases Including Animal and Plant Regulators of Programmed Cell Death: Multiple, Complex Domain Architectures, Unusual Phyletic Patterns, and Evolution by Horizontal Gene Transfer. J Mol Biol 2004; 343:1-28. [PMID: 15381417 DOI: 10.1016/j.jmb.2004.08.023] [Citation(s) in RCA: 325] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2004] [Revised: 07/27/2004] [Accepted: 08/10/2004] [Indexed: 10/26/2022]
Abstract
Using sequence profile analysis and sequence-based structure predictions, we define a previously unrecognized, widespread class of P-loop NTPases. The signal transduction ATPases with numerous domains (STAND) class includes the AP-ATPases (animal apoptosis regulators CED4/Apaf-1, plant disease resistance proteins, and bacterial AfsR-like transcription regulators) and NACHT NTPases (e.g. NAIP, TLP1, Het-E-1) that have been studied extensively in the context of apoptosis, pathogen response in animals and plants, and transcriptional regulation in bacteria. We show that, in addition to these well-characterized protein families, the STAND class includes several other groups of (predicted) NTPase domains from diverse signaling and transcription regulatory proteins from bacteria and eukaryotes, and three Archaea-specific families. We identified the STAND domain in several biologically well-characterized proteins that have not been suspected to have NTPase activity, including soluble adenylyl cyclases, nephrocystin 3 (implicated in polycystic kidney disease), and Rolling pebble (a regulator of muscle development); these findings are expected to facilitate elucidation of the functions of these proteins. The STAND class belongs to the additional strand, catalytic E division of P-loop NTPases together with the AAA+ ATPases, RecA/helicase-related ATPases, ABC-ATPases, and VirD4/PilT-like ATPases. The STAND proteins are distinguished from other P-loop NTPases by the presence of unique sequence motifs associated with the N-terminal helix and the core strand-4, as well as a C-terminal helical bundle that is fused to the NTPase domain. This helical module contains a signature GxP motif in the loop between the two distal helices. With the exception of the archaeal families, almost all STAND NTPases are multidomain proteins containing three or more domains. In addition to the NTPase domain, these proteins typically contain DNA-binding or protein-binding domains, superstructure-forming repeats, such as WD40 and TPR, and enzymatic domains involved in signal transduction, including adenylate cyclases and kinases. By analogy to the AAA+ ATPases, it can be predicted that STAND NTPases use the C-terminal helical bundle as a "lever" to transmit the conformational changes brought about by NTP hydrolysis to effector domains. STAND NTPases represent a novel paradigm in signal transduction, whereby adaptor, regulatory switch, scaffolding, and, in some cases, signal-generating moieties are combined into a single polypeptide. The STAND class consists of 14 distinct families, and the evolutionary history of most of these families is riddled with dramatic instances of lineage-specific expansion and apparent horizontal gene transfer. The STAND NTPases are most abundant in developmentally and organizationally complex prokaryotes and eukaryotes. Transfer of genes for STAND NTPases from bacteria to eukaryotes on several occasions might have played a significant role in the evolution of eukaryotic signaling systems.
Collapse
Affiliation(s)
- Detlef D Leipe
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | |
Collapse
|
60
|
Roesner A, Fuchs C, Hankeln T, Burmester T. A globin gene of ancient evolutionary origin in lower vertebrates: evidence for two distinct globin families in animals. Mol Biol Evol 2004; 22:12-20. [PMID: 15356282 DOI: 10.1093/molbev/msh258] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Hemoglobin, myoglobin, neuroglobin, and cytoglobin are four types of vertebrate globins with distinct tissue distributions and functions. Here, we report the identification of a fifth and novel globin gene from fish and amphibians, which has apparently been lost in the evolution of higher vertebrates (Amniota). Because its function is presently unknown, we tentatively call it globin X (GbX). Globin X sequences were obtained from three fish species, the zebrafish Danio rerio, the goldfish Carassius auratus, and the pufferfish Tetraodon nigroviridis, and the clawed frog Silurana tropicalis. Globin X sequences are distinct from vertebrate hemoglobins, myoglobins, neuroglobins, and cytoglobins. Globin X displays the highest identity scores with neuroglobin (approximately 26% to 35%), although it is not a neuronal protein, as revealed by RT-PCR experiments on goldfish RNA from various tissues. The distal ligand-binding and the proximal heme-binding histidines (E7 and F8), as well as the conserved phenylalanine CD1 are present in the globin X sequences, but because of extensions at the N-terminal and C-terminal, the globin X proteins are longer than the typical eight alpha-helical globins and comprise about 200 amino acids. In addition to the conserved globin introns at helix positions B12.2 and G7.0, the globin X genes contain two introns in E10.2 and H10.0. The intron in E10.2 is shifted by 1 bp in respect to the vertebrate neuroglobin gene (E11.0), providing possible evidence for an intron sliding event. Phylogenetic analyses confirm an ancient evolutionary relationship of globin X with neuroglobin and suggest the existence of two distinct globin types in the last common ancestor of Protostomia and Deuterostomia.
Collapse
Affiliation(s)
- Anja Roesner
- Institute of Zoology, Johannes Gutenberg University, Mainz, Germany
| | | | | | | |
Collapse
|
61
|
Garau G, García-Sáez I, Bebrone C, Anne C, Mercuri P, Galleni M, Frère JM, Dideberg O. Update of the standard numbering scheme for class B beta-lactamases. Antimicrob Agents Chemother 2004; 48:2347-9. [PMID: 15215079 PMCID: PMC434215 DOI: 10.1128/aac.48.7.2347-2349.2004] [Citation(s) in RCA: 231] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Affiliation(s)
- Gianpiero Garau
- Institut de Biologie Structurale Jean-Pierre Ebel, CEA-CNRS-UJF, Laboratoire de Cristallographie Macromoléculaire, 41 rue Jules Horowitz, F-38027 Grenoble, France
| | | | | | | | | | | | | | | |
Collapse
|
62
|
Ascenzi P, Bocedi A, de Sanctis D, Pesce A, Bolognesi M, Marden MC, Dewilde S, Moens L, Hankeln T, Burmester T. Neuroglobin and cytoglobin: Two new entries in the hemoglobin superfamily*. BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION : A BIMONTHLY PUBLICATION OF THE INTERNATIONAL UNION OF BIOCHEMISTRY AND MOLECULAR BIOLOGY 2004; 32:305-313. [PMID: 21706744 DOI: 10.1002/bmb.2004.494032050386] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Neuroglobin (Ngb) and cytoglobin (Cygb) are two newly discovered intracellular members of the vertebrate hemoglobin (Hb) family. Ngb, predominantly expressed in nerve cells, is of ancient evolutionary origin and is homologous to nerve-globins of invertebrates. Cygb, present in many different tissues, shares common ancestry with myoglobin (Mb) and can be traced to early vertebrate evolution. Ngb and Cygb display the classical three-on-three α-helical globin fold and are endowed with a hexa-coordinate heme Fe atom, in both their ferrous and ferric forms, having the heme distal HisE7 residue as the endogenous sixth ligand. Reversible intramolecular hexa- to penta-coordination of the heme Fe atom modulates Ngb and Cygb ligand-binding properties. In Ngb and Cygb, ligand migration to/from the heme distal site may be assisted by protein/matrix tunnel cavity systems. The physiological roles of Ngb and Cygb are poorly understood. Ngb may protect neuronal cells from hypoxic-ischemic insults, may act as oxidative stress-responsive sensor protein, and may sustain NO/O(2) scavenging and/or reactive oxygen species (ROS) detoxification. Cygb, located in the cytoplasm of fibroblasts, chondroblasts, osteoblasts, and hepatic stellate cells, has been hypothesized to be involved in collagen synthesis. In neurons, Cygb, located in both cytoplasm and nucleus, may provide O(2) for enzymatic reactions, and may be involved in a ROS (NO)-signaling pathway(s). Here, we review current knowledge on Ngb and Cygb in terms of their structure, function, and evolutionary links to the well-known human HbA and Mb.
Collapse
Affiliation(s)
- Paolo Ascenzi
- Department of Biology and the Interdepartmental Laboratory for Electron Microscopy, University "Roma Tre," I-00146 Roma, Italy; National Institute for Infectious Diseases IRCSS "Lazzaro Spallanzani," I-00149 Roma, Italy.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
63
|
Okamoto Y, Morishita J, Tsuboi K, Tonai T, Ueda N. Molecular Characterization of a Phospholipase D Generating Anandamide and Its Congeners. J Biol Chem 2004; 279:5298-305. [PMID: 14634025 DOI: 10.1074/jbc.m306642200] [Citation(s) in RCA: 580] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Anandamide (N-arachidonoylethanolamine) is known to be an endogenous ligand of cannabinoid and vanilloid receptors. Its congeners (collectively referred to as N-acylethanolamines) also show a variety of biological activities. These compounds are principally formed from their corresponding N-acyl-phosphatidylethanolamines by a phosphodiesterase of the phospholipase D-type in animal tissues. We purified the enzyme from rat heart, and by the use of the sequences of its internal peptides cloned its complementary DNAs from mouse, rat, and human. The deduced amino acid sequences were composed of 393-396 residues, and showed that the enzyme has no homology with the known phospholipase D enzymes but is classified as a member of the zinc metallohydrolase family of the beta-lactamase fold. As was overexpressed in COS-7 cells, the recombinant enzyme generated anandamide and other N-acylethanolamines from their corresponding N-acyl-phosphatidylethanolamines at comparable rates. In contrast, the enzyme was inactive with phosphatidylcholine and phosphatidylethanolamine. Assays of the enzyme activity and the messenger RNA and protein levels revealed its wide distribution in murine organs with higher contents in the brain, kidney, and testis. These results confirm that a specific phospholipase D is responsible for the generation of N-acylethanolamines including anandamide, strongly suggesting the physiological importance of lipid molecules of this class.
Collapse
Affiliation(s)
- Yasuo Okamoto
- Department of Biochemistry, Kagawa University School of Medicine, 1750-1 Ikenobe, Miki, Kagawa 761-0793, Japan
| | | | | | | | | |
Collapse
|
64
|
de Sanctis D, Dewilde S, Pesce A, Moens L, Ascenzi P, Hankeln T, Burmester T, Bolognesi M. Crystal Structure of Cytoglobin: The Fourth Globin Type Discovered in Man Displays Heme Hexa-coordination. J Mol Biol 2004; 336:917-27. [PMID: 15095869 DOI: 10.1016/j.jmb.2003.12.063] [Citation(s) in RCA: 114] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Cytoglobin is a recently discovered hemeprotein belonging to the globin superfamily together with hemoglobin, myoglobin and neuroglobin. Although distributed in almost all human tissues, cytoglobin has not been ascribed a specific function. Human cytoglobin is composed of 190 amino acid residues. Sequence alignments show that a protein core region (about 150 residues) is structurally related to hemoglobin and myoglobin, being complemented by about 20 extra residues both on the N and C termini. In the absence of exogenous ligands (e.g. O2), the cytoglobin distal HisE7 residue is coordinated to the heme Fe atom, thus decreasing the ligand affinity. The crystal structure of human cytoglobin (2.1 A resolution, 21.3% R-factor) highlights a three-over-three alpha-helical globin fold, covering residues 18-171; the 1-17 N-terminal, and the 172-190 C-terminal residue segments are disordered in both molecules of the crystal asymmetric unit. Heme hexa-coordination is evident in one of the two cytoglobin chains, whereas alternate conformation for the heme distal region, achieving partial heme penta-coordination, is observed in the other. Human cytoglobin displays a large apolar protein matrix cavity, next to the heme, not related to the myoglobin cavities recognized as temporary ligand docking stations. The cavity, which may provide a heme ligand diffusion pathway, is connected to the external space through a narrow tunnel nestled between the globin G and H helices.
Collapse
Affiliation(s)
- Daniele de Sanctis
- Department of Physics-INFM, Centre for Excellence in Biomedical Research, University of Genova, Via Dodecaneso 33, 1-16146 Genova, Italy
| | | | | | | | | | | | | | | |
Collapse
|
65
|
Andersen C. Channel-tunnels: outer membrane components of type I secretion systems and multidrug efflux pumps of Gram-negative bacteria. Rev Physiol Biochem Pharmacol 2003; 147:122-65. [PMID: 12783268 DOI: 10.1007/s10254-003-0008-y] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
For translocation across the cell envelope of Gram-negative bacteria, substances have to overcome two permeability barriers, the inner and outer membrane. Channel-tunnels are outer membrane proteins, which are central to two distinct export systems: the type I secretion system exporting proteins such as toxins or proteases, and efflux pumps discharging antibiotics, dyes, or heavy metals and thus mediating drug resistance. Protein secretion is driven by an inner membrane ATP-binding cassette (ABC) transporter while drug efflux occurs via an inner membrane proton antiporter. Both inner membrane transporters are associated with a periplasmic accessory protein that recruits an outer membrane channel-tunnel to form a functional export complex. Prototypes of these export systems are the hemolysin secretion system and the AcrAB/TolC drug efflux pump of Escherichia coli, which both employ TolC as an outer membrane component. Its remarkable conduit-like structure, protruding 100 A into the periplasmic space, reveals how both systems are capable of transporting substrates across both membranes directly from the cytosol into the external environment. Proteins of the channel-tunnel family are widespread within Gram-negative bacteria. Their involvement in drug resistance and in secretion of pathogenic factors makes them an interesting system for further studies. Understanding the mechanism of the different export apparatus could help to develop new drugs, which block the efflux pumps or the secretion system.
Collapse
Affiliation(s)
- C Andersen
- Department of Biotechnology, University of Würzburg, 97074 Würzburg, Germany.
| |
Collapse
|
66
|
Sitbon E, Pietrokovski S. New types of conserved sequence domains in DNA-binding regions of homing endonucleases. Trends Biochem Sci 2003; 28:473-7. [PMID: 13678957 DOI: 10.1016/s0968-0004(03)00170-1] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
We have identified four new types of short conserved sequence domains in homing endonucleases and related proteins. These domains are modular, appearing in various combinations. One domain includes a motif known by structure as a novel sequence-specific DNA-binding helix. Sequence similarity shows two other domains to be new types of helix-turn-helix DNA-binding domains. We term the new domains nuclease-associated modular DNA-binding domains (NUMODs).
Collapse
Affiliation(s)
- Einat Sitbon
- Molecular Genetics Department, Weizmann Institute of Science, PO Box 26, Rehovot 76100, Israel
| | | |
Collapse
|
67
|
Anantharaman V, Aravind L. New connections in the prokaryotic toxin-antitoxin network: relationship with the eukaryotic nonsense-mediated RNA decay system. Genome Biol 2003; 4:R81. [PMID: 14659018 PMCID: PMC329420 DOI: 10.1186/gb-2003-4-12-r81] [Citation(s) in RCA: 195] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2003] [Revised: 10/13/2003] [Accepted: 10/10/2003] [Indexed: 11/28/2022] Open
Abstract
Sequence profile analysis of the RelE- and ParE-type post-segregational cell killing (PSK) toxins from diverse bacteria and archaea has unified these proteins into a single superfamily. Further comparative analysis suggests that the core of the eukaryotic nonsense-mediated RNA decay system has probably evolved from a PSK-related system. Background Several prokaryotic plasmids maintain themselves in their hosts by means of diverse post-segregational cell killing systems. Recent findings suggest that chromosomally encoded copies of toxins and antitoxins of post-segregational cell killing systems - such as the RelE system - might function as regulatory switches under stress conditions. The RelE toxin cleaves ribosome-associated transcripts, whereas another post-segregational cell killing toxin, ParE, functions as a gyrase inhibitor. Results Using sequence profile analysis we were able unify the RelE- and ParE-type toxins with several families of small, uncharacterized proteins from diverse bacteria and archaea into a single superfamily. Gene neighborhood analysis showed that the majority of these proteins were encoded by genes in characteristic neighborhoods, in which genes encoding toxins always co-occurred with genes encoding transcription factors that are also antitoxins. The transcription factors accompanying the RelE/ParE superfamily may belong to unrelated or distantly related superfamilies, however. We used this conserved neighborhood template to transitively search genomes and identify novel post-segregational cell killing-related systems. One of these novel systems, observed in several prokaryotes, contained a predicted toxin with a PilT-N terminal (PIN) domain, which is also found in proteins of the eukaryotic nonsense-mediated RNA decay system. These searches also identified novel transcription factors (antitoxins) in post-segregational cell killing systems. Furthermore, the toxin Doc defines a potential metalloenzyme superfamily, with novel representatives in bacteria, archaea and eukaryotes, that probably acts on nucleic acids. Conclusions The tightly maintained gene neighborhoods of post-segregational cell killing-related systems appear to have evolved by in situ displacement of genes for toxins or antitoxins by functionally equivalent but evolutionarily unrelated genes. We predict that the novel post-segregational cell killing-related systems containing a PilT-N terminal domain toxin and the eukaryotic nonsense-mediated RNA decay system are likely to function via a common mechanism, in which the PilT-N terminal domain cleaves ribosome-associated transcripts. The core of the eukaryotic nonsense-mediated RNA decay system has probably evolved from a post-segregational cell killing-related system.
Collapse
Affiliation(s)
- Vivek Anantharaman
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
68
|
Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, Rao BS, Smirnov S, Sverdlov AV, Vasudevan S, Wolf YI, Yin JJ, Natale DA. The COG database: an updated version includes eukaryotes. BMC Bioinformatics 2003; 4:41. [PMID: 12969510 PMCID: PMC222959 DOI: 10.1186/1471-2105-4-41] [Citation(s) in RCA: 3241] [Impact Index Per Article: 154.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2003] [Accepted: 09/11/2003] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. RESULTS We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or approximately 54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of approximately 20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (approximately 1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes. CONCLUSION The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies.
Collapse
Affiliation(s)
- Roman L Tatusov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Natalie D Fedorova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - John D Jackson
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Aviva R Jacobs
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Boris Kiryutin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Dmitri M Krylov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Raja Mazumder
- Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20007, USA
| | - Sergei L Mekhedov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Anastasia N Nikolskaya
- Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20007, USA
| | - B Sridhar Rao
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Sergei Smirnov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Alexander V Sverdlov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Sona Vasudevan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Jodie J Yin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA
| | - Darren A Natale
- Protein Information Resource, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20007, USA
| |
Collapse
|
69
|
Abstract
It is often desired to identify further homologs of a family of biological sequences from the ever-growing sequence databases. Profile hidden Markov models excel at capturing the common statistical features of a group of biological sequences. With these common features, we can search the biological database and find new homologous sequences. Most general profile hidden Markov model methods, however, treat the evolutionary relationships between the sequences in a homologous group in an ad-hoc manner. We hereby introduce a method to incorporate phylogenetic information directly into hidden Markov models, and demonstrate that the resulting model performs better than most of the current multiple sequence-based methods for finding distant homologs.
Collapse
Affiliation(s)
- Bin Qian
- Biophysics Research Division, University of Michigan, Ann Arbor, Michigan, USA
| | | |
Collapse
|
70
|
Jackson DB, Minch E, Munro RE. Bioinformatics. EXS 2003:31-69. [PMID: 12613171 DOI: 10.1007/978-3-0348-7997-2_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/01/2023]
|
71
|
Teufel M, Saudek V, Ledig JP, Bernhardt A, Boularand S, Carreau A, Cairns NJ, Carter C, Cowley DJ, Duverger D, Ganzhorn AJ, Guenet C, Heintzelmann B, Laucher V, Sauvage C, Smirnova T. Sequence identification and characterization of human carnosinase and a closely related non-specific dipeptidase. J Biol Chem 2003; 278:6521-31. [PMID: 12473676 DOI: 10.1074/jbc.m209764200] [Citation(s) in RCA: 241] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Carnosine (beta-alanyl-L-histidine) and homocarnosine (gamma-aminobutyric acid-L-histidine) are two naturally occurring dipeptides with potential neuroprotective and neurotransmitter functions in the brain. Peptidase activities degrading both carnosine and homocarnosine have been described previously, but the genes linked to these activities were unknown. Here we present the identification of two novel cDNAs named CN1 and CN2 coding for two proteins of 56.8 and 52.7 kDa and their classification as members of the M20 metalloprotease family. Whereas human CN1 mRNA and protein are brain-specific, CN2 codes for a ubiquitous protein. In contrast, expression of the mouse and rat CN1 orthologues was detectable only in kidney. The recombinant CN1 and CN2 proteins were expressed in Chinese hamster ovary cells and purified to homogeneity. CN1 was identified as a homodimeric dipeptidase with a narrow substrate specificity for Xaa-His dipeptides including those with Xaa = beta Ala (carnosine, K(m) 1.2 mM), N-methyl beta Ala, Ala, Gly, and gamma-aminobutyric acid (homocarnosine, K(m) 200 microM), an isoelectric point of pH 4.5, and maximal activity at pH 8.5. CN2 protein is a dipeptidase not limited to Xaa-His dipeptides, requires Mn(2+) for full activity, and is sensitive to inhibition by bestatin (IC(50) 7 nM). This enzyme does not degrade homocarnosine and hydrolyzes carnosine only at alkaline pH with an optimum at pH 9.5. Based on their substrate specificity and biophysical and biochemical properties CN1 was identified as human carnosinase (EC ), whereas CN2 corresponds to the cytosolic nonspecific dipeptidase (EC ).
Collapse
Affiliation(s)
- Michael Teufel
- Department of Exploratory Research, Sanofi Synthelabo Recherche, 16 Rue d'Ankara, F-67080 Strasbourg, France.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
72
|
Iyer LM, Koonin EV, Aravind L. Evolutionary connection between the catalytic subunits of DNA-dependent RNA polymerases and eukaryotic RNA-dependent RNA polymerases and the origin of RNA polymerases. BMC STRUCTURAL BIOLOGY 2003; 3:1. [PMID: 12553882 PMCID: PMC151600 DOI: 10.1186/1472-6807-3-1] [Citation(s) in RCA: 165] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2003] [Accepted: 01/28/2003] [Indexed: 12/02/2022]
Abstract
BACKGROUND The eukaryotic RNA-dependent RNA polymerase (RDRP) is involved in the amplification of regulatory microRNAs during post-transcriptional gene silencing. This enzyme is highly conserved in most eukaryotes but is missing in archaea and bacteria. No evolutionary relationship between RDRP and other polymerases has been reported so far, hence the origin of this eukaryote-specific polymerase remains a mystery. RESULTS Using extensive sequence profile searches, we identified bacteriophage homologs of the eukaryotic RDRP. The comparison of the eukaryotic RDRP and their homologs from bacteriophages led to the delineation of the conserved portion of these enzymes, which is predicted to harbor the catalytic site. Further, detailed sequence comparison, aided by examination of the crystal structure of the DNA-dependent RNA polymerase (DDRP), showed that the RDRP and the beta' subunit of DDRP (and its orthologs in archaea and eukaryotes) contain a conserved double-psi beta-barrel (DPBB) domain. This DPBB domain contains the signature motif DbDGD (b is a bulky residue), which is conserved in all RDRPs and DDRPs and contributes to catalysis via a coordinated divalent cation. Apart from the DPBB domain, no similarity was detected between RDRP and DDRP, which leaves open two scenarios for the origin of RDRP: i) RDRP evolved at the onset of the evolution of eukaryotes via a duplication of the DDRP beta' subunit followed by dramatic divergence that obliterated the sequence similarity outside the core catalytic domain and ii) the primordial RDRP, which consisted primarily of the DPBB domain, evolved from a common ancestor with the DDRP at a very early stage of evolution, during the RNA world era. The latter hypothesis implies that RDRP had been subsequently eliminated from cellular life forms and might have been reintroduced into the eukaryotic genomes through a bacteriophage. Sequence and structure analysis of the DDRP led to further insights into the evolution of RNA polymerases. In addition to the beta' subunit, beta subunit of DDRP also contains a DPBB domain, which is, however, distorted by large inserts and does not harbor a counterpart of the DbDGD motif. The DPBB domains of the two DDRP subunits together form the catalytic cleft, with the domain from the beta' subunit supplying the metal-coordinating DbDGD motif and the one from the beta subunit providing two lysine residues involved in catalysis. Given that the two DPBB domains of DDRP contribute completely different sets of active residues to the catalytic center, it is hypothesized that the ultimate ancestor of RNA polymerases functioned as a homodimer of a generic, RNA-binding DPBB domain. This ancestral protein probably did not have catalytic activity and served as a cofactor for a ribozyme RNA polymerase. Subsequent evolution of DDRP and RDRP involved accretion of distinct sets of additional domains. In the DDRPs, these included a RNA-binding Zn-ribbon, an AT-hook-like module and a sandwich-barrel hybrid motif (SBHM) domain. Further, lineage-specific accretion of SBHM domains and other, DDRP-specific domains is observed in bacterial DDRPs. In contrast, the orthologs of the beta' subunit in archaea and eukaryotes contains a four-stranded alpha + beta domain that is shared with the alpha-subunit of bacterial DDRP, eukaryotic DDRP subunit RBP11, translation factor eIF1 and type II topoisomerases. The additional domains of the RDRPs remain to be characterized. CONCLUSIONS Eukaryotic RNA-dependent RNA polymerases share the catalytic double-psi beta-barrel domain, containing a signature metal-coordinating motif, with the universally conserved beta' subunit of DNA-dependent RNA polymerases. Beyond this core catalytic domain, the two classes of RNA polymerases do not have common domains, suggesting early divergence from a common ancestor, with subsequent independent domain accretion. The beta-subunit of DDRP contains another, highly diverged DPBB domain. The presence of two distinct DPBB domains in two subunits of DDRP is compatible with the hypothesis that the ith the hypothesis that the ultimate ancestor of RNA polymerases was a RNA-binding DPBB domain that had no catalytic activity but rather functioned as a homodimeric cofactor for a ribozyme polymerase.
Collapse
Affiliation(s)
- Lakshminarayan M Iyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
73
|
Panchenko AR. Finding weak similarities between proteins by sequence profile comparison. Nucleic Acids Res 2003; 31:683-9. [PMID: 12527777 PMCID: PMC140518 DOI: 10.1093/nar/gkg154] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
To improve the recognition of weak similarities between proteins a method of aligning two sequence profiles is proposed. It is shown that exploring the sequence space in the vicinity of the sequence with unknown properties significantly improves the performance of sequence alignment methods. Consistent with the previous observations the recognition sensitivity and alignment accuracy obtained by a profile-profile alignment method can be as much as 30% higher compared to the sequence-profile alignment method. It is demonstrated that the choice of score function and the diversity of the test profile are very important factors for achieving the maximum performance of the method, whereas the optimum range of these parameters depends on the level of similarity to be recognized.
Collapse
Affiliation(s)
- Anna R Panchenko
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, Room 8N805, 8600 Rockville Pike, Bethesda, MD 20894, USA.
| |
Collapse
|
74
|
Tulin A, Chinenov Y, Spradling A. Regulation of chromatin structure and gene activity by poly(ADP-ribose) polymerases. Curr Top Dev Biol 2003; 56:55-83. [PMID: 14584726 DOI: 10.1016/s0070-2153(03)01007-x] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Alexei Tulin
- HHMI Laboratories, Carnegie Institution of Washington, Baltimore, Maryland 21210, USA
| | | | | |
Collapse
|
75
|
Pesce A, Bolognesi M, Bocedi A, Ascenzi P, Dewilde S, Moens L, Hankeln T, Burmester T. Neuroglobin and cytoglobin. Fresh blood for the vertebrate globin family. EMBO Rep 2002; 3:1146-51. [PMID: 12475928 PMCID: PMC1308314 DOI: 10.1093/embo-reports/kvf248] [Citation(s) in RCA: 232] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Neuroglobin and cytoglobin are two recently discovered members of the vertebrate globin family. Both are intracellular proteins endowed with hexacoordinated heme-Fe atoms, in their ferrous and ferric forms, and display O2 affinities comparable with that of myoglobin. Neuroglobin, which is predominantly expressed in nerve cells, is thought to protect neurons from hypoxic-ischemic injury. It is of ancient evolutionary origin, and is homologous to nerve globins of invertebrates. Cytoglobin is expressed in many different tissues, although at varying levels. It shares common ancestry with myoglobin, and can be traced to early vertebrate evolution. The physiological roles of neuroglobin and cytoglobin are not completely understood. Although supplying cells with O2 is the likely function, it is also possible that both globins act as O2-consuming enzymes or as O2 sensors. Here, we review what is currently known about neuroglobin and cytoglobin in terms of their function, tissue distribution and relatedness to the well-known hemoglobin and myoglobin. Strikingly, the data reveal that O2 metabolism in cells is more complicated than was thought before, requiring unexpected O2-binding proteins with potentially novel functional features.
Collapse
Affiliation(s)
- Alessandra Pesce
- Department of Physics–INFM and Center for Excellence in Biomedical Research, University of Genova, Via Dodecaneso 33, I-16146 Genova, Italy
| | - Martino Bolognesi
- Department of Physics–INFM and Center for Excellence in Biomedical Research, University of Genova, Via Dodecaneso 33, I-16146 Genova, Italy
- Tel/Fax: +39 010 5737 306;
| | - Alessio Bocedi
- Department of Biology, University 'Roma Tre', Viale Guglielmo Marconi 446, I-00146 Roma, Italy
| | - Paolo Ascenzi
- Department of Biology, University 'Roma Tre', Viale Guglielmo Marconi 446, I-00146 Roma, Italy
| | - Sylvia Dewilde
- Department of Biochemistry, University of Antwerp, Universiteitsplein 1, B-2610 Antwerp, Belgium
| | - Luc Moens
- Department of Biochemistry, University of Antwerp, Universiteitsplein 1, B-2610 Antwerp, Belgium
| | - Thomas Hankeln
- Institute of Molecular Genetics, Johannes Gutenberg University of Mainz, Becherweg 32, D-55099 Mainz, Germany
| | - Thorsten Burmester
- Institute of Zoology, Johannes Gutenberg University of Mainz, Müllerweg 6, D-55099 Mainz, Germany
| |
Collapse
|
76
|
Wilk S, Wilk E, Magnusson RP. Identification of histidine residues important in the catalysis and structure of aspartyl aminopeptidase. Arch Biochem Biophys 2002; 407:176-83. [PMID: 12413488 DOI: 10.1016/s0003-9861(02)00494-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Aspartyl aminopeptidase (DAP), a widely distributed and abundant cytosolic enzyme, removes glutamyl or aspartyl residues from N-terminal acidic amino acid-containing peptides. DAP is a member of the M18 family of the MH clan of cocatalytic metallopeptidases. The human and mouse enzymes have been cloned. We have identified 8 highly homologous eukaryotic sequences that are probable aspartyl aminopeptidases. Eight histidine residues of human DAP were sequentially mutated to phenylalanine. Mutation of His94, His170, and His440 abolished enzymatic activity. His94 and His440 are postulated to be involved in binding cocatalytic zinc atoms by homology with other members of the MH clan. Mutation of His352 dramatically reduced enzyme activity. Gel-filtration analysis of the His352 mutant revealed destabilization of the quaternary structure and dissociation of the native 440-kDa enzyme. Mutation of His33 and of histidines residing in a cluster at residues 349, 359, and 363 all decreased k(cat). These studies reveal an important role for histidine residues both in catalysis and in the structural integrity of DAP.
Collapse
Affiliation(s)
- Sherwin Wilk
- Department of Pharmacology, Box 1215, Mount Sinai School of Medicine, One Gustave L Levy Place, New York, NY 10029, USA.
| | | | | |
Collapse
|
77
|
Shaw E, Dordick JS. Predicting amino acid residues responsible for enzyme specificity solely from protein sequences. Biotechnol Bioeng 2002; 79:295-300. [PMID: 12115418 DOI: 10.1002/bit.10289] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
We describe a general, modular method for developing protocols to identify the amino acid residues that most likely define the division of a protein superfamily into two subsets. As one possibility, we use PROBE to gather superfamily members and perform an ungapped alignment. We then use a modified BLOSUM62 substitution matrix to determine the discriminating power of each column of aligned residues. The overall method is particularly useful for predicting amino acids responsible for substrate or binding specificity when no structures are available. We apply our method to three pairs of protein classes in three different superfamilies, and present our results, some of which have been experimentally verified. This approach may accelerate the elucidation of enzymic substrate specificity, which is critical for both mechanistic insights into biocatalysis and ultimate application.
Collapse
Affiliation(s)
- Eudean Shaw
- Department of Chemical Engineering, Rensselaer Polytechnic Institute, 110 8th Street, Troy, New York 12180, USA
| | | |
Collapse
|
78
|
Abstract
Multidrug efflux transporters are normal constituents of bacterial cells. These transporters are major contributors to intrinsic resistance of bacteria to many anti-microbial agents. In clinical settings, exposure to antibiotics promotes the mutational overexpression of active or silent multidrug transporters, leading to increased antibiotic resistance without acquisition of multiple, specific resistance determinants. The paradoxical ability of multidrug transporters to recognize and efficiently expel from cells scores of dissimilar organic compounds has been in the focus of extensive research for many years. Several independent studies implied that the mechanistic basis of such ability lies in a distinctive locus of the transporter-substrate interaction: the multidrug transporters select and bind their substrates within the phospholipid bilayer. The recently reported high-resolution structure of a complete MsbA transporter of Escherichia coli provides a solid structural basis for these studies. Although the majority of multidrug transporters function as single-component pumps, major transporters of Gram-negative bacteria are organized as three-component structures. Special outer membrane channels and periplasmic proteins belonging to the membrane fusion protein family enable drug efflux across a Gram-negative two-membrane envelope, directly into the external medium. This minireview focuses on the current status of research in the field of multidrug efflux mechanisms.
Collapse
Affiliation(s)
- Helen I Zgurskaya
- Department of Chemistry and Biochemistry, University of Oklahoma, 620 Parrington Oval, Room 208, Norman, OK 73019, USA.
| |
Collapse
|
79
|
Aravind L, Anantharaman V, Koonin EV. Monophyly of class I aminoacyl tRNA synthetase, USPA, ETFP, photolyase, and PP-ATPase nucleotide-binding domains: implications for protein evolution in the RNA. Proteins 2002; 48:1-14. [PMID: 12012333 DOI: 10.1002/prot.10064] [Citation(s) in RCA: 111] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Protein sequence and structure comparisons show that the catalytic domains of Class I aminoacyl-tRNA synthetases, a related family of nucleotidyltransferases involved primarily in coenzyme biosynthesis, nucleotide-binding domains related to the UspA protein (USPA domains), photolyases, electron transport flavoproteins, and PP-loop-containing ATPases together comprise a distinct class of alpha/beta domains designated the HUP domain after HIGH-signature proteins, UspA, and PP-ATPase. Several lines of evidence are presented to support the monophyly of the HUP domains, to the exclusion of other three-layered alpha/beta folds with the generic "Rossmann-like" topology. Cladistic analysis, with patterns of structural and sequence similarity used as discrete characters, identified three major evolutionary lineages within the HUP domain class: the PP-ATPases; the HIGH superfamily, which includes class I aaRS and related nucleotidyltransferases containing the HIGH signature in their nucleotide-binding loop; and a previously unrecognized USPA-like group, which includes USPA domains, electron transport flavoproteins, and photolyases. Examination of the patterns of phyletic distribution of distinct families within these three major lineages suggests that the Last Universal Common Ancestor of all modern life forms encoded 15-18 distinct alpha/beta ATPases and nucleotide-binding proteins of the HUP class. This points to an extensive radiation of HUP domains before the last universal common ancestor (LUCA), during which the multiple class I aminoacyl-tRNA synthetases emerged only at a late stage. Thus, substantial evolutionary diversification of protein domains occurred well before the modern version of the protein-dependent translation machinery was established, i.e., still in the RNA world.
Collapse
Affiliation(s)
- L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA.
| | | | | |
Collapse
|
80
|
Abstract
Fold recognition predicts protein three-dimensional structure by establishing relationships between a protein sequence and known protein structures. Most methods explicitly use information derived from the secondary and tertiary structure of the templates. Here we show that rigorous application of a sequence search method (PSI-BLAST) with no reference to secondary or tertiary structure information is able to perform as well as traditional fold recognition methods. Since the method, SENSER, does not require knowledge of the three-dimensional structure, it can be used to infer relationships that are not tractable by methods dependent on structural templates.
Collapse
Affiliation(s)
- Kristin K Koretke
- Microbial Bioinformatics Group, GlaxoSmithKline, Collegeville, Pennsylvania 19426-0989, USA.
| | | | | |
Collapse
|
81
|
Pesce A, Nardini M, Dewilde S, Geuens E, Yamauchi K, Ascenzi P, Riggs AF, Moens L, Bolognesi M. The 109 residue nerve tissue minihemoglobin from Cerebratulus lacteus highlights striking structural plasticity of the alpha-helical globin fold. Structure 2002; 10:725-35. [PMID: 12015154 DOI: 10.1016/s0969-2126(02)00763-3] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
A very short hemoglobin (CerHb; 109 amino acids) binds O(2) cooperatively in the nerve tissue of the nemertean worm Cerebratulus lacteus to sustain neural activity during anoxia. Sequence analysis suggests that CerHb tertiary structure may be unique among the known globin fold evolutionary variants. The X-ray structure of oxygenated CerHb (R factor 15.3%, at 1.5 A resolution) displays deletion of the globin N-terminal A helix, an extended GH region, a very short H helix, and heme solvent shielding based on specific aromatic residues. The heme-bound O(2) is stabilized by hydrogen bonds to the distal TyrB10-GlnE7 pair. Ligand access to heme may take place through a wide protein matrix tunnel connecting the distal site to a surface cleft located between the E and H helices.
Collapse
Affiliation(s)
- Alessandra Pesce
- Department of Physics, INFM, Advanced Biotechnology Centre, University of Genova, Largo Rosanna Benzi 10, I-16146 Genova, Italy
| | | | | | | | | | | | | | | | | |
Collapse
|
82
|
Shaw E, McCue LA, Lawrence CE, Dordick JS. Identification of a novel class in the alpha/beta hydrolase fold superfamily: the N-myc differentiation-related proteins. Proteins 2002; 47:163-8. [PMID: 11933063 DOI: 10.1002/prot.10083] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The alpha/beta hydrolases constitute a large protein superfamily that mainly consists of enzymes that catalyze a diverse range of reactions. These proteins exhibit the alpha/beta hydrolase fold, the essential features of which have recently been delineated: the presence of at least five parallel beta-strands, a catalytic triad in a specific order (nucleophile-acid-histidine), and a nucleophilic elbow. Because of the difficulties experimentally in identifying protein structures, we have used a Bayesian computational algorithm (PROBE) to identify the members of this superfamily based on distant sequence relationships. We found that the presence of five sequence motifs, which contain residues important for substrate binding and stabilization of the fold, are required for membership in this superfamily. The superfamily consists of at least 909 members, including the N-myc downstream regulated proteins, which are believed to be involved in cell differentiation. Unlike most of the other superfamily members, the N-myc downstream regulated proteins have never been proposed to possess the alpha/beta hydrolase fold and do not appear to be hydrolases.
Collapse
Affiliation(s)
- Eudean Shaw
- Department of Chemical Engineering, Rensselaer Polytechnic Institute, Troy, New York, USA
| | | | | | | |
Collapse
|
83
|
Bujnicki JM, Rychlewski L. RNA:(guanine-N2) methyltransferases RsmC/RsmD and their homologs revisited--bioinformatic analysis and prediction of the active site based on the uncharacterized Mj0882 protein structure. BMC Bioinformatics 2002; 3:10. [PMID: 11929612 PMCID: PMC102759 DOI: 10.1186/1471-2105-3-10] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2001] [Accepted: 04/03/2002] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Escherichia coli guanine-N2 (m2G) methyltransferases (MTases) RsmC and RsmD modify nucleosides G1207 and G966 of 16S rRNA. They possess a common MTase domain in the C-terminus and a variable region in the N-terminus. Their C-terminal domain is related to the YbiN family of hypothetical MTases, but nothing is known about the structure or function of the N-terminal domain. RESULTS Using a combination of sequence database searches and fold recognition methods it has been demonstrated that the N-termini of RsmC and RsmD are related to each other and that they represent a "degenerated" version of the C-terminal MTase domain. Novel members of the YbiN family from Archaea and Eukaryota were also indentified. It is inferred that YbiN and both domains of RsmC and RsmD are closely related to a family of putative MTases from Gram-positive bacteria and Archaea, typified by the Mj0882 protein from M. jannaschii (1dus in PDB). Based on the results of sequence analysis and structure prediction, the residues involved in cofactor binding, target recognition and catalysis were identified, and the mechanism of the guanine-N2 methyltransfer reaction was proposed. CONCLUSIONS Using the known Mj0882 structure, a comprehensive analysis of sequence-structure-function relationships in the family of genuine and putative m2G MTases was performed. The results provide novel insight into the mechanism of m2G methylation and will serve as a platform for experimental analysis of numerous uncharacterized N-MTases.
Collapse
Affiliation(s)
- Janusz M Bujnicki
- Bioinformatics Laboratory, International Institute of Cell and Molecular Biology, ul. ks. Trojdena 4, 02-109 Warsaw, Poland
| | | |
Collapse
|
84
|
Burmester T, Ebner B, Weich B, Hankeln T. Cytoglobin: a novel globin type ubiquitously expressed in vertebrate tissues. Mol Biol Evol 2002; 19:416-21. [PMID: 11919282 DOI: 10.1093/oxfordjournals.molbev.a004096] [Citation(s) in RCA: 364] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Vertebrates possess multiple respiratory globins that differ in terms of structure, function, and tissue distribution. Three types of globins have been described so far: hemoglobin facilitates the transport of oxygen in the blood, myoglobin serves oxygen transport and storage in the muscle, and neuroglobin has a yet unidentified function in nerve cells. Here we report the identification of a fourth and novel type of globin in mouse, man, and zebrafish. It is expressed in apparently all types of human tissue and therefore has been called cytoglobin (CYGB). Mouse and human CYGBs comprise 190 amino acids; the zebrafish CYGB, 174 amino acids. The human CYGB gene is located on chromosome 17q25. The mammalian genes display a unique exon-intron pattern with an additional exon resulting in a C-terminal extension of the protein, which is absent in the fish CYGB. Phylogenetic analyses suggest that the CYGBs had a common ancestor with vertebrate myoglobins. This indicates that the vertebrate myoglobins are in fact a specialized intracellular globin that evolved in adaptation to the special needs of muscle cells.
Collapse
Affiliation(s)
- Thorsten Burmester
- Institute of Zoology, Biosafety Research and Consulting, Johannes Gutenberg University Mainz, Becherweg 32, D-55099 Mainz, Germany.
| | | | | | | |
Collapse
|
85
|
|
86
|
Purkayastha A, McCue LA, McDonough KA. Identification of a Mycobacterium tuberculosis putative classical nitroreductase gene whose expression is coregulated with that of the acr aene within macrophages, in standing versus shaking cultures, and under low oxygen conditions. Infect Immun 2002; 70:1518-29. [PMID: 11854240 PMCID: PMC127740 DOI: 10.1128/iai.70.3.1518-1529.2002] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Tuberculosis remains a leading killer worldwide, and new approaches for its treatment and prevention are urgently needed. This effort will benefit greatly from a better understanding of gene regulation in Mycobacterium tuberculosis, particularly with respect to this pathogen's response to its host environment. We examined the behavior of two promoters from the divergently transcribed M. tuberculosis genes acr/hspX/Rv2031c (alpha-crystallin homolog) and Rv2032/acg (acr-coregulated gene) by using a promoter-GFP fusion assay in Mycobacterium bovis BCG. We found that Rv2032 is a novel macrophage-induced gene whose expression is coregulated with that of acr. Relative levels of intracellular induction for both promoters were significantly affected by shallow standing versus shaking bacterial culture conditions prior to macrophage infection, and both promoters were strongly induced under low oxygen conditions. Deletion analyses showed that DNA sequences within a 43-bp region were required for expression of these promoters under all conditions. Multiple sequence alignment and database searches performed with PROBE indicated that Rv2032 is one of eight M. tuberculosis genes of previously unknown function that belong to an unusual superfamily of classical nitroreductases, which may have a role for bacteria within the host environment. These findings show that mycobacterial culture conditions can greatly influence the results and interpretation of subsequent gene regulation experiments. We propose that these differences might be exploited for dissection of the regulatory factors that affect mycobacterial gene expression within the host.
Collapse
Affiliation(s)
- Anjan Purkayastha
- Department of Biomedical Sciences, University of Albany School of Public Health, Albany, New York 12201-2002, USA
| | | | | |
Collapse
|
87
|
Chinenov Y. A second catalytic domain in the Elp3 histone acetyltransferases: a candidate for histone demethylase activity? Trends Biochem Sci 2002; 27:115-7. [PMID: 11893502 DOI: 10.1016/s0968-0004(02)02058-3] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
A new subfamily of two-domain histone acetyltransferases (HATs) related to Elp3 has been identified. In addition to a HAT domain in the C terminus, these proteins have an N-terminal domain similar to the catalytic domain of S-adenosylmethionine radical enzymes. Two-domain organization is preserved in evolution, suggesting that both enzymatic activities are functionally or mechanistically coupled and directed towards highly conserved substrates. The functional implications of this similarity and a possible role for Elp3-related proteins as histone demethylases are discussed.
Collapse
Affiliation(s)
- Yurii Chinenov
- Howard Hughes Medical Institute, University of Michigan Medical Center, 1150 W. Medical Center Dr., Ann Arbor, MI 48109-0650, USA.
| |
Collapse
|
88
|
Aravind L, Koonin EV. Classification of the caspase-hemoglobinase fold: detection of new families and implications for the origin of the eukaryotic separins. Proteins 2002; 46:355-67. [PMID: 11835511 DOI: 10.1002/prot.10060] [Citation(s) in RCA: 134] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
A comprehensive sequence and structural comparative analysis of the caspase-hemoglobinase protein fold resulted in the delineation of the minimal structural core of the protease domain and the identification of numerous, previously undetected members, including a new protease family typified by the HetF protein from the cyanobacterium Nostoc. The first bacterial homologs of legumains and hemoglobinases were also identified. Most proteins containing this fold are known or predicted to be active proteases, but multiple, independent inactivations were noticed in nearly all lineages. Together with the tendency of caspase-related proteases to form intramolecular or intermolecular dimers, this suggests a widespread regulatory role for the inactive forms. A classification of the caspase-hemoglobinase fold was developed to reflect the inferred evolutionary relationships between the constituent protein families. Proteins containing this domain were so far detected almost exclusively in bacteria and eukaryotes. This analysis indicates that caspase-hemoglobinase-fold proteases and their inactivated derivatives are widespread in diverse bacteria, particularly those with a complex development, such as Streptomyces, Anabaena, Mesorhizobium, and Myxococcus. The eukaryotic separin family was shown to be most closely related to the mainly prokaryotic HetF family. The phyletic patterns and evolutionary relationships between these proteins suggest that they probably were acquired by eukaryotes from bacteria during the primary, promitochondrial endosymbiosis. A similar scenario, supported by phylogenetic analysis, seems to apply to metacaspases and paracaspases, with the latter, perhaps, being acquired in an independent horizontal transfer to the eukaryotes. The acquisition of the caspase-hemoglobinase-fold domains by eukaryotes might have been critical in the evolution of important eukaryotic processes, such as mitosis and programmed cell death.
Collapse
Affiliation(s)
- L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA.
| | | |
Collapse
|
89
|
Panchenko AR, Bryant SH. A comparison of position-specific score matrices based on sequence and structure alignments. Protein Sci 2002; 11:361-70. [PMID: 11790846 PMCID: PMC2373449 DOI: 10.1110/ps.19902] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
Abstract
Sequence comparison methods based on position-specific score matrices (PSSMs) have proven a useful tool for recognition of the divergent members of a protein family and for annotation of functional sites. Here we investigate one of the factors that affects overall performance of PSSMs in a PSI-BLAST search, the algorithm used to construct the seed alignment upon which the PSSM is based. We compare PSSMs based on alignments constructed by global sequence similarity (ClustalW and ClustalW-pairwise), local sequence similarity (BLAST), and local structure similarity (VAST). To assess performance with respect to identification of conserved functional or structural sites, we examine the accuracy of the three-dimensional molecular models predicted by PSSM-sequence alignments. Using the known structures of those sequences as the standard of truth, we find that model accuracy varies with the algorithm used for seed alignment construction in the pattern local-structure (VAST) > local-sequence (BLAST) > global-sequence (ClustalW). Using structural similarity of query and database proteins as the standard of truth, we find that PSSM recognition sensitivity depends primarily on the diversity of the sequences included in the alignment, with an optimum around 30-50% average pairwise identity. We discuss these observations, and suggest a strategy for constructing seed alignments that optimize PSSM-sequence alignment accuracy and recognition sensitivity.
Collapse
Affiliation(s)
- Anna R Panchenko
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | | |
Collapse
|
90
|
Bornberg-Bauer E. Randomness, Structural Uniqueness, Modularity and Neutral Evolution in Sequence Space of Model Proteins. ACTA ACUST UNITED AC 2002. [DOI: 10.1524/zpch.2002.216.2.139] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The genotype-phenotype map for short chains of a protein-like hetero-polymer model has been characterised [9, 12]. Hydrophobic-Polar (HP) sequences on a square lattice, their structures and partition functions have been exhaustively enumerated and analysed. Homologous sequences folding uniquely into the same structure are interconnected by point mutations. These
Collapse
|
91
|
Håkansson K, Miller CG. Structure of peptidase T from Salmonella typhimurium. EUROPEAN JOURNAL OF BIOCHEMISTRY 2002; 269:443-50. [PMID: 11856302 DOI: 10.1046/j.0014-2956.2001.02665.x] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The structure of peptidase T, or tripeptidase, was determined by multiple wavelength anomalous dispersion (MAD) methodology and refined to 2.4 A resolution. Peptidase T comprises two domains; a catalytic domain with an active site containing two metal ions, and a smaller domain formed through a long insertion into the catalytic domain. The two metal ions, presumably zinc, are separated by 3.3 A, and are coordinated by five carboxylate and histidine ligands. The molecular surface of the active site is negatively charged. Peptidase T has the same basic fold as carboxypeptidase G2. When the structures of the two enzymes are superimposed, a number of homologous residues, not evident from the sequence alone, could be identified. Comparison of the active sites of peptidase T, carboxypeptidase G2, Aeromonas proteolytica aminopeptidase, carboxypeptidase A and leucine aminopeptidase reveals a common structural framework with interesting similarities and differences in the active sites and in the zinc coordination. A putative binding site for the C-terminal end of the tripeptide substrate was found at a peptidase T specific fingerprint sequence motif.
Collapse
Affiliation(s)
- Kjell Håkansson
- Department of Microbiology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | | |
Collapse
|
92
|
Bujnicki JM. In silico analysis of the tRNA:m1A58 methyltransferase family: homology-based fold prediction and identification of new members from Eubacteria and Archaea. FEBS Lett 2001; 507:123-7. [PMID: 11684083 DOI: 10.1016/s0014-5793(01)02962-3] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The amino acid sequences of Gcd10p and Gcd14p, the two subunits of the tRNA:(1-methyladenosine-58; m(1)A58) methyltransferase (MTase) of Saccharomyces cerevisiae, have been analyzed using iterative sequence database searches and fold recognition programs. The results suggest that the 'catalytic' Gcd14p and 'substrate binding' Gcd10p are related to each other and to a group of prokaryotic open reading frames, which were previously annotated as hypothetical protein isoaspartate MTases in sequence databases. It is predicted that the prokaryotic proteins are genuine tRNA:m(1)A MTases based on similarity of their predicted active site to the Gcd14p family. In addition to the MTase domain, an additional domain was identified in the N-terminus of all these proteins that may be involved in interaction with tRNA. These results suggest that the eukaryotic tRNA:m(1)A58 MTase is a product of gene duplication and divergent evolution of a possibly homodimeric prokaryotic enzyme.
Collapse
Affiliation(s)
- J M Bujnicki
- Bioinformatics Laboratory, International Institute of Molecular and Cell Biology, ul. ks. Trojdena 4, 02-109 Warsaw, Poland.
| |
Collapse
|
93
|
Sharff A, Fanutti C, Shi J, Calladine C, Luisi B. The role of the TolC family in protein transport and multidrug efflux. From stereochemical certainty to mechanistic hypothesis. EUROPEAN JOURNAL OF BIOCHEMISTRY 2001; 268:5011-26. [PMID: 11589692 DOI: 10.1046/j.0014-2956.2001.02442.x] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Gram-negative bacteria are enveloped by a system of two membranes, and they use specialized multicomponent, energy-driven pumps to transport molecules directly across this double-layered partition from the cell interior to the extra-cellular environment. One component of these pumps is embedded in the outer-membrane, and the paradigm for its structure and function is the TolC protein from Escherichia coli. A common component of a wide variety of efflux pumps, TolC and its homologues are involved in the export of chemically diverse molecules ranging from large protein toxins, such as alpha-hemolysin, to small toxic compounds, such as antibiotics. TolC family members thus play important roles in conferring pathogenic bacteria with both virulence and multidrug resistance. These pumps assemble reversibly in a transient process that brings together TolC or its homologue, an inner-membrane-associated periplasmic component, an integral inner-membrane translocase and the substrate itself. TolC can associate in this fashion with a variety of different partners to participate in the transport of diverse substrates. We review here the structure and function of TolC and the other components of the efflux/transport pump.
Collapse
Affiliation(s)
- A Sharff
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK
| | | | | | | | | |
Collapse
|
94
|
Spencer J, Clarke AR, Walsh TR. Novel mechanism of hydrolysis of therapeutic beta-lactams by Stenotrophomonas maltophilia L1 metallo-beta-lactamase. J Biol Chem 2001; 276:33638-44. [PMID: 11443136 DOI: 10.1074/jbc.m105550200] [Citation(s) in RCA: 82] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Stopped-flow tryptophan fluorescence under single turnover and pseudo-first-order conditions has been used to investigate the kinetic mechanism of beta-lactam hydrolysis by the Stenotrophomonas maltophilia L1 metallo-beta-lactamase. For the cephalosporin substrates nitrocefin and cefaclor and the carbapenem meropenem, a substantial quench of fluorescence is observed on association of substrate with enzyme. We have assigned this to a rearrangement event subsequent to formation of an initial collision complex. For the colorimetric compound nitrocefin, decay of this dark inter- mediate represents the overall rate-determining step for the reaction and is equivalent to decay of a previously observed state in which the beta-lactam amide bond has already been cleaved. For both cefaclor and meropenem, the rate-determining step for hydrolysis is loss of a second, less quenched state, in which, however, the beta-lactam amide bond remains intact. We suggest, therefore, that the mechanism of hydrolysis of nitrocefin by binuclear metallo-beta-lactamases may be atypical and that cleavage of the beta-lactam amide bond is the rate-determining step for breakdown of the majority of beta-lactam substrates by the L1 enzyme.
Collapse
Affiliation(s)
- J Spencer
- Department of Pathology and Microbiology, University of Bristol School of Medical Sciences, University Walk, Bristol BS8 1TD, United Kingdom.
| | | | | |
Collapse
|
95
|
Florczyk MA, McCue LA, Stack RF, Hauer CR, McDonough KA. Identification and characterization of mycobacterial proteins differentially expressed under standing and shaking culture conditions, including Rv2623 from a novel class of putative ATP-binding proteins. Infect Immun 2001; 69:5777-85. [PMID: 11500455 PMCID: PMC98695 DOI: 10.1128/iai.69.9.5777-5785.2001] [Citation(s) in RCA: 79] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The environmental signals that affect gene regulation in Mycobacterium tuberculosis remain largely unknown despite their importance to tuberculosis pathogenesis. Other work has shown that several promoters, including acr (also known as hspX) (alpha-crystallin homolog), are upregulated in shallow standing cultures compared with constantly shaking cultures. Each of these promoters is also induced to a similar extent within macrophages. The present study used two-dimensional gel electrophoresis and mass spectrometry to further characterize differences in mycobacterial protein expression during growth under standing and shaking culture conditions. Metabolic labeling of M. bovis BCG showed that at least 45 proteins were differentially expressed under standing and shaking culture conditions. Rv2623, CysA2-CysA3, Gap, and Acr were identified from each of four spots or gel bands that were specifically increased in bacteria from standing cultures. An additional standing-induced spot contained two comigrating proteins, GlcB and KatG. The greatest induction was observed with Rv2623, a 32-kDa protein of unknown function that was strongly expressed under standing conditions and absent in shaking cultures. Analysis using PROBE, a multiple sequence alignment and database mining tool, classified M. tuberculosis Rv2623 as a member of a novel class of ATP-binding proteins that may be involved in M. tuberculosis's response to environmental signals. These studies demonstrate the power of combined proteomic and computational approaches and demonstrate that subtle differences in bacterial culture conditions may have important implications for the study of gene expression in mycobacteria.
Collapse
Affiliation(s)
- M A Florczyk
- Department of Biomedical Sciences, University at Albany, Albany, New York 12222, USA
| | | | | | | | | |
Collapse
|
96
|
Daiyasu H, Osaka K, Ishino Y, Toh H. Expansion of the zinc metallo-hydrolase family of the beta-lactamase fold. FEBS Lett 2001; 503:1-6. [PMID: 11513844 DOI: 10.1016/s0014-5793(01)02686-2] [Citation(s) in RCA: 258] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Recently, the zinc metallo-hydrolase family of the beta-lactamase fold has grown quite rapidly, accompanied by the accumulation of sequence and structure data. The variety of the biological functions of the family is higher than expected. In addition, the members often have mosaic structures with additional domains. The family includes class B beta-lactamase, glyoxalase II, arylsulfatase, flavoprotein, cyclase/dehydrase, an mRNA 3'-processing protein, a DNA cross-link repair enzyme, a DNA uptake-related protein, an alkylphosphonate uptake-related protein, CMP-N-acetylneuraminate hydroxylase, the romA gene product, alkylsulfatase, and insecticide hydrolases. In this minireview, the functional and structural varieties of the growing protein family are described.
Collapse
Affiliation(s)
- H Daiyasu
- Department of Bioinformatics, Biomolecular Engineering Research Institute, Osaka, Japan
| | | | | | | |
Collapse
|
97
|
Grigoriev IV, Zhang C, Kim SH. Sequence-based detection of distantly related proteins with the same fold. PROTEIN ENGINEERING 2001; 14:455-8. [PMID: 11522917 DOI: 10.1093/protein/14.7.455] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Affiliation(s)
- I V Grigoriev
- Department of Chemistry, University of California, Berkeley, CA 94720, USA
| | | | | |
Collapse
|
98
|
Lewis JA, Hatfull GF. Control of directionality in integrase-mediated recombination: examination of recombination directionality factors (RDFs) including Xis and Cox proteins. Nucleic Acids Res 2001; 29:2205-16. [PMID: 11376138 PMCID: PMC55702 DOI: 10.1093/nar/29.11.2205] [Citation(s) in RCA: 147] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2001] [Revised: 03/28/2001] [Accepted: 04/11/2001] [Indexed: 11/12/2022] Open
Abstract
Similarity between the DNA substrates and products of integrase-mediated site-specific recombination reactions results in a single recombinase enzyme being able to catalyze both the integration and excision reactions. The control of directionality in these reactions is achieved through a class of small accessory factors that favor one reaction while interfering with the other. These proteins, which we will refer to collectively as recombination directionality factors (RDFs), play architectural roles in reactions catalyzed by their cognate recombinases and have been identified in conjunction with both tyrosine and serine integrases. Previously identified RDFs are typically small, basic and have diverse amino acid sequences. A subset of RDFs, the cox genes, also function as transcriptional regulators. We present here a compilation of all the known RDF proteins as well as those identified through database mining that we predict to be involved in conferring recombination directionality. Analysis of this group of proteins shows that they can be grouped into distinct sub-groups based on their sequence similarities and that they are likely to have arisen from several independent evolutionary lineages. This compilation will prove useful in recognizing new proteins that confer directionality upon site-specific recombination reactions encoded by plasmids, transposons, phages and prophages.
Collapse
Affiliation(s)
- J A Lewis
- Pittsburgh Bacteriophage Institute and Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | | |
Collapse
|
99
|
Lecompte O, Thompson JD, Plewniak F, Thierry J, Poch O. Multiple alignment of complete sequences (MACS) in the post-genomic era. Gene 2001; 270:17-30. [PMID: 11403999 DOI: 10.1016/s0378-1119(01)00461-9] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Multiple alignment, since its introduction in the early seventies, has become a cornerstone of modern molecular biology. It has traditionally been used to deduce structure / function by homology, to detect conserved motifs and in phylogenetic studies. There has recently been some renewed interest in the development of multiple alignment techniques, with current opinion moving away from a single all-encompassing algorithm to iterative and / or co-operative strategies. The exploitation of multiple alignments in genome annotation projects represents a qualitative leap in the functional analysis process, opening the way to the study of the co-evolution of validated sets of proteins and to reliable phylogenomic analysis. However, the alignment of the highly complex proteins detected by today's advanced database search methods is a daunting task. In addition, with the explosion of the sequence databases and with the establishment of numerous specialized biological databases, multiple alignment programs must evolve if they are to successfully rise to the new challenges of the post-genomic era. The way forward is clearly an integrated system bringing together sequence data, knowledge-based systems and prediction methods with their inherent unreliability. The incorporation of such heterogeneous, often non-consistent, data will require major changes to the fundamental alignment algorithms used to date. Such an integrated multiple alignment system will provide an ideal workbench for the validation, propagation and presentation of this information in a format that is concise, clear and intuitive.
Collapse
Affiliation(s)
- O Lecompte
- Laboratoire de Biologie et Génomique Structurales, Institut de Génétique et de Biologie Moléculaire et Cellulaire (CNRS/INSERM/ULP), BP 163, 67404 Cedex, Illkirch, France
| | | | | | | | | |
Collapse
|
100
|
Kwak J, McCue LA, Trczianka K, Kendrick KE. Identification and characterization of a developmentally regulated protein, EshA, required for sporogenic hyphal branches in Streptomyces griseus. J Bacteriol 2001; 183:3004-15. [PMID: 11325927 PMCID: PMC95199 DOI: 10.1128/jb.183.10.3004-3015.2001] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
To identify sporulation-specific proteins that might serve as targets of developmental regulatory factors in Streptomyces, we examined total proteins of Streptomyces griseus by two-dimensional gel electrophoresis. Among five proteins that were present at high levels during sporulation but absent from vegetative cells, two of the proteins, P3 and P4, were absent from developmental mutants that undergo aberrant morphogenesis. The deduced amino acid sequence of the gene that encodes P3 (EshA) showed extensive similarity to proteins from mycobacteria and a cyanobacterium, Synechococcus, that are abundant during nutritional stress but whose functions are unknown. Uniquely among these proteins, EshA contains a cyclic nucleotide-binding domain, suggesting that the activity of EshA may be modulated by a cyclic nucleotide. The eshA gene was strongly expressed from a single transcription start site only during sporulation, and accumulation of the eshA transcript depended on a developmental gene, bldA. During submerged sporulation, a null mutant strain that produced no EshA could not extend sporogenic hyphae from new branch points but instead accelerated septation and spore maturation at the preexisting vegetative filaments. These results indicated that EshA is required for the growth of sporogenic hyphae and localization of septation and spore maturation but not for spore viability.
Collapse
Affiliation(s)
- J Kwak
- Department of Microbiology, Ohio State University, Columbus, OH 43210, USA.
| | | | | | | |
Collapse
|