151
|
Xue H, Hang J, Chu R, Xiao Y, Li H, Lee P, Zheng H. Delineation of a membrane-proximal beta-rich domain in the GABAA receptor by progressive deletions. J Mol Biol 1999; 285:55-61. [PMID: 9878387 DOI: 10.1006/jmbi.1998.2317] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The type A gamma-aminobutyric acid (GABAA) receptor plays a major inhibitory role in the central nervous system. Structural elucidation of the GABAA receptor has been impeded by the large size of the receptor. We present here the delineation of a minimal structural domain as the first step of dissecting the receptor structure. This was achieved through prediction-assisted progressive deletions: the prediction of a candidate structural domain rich in beta-strands with no close similarity to known structures was tested by deleting putative secondary structure elements from the ends of the proposed domain, as well as mutations within the terminal secondary structures. Such progressive deletions revealed the limits of an integral domain, spanning Cys180 to Met293 (numbering of human alpha1 subunit). Below these limits the intact domain structure, as indicated by its circular dichroism, collapses. Based on its putative position, this domain is provisionally designated the membrane-proximal beta-rich domain of GABAA receptor. The inclusion of sequences from the first two out of four previously suggested transmembrane segments and one of the two conserved Cys residues in this domain defines important constraints to the receptor structure.
Collapse
Affiliation(s)
- H Xue
- Department of Biochemistry, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong.
| | | | | | | | | | | | | |
Collapse
|
152
|
|
153
|
Aravind L, Koonin EV. The HD domain defines a new superfamily of metal-dependent phosphohydrolases. Trends Biochem Sci 1998; 23:469-72. [PMID: 9868367 DOI: 10.1016/s0968-0004(98)01293-6] [Citation(s) in RCA: 362] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Affiliation(s)
- L Aravind
- NCBI, National Library of Medicine, NIH, Bethesda, MD 20894, USA.
| | | |
Collapse
|
154
|
Pons T, Olmea O, Chinea G, Beldarraín A, Márquez G, Acosta N, Rodríguez L, Valencia A. Structural model for family 32 of glycosyl-hydrolase enzymes. Proteins 1998; 33:383-95. [PMID: 9829697 DOI: 10.1002/(sici)1097-0134(19981115)33:3<383::aid-prot7>3.0.co;2-r] [Citation(s) in RCA: 50] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
A structural model is presented for family 32 of the glycosyl-hydrolase enzymes based on the beta-propeller fold. The model is derived from the common prediction of two different threading methods, TOPITS and THREADER. In addition, we used a correlated mutation analysis and prediction of active-site residues to corroborate the proposed model. Physical techniques (circular dichroism and differential scanning calorimetry) confirmed two aspects of the prediction, the proposed all-beta fold and the multi-domain structure. The most reliable three-dimensional model was obtained using the structure of neuraminidase (1nscA) as template. The analysis of the position of the active site residues in this model is compatible with the catalytic mechanism proposed by Reddy and Maley (J. Biol. Chem. 271:13953-13958, 1996), which includes three conserved residues, Asp, Glu, and Cys. Based on this analysis, we propose the participation of one more conserved residue (Asp 162) in the catalytic mechanism. The model will facilitate further studies of the physical and biochemical characteristics of family 32 of the glycosyl-hydrolases.
Collapse
Affiliation(s)
- T Pons
- Centro de Ingeniería Genética y Biotecnología, Havana, Cuba.
| | | | | | | | | | | | | | | |
Collapse
|
155
|
Mahata M, Mahata SK, Parmer RJ, O'Connor DT. Proadrenomedullin N-terminal 20 peptide: minimal active region to regulate nicotinic receptors. Hypertension 1998; 32:907-16. [PMID: 9822452 DOI: 10.1161/01.hyp.32.5.907] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Proadrenomedullin N-terminal 20 peptide (PAMP-[1-20]; ARLDVASEFRKKWNKWALSR-amide) is a potent hypotensive and catecholamine release-inhibitory peptide released from chromaffin cells. We studied the mechanism of PAMP action and how its function is linked to structure. We tested human PAMP-[1-20] on catecholamine secretion in PC12 pheochromocytoma cells and found it to be a potent, dose-dependent (IC50 approximately 350 nmol/L) secretory inhibitor. Inhibition was specific for nicotinic cholinergic stimulation since PAMP-[1-20] failed to inhibit release by agents that bypass the nicotinic receptor. Nicotinic cationic (22Na+,45Ca2+) signal transduction was disrupted by this peptide, and potencies for inhibition of 22Na+ uptake and catecholamine secretion were comparable. Even high-dose nicotine failed to overcome the inhibition, suggesting noncompetitive nicotinic antagonism. N- and C-terminal PAMP truncation peptides indicated a role for the C-terminal amide and refined the minimal active region to the C-terminal 8 amino acids (WNKWALSR-amide), a region likely to be alpha-helical. PAMP also blocked (EC50 approximately 270 nmol/L) nicotinic cholinergic agonist desensitization of catecholamine release, as well as desensitization of nicotinic signal transduction (22Na+ uptake). Thus, PAMP may exert both inhibitory and facilitatory effects on nicotinic signaling, depending on the prior state of nicotinic stimulation. PAMP may therefore contribute to a novel, autocrine, homeostatic (negative-feedback) mechanism controlling catecholamine release.
Collapse
Affiliation(s)
- M Mahata
- Department of Medicine and Center for Molecular Genetics, University of California at San Diego, and San Diego Veterans Affairs Healthcare System, San Diego, CA, USA
| | | | | | | |
Collapse
|
156
|
Mirny LA, Shakhnovich EI. Protein structure prediction by threading. Why it works and why it does not. J Mol Biol 1998; 283:507-26. [PMID: 9769221 DOI: 10.1006/jmbi.1998.2092] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We developed a novel Monte Carlo threading algorithm which allows gaps and insertions both in the template structure and threaded sequence. The algorithm is able to find the optimal sequence-structure alignment and sample suboptimal alignments. Using our algorithm we performed sequence-structure alignments for a number of examples for three protein folds (ubiquitin, immunoglobulin and globin) using both "ideal" set of potentials (optimized to provide the best Z-score for a given protein) and more realistic knowledge-based potentials. Two physically different scenarios emerged. If a template structure is similar to the native one (within 2 A RMS), then (i) the optimal threading alignment is correct and robust with respect to deviations of the potential from the "ideal" one; (ii) suboptimal alignments are very similar to the optimal one; (iii) as Monte Carlo temperature decreases a sharp cooperative transition to the optimal alignment is observed. In contrast, if the template structure is only moderately close to the native structure (RMS greater than 3.5 A), then (i) the optimal alignment changes dramatically when an "ideal" potential is substituted by the real one; (ii) the structures of suboptimal alignments are very different from the optimal one, reducing the reliability of the alignment; (iii) the transition to the apparently optimal alignment is non-cooperative. In the intermediate cases when the RMS between the template and the native conformations is in the range between 2 A and 3.5 A, the success of threading alignment may depend on the quality of potentials used. These results are rationalized in terms of a threading free energy landscape. Possible ways to overcome the fundamental limitations of threading are discussed briefly.
Collapse
Affiliation(s)
- L A Mirny
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA, 02138, USA
| | | |
Collapse
|
157
|
Ribas de Pouplana L, Buechter D, Sardesai NY, Schimmel P. Functional analysis of peptide motif for RNA microhelix binding suggests new family of RNA-binding domains. EMBO J 1998; 17:5449-57. [PMID: 9736622 PMCID: PMC1170870 DOI: 10.1093/emboj/17.18.5449] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
RNA microhelices that recreate the acceptor stems of transfer RNAs are charged with specific amino acids. Here we identify a two-helix pair in alanyl-tRNA synthetase that is required for RNA microhelix binding. A single point mutation at an absolutely conserved residue in this motif selectively disrupts RNA binding without perturbation of the catalytic site. These results, and findings of similar motifs in the proximity of the active sites of other tRNA synthetases, suggest that two-helix pairs are widespread and provide a structural framework important for contacts with bound RNA substrates.
Collapse
Affiliation(s)
- L Ribas de Pouplana
- The Skaggs Institute for Chemical Biology and Departments of Molecular Biology and Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | | | | | | |
Collapse
|
158
|
Aravind L, Leipe DD, Koonin EV. Toprim--a conserved catalytic domain in type IA and II topoisomerases, DnaG-type primases, OLD family nucleases and RecR proteins. Nucleic Acids Res 1998; 26:4205-13. [PMID: 9722641 PMCID: PMC147817 DOI: 10.1093/nar/26.18.4205] [Citation(s) in RCA: 300] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Iterative profile searches and structural modeling show that bacterial DnaG-type primases, small primase-like proteins from bacteria and archaea, type IA and type II topoisomerases, bacterial and archaeal nucleases of the OLD family and bacterial DNA repair proteins of the RecR/M family contain a common domain, designated Toprim (topoisomerase-primase) domain. The domain consists of approximately 100 amino acids and has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). Examination of the structure of Topo IA and Topo II and modeling of the Toprim domains of the primases reveal a compact beta/alpha fold, with the conserved negatively charged residues juxtaposed, and inserts seen in Topo IA and Topo II. The conserved glutamate may act as a general base in nucleotide polymerization by primases and in strand rejoining by topoisomerases and as a general acid in strand cleavage by topoisomerases and nucleases. The role of this glutamate in catalysis is supported by site-directed mutagenesis data on primases and Topo IA. The DxD motif may coordinate Mg2+that is required for the activity of all Toprim-containing enzymes. The common ancestor of all life forms could encode a prototype Toprim enzyme that might have had both nucleotidyl transferase and polynucleotide cleaving activity.
Collapse
Affiliation(s)
- L Aravind
- Department of Biology, Texas A&M University, College Station, TX 70843, USA, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | |
Collapse
|
159
|
Pascarella S, Angelaccio S, Contestabile R, Delle Fratte S, Di Salvo M, Bossa F. The structure of serine hydroxymethyltransferase as modeled by homology and validated by site-directed mutagenesis. Protein Sci 1998; 7:1976-82. [PMID: 9761478 PMCID: PMC2144154 DOI: 10.1002/pro.5560070913] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
We describe a model for the three-dimensional structure of E. coli serine hydroxymethyltransferase based on its sequence homology with other PLP enzymes of the alpha-family and whose tertiary structures are known. The model suggests that certain amino acid residues at the putative active site of the enzyme can adopt specific roles in the catalytic mechanism. These proposals were supported by analysis of the properties of a number of site-directed mutants. New active site features are also proposed for further experimental testing.
Collapse
Affiliation(s)
- S Pascarella
- Dipartimento di Scienze Biochimiche A. Rossi Fanelli and Centro di Biologia Molecolare del CNR, Università La Sapienza, Roma, Italy
| | | | | | | | | | | |
Collapse
|
160
|
Padilla-Zúñiga AJ, Rojo-Domínguez A. Non-homology knowledge-based prediction of the papain prosegment folding pattern: a description of plausible folding and activation mechanisms. FOLDING & DESIGN 1998; 3:271-84. [PMID: 9710573 DOI: 10.1016/s1359-0278(98)00038-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
BACKGROUND A detailed knowledge of three-dimensional conformations is necessary in order to understand the close relationship between protein structure and function. Among current methodologies, homology modeling is an important tool for obtaining reliable geometries and it provides a direct alternative to X-ray or NMR techniques. In contrast, predictive methods with no three-dimensional template (non-homology) still require further validation and systematization. RESULTS Here, we present a non-homology knowledge-based strategy for the structural prediction of the proregion of a cysteine proteinase zymogen. This method analyzes individual sequences and multiple alignments of homologous sequences, making use of different published algorithms and incorporating all available structure-related information to obtain improved predictions. Our strategy yielded acceptable secondary structure and general three-dimensional assignments when compared with crystallographic data from homologous proteins. CONCLUSIONS We discuss our successes and failures as a contribution to non-homology prediction development. In addition, based on the information analyzed and generated in this work, we propose plausible folding and activation mechanisms for thiol-proteinase precursors that attempt to shed light on the molecular basis of prosegment functions.
Collapse
Affiliation(s)
- A J Padilla-Zúñiga
- Departamento de Química, Universidad Autónoma Metropolitana-Iztapalapa, México, D.F., México.
| | | |
Collapse
|
161
|
Aravind L, Koonin EV. Phosphoesterase domains associated with DNA polymerases of diverse origins. Nucleic Acids Res 1998; 26:3746-52. [PMID: 9685491 PMCID: PMC147763 DOI: 10.1093/nar/26.16.3746] [Citation(s) in RCA: 203] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Computer analysis of DNA polymerase protein sequences revealed previously unidentified conserved domains that belong to two distinct superfamilies of phosphoesterases. The alpha subunits of bacterial DNA polymerase III and two distinct family X DNA polymerases are shown to contain an N-terminal domain that defines a novel enzymatic superfamily, designated PHP, after polymerase and histidinol phosphatase. The predicted catalytic site of the PHP superfamily consists of four motifs containing conserved histidine residues that are likely to be involved in metal-dependent catalysis of phosphoester bond hydrolysis. The PHP domain is highly conserved in all bacterial polymerase III alpha subunits, but in proteobacteria and mycoplasmas, the conserved motifs are distorted, suggesting a loss of the enzymatic activity. Another conserved domain, found in the small subunits of archaeal DNA polymerase II and eukaryotic DNA polymerases alpha and delta, is shown to belong to the superfamily of calcineurin-like phospho-esterases, which unites a variety of phosphatases and nucleases. The conserved motifs required for phospho-esterase activity are intact in the archaeal DNA polymerase subunits, but are disrupted in their eukaryotic orthologs. A hypothesis is proposed that bacterial and archaeal replicative DNA polymerases possess intrinsic phosphatase activity that hydrolyzes the pyrophosphate released during nucleotide polymerization. As proposed previously, pyrophosphate hydrolysis may be necessary to drive the polymerization reaction forward. The phosphoesterase domains with disrupted catalytic motifs may assume an allosteric, regulatory function and/or bind other subunits of DNA polymerase holoenzymes. In these cases, the pyrophosphate may be hydrolyzed by a stand-alone phosphatase, and candidates for such a role were identified among bacterial PHP superfamily members.
Collapse
Affiliation(s)
- L Aravind
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
| | | |
Collapse
|
162
|
Aravind L, Koonin EV. The HORMA domain: a common structural denominator in mitotic checkpoints, chromosome synapsis and DNA repair. Trends Biochem Sci 1998; 23:284-6. [PMID: 9757827 DOI: 10.1016/s0968-0004(98)01257-2] [Citation(s) in RCA: 158] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- L Aravind
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894, USA.
| | | |
Collapse
|
163
|
Abstract
Genome sequencing projects continue to provide a flood of new protein sequences, and prediction methods remain an important means of adding structural information. Recently, there have been advances in secondary structure prediction, which feed, in turn, into improved fold recognition algorithms. Finally, there have been technical improvements in comparative modelling, and studies of the expected accuracy of three-dimensional structural models built by this method.
Collapse
Affiliation(s)
- D R Westhead
- The European Bioinformatics Institute EMBL Outstation Wellcome Trust Genome Campus Hinxton, Cambridge, CB10 1SD, UK.
| | | |
Collapse
|
164
|
Aravind L, Koonin EV. A colipase fold in the carboxy-terminal domain of the Wnt antagonists--the Dickkopfs. Curr Biol 1998; 8:R477-8. [PMID: 9663378 DOI: 10.1016/s0960-9822(98)70309-4] [Citation(s) in RCA: 50] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
165
|
Abstract
Computational biology exploits the evolutionary connectivity between proteins and protein families to predict structural and functional properties of uncharacterized gene products. In the past year, conceptual and statistical refinements have substantially improved algorithms for the detection of remote homologues. In conjunction with the rapid growth of biological databases, the global organization of proteins into sequence families, functional families and structural families has become both pertinent and feasible.
Collapse
Affiliation(s)
- L Holm
- European Molecular Biology Laboratory-European Bio-informatics Institute, Cambridge, UK
| |
Collapse
|
166
|
de las Alas MM, de Bruin RA, Ten Eyck L, Los G, Howell SB. Prediction-based threading of the hMSH2 DNA mismatch repair protein. FASEB J 1998; 12:653-63. [PMID: 9619444 DOI: 10.1096/fasebj.12.9.653] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Mutations in the genes whose products participate in DNA mismatch repair underlie the increased risk of cancer in families with hereditary nonpolyposis colon carcinoma. Mutations in hMSH2 account for approximately 50% of the mutations found in these families. We sought to predict the 3-dimensional structure of hMSH2 by identifying structural homologues using prediction-based threading and by computer modeling using information from these putative structurally related proteins. Prediction-based threading identified three candidate structural homologues: glycogen phosphorylase (gpb), a 70 kDa soluble lytic transglycosylase, and ribonucleotide reductase protein R1. An independent approach utilizing a potential-based threading program also identified gpb as a structural homologue. The models based on the structures of these proteins suggest that the ATP binding domain and helix-turn-helix domain are exposed on the outside of the protein. All known bacterial MutS and hMSH2 mutations appear to be clustered in similar vicinities in the theoretical models of hMSH2; the major site is within the ATP binding domain and near the carboxyl-terminal end, whereas a smaller number map to the region coding for exon 5 and the amino-terminal domain. All point mutations also appear to affect amino acids that are exposed on the outside surface of the protein.
Collapse
Affiliation(s)
- M M de las Alas
- Department of Medicine and the Cancer Center, University of California, San Diego, La Jolla 92093-0058, USA
| | | | | | | | | |
Collapse
|
167
|
Abstract
There has been a dramatic increase in the number of completely sequenced bacterial genomes during the past two years as a result of the efforts both of public genome agencies and the pharmaceutical industry. The availability of completely sequenced genomes permits more systematic analyses of genes, evolution and genome function than was otherwise possible. Using computational methods - which are used to identify genes and their functions including statistics, sequence similarity, motifs, profiles, protein folds and probabilistic models - it is possible to develop characteristic genome signatures, assign functions to genes, identify pathogenic genes, identify metabolic pathways, develop diagnostic probes and discover potential drug-binding sites. All of these directions are critical to understanding bacterial growth, pathogenicity and host-pathogen interactions.
Collapse
Affiliation(s)
- D L Brutlag
- Department of Biochemistry, Beckman Center, B400 Stanford University, Stanford, California 94305-5307, USA.
| |
Collapse
|
168
|
|
169
|
Hrmova M, MacGregor EA, Biely P, Stewart RJ, Fincher GB. Substrate binding and catalytic mechanism of a barley beta-D-Glucosidase/(1,4)-beta-D-glucan exohydrolase. J Biol Chem 1998; 273:11134-43. [PMID: 9556600 DOI: 10.1074/jbc.273.18.11134] [Citation(s) in RCA: 81] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
A beta-glucosidase, designated isoenzyme betaII, from germinated barley (Hordeum vulgare L.) hydrolyzes aryl-beta-glucosides and shares a high level of amino acid sequence similarity with beta-glucosidases of diverse origin. It releases glucose from the non-reducing termini of cellodextrins with catalytic efficiency factors, kcat/Km, that increase approximately 9-fold as the degree of polymerization of these substrates increases from 2 to 6. Thus, the enzyme has a specificity and action pattern characteristic of both beta-glucosidases (EC 3.2.1.21) and the polysaccharide exohydrolase, (1,4)-beta-glucan glucohydrolase (EC 3.2.1.74). At high concentrations (100 mM) of 4-nitrophenyl beta-glucoside, beta-glucosidase isoenzyme betaII catalyzes glycosyl transfer reactions, which generate 4-nitrophenyl-beta-laminaribioside, -cellobioside, and -gentiobioside. Subsite mapping with cellooligosaccharides indicates that the barley beta-glucosidase isoenzyme betaII has six substrate-binding subsites, each of which binds an individual beta-glucosyl residue. Amino acid residues Glu181 and Glu391 are identified as the probable catalytic acid and catalytic nucleophile, respectively. The enzyme is a family 1 glycoside hydrolase that is likely to adopt a (beta/alpha)8 barrel fold and in which the catalytic amino acid residues appear to be located at the bottom of a funnel-shaped pocket in the enzyme.
Collapse
Affiliation(s)
- M Hrmova
- Department of Plant Science, University of Adelaide, Waite Campus, Glen Osmond, South Australia 5064, Australia
| | | | | | | | | |
Collapse
|
170
|
Aravind L, Galperin MY, Koonin EV. The catalytic domain of the P-type ATPase has the haloacid dehalogenase fold. Trends Biochem Sci 1998; 23:127-9. [PMID: 9584613 DOI: 10.1016/s0968-0004(98)01189-x] [Citation(s) in RCA: 196] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | |
Collapse
|
171
|
Abstract
The exponential growth of sequence data does not necessarily lead to an increase in knowledge about the functions of genes and their products. Prediction of function using comparative sequence analysis is extremely powerful but, if not performed appropriately, may also lead to the creation and propagation of assignment errors. While current homology detection methods can cope with the data flow, the identification, verification and annotation of functional features need to be drastically improved.
Collapse
Affiliation(s)
- P Bork
- EMBL, Heidelberg, Germany.
| | | |
Collapse
|
172
|
Ortiz AR, Kolinski A, Skolnick J. Fold assembly of small proteins using monte carlo simulations driven by restraints derived from multiple sequence alignments. J Mol Biol 1998; 277:419-48. [PMID: 9514747 DOI: 10.1006/jmbi.1997.1595] [Citation(s) in RCA: 73] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The feasibility of predicting the global fold of small proteins by incorporating predicted secondary and tertiary restraints into ab initio folding simulations has been demonstrated on a test set comprised of 20 non-homologous proteins, of which one was a blind prediction of target 42 in the recent CASP2 contest. These proteins contain from 37 to 100 residues and represent all secondary structural classes and a representative variety of global topologies. Secondary structure restraints are provided by the PHD secondary structure prediction algorithm that incorporates multiple sequence information. Predicted tertiary restraints are derived from multiple sequence alignments via a two-step process. First, seed side-chain contacts are identified from correlated mutation analysis, and then a threading-based algorithm is used to expand the number of these seed contacts. A lattice-based reduced protein model and a folding algorithm designed to incorporate these predicted restraints is described. Depending upon fold complexity, it is possible to assemble native-like topologies whose coordinate root-mean-square deviation from native is between 3.0 A and 6.5 A. The requisite level of accuracy in side-chain contact map prediction can be roughly 25% on average, provided that about 60% of the contact predictions are correct within +/-1 residue and 95% of the predictions are correct within +/-4 residues. Precision in tertiary contact prediction is more critical than absolute accuracy. Furthermore, only a subset of the tertiary contacts, on the order of 25% of the total, is sufficient for successful topology assembly. Overall, this study suggests that the use of restraints derived from multiple sequence alignments combined with a fold assembly algorithm holds considerable promise for the prediction of the global topology of small proteins.
Collapse
Affiliation(s)
- A R Ortiz
- TPC-5, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | | | | |
Collapse
|
173
|
Aurora R, Rose GD. Seeking an ancient enzyme in Methanococcus jannaschii using ORF, a program based on predicted secondary structure comparisons. Proc Natl Acad Sci U S A 1998; 95:2818-23. [PMID: 9501173 PMCID: PMC19652 DOI: 10.1073/pnas.95.6.2818] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
We have developed a simple procedure to identify protein homologs in genomic databases. The program, called ORF, is based on comparisons of predicted secondary structure. Protein structure is far better conserved than amino acid sequence, and structure-based methods have been effective in exploiting this fact to find homologs, even among proteins with scant sequence identity. ORF is a secondary structure-based method that operates solely on predictions from sequence and requires no experimentally determined information about the structure. The approach is illustrated by an example: Thymidylate synthase, a highly conserved enzyme essential to thymidine biosynthesis in both prokaryotes and eukaryotes, is thought to be used by Archaea, but a corresponding gene has yet to be identified. Here, a candidate thymidylate synthase is identified as a previously unassigned open reading frame from the genome of Methanococcus jannaschii, viz., MJ0757. Using primary structure information alone, the optimally aligned sequence identity between MJ0757 and Escherichia coli thymidylate synthase is 7%, well below the threshold of sensitivity for detection by sequence-based methods.
Collapse
Affiliation(s)
- R Aurora
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins School of Medicine, 725 North Wolfe Street, Baltimore, MD 21205, USA
| | | |
Collapse
|
174
|
Allaire M, Li Y, MacKenzie RE, Cygler M. The 3-D structure of a folate-dependent dehydrogenase/cyclohydrolase bifunctional enzyme at 1.5 A resolution. Structure 1998; 6:173-82. [PMID: 9519408 DOI: 10.1016/s0969-2126(98)00019-7] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
BACKGROUND The interconversion of two major folate one-carbon donors occurs through the sequential activities of NAD(P)-dependent methylene[H4]folate dehydrogenase (D) and methenyl[H4]folate cyclohydrolase (C). These activities often coexist as part of a multifunctional enzyme and there are several lines of evidence suggesting that their substrates bind at overlapping sites. Little is known, however, about the nature of this site or the identity of the active-site residues for this enzyme family. RESULTS We have determined, to 1.5 A resolution, the structure of a dimer of the D/C domain of the human trifunctional cytosolic enzyme with bound NADP cofactor, using the MAD technique. The D/C subunit is composed of two alpha/beta domains that assemble to form a wide cleft. The cleft walls are lined with highly conserved residues and NADP is bound along one wall. The NADP-binding domain has a Rossmann fold, characterized by a modified diphosphate-binding loop fingerprint-GXSXXXG. Dimerization occurs by antiparallel interaction of two NADP-binding domains. Superposition of the two subunits indicates domain motion occurs about a well-defined hinge region. CONCLUSIONS Analysis of the structure suggests strongly that folate-binding sites for both activities are within the cleft, providing direct support for the proposed overlapping site model. The orientation of the nicotinamide ring suggests that in the dehydrogenase-catalyzed reaction hydride transfer occurs to the pro-R side of the ring. The identity of the cyclohydrolase active site is not obvious. We propose that a conserved motif-Tyr52-X-X-X-Lys56- and/or a Ser49-Gln100-Pro102 triplet have a role in this activity.
Collapse
Affiliation(s)
- M Allaire
- Biotechnology Research Institute, Montréal, Québec, Canada
| | | | | | | |
Collapse
|
175
|
Aravind L, Koonin EV. A novel family of predicted phosphoesterases includes Drosophila prune protein and bacterial RecJ exonuclease. Trends Biochem Sci 1998; 23:17-9. [PMID: 9478130 DOI: 10.1016/s0968-0004(97)01162-6] [Citation(s) in RCA: 130] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- L Aravind
- Department of Biology, Texas A&M University, College Station 77843, USA
| | | |
Collapse
|
176
|
Turcotte M, Muggleton SH, Sternberg MJE. Application of inductive logic programming to discover rules governing the three-dimensional topology of protein structure. INDUCTIVE LOGIC PROGRAMMING 1998. [DOI: 10.1007/bfb0027310] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
|
177
|
|