51
|
Yu GW, Allen MD, Andreeva A, Fersht AR, Bycroft M. Solution structure of the C4 zinc finger domain of HDM2. Protein Sci 2005; 15:384-9. [PMID: 16385008 PMCID: PMC2242465 DOI: 10.1110/ps.051927306] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
HDM2 is a ubiquitin E3 ligase that is a key negative regulator of the tumor suppressor p53. Here, we report the determination of the solution structure of the C4 zinc finger domain of HDM2 using multidimensional NMR. The HDM2 C4 zinc finger domain has a fold consisting of a 3(10) helix followed by four beta-strands, which shares significant structural similarity to the zinc ribbon protein family. Family based sequence analysis identified two putative binding sites, one of which resembles an RNA binding motif.
Collapse
Affiliation(s)
- Grace W Yu
- Centre for Protein Engineering, Medical Research Council, Hills Road, CB2 2QH, Cambridge, United Kingdom
| | | | | | | | | |
Collapse
|
52
|
Abstract
YAKUSA is a program designed for rapid scanning of a structural database with a query protein structure. It searches for the longest common substructures called SHSPs (structural high-scoring pairs) existing between a query structure and every structure in the structural database. It makes use of protein backbone internal coordinates (alpha angles) in order to describe protein structures as sequences of symbols. The structural similarities are established in 5 steps, the first 3 being analogous to those used in BLAST: (1) building up a deterministic finite automaton describing all patterns identical or similar to those in the query structure; (2) searching for all these patterns in every structure in the database; (3) extending the patterns to longer matching substructures (i.e., SHSPs); (4) selecting compatible SHSPs for each query-database structure pair; and (5) ranking the query-database structure pairs using 3 scores based on SHSP similarity, on SHSP probabilities, and on spatial compatibility of SHSPs. Structural fragment probabilities are estimated according to a mixture transition distribution model, which is an approximation of a high-order Markov chain model. With regard to sensitivity and selectivity of the structural matches, YAKUSA compares well to the best related programs, although it is by far faster: A typical database scan takes about 40 s CPU time on a desktop personal computer. It has also been implemented on a Web server for real-time searches.
Collapse
|
53
|
Neiditch MB, Federle MJ, Miller ST, Bassler BL, Hughson FM. Regulation of LuxPQ receptor activity by the quorum-sensing signal autoinducer-2. Mol Cell 2005; 18:507-18. [PMID: 15916958 DOI: 10.1016/j.molcel.2005.04.020] [Citation(s) in RCA: 161] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2005] [Revised: 04/17/2005] [Accepted: 04/27/2005] [Indexed: 11/28/2022]
Abstract
The extracellular signaling molecule autoinducer-2 (AI-2) mediates quorum-sensing communication in diverse bacterial species. In marine vibrios, binding of AI-2 to the periplasmic receptor LuxP modulates the activity of the inner membrane sensor kinase LuxQ, transducing the AI-2 information into the cytoplasm. Here, we show that Vibrio harveyi LuxP associates with LuxQ in both the presence and absence of AI-2. The 1.9 A X-ray crystal structure of apoLuxP, complexed with the periplasmic domain of LuxQ, reveals that the latter contains two tandem Per/ARNT/Simple-minded (PAS) folds. Thus, although many prokaryotic PAS folds themselves bind ligands, the LuxQ periplasmic PAS folds instead bind LuxP, monitoring its AI-2 occupancy. Mutations that disrupt the apoLuxP:LuxQ interface sensitize V. harveyi to AI-2, implying that AI-2 binding causes the replacement of one set of LuxP:LuxQ contacts with another. These conformational changes switch LuxQ between two opposing enzymatic activities, each of which conveys information to the cytoplasm about the cell density of the surrounding environment.
Collapse
Affiliation(s)
- Matthew B Neiditch
- Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544, USA
| | | | | | | | | |
Collapse
|
54
|
Shi HJ, Wu AZ, Santos M, Feng ZM, Huang L, Chen YM, Zhu K, Chen CLC. Cloning and characterization of rat spermatid protein SSP411: a thioredoxin-like protein. ACTA ACUST UNITED AC 2005; 25:479-93. [PMID: 15223837 DOI: 10.1002/j.1939-4640.2004.tb02819.x] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
In an attempt to identify new sperm-specific genes that are involved in sperm maturation, fertilization, and embryo development, such as the mammalian ortholog of the sperm-supplied protein gene, spe-11, in Caenorhabditis elegans, we cloned and characterized a new spermatid-specific protein gene, ssp411, from adult rat testes. The ssp411 cDNA shared >85% sequence identity with an unnamed human protein, FLJ21347, and an uncharacterized mouse testicular protein called transcript increased in spermiogenesis 78 (TISP78). A 2.8-kb ssp411 mRNA was expressed in a testis-specific and age-dependent manner; the mRNA was evident at 28 days and remained at high levels throughout adulthood. An SSP411 protein of molecular weight 88 000 was detected in testicular extracts by Western blot analysis. Ssp411 mRNA and SSP411 protein, as analyzed by in situ hybridization and immunohistochemistry, were both expressed in a stage-dependent fashion during the cycle of the seminiferous epithelium. The ssp411 mRNA was predominantly localized to round and elongated spermatids, with maximal expression at stages VII-XII. The SSP411 protein was mainly observed in elongated spermatids and reached its highest levels during stages V-VI. A conserved thioredoxin-like domain was detected in the N-terminal region of SSP411 and its orthologs. An analysis of the predicted 3-dimensional structural modeling and folding pattern further suggested that SSP411 is identifiable as a member of thioredoxin family. In summary, we have identified a new rat spermatid protein gene, ssp411, and its orthologs in human and mouse and demonstrated that SSP411 might belong to a testis-specific thioredoxin family. This suggests that SSP411 may play a role in sperm maturation, fertilization, and/or embryo development, as has been shown in thioredoxin family.
Collapse
Affiliation(s)
- Hui-Juan Shi
- Center for Biomedical Research, Population Council, Beijing, P.R. China
| | | | | | | | | | | | | | | |
Collapse
|
55
|
Sierk ML, Kleywegt GJ. Déjà vu all over again: finding and analyzing protein structure similarities. Structure 2005; 12:2103-11. [PMID: 15576025 DOI: 10.1016/j.str.2004.09.016] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2004] [Revised: 09/07/2004] [Accepted: 09/23/2004] [Indexed: 10/26/2022]
Abstract
Structure comparison is a crucial aspect of structural biology today. The field of structure comparison is developing rapidly, with the development of new algorithms, similarity scores, and statistical scores. The predicted large increase of experimental structures and structural models made possible by high-throughput efforts means that structural comparison and searching of structural databases using automated methods will become increasingly common. This Ways & Means article is meant to guide the structural biologist in the basics of structural alignment, and to provide an overview of the available software tools. The main purpose is to encourage users to gain some understanding of the strengths and limitations of structural alignment, and to take these factors into account when interpreting the results of different programs.
Collapse
Affiliation(s)
- Michael L Sierk
- Department of Biochemistry and Molecular Genetics, University of Virginia, P.O. Box 800733, Charlottesville, VA 22908, USA.
| | | |
Collapse
|
56
|
EvDTree: structure-dependent substitution profiles based on decision tree classification of 3D environments. BMC Bioinformatics 2005; 6:4. [PMID: 15638949 PMCID: PMC545998 DOI: 10.1186/1471-2105-6-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2004] [Accepted: 01/10/2005] [Indexed: 12/04/2022] Open
Abstract
Background Structure-dependent substitution matrices increase the accuracy of sequence alignments when the 3D structure of one sequence is known, and are successful e.g. in fold recognition. We propose a new automated method, EvDTree, based on a decision tree algorithm, for automatic derivation of amino acid substitution probabilities from a set of sequence-structure alignments. The main advantage over other approaches is an unbiased automatic selection of the most informative structural descriptors and associated values or thresholds. This feature allows automatic derivation of structure-dependent substitution scores for any specific set of structures, without the need to empirically determine best descriptors and parameters. Results Decision trees for residue substitutions were constructed for each residue type from sequence-structure alignments extracted from the HOMSTRAD database. For each tree cluster, environment-dependent substitution profiles were derived. The resulting structure-dependent substitution scores were assessed using a criterion based on the mean ranking of observed substitution among all possible substitutions and in sequence-structure alignments. The automatically built EvDTree substitution scores provide significantly better results than conventional matrices and similar or slightly better results than other structure-dependent matrices. EvDTree has been applied to small disulfide-rich proteins as a test case to automatically derive specific substitutions scores providing better results than non-specific substitution scores. Analyses of the decision tree classifications provide useful information on the relative importance of different structural descriptors. Conclusions We propose a fully automatic method for the classification of structural environments and inference of structure-dependent substitution profiles. We show that this approach is more accurate than existing methods for various applications. The easy adaptation of EvDTree to any specific data set opens the way for class-specific structure-dependent substitution scores which can be used in threading-based remote homology searches.
Collapse
|
57
|
Abstract
Comparison of two protein structures often results in not only a global alignment but also a number of distinct local alignments; the latter, referred to as alternative alignments, are however usually ignored in existing protein structure comparison analyses. Here, we used a novel method of protein structure comparison to extensively identify and characterize the alternative alignments obtained for structure pairs of a fold classification database. We showed that all alternative alignments can be classified into one of just a few types, and with which illustrated the potential of using alternative alignments to identify recurring protein substructures, including the internal structural repeats of a protein. Furthermore, we showed that among the alternative alignments obtained, permuted alignments, which included both circular and scrambled permutations, are as prevalent as topological alignments. These results demonstrated that the so far largely unattended alternative alignments of protein structures have implications and applications for research of protein classification and evolution.
Collapse
Affiliation(s)
- Edward S C Shih
- Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan
| | | |
Collapse
|
58
|
Kulczyk AW, Yang JC, Neuhaus D. Solution structure and DNA binding of the zinc-finger domain from DNA ligase IIIalpha. J Mol Biol 2004; 341:723-38. [PMID: 15288782 DOI: 10.1016/j.jmb.2004.06.035] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2004] [Revised: 06/09/2004] [Accepted: 06/11/2004] [Indexed: 11/21/2022]
Abstract
DNA ligase IIIalpha carries out the final ligation step in the base excision repair (BER) and single strand break repair (SSBR) mechanisms of DNA repair. The enzyme recognises single-strand nicks and other damage features in double-stranded DNA, both through the catalytic domain and an N-terminal domain containing a single zinc finger. The latter is homologous to other zinc fingers that recognise damaged DNA, two in the N terminus of poly(adenosine-ribose)polymerase and three in the N terminus of the Arabidopsis thaliana nick-sensing DNA 3'-phosphoesterase. Here, we present the solution structure of the zinc-finger domain of human DNA ligase IIIalpha, the first structure of a finger from this group. It is related to that of the erythroid transcription factor GATA-1, but has an additional N-terminal beta-strand and C-terminal alpha-helix. Chemical shift mapping using a DNA ligand containing a single-stranded break showed that the DNA-binding surface of the DNA-ligase IIIalpha zinc finger is substantially different from that of GATA-1, consistent with the fact that the two proteins recognise very different features in the DNA. Likely implications for DNA binding are discussed.
Collapse
|
59
|
Bertrand T, Eady NAJ, Jones JN, Nagy JM, Jamart-Grégoire B, Raven EL, Brown KA. Crystal Structure of Mycobacterium tuberculosis Catalase-Peroxidase. J Biol Chem 2004; 279:38991-9. [PMID: 15231843 DOI: 10.1074/jbc.m402382200] [Citation(s) in RCA: 144] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The Mycobacterium tuberculosis catalase-peroxidase is a multifunctional heme-dependent enzyme that activates the core anti-tuberculosis drug isoniazid. Numerous studies have been undertaken to elucidate the enzyme-dependent mechanism of isoniazid activation, and it is well documented that mutations that reduce activity or inactivate the catalase-peroxidase lead to increased levels of isoniazid resistance in M. tuberculosis. Interpretation of the catalytic activities and the effects of mutations upon the action of the enzyme to date have been limited due to the lack of a three-dimensional structure for this enzyme. In order to provide a more accurate model of the three-dimensional structure of the M. tuberculosis catalase-peroxidase, we have crystallized the enzyme and now report its crystal structure refined to 2.4-A resolution. The structure reveals new information about dimer assembly and provides information about the location of residues that may play a role in catalysis including candidates for protein-based radical formation. Modeling and computational studies suggest that the binding site for isoniazid is located near the delta-meso heme edge rather than in a surface loop structure as currently proposed. The availability of a crystal structure for the M. tuberculosis catalase-peroxidase also permits structural and functional effects of mutations implicated in causing elevated levels of isoniazid resistance in clinical isolates to be interpreted with improved confidence.
Collapse
Affiliation(s)
- Thomas Bertrand
- Department of Biological Sciences, Centre for Molecular Microbiology and Infection, Flowers Building, Imperial College London, London SW7 2AZ, United Kingdom
| | | | | | | | | | | | | |
Collapse
|
60
|
Maiti R, Van Domselaar GH, Zhang H, Wishart DS. SuperPose: a simple server for sophisticated structural superposition. Nucleic Acids Res 2004; 32:W590-4. [PMID: 15215457 PMCID: PMC441615 DOI: 10.1093/nar/gkh477] [Citation(s) in RCA: 475] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The SuperPose web server rapidly and robustly calculates both pairwise and multiple protein structure superpositions using a modified quaternion eigenvalue approach. SuperPose generates sequence alignments, structure alignments, PDB (Protein Data Bank) coordinates and RMSD statistics, as well as difference distance plots and images (both static and interactive) of the superimposed molecules. SuperPose employs a simple interface that requires only PDB files or accession numbers as input. All other superposition decisions are made by the program. SuperPose is uniquely able to superimpose structures that differ substantially in sequence, size or shape. It is also capable of handling a much larger range of superposition queries and situations than many standalone programs and yields results that are intuitively more in agreement with known biological or structural data. The SuperPose web server is freely accessible at http://wishart.biology.ualberta.ca/SuperPose/.
Collapse
Affiliation(s)
- Rajarshi Maiti
- Departments of Biological Sciences and Computing Science, University of Alberta, Edmonton, AB, T6G 2E8, Canada
| | | | | | | |
Collapse
|
61
|
Miller WL, Wenzel CQ, Daniels C, Larocque S, Brisson JR, Lam JS. Biochemical characterization of WbpA, a UDP-N-acetyl-D-glucosamine 6-dehydrogenase involved in O-antigen biosynthesis in Pseudomonas aeruginosa PAO1. J Biol Chem 2004; 279:37551-8. [PMID: 15226302 DOI: 10.1074/jbc.m404749200] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
WbpA (PA3159) is an enzyme involved in the biosynthesis of unusual di-N-acetyl-d-mannosaminuronic acid-derived sugar nucleotides found in the O antigen of Pseudomonas aeruginosa PAO1 (serotype O5). The wbpA gene that encodes this enzyme was cloned into pET-28a, overexpressed as a histidine-tagged fusion protein, and purified by nickel chelation chromatography. Capillary electrophoresis was used to examine substrate conversion by WbpA, and the data revealed that WbpA is a UDP-N-acetyl-D-glucosamine 6-dehydrogenase (EC 1.1.1.136), which uses NAD(+) as a coenzyme. The enzyme reaction product was purified by HPLC and analyzed using NMR spectroscopy. Our results showed unequivocally that the product of the WbpA reaction is UDP-N-acetyl-d-glucosaminuronic acid. WbpA requires either NH(4)(+) or K(+) for activity and the accompanying anions exert secondary effects on activity consistent with their ranking in the Hofmeister series. Kinetic analysis showed positive cooperativity with respect to UDP-N-acetyl-d-glucosamine binding with a K(0.5) of 94 microM, a k(cat) of 86 min(-1), and a Hill coefficient of 1.8. In addition, WbpA has a K(0.5) for NAD(+) of 220 microM, a k(cat) of 86 min(-1), and a Hill coefficient of 1.1. The oligomerization state of WbpA was analyzed by gel filtration, dynamic light scattering, and analytical ultracentrifugation, with all three techniques indicating that WbpA exists as a trimer in solution. However, tertiary structure predictions suggested a tetramer, which was supported by data from transmission electron microscopy. The electron micrograph of negatively stained WbpA samples revealed structures with 4-fold symmetry.
Collapse
Affiliation(s)
- Wayne L Miller
- Department of Microbiology, University of Guelph, Guelph, Ontario N1G 2W1, Canada
| | | | | | | | | | | |
Collapse
|
62
|
Kotlovyi V, Nichols WL, Ten Eyck LF. Protein structural alignment for detection of maximally conserved regions. Biophys Chem 2004; 105:595-608. [PMID: 14499921 DOI: 10.1016/s0301-4622(03)00069-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
An algorithm for comparison of homologous protein structures and for study of conformational changes in proteins, has been developed. The method is based on identification of pieces of the two molecules that have similar shapes, as determined by the local conformation of the polypeptide chain. Pieces that superpose within a specified tolerance are assembled into domains based on similar transformations for superposition. The result is sets of pieces that represent conserved structural elements and conserved spatial relationships between structural elements within the proteins being compared. A similarity criterion based on maximum distance rather than on root mean square deviation reduces bias by outliers. The utility of the method is demonstrated by using examples from the protein kinase family.
Collapse
Affiliation(s)
- Vladimir Kotlovyi
- San Diego Supercomputer Center, University of California, San Diego 0505, 9500 Gilman Drive, La Jolla, CA 92093-0505, USA
| | | | | |
Collapse
|
63
|
Koike R, Kinoshita K, Kidera A. Probabilistic description of protein alignments for sequences and structures. Proteins 2004; 56:157-66. [PMID: 15162495 DOI: 10.1002/prot.20067] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
A number of equally optimal alignments inherently exist in the sequence and structure comparisons among proteins. To represent the sub-optimal alignments systematically, we have developed a method of generating probabilistic alignments for sequences and structures, by which the correspondence between pairs of residues is evaluated in a probabilistic manner. Our method uses the periodic boundary condition to avoid the entropy artifact favoring full-length matches. In the structure comparison, the environmental effects are incorporated by the mean-field approximation. We applied this method in comparisons of two pairs of proteins with internal symmetry; the first set were proteins of TIM-barrel fold and the second were beta-trefoil fold. These pairs are expected to have distinct sub-optimal alignments suitable for probabilistic description with the periodic boundary. It was shown that the sequence and structure alignments are consistent with each other and that the alignments with the highest probability represent circular permutation.
Collapse
Affiliation(s)
- Ryotaro Koike
- Department of Chemistry, Graduate School of Science, Kyoto University, Kitashirakawa-Oiwake-cho, Sakyo-ku, Kyoto 606-8502, Japan
| | | | | |
Collapse
|
64
|
Abstract
The structural comparison of two proteins comes up in many applications in structural biology where it is often necessary to find similarities in very large conformation sets. This work describes techniques to achieve significant speedup in the computation of structural similarity between two given conformations, at the expense of introducing a small error in the similarity measure. Furthermore, the proposed computational scheme allows for a tradeoff between speedup and error. This scheme exploits the fact that the Calpha representation of a protein conformation contains redundant information, due to the chain topology and limited compactness of proteins. This redundancy can be reduced by approximating subchains of a protein by their centers of mass, resulting in a smaller number of points to describe a conformation. A Haar wavelet analysis of random chains and proteins is used to justify this approximated representation. Similarity measures computed with this representation are highly correlated to the measures computed with the original Calpha representation. Therefore, they can be used in applications where small similarity errors can be tolerated or as fast filters in applications that require exact measures. Computational tests have been conducted on two applications, nearest neighbor search and automatic structural classification.
Collapse
Affiliation(s)
- Itay Lotan
- Department of Computer Science, 353 Serra Mall, Stanford University, Stanford, CA 94305, USA.
| | | |
Collapse
|
65
|
Ochagavía ME, Wodak S. Progressive combinatorial algorithm for multiple structural alignments: Application to distantly related proteins. Proteins 2004; 55:436-54. [PMID: 15048834 DOI: 10.1002/prot.10587] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
MALECON is a progressive combinatorial procedure for multiple alignments of protein structures. It searches a library of pairwise alignments for all three-protein alignments in which a specified number of residues is consistently aligned. These alignments are progressively expanded to include additional proteins and more spatially equivalent residues, subject to certain criteria. This action involves superimposing the aligned proteins by their hitherto equivalent residues and searching for additional Calpha atoms that lie close in space. The performance of MALECON is illustrated and compared with several extant multiple structure alignment methods by using as test the globin homologous superfamily, the OB and the Jellyrolls folds. MALECON gives better definitions of the common structural features in the structurally more diverse proteins of the OB and Jellyrolls folds, but it yields comparable results for the more similar globins. When no consistent multiple alignments can be derived for all members of a protein group, our procedure is still capable of automatically generating consistent alignments and common core definitions for subgroups of the members. This finding is illustrated for proteins of the OB fold and SH3 domains, believed to share common structural features, and should be very instrumental in homology modeling and investigations of protein evolution.
Collapse
|
66
|
Venclovas C, Zemla A, Fidelis K, Moult J. Assessment of progress over the CASP experiments. Proteins 2004; 53 Suppl 6:585-95. [PMID: 14579350 DOI: 10.1002/prot.10530] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The quality of structure models produced in the CASP5 experiment has been compared with that in earlier CASPs. The most significant progress is in the fold recognition regime, where the development of meta-servers has allowed more accurate consensus models to be generated. In contrast to this, there is little evidence of progress in producing more accurate comparative models, particularly those based on sequence identities > 30%. For comparative models based on low-sequence identity and for fold recognition models, accuracy depends primarily on the fraction of the target structure that is similar to an available template, and the quality of the alignment. Overall, these results indicate that there are still no effective methods of improving model quality beyond that obtained by successfully copying a template structure. For models of proteins with previously unknown folds, there appears to be a pause in the previous consistent improvement. There is some evidence that more groups are producing top-quality models, however. Although specific progress between successive experiments is sometimes difficulty to identify, over the history of all the CASPs there has been steady, if sometimes slow, progress in all modeling regimes.
Collapse
Affiliation(s)
- Ceslovas Venclovas
- Biology and Biotechnology Research Program, Lawrence Livermore National Laboratory, Livermore, California, USA
| | | | | | | |
Collapse
|
67
|
Albrecht M, Domingues FS, Schreiber S, Lengauer T. Structural localization of disease-associated sequence variations in the NACHT and LRR domains of PYPAF1 and NOD2. FEBS Lett 2003; 554:520-8. [PMID: 14623123 DOI: 10.1016/s0014-5793(03)01222-5] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Several autoinflammatory diseases with distinct clinical manifestations have been associated with sequence variations in the gene products PYPAF1/CIAS1 and NOD2/CARD15. Both proteins belong to the PYD/CARD-containing family of apoptosis regulators and activators of pro-inflammatory caspases. To gain insight into the dysfunctional role of sequence alterations, we assembled a structure-based multiple sequence alignment of family members and related proteins. This allowed us to analyze the putative effect of the alterations on the function of nucleotide-binding (NACHT) and leucine-rich repeat (LRR) domains shared by the family members. In support of this analysis, we carefully selected template structures for the NACHT and LRR domains and mapped the genetic variations onto 3D domain models. Additionally, we propose a model of the NACHT and LRR domain complex. Our study revealed that many of the disease-associated sequence variants are located close to highly conserved sequence regions of functional relevance and are spatially adjacent in the predicted 3D structure. The implications on the domain functions such as NTP-hydrolysis or oligomerization are discussed.
Collapse
Affiliation(s)
- Mario Albrecht
- Max-Planck-Institute for Informatics, Stuhlsatzenhausweg 85, 66123 Saarbrücken, Germany.
| | | | | | | |
Collapse
|
68
|
Kinch LN, Wrabl JO, Krishna SS, Majumdar I, Sadreyev RI, Qi Y, Pei J, Cheng H, Grishin NV. CASP5 assessment of fold recognition target predictions. Proteins 2003; 53 Suppl 6:395-409. [PMID: 14579328 DOI: 10.1002/prot.10557] [Citation(s) in RCA: 78] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
We present an overview of the fifth round of Critical Assessment of Protein Structure Prediction (CASP5) fold recognition category. Prediction models were evaluated by using six different structural measures and four different alignment measures, and these scores were compared to those assigned manually over a diverse subset of target domains. Scores were combined to compare overall performance of participating groups and to estimate rank significance. The methods used by a few groups outperformed all other methods in terms of the evaluated criteria and could be considered state-of-the-art in structure prediction. We discuss a few examples of difficult fold recognition targets to highlight the progress of ab initio-type methods on difficult structure analogs and the difficulties of predicting multidomain targets and selecting prediction models. We also compared the results of manual groups to those of automatic servers evaluated in parallel by CAFASP, showing that the top performing automated server structure predictions approached those of the best manual predictors.
Collapse
Affiliation(s)
- Lisa N Kinch
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9050, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
69
|
Abstract
Recent identification of the prokaryotic genes related to the catalytic oxygenase domain of mammalian nitric oxide synthase (NOS) has led to speculations on the origins of the NO signaling network. NOS activity in eukaryotes relies on the concerted action of the oxygenase domain with an electron-donating reductase domain that is fused to it. A fused reductase domain is, however, absent in prokaryotes. Consequently, we searched bacterial genomes for homologs of the reductase domain and identified candidate genes. On the basis of genomic sequence and protein structural analysis, we here propose that sulfite reductase flavoprotein is a prototype of the mammalian NOS reductase domain and a complementing interaction partner of the bacterial NOS oxygenase protein.
Collapse
Affiliation(s)
- Tomasz Zemojtel
- Department of Bioinformatics, University of Würzburg, Am Hubland, D-97074 Würzburg, Germany.
| | | | | |
Collapse
|
70
|
Abstract
We present the LGA (Local-Global Alignment) method, designed to facilitate the comparison of protein structures or fragments of protein structures in sequence dependent and sequence independent modes. The LGA structure alignment program is available as an online service at http://PredictionCenter.llnl.gov/local/lga. Data generated by LGA can be successfully used in a scoring function to rank the level of similarity between two structures and to allow structure classification when many proteins are being analyzed. LGA also allows the clustering of similar fragments of protein structures.
Collapse
Affiliation(s)
- Adam Zemla
- Lawrence Livermore National Laboratory, 7000 East Avenue, Livermore, CA 94550, USA.
| |
Collapse
|
71
|
Ohi MD, Vander Kooi CW, Rosenberg JA, Chazin WJ, Gould KL. Structural insights into the U-box, a domain associated with multi-ubiquitination. Nat Struct Mol Biol 2003; 10:250-5. [PMID: 12627222 PMCID: PMC5881891 DOI: 10.1038/nsb906] [Citation(s) in RCA: 214] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2002] [Accepted: 01/15/2003] [Indexed: 11/08/2022]
Abstract
The structure of the U-box in the essential Saccharomyces cerevisiae pre-mRNA splicing factor Prp19p has been determined by NMR. The conserved zinc-binding sites supporting the cross-brace arrangement in RING-finger domains are replaced by hydrogen-bonding networks in the U-box. These hydrogen-bonding networks are necessary for the structural stabilization and activity of the U-box. A conservative Val-->Ile point mutation in the Prp19p U-box domain leads to pre-mRNA splicing defects in vivo. NMR analysis of this mutant shows that the substitution disrupts structural integrity of the U-box domain. Furthermore, comparison of the Prp19p U-box domain with known RING-E2 complex structures demonstrates that both U-box and RING-fingers contain a conserved interaction surface. Mutagenesis of residues at this interface, while not perturbing the structure of the U-box, abrogates Prp19p function in vivo. These comparative structural and functional analyses imply that the U-box and its associated ubiquitin ligase activity are critical for Prp19p function in vivo.
Collapse
Affiliation(s)
- Melanie D Ohi
- Howard Hughes Medical Institute School of Medicine, Vanderbilt University, Nashville, Tennessee 37232, USA
| | | | | | | | | |
Collapse
|
72
|
Van Walle I, Lasters I, Wyns L. Consistency matrices: quantified structure alignments for sets of related proteins. Proteins 2003; 51:1-9. [PMID: 12596259 DOI: 10.1002/prot.10293] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Comparing two remotely similar structures is a difficult problem: more often than not, resulting structure alignments will show ambiguities and a unique answer usually does not even exist. In addition, alignments in general have a limited information content because every aligned residue is considered equally important. To solve these issues to a certain extent, one can take the perspective of a whole group of similar structures and then evaluate common structural features. Here, we describe a consistency approach that, although not actually performing a multiple structure alignment, does produce the information that one would conceivably want from such an experiment: the key structural features of the group, e.g., a fold, which in this case are projected onto either a pair of proteins or a single protein. Both representations are useful for a number of applications, ranging from the detection of (partially) wrong structure alignments to protein structure classification and fold recognition. To demonstrate some of these applications, the procedure was applied to 195 SCOP folds containing a total of 1802 domains sharing very low sequence similarity.
Collapse
Affiliation(s)
- Ivo Van Walle
- Department of Ultrastructure, Vrije Universiteit Brussel, Sint-Genesius Rode, Belgium.
| | | | | |
Collapse
|
73
|
Van Petegem F, Collins T, Meuwis MA, Gerday C, Feller G, Van Beeumen J. The structure of a cold-adapted family 8 xylanase at 1.3 A resolution. Structural adaptations to cold and investgation of the active site. J Biol Chem 2003; 278:7531-9. [PMID: 12475991 DOI: 10.1074/jbc.m206862200] [Citation(s) in RCA: 102] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Enzymes from psychrophilic organisms differ from their mesophilic counterparts in having a lower thermostability and a higher specific activity at low and moderate temperatures. The current consensus is that they have an increased flexibility, enhancing accommodation and transformation of the substrates at low energy costs. Here we describe the structure of the xylanase from the Antarctic bacterium Pseudoalteromonas haloplanktis at 1.3 A resolution. Xylanases are usually grouped into glycosyl hydrolase families 10 and 11, but this enzyme belongs to family 8. The fold differs from that of other known xylanases and can be described as an (alpha/alpha)(6) barrel. Various parameters that may explain the cold-adapted properties were examined and indicated that the protein has a reduced number of salt bridges and an increased exposure of hydrophobic residues. The crystal structures of a complex with xylobiose and of mutant D144N were obtained at 1.2 and 1.5 A resolution, respectively. Analysis of the various substrate binding sites shows that the +3 and -3 subsites are rearranged as compared to those of a family 8 homolog, while the xylobiose complex suggests the existence of a +4 subsite. A decreased acidity of the substrate binding cleft and an increased flexibility of aromatic residues lining the subsites may enhance the rate at which substrate is bound.
Collapse
Affiliation(s)
- Filip Van Petegem
- Laboratorium voor Eiwitbiochemie en Eiwitengineering, Ghent University, Ledeganckstraat 35, B-9000 Gent, Belgium
| | | | | | | | | | | |
Collapse
|
74
|
Abstract
To assess the reliability of fold assignments to protein sequences, we developed a fold recognition method called FROST (Fold Recognition-Oriented Search Tool) based on a series of filters and a database specifically designed as a benchmark for this new method under realistic conditions. This benchmark database consists of proteins for which there exists, at least, another protein with an extensively similar 3D structure in a database of representative 3D structures (i.e., more than 65% of the residues in both proteins can be structurally aligned). Because the testing of our method must be carried out under conditions similar to those of real fold recognition experiments, no protein pair with sequence similarity detectable using standard sequence comparison methods such as FASTA is included in the benchmark database. While using FROST, we achieved a coverage of 60% for a rate of error of 1%. To obtain a baseline for our method, we used PSI-BLAST and 3D-PSSM. Under the same conditions, for a 1% error rate, coverages for PSI-BLAST and 3D-PSSM were 33 and 56%, respectively.
Collapse
Affiliation(s)
- Antoine Marin
- Mathématique, Informatique et Génome, Centre de Recherche de Versailles, INRA, Route de St Cyr, 78026 Versailles, Cedex, France
| | | | | | | |
Collapse
|
75
|
Ortiz AR, Strauss CEM, Olmea O. MAMMOTH (matching molecular models obtained from theory): an automated method for model comparison. Protein Sci 2002; 11:2606-21. [PMID: 12381844 PMCID: PMC2373724 DOI: 10.1110/ps.0215902] [Citation(s) in RCA: 320] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Advances in structural genomics and protein structure prediction require the design of automatic, fast, objective, and well benchmarked methods capable of comparing and assessing the similarity of low-resolution three-dimensional structures, via experimental or theoretical approaches. Here, a new method for sequence-independent structural alignment is presented that allows comparison of an experimental protein structure with an arbitrary low-resolution protein tertiary model. The heuristic algorithm is given and then used to show that it can describe random structural alignments of proteins with different folds with good accuracy by an extreme value distribution. From this observation, a structural similarity score between two proteins or two different conformations of the same protein is derived from the likelihood of obtaining a given structural alignment by chance. The performance of the derived score is then compared with well established, consensus manual-based scores and data sets. We found that the new approach correlates better than other tools with the gold standard provided by a human evaluator. Timings indicate that the algorithm is fast enough for routine use with large databases of protein models. Overall, our results indicate that the new program (MAMMOTH) will be a good tool for protein structure comparisons in structural genomics applications. MAMMOTH is available from our web site at http://physbio.mssm.edu/~ortizg/.
Collapse
Affiliation(s)
- Angel R Ortiz
- Department of Physiology and Biophysics, Mount Sinai School of Medicine, New York University, New York, New York 10029, USA.
| | | | | |
Collapse
|
76
|
Lesk AM, Lo Conte L, Hubbard TJ. Assessment of novel fold targets in CASP4: predictions of three-dimensional structures, secondary structures, and interresidue contacts. Proteins 2002; Suppl 5:98-118. [PMID: 11835487 DOI: 10.1002/prot.10056] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
In the Novel Fold category, three types of predictions were assessed: three-dimensional structures, secondary structures, and residue-residue contacts. For predictions of three-dimensional models, CASP4 targets included 5 domains or structures with novel folds, and 13 on the borderline between Novel Fold and Fold Recognition categories. These elicited 1863 predictions of these and other targets by methods more general than comparative modeling or fold recognition techniques. The group of Bonneau, Tsai, Ruczinski, and Baker stood out as performing well with the greatest consistency. In many cases, several groups were able to predict fragments of the target correctly-often at a level somewhat larger than standard supersecondary structures-but were not able to assemble fragments into a correct global topology. The methods of Bonneau, Tsai, Ruczinski, and Baker have been successful in addressing the fragment assembly problem for many but not all the target structures.
Collapse
Affiliation(s)
- A M Lesk
- Department of Haematology, University of Cambridge Clinical School, Cambridge Institute for Medical Research, Cambridge, United Kingdom.
| | | | | |
Collapse
|
77
|
Sippl MJ, Lackner P, Domingues FS, Prlić A, Malik R, Andreeva A, Wiederstein M. Assessment of the CASP4 fold recognition category. Proteins 2002; Suppl 5:55-67. [PMID: 11835482 DOI: 10.1002/prot.10006] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
We present the assessment of the CASP4 fold recognition category. The tasks we had to execute include the splitting of multidomain targets into single domains, the classification of target domains in terms of prediction categories, the numerical evaluation of predictions, the mapping of numerical scores to quality indices, the ranking of predictors, the selection of top-performing groups, and the analysis and critical discussion of the state of the art in this field. The 125 fold recognition groups were assessed by a total score that summarizes their performance over all targets and a quality score reflecting the average quality of the submitted models. Most of the top-performing groups achieved respectable results on both scores simultaneously. Several groups submitted models that were much closer to the respective target structures than any of the known folds in the Protein Data Bank. The CASP4 assessment included the automated servers of the parallel CAFASP experiment. For the total score, the highest rank achieved by a fully automated server is 12. Two thirds of the predictors have rather low scores.
Collapse
Affiliation(s)
- M J Sippl
- Center for Applied Molecular Engineering, Institute for Chemistry and Biochemistry, University of Salzburg, Salzburg, Austria.
| | | | | | | | | | | | | |
Collapse
|
78
|
Novotny J, Rigoutsos I, Coleman D, Shenk T. In silico structural and functional analysis of the human cytomegalovirus (HHV5) genome. J Mol Biol 2001; 310:1151-66. [PMID: 11502002 DOI: 10.1006/jmbi.2001.4798] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The open reading frames of human cytomegalovirus (human herpesvirus-5, HHV5) encode some 213 unique proteins with mostly unknown functions. Using the threading program, ProCeryon, we calculated possible matches between the amino acid sequences of these proteins and the Protein Data Bank library of three-dimensional structures. Thirty-six proteins were fully identified in terms of their structure and, often, function; 65 proteins were recognized as members of narrow structural/functional families (e.g. DNA-binding factors, cytokines, enzymes, signaling particles, cell surface receptors etc.); and 87 proteins were assigned to broad structural classes (e.g. all-beta, 3-layer-alphabetaalpha, multidomain, etc.). Genes encoding proteins with similar folds, or containing identical structural traits (extreme sequence length, runs of unstructured (Pro and/or Gly-rich) residues, transmembrane segments, etc.) often formed tandem clusters throughout the genome. In the course of this work, benchmarks on about 20 known folds were used to optimize adjustable parameters of threading calculations, i.e. gap penalty weights used in sequence/structure alignments; new scores obtained as simple combinations of existing scoring functions; and number of threading runs conducive to meaningful results. An introduction of summed, per-residue-normalized scores has been essential for discovery of subdomains (EGF-like, SH2, SH3) in longer protein sequences, such as the eight "open sandwich" cytokine domains, 60-70 amino acids long and having the 3beta1alpha fold with one or two disulfide bridges, present in otherwise unrelated proteins.
Collapse
Affiliation(s)
- J Novotny
- Victor Chang Cardiac Research Institute, Darlinghurst, NSW, Australia.
| | | | | | | |
Collapse
|