51
|
Raman S, Vernon R, Thompson J, Tyka M, Sadreyev R, Pei J, Kim D, Kellogg E, DiMaio F, Lange O, Kinch L, Sheffler W, Kim BH, Das R, Grishin NV, Baker D. Structure prediction for CASP8 with all-atom refinement using Rosetta. Proteins 2010; 77 Suppl 9:89-99. [PMID: 19701941 DOI: 10.1002/prot.22540] [Citation(s) in RCA: 366] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
We describe predictions made using the Rosetta structure prediction methodology for the Eighth Critical Assessment of Techniques for Protein Structure Prediction. Aggressive sampling and all-atom refinement were carried out for nearly all targets. A combination of alignment methodologies was used to generate starting models from a range of templates, and the models were then subjected to Rosetta all atom refinement. For the 64 domains with readily identified templates, the best submitted model was better than the best alignment to the best template in the Protein Data Bank for 24 cases, and improved over the best starting model for 43 cases. For 13 targets where only very distant sequence relationships to proteins of known structure were detected, models were generated using the Rosetta de novo structure prediction methodology followed by all-atom refinement; in several cases the submitted models were better than those based on the available templates. Of the 12 refinement challenges, the best submitted model improved on the starting model in seven cases. These improvements over the starting template-based models and refinement tests demonstrate the power of Rosetta structure refinement in improving model accuracy.
Collapse
Affiliation(s)
- Srivatsan Raman
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
52
|
Abstract
Functional characterization of a protein is often facilitated by its 3D structure. However, the fraction of experimentally known 3D models is currently less than 1% due to the inherently time-consuming and complicated nature of structure determination techniques. Computational approaches are employed to bridge the gap between the number of known sequences and that of 3D models. Template-based protein structure modeling techniques rely on the study of principles that dictate the 3D structure of natural proteins from the theory of evolution viewpoint. Strategies for template-based structure modeling will be discussed with a focus on comparative modeling, by reviewing techniques available for all the major steps involved in the comparative modeling pipeline.
Collapse
Affiliation(s)
- Andras Fiser
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA
| |
Collapse
|
53
|
Abstract
As the field of protein structure prediction continues to expand at an
exponential rate, the bench-biologist might feel overwhelmed by the sheer
range of available applications. This review presents the three main
approaches in computational structure prediction from a
non-bioinformatician?s point of view and makes a selection of tools and
servers freely available. These tools are evaluated from several aspects,
such as number of citations, ease of usage and quality of the results.
Finally, the applications of models generated by computational structure
prediction are discussed.
Collapse
|
54
|
Lenhart TR, Akins DR. Borrelia burgdorferi locus BB0795 encodes a BamA orthologue required for growth and efficient localization of outer membrane proteins. Mol Microbiol 2009; 75:692-709. [PMID: 20025662 DOI: 10.1111/j.1365-2958.2009.07015.x] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The outer membrane (OM) of the pathogenic diderm spirochete, Borrelia burgdorferi, contains integral beta-barrel outer membrane proteins (OMPs) in addition to its numerous outer surface lipoproteins. Very few OMPs have been identified in B. burgdorferi, and the protein machinery required for OMP assembly and OM localization is currently unknown. Essential OM BamA proteins have recently been characterized in Gram-negative bacteria that are central components of an OM beta-barrel assembly machine and are required for proper localization and insertion of bacterial OMPs. In the present study, we characterized a putative B. burgdorferi BamA orthologue encoded by open reading frame bb0795. Structural model predictions and cellular localization data indicate that the B. burgdorferi BB0795 protein contains an N-terminal periplasmic domain and a C-terminal, surface-exposed beta-barrel domain. Additionally, assays with an IPTG-regulatable bb0795 mutant revealed that BB0795 is required for B. burgdorferi growth. Furthermore, depletion of BB0795 results in decreased amounts of detectable OMPs in the B. burgdorferi OM. Interestingly, a decrease in the levels of surface-exposed lipoproteins was also observed in the mutant OMs. Collectively, our structural, cellular localization and functional data are consistent with the characteristics of other BamA proteins, indicating that BB0795 is a B. burgdorferi BamA orthologue.
Collapse
Affiliation(s)
- Tiffany R Lenhart
- Department of Microbiology and Immunology, University of Oklahoma Health Sciences Center, Oklahoma City, OK 73104, USA
| | | |
Collapse
|
55
|
Guex N, Peitsch MC, Schwede T. Automated comparative protein structure modeling with SWISS-MODEL and Swiss-PdbViewer: a historical perspective. Electrophoresis 2009; 30 Suppl 1:S162-73. [PMID: 19517507 DOI: 10.1002/elps.200900140] [Citation(s) in RCA: 1304] [Impact Index Per Article: 86.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
SWISS-MODEL pioneered the field of automated modeling as the first protein modeling service on the Internet. In combination with the visualization tool Swiss-PdbViewer, the Internet-based Workspace and the SWISS-MODEL Repository, it provides a fully integrated sequence to structure analysis and modeling platform. This computational environment is made freely available to the scientific community with the aim to hide the computational complexity of structural bioinformatics and encourage bench scientists to make use of the ever-increasing structural information available. Indeed, over the last decade, the availability of structural information has significantly increased for many organisms as a direct consequence of the complementary nature of comparative protein modeling and experimental structure determination. This has a very positive and enabling impact on many different applications in biomedical research as described in this paper.
Collapse
Affiliation(s)
- Nicolas Guex
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | | |
Collapse
|
56
|
Zhou H, Skolnick J. Protein structure prediction by pro-Sp3-TASSER. Biophys J 2009; 96:2119-27. [PMID: 19289038 DOI: 10.1016/j.bpj.2008.12.3898] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2008] [Revised: 11/12/2008] [Accepted: 12/03/2008] [Indexed: 12/29/2022] Open
Abstract
An automated protein structure prediction algorithm, pro-sp3-Threading/ASSEmbly/Refinement (TASSER), is described and benchmarked. Structural templates are identified using five different scoring functions derived from the previously developed threading methods PROSPECTOR_3 and SP(3). Top templates identified by each scoring function are combined to derive contact and distant restraints for subsequent model refinement by short TASSER simulations. For Medium/Hard targets (those with moderate to poor quality templates and/or alignments), alternative template alignments are also generated by parametric alignment and the top models selected by TASSER-QA are included in the contact and distance restraint derivation. Then, multiple short TASSER simulations are used to generate an ensemble of full-length models. Subsequently, the top models are selected from the ensemble by TASSER-QA and used to derive TASSER contacts and distant restraints for another round of full TASSER refinement. The final models are selected from both rounds of TASSER simulations by TASSER-QA. We compare pro-sp3-TASSER with our previously developed MetaTASSER method (enhanced with chunk-TASSER for Medium/Hard targets) on a representative test data set of 723 proteins <250 residues in length. For the 348 proteins classified as easy targets (those templates with good alignments and global structure similarity to the target), the cumulative TM-score of the best of top five models by pro-sp3-TASSER shows a 2.1% improvement over MetaTASSER. For the 155/220 medium/hard targets, the improvements in TM-score are 2.8% and 2.2%, respectively. All improvements are statistically significant. More importantly, the number of foldable targets (those having models whose TM-score to native >0.4 in the top five clusters) increases from 472 to 497 for all targets, and the relative increases for medium and hard targets are 10% and 15%, respectively. A server that implements the above algorithm is available at http://cssb.biology.gatech.edu/skolnick/webservice/pro-sp3-TASSER/. The source code is also available upon request.
Collapse
Affiliation(s)
- Hongyi Zhou
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia, USA
| | | |
Collapse
|
57
|
Niv MY, Iida K, Zheng R, Horiguchi A, Shen R, Nanus DM. Rational redesign of neutral endopeptidase binding to merlin and moesin proteins. Protein Sci 2009; 18:1042-50. [PMID: 19388049 DOI: 10.1002/pro.114] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Neutral endopeptidase (NEP) is a 90- to 110-kDa cell-surface peptidase that is normally expressed by numerous tissues but whose expression is lost or reduced in a variety of malignancies. The anti-tumorigenic function of NEP is mediated not only by its catalytic activity but also through direct protein-protein interactions of its cytosolic region with several binding partners, including Lyn kinase, PTEN, and ezrin/radixin/moesin (ERM) proteins. We have previously shown that mutation of the K(19)K(20)K(21) basic cluster in NEPs' cytosolic region to residues QNI disrupts binding to the ERM proteins. Here we show that the ERM-related protein merlin (NF2) does not bind NEP or its cytosolic region. Using experimental data, threading, and sequence analysis, we predicted the involvement of moesin residues E(159)Q(160) in binding to the NEP cytosolic domain. Mutation of these residues to NL (to mimic the corresponding N(159)L(160) residues in the nonbinder merlin) disrupted moesin binding to NEP. Mutation of residues N(159)L(160)Y(161)K(162)M(163) in merlin to the corresponding moesin residues resulted in NEP binding to merlin. This engineered NEP peptide-merlin interaction was diminished by the QNI mutation in NEP, supporting the role of the NEP basic cluster in binding. We thus identified the region of interaction between NEP and moesin, and engineered merlin into a NEP-binding protein. These data form the basis for further exploration of the details of NEP-ERM binding and function.
Collapse
Affiliation(s)
- Masha Y Niv
- Institute of Biochemistry, Food Science and Nutrition, Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, P.O. Box 12, Rehovot 76100, Israel
| | | | | | | | | | | |
Collapse
|
58
|
Szymczyna BR, Taurog RE, Young MJ, Snyder JC, Johnson JE, Williamson JR. Synergy of NMR, computation, and X-ray crystallography for structural biology. Structure 2009; 17:499-507. [PMID: 19368883 PMCID: PMC2705668 DOI: 10.1016/j.str.2009.03.001] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2009] [Revised: 02/22/2009] [Accepted: 03/03/2009] [Indexed: 11/26/2022]
Abstract
NMR spectroscopy and X-ray crystallography are currently the two most widely applied methods for the determination of macromolecular structures at high resolution. More recently, significant advances have been made in algorithms for the de novo prediction of protein structure, and, in favorable cases, the predicted models agree extremely well with experimentally determined structures. Here, we demonstrate a synergistic combination of NMR spectroscopy, de novo structure prediction, and X-ray crystallography in an effective overall strategy for rapidly determining the structure of the coat protein C-terminal domain from the Sulfolobus islandicus rod-shaped virus (SIRV). This approach takes advantage of the most accessible aspects of each structural technique and may be widely applicable for structure determination.
Collapse
Affiliation(s)
- Blair R. Szymczyna
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, 92037
- Department of Chemistry, and the Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, CA, 92037
| | - Rebecca E. Taurog
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, 92037
| | - Mark J. Young
- Department of Plant Sciences and Plant Pathology, Montana State University-Bozeman, Bozeman, Montana 59717
| | - Jamie C. Snyder
- Department of Plant Sciences and Plant Pathology, Montana State University-Bozeman, Bozeman, Montana 59717
| | - John E. Johnson
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, 92037
| | - James R. Williamson
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, 92037
- Department of Chemistry, and the Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, CA, 92037
| |
Collapse
|
59
|
Lindert S, Stewart PL, Meiler J. Hybrid approaches: applying computational methods in cryo-electron microscopy. Curr Opin Struct Biol 2009; 19:218-25. [PMID: 19339173 DOI: 10.1016/j.sbi.2009.02.010] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2008] [Accepted: 02/26/2009] [Indexed: 12/20/2022]
Abstract
Recent advances in cryo-electron microscopy have led to an increasing number of high (3-5A) to medium (5-10A) resolution cryoEM density maps. These density maps contain valuable information about the protein structure but frequently require computational algorithms to aid their structural interpretation. It is these hybrid approaches between cryoEM and computational protein structure prediction algorithms that will shape protein structure elucidation from density maps.
Collapse
Affiliation(s)
- Steffen Lindert
- Department of Chemistry, Vanderbilt University, Nashville, TN 37212, USA
| | | | | |
Collapse
|
60
|
Dassa B, London N, Stoddard BL, Schueler-Furman O, Pietrokovski S. Fractured genes: a novel genomic arrangement involving new split inteins and a new homing endonuclease family. Nucleic Acids Res 2009; 37:2560-73. [PMID: 19264795 PMCID: PMC2677866 DOI: 10.1093/nar/gkp095] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Inteins are genetic elements, inserted in-frame into protein-coding genes, whose products catalyze their removal from the protein precursor via a protein-splicing reaction. Intein domains can be split into two fragments and still ligate their flanks by a trans-protein-splicing reaction. A bioinformatic analysis of environmental metagenomic data revealed 26 different loci with a novel genomic arrangement. In each locus, a conserved enzyme coding region is broken in two by a split intein, with a free-standing endonuclease gene inserted in between. Eight types of DNA synthesis and repair enzymes have this ‘fractured’ organization. The new types of naturally split-inteins were analyzed in comparison to known split-inteins. Some loci include apparent gene control elements brought in with the endonuclease gene. A newly predicted homing endonuclease family, related to very-short patch repair (Vsr) endonucleases, was found in half of the loci. These putative homing endonucleases also appear in group-I introns, and as stand-alone inserts in the absence of surrounding intervening sequences. The new fractured genes organization appears to be present mainly in phage, shows how endonucleases can integrate into inteins, and may represent a missing link in the evolution of gene breaking in general, and in the creation of split-inteins in particular.
Collapse
Affiliation(s)
- Bareket Dassa
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | | | | | | | | |
Collapse
|
61
|
Arnold K, Kiefer F, Kopp J, Battey JND, Podvinec M, Westbrook JD, Berman HM, Bordoli L, Schwede T. The Protein Model Portal. JOURNAL OF STRUCTURAL AND FUNCTIONAL GENOMICS 2009; 10:1-8. [PMID: 19037750 PMCID: PMC2704613 DOI: 10.1007/s10969-008-9048-5] [Citation(s) in RCA: 117] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2008] [Accepted: 11/02/2008] [Indexed: 11/28/2022]
Abstract
Structural Genomics has been successful in determining the structures of many unique proteins in a high throughput manner. Still, the number of known protein sequences is much larger than the number of experimentally solved protein structures. Homology (or comparative) modeling methods make use of experimental protein structures to build models for evolutionary related proteins. Thereby, experimental structure determination efforts and homology modeling complement each other in the exploration of the protein structure space. One of the challenges in using model information effectively has been to access all models available for a specific protein in heterogeneous formats at different sites using various incompatible accession code systems. Often, structure models for hundreds of proteins can be derived from a given experimentally determined structure, using a variety of established methods. This has been done by all of the PSI centers, and by various independent modeling groups. The goal of the Protein Model Portal (PMP) is to provide a single portal which gives access to the various models that can be leveraged from PSI targets and other experimental protein structures. A single interface allows all existing pre-computed models across these various sites to be queried simultaneously, and provides links to interactive services for template selection, target-template alignment, model building, and quality assessment. The current release of the portal consists of 7.6 million model structures provided by different partner resources (CSMP, JCSG, MCSG, NESG, NYSGXRC, JCMM, ModBase, SWISS-MODEL Repository). The PMP is available at http://www.proteinmodelportal.org and from the PSI Structural Genomics Knowledgebase.
Collapse
Affiliation(s)
- Konstantin Arnold
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - Florian Kiefer
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - Jürgen Kopp
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - James N. D. Battey
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - Michael Podvinec
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - John D. Westbrook
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854-8087 USA
| | - Helen M. Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854-8087 USA
| | - Lorenza Bordoli
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| |
Collapse
|
62
|
Zhu J, Fan H, Periole X, Honig B, Mark AE. Refining homology models by combining replica-exchange molecular dynamics and statistical potentials. Proteins 2009; 72:1171-88. [PMID: 18338384 DOI: 10.1002/prot.22005] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
A protocol is presented for the global refinement of homology models of proteins. It combines the advantages of temperature-based replica-exchange molecular dynamics (REMD) for conformational sampling and the use of statistical potentials for model selection. The protocol was tested using 21 models. Of these 14 were models of 10 small proteins for which high-resolution crystal structures were available, the remainder were targets of the recent CASPR exercise. It was found that REMD in combination with currently available force fields could sample near-native conformational states starting from high-quality homology models. Conformations in which the backbone RMSD of secondary structure elements (SSE-RMSD) was lower than the starting value by 0.5-1.0 A were found for 15 out of the 21 cases (average 0.82 A). Furthermore, when a simple scoring function consisting of two statistical potentials was used to rank the structures, one or more structures with SSE-RMSD of at least 0.2 A lower than the starting value was found among the five best ranked structures in 11 out of the 21 cases. The average improvement in SSE-RMSD for the best models was 0.42 A. However, none of the scoring functions tested identified the structures with the lowest SSE-RMSD as the best models although all identified the native conformation as the one with lowest energy. This suggests that while the proposed protocol proved effective for the refinement of high-quality models of small proteins scoring functions remain one of the major limiting factors in structure refinement. This and other aspects by which the methodology could be further improved are discussed.
Collapse
Affiliation(s)
- Jiang Zhu
- Howard Hughes Medical Institute and Columbia University, Center for Computational Biology and Bioinformatics, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, USA
| | | | | | | | | |
Collapse
|
63
|
Peng J, Xu J. Boosting Protein Threading Accuracy. RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY : ... ANNUAL INTERNATIONAL CONFERENCE, RECOMB ... : PROCEEDINGS. RECOMB (CONFERENCE : 2005- ) 2009; 5541:31-45. [PMID: 22506254 DOI: 10.1007/978-3-642-02008-7_3] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Protein threading is one of the most successful protein structure prediction methods. Most protein threading methods use a scoring function linearly combining sequence and structure features to measure the quality of a sequence-template alignment so that a dynamic programming algorithm can be used to optimize the scoring function. However, a linear scoring function cannot fully exploit interdependency among features and thus, limits alignment accuracy.This paper presents a nonlinear scoring function for protein threading, which not only can model interactions among different protein features, but also can be efficiently optimized using a dynamic programming algorithm. We achieve this by modeling the threading problem using a probabilistic graphical model Conditional Random Fields (CRF) and training the model using the gradient tree boosting algorithm. The resultant model is a nonlinear scoring function consisting of a collection of regression trees. Each regression tree models a type of nonlinear relationship among sequence and structure features. Experimental results indicate that this new threading model can effectively leverage weak biological signals and improve both alignment accuracy and fold recognition rate greatly.
Collapse
|
64
|
van Hemert FJ, Zaaijer HL, Berkhout B, Lukashov VV. Occult hepatitis B infection: an evolutionary scenario. Virol J 2008; 5:146. [PMID: 19077239 PMCID: PMC2637267 DOI: 10.1186/1743-422x-5-146] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2008] [Accepted: 12/11/2008] [Indexed: 02/07/2023] Open
Abstract
Background Occult or latent hepatitis B virus (HBV) infection is defined as infection with detectable HBV DNA and undetectable surface antigen (HBsAg) in patients' blood. The cause of an overt HBV infection becoming an occult one is unknown. To gain insight into the mechanism of the development of occult infection, we compared the full-length HBV genome from a blood donor carrying an occult infection (d4) with global genotype D genomes. Results The phylogenetic analysis of polymerase, core and X protein sequences did not distinguish d4 from other genotype D strains. Yet, d4 surface protein formed the evolutionary outgroup relative to all other genotype D strains. Its evolutionary branch was the only one where accumulation of substitutions suggests positive selection (dN/dS = 1.3787). Many of these substitutiions accumulated specifically in regions encoding the core/surface protein interface, as revealed in a 3D-modeled protein complex. We identified a novel RNA splicing event (deleting nucleotides 2986-202) that abolishes surface protein gene expression without affecting polymerase, core and X-protein related functions. Genotype D strains differ in their ability to perform this 2986-202 splicing. Strains prone to 2986-202 splicing constitute a separate clade in a phylogenetic tree of genotype D HBVs. A single substitution (G173T) that is associated with clade membership alters the local RNA secondary structure and is proposed to affect splicing efficiency at the 202 acceptor site. Conclusion We propose an evolutionary scenario for occult HBV infection, in which 2986-202 splicing generates intracellular virus particles devoid of surface protein, which subsequently accumulates mutations due to relaxation of coding constraints. Such viruses are deficient of autonomous propagation and cannot leave the host cell until it is lysed.
Collapse
Affiliation(s)
- Formijn J van Hemert
- Department of Medical Microbiology, Laboratory of Experimental Virology, Center for Infection and Immunity Amsterdam Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands.
| | | | | | | |
Collapse
|
65
|
Phylogenetic conservation and homology modeling help reveal a novel domain within the budding yeast heterochromatin protein Sir1. Mol Cell Biol 2008; 29:687-702. [PMID: 19029247 DOI: 10.1128/mcb.00202-08] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The yeast Sir1 protein's ability to bind and silence the cryptic mating-type locus HMRa requires a protein-protein interaction between Sir1 and the origin recognition complex (ORC). A domain within the C-terminal half of Sir1, the Sir1 ORC interaction region (Sir1OIR), and the conserved bromo-adjacent homology (BAH) domain within Orc1, the largest subunit of ORC, mediate this interaction. The structure of the Sir1OIR-Orc1BAH complex is known. Sir1OIR and Orc1BAH interacted with a high affinity in vitro, but the Sir1OIR did not inhibit Sir1-dependent silencing when overproduced in vivo, suggesting that other regions of Sir1 helped it bind HMRa. Comparisons of diverged Sir1 proteins revealed two highly conserved regions, N1 and N2, within Sir1's poorly characterized N-terminal half. An N-terminal portion of Sir1 (residues 27 to 149 [Sir1(27-149)]) is similar in sequence to the Sir1OIR; homology modeling predicted a structure for Sir1(27-149) in which N1 formed a submodule similar to the known Orc1BAH-interacting surface on Sir1. Consistent with these findings, two-hybrid assays indicated that the Sir1 N terminus could interact with BAH domains. Amino acid substitutions within or near N1 or N2 reduced full-length Sir1's ability to bind and silence HMRa and to interact with Orc1BAH in a two-hybrid assay. Purified recombinant Sir1 formed a large protease-resistant structure within which the Sir1OIR domain was protected, and Orc1BAH bound Sir1OIR more efficiently than full-length Sir1 in vitro. Thus, the Sir1 N terminus exhibited both positive and negative roles in the formation of a Sir1-ORC silencing complex. This functional duality might contribute to Sir1's selectivity for silencer-bound ORCs in vivo.
Collapse
|
66
|
Nicosia G, Stracquadanio G. Generalized pattern search algorithm for Peptide structure prediction. Biophys J 2008; 95:4988-99. [PMID: 18487293 PMCID: PMC2576383 DOI: 10.1529/biophysj.107.124016] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2007] [Accepted: 03/20/2008] [Indexed: 11/18/2022] Open
Abstract
Finding the near-native structure of a protein is one of the most important open problems in structural biology and biological physics. The problem becomes dramatically more difficult when a given protein has no regular secondary structure or it does not show a fold similar to structures already known. This situation occurs frequently when we need to predict the tertiary structure of small molecules, called peptides. In this research work, we propose a new ab initio algorithm, the generalized pattern search algorithm, based on the well-known class of Search-and-Poll algorithms. We performed an extensive set of simulations over a well-known set of 44 peptides to investigate the robustness and reliability of the proposed algorithm, and we compared the peptide conformation with a state-of-the-art algorithm for peptide structure prediction known as PEPstr. In particular, we tested the algorithm on the instances proposed by the originators of PEPstr, to validate the proposed algorithm; the experimental results confirm that the generalized pattern search algorithm outperforms PEPstr by 21.17% in terms of average root mean-square deviation, RMSD C(alpha).
Collapse
Affiliation(s)
- Giuseppe Nicosia
- Department of Mathematics and Computer Science, University of Catania, Catania, Italy
| | | |
Collapse
|
67
|
Abstract
Advances over the past few years have begun to enable prediction and design of macromolecular structures at near-atomic accuracy. Progress has stemmed from the development of reasonably accurate and efficiently computed all-atom potential functions as well as effective conformational sampling strategies appropriate for searching a highly rugged energy landscape, both driven by feedback from structure prediction and design tests. A unified energetic and kinematic framework in the Rosetta program allows a wide range of molecular modeling problems, from fibril structure prediction to RNA folding to the design of new protein interfaces, to be readily investigated and highlights areas for improvement. The methodology enables the creation of novel molecules with useful functions and holds promise for accelerating experimental structural inference. Emerging connections to crystallographic phasing, NMR modeling, and lower-resolution approaches are described and critically assessed.
Collapse
Affiliation(s)
- Rhiju Das
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.
| | | |
Collapse
|
68
|
Dong E, Smith J, Heinze S, Alexander N, Meiler J. BCL::Align-sequence alignment and fold recognition with a custom scoring function online. Gene 2008; 422:41-6. [PMID: 18601985 DOI: 10.1016/j.gene.2008.06.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
BCL::Align is a multiple sequence alignment tool that utilizes the dynamic programming method in combination with a customizable scoring function for sequence alignment and fold recognition. The scoring function is a weighted sum of the traditional PAM and BLOSUM scoring matrices, position-specific scoring matrices output by PSI-BLAST, secondary structure predicted by a variety of methods, chemical properties, and gap penalties. By adjusting the weights, the method can be tailored for fold recognition or sequence alignment tasks at different levels of sequence identity. A Monte Carlo algorithm was used to determine optimized weight sets for sequence alignment and fold recognition that most accurately reproduced the SABmark reference alignment test set. In an evaluation of sequence alignment performance, BCL::Align ranked best in alignment accuracy (Cline score of 22.90 for sequences in the Twilight Zone) when compared with Align-m, ClustalW, T-Coffee, and MUSCLE. ROC curve analysis indicates BCL::Align's ability to correctly recognize protein folds with over 80% accuracy. The flexibility of the program allows it to be optimized for specific classes of proteins (e.g. membrane proteins) or fold families (e.g. TIM-barrel proteins). BCL::Align is free for academic use and available online at http://www.meilerlab.org/.
Collapse
Affiliation(s)
- Elizabeth Dong
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| | | | | | | | | |
Collapse
|
69
|
Yang YD, Park C, Kihara D. Threading without optimizing weighting factors for scoring function. Proteins 2008; 73:581-96. [DOI: 10.1002/prot.22082] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
70
|
Vymetal J, Slabý I, Spahr A, Vondrásek J, Lyngstadaas SP. Bioinformatic analysis and molecular modelling of human ameloblastin suggest a two-domain intrinsically unstructured calcium-binding protein. Eur J Oral Sci 2008; 116:124-34. [PMID: 18353005 DOI: 10.1111/j.1600-0722.2008.00526.x] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Ameloblastin (AMBN) was originally believed to be an enamel-specific extracellular matrix glycoprotein secreted by ameloblasts. Recently, AMBN expression was also detected in developing mesenchymal dental hard tissues, in trauma-induced reparative dentin, and during early craniofacial bone formation. The function and structure of AMBN still remain ambiguous, and there are no known proteins with similar primary sequences. We therefore performed a bio-informatic analysis of AMBN to model ab initio the three-dimensional structure of the molecule. The results suggest that AMBN is a two-domain, intrinsically unstructured protein (IUP). The analysis did not reveal any regions with structural similarity to known receptor-ligand systems, and did not identify any higher-order structures similar to functional regions in other known sequences. The AMBN model predicts 11 defined regions exposed on the surface, internalizing the rest of the molecule including a human-specific insert. Molecular dynamics analysis identified one specific and several non-specific calcium-binding regions, mostly at the C-terminal part of the molecule. The model is supported by previous observations that AMBN is a bipolar calcium-binding molecule and hints at a possible role in protein-protein interactions. The model provides information useful for further studies on the function of AMBN.
Collapse
Affiliation(s)
- Jirí Vymetal
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Flemingovo nam, Prague, Czech Republic
| | | | | | | | | |
Collapse
|
71
|
Tress M, Cheng J, Baldi P, Joo K, Lee J, Seo JH, Lee J, Baker D, Chivian D, Kim D, Ezkurdia I. Assessment of predictions submitted for the CASP7 domain prediction category. Proteins 2008; 69 Suppl 8:137-51. [PMID: 17680686 DOI: 10.1002/prot.21675] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
This paper details the assessment process and evaluation results for the Critical Assessment of Protein Structure Prediction (CASP7) domain prediction category. Domain predictions were assessed using the Normalized Domain Overlap score introduced in CASP6 and the accuracy of prediction of domain break points. The results of the analysis clearly demonstrate that the best methods are able to make consistently reliable predictions when the target has a structural template, although they are less good when the domain break occurs in a region not covered by a template. The conditions of the experiment meant that it was impossible to draw any conclusions about domain prediction for free modeling targets and it was also difficult to draw many distinctions between the best groups. Two thirds of the targets submitted were single domains and hence regarded as easy to predict. Even those targets defined as having multiple domains always had at least one domain with a similar template structure.
Collapse
Affiliation(s)
- Michael Tress
- Structural and Biological Computation Programme, Spanish National Cancer Research Centre, Madrid, Spain.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
72
|
D'hoedt D, Owsianik G, Prenen J, Cuajungco MP, Grimm C, Heller S, Voets T, Nilius B. Stimulus-specific modulation of the cation channel TRPV4 by PACSIN 3. J Biol Chem 2008; 283:6272-80. [PMID: 18174177 DOI: 10.1074/jbc.m706386200] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
TRPV4, a member of the vanilloid subfamily of the transient receptor potential (TRP) channels, is activated by a variety of stimuli, including cell swelling, moderate heat, and chemical compounds such as synthetic 4alpha-phorbol esters. TRPV4 displays a widespread expression in various cells and tissues and has been implicated in diverse physiological processes, including osmotic homeostasis, thermo- and mechanosensation, vasorelaxation, tuning of neuronal excitability, and bladder voiding. The mechanisms that regulate TRPV4 in these different physiological settings are currently poorly understood. We have recently shown that the relative amount of TRPV4 in the plasma membrane is enhanced by interaction with the SH3 domain of PACSIN 3, a member of the PACSIN family of proteins involved in synaptic vesicular membrane trafficking and endocytosis. Here we demonstrate that PACSIN 3 strongly inhibits the basal activity of TRPV4 and its activation by cell swelling and heat, while leaving channel gating induced by the synthetic ligand 4alpha-phorbol 12,13-didecanoate unaffected. A single proline mutation in the SH3 domain of PACSIN 3 abolishes its inhibitory effect on TRPV4, indicating that PACSIN 3 must bind to the channel to modulate its function. In line herewith, mutations at specific proline residues in the N terminus of TRPV4 abolish binding of PACSIN 3 and render the channel insensitive to PACSIN 3-induced inhibition. Taken together, these data suggest that PACSIN 3 acts as an auxiliary protein of TRPV4 channel that not only affects the channel's subcellular localization but also modulates its function in a stimulus-specific manner.
Collapse
Affiliation(s)
- Dieter D'hoedt
- Department of Physiology, Katholieke Universiteit Leuven, Campus Gasthuisberg, O&N 1, Herestraat 49 Bus 802, Leuven, Belgium.
| | | | | | | | | | | | | | | |
Collapse
|
73
|
Qiu J, Sheffler W, Baker D, Noble WS. Ranking predicted protein structures with support vector regression. Proteins 2007; 71:1175-82. [PMID: 18004754 DOI: 10.1002/prot.21809] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- Jian Qiu
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | | | | | | |
Collapse
|
74
|
Qian B, Raman S, Das R, Bradley P, McCoy AJ, Read RJ, Baker D. High-resolution structure prediction and the crystallographic phase problem. Nature 2007; 450:259-64. [PMID: 17934447 PMCID: PMC2504711 DOI: 10.1038/nature06249] [Citation(s) in RCA: 258] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2007] [Accepted: 09/13/2007] [Indexed: 11/09/2022]
Abstract
The energy-based refinement of low-resolution protein structure models to atomic-level accuracy is a major challenge for computational structural biology. Here we describe a new approach to refining protein structure models that focuses sampling in regions most likely to contain errors while allowing the whole structure to relax in a physically realistic all-atom force field. In applications to models produced using nuclear magnetic resonance data and to comparative models based on distant structural homologues, the method can significantly improve the accuracy of the structures in terms of both the backbone conformations and the placement of core side chains. Furthermore, the resulting models satisfy a particularly stringent test: they provide significantly better solutions to the X-ray crystallographic phase problem in molecular replacement trials. Finally, we show that all-atom refinement can produce de novo protein structure predictions that reach the high accuracy required for molecular replacement without any experimental phase information and in the absence of templates suitable for molecular replacement from the Protein Data Bank. These results suggest that the combination of high-resolution structure prediction with state-of-the-art phasing tools may be unexpectedly powerful in phasing crystallographic data for which molecular replacement is hindered by the absence of sufficiently accurate previous models.
Collapse
Affiliation(s)
- Bin Qian
- University of Washington, Department of Biochemistry and Howard Hughes Medical Institute, Box 357350, Seattle 98195, USA
| | - Srivatsan Raman
- University of Washington, Department of Biochemistry and Howard Hughes Medical Institute, Box 357350, Seattle 98195, USA
| | - Rhiju Das
- University of Washington, Department of Biochemistry and Howard Hughes Medical Institute, Box 357350, Seattle 98195, USA
| | - Philip Bradley
- University of Washington, Department of Biochemistry and Howard Hughes Medical Institute, Box 357350, Seattle 98195, USA
| | - Airlie J. McCoy
- Department of Haematology, University of Cambridge, Cambridge Institute for Medical Research, Wellcome Trust/MRC Building, Hills Road, Cambridge, CB2 0XY
| | - Randy J. Read
- Department of Haematology, University of Cambridge, Cambridge Institute for Medical Research, Wellcome Trust/MRC Building, Hills Road, Cambridge, CB2 0XY
| | - David Baker
- University of Washington, Department of Biochemistry and Howard Hughes Medical Institute, Box 357350, Seattle 98195, USA
| |
Collapse
|
75
|
van Hemert FJ, Zaaijer HL, Berkhout B, Lukashov VV. Mosaic amino acid conservation in 3D-structures of surface protein and polymerase of hepatitis B virus. Virology 2007; 370:362-72. [PMID: 17935747 DOI: 10.1016/j.virol.2007.08.036] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2007] [Revised: 07/31/2007] [Accepted: 08/25/2007] [Indexed: 12/17/2022]
Abstract
Surface protein and polymerase of hepatitis B virus provide a striking example of gene overlap. Inclusion of more coding constraints in the phylogenetic analysis forces the tree toward accepted topology. Three-dimensional protein modeling demonstrates that participation in local protein function underlies the observed mosaic patterns of amino acid conservation and variability. Conserved amino acid residues of polymerase were typically clustered at the catalytic core marked by the YMDD motif. The proposed tertiary structure of surface protein displayed the expected transmembrane helices in a 2-domain constellation. Conserved amino acids like, for instance, cysteine residues are involved in the spatial orientation of the two domains, the exposed location of the a-determinant and the dimer formation of surface protein. By means of computational alanine replacement scanning, we demonstrated that the interfaces between domains in monomeric surface protein, between the monomers in dimeric surface protein and in a capsid-surface protein complex mainly consist of relatively well-conserved amino acid residues.
Collapse
Affiliation(s)
- Formijn J van Hemert
- Laboratory of Experimental Virology, Department of Medical Microbiology, Center for Infection and Immunity Amsterdam (CINIMA), Academic Medical Center, University of Amsterdam, Meibergdreef 15, 1105 AZ Amsterdam, The Netherlands.
| | | | | | | |
Collapse
|
76
|
Becker E, Cotillard A, Meyer V, Madaoui H, Guérois R. HMM-Kalign: a tool for generating sub-optimal HMM alignments. ACTA ACUST UNITED AC 2007; 23:3095-7. [PMID: 17921492 DOI: 10.1093/bioinformatics/btm492] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Recent development of strategies using multiple sequence alignments (MSA) or profiles to detect remote homologies between proteins has led to a significant increase in the number of proteins whose structures can be generated by comparative modeling methods. However, prediction of the optimal alignment between these highly divergent homologous proteins remains a difficult issue. We present a tool based on a generalized Viterbi algorithm that generates optimal and sub-optimal alignments between a sequence and a Hidden Markov Model. The tool is implemented as a new function within the HMMER package called hmmkalign.
Collapse
Affiliation(s)
- Emmanuelle Becker
- CEA, iBiTecS, URA 2096, SBSM, Laboratoire de Biologie Structurale et Radiobiologie, Gif sur Yvette, F-91191 France
| | | | | | | | | |
Collapse
|
77
|
Pukatzki S, Ma AT, Revel AT, Sturtevant D, Mekalanos JJ. Type VI secretion system translocates a phage tail spike-like protein into target cells where it cross-links actin. Proc Natl Acad Sci U S A 2007; 104:15508-13. [PMID: 17873062 PMCID: PMC2000545 DOI: 10.1073/pnas.0706532104] [Citation(s) in RCA: 543] [Impact Index Per Article: 31.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Genes encoding type VI secretion systems (T6SS) are widely distributed in pathogenic Gram-negative bacterial species. In Vibrio cholerae, T6SS have been found to secrete three related proteins extracellularly, VgrG-1, VgrG-2, and VgrG-3. VgrG-1 can covalently cross-link actin in vitro, and this activity was used to demonstrate that V. cholerae can translocate VgrG-1 into macrophages by a T6SS-dependent mechanism. Protein structure search algorithms predict that VgrG-related proteins likely assemble into a trimeric complex that is analogous to that formed by the two trimeric proteins gp27 and gp5 that make up the baseplate "tail spike" of Escherichia coli bacteriophage T4. VgrG-1 was shown to interact with itself, VgrG-2, and VgrG-3, suggesting that such a complex does form. Because the phage tail spike protein complex acts as a membrane-penetrating structure as well as a conduit for the passage of DNA into phage-infected cells, we propose that the VgrG components of the T6SS apparatus may assemble a "cell-puncturing device" analogous to phage tail spikes to deliver effector protein domains through membranes of target host cells.
Collapse
Affiliation(s)
- Stefan Pukatzki
- Department of Microbiology and Molecular Genetics, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115
| | - Amy T. Ma
- Department of Microbiology and Molecular Genetics, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115
| | - Andrew T. Revel
- Department of Microbiology and Molecular Genetics, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115
| | - Derek Sturtevant
- Department of Microbiology and Molecular Genetics, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115
| | - John J. Mekalanos
- Department of Microbiology and Molecular Genetics, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115
- *To whom correspondence should be addressed. E-mail:
| |
Collapse
|
78
|
Haft RJF, Gachelet EG, Nguyen T, Toussaint L, Chivian D, Traxler B. In vivo oligomerization of the F conjugative coupling protein TraD. J Bacteriol 2007; 189:6626-34. [PMID: 17631633 PMCID: PMC2045173 DOI: 10.1128/jb.00513-07] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2007] [Accepted: 07/05/2007] [Indexed: 11/20/2022] Open
Abstract
Type IV secretory systems are a group of bacterial transporters responsible for the transport of proteins and nucleic acids directly into recipient cells. Such systems play key roles in the virulence of some pathogenic organisms and in conjugation-mediated horizontal gene transfer. Many type IV systems require conserved "coupling proteins," transmembrane polypeptides that are critical for transporting secreted substrates across the cytoplasmic membrane of the bacterium. In vitro evidence suggests that the functional form of coupling proteins is a homohexameric, ring-shaped complex. Using a library of tagged mutants, we investigated the structural and functional organization of the F plasmid conjugative coupling protein TraD by coimmunoprecipitation, cross-linking, and genetic means. We present direct evidence that coupling proteins form stable oligomeric complexes in the membranes of bacteria and that the formation of some of these complexes requires other F-encoded functions. Our data also show that different regions of TraD play distinct roles in the oligomerization process. We postulate a model for in vivo oligomerization and discuss the probable participation of individual domains of TraD in each step.
Collapse
Affiliation(s)
- Rembrandt J F Haft
- Department of Microbiology, University of Washington, Box 357242, Seattle, WA 98195- 7242, USA
| | | | | | | | | | | |
Collapse
|
79
|
Norman PJ, Abi-Rached L, Gendzekhadze K, Korbel D, Gleimer M, Rowley D, Bruno D, Carrington CVF, Chandanayingyong D, Chang YH, Crespí C, Saruhan-Direskeneli G, Fraser PA, Hameed K, Kamkamidze G, Koram KA, Layrisse Z, Matamoros N, Milà J, Park MH, Pitchappan RM, Ramdath DD, Shiau MY, Stephens HAF, Struik S, Verity DH, Vaughan RW, Tyan D, Davis RW, Riley EM, Ronaghi M, Parham P. Unusual selection on the KIR3DL1/S1 natural killer cell receptor in Africans. Nat Genet 2007; 39:1092-9. [PMID: 17694054 DOI: 10.1038/ng2111] [Citation(s) in RCA: 181] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2007] [Accepted: 07/06/2007] [Indexed: 12/12/2022]
Abstract
Interactions of killer cell immunoglobulin-like receptors (KIRs) with major histocompatibility complex (MHC) class I ligands diversify natural killer cell responses to infection. By analyzing sequence variation in diverse human populations, we show that the KIR3DL1/S1 locus encodes two lineages of polymorphic inhibitory KIR3DL1 allotypes that recognize Bw4 epitopes of protein">HLA-A and HLA-B and one lineage of conserved activating KIR3DS1 allotypes, also implicated in Bw4 recognition. Balancing selection has maintained these three lineages for over 3 million years. Variation was selected at D1 and D2 domain residues that contact HLA class I and at two sites on D0, the domain that enhances the binding of KIR3D to HLA class I. HLA-B variants that gained Bw4 through interallelic microconversion are also products of selection. A worldwide comparison uncovers unusual KIR3DL1/S1 evolution in modern sub-Saharan Africans. Balancing selection is weak and confined to D0, KIR3DS1 is rare and KIR3DL1 allotypes with similar binding sites predominate. Natural killer cells express the dominant KIR3DL1 at a high frequency and with high surface density, providing strong responses to cells perturbed in Bw4 expression.
Collapse
Affiliation(s)
- Paul J Norman
- Department of Structural Biology, Stanford University School of Medicine, Stanford, California 94305, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
80
|
Zhao L, Bonocora RP, Shub DA, Stoddard BL. The restriction fold turns to the dark side: a bacterial homing endonuclease with a PD-(D/E)-XK motif. EMBO J 2007; 26:2432-42. [PMID: 17410205 PMCID: PMC1864971 DOI: 10.1038/sj.emboj.7601672] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2007] [Accepted: 03/09/2007] [Indexed: 11/08/2022] Open
Abstract
The homing endonuclease I-Ssp6803I causes the insertion of a group I intron into a bacterial tRNA gene-the only example of an invasive mobile intron within a bacterial genome. Using a computational fold prediction, mutagenic screen and crystal structure determination, we demonstrate that this protein is a tetrameric PD-(D/E)-XK endonuclease - a fold normally used to protect a bacterial genome from invading DNA through the action of restriction endonucleases. I-Ssp6803I uses its tetrameric assembly to promote recognition of a single long target site, whereas restriction endonuclease tetramers facilitate cooperative binding and cleavage of two short sites. The limited use of the PD-(D/E)-XK nucleases by mobile introns stands in contrast to their frequent use of LAGLIDADG and HNH endonucleases - which in turn, are rarely incorporated into restriction/modification systems.
Collapse
Affiliation(s)
- Lei Zhao
- Graduate Program in Molecular Biophysics, Structure and Design, University of Washington, Seattle, WA, USA
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle WA, USA
| | - Richard P Bonocora
- Department of Biological Sciences and Center for Molecular Genetics, University at Albany, State University of New York, Albany, NY, USA
| | - David A Shub
- Department of Biological Sciences and Center for Molecular Genetics, University at Albany, State University of New York, Albany, NY, USA
| | - Barry L Stoddard
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle WA, USA
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue N. A3-025, Seattle, WA 98109, USA. Tel.: +1 206 667 4031; Fax: +1 206 667 3331; E-mail:
| |
Collapse
|
81
|
Das R, Qian B, Raman S, Vernon R, Thompson J, Bradley P, Khare S, Tyka MD, Bhat D, Chivian D, Kim DE, Sheffler WH, Malmström L, Wollacott AM, Wang C, Andre I, Baker D. Structure prediction for CASP7 targets using extensive all-atom refinement with Rosetta@home. Proteins 2007; 69 Suppl 8:118-28. [PMID: 17894356 DOI: 10.1002/prot.21636] [Citation(s) in RCA: 156] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
We describe predictions made using the Rosetta structure prediction methodology for both template-based modeling and free modeling categories in the Seventh Critical Assessment of Techniques for Protein Structure Prediction. For the first time, aggressive sampling and all-atom refinement could be carried out for the majority of targets, an advance enabled by the Rosetta@home distributed computing network. Template-based modeling predictions using an iterative refinement algorithm improved over the best existing templates for the majority of proteins with less than 200 residues. Free modeling methods gave near-atomic accuracy predictions for several targets under 100 residues from all secondary structure classes. These results indicate that refinement with an all-atom energy function, although computationally expensive, is a powerful method for obtaining accurate structure predictions.
Collapse
Affiliation(s)
- Rhiju Das
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|