101
|
Alexander NS, Stein RA, Koteiche HA, Kaufmann KW, Mchaourab HS, Meiler J. RosettaEPR: rotamer library for spin label structure and dynamics. PLoS One 2013; 8:e72851. [PMID: 24039810 PMCID: PMC3764097 DOI: 10.1371/journal.pone.0072851] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2013] [Accepted: 07/15/2013] [Indexed: 11/18/2022] Open
Abstract
An increasingly used parameter in structural biology is the measurement of distances between spin labels bound to a protein. One limitation to these measurements is the unknown position of the spin label relative to the protein backbone. To overcome this drawback, we introduce a rotamer library of the methanethiosulfonate spin label (MTSSL) into the protein modeling program Rosetta. Spin label rotamers were derived from conformations observed in crystal structures of spin labeled T4 lysozyme and previously published molecular dynamics simulations. Rosetta’s ability to accurately recover spin label conformations and EPR measured distance distributions was evaluated against 19 experimentally determined MTSSL labeled structures of T4 lysozyme and the membrane protein LeuT and 73 distance distributions from T4 lysozyme and the membrane protein MsbA. For a site in the core of T4 lysozyme, the correct spin label conformation (Χ1 and Χ2) is recovered in 99.8% of trials. In surface positions 53% of the trajectories agree with crystallized conformations in Χ1 and Χ2. This level of recovery is on par with Rosetta performance for the 20 natural amino acids. In addition, Rosetta predicts the distance between two spin labels with a mean error of 4.4 Å. The width of the experimental distance distribution, which reflects the flexibility of the two spin labels, is predicted with a mean error of 1.3 Å. RosettaEPR makes full-atom spin label modeling available to a wide scientific community in conjunction with the powerful suite of modeling methods within Rosetta.
Collapse
Affiliation(s)
- Nathan S. Alexander
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Richard A. Stein
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Hanane A. Koteiche
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Kristian W. Kaufmann
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Hassane S. Mchaourab
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
- * E-mail:
| |
Collapse
|
102
|
Adolf-Bryfogle J, Dunbrack Jr. RL. The PyRosetta Toolkit: a graphical user interface for the Rosetta software suite. PLoS One 2013; 8:e66856. [PMID: 23874400 PMCID: PMC3706480 DOI: 10.1371/journal.pone.0066856] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Accepted: 05/11/2013] [Indexed: 01/25/2023] Open
Abstract
The Rosetta Molecular Modeling suite is a command-line-only collection of applications that enable high-resolution modeling and design of proteins and other molecules. Although extremely useful, Rosetta can be difficult to learn for scientists with little computational or programming experience. To that end, we have created a Graphical User Interface (GUI) for Rosetta, called the PyRosetta Toolkit, for creating and running protocols in Rosetta for common molecular modeling and protein design tasks and for analyzing the results of Rosetta calculations. The program is highly extensible so that developers can add new protocols and analysis tools to the PyRosetta Toolkit GUI.
Collapse
Affiliation(s)
- Jared Adolf-Bryfogle
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, Pennsylvania, United States of America
- Drexel University College of Medicine, Program in Molecular and Cell Biology and Genetics, Philadelphia, Pennsylvania, United States of America
| | - Roland L. Dunbrack Jr.
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
103
|
Combs SA, Deluca SL, Deluca SH, Lemmon GH, Nannemann DP, Nguyen ED, Willis JR, Sheehan JH, Meiler J. Small-molecule ligand docking into comparative models with Rosetta. Nat Protoc 2013; 8:1277-98. [PMID: 23744289 DOI: 10.1038/nprot.2013.074] [Citation(s) in RCA: 120] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Structure-based drug design is frequently used to accelerate the development of small-molecule therapeutics. Although substantial progress has been made in X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy, the availability of high-resolution structures is limited owing to the frequent inability to crystallize or obtain sufficient NMR restraints for large or flexible proteins. Computational methods can be used to both predict unknown protein structures and model ligand interactions when experimental data are unavailable. This paper describes a comprehensive and detailed protocol using the Rosetta modeling suite to dock small-molecule ligands into comparative models. In the protocol presented here, we review the comparative modeling process, including sequence alignment, threading and loop building. Next, we cover docking a small-molecule ligand into the protein comparative model. In addition, we discuss criteria that can improve ligand docking into comparative models. Finally, and importantly, we present a strategy for assessing model quality. The entire protocol is presented on a single example selected solely for didactic purposes. The results are therefore not representative and do not replace benchmarks published elsewhere. We also provide an additional tutorial so that the user can gain hands-on experience in using Rosetta. The protocol should take 5-7 h, with additional time allocated for computer generation of models.
Collapse
Affiliation(s)
- Steven A Combs
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
104
|
Stein A, Kortemme T. Improvements to robotics-inspired conformational sampling in rosetta. PLoS One 2013; 8:e63090. [PMID: 23704889 PMCID: PMC3660577 DOI: 10.1371/journal.pone.0063090] [Citation(s) in RCA: 140] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Accepted: 03/28/2013] [Indexed: 02/04/2023] Open
Abstract
To accurately predict protein conformations in atomic detail, a computational method must be capable of sampling models sufficiently close to the native structure. All-atom sampling is difficult because of the vast number of possible conformations and extremely rugged energy landscapes. Here, we test three sampling strategies to address these difficulties: conformational diversification, intensification of torsion and omega-angle sampling and parameter annealing. We evaluate these strategies in the context of the robotics-based kinematic closure (KIC) method for local conformational sampling in Rosetta on an established benchmark set of 45 12-residue protein segments without regular secondary structure. We quantify performance as the fraction of sub-Angstrom models generated. While improvements with individual strategies are only modest, the combination of intensification and annealing strategies into a new “next-generation KIC” method yields a four-fold increase over standard KIC in the median percentage of sub-Angstrom models across the dataset. Such improvements enable progress on more difficult problems, as demonstrated on longer segments, several of which could not be accurately remodeled with previous methods. Given its improved sampling capability, next-generation KIC should allow advances in other applications such as local conformational remodeling of multiple segments simultaneously, flexible backbone sequence design, and development of more accurate energy functions.
Collapse
Affiliation(s)
- Amelie Stein
- California Institute for Quantitative Biomedical Research and Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- * E-mail: (AS); (TK)
| | - Tanja Kortemme
- California Institute for Quantitative Biomedical Research and Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- * E-mail: (AS); (TK)
| |
Collapse
|
105
|
Using the unfolded state as the reference state improves the performance of statistical potentials. Biophys J 2013. [PMID: 23199923 DOI: 10.1016/j.bpj.2012.09.023] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Distance-dependent statistical potentials are an important class of energy functions extensively used in modeling protein structures and energetics. These potentials are obtained by statistically analyzing the proximity of atoms in all combinatorial amino-acid pairs in proteins with known structures. In model evaluation, the statistical potential is usually subtracted by the value of a reference state for better selectivity. An ideal reference state should include the general chemical properties of polypeptide chains so that only the unique factors stabilizing the native structures are retained after calibrating on reference state. However, reference states available as of this writing rarely model specific chemical constraints of peptide bonds and therefore poorly reflect the behavior of polypeptide chains. In this work, we proposed a statistical potential based on unfolded state ensemble (SPOUSE), where the reference state is summarized from the unfolded state ensembles of proteins produced according to the statistical coil model. Due to its better representation of the features of polypeptides, SPOUSE outperforms three of the most widely used distance-dependent potentials not only in native conformation identification, but also in the selection of close-to-native models and correlation coefficients between energy and model error. Furthermore, SPOUSE shows promising possibility of further improvement by integration with the orientation-dependent side-chain potentials.
Collapse
|
106
|
Yagi H, Pilla KB, Maleckis A, Graham B, Huber T, Otting G. Three-dimensional protein fold determination from backbone amide pseudocontact shifts generated by lanthanide tags at multiple sites. Structure 2013; 21:883-90. [PMID: 23643949 DOI: 10.1016/j.str.2013.04.001] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2013] [Revised: 03/28/2013] [Accepted: 04/01/2013] [Indexed: 01/15/2023]
Abstract
Site-specific attachment of paramagnetic lanthanide ions to a protein generates pseudocontact shifts (PCS) in the nuclear magnetic resonance (NMR) spectra of the protein that are easily measured as changes in chemical shifts. By labeling the protein with lanthanide tags at four different sites, PCSs are observed for most amide protons and accurate information is obtained about their coordinates in three-dimensional space. The approach is demonstrated with the chaperone ERp29, for which large differences have been reported between X-ray and NMR structures of the C-terminal domain, ERp29-C. The results unambiguously show that the structure of rat ERp29-C in solution is similar to the crystal structure of human ERp29-C. PCSs of backbone amides were the only structural restraints required. Because these can be measured for more dilute protein solutions than other NMR restraints, the approach greatly widens the range of proteins amenable to structural studies in solution.
Collapse
Affiliation(s)
- Hiromasa Yagi
- Research School of Chemistry, Australian National University, Canberra, ACT 0200, Australia
| | | | | | | | | | | |
Collapse
|
107
|
Borysik AJ, Hewitt DJ, Robinson CV. Detergent release prolongs the lifetime of native-like membrane protein conformations in the gas-phase. J Am Chem Soc 2013; 135:6078-83. [PMID: 23521660 DOI: 10.1021/ja401736v] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Recent studies have suggested that detergents can protect the structure of membrane proteins during their transition from solution to the gas-phase. Here we provide mechanistic insights into this process by interrogating the structures of membrane protein-detergent assemblies in the gas-phase using ion mobility mass spectrometry. We show a clear correlation between the population of native-like protein conformations and the degree of detergent attachment to the protein in the gas-phase. Interrogation of these protein-detergent assemblies, by tandem mass spectrometry, enables us to define the mechanism by which detergents preserve native-like protein conformations in a solvent free environment. We show that the release of detergent is more central to the survival of these conformations than the physical presence of detergent bound to the protein. We propose that detergent release competes with structural collapse for the internal energy of the ion and permits the observation of transient native-like membrane protein conformations that are otherwise lost to structural rearrangement in the gas-phase.
Collapse
Affiliation(s)
- Antoni J Borysik
- Chemistry Research Laboratory, South Parks Road, University of Oxford, Oxford OX1 3QY, United Kingdom
| | | | | |
Collapse
|
108
|
Druggable protein interaction sites are more predisposed to surface pocket formation than the rest of the protein surface. PLoS Comput Biol 2013; 9:e1002951. [PMID: 23505360 PMCID: PMC3591273 DOI: 10.1371/journal.pcbi.1002951] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2012] [Accepted: 01/11/2013] [Indexed: 01/22/2023] Open
Abstract
Despite intense interest and considerable effort via high-throughput screening, there are few examples of small molecules that directly inhibit protein-protein interactions. This suggests that many protein interaction surfaces may not be intrinsically “druggable” by small molecules, and elevates in importance the few successful examples as model systems for improving our fundamental understanding of druggability. Here we describe an approach for exploring protein fluctuations enriched in conformations containing surface pockets suitable for small molecule binding. Starting from a set of seven unbound protein structures, we find that the presence of low-energy pocket-containing conformations is indeed a signature of druggable protein interaction sites and that analogous surface pockets are not formed elsewhere on the protein. We further find that ensembles of conformations generated with this biased approach structurally resemble known inhibitor-bound structures more closely than equivalent ensembles of unbiased conformations. Collectively these results suggest that “druggability” is a property encoded on a protein surface through its propensity to form pockets, and inspire a model in which the crude features of the predisposed pocket(s) restrict the range of complementary ligands; additional smaller conformational changes then respond to details of a particular ligand. We anticipate that the insights described here will prove useful in selecting protein targets for therapeutic intervention. Identifying small-molecule inhibitors of protein interactions has traditionally presented a challenge for modern screening methods, despite interest stemming from the fact that such interactions comprise the underlying mechanisms for cell proliferation, differentiation, and survival. This suggests that many protein interaction surfaces may not be intrinsically “druggable” by small molecules, and elevates in importance the few successful examples as model systems for improving our understanding of factors contributing to druggability. Here we describe a new approach for exploring protein fluctuations leading to surface pockets suitable for small molecule binding. We find that the presence of such pockets is indeed a signature of druggable protein interaction sites, suggesting that “druggability” is a property encoded on a protein surface through its propensity to form pockets. We anticipate that the insights described here will prove useful in selecting protein targets for therapeutic intervention.
Collapse
|
109
|
Adams PD, Baker D, Brunger AT, Das R, DiMaio F, Read RJ, Richardson DC, Richardson JS, Terwilliger TC. Advances, interactions, and future developments in the CNS, Phenix, and Rosetta structural biology software systems. Annu Rev Biophys 2013; 42:265-87. [PMID: 23451892 DOI: 10.1146/annurev-biophys-083012-130253] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Advances in our understanding of macromolecular structure come from experimental methods, such as X-ray crystallography, and also computational analysis of the growing number of atomic models obtained from such experiments. The later analyses have made it possible to develop powerful tools for structure prediction and optimization in the absence of experimental data. In recent years, a synergy between these computational methods for crystallographic structure determination and structure prediction and optimization has begun to be exploited. We review some of the advances in the algorithms used for crystallographic structure determination in the Phenix and Crystallography & NMR System software packages and describe how methods from ab initio structure prediction and refinement in Rosetta have been applied to challenging crystallographic problems. The prospects for future improvement of these methods are discussed.
Collapse
Affiliation(s)
- Paul D Adams
- Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
110
|
Bentzien J, Muegge I, Hamner B, Thompson DC. Crowd computing: using competitive dynamics to develop and refine highly predictive models. Drug Discov Today 2013; 18:472-8. [PMID: 23337388 DOI: 10.1016/j.drudis.2013.01.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2012] [Revised: 12/20/2012] [Accepted: 01/03/2013] [Indexed: 12/16/2022]
Abstract
A recent application of a crowd computing platform to develop highly predictive in silico models for use in the drug discovery process is described. The platform, Kaggle™, exploits a competitive dynamic that results in model optimization as the competition unfolds. Here, this dynamic is described in detail and compared with more-conventional modeling strategies. The complete and full structure of the underlying dataset is disclosed and some thoughts as to the broader utility of such 'gamification' approaches to the field of modeling are offered.
Collapse
Affiliation(s)
- Jörg Bentzien
- Boehringer Ingelheim Pharmaceuticals, 900 Ridgebury Road, Ridgefield, CT 06877, USA
| | | | | | | |
Collapse
|
111
|
Fadouloglou VE, Kapanidou M, Agiomirgianaki A, Arnaouteli S, Bouriotis V, Glykos NM, Kokkinidis M. Structure determination through homology modelling and torsion-angle simulated annealing: application to a polysaccharide deacetylase from Bacillus cereus. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2013; 69:276-83. [PMID: 23385463 DOI: 10.1107/s0907444912045829] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2012] [Accepted: 11/06/2012] [Indexed: 11/10/2022]
Abstract
The structure of BC0361, a polysaccharide deacetylase from Bacillus cereus, has been determined using an unconventional molecular-replacement procedure. Tens of putative models of the C-terminal domain of the protein were constructed using a multitude of homology-modelling algorithms, and these were tested for the presence of signal in molecular-replacement calculations. Of these, only the model calculated by the SAM-T08 server gave a consistent and convincing solution, but the resulting model was too inaccurate to allow phase determination to proceed to completion. The application of slow-cooling torsion-angle simulated annealing (started from a very high temperature) drastically improved this initial model to the point of allowing phasing through cycles of model building and refinement to be initiated. The structure of the protein is presented with emphasis on the presence of a C(α)-modified proline at its active site, which was modelled as an α-hydroxy-L-proline.
Collapse
Affiliation(s)
- Vasiliki E Fadouloglou
- Department of Molecular Biology and Genetics, Democritus University of Thrace, University Campus, 68100 Alexandroupolis, Greece
| | | | | | | | | | | | | |
Collapse
|
112
|
Bhattacharya D, Cheng J. 3Drefine: consistent protein structure refinement by optimizing hydrogen bonding network and atomic-level energy minimization. Proteins 2013; 81:119-31. [PMID: 22927229 PMCID: PMC3634918 DOI: 10.1002/prot.24167] [Citation(s) in RCA: 130] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2012] [Revised: 07/26/2012] [Accepted: 08/17/2012] [Indexed: 12/27/2022]
Abstract
One of the major limitations of computational protein structure prediction is the deviation of predicted models from their experimentally derived true, native structures. The limitations often hinder the possibility of applying computational protein structure prediction methods in biochemical assignment and drug design that are very sensitive to structural details. Refinement of these low-resolution predicted models to high-resolution structures close to the native state, however, has proven to be extremely challenging. Thus, protein structure refinement remains a largely unsolved problem. Critical assessment of techniques for protein structure prediction (CASP) specifically indicated that most predictors participating in the refinement category still did not consistently improve model quality. Here, we propose a two-step refinement protocol, called 3Drefine, to consistently bring the initial model closer to the native structure. The first step is based on optimization of hydrogen bonding (HB) network and the second step applies atomic-level energy minimization on the optimized model using a composite physics and knowledge-based force fields. The approach has been evaluated on the CASP benchmark data and it exhibits consistent improvement over the initial structure in both global and local structural quality measures. 3Drefine method is also computationally inexpensive, consuming only few minutes of CPU time to refine a protein of typical length (300 residues). 3Drefine web server is freely available at http://sysbio.rnet.missouri.edu/3Drefine/.
Collapse
Affiliation(s)
| | - Jianlin Cheng
- Department of Computer Science, University of Missouri, Columbia, MO 65211, USA
- Informatics Institute, University of Missouri, Columbia, MO 65211, USA
- Bond Life Science Center, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|
113
|
Tyka MD, Jung K, Baker D. Efficient sampling of protein conformational space using fast loop building and batch minimization on highly parallel computers. J Comput Chem 2012; 33:2483-91. [PMID: 22847521 PMCID: PMC3760475 DOI: 10.1002/jcc.23069] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2012] [Revised: 05/30/2012] [Accepted: 06/24/2012] [Indexed: 12/22/2022]
Abstract
All-atom sampling is a critical and compute-intensive end stage to protein structural modeling. Because of the vast size and extreme ruggedness of conformational space, even close to the native structure, the high-resolution sampling problem is almost as difficult as predicting the rough fold of a protein. Here, we present a combination of new algorithms that considerably speed up the exploration of very rugged conformational landscapes and are capable of finding heretofore hidden low-energy states. The algorithm is based on a hierarchical workflow and can be parallelized on supercomputers with up to 128,000 compute cores with near perfect efficiency. Such scaling behavior is notable, as with Moore's law continuing only in the number of cores per chip, parallelizability is a critical property of new algorithms. Using the enhanced sampling power, we have uncovered previously invisible deficiencies in the Rosetta force field and created an extensive decoy training set for optimizing and testing force fields.
Collapse
Affiliation(s)
- Michael D Tyka
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, USA.
| | | | | |
Collapse
|
114
|
Karakaş M, Woetzel N, Staritzbichler R, Alexander N, Weiner BE, Meiler J. BCL::Fold--de novo prediction of complex and large protein topologies by assembly of secondary structure elements. PLoS One 2012; 7:e49240. [PMID: 23173050 PMCID: PMC3500284 DOI: 10.1371/journal.pone.0049240] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2012] [Accepted: 10/07/2012] [Indexed: 01/10/2023] Open
Abstract
Computational de novo protein structure prediction is limited to small proteins of simple topology. The present work explores an approach to extend beyond the current limitations through assembling protein topologies from idealized α-helices and β-strands. The algorithm performs a Monte Carlo Metropolis simulated annealing folding simulation. It optimizes a knowledge-based potential that analyzes radius of gyration, β-strand pairing, secondary structure element (SSE) packing, amino acid pair distance, amino acid environment, contact order, secondary structure prediction agreement and loop closure. Discontinuation of the protein chain favors sampling of non-local contacts and thereby creation of complex protein topologies. The folding simulation is accelerated through exclusion of flexible loop regions further reducing the size of the conformational search space. The algorithm is benchmarked on 66 proteins with lengths between 83 and 293 amino acids. For 61 out of these proteins, the best SSE-only models obtained have an RMSD100 below 8.0 Å and recover more than 20% of the native contacts. The algorithm assembles protein topologies with up to 215 residues and a relative contact order of 0.46. The method is tailored to be used in conjunction with low-resolution or sparse experimental data sets which often provide restraints for regions of defined secondary structure.
Collapse
Affiliation(s)
- Mert Karakaş
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Nils Woetzel
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Rene Staritzbichler
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Nathan Alexander
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Brian E. Weiner
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Jens Meiler
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| |
Collapse
|
115
|
Bibby J, Keegan RM, Mayans O, Winn MD, Rigden DJ. AMPLE: a cluster-and-truncate approach to solve the crystal structures of small proteins using rapidly computed ab initio models. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2012; 68:1622-31. [PMID: 23151627 DOI: 10.1107/s0907444912039194] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2012] [Accepted: 09/13/2012] [Indexed: 11/10/2022]
Abstract
Protein ab initio models predicted from sequence data alone can enable the elucidation of crystal structures by molecular replacement. However, the calculation of such ab initio models is typically computationally expensive. Here, a computational pipeline based on the clustering and truncation of cheaply obtained ab initio models for the preparation of structure ensembles is described. Clustering is used to select models and to quantitatively predict their local accuracy, allowing rational truncation of predicted inaccurate regions. The resulting ensembles, with or without rapidly added side chains, solved 43% of all test cases, with an 80% success rate for all-α proteins. A program implementing this approach, AMPLE, is included in the CCP4 suite of programs. It only requires the input of a FASTA sequence file and a diffraction data file. It carries out the modelling using locally installed Rosetta, creates search ensembles and automatically performs molecular replacement and model rebuilding.
Collapse
Affiliation(s)
- Jaclyn Bibby
- Institute of Integrative Biology, University of Liverpool, Liverpool, England
| | | | | | | | | |
Collapse
|
116
|
Anderson JS, Hernández G, LeMaster DM. Assessing the chemical accuracy of protein structures via peptide acidity. Biophys Chem 2012. [PMID: 23182463 DOI: 10.1016/j.bpc.2012.10.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Although the protein native state is a Boltzmann conformational ensemble, practical applications often require a representative model from the most populated region of that distribution. The acidity of the backbone amides, as reflected in hydrogen exchange rates, is exquisitely sensitive to the surrounding charge and dielectric volume distribution. For each of four proteins, three independently determined X-ray structures of differing crystallographic resolution were used to predict exchange for the static solvent-exposed amide hydrogens. The average correlation coefficients range from 0.74 for ubiquitin to 0.93 for Pyrococcus furiosus rubredoxin, reflecting the larger range of experimental exchange rates exhibited by the latter protein. The exchange prediction errors modestly correlate with the crystallographic resolution. MODELLER 9v6-derived homology models at ~60% sequence identity (36% identity for chymotrypsin inhibitor CI2) yielded correlation coefficients that are ~0.1 smaller than for the cognate X-ray structures. The most recently deposited NOE-based ubiquitin structure and the original NMR structure of CI2 fail to provide statistically significant predictions of hydrogen exchange. However, the more recent RECOORD refinement study of CI2 yielded predictions comparable to the X-ray and homology model-based analyses.
Collapse
Affiliation(s)
- Janet S Anderson
- Department of Chemistry, Union College, Schenectady, New York 12308, USA
| | | | | |
Collapse
|
117
|
Shrestha R, Simoncini D, Zhang KYJ. Error-estimation-guided rebuilding ofde novomodels increases the success rate ofab initiophasing. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2012; 68:1522-34. [DOI: 10.1107/s0907444912037961] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2012] [Accepted: 09/04/2012] [Indexed: 11/10/2022]
Abstract
Recent advancements in computational methods for protein-structure prediction have made it possible to generate the high-qualityde novomodels required forab initiophasing of crystallographic diffraction data using molecular replacement. Despite those encouraging achievements inab initiophasing usingde novomodels, its success is limited only to those targets for which high-qualityde novomodels can be generated. In order to increase the scope of targets to whichab initiophasing withde novomodels can be successfully applied, it is necessary to reduce the errors in thede novomodels that are used as templates for molecular replacement. Here, an approach is introduced that can identify and rebuild the residues with larger errors, which subsequently reduces the overall Cαroot-mean-square deviation (CA-RMSD) from the native protein structure. The error in a predicted model is estimated from the average pairwise geometric distance per residue computed among selected lowest energy coarse-grained models. This score is subsequently employed to guide a rebuilding process that focuses on more error-prone residues in the coarse-grained models. This rebuilding methodology has been tested on ten protein targets that were unsuccessful using previous methods. The average CA-RMSD of the coarse-grained models was improved from 4.93 to 4.06 Å. For those models with CA-RMSD less than 3.0 Å, the average CA-RMSD was improved from 3.38 to 2.60 Å. These rebuilt coarse-grained models were then converted into all-atom models and refined to produce improvedde novomodels for molecular replacement. Seven diffraction data sets were successfully phased using rebuiltde novomodels, indicating the improved quality of these rebuiltde novomodels and the effectiveness of the rebuilding process. Software implementing this method, calledMORPHEUS, can be downloaded from http://www.riken.jp/zhangiru/software.html.
Collapse
|
118
|
Chaikind B, Kilambi KP, Gray JJ, Ostermeier M. Targeted DNA methylation using an artificially bisected M.HhaI fused to zinc fingers. PLoS One 2012; 7:e44852. [PMID: 22984575 PMCID: PMC3439449 DOI: 10.1371/journal.pone.0044852] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2012] [Accepted: 08/08/2012] [Indexed: 11/18/2022] Open
Abstract
Little is known about the effects of single DNA methylation events on gene transcription. The ability to direct the methylation toward a single unique site within a genome would have broad use as a tool to study the effects of specific epigenetic changes on transcription. A targeted enzyme might also be useful in a therapy for diseases with an epigenetic component or as a means to site-specifically label DNA. Previous studies have sought to target methyltransferase activity by fusing DNA binding proteins to methyltransferases. However, the methyltransferase domain remains active even when the DNA binding protein is unbound, resulting in significant off-target methylation. A better strategy would make methyltransferase activity contingent upon the DNA binding protein’s association with its DNA binding site. We have designed targeted methyltransferases by fusing zinc fingers to the fragments of artificially-bisected, assembly-compromised methyltransferases. The zinc fingers’ binding sites flank the desired target site for methylation. Zinc finger binding localizes the two fragments near each other encouraging their assembly only over the desired site. Through a combination of molecular modeling and experimental optimization in E. coli, we created an engineered methyltransferase derived from M.HhaI with 50–60% methylation at a target site and nearly undetectable levels of methylation at a non-target M.HhaI site (1.4±2.4%). Using a restriction digestion assay, we demonstrate that localization of both fragments synergistically increases methylation at the target site, illustrating the promise of our approach.
Collapse
Affiliation(s)
- Brian Chaikind
- Chemistry-Biology Interface Graduate Program, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Krishna Praneeth Kilambi
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Marc Ostermeier
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
- * E-mail:
| |
Collapse
|
119
|
Terwilliger TC, Read RJ, Adams PD, Brunger AT, Afonine PV, Grosse-Kunstleve RW, Hung LW. Improved crystallographic models through iterated local density-guided model deformation and reciprocal-space refinement. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2012; 68:861-70. [PMID: 22751672 PMCID: PMC3388814 DOI: 10.1107/s0907444912015636] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/14/2011] [Accepted: 04/10/2012] [Indexed: 11/14/2022]
Abstract
An approach is presented for addressing the challenge of model rebuilding after molecular replacement in cases where the placed template is very different from the structure to be determined. The approach takes advantage of the observation that a template and target structure may have local structures that can be superimposed much more closely than can their complete structures. A density-guided procedure for deformation of a properly placed template is introduced. A shift in the coordinates of each residue in the structure is calculated based on optimizing the match of model density within a 6 Å radius of the center of that residue with a prime-and-switch electron-density map. The shifts are smoothed and applied to the atoms in each residue, leading to local deformation of the template that improves the match of map and model. The model is then refined to improve the geometry and the fit of model to the structure-factor data. A new map is then calculated and the process is repeated until convergence. The procedure can extend the routine applicability of automated molecular replacement, model building and refinement to search models with over 2 Å r.m.s.d. representing 65-100% of the structure.
Collapse
Affiliation(s)
- Thomas C Terwilliger
- Bioscience Division and Los Alamos Institutes, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.
| | | | | | | | | | | | | |
Collapse
|
120
|
Abstract
Much molecular-evolution research is concerned with sequence analysis. Yet these sequences represent real, three-dimensional molecules with complex structure and function. Here I highlight a growing trend in the field to incorporate molecular structure and function into computational molecular-evolution work. I consider three focus areas: reconstruction and analysis of past evolutionary events, such as phylogenetic inference or methods to infer selection pressures; development of toy models and simulations to identify fundamental principles of molecular evolution; and atom-level, highly realistic computational modeling of molecular structure and function aimed at making predictions about possible future evolutionary events.
Collapse
Affiliation(s)
- Claus O Wilke
- Institute of Cell and Molecular Biology, The University of Texas at Austin, Austin, Texas, United States of America.
| |
Collapse
|
121
|
Abstract
Functional characterization of proteins being one of the major issues in molecular biology is still unsolved due to several resource and technical limitations of experimental structure determination methods. A suitable methodology for accurate prediction of protein confirmations simply from sequence is therefore emerging as the primary modeling goal of researchers today. Global blind protein structure prediction summit, entitled Critical Assessment of Structure Prediction (CASP), critically assesses the modeling methodologies, to track our algorithmic path development. But our success is still impeded by incompetent modeling methodologies and several key technical lacunas. There is still a great need to focus some key issues for bridging the major though considered trivial gaps, in the upcoming CASP to pave and demarcate our correct way of developing a consistently accurate prediction methodology in the near future. Major problems resulting in divergence of our predicted models from their actual native states are thus highlighted with suggested more stringent and reliable assessment considerations in the CASP test.
Collapse
Affiliation(s)
- Ashish Runthala
- Biological Sciences, Faculty Division III, Birla Institute of Technology & Science, Pilani, Rajasthan, India.
| |
Collapse
|
122
|
Determination of solution structures of proteins up to 40 kDa using CS-Rosetta with sparse NMR data from deuterated samples. Proc Natl Acad Sci U S A 2012; 109:10873-8. [PMID: 22733734 DOI: 10.1073/pnas.1203013109] [Citation(s) in RCA: 167] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We have developed an approach for determining NMR structures of proteins over 20 kDa that utilizes sparse distance restraints obtained using transverse relaxation optimized spectroscopy experiments on perdeuterated samples to guide RASREC Rosetta NMR structure calculations. The method was tested on 11 proteins ranging from 15 to 40 kDa, seven of which were previously unsolved. The RASREC Rosetta models were in good agreement with models obtained using traditional NMR methods with larger restraint sets. In five cases X-ray structures were determined or were available, allowing comparison of the accuracy of the Rosetta models and conventional NMR models. In all five cases, the Rosetta models were more similar to the X-ray structures over both the backbone and side-chain conformations than the "best effort" structures determined by conventional methods. The incorporation of sparse distance restraints into RASREC Rosetta allows routine determination of high-quality solution NMR structures for proteins up to 40 kDa, and should be broadly useful in structural biology.
Collapse
|
123
|
Zhao F, Xu J. A position-specific distance-dependent statistical potential for protein structure and functional study. Structure 2012; 20:1118-26. [PMID: 22608968 PMCID: PMC3372698 DOI: 10.1016/j.str.2012.04.003] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2012] [Revised: 04/09/2012] [Accepted: 04/10/2012] [Indexed: 10/28/2022]
Abstract
Although studied extensively, designing highly accurate protein energy potential is still challenging. A lot of knowledge-based statistical potentials are derived from the inverse of the Boltzmann law and consist of two major components: observed atomic interacting probability and reference state. These potentials mainly distinguish themselves in the reference state and use a similar simple counting method to estimate the observed probability, which is usually assumed to correlate with only atom types. This article takes a rather different view on the observed probability and parameterizes it by the protein sequence profile context of the atoms and the radius of the gyration, in addition to atom types. Experiments confirm that our position-specific statistical potential outperforms currently the popular ones in several decoy discrimination tests. Our results imply that, in addition to reference state, the observed probability also makes energy potentials different and evolutionary information greatly boost performance of energy potentials.
Collapse
Affiliation(s)
- Feng Zhao
- Toyota Technological Institute at Chicago, Chicago IL, USA 60637
| | - Jinbo Xu
- Toyota Technological Institute at Chicago, Chicago IL, USA 60637
| |
Collapse
|
124
|
Lange OF, Baker D. Resolution-adapted recombination of structural features significantly improves sampling in restraint-guided structure calculation. Proteins 2012; 80:884-95. [PMID: 22423358 PMCID: PMC3310173 DOI: 10.1002/prot.23245] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Abstract
Recent work has shown that NMR structures can be determined by integrating sparse NMR data with structure prediction methods such as Rosetta. The experimental data serve to guide the search for the lowest energy state towards the deep minimum at the native state which is frequently missed in Rosetta de novo structure calculations. However, as the protein size increases, sampling again becomes limiting; for example, the standard Rosetta protocol involving Monte Carlo fragment insertion starting from an extended chain fails to converge for proteins over 150 amino acids even with guidance from chemical shifts (CS-Rosetta) and other NMR data. The primary limitation of this protocol—that every folding trajectory is completely independent of every other—was recently overcome with the development of a new approach involving resolution-adapted structural recombination (RASREC). Here we describe the RASREC approach in detail and compare it to standard CS-Rosetta. We show that the improved sampling of RASREC is essential in obtaining accurate structures over a benchmark set of 11 proteins in the 15-25 kDa size range using chemical shifts, backbone RDCs and HN-HN NOE data; in a number of cases the improved sampling methodology makes a larger contribution than incorporation of additional experimental data. Experimental data are invaluable for guiding sampling to the vicinity of the global energy minimum, but for larger proteins, the standard Rosetta fold-from-extended-chain protocol does not converge on the native minimum even with experimental data and the more powerful RASREC approach is necessary to converge to accurate solutions.
Collapse
Affiliation(s)
- Oliver F Lange
- Department Chemie, Biomolecular NMR and Munich Center for Integrated Protein Science, Technische Universität München, Garching, Germany.
| | | |
Collapse
|
125
|
Raval A, Piana S, Eastwood MP, Dror RO, Shaw DE. Refinement of protein structure homology models via long, all-atom molecular dynamics simulations. Proteins 2012; 80:2071-9. [PMID: 22513870 DOI: 10.1002/prot.24098] [Citation(s) in RCA: 184] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2012] [Revised: 04/03/2012] [Accepted: 04/11/2012] [Indexed: 11/07/2022]
Abstract
Accurate computational prediction of protein structure represents a longstanding challenge in molecular biology and structure-based drug design. Although homology modeling techniques are widely used to produce low-resolution models, refining these models to high resolution has proven difficult. With long enough simulations and sufficiently accurate force fields, molecular dynamics (MD) simulations should in principle allow such refinement, but efforts to refine homology models using MD have for the most part yielded disappointing results. It has thus far been unclear whether MD-based refinement is limited primarily by accessible simulation timescales, force field accuracy, or both. Here, we examine MD as a technique for homology model refinement using all-atom simulations, each at least 100 μs long-more than 100 times longer than previous refinement simulations-and a physics-based force field that was recently shown to successfully fold a structurally diverse set of fast-folding proteins. In MD simulations of 24 proteins chosen from the refinement category of recent Critical Assessment of Structure Prediction (CASP) experiments, we find that in most cases, simulations initiated from homology models drift away from the native structure. Comparison with simulations initiated from the native structure suggests that force field accuracy is the primary factor limiting MD-based refinement. This problem can be mitigated to some extent by restricting sampling to the neighborhood of the initial model, leading to structural improvement that, while limited, is roughly comparable to the leading alternative methods.
Collapse
Affiliation(s)
- Alpan Raval
- D E Shaw Research, New York, New York 10036, USA
| | | | | | | | | |
Collapse
|
126
|
Cruz JA, Blanchet MF, Boniecki M, Bujnicki JM, Chen SJ, Cao S, Das R, Ding F, Dokholyan NV, Flores SC, Huang L, Lavender CA, Lisi V, Major F, Mikolajczak K, Patel DJ, Philips A, Puton T, Santalucia J, Sijenyi F, Hermann T, Rother K, Rother M, Serganov A, Skorupski M, Soltysinski T, Sripakdeevong P, Tuszynska I, Weeks KM, Waldsich C, Wildauer M, Leontis NB, Westhof E. RNA-Puzzles: a CASP-like evaluation of RNA three-dimensional structure prediction. RNA (NEW YORK, N.Y.) 2012; 18:610-25. [PMID: 22361291 PMCID: PMC3312550 DOI: 10.1261/rna.031054.111] [Citation(s) in RCA: 189] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
We report the results of a first, collective, blind experiment in RNA three-dimensional (3D) structure prediction, encompassing three prediction puzzles. The goals are to assess the leading edge of RNA structure prediction techniques; compare existing methods and tools; and evaluate their relative strengths, weaknesses, and limitations in terms of sequence length and structural complexity. The results should give potential users insight into the suitability of available methods for different applications and facilitate efforts in the RNA structure prediction community in ongoing efforts to improve prediction tools. We also report the creation of an automated evaluation pipeline to facilitate the analysis of future RNA structure prediction exercises.
Collapse
Affiliation(s)
- José Almeida Cruz
- Architecture et Réactivité de l'ARN, Université de Strasbourg, IBMC-CNRS, F-67084 Strasbourg, France
| | - Marc-Frédérick Blanchet
- Institute for Research in Immunology and Cancer (IRIC), Department of Computer Science and Operations Research, Université de Montréal, Montréal, Québec H3C 3J7, Canada
| | - Michal Boniecki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Janusz M. Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Shi-Jie Chen
- Department of Physics and Department of Biochemistry, University of Missouri, Columbia, Missouri 65211, USA
| | - Song Cao
- Department of Physics and Department of Biochemistry, University of Missouri, Columbia, Missouri 65211, USA
| | - Rhiju Das
- Department of Biochemistry
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Feng Ding
- Department of Biochemistry and Biophysics, University of North Carolina, School of Medicine, Chapel Hill, North Carolina 27599, USA
| | - Nikolay V. Dokholyan
- Department of Biochemistry and Biophysics, University of North Carolina, School of Medicine, Chapel Hill, North Carolina 27599, USA
| | - Samuel Coulbourn Flores
- Computational & Systems Biology Program, Institute for Cell and Molecular Biology, Uppsala University, 751 05 Uppsala, Sweden
| | - Lili Huang
- Structural Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York 10021, USA
| | - Christopher A. Lavender
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Véronique Lisi
- Institute for Research in Immunology and Cancer (IRIC), Department of Computer Science and Operations Research, Université de Montréal, Montréal, Québec H3C 3J7, Canada
| | - François Major
- Institute for Research in Immunology and Cancer (IRIC), Department of Computer Science and Operations Research, Université de Montréal, Montréal, Québec H3C 3J7, Canada
| | - Katarzyna Mikolajczak
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Dinshaw J. Patel
- Structural Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York 10021, USA
| | - Anna Philips
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Tomasz Puton
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - John Santalucia
- Department of Chemistry, Wayne State University, Detroit, Michigan 48202, USA
- DNA Software, Ann Arbor, Michigan 48104, USA
| | | | - Thomas Hermann
- Department of Chemistry and Biochemistry, University of California at San Diego, La Jolla, California 92093, USA
| | - Kristian Rother
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Magdalena Rother
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Alexander Serganov
- Structural Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York 10021, USA
| | - Marcin Skorupski
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Tomasz Soltysinski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Parin Sripakdeevong
- Department of Biochemistry
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Irina Tuszynska
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Kevin M. Weeks
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Christina Waldsich
- Max F. Perutz Laboratories, Department of Biochemistry, University of Vienna, Vienna 1030, Austria
| | - Michael Wildauer
- Max F. Perutz Laboratories, Department of Biochemistry, University of Vienna, Vienna 1030, Austria
| | - Neocles B. Leontis
- Department of Chemistry and Center for Biomolecular Sciences, Bowling Green State University, Bowling Green, Ohio 43403, USA
| | - Eric Westhof
- Architecture et Réactivité de l'ARN, Université de Strasbourg, IBMC-CNRS, F-67084 Strasbourg, France
- Corresponding author.E-mail .
| |
Collapse
|
127
|
Scott WG. Challenges and surprises that arise with nucleic acids during model building and refinement. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2012; 68:441-5. [PMID: 22505264 PMCID: PMC3322603 DOI: 10.1107/s0907444912001084] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2011] [Accepted: 01/10/2012] [Indexed: 11/11/2022]
Abstract
The process of building and refining crystal structures of nucleic acids, although similar to that for proteins, has some peculiarities that give rise to both various complications and various benefits. Although conventional isomorphous replacement phasing techniques are typically used to generate an experimental electron-density map for the purposes of determining novel nucleic acid structures, it is also possible to couple the phasing and model-building steps to permit the solution of complex and novel RNA three-dimensional structures without the need for conventional heavy-atom phasing approaches.
Collapse
Affiliation(s)
- William G Scott
- Department of Chemistry and Biochemistry and the Center for the Molecular Biology of RNA, University of California at Santa Cruz, Santa Cruz, CA 95064, USA.
| |
Collapse
|
128
|
Analysis of the conformation and function of the Plasmodium falciparum merozoite proteins MTRAP and PTRAMP. EUKARYOTIC CELL 2012; 11:615-25. [PMID: 22467743 DOI: 10.1128/ec.00039-12] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Thrombospondin repeat (TSR)-like domains are structures involved with cell adhesion. Plasmodium falciparum proteins containing TSR domains play crucial roles in parasite development. In particular, the preerythrocytic P. falciparum circumsporozoite protein is involved in hepatocyte invasion. The importance of these domains in two other malaria proteins, the merozoite-specific thrombospondin-related anonymous protein (MTRAP) and the thrombospondin-related apical membrane protein (PTRAMP), were assessed using near-full-length recombinant proteins composed of the extracellular domains produced in Escherichia coli. MTRAP is thought to be released from invasive organelles identified as micronemes during merozoite invasion to mediate motility and host cell invasion through an interaction with aldolase, an actin binding protein involved in the moving junction. PTRAMP function remains unknown. In this study, the conformation of recombinant MTRAP (rMTRAP) appeared to be a highly extended protein (2 nm by 33 nm, width by length, respectively), whereas rPTRAMP had a less extended structure. Using an erythrocyte binding assay, rMTRAP but not rPTRAMP bound human erythrocytes; rMTRAP binding was mediated through the TSR domain. MTRAP- and in general PTRAMP-specific antibodies failed to inhibit P. falciparum development in vitro. Altogether, MTRAP is a highly extended bifunctional protein that binds to an erythrocyte receptor and the merozoite motor.
Collapse
|
129
|
Gront D, Kmiecik S, Blaszczyk M, Ekonomiuk D, Koliński A. Optimization of protein models. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2012. [DOI: 10.1002/wcms.1090] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- Dominik Gront
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Sebastian Kmiecik
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Maciej Blaszczyk
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Dariusz Ekonomiuk
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Andrzej Koliński
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| |
Collapse
|
130
|
phenix.mr_rosetta: molecular replacement and model rebuilding with Phenix and Rosetta. ACTA ACUST UNITED AC 2012; 13:81-90. [PMID: 22418934 PMCID: PMC3375004 DOI: 10.1007/s10969-012-9129-3] [Citation(s) in RCA: 116] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2011] [Accepted: 02/07/2012] [Indexed: 11/03/2022]
Abstract
The combination of algorithms from the structure-modeling field with those of crystallographic structure determination can broaden the range of templates that are useful for structure determination by the method of molecular replacement. Automated tools in phenix.mr_rosetta simplify the application of these combined approaches by integrating Phenix crystallographic algorithms and Rosetta structure-modeling algorithms and by systematically generating and evaluating models with a combination of these methods. The phenix.mr_rosetta algorithms can be used to automatically determine challenging structures. The approaches used in phenix.mr_rosetta are described along with examples that show roles that structure-modeling can play in molecular replacement.
Collapse
|
131
|
Abstract
Essentially the same steps are required to solve the crystal structure of a serpin as for any other protein: produce and purify protein, grow crystals, collect diffraction data, find estimates of the phase angles, and then refine and validate the structure. For the phasing step, experimental phasing methods involving heavy atom soaks were required for the first few structures, but with the large number of serpin structures now available, molecular replacement has become the method of choice. Two things are special about serpins. First, because of the central role of conformational change in serpin mechanism, it is advisable to consider a variety of molecular replacement models in different conformations and then to allow for rigid-body motions in the initial refinement steps. Second, probably owing to the flexibility of serpins, the average serpin crystal is significantly less well ordered than the average crystal of another protein, which increases the difficulty of solving and refining their structures.
Collapse
|
132
|
Totir M, Echols N, Nanao M, Gee CL, Moskaleva A, Gradia S, Iavarone AT, Berger JM, May AP, Zubieta C, Alber T. Macro-to-micro structural proteomics: native source proteins for high-throughput crystallization. PLoS One 2012; 7:e32498. [PMID: 22393408 PMCID: PMC3290569 DOI: 10.1371/journal.pone.0032498] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2011] [Accepted: 01/27/2012] [Indexed: 01/07/2023] Open
Abstract
Structural biology and structural genomics projects routinely rely on recombinantly expressed proteins, but many proteins and complexes are difficult to obtain by this approach. We investigated native source proteins for high-throughput protein crystallography applications. The Escherichia coli proteome was fractionated, purified, crystallized, and structurally characterized. Macro-scale fermentation and fractionation were used to subdivide the soluble proteome into 408 unique fractions of which 295 fractions yielded crystals in microfluidic crystallization chips. Of the 295 crystals, 152 were selected for optimization, diffraction screening, and data collection. Twenty-three structures were determined, four of which were novel. This study demonstrates the utility of native source proteins for high-throughput crystallography.
Collapse
Affiliation(s)
- Monica Totir
- Department of Molecular and Cell Biology, University of California, Berkeley, California, United States of America
| | - Nathaniel Echols
- Department of Molecular and Cell Biology, University of California, Berkeley, California, United States of America
| | - Max Nanao
- European Molecular Biology Laboratory, Grenoble, France
| | - Christine L. Gee
- Department of Molecular and Cell Biology, University of California, Berkeley, California, United States of America
| | - Alisa Moskaleva
- Department of Molecular and Cell Biology, University of California, Berkeley, California, United States of America
| | - Scott Gradia
- QB3 Institute, Berkeley, California, United States of America
| | | | - James M. Berger
- Department of Molecular and Cell Biology, University of California, Berkeley, California, United States of America
| | - Andrew P. May
- Fluidigm Corporation, South San Francisco, California, United States of America
| | - Chloe Zubieta
- Department of Molecular and Cell Biology, University of California, Berkeley, California, United States of America
| | - Tom Alber
- Department of Molecular and Cell Biology, University of California, Berkeley, California, United States of America
| |
Collapse
|
133
|
Schmitz C, Vernon R, Otting G, Baker D, Huber T. Protein structure determination from pseudocontact shifts using ROSETTA. J Mol Biol 2012; 416:668-77. [PMID: 22285518 DOI: 10.1016/j.jmb.2011.12.056] [Citation(s) in RCA: 96] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2011] [Revised: 12/16/2011] [Accepted: 12/27/2011] [Indexed: 11/25/2022]
Abstract
Paramagnetic metal ions generate pseudocontact shifts (PCSs) in nuclear magnetic resonance spectra that are manifested as easily measurable changes in chemical shifts. Metals can be incorporated into proteins through metal binding tags, and PCS data constitute powerful long-range restraints on the positions of nuclear spins relative to the coordinate system of the magnetic susceptibility anisotropy tensor (Δχ-tensor) of the metal ion. We show that three-dimensional structures of proteins can reliably be determined using PCS data from a single metal binding site combined with backbone chemical shifts. The program PCS-ROSETTA automatically determines the Δχ-tensor and metal position from the PCS data during the structure calculations, without any prior knowledge of the protein structure. The program can determine structures accurately for proteins of up to 150 residues, offering a powerful new approach to protein structure determination that relies exclusively on readily measurable backbone chemical shifts and easily discriminates between correctly and incorrectly folded conformations.
Collapse
Affiliation(s)
- Christophe Schmitz
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, QLD 4072, Australia
| | | | | | | | | |
Collapse
|
134
|
Mullins JGL. Structural modelling pipelines in next generation sequencing projects. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2012; 89:117-67. [PMID: 23046884 DOI: 10.1016/b978-0-12-394287-6.00005-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Our capacity to reliably predict protein structure from sequence is steadily improving due to the increased numbers and better targeting of protein structures being experimentally determined by structural genomics projects, along with the development of better modeling methodologies. Template-based (homology) modeling and de novo modeling methods are being combined to fill in remaining gaps in template coverage, and powerful automated structural modeling pipelines are being applied to large data sets of protein sequences. The improved quality of 3D models of proteins has led to their routine use in assessing the functional impact of nonsynonymous single nucleotide polymorphisms (nsSNPs) in specific protein systems, with the development of approaches that may be applied in a predictive fashion to nsSNPs emerging from next-generation sequencing projects. The challenges encountered in deriving functionally meaningful deductions from structural modeling can be quite different for proteins of different protein functional classes. The specific challenges to the assessment of the structural and functional impact of nsSNPs in globular proteins such as binding and regulatory proteins, structural proteins, and enzymes are discussed, as well as membrane transport proteins and ion channels. The mapping of reliable predictions of the structural and functional impact of SNPs, generated from automated modeling pipelines, on to protein-protein interaction networks will facilitate new approaches to understanding complex polygenic disorders and predisposition to disease.
Collapse
Affiliation(s)
- Jonathan G L Mullins
- Genome and Structural Bioinformatics, Institute of Life Science, College of Medicine, Swansea University, Singleton Park, Swansea, Wales, UK.
| |
Collapse
|
135
|
He X, Lobsiger J, Stocker A. Molecular Clues to Bothnia-Type Retinal Dystrophy. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2012; 723:589-94. [DOI: 10.1007/978-1-4614-0631-0_75] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
136
|
Fanelli F, De Benedetti PG. Update 1 of: computational modeling approaches to structure-function analysis of G protein-coupled receptors. Chem Rev 2011; 111:PR438-535. [PMID: 22165845 DOI: 10.1021/cr100437t] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Francesca Fanelli
- Dulbecco Telethon Institute, University of Modena and Reggio Emilia, via Campi 183, 41125 Modena, Italy.
| | | |
Collapse
|
137
|
An enumerative stepwise ansatz enables atomic-accuracy RNA loop modeling. Proc Natl Acad Sci U S A 2011; 108:20573-8. [PMID: 22143768 DOI: 10.1073/pnas.1106516108] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Atomic-accuracy structure prediction of macromolecules should be achievable by optimizing a physically realistic energy function but is presently precluded by incomplete sampling of a biopolymer's many degrees of freedom. We present herein a working hypothesis, called the "stepwise ansatz," for recursively constructing well-packed atomic-detail models in small steps, enumerating several million conformations for each monomer, and covering all build-up paths. By making use of high-performance computing and the Rosetta framework, we provide first tests of this hypothesis on a benchmark of 15 RNA loop-modeling problems drawn from riboswitches, ribozymes, and the ribosome, including 10 cases that are not solvable by current knowledge-based modeling approaches. For each loop problem, this deterministic stepwise assembly method either reaches atomic accuracy or exposes flaws in Rosetta's all-atom energy function, indicating the resolution of the conformational sampling bottleneck. As a further rigorous test, we have carried out a blind all-atom prediction for a noncanonical RNA motif, the C7.2 tetraloop/receptor, and validated this model through nucleotide-resolution chemical mapping experiments. Stepwise assembly is an enumerative, ab initio build-up method that systematically outperforms existing Monte Carlo and knowledge-based methods for 3D structure prediction.
Collapse
|
138
|
Fortenberry C, Bowman EA, Proffitt W, Dorr B, Combs S, Harp J, Mizoue L, Meiler J. Exploring symmetry as an avenue to the computational design of large protein domains. J Am Chem Soc 2011; 133:18026-9. [PMID: 21978247 PMCID: PMC3781211 DOI: 10.1021/ja2051217] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
It has been demonstrated previously that symmetric, homodimeric proteins are energetically favored, which explains their abundance in nature. It has been proposed that such symmetric homodimers underwent gene duplication and fusion to evolve into protein topologies that have a symmetric arrangement of secondary structure elements--"symmetric superfolds". Here, the ROSETTA protein design software was used to computationally engineer a perfectly symmetric variant of imidazole glycerol phosphate synthase and its corresponding symmetric homodimer. The new protein, termed FLR, adopts the symmetric (βα)(8) TIM-barrel superfold. The protein is soluble and monomeric and exhibits two-fold symmetry not only in the arrangement of secondary structure elements but also in sequence and at atomic detail, as verified by crystallography. When cut in half, FLR dimerizes readily to form the symmetric homodimer. The successful computational design of FLR demonstrates progress in our understanding of the underlying principles of protein stability and presents an attractive strategy for the in silico construction of larger protein domains from smaller pieces.
Collapse
Affiliation(s)
| | | | | | | | | | - Joel Harp
- Vanderbilt University, Nashville, TN
| | | | | |
Collapse
|
139
|
Azoitei ML, Ban YEA, Julien JP, Bryson S, Schroeter A, Kalyuzhniy O, Porter JR, Adachi Y, Baker D, Pai EF, Schief WR. Computational design of high-affinity epitope scaffolds by backbone grafting of a linear epitope. J Mol Biol 2011; 415:175-92. [PMID: 22061265 PMCID: PMC7105911 DOI: 10.1016/j.jmb.2011.10.003] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Revised: 10/01/2011] [Accepted: 10/04/2011] [Indexed: 11/23/2022]
Abstract
Computational grafting of functional motifs onto scaffold proteins is a promising way to engineer novel proteins with pre-specified functionalities. Typically, protein grafting involves the transplantation of protein side chains from a functional motif onto structurally homologous regions of scaffold proteins. Using this approach, we previously transplanted the human immunodeficiency virus 2F5 and 4E10 epitopes onto heterologous proteins to design novel “epitope-scaffold” antigens. However, side-chain grafting is limited by the availability of scaffolds with compatible backbone for a given epitope structure and offers no route to modify backbone structure to improve mimicry or binding affinity. To address this, we report here a new and more aggressive computational method—backbone grafting of linear motifs—that transplants the backbone and side chains of linear functional motifs onto scaffold proteins. To test this method, we first used side-chain grafting to design new 2F5 epitope scaffolds with improved biophysical characteristics. We then independently transplanted the 2F5 epitope onto three of the same parent scaffolds using the newly developed backbone grafting procedure. Crystal structures of side-chain and backbone grafting designs showed close agreement with both the computational models and the desired epitope structure. In two cases, backbone grafting scaffolds bound antibody 2F5 with 30- and 9-fold higher affinity than corresponding side-chain grafting designs. These results demonstrate that flexible backbone methods for epitope grafting can significantly improve binding affinities over those achieved by fixed backbone methods alone. Backbone grafting of linear motifs is a general method to transplant functional motifs when backbone remodeling of the target scaffold is necessary.
Collapse
Affiliation(s)
- Mihai L Azoitei
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
140
|
Kryshtafovych A, Fidelis K, Tramontano A. Evaluation of model quality predictions in CASP9. Proteins 2011; 79 Suppl 10:91-106. [PMID: 21997462 DOI: 10.1002/prot.23180] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2011] [Revised: 08/22/2011] [Accepted: 08/24/2011] [Indexed: 12/14/2022]
Abstract
CASP has been assessing the state of the art in the a priori estimation of accuracy of protein structure prediction since 2006. The inclusion of model quality assessment category in CASP contributed to a rapid development of methods in this area. In the last experiment, 46 quality assessment groups tested their approaches to estimate the accuracy of protein models as a whole and/or on a per-residue basis. We assessed the performance of these methods predominantly on the basis of the correlation between the predicted and observed quality of the models on both global and local scales. The ability of the methods to identify the models closest to the best one, to differentiate between good and bad models, and to identify well modeled regions was also analyzed. Our evaluations demonstrate that even though global quality assessment methods seem to approach perfection point (weighted average per-target Pearson's correlation coefficients are as high as 0.97 for the best groups), there is still room for improvement. First, all top-performing methods use consensus approaches to generate quality estimates, and this strategy has its own limitations. Second, the methods that are based on the analysis of individual models lag far behind clustering techniques and need a boost in performance. The methods for estimating per-residue accuracy of models are less accurate than global quality assessment methods, with an average weighted per-model correlation coefficient in the range of 0.63-0.72 for the best 10 groups.
Collapse
Affiliation(s)
- Andriy Kryshtafovych
- Genome Center, University of California-Davis, 451 Health Sciences Drive, Davis, CA 95616, USA.
| | | | | |
Collapse
|
141
|
Mao B, Guan R, Montelione GT. Improved technologies now routinely provide protein NMR structures useful for molecular replacement. Structure 2011; 19:757-66. [PMID: 21645849 DOI: 10.1016/j.str.2011.04.005] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2011] [Revised: 04/07/2011] [Accepted: 04/25/2011] [Indexed: 10/18/2022]
Abstract
Molecular replacement (MR) is widely used for addressing the phase problem in X-ray crystallography. Historically, crystallographers have had limited success using NMR structures as MR search models. Here, we report a comprehensive investigation of the utility of protein NMR ensembles as MR search models, using data for 25 pairs of X-ray and NMR structures solved and refined using modern NMR methods. Starting from NMR ensembles prepared by an improved protocol, FindCore, correct MR solutions were obtained for 22 targets. Based on these solutions, automatic model rebuilding could be done successfully. Rosetta refinement of NMR structures provided MR solutions for another two proteins. We also demonstrate that such properly prepared NMR ensembles and X-ray crystal structures have similar performance when used as MR search models for homologous structures, particularly for targets with sequence identity >40%.
Collapse
Affiliation(s)
- Binchen Mao
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, and Robert Wood Johnson Medical School, UMDNJ, Piscataway, NJ 08854, USA
| | | | | |
Collapse
|
142
|
Altshuler EP, Serebryanaya DV, Katrukha AG. Generation of recombinant antibodies and means for increasing their affinity. BIOCHEMISTRY (MOSCOW) 2011; 75:1584-605. [PMID: 21417996 DOI: 10.1134/s0006297910130067] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Highly specific interaction with foreign molecules is a unique feature of antibodies. Since 1975, when Keller and Milstein proposed the method of hybridoma technology and prepared mouse monoclonal antibodies, many antibodies specific to various antigens have been obtained. Recent development of methods for preparation of recombinant DNA libraries and in silico bioinformatics approaches for protein structure analysis makes possible antibody preparation using gene engineering approaches. The development of gene engineering methods allowed creating recombinant antibodies and improving characteristics of existing antibodies; this significantly extends the applicability of antibodies. By modifying biochemical and immunochemical properties of antibodies by changing their amino acid sequences it is possible to create antibodies with properties optimal for certain tasks. For example, application of recombinant technologies resulted in antibody preparation of high affinity significantly exceeding the initial affinity of natural antibodies. In this review we summarize information about the structure, modes of preparation, and application of recombinant antibodies and their fragments and also consider the main approaches used to increase antibody affinity.
Collapse
Affiliation(s)
- E P Altshuler
- Department of Biochemistry, Faculty of Biology, Lomonosov Moscow State University, Russia
| | | | | |
Collapse
|
143
|
Huang PS, Ban YEA, Richter F, Andre I, Vernon R, Schief WR, Baker D. RosettaRemodel: a generalized framework for flexible backbone protein design. PLoS One 2011; 6:e24109. [PMID: 21909381 PMCID: PMC3166072 DOI: 10.1371/journal.pone.0024109] [Citation(s) in RCA: 241] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2011] [Accepted: 07/29/2011] [Indexed: 12/12/2022] Open
Abstract
We describe RosettaRemodel, a generalized framework for flexible protein design that provides a versatile and convenient interface to the Rosetta modeling suite. RosettaRemodel employs a unified interface, called a blueprint, which allows detailed control over many aspects of flexible backbone protein design calculations. RosettaRemodel allows the construction and elaboration of customized protocols for a wide range of design problems ranging from loop insertion and deletion, disulfide engineering, domain assembly, loop remodeling, motif grafting, symmetrical units, to de novo structure modeling.
Collapse
Affiliation(s)
- Po-Ssu Huang
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
| | - Yih-En Andrew Ban
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
| | - Florian Richter
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
- Interdisciplinary Program in Biomolecular Structure and Design, University of Washington, Seattle, Washington, United States of America
| | - Ingemar Andre
- Department of Biochemistry and Structural Biology, Lund University, Lund, Sweden
| | - Robert Vernon
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada
| | - William R. Schief
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
- * E-mail: (WRS); (DB)
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, University of Washington, Seattle, Washington, United States of America
- * E-mail: (WRS); (DB)
| |
Collapse
|
144
|
MacCallum JL, Pérez A, Schnieders MJ, Hua L, Jacobson MP, Dill KA. Assessment of protein structure refinement in CASP9. Proteins 2011; 79 Suppl 10:74-90. [PMID: 22069034 DOI: 10.1002/prot.23131] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2011] [Revised: 06/15/2011] [Accepted: 07/03/2011] [Indexed: 11/06/2022]
Abstract
We assess performance in the structure refinement category in CASP9. Two years after CASP8, the performance of the best groups has not improved. There are few groups that improve any of our assessment scores with statistical significance. Some predictors, however, are able to consistently improve the physicality of the models. Although we cannot identify any clear bottleneck in improving refinement, several points arise: (1) The refinement portion of CASP has too few targets to make many statistically meaningful conclusions. (2) Predictors are usually very conservative, limiting the possibility of large improvements in models. (3) No group is actually able to correctly rank their five submissions-indicating that potentially better models may be discarded. (4) Different sampling strategies work better for different refinement problems; there is no single strategy that works on all targets. In general, conservative strategies do better, while the greatest improvements come from more adventurous sampling-at the cost of consistency. Comparison with experimental data reveals aspects not captured by comparison to a single structure. In particular, we show that improvement in backbone geometry does not always mean better agreement with experimental data. Finally, we demonstrate that even given the current challenges facing refinement, the refined models are useful for solving the crystallographic phase problem through molecular replacement. Proteins 2011;. © 2011 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Justin L MacCallum
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA.
| | | | | | | | | | | |
Collapse
|
145
|
Berrondo M, Gray JJ. Computed structures of point deletion mutants and their enzymatic activities. Proteins 2011; 79:2844-60. [PMID: 21905110 DOI: 10.1002/prot.23109] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2010] [Revised: 04/08/2011] [Accepted: 05/13/2011] [Indexed: 11/11/2022]
Abstract
Point deletions in enzymes can vary in effect from negligible to complete loss of activity; however, these effects are not generally predictable. Deletions are widely observed in nature and often result in diseases such as cancer, cystic fibrosis, or osteogenesis imperfecta. Here, we have developed an algorithm to model the perturbed structures of deletion mutants with the ultimate goal of predicting their activities. The algorithm works by deleting the specified residue from the wild-type structure, creating a gap that is closed using a combination of local and global moves that change the backbone torsion angles of the protein structure. On a set of five proteins for which both wild-type and deletion mutant x-ray crystal structures are available, the algorithm produces deep, narrow energy funnels within 1.5 Å of the crystal structure for the deletion mutants. To assess the ability of our algorithm to predict activity from the predicted structures, we tested the correlation of experimental activity with several measures of the predicted structure ensemble using a set of 45 point deletions from ricin. Estimates incorporating likely prevalence of active and inactive deletion sites suggest that activity can be predicted correctly over 60% of the time from the active site root-mean squared deviation of the lowest energy predicted structures. The predictions are stronger than simple sequence organization measures, but more fundamental work is required in structure prediction and enzyme activity determination to allow consistent prediction of activity.
Collapse
Affiliation(s)
- Monica Berrondo
- Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland 21218, USA
| | | |
Collapse
|
146
|
Shrestha R, Berenger F, Zhang KYJ. Acceleratingab initiophasing withde novomodels. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2011; 67:804-12. [DOI: 10.1107/s090744491102779x] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2011] [Accepted: 07/11/2011] [Indexed: 11/10/2022]
|
147
|
Adams PD, Afonine PV, Bunkóczi G, Chen VB, Echols N, Headd JJ, Hung LW, Jain S, Kapral GJ, Grosse Kunstleve RW, McCoy AJ, Moriarty NW, Oeffner RD, Read RJ, Richardson DC, Richardson JS, Terwilliger TC, Zwart PH. The Phenix software for automated determination of macromolecular structures. Methods 2011; 55:94-106. [PMID: 21821126 DOI: 10.1016/j.ymeth.2011.07.005] [Citation(s) in RCA: 670] [Impact Index Per Article: 51.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2011] [Revised: 07/14/2011] [Accepted: 07/15/2011] [Indexed: 11/26/2022] Open
Abstract
X-ray crystallography is a critical tool in the study of biological systems. It is able to provide information that has been a prerequisite to understanding the fundamentals of life. It is also a method that is central to the development of new therapeutics for human disease. Significant time and effort are required to determine and optimize many macromolecular structures because of the need for manual interpretation of complex numerical data, often using many different software packages, and the repeated use of interactive three-dimensional graphics. The Phenix software package has been developed to provide a comprehensive system for macromolecular crystallographic structure solution with an emphasis on automation. This has required the development of new algorithms that minimize or eliminate subjective input in favor of built-in expert-systems knowledge, the automation of procedures that are traditionally performed by hand, and the development of a computational framework that allows a tight integration between the algorithms. The application of automated methods is particularly appropriate in the field of structural proteomics, where high throughput is desired. Features in Phenix for the automation of experimental phasing with subsequent model building, molecular replacement, structure refinement and validation are described and examples given of running Phenix from both the command line and graphical user interface.
Collapse
Affiliation(s)
- Paul D Adams
- Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
148
|
Xu M, Yu L, Wan B, Yu L, Huang Q. Predicting inactive conformations of protein kinases using active structures: conformational selection of type-II inhibitors. PLoS One 2011; 6:e22644. [PMID: 21818358 PMCID: PMC3144914 DOI: 10.1371/journal.pone.0022644] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2011] [Accepted: 07/03/2011] [Indexed: 11/19/2022] Open
Abstract
Protein kinases have been found to possess two characteristic conformations in their activation-loops: the active DFG-in conformation and the inactive DFG-out conformation. Recently, it has been very interesting to develop type-II inhibitors which target the DFG-out conformation and are more specific than the type-I inhibitors binding to the active DFG-in conformation. However, solving crystal structures of kinases with the DFG-out conformation remains a challenge, and this seriously hampers the application of the structure-based approaches in development of novel type-II inhibitors. To overcome this limitation, here we present a computational approach for predicting the DFG-out inactive conformation using the DFG-in active structures, and develop related conformational selection protocols for the uses of the predicted DFG-out models in the binding pose prediction and virtual screening of type-II ligands. With the DFG-out models, we predicted the binding poses for known type-II inhibitors, and the results were found in good agreement with the X-ray crystal structures. We also tested the abilities of the DFG-out models to recognize their specific type-II inhibitors by screening a database of small molecules. The AUC (area under curve) results indicated that the predicted DFG-out models were selective toward their specific type-II inhibitors. Therefore, the computational approach and protocols presented in this study are very promising for the structure-based design and screening of novel type-II kinase inhibitors.
Collapse
Affiliation(s)
- Min Xu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Lu Yu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Bo Wan
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Long Yu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Qiang Huang
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| |
Collapse
|
149
|
Park H, Ko J, Joo K, Lee J, Seok C, Lee J. Refinement of protein termini in template-based modeling using conformational space annealing. Proteins 2011; 79:2725-34. [PMID: 21755541 DOI: 10.1002/prot.23101] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2011] [Revised: 05/17/2011] [Accepted: 05/27/2011] [Indexed: 02/05/2023]
Abstract
The rapid increase in the number of experimentally determined protein structures in recent years enables us to obtain more reliable protein tertiary structure models than ever by template-based modeling. However, refinement of template-based models beyond the limit available from the best templates is still needed for understanding protein function in atomic detail. In this work, we develop a new method for protein terminus modeling that can be applied to refinement of models with unreliable terminus structures. The energy function for terminus modeling consists of both physics-based and knowledge-based potential terms with carefully optimized relative weights. Effective sampling of both the framework and terminus is performed using the conformational space annealing technique. This method has been tested on a set of termini derived from a nonredundant structure database and two sets of termini from the CASP8 targets. The performance of the terminus modeling method is significantly improved over our previous method that does not employ terminus refinement. It is also comparable or superior to the best server methods tested in CASP8. The success of the current approach suggests that similar strategy may be applied to other types of refinement problems such as loop modeling or secondary structure rearrangement.
Collapse
Affiliation(s)
- Hahnbeom Park
- Department of Chemistry, Seoul National University, Seoul 151-747, Republic of Korea
| | | | | | | | | | | |
Collapse
|
150
|
Li W, Li F. Cross-crystal averaging with search models to improve molecular replacement phases. Structure 2011; 19:155-61. [PMID: 21300285 PMCID: PMC3037595 DOI: 10.1016/j.str.2010.12.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2010] [Revised: 11/29/2010] [Accepted: 12/13/2010] [Indexed: 11/25/2022]
Abstract
The application of molecular replacement (MR) in macromolecular crystallography can be limited by the “model bias” problem. Here we propose a strategy to reduce model bias when only part of a new structure is known: after the MR search, structure determination of the unknown part of the new structure can be facilitated by cross-crystal averaging of the known part of the new structure with the search model. This strategy dramatically improves electron density in the unknown part of the new structure. It has enabled us to determine the structures of two coronavirus receptor-binding domains each complexed with their receptor at moderate resolutions. In a test case, it also enabled automated model building when >50% of an antigen-antibody complex was absent. These results suggest that this averaging strategy can be routinely used after MR to enhance the interpretability of electron density associated with missing model.
Collapse
Affiliation(s)
- Weikai Li
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | | |
Collapse
|