51
|
Multiple subunit fitting into a low-resolution density map of a macromolecular complex using a gaussian mixture model. Biophys J 2008; 95:4643-58. [PMID: 18708469 PMCID: PMC2576401 DOI: 10.1529/biophysj.108.137125] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Recently, electron microscopy measurement of single particles has enabled us to reconstruct a low-resolution 3D density map of large biomolecular complexes. If structures of the complex subunits can be solved by x-ray crystallography at atomic resolution, fitting these models into the 3D density map can generate an atomic resolution model of the entire large complex. The fitting of multiple subunits, however, generally requires large computational costs; therefore, development of an efficient algorithm is required. We developed a fast fitting program, “gmfit”, which employs a Gaussian mixture model (GMM) to represent approximated shapes of the 3D density map and the atomic models. A GMM is a distribution function composed by adding together several 3D Gaussian density functions. Because our model analytically provides an integral of a product of two distribution functions, it enables us to quickly calculate the fitness of the density map and the atomic models. Using the integral, two types of potential energy function are introduced: the attraction potential energy between a 3D density map and each subunit, and the repulsion potential energy between subunits. The restraint energy for symmetry is also employed to build symmetrical origomeric complexes. To find the optimal configuration of subunits, we randomly generated initial configurations of subunit models, and performed a steepest-descent method using forces and torques of the three potential energies. Comparison between an original density map and its GMM showed that the required number of Gaussian distribution functions for a given accuracy depended on both resolution and molecular size. We then performed test fitting calculations for simulated low-resolution density maps of atomic models of homodimer, trimer, and hexamer, using different search parameters. The results indicated that our method was able to rebuild atomic models of a complex even for maps of 30 Å resolution if sufficient numbers (eight or more) of Gaussian distribution functions were employed for each subunit, and the symmetric restraints were assigned for complexes with more than three subunits. As a more realistic test, we tried to build an atomic model of the GroEL/ES complex by fitting 21-subunit atomic models into the 3D density map obtained by cryoelectron microscopy using the C7 symmetric restraints. A model with low root mean-square deviations (14.7 Å) was obtained as the lowest-energy model, showing that our fitting method was reasonably accurate. Inclusion of other restraints from biological and biochemical experiments could further enhance the accuracy.
Collapse
|
52
|
Serysheva II, Ludtke SJ, Baker ML, Cong Y, Topf M, Eramian D, Sali A, Hamilton SL, Chiu W. Subnanometer-resolution electron cryomicroscopy-based domain models for the cytoplasmic region of skeletal muscle RyR channel. Proc Natl Acad Sci U S A 2008; 105:9610-5. [PMID: 18621707 PMCID: PMC2474495 DOI: 10.1073/pnas.0803189105] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2008] [Indexed: 11/18/2022] Open
Abstract
The skeletal muscle Ca(2+) release channel (RyR1), a homotetramer, regulates the release of Ca(2+) from the sarcoplasmic reticulum to initiate muscle contraction. In this work, we have delineated the RyR1 monomer boundaries in a subnanometer-resolution electron cryomicroscopy (cryo-EM) density map. In the cytoplasmic region of each RyR1 monomer, 36 alpha-helices and 7 beta-sheets can be resolved. A beta-sheet was also identified close to the membrane-spanning region that resembles the cytoplasmic pore structures of inward rectifier K(+) channels. Three structural folds, generated for amino acids 12-565 using comparative modeling and cryo-EM density fitting, localize close to regions implicated in communication with the voltage sensor in the transverse tubules. Eleven of the 15 disease-related residues for these domains are mapped to the surface of these models. Four disease-related residues are found in a basin at the interfaces of these regions, creating a pocket in which the immunophilin FKBP12 can fit. Taken together, these results provide a structural context for both channel gating and the consequences of certain malignant hyperthermia and central core disease-associated mutations in RyR1.
Collapse
Affiliation(s)
- Irina I. Serysheva
- *National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, and
| | - Steven J. Ludtke
- *National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, and
| | - Matthew L. Baker
- *National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, and
| | - Yao Cong
- *National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, and
| | - Maya Topf
- Department of Crystallography, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom; and
| | - David Eramian
- Departments of Biopharmaceutical Sciences and Pharmaceutical Chemistry, California Institute for Quantitative Biosciences, University of California, Mission Bay Byers Hall, 1700 Fourth Street, San Francisco, CA 94158
| | - Andrej Sali
- Departments of Biopharmaceutical Sciences and Pharmaceutical Chemistry, California Institute for Quantitative Biosciences, University of California, Mission Bay Byers Hall, 1700 Fourth Street, San Francisco, CA 94158
| | - Susan L. Hamilton
- Department of Molecular Physiology and Biophysics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030
| | - Wah Chiu
- *National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, and
| |
Collapse
|
53
|
Remaut H, Tang C, Henderson NS, Pinkner JS, Wang T, Hultgren SJ, Thanassi DG, Waksman G, Li H. Fiber formation across the bacterial outer membrane by the chaperone/usher pathway. Cell 2008; 133:640-52. [PMID: 18485872 PMCID: PMC3036173 DOI: 10.1016/j.cell.2008.03.033] [Citation(s) in RCA: 180] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2007] [Revised: 02/22/2008] [Accepted: 03/28/2008] [Indexed: 11/19/2022]
Abstract
Gram-negative pathogens commonly exhibit adhesive pili on their surfaces that mediate specific attachment to the host. A major class of pili is assembled via the chaperone/usher pathway. Here, the structural basis for pilus fiber assembly and secretion performed by the outer membrane assembly platform--the usher--is revealed by the crystal structure of the translocation domain of the P pilus usher PapC and single particle cryo-electron microscopy imaging of the FimD usher bound to a translocating type 1 pilus assembly intermediate. These structures provide molecular snapshots of a twinned-pore translocation machinery in action. Unexpectedly, only one pore is used for secretion, while both usher protomers are used for chaperone-subunit complex recruitment. The translocating pore itself comprises 24 beta strands and is occluded by a folded plug domain, likely gated by a conformationally constrained beta-hairpin. These structures capture the secretion of a virulence factor across the outer membrane of gram-negative bacteria.
Collapse
Affiliation(s)
- Han Remaut
- Institute of Structural Molecular Biology, University College London and Birkbeck College, Malet Street, London, WC1E 7HX, United Kingdom
- These authors contributed equally to this work
| | - Chunyan Tang
- Biology Department, Brookhaven National Laboratory, Upton, NY 11973, USA
- These authors contributed equally to this work
| | - Nadine S. Henderson
- Center for Infectious Diseases, Department of Molecular Genetics and Microbiology, Stony Brook University, Stony Brook, NY, 11794-5120, USA
| | - Jerome S. Pinkner
- Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Tao Wang
- Biology Department, Brookhaven National Laboratory, Upton, NY 11973, USA
| | - Scott J. Hultgren
- Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - David G. Thanassi
- Center for Infectious Diseases, Department of Molecular Genetics and Microbiology, Stony Brook University, Stony Brook, NY, 11794-5120, USA
| | - Gabriel Waksman
- Institute of Structural Molecular Biology, University College London and Birkbeck College, Malet Street, London, WC1E 7HX, United Kingdom
| | - Huilin Li
- Biology Department, Brookhaven National Laboratory, Upton, NY 11973, USA
- Department of Biochemistry and Cell Biology, Stony Brook University, Stony Brook, NY 11794, USA
| |
Collapse
|
54
|
Topf M, Lasker K, Webb B, Wolfson H, Chiu W, Sali A. Protein structure fitting and refinement guided by cryo-EM density. Structure 2008; 16:295-307. [PMID: 18275820 PMCID: PMC2409374 DOI: 10.1016/j.str.2007.11.016] [Citation(s) in RCA: 265] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2007] [Revised: 11/20/2007] [Accepted: 11/26/2007] [Indexed: 11/23/2022]
Abstract
For many macromolecular assemblies, both a cryo-electron microscopy map and atomic structures of its component proteins are available. Here we describe a method for fitting and refining a component structure within its map at intermediate resolution (<15 A). The atomic positions are optimized with respect to a scoring function that includes the crosscorrelation coefficient between the structure and the map as well as stereochemical and nonbonded interaction terms. A heuristic optimization that relies on a Monte Carlo search, a conjugate-gradients minimization, and simulated annealing molecular dynamics is applied to a series of subdivisions of the structure into progressively smaller rigid bodies. The method was tested on 15 proteins of known structure with 13 simulated maps and 3 experimentally determined maps. At approximately 10 A resolution, Calpha rmsd between the initial and final structures was reduced on average by approximately 53%. The method is automated and can refine both experimental and predicted atomic structures.
Collapse
Affiliation(s)
- Maya Topf
- School of Crystallography, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom.
| | | | | | | | | | | |
Collapse
|
55
|
Alber F, Förster F, Korkin D, Topf M, Sali A. Integrating diverse data for structure determination of macromolecular assemblies. Annu Rev Biochem 2008; 77:443-77. [PMID: 18318657 DOI: 10.1146/annurev.biochem.77.060407.135530] [Citation(s) in RCA: 185] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
To understand the cell, we need to determine the macromolecular assembly structures, which may consist of tens to hundreds of components. First, we review the varied experimental data that characterize the assemblies at several levels of resolution. We then describe computational methods for generating the structures using these data. To maximize completeness, resolution, accuracy, precision, and efficiency of the structure determination, a computational approach is required that uses spatial information from a variety of experimental methods. We propose such an approach, defined by its three main components: a hierarchical representation of the assembly, a scoring function consisting of spatial restraints derived from experimental data, and an optimization method that generates structures consistent with the data. This approach is illustrated by determining the configuration of the 456 proteins in the nuclear pore complex (NPC) from baker's yeast. With these tools, we are poised to integrate structural information gathered at multiple levels of the biological hierarchy--from atoms to cells--into a common framework.
Collapse
Affiliation(s)
- Frank Alber
- Department of Biopharmaceutical Sciences, and California Institute for Quantitative Biosciences, University of California at San Francisco, CA 94158-2330, USA.
| | | | | | | | | |
Collapse
|
56
|
Dugan JM, Altman RB. Using surface envelopes to constrain molecular modeling. Protein Sci 2007; 16:1266-73. [PMID: 17586766 PMCID: PMC2206696 DOI: 10.1110/ps.062733407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Molecular density information (as measured by electron microscopic reconstructions or crystallographic density maps) can be a powerful source of information for molecular modeling. Molecular density constrains models by specifying where atoms should and should not be. Low-resolution density information can often be obtained relatively quickly, and there is a need for methods that use it effectively. We have previously described a method for scoring molecular models with surface envelopes to discriminate between plausible and implausible fits. We showed that we could successfully filter out models with the wrong shape based on this discrimination power. Ideally, however, surface information should be used during the modeling process to constrain the conformations that are sampled. In this paper, we describe an extension of our method for using shape information during computational modeling. We use the envelope scoring metric as part of an objective function in a global optimization that also optimizes distances and angles while avoiding collisions. We systematically tested surface representations of proteins (using all nonhydrogen heavy atoms) with different abundance of distance information and showed that the root mean square deviation (RMSD) of models built with envelope information is consistently improved, particularly in data sets with relatively small sets of short-range distances.
Collapse
|
57
|
Dror O, Lasker K, Nussinov R, Wolfson H. EMatch: an efficient method for aligning atomic resolution subunits into intermediate-resolution cryo-EM maps of large macromolecular assemblies. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2007; 63:42-9. [PMID: 17164525 PMCID: PMC2483490 DOI: 10.1107/s0907444906041059] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2006] [Accepted: 10/08/2006] [Indexed: 11/22/2022]
Abstract
Structural analysis of biological machines is essential for inferring their function and mechanism. Nevertheless, owing to their large size and instability, deciphering the atomic structure of macromolecular assemblies is still considered as a challenging task that cannot keep up with the rapid advances in the protein-identification process. In contrast, structural data at lower resolution is becoming more and more available owing to recent advances in cryo-electron microscopy (cryo-EM) techniques. Once a cryo-EM map is acquired, one of the basic questions asked is what are the folds of the components in the assembly and what is their configuration. Here, a novel knowledge-based computational method, named EMatch, towards tackling this task for cryo-EM maps at 6-10 A resolution is presented. The method recognizes and locates possible atomic resolution structural homologues of protein domains in the assembly. The strengths of EMatch are demonstrated on a cryo-EM map of native GroEL at 6 A resolution.
Collapse
Affiliation(s)
- Oranit Dror
- School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Keren Lasker
- School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Ruth Nussinov
- Department of Human Genetics and Molecular Medicine, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
- Basic Research Program, SAIC-Frederick, Center for Cancer Research Nanobiology Program, NCI-Frederick, Building 469, Room 151, Frederick, MD 21702 USA
| | - Haim Wolfson
- School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| |
Collapse
|
58
|
Lasker K, Dror O, Shatsky M, Nussinov R, Wolfson HJ. EMatch: discovery of high resolution structural homologues of protein domains in intermediate resolution cryo-EM maps. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2007; 4:28-39. [PMID: 17277411 DOI: 10.1109/tcbb.2007.1003] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Cryo-EM has become an increasingly powerful technique for elucidating the structure, dynamics, and function of large flexible macromolecule assemblies that cannot be determined at atomic resolution. However, due to the relatively low resolution of cryo-EM data, a major challenge is to identify components of complexes appearing in cryo-EM maps. Here, we describe EMatch, a novel integrated approach for recognizing structural homologues of protein domains present in a 6-10 A resolution cryo-EM map and constructing a quasi-atomic structural model of their assembly. The method is highly efficient and has been successfully validated on various simulated data. The strength of the method is demonstrated by a domain assembly of an experimental cryo-EM map of native GroEL at 6 A resolution.
Collapse
Affiliation(s)
- Keren Lasker
- School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Israel.
| | | | | | | | | |
Collapse
|
59
|
Shacham E, Sheehan B, Volkmann N. Density-based score for selecting near-native atomic models of unknown structures. J Struct Biol 2006; 158:188-95. [PMID: 17296314 PMCID: PMC2175034 DOI: 10.1016/j.jsb.2006.12.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2006] [Revised: 10/20/2006] [Accepted: 12/07/2006] [Indexed: 11/15/2022]
Abstract
We present a low-resolution density-based scoring scheme for selecting high-quality models from a large pool of lesser quality models. We use pre-configured decoy data sets that contain large numbers of models with different degrees of correctness to evaluate the performance of the strategy. We find that the scoring scheme consistently identifies one of the highest quality models for a wide variety of target structures, resolution ranges, and noise models. Tests with experimental data yield similar results.
Collapse
Affiliation(s)
| | | | - Niels Volkmann
- *Correspondence should be addressed to Niels Volkmann, The Burnham Institute for Medical Research, Bioinformatics and Systems Biology Program, 10901 North Torrey Pines Road, La Jolla, CA 92037, Phone: 858 646 3187, Fax: 858 646 3195, e-mail:
| |
Collapse
|
60
|
Garzón JI, Kovacs J, Abagyan R, Chacón P. ADP_EM: fast exhaustive multi-resolution docking for high-throughput coverage. Bioinformatics 2006; 23:427-33. [PMID: 17150992 DOI: 10.1093/bioinformatics/btl625] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Efficient fitting tools are needed to take advantage of a fast growth of atomic models of protein domains from crystallography or comparative modeling, and low-resolution density maps of larger molecular assemblies. Here, we report a novel fitting algorithm for the exhaustive and fast overlay of partial high-resolution models into a low-resolution density map. The method incorporates a fast rotational search based on spherical harmonics (SH) combined with a simple translational scanning. RESULTS This novel combination makes it possible to accurately dock atomic structures into low-resolution electron-density maps in times ranging from seconds to a few minutes. The high-efficiency achieved with simulated and experimental test cases preserves the exhaustiveness needed in these heterogeneous-resolution merging tools. The results demonstrate its efficiency, robustness and high-throughput coverage. AVAILABILITY http://sbg.cib.csic.es/Software/ADP_EM. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- José Ignacio Garzón
- Centro de Investigaciones Biológicas, CSIC Ramiro de Maeztu 9, 28040 Madrid, Spain
| | | | | | | |
Collapse
|
61
|
Velazquez-Muriel JA, Carazo JMA. Flexible fitting in 3D-EM with incomplete data on superfamily variability. J Struct Biol 2006; 158:165-81. [PMID: 17257856 DOI: 10.1016/j.jsb.2006.10.014] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2006] [Revised: 09/20/2006] [Accepted: 10/13/2006] [Indexed: 11/26/2022]
Abstract
We present a substantial improvement of S-flexfit, our recently proposed method for flexible fitting in three dimensional electron microscopy (3D-EM) at a resolution range of 8-12A, together with a comparison of the method capabilities with Normal Mode Analysis (NMA), application examples and a user's guide. S-flexfit uses the evolutionary information contained in protein domain databases like CATH, by means of the structural alignment of the elements of a protein superfamily. The added development is based on a recent extension of the Singular Value Decomposition (SVD) algorithm specifically designed for situations with missing data: Incremental Singular Value Decomposition (ISVD). ISVD can manage gaps and allows considering more aminoacids in the structural alignment of a superfamily, extending the range of application and producing better models for the fitting step of our methodology. Our previous SVD-based flexible fitting approach can only take into account positions with no gaps in the alignment, being appropriate when the superfamily members are relatively similar and there are few gaps. However, with new data coming from structural proteomics works, the later situation is becoming less likely, making ISVD the technique of choice for further works. We present the results of using ISVD in the process of flexible fitting with both simulated and experimental 3D-EM maps (GroEL and Poliovirus 135S cell entry intermediate).
Collapse
Affiliation(s)
- Javier A Velazquez-Muriel
- Biocomputing Unit, National Center for Biotechnology, Campus Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | | |
Collapse
|
62
|
Baker ML, Jiang W, Wedemeyer WJ, Rixon FJ, Baker D, Chiu W. Ab initio modeling of the herpesvirus VP26 core domain assessed by CryoEM density. PLoS Comput Biol 2006; 2:e146. [PMID: 17069457 PMCID: PMC1626159 DOI: 10.1371/journal.pcbi.0020146] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2006] [Accepted: 09/26/2006] [Indexed: 12/22/2022] Open
Abstract
Efforts in structural biology have targeted the systematic determination of all protein structures through experimental determination or modeling. In recent years, 3-D electron cryomicroscopy (cryoEM) has assumed an increasingly important role in determining the structures of these large macromolecular assemblies to intermediate resolutions (6–10 Å). While these structures provide a snapshot of the assembly and its components in well-defined functional states, the resolution limits the ability to build accurate structural models. In contrast, sequence-based modeling techniques are capable of producing relatively robust structural models for isolated proteins or domains. In this work, we developed and applied a hybrid modeling approach, utilizing cryoEM density and ab initio modeling to produce a structural model for the core domain of a herpesvirus structural protein, VP26. Specifically, this method, first tested on simulated data, utilizes the cryoEM density map as a geometrical constraint in identifying the most native-like models from a gallery of models generated by ab initio modeling. The resulting model for the core domain of VP26, based on the 8.5-Å resolution herpes simplex virus type 1 (HSV-1) capsid cryoEM structure and mutational data, exhibited a novel fold. Additionally, the core domain of VP26 appeared to have a complementary interface to the known upper-domain structure of VP5, its cognate binding partner. While this new model provides for a better understanding of the assembly and interactions of VP26 in HSV-1, the approach itself may have broader applications in modeling the components of large macromolecular assemblies. Efforts in structural genomics have targeted the systematic determination of all protein structures primarily using X-ray crystallography and nuclear magnetic resonance. These initiatives have typically focused on domains, single-protein and in some cases small complexes, and as such macromolecular machines are relatively underrepresented. However, in recent years, electron cryomicroscopy (cryoEM) has assumed an increasingly important role in determining the structure of large macromolecular machines in their biologically active states to intermediate resolutions (5–10 Å). Concurrently, modeling techniques, such as comparative and ab initio modeling, have played an increasingly important role in structure determination of small proteins not amenable to other structural techniques. In this work, Baker and colleagues have leveraged ab initio modeling and cryoEM to assess and identify structural models for the macromolecular components within a large complex. Specifically, the cryoEM density can be used to select the most native-like models from a large gallery of potential models. Applied to the smallest herpesvirus capsid protein, VP26 (12 kDa), it was possible to determine its core domain structure (residues 42–105), which helped to elucidate interactions among the structural protein in the virion. Beyond VP26, these techniques potentially provide a new pathway for accurate structure determination of proteins in their biological and functional states.
Collapse
Affiliation(s)
- Matthew L Baker
- National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas, United States of America
| | - Wen Jiang
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, United States of America
| | - William J Wedemeyer
- Department of Biochemistry, Michigan State University, East Lansing, Michigan, United States of America
| | - Frazer J Rixon
- MRC Virology Unit, Institute of Virology, Glasgow, United Kingdom
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
| | - Wah Chiu
- National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
63
|
Velazquez-Muriel JA, Valle M, Santamaría-Pang A, Kakadiaris IA, Carazo JM. Flexible Fitting in 3D-EM Guided by the Structural Variability of Protein Superfamilies. Structure 2006; 14:1115-26. [PMID: 16843893 DOI: 10.1016/j.str.2006.05.013] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2006] [Revised: 04/27/2006] [Accepted: 05/01/2006] [Indexed: 10/24/2022]
Abstract
A method for flexible fitting of molecular models into three-dimensional electron microscopy (3D-EM) reconstructions at a resolution range of 8-12 A is proposed. The approach uses the evolutionarily related structural variability existing among the protein domains of a given superfamily, according to structural databases such as CATH. A structural alignment of domains belonging to the superfamily, followed by a principal components analysis, is performed, and the first three principal components of the decomposition are explored. Using rigid body transformations for the secondary structure elements (SSEs) plus the cyclic coordinate descent algorithm to close the loops, stereochemically correct models are built for the structure to fit. All of the models are fitted into the 3D-EM map, and the best one is selected based on crosscorrelation measures. This work applies the method to both simulated and experimental data and shows that the flexible fitting was able to produce better results than rigid body fitting.
Collapse
Affiliation(s)
- Javier-Angel Velazquez-Muriel
- Biocomputing Unit, National Center for Biotechnology-CSIC, Campus Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | | | | | | | | |
Collapse
|
64
|
Eramian D, Shen MY, Devos D, Melo F, Sali A, Marti-Renom MA. A composite score for predicting errors in protein structure models. Protein Sci 2006; 15:1653-66. [PMID: 16751606 PMCID: PMC2242555 DOI: 10.1110/ps.062095806] [Citation(s) in RCA: 114] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Reliable prediction of model accuracy is an important unsolved problem in protein structure modeling. To address this problem, we studied 24 individual assessment scores, including physics-based energy functions, statistical potentials, and machine learning-based scoring functions. Individual scores were also used to construct approximately 85,000 composite scoring functions using support vector machine (SVM) regression. The scores were tested for their abilities to identify the most native-like models from a set of 6000 comparative models of 20 representative protein structures. Each of the 20 targets was modeled using a template of <30% sequence identity, corresponding to challenging comparative modeling cases. The best SVM score outperformed all individual scores by decreasing the average RMSD difference between the model identified as the best of the set and the model with the lowest RMSD (DeltaRMSD) from 0.63 A to 0.45 A, while having a higher Pearson correlation coefficient to RMSD (r=0.87) than any other tested score. The most accurate score is based on a combination of the DOPE non-hydrogen atom statistical potential; surface, contact, and combined statistical potentials from MODPIPE; and two PSIPRED/DSSP scores. It was implemented in the SVMod program, which can now be applied to select the final model in various modeling problems, including fold assignment, target-template alignment, and loop modeling.
Collapse
Affiliation(s)
- David Eramian
- Graduate Group in Biophysics, Department of Biopharmaceutical Sciences, University of California at San Francisco 94158, USA
| | | | | | | | | | | |
Collapse
|
65
|
Topf M, Baker ML, Marti-Renom MA, Chiu W, Sali A. Refinement of Protein Structures by Iterative Comparative Modeling and CryoEM Density Fitting. J Mol Biol 2006; 357:1655-68. [PMID: 16490207 DOI: 10.1016/j.jmb.2006.01.062] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2005] [Revised: 01/06/2006] [Accepted: 01/17/2006] [Indexed: 11/21/2022]
Abstract
We developed a method for structure characterization of assembly components by iterative comparative protein structure modeling and fitting into cryo-electron microscopy (cryoEM) density maps. Specifically, we calculate a comparative model of a given component by considering many alternative alignments between the target sequence and a related template structure while optimizing the fit of a model into the corresponding density map. The method relies on the previously developed Moulder protocol that iterates over alignment, model building, and model assessment. The protocol was benchmarked using 20 varied target-template pairs of known structures with less than 30% sequence identity and corresponding simulated density maps at resolutions from 5A to 25A. Relative to the models based on the best existing sequence profile alignment methods, the percentage of C(alpha) atoms that are within 5A of the corresponding C(alpha) atoms in the superposed native structure increases on average from 52% to 66%, which is half-way between the starting models and the models from the best possible alignments (82%). The test also reveals that despite the improvements in the accuracy of the fitness function, this function is still the bottleneck in reducing the remaining errors. To demonstrate the usefulness of the protocol, we applied it to the upper domain of the P8 capsid protein of rice dwarf virus that has been studied by cryoEM at 6.8A. The C(alpha) root-mean-square deviation of the model based on the remotely related template, bluetongue virus VP7, improved from 8.7A to 6.0A, while the best possible model has a C(alpha) RMSD value of 5.3A. Moreover, the resulting model fits better into the cryoEM density map than the initial template structure. The method is being implemented in our program MODELLER for protein structure modeling by satisfaction of spatial restraints and will be applicable to the rapidly increasing number of cryoEM density maps of macromolecular assemblies.
Collapse
Affiliation(s)
- Maya Topf
- Department of Biopharmaceutical Sciences, California Institute for Quantitative Biomedical Research, QB3, 1700 4th Street, Suite 503B, University of California at San Francisco, San Francisco, CA 94143-2552, USA
| | | | | | | | | |
Collapse
|
66
|
Topf M, Sali A. Combining electron microscopy and comparative protein structure modeling. Curr Opin Struct Biol 2005; 15:578-85. [PMID: 16118050 DOI: 10.1016/j.sbi.2005.08.001] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2005] [Revised: 07/01/2005] [Accepted: 08/10/2005] [Indexed: 10/25/2022]
Abstract
Recently, advances have been made in methods and applications that integrate electron microscopy density maps and comparative modeling to produce atomic structures of macromolecular assemblies. Electron microscopy can benefit from comparative modeling through the fitting of comparative models into electron microscopy density maps. Also, comparative modeling can benefit from electron microscopy through the use of intermediate-resolution density maps in fold recognition, template selection and sequence-structure alignment.
Collapse
Affiliation(s)
- Maya Topf
- Department of Biopharmaceutical Sciences, University of California San Francisco, San Francisco, CA 94143, USA
| | | |
Collapse
|
67
|
Alber F, Kim MF, Sali A. Structural characterization of assemblies from overall shape and subcomplex compositions. Structure 2005; 13:435-45. [PMID: 15766545 DOI: 10.1016/j.str.2005.01.013] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2004] [Revised: 01/12/2005] [Accepted: 01/14/2005] [Indexed: 11/24/2022]
Abstract
We suggest structure characterization of macromolecular assemblies by combining assembly shape determined by electron cyromicroscopy with information about subunit proximity determined by affinity purification. To achieve this aim, structure characterization is expressed as a problem in satisfaction of spatial restraints that (1) represents subunits as spheres, (2) encodes information about the subunit excluded volume, assembly shape, and pulldowns in a scoring function, and (3) finds subunit configurations that satisfy the input restraints by an optimization of the scoring function. Testing of the approach with model systems suggests its feasibility.
Collapse
Affiliation(s)
- Frank Alber
- Department of Biopharmaceutical Sciences and, California Institute for Quantitative Biomedical Research, University of California, San Francisco, San Francisco, California 94143, USA
| | | | | |
Collapse
|
68
|
Chiu W, Baker ML, Jiang W, Dougherty M, Schmid MF. Electron cryomicroscopy of biological machines at subnanometer resolution. Structure 2005; 13:363-72. [PMID: 15766537 DOI: 10.1016/j.str.2004.12.016] [Citation(s) in RCA: 122] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2004] [Revised: 12/22/2004] [Accepted: 12/25/2004] [Indexed: 01/29/2023]
Abstract
Advances in electron cryomicroscopy (cryo-EM) have made possible the structural determination of large biological machines in the resolution range of 6-9 angstroms. Rice dwarf virus and the acrosomal bundle represent two distinct types of machines amenable to cryo-EM investigations at subnanometer resolutions. However, calculating the density map is only the first step, and much analysis remains to extract structural insights and the mechanism of action in these machines. This paper will review the computational and visualization methodologies necessary for analysis (structure mining) of the computed cryo-EM maps of these machines. These steps include component segmentation, averaging based on local symmetry among components, density connectivity trace, incorporation of bioinformatics analysis, and fitting of high-resolution component data, if available. The consequences of these analyses can not only identify accurately some of the secondary structure elements of the molecular components in machines but also suggest structural mechanisms related to their biological functions.
Collapse
Affiliation(s)
- Wah Chiu
- National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas 77030, USA.
| | | | | | | | | |
Collapse
|