51
|
Tiwari SP, Tama F, Miyashita O. Searching for 3D structural models from a library of biological shapes using a few 2D experimental images. BMC Bioinformatics 2018; 19:320. [PMID: 30208849 PMCID: PMC6134691 DOI: 10.1186/s12859-018-2358-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Accepted: 09/03/2018] [Indexed: 01/08/2023] Open
Abstract
Background Advancements in biophysical experimental techniques have pushed the limits in terms of the types of phenomena that can be characterized, the amount of data that can be produced and the resolution at which we can visualize them. Single particle techniques such as Electron Microscopy (EM) and X-ray free electron laser (XFEL) scattering require a large number of 2D images collected to resolve three-dimensional (3D) structures. In this study, we propose a quick strategy to retrieve potential 3D shapes, as low-resolution models, from a few 2D experimental images by searching a library of 2D projection images generated from existing 3D structures. Results We developed the protocol to assemble a non-redundant set of 3D shapes for generating the 2D image library, and to retrieve potential match 3D shapes for query images, using EM data as a test. In our strategy, we disregard differences in volume size, giving previously unknown structures and conformations a greater number of 3D biological shapes as possible matches. We tested the strategy using images from three EM models as query images for searches against a library of 22750 2D projection images generated from 250 random EM models. We found that our ability to identify 3D shapes that match the query images depends on how complex the outline of the 2D shapes are and whether they are represented in the search image library. Conclusions Through our computational method, we are able to quickly retrieve a 3D shape from a few 2D projection images. Our approach has the potential for exploring other types of 2D single particle structural data such as from XFEL scattering experiments, for providing a tool to interpret low-resolution data that may be insufficient for 3D reconstruction, and for estimating the mixing of states or conformations that could exist in such experimental data. Electronic supplementary material The online version of this article (10.1186/s12859-018-2358-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sandhya P Tiwari
- Computational Structural Biology Unit, RIKEN Center for Computational Science, Kobe, Japan
| | - Florence Tama
- Computational Structural Biology Unit, RIKEN Center for Computational Science, Kobe, Japan. .,Graduate School of Science, Department of Physics & Institute of Transformative Bio-Molecules (WPI-ITbM), Nagoya University, Nagoya, Japan.
| | - Osamu Miyashita
- Computational Structural Biology Unit, RIKEN Center for Computational Science, Kobe, Japan
| |
Collapse
|
52
|
Kumar A, Zhang KYJ. Advances in the Development of Shape Similarity Methods and Their Application in Drug Discovery. Front Chem 2018; 6:315. [PMID: 30090808 PMCID: PMC6068280 DOI: 10.3389/fchem.2018.00315] [Citation(s) in RCA: 81] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Accepted: 07/09/2018] [Indexed: 12/21/2022] Open
Abstract
Molecular similarity is a key concept in drug discovery. It is based on the assumption that structurally similar molecules frequently have similar properties. Assessment of similarity between small molecules has been highly effective in the discovery and development of various drugs. Especially, two-dimensional (2D) similarity approaches have been quite popular due to their simplicity, accuracy and efficiency. Recently, the focus has been shifted toward the development of methods involving the representation and comparison of three-dimensional (3D) conformation of small molecules. Among the 3D similarity methods, evaluation of shape similarity is now gaining attention for its application not only in virtual screening but also in molecular target prediction, drug repurposing and scaffold hopping. A wide range of methods have been developed to describe molecular shape and to determine the shape similarity between small molecules. The most widely used methods include atom distance-based methods, surface-based approaches such as spherical harmonics and 3D Zernike descriptors, atom-centered Gaussian overlay based representations. Several of these methods demonstrated excellent virtual screening performance not only retrospectively but also prospectively. In addition to methods assessing the similarity between small molecules, shape similarity approaches have been developed to compare shapes of protein structures and binding pockets. Additionally, shape comparisons between atomic models and 3D density maps allowed the fitting of atomic models into cryo-electron microscopy maps. This review aims to summarize the methodological advances in shape similarity assessment highlighting advantages, disadvantages and their application in drug discovery.
Collapse
Affiliation(s)
| | - Kam Y. J. Zhang
- Laboratory for Structural Bioinformatics, Center for Biosystems Dynamics Research, RIKEN, Yokohama, Japan
| |
Collapse
|
53
|
Tiemann JK, Rose AS, Ismer J, Darvish MD, Hilal T, Spahn CM, Hildebrand PW. FragFit: a web-application for interactive modeling of protein segments into cryo-EM density maps. Nucleic Acids Res 2018; 46:W310-W314. [PMID: 29788317 PMCID: PMC6030921 DOI: 10.1093/nar/gky424] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Accepted: 05/10/2018] [Indexed: 11/20/2022] Open
Abstract
Cryo-electron microscopy (cryo-EM) is a standard method to determine the three-dimensional structures of molecular complexes. However, easy to use tools for modeling of protein segments into cryo-EM maps are sparse. Here, we present the FragFit web-application, a web server for interactive modeling of segments of up to 35 amino acids length into cryo-EM density maps. The fragments are provided by a regularly updated database containing at the moment about 1 billion entries extracted from PDB structures and can be readily integrated into a protein structure. Fragments are selected based on geometric criteria, sequence similarity and fit into a given cryo-EM density map. Web-based molecular visualization with the NGL Viewer allows interactive selection of fragments. The FragFit web-application, accessible at http://proteinformatics.de/FragFit, is free and open to all users, without any login requirements.
Collapse
Affiliation(s)
- Johanna Ks Tiemann
- Institute of Medical Physics and Biophysics, Charité University Medicine Berlin, Berlin 10117, Germany.,Institute of Medical Physics and Biophysics, Medical University Leipzig, Leipzig, Sachsen 04107, Germany
| | - Alexander S Rose
- Institute of Medical Physics and Biophysics, Charité University Medicine Berlin, Berlin 10117, Germany
| | - Jochen Ismer
- Institute of Medical Physics and Biophysics, Charité University Medicine Berlin, Berlin 10117, Germany
| | - Mitra D Darvish
- Institute of Medical Physics and Biophysics, Charité University Medicine Berlin, Berlin 10117, Germany
| | - Tarek Hilal
- Institute of Medical Physics and Biophysics, Charité University Medicine Berlin, Berlin 10117, Germany
| | - Christian Mt Spahn
- Institute of Medical Physics and Biophysics, Charité University Medicine Berlin, Berlin 10117, Germany
| | - Peter W Hildebrand
- Institute of Medical Physics and Biophysics, Charité University Medicine Berlin, Berlin 10117, Germany.,Institute of Medical Physics and Biophysics, Medical University Leipzig, Leipzig, Sachsen 04107, Germany
| |
Collapse
|
54
|
Abstract
Despite the central role of Nuclear Pore Complexes (NPCs) as gatekeepers of RNA and protein transport between the cytoplasm and nucleoplasm, their large size and dynamic nature have impeded a full structural and functional elucidation. Here, we have determined a subnanometer precision structure for the entire 552-protein yeast NPC by satisfying diverse data including stoichiometry, a cryo-electron tomography map, and chemical cross-links. The structure reveals the NPC’s functional elements in unprecedented detail. The NPC is built of sturdy diagonal columns to which are attached connector cables, imbuing both strength and flexibility, while tying together all other elements of the NPC, including membrane-interacting regions and RNA processing platforms. Inwardly-directed anchors create a high density of transport factor-docking Phe-Gly repeats in the central channel, organized in distinct functional units. Taken together, this integrative structure allows us to rationalize the architecture, transport mechanism, and evolutionary origins of the NPC.
Collapse
|
55
|
Abstract
A new Gaussian mixture model (GMM) has been developed for better representations of both atomic models and electron microscopy 3D density maps. The standard GMM algorithm employs an EM algorithm to determine the parameters. It accepted a set of 3D points with weights, corresponding to voxel or atomic centers. Although the standard algorithm worked reasonably well; however, it had three problems. First, it ignored the size (voxel width or atomic radius) of the input, and thus it could lead to a GMM with a smaller spread than the input. Second, the algorithm had a singularity problem, as it sometimes stopped the iterative procedure due to a Gaussian function with almost zero variance. Third, a map with a large number of voxels required a long computation time for conversion to a GMM. To solve these problems, we have introduced a Gaussian-input GMM algorithm, which considers the input atoms or voxels as a set of Gaussian functions. The standard EM algorithm of GMM was extended to optimize the new GMM. The new GMM has identical radius of gyration to the input, and does not suddenly stop due to the singularity problem. For fast computation, we have introduced a down-sampled Gaussian functions (DSG) by merging neighboring voxels into an anisotropic Gaussian function. It provides a GMM with thousands of Gaussian functions in a short computation time. We also have introduced a DSG-input GMM: the Gaussian-input GMM with the DSG as the input. This new algorithm is much faster than the standard algorithm.
Collapse
Affiliation(s)
- Takeshi Kawabata
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan.
| |
Collapse
|
56
|
Rigid-Body Fitting of Atomic Models on 3D Density Maps of Electron Microscopy. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2018; 1105:219-235. [PMID: 30617832 DOI: 10.1007/978-981-13-2200-6_14] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Cryo electron microscopy has revolutionarily evolved for the determination of the 3D structure of macromolecular complexes. The modeling procedures on the 3D density maps of electron microscopy are roughly classified into three categories: fitting, de novo modeling and refinement. The registered atomic models from the maps have mostly been hand-built and auto-refined. Several programs aiming at automatic modeling have also been developed using various kinds of molecular representations. Among these three classes of the modeling procedures, the rigid body fitting is reviewed here, because it is the most basic modeling process applied before the other steps. The fitting problems are classified as the fittings of single subunit or multiple subunits, and the fittings on global or local parts of maps. A higher resolution map enables more local fitting. Various molecular representations have been employed in the fitting programs. A point and digital image models are generally used to represent molecules, but new representations, such as the Gaussian mixture model, have been applied recently.
Collapse
|
57
|
Ismer J, Rose AS, Tiemann JKS, Hildebrand PW. A fragment based method for modeling of protein segments into cryo-EM density maps. BMC Bioinformatics 2017; 18:475. [PMID: 29132296 PMCID: PMC5683378 DOI: 10.1186/s12859-017-1904-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2017] [Accepted: 11/01/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Single-particle analysis of electron cryo-microscopy (cryo-EM) is a key technology for elucidation of macromolecular structures. Recent technical advances in hardware and software developments significantly enhanced the resolution of cryo-EM density maps and broadened the applicability and the circle of users. To facilitate modeling of macromolecules into cryo-EM density maps, fast and easy to use methods for modeling are now demanded. RESULTS Here we investigated and benchmarked the suitability of a classical and well established fragment-based approach for modeling of segments into cryo-EM density maps (termed FragFit). FragFit uses a hierarchical strategy to select fragments from a pre-calculated set of billions of fragments derived from structures deposited in the Protein Data Bank, based on sequence similarly, fit of stem atoms and fit to a cryo-EM density map. The user only has to specify the sequence of the segment and the number of the N- and C-terminal stem-residues in the protein. Using a representative data set of protein structures, we show that protein segments can be accurately modeled into cryo-EM density maps of different resolution by FragFit. Prediction quality depends on segment length, the type of secondary structure of the segment and local quality of the map. CONCLUSION Fast and automated calculation of FragFit renders it applicable for implementation of interactive web-applications e.g. to model missing segments, flexible protein parts or hinge-regions into cryo-EM density maps.
Collapse
Affiliation(s)
- Jochen Ismer
- Institute of Medical Physics and Biophysics, University Medicine Berlin, Charitéplatz 1, 10117, Berlin, Germany
| | - Alexander S Rose
- Institute of Medical Physics and Biophysics, University Medicine Berlin, Charitéplatz 1, 10117, Berlin, Germany.,RCSB Protein Data Bank, San Diego Supercomputer Center, University of California, San Diego, CA, 92093-0743, USA
| | - Johanna K S Tiemann
- Institute of Medical Physics and Biophysics, University Medicine Berlin, Charitéplatz 1, 10117, Berlin, Germany.,Institute of Medical Physics and Biophysics, University Leipzig, Härtelstraße 16-18, 04107, Leipzig, Germany
| | - Peter W Hildebrand
- Institute of Medical Physics and Biophysics, University Medicine Berlin, Charitéplatz 1, 10117, Berlin, Germany. .,Institute of Medical Physics and Biophysics, University Leipzig, Härtelstraße 16-18, 04107, Leipzig, Germany.
| |
Collapse
|
58
|
Webb B, Viswanath S, Bonomi M, Pellarin R, Greenberg CH, Saltzberg D, Sali A. Integrative structure modeling with the Integrative Modeling Platform. Protein Sci 2017; 27:245-258. [PMID: 28960548 DOI: 10.1002/pro.3311] [Citation(s) in RCA: 73] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2017] [Revised: 09/23/2017] [Accepted: 09/25/2017] [Indexed: 11/06/2022]
Abstract
Building models of a biological system that are consistent with the myriad data available is one of the key challenges in biology. Modeling the structure and dynamics of macromolecular assemblies, for example, can give insights into how biological systems work, evolved, might be controlled, and even designed. Integrative structure modeling casts the building of structural models as a computational optimization problem, for which information about the assembly is encoded into a scoring function that evaluates candidate models. Here, we describe our open source software suite for integrative structure modeling, Integrative Modeling Platform (https://integrativemodeling.org), and demonstrate its use.
Collapse
Affiliation(s)
- Benjamin Webb
- California Institute for Quantitative Biosciences, University of California, San Francisco, California, 94158
| | - Shruthi Viswanath
- California Institute for Quantitative Biosciences, University of California, San Francisco, California, 94158
| | | | - Riccardo Pellarin
- Structural Bioinformatics Unit, Institut Pasteur, CNRS UMR 3528, Paris, France
| | - Charles H Greenberg
- California Institute for Quantitative Biosciences, University of California, San Francisco, California, 94158
| | - Daniel Saltzberg
- California Institute for Quantitative Biosciences, University of California, San Francisco, California, 94158
| | - Andrej Sali
- California Institute for Quantitative Biosciences, University of California, San Francisco, California, 94158
| |
Collapse
|
59
|
Kinjo AR, Bekker GJ, Wako H, Endo S, Tsuchiya Y, Sato H, Nishi H, Kinoshita K, Suzuki H, Kawabata T, Yokochi M, Iwata T, Kobayashi N, Fujiwara T, Kurisu G, Nakamura H. New tools and functions in data-out activities at Protein Data Bank Japan (PDBj). Protein Sci 2017; 27:95-102. [PMID: 28815765 PMCID: PMC5734392 DOI: 10.1002/pro.3273] [Citation(s) in RCA: 61] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Accepted: 08/14/2017] [Indexed: 11/23/2022]
Abstract
The Protein Data Bank Japan (PDBj), a member of the worldwide Protein Data Bank (wwPDB), accepts and processes the deposited data of experimentally determined biological macromolecular structures. In addition to archiving the PDB data in collaboration with the other wwPDB partners, PDBj also provides a wide range of original and unique services and tools, which are continuously improved and updated. Here, we report the new RDB PDBj Mine 2, the WebGL molecular viewer Molmil, the ProMode‐Elastic server for normal mode analysis, a virtual reality system for the eF‐site protein electrostatic molecular surfaces, the extensions of the Omokage search for molecular shape similarity, and the integration of PDBj and BMRB searches.
Collapse
Affiliation(s)
- Akira R Kinjo
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Gert-Jan Bekker
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Hiroshi Wako
- School of Social Sciences, Waseda University, 1-6-1 Nishi-Waseda, Shinjuku-ku, Tokyo, 169-8050, Japan
| | - Shigeru Endo
- School of Science, Kitasato University, 1-15-1, Kitasato, Minami-ku, Sagamihara, Kanagawa, 252-0373, Japan
| | - Yuko Tsuchiya
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Hiromu Sato
- Graduate School of Information Sciences, Tohoku University, 6-3-09 Aoba, Aramaki-aza Aoba-ku, Sendai, 980-8579, Japan
| | - Hafumi Nishi
- Graduate School of Information Sciences, Tohoku University, 6-3-09 Aoba, Aramaki-aza Aoba-ku, Sendai, 980-8579, Japan
| | - Kengo Kinoshita
- Graduate School of Information Sciences, Tohoku University, 6-3-09 Aoba, Aramaki-aza Aoba-ku, Sendai, 980-8579, Japan
| | - Hirofumi Suzuki
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Takeshi Kawabata
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Masashi Yokochi
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Takeshi Iwata
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Naohiro Kobayashi
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Toshimichi Fujiwara
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Genji Kurisu
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Haruki Nakamura
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| |
Collapse
|
60
|
Chen YL, Habeck M. Data-driven coarse graining of large biomolecular structures. PLoS One 2017; 12:e0183057. [PMID: 28817608 PMCID: PMC5560709 DOI: 10.1371/journal.pone.0183057] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Accepted: 07/29/2017] [Indexed: 11/18/2022] Open
Abstract
Advances in experimental and computational techniques allow us to study the structure and dynamics of large biomolecular assemblies at increasingly higher resolution. However, with increasing structural detail it can be challenging to unravel the mechanism underlying the function of molecular machines. One reason is that atomistic simulations become computationally prohibitive. Moreover it is difficult to rationalize the functional mechanism of systems composed of tens of thousands to millions of atoms by following each atom’s movements. Coarse graining (CG) allows us to understand biological structures from a hierarchical perspective and to gradually zoom into the adequate level of structural detail. This article introduces a Bayesian approach for coarse graining biomolecular structures. We develop a probabilistic model that aims to represent the shape of an experimental structure as a cloud of bead particles. The particles interact via a pairwise potential whose parameters are estimated along with the bead positions and the CG mapping between atoms and beads. Our model can also be applied to density maps obtained by cryo-electron microscopy. We illustrate our approach on various test systems.
Collapse
Affiliation(s)
- Yi-Ling Chen
- Statistical Inverse Problems in Biophysics, Max Planck Institute for Biophysical Chemistry, Am Fassberg 11, 37077 Göttingen, Germany
- Department of NMR based Structural Biology, Max Planck Institute for Biophysical Chemistry, Am Fassberg 11, 37077 Göttingen, Germany
| | - Michael Habeck
- Statistical Inverse Problems in Biophysics, Max Planck Institute for Biophysical Chemistry, Am Fassberg 11, 37077 Göttingen, Germany
- Felix Bernstein Institute for Mathematical Statistics in the Biosciences, Georg August University Göttingen, Goldschmidtstrasse 7, 37077 Göttingen, Germany
- * E-mail:
| |
Collapse
|
61
|
Joseph AP, Lagerstedt I, Patwardhan A, Topf M, Winn M. Improved metrics for comparing structures of macromolecular assemblies determined by 3D electron-microscopy. J Struct Biol 2017; 199:12-26. [PMID: 28552721 PMCID: PMC5479444 DOI: 10.1016/j.jsb.2017.05.007] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2016] [Revised: 05/19/2017] [Accepted: 05/23/2017] [Indexed: 11/28/2022]
Abstract
Recent developments in 3-dimensional electron microcopy (3D-EM) techniques and a concomitant drive to look at complex molecular structures, have led to a rapid increase in the amount of volume data available for biomolecules. This creates a demand for better methods to analyse the data, including improved scores for comparison, classification and integration of data at different resolutions. To this end, we developed and evaluated a set of scoring functions that compare 3D-EM volumes. To test our scores we used a benchmark set of volume alignments derived from the Electron Microscopy Data Bank. We find that the performance of different scores vary with the map-type, resolution and the extent of overlap between volumes. Importantly, adding the overlap information to the local scoring functions can significantly improve their precision and accuracy in a range of resolutions. A combined score involving the local mutual information and overlap (LMI_OV) performs best overall, irrespective of the map category, resolution or the extent of overlap, and we recommend this score for general use. The local mutual information score itself is found to be more discriminatory than cross-correlation coefficient for intermediate-to-low resolution maps or when the map size and density distribution differ significantly. For comparing map surfaces, we implemented two filters to detect the surface points, including one based on the 'extent of surface exposure'. We show that scores that compare surfaces are useful at low resolutions and for maps with evident surface features. All the scores discussed are implemented in TEMPy (http://tempy.ismb.lon.ac.uk/).
Collapse
Affiliation(s)
- Agnel Praveen Joseph
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom; Scientific Computing Department, Science and Technology Facilities Council, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Ingvar Lagerstedt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom; Computational Chemistry and Cheminformatics, Lilly UK, Windlesham GU20 6PH, United Kingdom
| | - Ardan Patwardhan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Maya Topf
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom.
| | - Martyn Winn
- Scientific Computing Department, Science and Technology Facilities Council, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom.
| |
Collapse
|
62
|
Miyashita O, Kobayashi C, Mori T, Sugita Y, Tama F. Flexible fitting to cryo-EM density map using ensemble molecular dynamics simulations. J Comput Chem 2017; 38:1447-1461. [PMID: 28370077 DOI: 10.1002/jcc.24785] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2016] [Revised: 01/26/2017] [Accepted: 02/22/2017] [Indexed: 12/25/2022]
Abstract
Flexible fitting is a computational algorithm to derive a new conformational model that conforms to low-resolution experimental data by transforming a known structure. A common application is against data from cryo-electron microscopy to obtain conformational models in new functional states. The conventional flexible fitting algorithms cannot derive correct structures in some cases due to the complexity of conformational transitions. In this study, we show the importance of conformational ensemble in the refinement process by performing multiple fittings trials using a variety of different force constants. Application to simulated maps of Ca2+ ATPase and diphtheria toxin as well as experimental data of release factor 2 revealed that for these systems, multiple conformations with similar agreement with the density map exist and a large number of fitting trials are necessary to generate good models. Clustering analysis can be an effective approach to avoid over-fitting models. In addition, we show that an automatic adjustment of the biasing force constants during the fitting process, implemented as replica-exchange scheme, can improve the success rate. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Osamu Miyashita
- Advanced Institute for Computational Science, RIKEN, 6-7-1, Minatojima-minami-machi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan
| | - Chigusa Kobayashi
- Advanced Institute for Computational Science, RIKEN, 6-7-1, Minatojima-minami-machi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan
| | - Takaharu Mori
- Theoretical Molecular Science Laboratory, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama, 351-0198, Japan.,iTHES, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama, 351-0198, Japan
| | - Yuji Sugita
- Advanced Institute for Computational Science, RIKEN, 6-7-1, Minatojima-minami-machi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan.,Theoretical Molecular Science Laboratory, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama, 351-0198, Japan.,iTHES, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama, 351-0198, Japan.,Quantitative Biology Center, RIKEN, 6-7-1, Minatojima-minami-machi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan
| | - Florence Tama
- Advanced Institute for Computational Science, RIKEN, 6-7-1, Minatojima-minami-machi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan.,Department of Physics and ITbM, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Aichi, 464-8602, Japan
| |
Collapse
|
63
|
Versatility of Approximating Single-Particle Electron Microscopy Density Maps Using Pseudoatoms and Approximation-Accuracy Control. BIOMED RESEARCH INTERNATIONAL 2016; 2016:7060348. [PMID: 28097146 PMCID: PMC5209604 DOI: 10.1155/2016/7060348] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2016] [Accepted: 11/03/2016] [Indexed: 01/09/2023]
Abstract
Three-dimensional Gaussian functions have been shown useful in representing electron microscopy (EM) density maps for studying macromolecular structure and dynamics. Methods that require setting a desired number of Gaussian functions or a maximum number of iterations may result in suboptimal representations of the structure. An alternative is to set a desired error of approximation of the given EM map and then optimize the number of Gaussian functions to achieve this approximation error. In this article, we review different applications of such an approach that uses spherical Gaussian functions of fixed standard deviation, referred to as pseudoatoms. Some of these applications use EM-map normal mode analysis (NMA) with elastic network model (ENM) (applications such as predicting conformational changes of macromolecular complexes or exploring actual conformational changes by normal-mode-based analysis of experimental data) while some other do not use NMA (denoising of EM density maps). In applications based on NMA and ENM, the advantage of using pseudoatoms in EM-map coarse-grain models is that the ENM springs are easily assigned among neighboring grains thanks to their spherical shape and uniformed size. EM-map denoising based on the map coarse-graining was so far only shown using pseudoatoms as grains.
Collapse
|
64
|
Kinjo AR, Bekker GJ, Suzuki H, Tsuchiya Y, Kawabata T, Ikegawa Y, Nakamura H. Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures. Nucleic Acids Res 2016; 45:D282-D288. [PMID: 27789697 PMCID: PMC5210648 DOI: 10.1093/nar/gkw962] [Citation(s) in RCA: 82] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2016] [Revised: 10/06/2016] [Accepted: 10/11/2016] [Indexed: 11/14/2022] Open
Abstract
The Protein Data Bank Japan (PDBj, http://pdbj.org), a member of the worldwide Protein Data Bank (wwPDB), accepts and processes the deposited data of experimentally determined macromolecular structures. While maintaining the archive in collaboration with other wwPDB partners, PDBj also provides a wide range of services and tools for analyzing structures and functions of proteins. We herein outline the updated web user interfaces together with RESTful web services and the backend relational database that support the former. To enhance the interoperability of the PDB data, we have previously developed PDB/RDF, PDB data in the Resource Description Framework (RDF) format, which is now a wwPDB standard called wwPDB/RDF. We have enhanced the connectivity of the wwPDB/RDF data by incorporating various external data resources. Services for searching, comparing and analyzing the ever-increasing large structures determined by hybrid methods are also described.
Collapse
Affiliation(s)
- Akira R Kinjo
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Gert-Jan Bekker
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Hirofumi Suzuki
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Yuko Tsuchiya
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Takeshi Kawabata
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Yasuyo Ikegawa
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Haruki Nakamura
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| |
Collapse
|
65
|
Kuzu G, Keskin O, Nussinov R, Gursoy A. PRISM-EM: template interface-based modelling of multi-protein complexes guided by cryo-electron microscopy density maps. Acta Crystallogr D Struct Biol 2016; 72:1137-1148. [PMID: 27710935 PMCID: PMC5053140 DOI: 10.1107/s2059798316013541] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Accepted: 08/23/2016] [Indexed: 12/29/2022] Open
Abstract
The structures of protein assemblies are important for elucidating cellular processes at the molecular level. Three-dimensional electron microscopy (3DEM) is a powerful method to identify the structures of assemblies, especially those that are challenging to study by crystallography. Here, a new approach, PRISM-EM, is reported to computationally generate plausible structural models using a procedure that combines crystallographic structures and density maps obtained from 3DEM. The predictions are validated against seven available structurally different crystallographic complexes. The models display mean deviations in the backbone of <5 Å. PRISM-EM was further tested on different benchmark sets; the accuracy was evaluated with respect to the structure of the complex, and the correlation with EM density maps and interface predictions were evaluated and compared with those obtained using other methods. PRISM-EM was then used to predict the structure of the ternary complex of the HIV-1 envelope glycoprotein trimer, the ligand CD4 and the neutralizing protein m36.
Collapse
Affiliation(s)
- Guray Kuzu
- Center for Computational Biology and Bioinformatics and College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Ozlem Keskin
- Center for Computational Biology and Bioinformatics and College of Engineering, Koc University, 34450 Istanbul, Turkey
- Chemical and Biological Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Ruth Nussinov
- Cancer and Inflammation Program, Leidos Biomedical Research Inc., National Cancer Institute, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
- Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Attila Gursoy
- Computer Engineering, Koc University, 34450 Istanbul, Turkey
| |
Collapse
|
66
|
Use of evolutionary information in the fitting of atomic level protein models in low resolution cryo-EM map of a protein assembly improves the accuracy of the fitting. J Struct Biol 2016; 195:294-305. [PMID: 27444391 DOI: 10.1016/j.jsb.2016.07.012] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2015] [Revised: 07/15/2016] [Accepted: 07/18/2016] [Indexed: 11/22/2022]
Abstract
Protein-protein interface residues, especially those at the core of the interface, exhibit higher conservation than residues in solvent exposed regions. Here, we explore the ability of this differential conservation to evaluate fittings of atomic models in low-resolution cryo-EM maps and select models from the ensemble of solutions that are often proposed by different model fitting techniques. As a prelude, using a non-redundant and high-resolution structural dataset involving 125 permanent and 95 transient complexes, we confirm that core interface residues are conserved significantly better than nearby non-interface residues and this result is used in the cryo-EM map analysis. From the analysis of inter-component interfaces in a set of fitted models associated with low-resolution cryo-EM maps of ribosomes, chaperones and proteasomes we note that a few poorly conserved residues occur at interfaces. Interestingly a few conserved residues are not in the interface, though they are close to the interface. These observations raise the potential requirement of refitting the models in the cryo-EM maps. We show that sampling an ensemble of models and selection of models with high residue conservation at the interface and in good agreement with the density helps in improving the accuracy of the fit. This study indicates that evolutionary information can serve as an additional input to improve and validate fitting of atomic models in cryo-EM density maps.
Collapse
|
67
|
Jonić S, Vargas J, Melero R, Gómez-Blanco J, Carazo JM, Sorzano COS. Denoising of high-resolution single-particle electron-microscopy density maps by their approximation using three-dimensional Gaussian functions. J Struct Biol 2016; 194:423-33. [PMID: 27085420 DOI: 10.1016/j.jsb.2016.04.007] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2016] [Revised: 04/12/2016] [Accepted: 04/12/2016] [Indexed: 12/22/2022]
Abstract
Cryo-electron microscopy (cryo-EM) of frozen-hydrated preparations of isolated macromolecular complexes is the method of choice to obtain the structure of complexes that cannot be easily studied by other experimental methods due to their flexibility or large size. An increasing number of macromolecular structures are currently being obtained at subnanometer resolution but the interpretation of structural details in such EM-derived maps is often difficult because of noise at these high-frequency signal components that reduces their contrast. In this paper, we show that the method for EM density-map approximation using Gaussian functions can be used for denoising of single-particle EM maps of high (typically subnanometer) resolution. We show its denoising performance using simulated and experimental EM density maps of several complexes.
Collapse
Affiliation(s)
- S Jonić
- IMPMC, Sorbonne Universités - CNRS UMR 7590, UPMC Univ Paris 6, MNHN, IRD UMR 206, 75005 Paris, France.
| | - J Vargas
- Biocomputing Unit, Centro Nacional de Biotecnología - CSIC, Campus de Cantoblanco, Darwin 3, 28049 Madrid, Spain
| | - R Melero
- Biocomputing Unit, Centro Nacional de Biotecnología - CSIC, Campus de Cantoblanco, Darwin 3, 28049 Madrid, Spain
| | - J Gómez-Blanco
- Biocomputing Unit, Centro Nacional de Biotecnología - CSIC, Campus de Cantoblanco, Darwin 3, 28049 Madrid, Spain
| | - J M Carazo
- Biocomputing Unit, Centro Nacional de Biotecnología - CSIC, Campus de Cantoblanco, Darwin 3, 28049 Madrid, Spain
| | - C O S Sorzano
- Biocomputing Unit, Centro Nacional de Biotecnología - CSIC, Campus de Cantoblanco, Darwin 3, 28049 Madrid, Spain
| |
Collapse
|
68
|
Two distinct trimeric conformations of natively membrane-anchored full-length herpes simplex virus 1 glycoprotein B. Proc Natl Acad Sci U S A 2016; 113:4176-81. [PMID: 27035968 DOI: 10.1073/pnas.1523234113] [Citation(s) in RCA: 77] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Many viruses are enveloped by a lipid bilayer acquired during assembly, which is typically studded with one or two types of glycoproteins. These viral surface proteins act as the primary interface between the virus and the host. Entry of enveloped viruses relies on specialized fusogen proteins to help merge the virus membrane with the host membrane. In the multicomponent herpesvirus fusion machinery, glycoprotein B (gB) acts as this fusogen. Although the structure of the gB ectodomain postfusion conformation has been determined, any other conformations (e.g., prefusion, intermediate conformations) have so far remained elusive, thus restricting efforts to develop antiviral treatments and prophylactic vaccines. Here, we have characterized the full-length herpes simplex virus 1 gB in a native membrane by displaying it on cell-derived vesicles and using electron cryotomography. Alongside the known postfusion conformation, a novel one was identified. Its structure, in the context of the membrane, was determined by subvolume averaging and found to be trimeric like the postfusion conformation, but appeared more condensed. Hierarchical constrained density-fitting of domains unexpectedly revealed the fusion loops in this conformation to be apart and pointing away from the anchoring membrane. This vital observation is a substantial step forward in understanding the complex herpesvirus fusion mechanism, and opens up new opportunities for more targeted intervention of herpesvirus entry.
Collapse
|
69
|
Joubert P, Habeck M. Bayesian inference of initial models in cryo-electron microscopy using pseudo-atoms. Biophys J 2016; 108:1165-75. [PMID: 25762328 DOI: 10.1016/j.bpj.2014.12.054] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2014] [Revised: 12/11/2014] [Accepted: 12/23/2014] [Indexed: 11/28/2022] Open
Abstract
Single-particle cryo-electron microscopy is widely used to study the structure of macromolecular assemblies. Tens of thousands of noisy two-dimensional images of the macromolecular assembly viewed from different directions are used to infer its three-dimensional structure. The first step is to estimate a low-resolution initial model and initial image orientations. This is a challenging global optimization problem with many unknowns, including an unknown orientation for each two-dimensional image. Obtaining a good initial model is crucial for the success of the subsequent refinement step. We introduce a probabilistic algorithm for estimating an initial model. The algorithm is fast, has very few algorithmic parameters, and yields information about the precision of estimated model parameters in addition to the parameters themselves. Our algorithm uses a pseudo-atomic model to represent the low-resolution three-dimensional structure, with isotropic Gaussian components as moveable pseudo-atoms. This leads to a significant reduction in the number of parameters needed to represent the three-dimensional structure, and a simplified way of computing two-dimensional projections. It also contributes to the speed of the algorithm. We combine the estimation of the unknown three-dimensional structure and image orientations in a Bayesian framework. This ensures that there are very few parameters to set, and specifies how to combine different types of prior information about the structure with the given data in a systematic way. To estimate the model parameters we use Markov chain Monte Carlo sampling. The advantage is that instead of just obtaining point estimates of model parameters, we obtain an ensemble of models revealing the precision of the estimated parameters. We demonstrate the algorithm on both simulated and real data.
Collapse
Affiliation(s)
- Paul Joubert
- Felix-Bernstein Institute for Mathematical Statistics, Georg-August-Universität Göttingen, Göttingen, Germany.
| | - Michael Habeck
- Felix-Bernstein Institute for Mathematical Statistics, Georg-August-Universität Göttingen, Göttingen, Germany; Max Planck Institute for Biophysical Chemistry, Göttingen, Germany.
| |
Collapse
|
70
|
Rakesh R, Srinivasan N. Improving the Accuracy of Fitted Atomic Models in Cryo-EM Density Maps of Protein Assemblies Using Evolutionary Information from Aligned Homologous Proteins. Methods Mol Biol 2016; 1415:193-209. [PMID: 27115634 DOI: 10.1007/978-1-4939-3572-7_10] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Cryo-Electron Microscopy (cryo-EM) has become an important technique to obtain structural insights into large macromolecular assemblies. However the resolution of the density maps do not allow for its interpretation at atomic level. Hence they are combined with high resolution structures along with information from other experimental or bioinformatics techniques to obtain pseudo-atomic models. Here, we describe the use of evolutionary conservation of residues as obtained from protein structures and alignments of homologous proteins to detect errors in the fitting of atomic structures as well as improve accuracy of the protein-protein interfacial regions in the cryo-EM density maps.
Collapse
Affiliation(s)
- Ramachandran Rakesh
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
| | | |
Collapse
|
71
|
Pandurangan AP, Vasishtan D, Alber F, Topf M. γ-TEMPy: Simultaneous Fitting of Components in 3D-EM Maps of Their Assembly Using a Genetic Algorithm. Structure 2015; 23:2365-2376. [PMID: 26655474 PMCID: PMC4671957 DOI: 10.1016/j.str.2015.10.013] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Revised: 09/24/2015] [Accepted: 10/01/2015] [Indexed: 12/02/2022]
Abstract
We have developed a genetic algorithm for building macromolecular complexes using only a 3D-electron microscopy density map and the atomic structures of the relevant components. For efficient sampling the method uses map feature points calculated by vector quantization. The fitness function combines a mutual information score that quantifies the goodness of fit with a penalty score that helps to avoid clashes between components. Testing the method on ten assemblies (containing 3–8 protein components) and simulated density maps at 10, 15, and 20 Å resolution resulted in identification of the correct topology in 90%, 70%, and 60% of the cases, respectively. We further tested it on four assemblies with experimental maps at 7.2–23.5 Å resolution, showing the ability of the method to identify the correct topology in all cases. We have also demonstrated the importance of the map feature-point quality on assembly fitting in the lack of additional experimental information. γ-TEMPy uses a genetic algorithm to fit multiple components into 3D-EM density maps The fitness score is a combination of a Mutual Information score and a clash penalty Efficient sampling is aided by using map feature points from vector quantization Native topologies for assemblies containing up to eight components can be predicted
Collapse
Affiliation(s)
- Arun Prasad Pandurangan
- Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet Street, London WC1E 7HX, UK
| | - Daven Vasishtan
- Division of Structural Biology, Oxford Particle Imaging Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - Frank Alber
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI413E, Los Angeles, CA 90089, USA
| | - Maya Topf
- Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet Street, London WC1E 7HX, UK.
| |
Collapse
|
72
|
Suzuki H, Kawabata T, Nakamura H. Omokage search: shape similarity search service for biomolecular structures in both the PDB and EMDB. Bioinformatics 2015; 32:619-20. [PMID: 26508754 PMCID: PMC4743628 DOI: 10.1093/bioinformatics/btv614] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2015] [Accepted: 10/17/2015] [Indexed: 12/02/2022] Open
Abstract
Summary:Omokage search is a service to search the global shape similarity of biological macromolecules and their assemblies, in both the Protein Data Bank (PDB) and Electron Microscopy Data Bank (EMDB). The server compares global shapes of assemblies independent of sequence order and number of subunits. As a search query, the user inputs a structure ID (PDB ID or EMDB ID) or uploads an atomic model or 3D density map to the server. The search is performed usually within 1 min, using one-dimensional profiles (incremental distance rank profiles) to characterize the shapes. Using the gmfit (Gaussian mixture model fitting) program, the found structures are fitted onto the query structure and their superimposed structures are displayed on the Web browser. Our service provides new structural perspectives to life science researchers. Availability and implementation:Omokage search is freely accessible at http://pdbj.org/omokage/. Contact:hirofumi@protein.osaka-u.ac.jp Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hirofumi Suzuki
- Institute for Protein Research, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Takeshi Kawabata
- Institute for Protein Research, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Haruki Nakamura
- Institute for Protein Research, Osaka University, Suita, Osaka, 565-0871, Japan
| |
Collapse
|
73
|
Bettadapura R, Rasheed M, Vollrath A, Bajaj C. PF2fit: Polar Fast Fourier Matched Alignment of Atomistic Structures with 3D Electron Microscopy Maps. PLoS Comput Biol 2015; 11:e1004289. [PMID: 26469938 PMCID: PMC4607507 DOI: 10.1371/journal.pcbi.1004289] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2014] [Accepted: 04/14/2015] [Indexed: 11/30/2022] Open
Abstract
There continue to be increasing occurrences of both atomistic structure models in the PDB (possibly reconstructed from X-ray diffraction or NMR data), and 3D reconstructed cryo-electron microscopy (3D EM) maps (albeit at coarser resolution) of the same or homologous molecule or molecular assembly, deposited in the EMDB. To obtain the best possible structural model of the molecule at the best achievable resolution, and without any missing gaps, one typically aligns (match and fits) the atomistic structure model with the 3D EM map. We discuss a new algorithm and generalized framework, named PF(2) fit (Polar Fast Fourier Fitting) for the best possible structural alignment of atomistic structures with 3D EM. While PF(2) fit enables only a rigid, six dimensional (6D) alignment method, it augments prior work on 6D X-ray structure and 3D EM alignment in multiple ways: Scoring. PF(2) fit includes a new scoring scheme that, in addition to rewarding overlaps between the volumes occupied by the atomistic structure and 3D EM map, rewards overlaps between the volumes complementary to them. We quantitatively demonstrate how this new complementary scoring scheme improves upon existing approaches. PF(2) fit also includes two scoring functions, the non-uniform exterior penalty and the skeleton-secondary structure score, and implements the scattering potential score as an alternative to traditional Gaussian blurring. Search. PF(2) fit utilizes a fast polar Fourier search scheme, whose main advantage is the ability to search over uniformly and adaptively sampled subsets of the space of rigid-body motions. PF(2) fit also implements a new reranking search and scoring methodology that considerably improves alignment metrics in results obtained from the initial search.
Collapse
Affiliation(s)
- Radhakrishna Bettadapura
- Radhakrishna Bettadapura Computational Visualization Center/Department of Mechanical Engineering, University of Texas at Austin, Austin, Texas, United States of America
| | - Muhibur Rasheed
- Muhibur Rasheed Computational Visualization Center/Department of Mechanical Engineering, University of Texas at Austin, Austin, Texas, United States of America
| | - Antje Vollrath
- Antje Vollrath Institut Computational Mathematics, Technische Universität Braunschweig, Braunschweig, Germany
| | - Chandrajit Bajaj
- Chandrajit Bajaj Computational Visualization Center/Institute of Computational Engineering & Sciences/Department of Computer Science, University of Texas at Austin, Austin, Texas, United States of America
| |
Collapse
|
74
|
Integrative Modeling of Biomolecular Complexes: HADDOCKing with Cryo-Electron Microscopy Data. Structure 2015; 23:949-960. [DOI: 10.1016/j.str.2015.03.014] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Revised: 03/12/2015] [Accepted: 03/13/2015] [Indexed: 12/13/2022]
|
75
|
López-Blanco JR, Chacón P. Structural modeling from electron microscopy data. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2014. [DOI: 10.1002/wcms.1199] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- José Ramón López-Blanco
- Department of Biological Physical Chemistry; Rocasolano Physical Chemistry Institute, CSIC; Madrid Spain
| | - Pablo Chacón
- Department of Biological Physical Chemistry; Rocasolano Physical Chemistry Institute, CSIC; Madrid Spain
| |
Collapse
|
76
|
Thalassinos K, Pandurangan AP, Xu M, Alber F, Topf M. Conformational States of macromolecular assemblies explored by integrative structure calculation. Structure 2014; 21:1500-8. [PMID: 24010709 PMCID: PMC3988990 DOI: 10.1016/j.str.2013.08.006] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2013] [Revised: 08/10/2013] [Accepted: 08/12/2013] [Indexed: 12/22/2022]
Abstract
A detailed description of macromolecular assemblies in multiple conformational states can be very valuable for understanding cellular processes. At present, structural determination of most assemblies in different biologically relevant conformations cannot be achieved by a single technique and thus requires an integrative approach that combines information from multiple sources. Different techniques require different computational methods to allow efficient and accurate data processing and analysis. Here, we summarize the latest advances and future challenges in computational methods that help the interpretation of data from two techniques—mass spectrometry and three-dimensional cryo-electron microscopy (with focus on alignment and classification of heterogeneous subtomograms from cryo-electron tomography). We evaluate how new developments in these two broad fields will lead to further integration with atomic structures to broaden our picture of the dynamic behavior of assemblies in their native environment.
Collapse
Affiliation(s)
- Konstantinos Thalassinos
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London WC1E 6BT, UK
| | | | | | | | | |
Collapse
|
77
|
Esquivel-Rodríguez J, Kihara D. Computational methods for constructing protein structure models from 3D electron microscopy maps. J Struct Biol 2013; 184:93-102. [PMID: 23796504 DOI: 10.1016/j.jsb.2013.06.008] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 06/11/2013] [Accepted: 06/13/2013] [Indexed: 12/31/2022]
Abstract
Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein structure information, in this article we focus on reviewing computational methods that model protein three-dimensional (3D) structures from a 3D EM density map that is constructed from two-dimensional (2D) maps. The computational methods we discuss range from de novo methods, which identify structural elements in an EM map, to structure fitting methods, where known high resolution structures are fit into a low-resolution EM map. A list of available computational tools is also provided.
Collapse
Affiliation(s)
- Juan Esquivel-Rodríguez
- Department of Computer Science, College of Science, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
78
|
de Vries SJ, Zacharias M. ATTRACT-EM: a new method for the computational assembly of large molecular machines using cryo-EM maps. PLoS One 2012; 7:e49733. [PMID: 23251350 PMCID: PMC3522670 DOI: 10.1371/journal.pone.0049733] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2012] [Accepted: 10/17/2012] [Indexed: 11/23/2022] Open
Abstract
Many of the most important functions in the cell are carried out by proteins organized in large molecular machines. Cryo-electron microscopy (cryo-EM) is increasingly being used to obtain low resolution density maps of these large assemblies. A new method, ATTRACT-EM, for the computational assembly of molecular assemblies from their components has been developed. Based on concepts from the protein-protein docking field, it utilizes cryo-EM density maps to assemble molecular subunits at near atomic detail, starting from millions of initial subunit configurations. The search efficiency was further enhanced by recombining partial solutions, the inclusion of symmetry information, and refinement using a molecular force field. The approach was tested on the GroES-GroEL system, using an experimental cryo-EM map at 23.5 Å resolution, and on several smaller complexes. Inclusion of experimental information on the symmetry of the systems and the application of a new gradient vector matching algorithm allowed the efficient identification of docked assemblies in close agreement with experiment. Application to the GroES-GroEL complex resulted in a top ranked model with a deviation of 4.6 Å (and a 2.8 Å model within the top 10) from the GroES-GroEL crystal structure, a significant improvement over existing methods.
Collapse
Affiliation(s)
- Sjoerd J de Vries
- Physik-Department T38, Technische Universität München, Garching, Germany.
| | | |
Collapse
|
79
|
Esquivel-Rodríguez J, Kihara D. Fitting multimeric protein complexes into electron microscopy maps using 3D Zernike descriptors. J Phys Chem B 2012; 116:6854-61. [PMID: 22417139 PMCID: PMC3376205 DOI: 10.1021/jp212612t] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A novel computational method for fitting high-resolution structures of multiple proteins into a cryoelectron microscopy map is presented. The method named EMLZerD generates a pool of candidate multiple protein docking conformations of component proteins, which are later compared with a provided electron microscopy (EM) density map to select the ones that fit well into the EM map. The comparison of docking conformations and the EM map is performed using the 3D Zernike descriptor (3DZD), a mathematical series expansion of three-dimensional functions. The 3DZD provides a unified representation of the surface shape of multimeric protein complex models and EM maps, which allows a convenient, fast quantitative comparison of the three-dimensional structural data. Out of 19 multimeric complexes tested, near native complex structures with a root-mean-square deviation of less than 2.5 Å were obtained for 14 cases while medium range resolution structures with correct topology were computed for the additional 5 cases.
Collapse
Affiliation(s)
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
- Markey Center for Structural Biology, Purdue University, West Lafayette, IN, 47907, USA
| |
Collapse
|
80
|
Trinh MH, Odorico M, Pique ME, Teulon JM, Roberts VA, Ten Eyck LF, Getzoff ED, Parot P, Chen SWW, Pellequer JL. Computational reconstruction of multidomain proteins using atomic force microscopy data. Structure 2012; 20:113-20. [PMID: 22244760 DOI: 10.1016/j.str.2011.10.023] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2011] [Revised: 10/05/2011] [Accepted: 10/10/2011] [Indexed: 01/10/2023]
Abstract
Classical structural biology techniques face a great challenge to determine the structure at the atomic level of large and flexible macromolecules. We present a novel methodology that combines high-resolution AFM topographic images with atomic coordinates of proteins to assemble very large macromolecules or particles. Our method uses a two-step protocol: atomic coordinates of individual domains are docked beneath the molecular surface of the large macromolecule, and then each domain is assembled using a combinatorial search. The protocol was validated on three test cases: a simulated system of antibody structures; and two experimentally based test cases: Tobacco mosaic virus, a rod-shaped virus; and Aquaporin Z, a bacterial membrane protein. We have shown that AFM-intermediate resolution topography and partial surface data are useful constraints for building macromolecular assemblies. The protocol is applicable to multicomponent structures connected in the polypeptide chain or as disjoint molecules. The approach effectively increases the resolution of AFM beyond topographical information down to atomic-detail structures.
Collapse
Affiliation(s)
- Minh-Hieu Trinh
- CEA, iBEB, Department of Biochemistry and Nuclear Toxicology, F-30207 Bagnols sur Cèze, France
| | | | | | | | | | | | | | | | | | | |
Collapse
|
81
|
Tjioe E, Lasker K, Webb B, Wolfson HJ, Sali A. MultiFit: a web server for fitting multiple protein structures into their electron microscopy density map. Nucleic Acids Res 2011; 39:W167-70. [PMID: 21715383 PMCID: PMC3125811 DOI: 10.1093/nar/gkr490] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Advances in electron microscopy (EM) allow for structure determination of large biological assemblies at increasingly higher resolutions. A key step in this process is fitting multiple component structures into an EM-derived density map of their assembly. Here, we describe a web server for this task. The server takes as input a set of protein structures in the PDB format and an EM density map in the MRC format. The output is an ensemble of models ranked by their quality of fit to the density map. The models can be viewed online or downloaded from the website. The service is available at; http://salilab.org/multifit/ and http://bioinfo3d.cs.tau.ac.il/.
Collapse
Affiliation(s)
- Elina Tjioe
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | | | | | | | | |
Collapse
|
82
|
Beck M, Topf M, Frazier Z, Tjong H, Xu M, Zhang S, Alber F. Exploring the spatial and temporal organization of a cell's proteome. J Struct Biol 2011; 173:483-96. [PMID: 21094684 PMCID: PMC3784337 DOI: 10.1016/j.jsb.2010.11.011] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2010] [Revised: 11/05/2010] [Accepted: 11/08/2010] [Indexed: 10/18/2022]
Abstract
To increase our current understanding of cellular processes, such as cell signaling and division, knowledge is needed about the spatial and temporal organization of the proteome at different organizational levels. These levels cover a wide range of length and time scales: from the atomic structures of macromolecules for inferring their molecular function, to the quantitative description of their abundance, and spatial distribution in the cell. Emerging new experimental technologies are greatly increasing the availability of such spatial information on the molecular organization in living cells. This review addresses three fields that have significantly contributed to our understanding of the proteome's spatial and temporal organization: first, methods for the structure determination of individual macromolecular assemblies, specifically the fitting of atomic structures into density maps generated from electron microscopy techniques; second, research that visualizes the spatial distributions of these complexes within the cellular context using cryo electron tomography techniques combined with computational image processing; and third, methods for the spatial modeling of the dynamic organization of the proteome, specifically those methods for simulating reaction and diffusion of proteins and complexes in crowded intracellular fluids. The long-term goal is to integrate the varied data about a proteome's organization into a spatially explicit, predictive model of cellular processes.
Collapse
Affiliation(s)
- Martin Beck
- European Molecular Biology Laboratory, Meyerhofstr. 1, 69117 Heidelberg, Germany
| | - Maya Topf
- Molecular Biology, Crystallography, Department of Biological Sciences, Birkbeck College, University of London, London, UK
| | - Zachary Frazier
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Harianto Tjong
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Min Xu
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Shihua Zhang
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Frank Alber
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| |
Collapse
|
83
|
Sael L, Kihara D. Improved protein surface comparison and application to low-resolution protein structure data. BMC Bioinformatics 2010; 11 Suppl 11:S2. [PMID: 21172052 PMCID: PMC3024873 DOI: 10.1186/1471-2105-11-s11-s2] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Background Recent advancements of experimental techniques for determining protein tertiary structures raise significant challenges for protein bioinformatics. With the number of known structures of unknown function expanding at a rapid pace, an urgent task is to provide reliable clues to their biological function on a large scale. Conventional approaches for structure comparison are not suitable for a real-time database search due to their slow speed. Moreover, a new challenge has arisen from recent techniques such as electron microscopy (EM), which provide low-resolution structure data. Previously, we have introduced a method for protein surface shape representation using the 3D Zernike descriptors (3DZDs). The 3DZD enables fast structure database searches, taking advantage of its rotation invariance and compact representation. The search results of protein surface represented with the 3DZD has showngood agreement with the existing structure classifications, but some discrepancies were also observed. Results The three new surface representations of backbone atoms, originally devised all-atom-surface representation, and the combination of all-atom surface with the backbone representation are examined. All representations are encoded with the 3DZD. Also, we have investigated the applicability of the 3DZD for searching protein EM density maps of varying resolutions. The surface representations are evaluated on structure retrieval using two existing classifications, SCOP and the CE-based classification. Conclusions Overall, the 3DZDs representing backbone atoms show better retrieval performance than the original all-atom surface representation. The performance further improved when the two representations are combined. Moreover, we observed that the 3DZD is also powerful in comparing low-resolution structures obtained by electron microscopy.
Collapse
Affiliation(s)
- Lee Sael
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA.
| | | |
Collapse
|
84
|
Birmanns S, Rusu M, Wriggers W. Using Sculptor and Situs for simultaneous assembly of atomic components into low-resolution shapes. J Struct Biol 2010; 173:428-35. [PMID: 21078392 DOI: 10.1016/j.jsb.2010.11.002] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2010] [Revised: 10/31/2010] [Accepted: 11/04/2010] [Indexed: 10/18/2022]
Abstract
We describe an integrated software system called Sculptor that combines visualization capabilities with molecular modeling algorithms for the analysis of multi-scale data sets. Sculptor features extensive special purpose visualization techniques that are based on modern GPU programming and are capable of representing complex molecular assemblies in real-time. The integration of graphics and modeling offers several advantages. The user interface not only eases the usually steep learning curve of pure algorithmic techniques, but it also permits instant analysis and post-processing of results, as well as the integration of results from external software. Here, we implemented an interactive peak-selection strategy that enables the user to explore a preliminary score landscape generated by the colors tool of Situs. The interactive placement of components, one at a time, is advantageous for low-resolution or ambiguously shaped maps, which are sometimes difficult to interpret by the fully automatic peak selection of colors. For the subsequent refinement of the preliminary models resulting from both interactive and automatic peak selection, we have implemented a novel simultaneous multi-body docking in Sculptor and Situs that softly enforces shape complementarities between components using the normalization of the cross-correlation coefficient. The proposed techniques are freely available in Situs version 2.6 and Sculptor version 2.0.
Collapse
Affiliation(s)
- Stefan Birmanns
- University of Texas School of Biomedical Informatics at Houston, 7000 Fannin St. UCT 600, Houston, TX 77030, USA.
| | | | | |
Collapse
|
85
|
Zhang S, Vasishtan D, Xu M, Topf M, Alber F. A fast mathematical programming procedure for simultaneous fitting of assembly components into cryoEM density maps. Bioinformatics 2010; 26:i261-8. [PMID: 20529915 PMCID: PMC2881386 DOI: 10.1093/bioinformatics/btq201] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
MOTIVATION Single-particle cryo electron microscopy (cryoEM) typically produces density maps of macromolecular assemblies at intermediate to low resolution (approximately 5-30 A). By fitting high-resolution structures of assembly components into these maps, pseudo-atomic models can be obtained. Optimizing the quality-of-fit of all components simultaneously is challenging due to the large search space that makes the exhaustive search over all possible component configurations computationally unfeasible. RESULTS We developed an efficient mathematical programming algorithm that simultaneously fits all component structures into an assembly density map. The fitting is formulated as a point set matching problem involving several point sets that represent component and assembly densities at a reduced complexity level. In contrast to other point matching algorithms, our algorithm is able to match multiple point sets simultaneously and not only based on their geometrical equivalence, but also based on the similarity of the density in the immediate point neighborhood. In addition, we present an efficient refinement method based on the Iterative Closest Point registration algorithm. The integer quadratic programming method generates an assembly configuration in a few seconds. This efficiency allows the generation of an ensemble of candidate solutions that can be assessed by an independent scoring function. We benchmarked the method using simulated density maps of 11 protein assemblies at 20 A, and an experimental cryoEM map at 23.5 A resolution. Our method was able to generate assembly structures with root-mean-square errors <6.5 A, which have been further reduced to <1.8 A by the local refinement procedure. AVAILABILITY The program is available upon request as a Matlab code package. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics Online.
Collapse
Affiliation(s)
- Shihua Zhang
- Program in Molecular and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | | | | | | | | |
Collapse
|
86
|
Lasker K, Topf M, Sali A, Wolfson HJ. Inferential optimization for simultaneous fitting of multiple components into a CryoEM map of their assembly. J Mol Biol 2009; 388:180-94. [PMID: 19233204 DOI: 10.1016/j.jmb.2009.02.031] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2008] [Revised: 12/29/2008] [Accepted: 02/12/2009] [Indexed: 11/24/2022]
Abstract
Models of macromolecular assemblies are essential for a mechanistic description of cellular processes. Such models are increasingly obtained by fitting atomic-resolution structures of components into a density map of the whole assembly. Yet, current density-fitting techniques are frequently insufficient for an unambiguous determination of the positions and orientations of all components. Here, we describe MultiFit, a method used for simultaneously fitting atomic structures of components into their assembly density map at resolutions as low as 25 A. The component positions and orientations are optimized with respect to a scoring function that includes the quality-of-fit of components in the map, the protrusion of components from the map envelope, and the shape complementarity between pairs of components. The scoring function is optimized by our exact inference optimizer DOMINO (Discrete Optimization of Multiple INteracting Objects) that efficiently finds the global minimum in a discrete sampling space. MultiFit was benchmarked on seven assemblies of known structure, consisting of up to seven proteins each. The input atomic structures of the components were obtained from the Protein Data Bank, as well as by comparative modeling based on a 16-99% sequence identity to a template structure. A near-native configuration was usually found as the top-scoring model. Therefore, MultiFit can provide initial configurations for further refinement of many multicomponent assembly structures described by electron microscopy.
Collapse
Affiliation(s)
- Keren Lasker
- Blavatnik School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel-Aviv 69978, Israel.
| | | | | | | |
Collapse
|