1
|
Corum MR, Venkannagari H, Hryc CF, Baker ML. Predictive modeling and cryo-EM: A synergistic approach to modeling macromolecular structure. Biophys J 2024; 123:435-450. [PMID: 38268190 PMCID: PMC10912932 DOI: 10.1016/j.bpj.2024.01.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/09/2024] [Accepted: 01/18/2024] [Indexed: 01/26/2024] Open
Abstract
Over the last 15 years, structural biology has seen unprecedented development and improvement in two areas: electron cryo-microscopy (cryo-EM) and predictive modeling. Once relegated to low resolutions, single-particle cryo-EM is now capable of achieving near-atomic resolutions of a wide variety of macromolecular complexes. Ushered in by AlphaFold, machine learning has powered the current generation of predictive modeling tools, which can accurately and reliably predict models for proteins and some complexes directly from the sequence alone. Although they offer new opportunities individually, there is an inherent synergy between these techniques, allowing for the construction of large, complex macromolecular models. Here, we give a brief overview of these approaches in addition to illustrating works that combine these techniques for model building. These examples provide insight into model building, assessment, and limitations when integrating predictive modeling with cryo-EM density maps. Together, these approaches offer the potential to greatly accelerate the generation of macromolecular structural insights, particularly when coupled with experimental data.
Collapse
Affiliation(s)
- Michael R Corum
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Harikanth Venkannagari
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Corey F Hryc
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Matthew L Baker
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas.
| |
Collapse
|
2
|
Beton JG, Cragnolini T, Kaleel M, Mulvaney T, Sweeney A, Topf M. Integrating model simulation tools and
cryo‐electron
microscopy. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Joseph George Beton
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Tristan Cragnolini
- Institute of Structural and Molecular Biology, Birkbeck and University College London London UK
| | - Manaz Kaleel
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Thomas Mulvaney
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Aaron Sweeney
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Maya Topf
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| |
Collapse
|
3
|
Hryc CF, Baker ML. AlphaFold2 and CryoEM: Revisiting CryoEM modeling in near-atomic resolution density maps. iScience 2022; 25:104496. [PMID: 35733789 PMCID: PMC9207676 DOI: 10.1016/j.isci.2022.104496] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 04/07/2022] [Accepted: 05/24/2022] [Indexed: 11/27/2022] Open
Abstract
With the advent of new artificial intelligence and machine learning algorithms, predictive modeling can, in some cases, produce structures on par with experimental methods. The combination of predictive modeling and experimental structure determination by electron cryomicroscopy (cryoEM) offers a tantalizing approach for producing robust atomic models of macromolecular assemblies. Here, we apply AlphaFold2 to a set of community standard data sets and compare the results with the corresponding reference maps and models. Moreover, we present three unique case studies from previously determined cryoEM density maps of viruses. Our results show that AlphaFold2 can not only produce reasonably accurate models for analysis and additional hypotheses testing, but can also potentially yield incorrect structures if not properly validated with experimental data. Whereas we outline numerous shortcomings and potential pitfalls of predictive modeling, the obvious synergy between predictive modeling and cryoEM will undoubtedly result in new computational modeling tools.
Collapse
Affiliation(s)
- Corey F. Hryc
- Department of Biochemistry and Molecular Biology, Structural Biology Imaging Center, McGovern Medical School at The University of Texas Health Science Center at Houston, 6431 Fannin Street, Houston, TX 77030, USA
| | - Matthew L. Baker
- Department of Biochemistry and Molecular Biology, Structural Biology Imaging Center, McGovern Medical School at The University of Texas Health Science Center at Houston, 6431 Fannin Street, Houston, TX 77030, USA
| |
Collapse
|
4
|
Hryc CF, Baker ML. Beyond the Backbone: The Next Generation of Pathwalking Utilities for Model Building in CryoEM Density Maps. Biomolecules 2022; 12:773. [PMID: 35740898 PMCID: PMC9220806 DOI: 10.3390/biom12060773] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 05/25/2022] [Accepted: 05/30/2022] [Indexed: 01/18/2023] Open
Abstract
Single-particle electron cryomicroscopy (cryoEM) has become an indispensable tool for studying structure and function in macromolecular assemblies. As an integral part of the cryoEM structure determination process, computational tools have been developed to build atomic models directly from a density map without structural templates. Nearly a decade ago, we created Pathwalking, a tool for de novo modeling of protein structure in near-atomic resolution cryoEM density maps. Here, we present the latest developments in Pathwalking, including the addition of probabilistic models, as well as a companion tool for modeling waters and ligands. This software was evaluated on the 2021 CryoEM Ligand Challenge density maps, in addition to identifying ligands in three IP3R1 density maps at ~3 Å to 4.1 Å resolution. The results clearly demonstrate that the Pathwalking de novo modeling pipeline can construct accurate protein structures and reliably localize and identify ligand density directly from a near-atomic resolution map.
Collapse
Affiliation(s)
| | - Matthew L. Baker
- Department of Biochemistry and Molecular Biology, Structural Biology Imaging Center, McGovern Medical School, The University of Texas Health Science Center, 6431 Fannin Street, Houston, TX 77030, USA;
| |
Collapse
|
5
|
Zumbado-Corrales M, Esquivel-Rodríguez J. EvoSeg: Automated Electron Microscopy Segmentation through Random Forests and Evolutionary Optimization. Biomimetics (Basel) 2021; 6:biomimetics6020037. [PMID: 34206006 PMCID: PMC8293153 DOI: 10.3390/biomimetics6020037] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 05/17/2021] [Accepted: 05/28/2021] [Indexed: 11/30/2022] Open
Abstract
Electron Microscopy Maps are key in the study of bio-molecular structures, ranging from borderline atomic level to the sub-cellular range. These maps describe the envelopes that cover possibly a very large number of proteins that form molecular machines within the cell. Within those envelopes, we are interested to find what regions correspond to specific proteins so that we can understand how they function, and design drugs that can enhance or suppress a process that they are involved in, along with other experimental purposes. A classic approach by which we can begin the exploration of map regions is to apply a segmentation algorithm. This yields a mask where each voxel in 3D space is assigned an identifier that maps it to a segment; an ideal segmentation would map each segment to one protein unit, which is rarely the case. In this work, we present a method that uses bio-inspired optimization, through an Evolutionary-Optimized Segmentation algorithm, to iteratively improve upon baseline segments obtained from a classical approach, called watershed segmentation. The cost function used by the evolutionary optimization is based on an ideal segmentation classifier trained as part of this development, which uses basic structural information available to scientists, such as the number of expected units, volume and topology. We show that a basic initial segmentation with the additional information allows our evolutionary method to find better segmentation results, compared to the baseline generated by the watershed.
Collapse
|
6
|
Seffernick JT, Lindert S. Hybrid methods for combined experimental and computational determination of protein structure. J Chem Phys 2020; 153:240901. [PMID: 33380110 PMCID: PMC7773420 DOI: 10.1063/5.0026025] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Accepted: 11/10/2020] [Indexed: 02/04/2023] Open
Abstract
Knowledge of protein structure is paramount to the understanding of biological function, developing new therapeutics, and making detailed mechanistic hypotheses. Therefore, methods to accurately elucidate three-dimensional structures of proteins are in high demand. While there are a few experimental techniques that can routinely provide high-resolution structures, such as x-ray crystallography, nuclear magnetic resonance (NMR), and cryo-EM, which have been developed to determine the structures of proteins, these techniques each have shortcomings and thus cannot be used in all cases. However, additionally, a large number of experimental techniques that provide some structural information, but not enough to assign atomic positions with high certainty have been developed. These methods offer sparse experimental data, which can also be noisy and inaccurate in some instances. In cases where it is not possible to determine the structure of a protein experimentally, computational structure prediction methods can be used as an alternative. Although computational methods can be performed without any experimental data in a large number of studies, inclusion of sparse experimental data into these prediction methods has yielded significant improvement. In this Perspective, we cover many of the successes of integrative modeling, computational modeling with experimental data, specifically for protein folding, protein-protein docking, and molecular dynamics simulations. We describe methods that incorporate sparse data from cryo-EM, NMR, mass spectrometry, electron paramagnetic resonance, small-angle x-ray scattering, Förster resonance energy transfer, and genetic sequence covariation. Finally, we highlight some of the major challenges in the field as well as possible future directions.
Collapse
Affiliation(s)
- Justin T. Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, USA
| |
Collapse
|
7
|
Zhang B, Zhang X, Pearce R, Shen HB, Zhang Y. A New Protocol for Atomic-Level Protein Structure Modeling and Refinement Using Low-to-Medium Resolution Cryo-EM Density Maps. J Mol Biol 2020; 432:5365-5377. [PMID: 32771523 DOI: 10.1016/j.jmb.2020.07.027] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2020] [Revised: 07/14/2020] [Accepted: 07/31/2020] [Indexed: 12/19/2022]
Abstract
The rapid progress of cryo-electron microscopy (cryo-EM) in structural biology has raised an urgent need for robust methods to create and refine atomic-level structural models using low-resolution EM density maps. We propose a new protocol to create initial models using I-TASSER protein structure prediction, followed by EM density map-based rigid-body structure fitting, flexible fragment adjustment and atomic-level structure refinement simulations. The protocol was tested on a large set of 285 non-homologous proteins and generated structural models with correct folds for 260 proteins, where 28% had RMSDs below 2 Å. Compared to other state-of-the-art methods, the major advantage of the proposed pipeline lies in the uniform structure prediction and refinement protocol, as well as the extensive structural re-assembly simulations, which allow for low-to-medium resolution EM density map-guided structure modeling starting from amino acid sequences. Interestingly, the quality of both the image fitting and subsequent structure refinement was found to be strongly correlated with the correctness of the initial I-TASSER models; this is mainly due to the different correlation patterns observed between force field and structural quality for the models with template modeling score (or TM-score, a metric quantifying the similarity of models to the native) above and below a threshold of 0.5. Overall, the results demonstrate a new avenue that is ready to use for large-scale cryo-EM-based structure modeling and atomic-level density map-guided structure refinement.
Collapse
Affiliation(s)
- Biao Zhang
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, China; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Xi Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, China.
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA; Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
8
|
Alnabati E, Kihara D. Advances in Structure Modeling Methods for Cryo-Electron Microscopy Maps. Molecules 2019; 25:molecules25010082. [PMID: 31878333 PMCID: PMC6982917 DOI: 10.3390/molecules25010082] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Revised: 12/20/2019] [Accepted: 12/20/2019] [Indexed: 01/16/2023] Open
Abstract
Cryo-electron microscopy (cryo-EM) has now become a widely used technique for structure determination of macromolecular complexes. For modeling molecular structures from density maps of different resolutions, many algorithms have been developed. These algorithms can be categorized into rigid fitting, flexible fitting, and de novo modeling methods. It is also observed that machine learning (ML) techniques have been increasingly applied following the rapid progress of the ML field. Here, we review these different categories of macromolecule structure modeling methods and discuss their advances over time.
Collapse
Affiliation(s)
- Eman Alnabati
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
- Correspondence:
| |
Collapse
|
9
|
Wen C, Zhang J, Duan Y, Zhang H, Ma H. A Mini‐Review on Brewer's Spent Grain Protein: Isolation, Physicochemical Properties, Application of Protein, and Functional Properties of Hydrolysates. J Food Sci 2019; 84:3330-3340. [DOI: 10.1111/1750-3841.14906] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2019] [Revised: 08/30/2019] [Accepted: 10/01/2019] [Indexed: 01/17/2023]
Affiliation(s)
- Chaoting Wen
- School of Food and Biological EngineeringJiangsu Univ. Zhenjiang 212013 China
| | - Jixian Zhang
- School of Food and Biological EngineeringJiangsu Univ. Zhenjiang 212013 China
| | - Yuqing Duan
- School of Food and Biological EngineeringJiangsu Univ. Zhenjiang 212013 China
- Inst. of Food Physical ProcessingJiangsu Univ. Zhenjiang 212013 China
| | - Haihui Zhang
- School of Food and Biological EngineeringJiangsu Univ. Zhenjiang 212013 China
| | - Haile Ma
- School of Food and Biological EngineeringJiangsu Univ. Zhenjiang 212013 China
- Inst. of Food Physical ProcessingJiangsu Univ. Zhenjiang 212013 China
| |
Collapse
|
10
|
Yu Z, Yao Y, Deng H, Yi M. ANDIS: an atomic angle- and distance-dependent statistical potential for protein structure quality assessment. BMC Bioinformatics 2019; 20:299. [PMID: 31159742 PMCID: PMC6547486 DOI: 10.1186/s12859-019-2898-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Accepted: 05/13/2019] [Indexed: 01/05/2023] Open
Abstract
Background The knowledge-based statistical potential has been widely used in protein structure modeling and model quality assessment. They are commonly evaluated based on their abilities of native recognition as well as decoy discrimination. However, these two aspects are found to be mutually exclusive in many statistical potentials. Results We developed an atomic ANgle- and DIStance-dependent (ANDIS) statistical potential for protein structure quality assessment with distance cutoff being a tunable parameter. When distance cutoff is ≤9.0 Å, “effective atomic interaction” is employed to enhance the ability of native recognition. For a distance cutoff of ≥10 Å, the distance-dependent atom-pair potential with random-walk reference state is combined to strengthen the ability of decoy discrimination. Benchmark tests on 632 structural decoy sets from diverse sources demonstrate that ANDIS outperforms other state-of-the-art potentials in both native recognition and decoy discrimination. Conclusions Distance cutoff is a crucial parameter for distance-dependent statistical potentials. A lower distance cutoff is better for native recognition, while a higher one is favorable for decoy discrimination. The ANDIS potential is freely available as a standalone application at http://qbp.hzau.edu.cn/ANDIS/. Electronic supplementary material The online version of this article (10.1186/s12859-019-2898-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Zhongwang Yu
- Department of Physics, College of Science, Huazhong Agricultural University, Wuhan, 430070, China
| | - Yuangen Yao
- Department of Physics, College of Science, Huazhong Agricultural University, Wuhan, 430070, China
| | - Haiyou Deng
- Department of Physics, College of Science, Huazhong Agricultural University, Wuhan, 430070, China. .,Institute of Applied Physics, Huazhong Agricultural University, Wuhan, 430070, China.
| | - Ming Yi
- Department of Physics, College of Science, Huazhong Agricultural University, Wuhan, 430070, China. .,Institute of Applied Physics, Huazhong Agricultural University, Wuhan, 430070, China.
| |
Collapse
|
11
|
Kumar KK, Devi BU, Neeraja P. Molecular activities and ligand-binding specificities of StAR-related lipid transfer domains: exploring integrated in silico methods and ensemble-docking approaches. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2018; 29:483-501. [PMID: 29688061 DOI: 10.1080/1062936x.2018.1462847] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Accepted: 04/05/2018] [Indexed: 06/08/2023]
Abstract
In this study, cholesterol biotransformation gene-set of human steroidogenic acute regulatory protein-related lipid transfer (START) domains were evaluated from high-throughput gene screening approaches. It was shown that STARD1, STARD3 and STARD4 proteins are better effective transporters of cholesterol than STARD5 and STARD6 domains. Docking studies show a strong agreement with gene ontology enrichment data. According to both complementary strategies, it was found that only STARD1, STARD3 and STARD4 are potentially involved in cholesterol biotransformation in mitochondria through Ω1-loop of C-terminal α4-helical domain. Ensemble docking assessment for a set of selected chemicals of protein-chemical networks has shown possible binding probabilities with START domains. Among those, reproductive toxicity evoked drugs (mifepristone), insecticides (rotenone), tobacco pulmonary carcinogens (benzo(a)pyrene) and endocrine disruptor chemicals (EDCs) including perfluorooctanesulfonic acid (PFOS) and aflatoxin B1 (AFB1) potentially bound with novel hotspot residues of the α4-helical domain. Compound representation space and clustering approaches reveal that the START proteins show more sensitivity with these lead scaffolds, so they could provide probable barrier assets in cholesterol and steroidogenic acute regulatory (StAR) binding and leads adverse consequences in steroidogenesis. These findings indicate potential START domains and their binding levels with toxic chemicals; sorted viewpoints could be useful as a promising way to identify chemicals with related steroidogenisis impacts on human health.
Collapse
Affiliation(s)
- K Kranthi Kumar
- a Department of Zoology , Sri Venkateswara University , Tirupati , 517502 - A.P . India
| | - B Uma Devi
- a Department of Zoology , Sri Venkateswara University , Tirupati , 517502 - A.P . India
| | - P Neeraja
- a Department of Zoology , Sri Venkateswara University , Tirupati , 517502 - A.P . India
| |
Collapse
|
12
|
Al Nasr K, Yousef F, Jebril R, Jones C. Analytical Approaches to Improve Accuracy in Solving the Protein Topology Problem. Molecules 2018; 23:E28. [PMID: 29360779 PMCID: PMC6017786 DOI: 10.3390/molecules23020028] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 01/19/2018] [Accepted: 01/19/2018] [Indexed: 11/17/2022] Open
Abstract
To take advantage of recent advances in genomics and proteomics it is critical that the three-dimensional physical structure of biological macromolecules be determined. Cryo-Electron Microscopy (cryo-EM) is a promising and improving method for obtaining this data, however resolution is often not sufficient to directly determine the atomic scale structure. Despite this, information for secondary structure locations is detectable. De novo modeling is a computational approach to modeling these macromolecular structures based on cryo-EM derived data. During de novo modeling a mapping between detected secondary structures and the underlying amino acid sequence must be identified. DP-TOSS (Dynamic Programming for determining the Topology Of Secondary Structures) is one tool that attempts to automate the creation of this mapping. By treating the correspondence between the detected structures and the structures predicted from sequence data as a constraint graph problem DP-TOSS achieved good accuracy in its original iteration. In this paper, we propose modifications to the scoring methodology of DP-TOSS to improve its accuracy. Three scoring schemes were applied to DP-TOSS and tested: (i) a skeleton-based scoring function; (ii) a geometry-based analytical function; and (iii) a multi-well potential energy-based function. A test of 25 proteins shows that a combination of these schemes can improve the performance of DP-TOSS to solve the topology determination problem for macromolecule proteins.
Collapse
Affiliation(s)
- Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| | - Feras Yousef
- Department of Mathematics, The University of Jordan, Amman 11942, Jordan.
| | - Ruba Jebril
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| | - Christopher Jones
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| |
Collapse
|
13
|
Al Nasr K, Jones C, Yousef F, Jebril R. PEM-fitter: A Coarse-Grained Method to Validate Protein Candidate Models. J Comput Biol 2017; 25:21-32. [PMID: 29140718 DOI: 10.1089/cmb.2017.0191] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The volumetric images produced by Cryo-Electron Microscopy (cryo-EM) technique are used to model macromolecular assemblies and machines. De novo protein modeling uses these images to computationally model the structure of the molecules. Many candidate conformations are usually generated during the intermediate step. Conventionally, each of these candidates is evaluated by time-consuming approaches such as potential energy. We introduce an initial version of a geometrical screening method that uses the skeleton of the cryo-EM images to evaluate candidate structures. The aim of this method is to reduce the number of native-like candidate conformations and, therefore, reduce the time required for structural evaluation by energy calculations. A test of two datasets was performed. The first dataset contains 10 proteins and shows that our method can successfully detect the correct native structure for the given skeleton among a set of different protein structures. The second dataset contains 12 proteins and shows that our method can filter slightly modified decoy conformations of the same protein. The efficiency of the method is also reported.
Collapse
Affiliation(s)
- Kamal Al Nasr
- 1 Department of Computer Science, Tennessee State University , Nashville, Tennessee
| | - Christopher Jones
- 1 Department of Computer Science, Tennessee State University , Nashville, Tennessee
| | - Feras Yousef
- 2 Department of Mathematics, The University of Jordan , Amman, Jordan
| | - Ruba Jebril
- 1 Department of Computer Science, Tennessee State University , Nashville, Tennessee
| |
Collapse
|
14
|
Borkotoky S, Kumar Meena C, Bhalerao GM, Murali A. An in-silico glimpse into the pH dependent structural changes of T7 RNA polymerase: a protein with simplicity. Sci Rep 2017; 7:6290. [PMID: 28740191 PMCID: PMC5524818 DOI: 10.1038/s41598-017-06586-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Accepted: 06/14/2017] [Indexed: 11/24/2022] Open
Abstract
The capability of performing an array of functions with its single subunit structure makes T7 RNA polymerase (T7RNAP) as one of the simplest yet attractive target for various investigations ranging from structure determinations to several biological tests. In this study, with the help of molecular dynamics (MD) calculations and molecular docking, we investigated the effect of varying pH conditions on conformational flexibility of T7RNAP. We also studied its effect on the interactions with a well established inhibitor (heparin), substrate GTP and T7 promoter of T7RNAP. The simulation studies were validated with the help of three dimensional reconstructions of the polymerase at different pH environments using transmission electron microscopy and single particle analysis. On comparing the simulated structures, it was observed that the structure of T7RNAP changes considerably and interactions with its binding partners also changes as the pH shifts from basic to acidic. Further, it was observed that the C-terminal end plays a vital role in the inefficiency of the polymerase at low pH. Thus, this in-silico study may provide a significant insight into the structural investigations on T7RNAP as well as in designing potent inhibitors against it in varying pH environments.
Collapse
Affiliation(s)
- Subhomoi Borkotoky
- Centre for Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry, 605014, India
| | - Chetan Kumar Meena
- Centre for Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry, 605014, India
| | - Gopalkrishna M Bhalerao
- UGC-DAE Consortium for Scientific Research Kalpakkam Node, Kokilamedu, Tamilnadu, 603104, India
| | - Ayaluru Murali
- Centre for Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry, 605014, India.
| |
Collapse
|
15
|
Computational modeling of protein assemblies. Curr Opin Struct Biol 2017; 44:179-189. [DOI: 10.1016/j.sbi.2017.04.006] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Revised: 04/07/2017] [Accepted: 04/11/2017] [Indexed: 01/18/2023]
|
16
|
Rakesh R, Joseph AP, Bhaskara RM, Srinivasan N. Structural and mechanistic insights into human splicing factor SF3b complex derived using an integrated approach guided by the cryo-EM density maps. RNA Biol 2016; 13:1025-1040. [PMID: 27618338 DOI: 10.1080/15476286.2016.1218590] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Pre-mRNA splicing in eukaryotes is performed by the spliceosome, a highly complex macromolecular machine. SF3b is a multi-protein complex which recognizes the branch point adenosine of pre-mRNA as part of a larger U2 snRNP or U11/U12 di-snRNP in the dynamic spliceosome machinery. Although a cryo-EM map is available for human SF3b complex, the structure and relative spatial arrangement of all components in the complex are not yet known. We have recognized folds of domains in various proteins in the assembly and generated comparative models. Using an integrative approach involving structural and other experimental data, guided by the available cryo-EM density map, we deciphered a pseudo-atomic model of the closed form of SF3b which is found to be a "fuzzy complex" with highly flexible components and multiplicity of folds. Further, the model provides structural information for 5 proteins (SF3b10, SF3b155, SF3b145, SF3b130 and SF3b14b) and localization information for 4 proteins (SF3b10, SF3b145, SF3b130 and SF3b14b) in the assembly for the first time. Integration of this model with the available U11/U12 di-snRNP cryo-EM map enabled elucidation of an open form. This now provides new insights on the mechanistic features involved in the transition between closed and open forms pivoted by a hinge region in the SF3b155 protein that also harbors cancer causing mutations. Moreover, the open form guided model of the 5' end of U12 snRNA, which includes the branch point duplex, shows that the architecture of SF3b acts as a scaffold for U12 snRNA: pre-mRNA branch point duplex formation with potential implications for branch point adenosine recognition fidelity.
Collapse
Affiliation(s)
- Ramachandran Rakesh
- a Molecular Biophysics Unit, Indian Institute of Science , Bangalore , India
| | - Agnel Praveen Joseph
- b National Center for Biological Sciences, TIFR, GKVK Campus , Bangalore , India
| | - Ramachandra M Bhaskara
- a Molecular Biophysics Unit, Indian Institute of Science , Bangalore , India.,b National Center for Biological Sciences, TIFR, GKVK Campus , Bangalore , India
| | | |
Collapse
|
17
|
Chen M, Baldwin PR, Ludtke SJ, Baker ML. De Novo modeling in cryo-EM density maps with Pathwalking. J Struct Biol 2016; 196:289-298. [PMID: 27436409 DOI: 10.1016/j.jsb.2016.06.004] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Revised: 06/06/2016] [Accepted: 06/07/2016] [Indexed: 11/26/2022]
Abstract
As electron cryo-microscopy (cryo-EM) can now frequently achieve near atomic resolution, accurate interpretation of these density maps in terms of atomistic detail has become paramount in deciphering macromolecular structure and function. However, there are few software tools for modeling protein structure from cryo-EM density maps in this resolution range. Here, we present an extension of our original Pathwalking protocol, which can automatically trace a protein backbone directly from a near-atomic resolution (3-6Å) density map. The original Pathwalking approach utilized a Traveling Salesman Problem solver for backbone tracing, but manual adjustment was still required during modeling. In the new version, human intervention is minimized and we provide a more robust approach for backbone modeling. This includes iterative secondary structure identification, termini detection and the ability to model multiple subunits without prior segmentation. Overall, the new Pathwalking procedure provides a more complete and robust tool for annotating protein structure function in near-atomic resolution density maps.
Collapse
Affiliation(s)
- Muyuan Chen
- Program in Structural and Computational Biology and Molecular Biophysics, United States; Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, United States
| | - Philip R Baldwin
- Department of Psychology, United States; Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, United States
| | - Steven J Ludtke
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, United States
| | - Matthew L Baker
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, United States.
| |
Collapse
|
18
|
Matthew Allen Bullock J, Schwab J, Thalassinos K, Topf M. The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry. Mol Cell Proteomics 2016; 15:2491-500. [PMID: 27150526 PMCID: PMC4937519 DOI: 10.1074/mcp.m116.058560] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2016] [Indexed: 11/06/2022] Open
Abstract
Crosslinking mass spectrometry (XL-MS) is becoming an increasingly popular technique for modeling protein monomers and complexes. The distance restraints garnered from these experiments can be used alone or as part of an integrative modeling approach, incorporating data from many sources. However, modeling practices are varied and the difference in their usefulness is not clear. Here, we develop a new scoring procedure for models based on crosslink data—Matched and Nonaccessible Crosslink score (MNXL). We compare its performance with that of other commonly-used scoring functions (Number of Violations and Sum of Violation Distances) on a benchmark of 14 protein domains, each with 300 corresponding models (at various levels of quality) and associated, previously published, experimental crosslinks (XLdb). The distances between crosslinked lysines are calculated either as Euclidean distances or Solvent Accessible Surface Distances (SASD) using a newly-developed method (Jwalk). MNXL takes into account whether a crosslink is nonaccessible, i.e. an experimentally observed crosslink has no corresponding SASD in a model due to buried lysines. This metric alone is shown to have a significant impact on modeling performance and is a concept that is not considered at present if only Euclidean distances are used. Additionally, a comparison between modeling with SASD or Euclidean distance shows that SASD is superior, even when factoring out the effect of the nonaccessible crosslinks. Our benchmarking also shows that MNXL outperforms the other tested scoring functions in terms of precision and correlation to Cα-RMSD from the crystal structure. We finally test the MNXL at different levels of crosslink recovery (i.e. the percentage of crosslinks experimentally observed out of all theoretical ones) and set a target recovery of ∼20% after which the performance plateaus.
Collapse
Affiliation(s)
- Joshua Matthew Allen Bullock
- From the ‡Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet street, London, WC1E 7HX, UK
| | - Jannik Schwab
- From the ‡Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet street, London, WC1E 7HX, UK; §Gene Center Munich, Ludwig-Maximilians-Universität (LMU) Munich, Feodor-Lynen-Strasse 25, 81377 Munich, Germany
| | - Konstantinos Thalassinos
- From the ‡Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet street, London, WC1E 7HX, UK; ¶Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London WC1E 6BT, UK
| | - Maya Topf
- From the ‡Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet street, London, WC1E 7HX, UK;
| |
Collapse
|
19
|
Deng H, Jia Y, Zhang Y. 3DRobot: automated generation of diverse and well-packed protein structure decoys. Bioinformatics 2015; 32:378-87. [PMID: 26471454 DOI: 10.1093/bioinformatics/btv601] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Accepted: 10/10/2015] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Computationally generated non-native protein structure conformations (or decoys) are often used for designing protein folding simulation methods and force fields. However, almost all the decoy sets currently used in literature suffer from uneven root mean square deviation (RMSD) distribution with bias to non-protein like hydrogen-bonding and compactness patterns. Meanwhile, most protein decoy sets are pre-calculated and there is a lack of methods for automated generation of high-quality decoys for any target proteins. RESULTS We developed a new algorithm, 3DRobot, to create protein structure decoys by free fragment assembly with enhanced hydrogen-bonding and compactness interactions. The method was benchmarked with three widely used decoy sets from ab initio folding and comparative modeling simulations. The decoys generated by 3DRobot are shown to have significantly enhanced diversity and evenness with a continuous distribution in the RMSD space. The new energy terms introduced in 3DRobot improve the hydrogen-bonding network and compactness of decoys, which eliminates the possibility of native structure recognition by trivial potentials. Algorithms that can automatically create such diverse and well-packed non-native conformations from any protein structure should have a broad impact on the development of advanced protein force field and folding simulation methods. AVAILIABLITY AND IMPLEMENTATION: http://zhanglab.ccmb.med.umich.edu/3DRobot/ CONTACT jiay@phy.ccnu.edu.cn; zhng@umich.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Haiyou Deng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 45108, USA, Department of Physics and Institute of Biophysics, Central China Normal University, Wuhan 430079, China and
| | - Ya Jia
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 45108, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 45108, USA, Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 45108, USA
| |
Collapse
|
20
|
Bettadapura R, Rasheed M, Vollrath A, Bajaj C. PF2fit: Polar Fast Fourier Matched Alignment of Atomistic Structures with 3D Electron Microscopy Maps. PLoS Comput Biol 2015; 11:e1004289. [PMID: 26469938 PMCID: PMC4607507 DOI: 10.1371/journal.pcbi.1004289] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2014] [Accepted: 04/14/2015] [Indexed: 11/30/2022] Open
Abstract
There continue to be increasing occurrences of both atomistic structure models in the PDB (possibly reconstructed from X-ray diffraction or NMR data), and 3D reconstructed cryo-electron microscopy (3D EM) maps (albeit at coarser resolution) of the same or homologous molecule or molecular assembly, deposited in the EMDB. To obtain the best possible structural model of the molecule at the best achievable resolution, and without any missing gaps, one typically aligns (match and fits) the atomistic structure model with the 3D EM map. We discuss a new algorithm and generalized framework, named PF(2) fit (Polar Fast Fourier Fitting) for the best possible structural alignment of atomistic structures with 3D EM. While PF(2) fit enables only a rigid, six dimensional (6D) alignment method, it augments prior work on 6D X-ray structure and 3D EM alignment in multiple ways: Scoring. PF(2) fit includes a new scoring scheme that, in addition to rewarding overlaps between the volumes occupied by the atomistic structure and 3D EM map, rewards overlaps between the volumes complementary to them. We quantitatively demonstrate how this new complementary scoring scheme improves upon existing approaches. PF(2) fit also includes two scoring functions, the non-uniform exterior penalty and the skeleton-secondary structure score, and implements the scattering potential score as an alternative to traditional Gaussian blurring. Search. PF(2) fit utilizes a fast polar Fourier search scheme, whose main advantage is the ability to search over uniformly and adaptively sampled subsets of the space of rigid-body motions. PF(2) fit also implements a new reranking search and scoring methodology that considerably improves alignment metrics in results obtained from the initial search.
Collapse
Affiliation(s)
- Radhakrishna Bettadapura
- Radhakrishna Bettadapura Computational Visualization Center/Department of Mechanical Engineering, University of Texas at Austin, Austin, Texas, United States of America
| | - Muhibur Rasheed
- Muhibur Rasheed Computational Visualization Center/Department of Mechanical Engineering, University of Texas at Austin, Austin, Texas, United States of America
| | - Antje Vollrath
- Antje Vollrath Institut Computational Mathematics, Technische Universität Braunschweig, Braunschweig, Germany
| | - Chandrajit Bajaj
- Chandrajit Bajaj Computational Visualization Center/Institute of Computational Engineering & Sciences/Department of Computer Science, University of Texas at Austin, Austin, Texas, United States of America
| |
Collapse
|
21
|
Farabella I, Vasishtan D, Joseph AP, Pandurangan AP, Sahota H, Topf M. TEMPy: a Python library for assessment of three-dimensional electron microscopy density fits. J Appl Crystallogr 2015; 48:1314-1323. [PMID: 26306092 PMCID: PMC4520291 DOI: 10.1107/s1600576715010092] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2014] [Accepted: 05/24/2015] [Indexed: 12/21/2022] Open
Abstract
TEMPy is an object-oriented Python library that provides the means to validate density fits in electron microscopy reconstructions. This article highlights several features of particular interest for this purpose and includes some customized examples. Three-dimensional electron microscopy is currently one of the most promising techniques used to study macromolecular assemblies. Rigid and flexible fitting of atomic models into density maps is often essential to gain further insights into the assemblies they represent. Currently, tools that facilitate the assessment of fitted atomic models and maps are needed. TEMPy (template and electron microscopy comparison using Python) is a toolkit designed for this purpose. The library includes a set of methods to assess density fits in intermediate-to-low resolution maps, both globally and locally. It also provides procedures for single-fit assessment, ensemble generation of fits, clustering, and multiple and consensus scoring, as well as plots and output files for visualization purposes to help the user in analysing rigid and flexible fits. The modular nature of TEMPy helps the integration of scoring and assessment of fits into large pipelines, making it a tool suitable for both novice and expert structural biologists.
Collapse
Affiliation(s)
- Irene Farabella
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London , Malet street, London WC1E 7HX, UK
| | - Daven Vasishtan
- Oxford Particle Imaging Centre, Division of Structural Biology, Wellcome Trust Centre for Human Genetics, University of Oxford , Oxford OX3 7BN, UK
| | - Agnel Praveen Joseph
- Scientific Computing Department, Science and Technology Facilities Council, Research Complex at Harwell , Didcot, Oxon OX11 0QX, UK
| | - Arun Prasad Pandurangan
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London , Malet street, London WC1E 7HX, UK
| | - Harpal Sahota
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London , Malet street, London WC1E 7HX, UK
| | - Maya Topf
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London , Malet street, London WC1E 7HX, UK
| |
Collapse
|
22
|
López-Blanco JR, Chacón P. Structural modeling from electron microscopy data. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2014. [DOI: 10.1002/wcms.1199] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- José Ramón López-Blanco
- Department of Biological Physical Chemistry; Rocasolano Physical Chemistry Institute, CSIC; Madrid Spain
| | - Pablo Chacón
- Department of Biological Physical Chemistry; Rocasolano Physical Chemistry Institute, CSIC; Madrid Spain
| |
Collapse
|
23
|
gEMfitter: a highly parallel FFT-based 3D density fitting tool with GPU texture memory acceleration. J Struct Biol 2013; 184:348-54. [PMID: 24060989 DOI: 10.1016/j.jsb.2013.09.010] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2013] [Revised: 09/06/2013] [Accepted: 09/10/2013] [Indexed: 11/24/2022]
Abstract
Fitting high resolution protein structures into low resolution cryo-electron microscopy (cryo-EM) density maps is an important technique for modeling the atomic structures of very large macromolecular assemblies. This article presents "gEMfitter", a highly parallel fast Fourier transform (FFT) EM density fitting program which can exploit the special hardware properties of modern graphics processor units (GPUs) to accelerate both the translational and rotational parts of the correlation search. In particular, by using the GPU's special texture memory hardware to rotate 3D voxel grids, the cost of rotating large 3D density maps is almost completely eliminated. Compared to performing 3D correlations on one core of a contemporary central processor unit (CPU), running gEMfitter on a modern GPU gives up to 26-fold speed-up. Furthermore, using our parallel processing framework, this speed-up increases linearly with the number of CPUs or GPUs used. Thus, it is now possible to use routinely more robust but more expensive 3D correlation techniques. When tested on low resolution experimental cryo-EM data for the GroEL-GroES complex, we demonstrate the satisfactory fitting results that may be achieved by using a locally normalised cross-correlation with a Laplacian pre-filter, while still being up to three orders of magnitude faster than the well-known COLORES program.
Collapse
|
24
|
BAJAJ CHANDRAJIT, BAUER BENEDIKT, BETTADAPURA RADHAKRISHNA, VOLLRATH ANTJE. NONUNIFORM FOURIER TRANSFORMS FOR RIGID-BODY AND MULTI-DIMENSIONAL ROTATIONAL CORRELATIONS. SIAM JOURNAL ON SCIENTIFIC COMPUTING : A PUBLICATION OF THE SOCIETY FOR INDUSTRIAL AND APPLIED MATHEMATICS 2013; 35:10.1137/120892386. [PMID: 24379643 PMCID: PMC3874283 DOI: 10.1137/120892386] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
The task of evaluating correlations is central to computational structural biology. The rigid-body correlation problem seeks the rigid-body transformation (R, t), R ∈ SO(3), t ∈ ℝ3 that maximizes the correlation between a pair of input scalar-valued functions representing molecular structures. Exhaustive solutions to the rigid-body correlation problem take advantage of the fast Fourier transform to achieve a speedup either with respect to the sought translation or rotation. We present PFcorr, a new exhaustive solution, based on the non-equispaced SO(3) Fourier transform, to the rigid-body correlation problem; unlike previous solutions, ours achieves a combination of translational and rotational speedups without requiring equispaced grids. PFcorr can be straightforwardly applied to a variety of problems in protein structure prediction and refinement that involve correlations under rigid-body motions of the protein. Additionally, we show how it applies, along with an appropriate flexibility model, to analogs of the above problems in which the flexibility of the protein is relevant.
Collapse
Affiliation(s)
- CHANDRAJIT BAJAJ
- Computational Visualization Center, Department of Computer Sciences and The Institute of Computational Engineering and Sciences, The University of Texas at Austin, 1 University Station C0200, Austin, Texas 78712, USA
| | - BENEDIKT BAUER
- Max Planck Institute for Evolutionary Biology. Plön, Germany
| | - RADHAKRISHNA BETTADAPURA
- Computational Visualization Center, Department of Mechanical Engineering, The University of Texas at Austin, 1 University Station C0200, Austin, Texas 78712, USA
| | - ANTJE VOLLRATH
- Institute of Computational Mathematics, TU Braunschweig, Pockelsstr 14, 38106 Braunschweig, Germany
| |
Collapse
|
25
|
Esquivel-Rodríguez J, Kihara D. Computational methods for constructing protein structure models from 3D electron microscopy maps. J Struct Biol 2013; 184:93-102. [PMID: 23796504 DOI: 10.1016/j.jsb.2013.06.008] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 06/11/2013] [Accepted: 06/13/2013] [Indexed: 12/31/2022]
Abstract
Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein structure information, in this article we focus on reviewing computational methods that model protein three-dimensional (3D) structures from a 3D EM density map that is constructed from two-dimensional (2D) maps. The computational methods we discuss range from de novo methods, which identify structural elements in an EM map, to structure fitting methods, where known high resolution structures are fit into a low-resolution EM map. A list of available computational tools is also provided.
Collapse
Affiliation(s)
- Juan Esquivel-Rodríguez
- Department of Computer Science, College of Science, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
26
|
Consensus among multiple approaches as a reliability measure for flexible fitting into cryo-EM data. J Struct Biol 2013; 182:67-77. [PMID: 23416197 DOI: 10.1016/j.jsb.2013.02.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2012] [Revised: 01/29/2013] [Accepted: 02/01/2013] [Indexed: 12/14/2022]
Abstract
Cryo-electron microscopy (cryo-EM) can provide low-resolution density maps of large macromolecular assemblies. As the number of structures deposited in the Protein Data Bank by fitting a high-resolution structure into a low-resolution cryo-EM map is increasing, there is a need to revise the protocols and improve the measures for fitting. A recent study suggested using a combination of multiple automated flexible fitting approaches to improve the interpretation of cryo-EM data. The current work further explores the use of multiple approaches by validating this "consensus" fitting approach and deriving a local reliability measure. Here four different flexible fitting approaches are applied for fitting an initial structure into a simulated density map of known target structure from a dataset of proteins. It is found that the models produced from different approaches often have a consensus in conformation and are also near to the target structure, whereas cases not showing consensus are away from the target. A high correlation is also observed between the RMSF profiles calculated with respect to the average and the target structures, which indicates that the relation between consensus and accuracy can also be extended to a per-residue level. Therefore, the RMSF among the fitted models is proposed as a local reliability measure, which can be used to assess the reliability of the fit at specific regions. Hence, we encourage the community to use consensus flexible fitting with different methods to report on local reliability of the resulting models and improve the interpretation of cryo-EM data.
Collapse
|
27
|
de Vries SJ, Zacharias M. ATTRACT-EM: a new method for the computational assembly of large molecular machines using cryo-EM maps. PLoS One 2012; 7:e49733. [PMID: 23251350 PMCID: PMC3522670 DOI: 10.1371/journal.pone.0049733] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2012] [Accepted: 10/17/2012] [Indexed: 11/23/2022] Open
Abstract
Many of the most important functions in the cell are carried out by proteins organized in large molecular machines. Cryo-electron microscopy (cryo-EM) is increasingly being used to obtain low resolution density maps of these large assemblies. A new method, ATTRACT-EM, for the computational assembly of molecular assemblies from their components has been developed. Based on concepts from the protein-protein docking field, it utilizes cryo-EM density maps to assemble molecular subunits at near atomic detail, starting from millions of initial subunit configurations. The search efficiency was further enhanced by recombining partial solutions, the inclusion of symmetry information, and refinement using a molecular force field. The approach was tested on the GroES-GroEL system, using an experimental cryo-EM map at 23.5 Å resolution, and on several smaller complexes. Inclusion of experimental information on the symmetry of the systems and the application of a new gradient vector matching algorithm allowed the efficient identification of docked assemblies in close agreement with experiment. Application to the GroES-GroEL complex resulted in a top ranked model with a deviation of 4.6 Å (and a 2.8 Å model within the top 10) from the GroES-GroEL crystal structure, a significant improvement over existing methods.
Collapse
Affiliation(s)
- Sjoerd J de Vries
- Physik-Department T38, Technische Universität München, Garching, Germany.
| | | |
Collapse
|
28
|
Saha M, Morais MC. FOLD-EM: automated fold recognition in medium- and low-resolution (4-15 Å) electron density maps. ACTA ACUST UNITED AC 2012; 28:3265-73. [PMID: 23131460 DOI: 10.1093/bioinformatics/bts616] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Owing to the size and complexity of large multi-component biological assemblies, the most tractable approach to determining their atomic structure is often to fit high-resolution radiographic or nuclear magnetic resonance structures of isolated components into lower resolution electron density maps of the larger assembly obtained using cryo-electron microscopy (cryo-EM). This hybrid approach to structure determination requires that an atomic resolution structure of each component, or a suitable homolog, is available. If neither is available, then the amount of structural information regarding that component is limited by the resolution of the cryo-EM map. However, even if a suitable homolog cannot be identified using sequence analysis, a search for structural homologs should still be performed because structural homology often persists throughout evolution even when sequence homology is undetectable, As macromolecules can often be described as a collection of independently folded domains, one way of searching for structural homologs would be to systematically fit representative domain structures from a protein domain database into the medium/low resolution cryo-EM map and return the best fits. Taken together, the best fitting non-overlapping structures would constitute a 'mosaic' backbone model of the assembly that could aid map interpretation and illuminate biological function. RESULT Using the computational principles of the Scale-Invariant Feature Transform (SIFT), we have developed FOLD-EM-a computational tool that can identify folded macromolecular domains in medium to low resolution (4-15 Å) electron density maps and return a model of the constituent polypeptides in a fully automated fashion. As a by-product, FOLD-EM can also do flexible multi-domain fitting that may provide insight into conformational changes that occur in macromolecular assemblies.
Collapse
Affiliation(s)
- Mitul Saha
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, 301 University Boulevard, Galveston, TX 7555-0647, USA.
| | | |
Collapse
|
29
|
Baker ML, Baker MR, Hryc CF, Ju T, Chiu W. Gorgon and pathwalking: macromolecular modeling tools for subnanometer resolution density maps. Biopolymers 2012; 97:655-68. [PMID: 22696403 PMCID: PMC3899894 DOI: 10.1002/bip.22065] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The complex interplay of proteins and other molecules, often in the form of large transitory assemblies, are critical to cellular function. Today, X-ray crystallography and electron cryo-microscopy (cryo-EM) are routinely used to image these macromolecular complexes, though often at limited resolutions. Despite the rapidly growing number of macromolecular structures, few tools exist for modeling and annotating structures in the range of 3-10 Å resolution. To address this need, we have developed a number of utilities specifically targeting subnanometer resolution density maps. As part of the 2010 Cryo-EM Modeling Challenge, we demonstrated two of our latest de novo modeling tools, Pathwalking and Gorgon, as well as a tool for secondary structure identification (SSEHunter) and a new rigid-body/flexible fitting tool in Gorgon. In total, we submitted 30 structural models from ten different subnanometer resolution data sets in four of the six challenge categories. Each of our utlities produced accurate structural models and annotations across the various density maps. In the end, the utilities that we present here offer users a robust toolkit for analyzing and modeling protein structure in macromolecular assemblies at non-atomic resolutions.
Collapse
Affiliation(s)
- Matthew L Baker
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, National Center for Macromolecular Imaging, Baylor College of Medicine, Houston, TX 77030, USA.
| | | | | | | | | |
Collapse
|
30
|
Lindert S, Alexander N, Wötzel N, Karakaş M, Stewart PL, Meiler J. EM-fold: de novo atomic-detail protein structure determination from medium-resolution density maps. Structure 2012; 20:464-78. [PMID: 22405005 DOI: 10.1016/j.str.2012.01.023] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2011] [Revised: 01/23/2012] [Accepted: 01/26/2012] [Indexed: 11/17/2022]
Abstract
Electron density maps of membrane proteins or large macromolecular complexes are frequently only determined at medium resolution between 4 Å and 10 Å, either by cryo-electron microscopy or X-ray crystallography. In these density maps, the general arrangement of secondary structure elements (SSEs) is revealed, whereas their directionality and connectivity remain elusive. We demonstrate that the topology of proteins with up to 250 amino acids can be determined from such density maps when combined with a computational protein folding protocol. Furthermore, we accurately reconstruct atomic detail in loop regions and amino acid side chains not visible in the experimental data. The EM-Fold algorithm assembles the SSEs de novo before atomic detail is added using Rosetta. In a benchmark of 27 proteins, the protocol consistently and reproducibly achieves models with root mean square deviation values <3 Å.
Collapse
Affiliation(s)
- Steffen Lindert
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, TN 37212, USA
| | | | | | | | | | | |
Collapse
|
31
|
Henderson R, Sali A, Baker ML, Carragher B, Devkota B, Downing KH, Egelman EH, Feng Z, Frank J, Grigorieff N, Jiang W, Ludtke SJ, Medalia O, Penczek PA, Rosenthal PB, Rossmann MG, Schmid MF, Schröder GF, Steven AC, Stokes DL, Westbrook JD, Wriggers W, Yang H, Young J, Berman HM, Chiu W, Kleywegt GJ, Lawson CL. Outcome of the first electron microscopy validation task force meeting. Structure 2012; 20:205-14. [PMID: 22325770 PMCID: PMC3328769 DOI: 10.1016/j.str.2011.12.014] [Citation(s) in RCA: 361] [Impact Index Per Article: 30.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2011] [Revised: 12/29/2011] [Accepted: 12/29/2011] [Indexed: 11/10/2022]
Abstract
This Meeting Review describes the proceedings and conclusions from the inaugural meeting of the Electron Microscopy Validation Task Force organized by the Unified Data Resource for 3DEM (http://www.emdatabank.org) and held at Rutgers University in New Brunswick, NJ on September 28 and 29, 2010. At the workshop, a group of scientists involved in collecting electron microscopy data, using the data to determine three-dimensional electron microscopy (3DEM) density maps, and building molecular models into the maps explored how to assess maps, models, and other data that are deposited into the Electron Microscopy Data Bank and Protein Data Bank public data archives. The specific recommendations resulting from the workshop aim to increase the impact of 3DEM in biology and medicine.
Collapse
Affiliation(s)
- Richard Henderson
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Finding rigid bodies in protein structures: Application to flexible fitting into cryoEM maps. J Struct Biol 2012; 177:520-31. [DOI: 10.1016/j.jsb.2011.10.011] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2011] [Revised: 10/22/2011] [Accepted: 10/27/2011] [Indexed: 11/18/2022]
|
33
|
Reconstructing virus structures from nanometer to near-atomic resolutions with cryo-electron microscopy and tomography. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2012; 726:49-90. [PMID: 22297510 DOI: 10.1007/978-1-4614-0980-9_4] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
The past few decades have seen tremendous advances in single-particle electron -cryo-microscopy (cryo-EM). The field has matured to the point that near-atomic resolution density maps can be generated for icosahedral viruses without the need for crystallization. In parallel, substantial progress has been made in determining the structures of nonicosahedrally arranged proteins in viruses by employing either single-particle cryo-EM or cryo-electron tomography (cryo-ET). Implicit in this course have been the availability of a new generation of electron cryo-microscopes and the development of the computational tools that are essential for generating these maps and models. This methodology has enabled structural biologists to analyze structures in increasing detail for virus particles that are in different morphogenetic states. Furthermore, electron imaging of frozen, hydrated cells, in the process of being infected by viruses, has also opened up a new avenue for studying virus structures "in situ". Here we present the common techniques used to acquire and process cryo-EM and cryo-ET data and discuss their implications for structural virology both now and in the future.
Collapse
|
34
|
Lasker K, Velázquez-Muriel JA, Webb BM, Yang Z, Ferrin TE, Sali A. Macromolecular assembly structures by comparative modeling and electron microscopy. Methods Mol Biol 2012; 857:331-350. [PMID: 22323229 DOI: 10.1007/978-1-61779-588-6_15] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Advances in electron microscopy allow for structure determination of large biological machines at increasingly higher resolutions. A key step in this process is fitting component structures into the electron microscopy-derived density map of their assembly. Comparative modeling can contribute by providing atomic models of the components, via fold assignment, sequence-structure alignment, model building, and model assessment. All four stages of comparative modeling can also benefit from consideration of the density map. In this chapter, we describe numerous types of modeling problems restrained by a density map and available protocols for finding solutions. In particular, we provide detailed instructions for density map-guided modeling using the Integrative Modeling Platform (IMP), MODELLER, and UCSF Chimera.
Collapse
Affiliation(s)
- Keren Lasker
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA.
| | | | | | | | | | | |
Collapse
|
35
|
AL NASR KAMAL, RANJAN DESH, ZUBAIR MOHAMMAD, HE JING. RANKING VALID TOPOLOGIES OF THE SECONDARY STRUCTURE ELEMENTS USING A CONSTRAINT GRAPH. J Bioinform Comput Biol 2011; 9:415-30. [DOI: 10.1142/s0219720011005604] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2011] [Revised: 04/12/2011] [Accepted: 04/17/2011] [Indexed: 11/18/2022]
Abstract
Electron cryo-microscopy is a fast advancing biophysical technique to derive three-dimensional structures of large protein complexes. Using this technique, many density maps have been generated at intermediate resolution such as 6–10 Å resolution. Although it is challenging to derive the backbone of the protein directly from such density maps, secondary structure elements such as helices and β-sheets can be computationally detected. Our work in this paper provides an approach to enumerate the top-ranked possible topologies instead of enumerating the entire population of the topologies. This approach is particularly practical for large proteins. We developed a directed weighted graph, the topology graph, to represent the secondary structure assignment problem. We prove that the problem of finding the valid topology with the minimum cost is NP hard. We developed an O(N2 2N) dynamic programming algorithm to identify the topology with the minimum cost. The test of 15 proteins suggests that our dynamic programming approach is feasible to work with proteins of much larger size than we could before. The largest protein in the test contains 18 helical sticks detected from the density map out of 33 helices in the protein.
Collapse
Affiliation(s)
- KAMAL AL NASR
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| | - DESH RANJAN
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| | - MOHAMMAD ZUBAIR
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| | - JING HE
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| |
Collapse
|
36
|
Tjioe E, Lasker K, Webb B, Wolfson HJ, Sali A. MultiFit: a web server for fitting multiple protein structures into their electron microscopy density map. Nucleic Acids Res 2011; 39:W167-70. [PMID: 21715383 PMCID: PMC3125811 DOI: 10.1093/nar/gkr490] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Advances in electron microscopy (EM) allow for structure determination of large biological assemblies at increasingly higher resolutions. A key step in this process is fitting multiple component structures into an EM-derived density map of their assembly. Here, we describe a web server for this task. The server takes as input a set of protein structures in the PDB format and an EM density map in the MRC format. The output is an ensemble of models ranked by their quality of fit to the density map. The models can be viewed online or downloaded from the website. The service is available at; http://salilab.org/multifit/ and http://bioinfo3d.cs.tau.ac.il/.
Collapse
Affiliation(s)
- Elina Tjioe
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | | | | | | | | |
Collapse
|
37
|
Beck M, Topf M, Frazier Z, Tjong H, Xu M, Zhang S, Alber F. Exploring the spatial and temporal organization of a cell's proteome. J Struct Biol 2011; 173:483-96. [PMID: 21094684 PMCID: PMC3784337 DOI: 10.1016/j.jsb.2010.11.011] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2010] [Revised: 11/05/2010] [Accepted: 11/08/2010] [Indexed: 10/18/2022]
Abstract
To increase our current understanding of cellular processes, such as cell signaling and division, knowledge is needed about the spatial and temporal organization of the proteome at different organizational levels. These levels cover a wide range of length and time scales: from the atomic structures of macromolecules for inferring their molecular function, to the quantitative description of their abundance, and spatial distribution in the cell. Emerging new experimental technologies are greatly increasing the availability of such spatial information on the molecular organization in living cells. This review addresses three fields that have significantly contributed to our understanding of the proteome's spatial and temporal organization: first, methods for the structure determination of individual macromolecular assemblies, specifically the fitting of atomic structures into density maps generated from electron microscopy techniques; second, research that visualizes the spatial distributions of these complexes within the cellular context using cryo electron tomography techniques combined with computational image processing; and third, methods for the spatial modeling of the dynamic organization of the proteome, specifically those methods for simulating reaction and diffusion of proteins and complexes in crowded intracellular fluids. The long-term goal is to integrate the varied data about a proteome's organization into a spatially explicit, predictive model of cellular processes.
Collapse
Affiliation(s)
- Martin Beck
- European Molecular Biology Laboratory, Meyerhofstr. 1, 69117 Heidelberg, Germany
| | - Maya Topf
- Molecular Biology, Crystallography, Department of Biological Sciences, Birkbeck College, University of London, London, UK
| | - Zachary Frazier
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Harianto Tjong
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Min Xu
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Shihua Zhang
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Frank Alber
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| |
Collapse
|
38
|
Baker ML, Abeysinghe SS, Schuh S, Coleman RA, Abrams A, Marsh MP, Hryc CF, Ruths T, Chiu W, Ju T. Modeling protein structure at near atomic resolutions with Gorgon. J Struct Biol 2011; 174:360-73. [PMID: 21296162 DOI: 10.1016/j.jsb.2011.01.015] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2010] [Revised: 01/27/2011] [Accepted: 01/31/2011] [Indexed: 11/29/2022]
Abstract
Electron cryo-microscopy (cryo-EM) has played an increasingly important role in elucidating the structure and function of macromolecular assemblies in near native solution conditions. Typically, however, only non-atomic resolution reconstructions have been obtained for these large complexes, necessitating computational tools for integrating and extracting structural details. With recent advances in cryo-EM, maps at near-atomic resolutions have been achieved for several macromolecular assemblies from which models have been manually constructed. In this work, we describe a new interactive modeling toolkit called Gorgon targeted at intermediate to near-atomic resolution density maps (10-3.5 Å), particularly from cryo-EM. Gorgon's de novo modeling procedure couples sequence-based secondary structure prediction with feature detection and geometric modeling techniques to generate initial protein backbone models. Beyond model building, Gorgon is an extensible interactive visualization platform with a variety of computational tools for annotating a wide variety of 3D volumes. Examples from cryo-EM maps of Rotavirus and Rice Dwarf Virus are used to demonstrate its applicability to modeling protein structure.
Collapse
Affiliation(s)
- Matthew L Baker
- National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Spilman MS, Dearborn AD, Chang JR, Damle PK, Christie GE, Dokland T. A conformational switch involved in maturation of Staphylococcus aureus bacteriophage 80α capsids. J Mol Biol 2010; 405:863-76. [PMID: 21129380 DOI: 10.1016/j.jmb.2010.11.047] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2010] [Revised: 11/22/2010] [Accepted: 11/23/2010] [Indexed: 10/18/2022]
Abstract
Bacteriophages are involved in many aspects of the spread and establishment of virulence factors in Staphylococcus aureus, including the mobilization of genetic elements known as S. aureus pathogenicity islands (SaPIs), which carry genes for superantigen toxins and other virulence factors. SaPIs are packaged into phage-like transducing particles using proteins supplied by the helper phage. We have used cryo-electron microscopy and icosahedral reconstruction to determine the structures of the procapsid and the mature capsid of 80α, a bacteriophage that can mobilize several different SaPIs. The 80α capsid has T=7 icosahedral symmetry with the capsid protein organized into pentameric and hexameric clusters that interact via prominent trimeric densities. The 80α capsid protein was modeled based on the capsid protein fold of bacteriophage HK97 and fitted into the 80α reconstructions. The models show that the trivalent interactions are mediated primarily by a 22-residue β hairpin structure called the P loop that is not found in HK97. Capsid expansion is associated with a conformational switch in the spine helix that is propagated throughout the subunit, unlike the domain rotation mechanism in phage HK97 or P22.
Collapse
Affiliation(s)
- Michael S Spilman
- Department of Microbiology, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | | | | | | | | | | |
Collapse
|
40
|
Saha M, Levitt M, Chiu W. MOTIF-EM: an automated computational tool for identifying conserved regions in CryoEM structures. Bioinformatics 2010; 26:i301-9. [PMID: 20529921 PMCID: PMC2881380 DOI: 10.1093/bioinformatics/btq195] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
We present a new, first-of-its-kind, fully automated computational tool MOTIF-EM for identifying regions or domains or motifs in cryoEM maps of large macromolecular assemblies (such as chaperonins, viruses, etc.) that remain conformationally conserved. As a by-product, regions in structures that are not conserved are revealed: this can indicate local molecular flexibility related to biological activity. MOTIF-EM takes cryoEM volumetric maps as inputs. The technique used by MOTIF-EM to detect conserved sub-structures is inspired by a recent breakthrough in 2D object recognition. The technique works by constructing rotationally invariant, low-dimensional representations of local regions in the input cryoEM maps. Correspondences are established between the reduced representations (by comparing them using a simple metric) across the input maps. The correspondences are clustered using hash tables and graph theory is used to retrieve conserved structural domains or motifs. MOTIF-EM has been used to extract conserved domains occurring in large macromolecular assembly maps, including as those of viruses P22 and epsilon 15, Ribosome 70S, GroEL, that remain structurally conserved in different functional states. Our method can also been used to build atomic models for some maps. We also used MOTIF-EM to identify the conserved folds shared among dsDNA bacteriophages HK97, Epsilon 15, and ô29, though they have low-sequence similarity. Contact:mitul@cs.stanford.edu Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mitul Saha
- NIH Center for Biomedical Computation, Stanford University, Stanford, CA 94305, USA.
| | | | | |
Collapse
|
41
|
Parent KN, Khayat R, Tu LH, Suhanovsky MM, Cortines JR, Teschke CM, Johnson JE, Baker TS. P22 coat protein structures reveal a novel mechanism for capsid maturation: stability without auxiliary proteins or chemical crosslinks. Structure 2010; 18:390-401. [PMID: 20223221 DOI: 10.1016/j.str.2009.12.014] [Citation(s) in RCA: 119] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2009] [Revised: 12/15/2009] [Accepted: 12/29/2009] [Indexed: 11/30/2022]
Abstract
Viral capsid assembly and stability in tailed, dsDNA phage and Herpesviridae are achieved by various means including chemical crosslinks (unique to HK97), or auxiliary proteins (lambda, T4, phi29, and herpesviruses). All these viruses have coat proteins (CP) with a conserved, HK97-like core structure. We used a combination of trypsin digestion, gold labeling, cryo-electron microscopy, 3D image reconstruction, and comparative modeling to derive two independent, pseudoatomic models of bacteriophage P22 CP: before and after maturation. P22 capsid stabilization results from intersubunit interactions among N-terminal helices and an extensive "P loop," which obviate the need for crosslinks or auxiliary proteins. P22 CP also has a telokin-like Ig domain that likely stabilizes the monomer fold so that assembly may proceed via individual subunit addition rather than via preformed capsomers as occurs in HK97. Hence, the P22 CP structure may be a paradigm for understanding how monomers assemble in viruses like phi29 and HSV-1.
Collapse
Affiliation(s)
- Kristin N Parent
- Department of Chemistry & Biochemistry, University of California, San Diego, La Jolla, CA 92093, USA
| | | | | | | | | | | | | | | |
Collapse
|
42
|
Zhang S, Vasishtan D, Xu M, Topf M, Alber F. A fast mathematical programming procedure for simultaneous fitting of assembly components into cryoEM density maps. Bioinformatics 2010; 26:i261-8. [PMID: 20529915 PMCID: PMC2881386 DOI: 10.1093/bioinformatics/btq201] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
MOTIVATION Single-particle cryo electron microscopy (cryoEM) typically produces density maps of macromolecular assemblies at intermediate to low resolution (approximately 5-30 A). By fitting high-resolution structures of assembly components into these maps, pseudo-atomic models can be obtained. Optimizing the quality-of-fit of all components simultaneously is challenging due to the large search space that makes the exhaustive search over all possible component configurations computationally unfeasible. RESULTS We developed an efficient mathematical programming algorithm that simultaneously fits all component structures into an assembly density map. The fitting is formulated as a point set matching problem involving several point sets that represent component and assembly densities at a reduced complexity level. In contrast to other point matching algorithms, our algorithm is able to match multiple point sets simultaneously and not only based on their geometrical equivalence, but also based on the similarity of the density in the immediate point neighborhood. In addition, we present an efficient refinement method based on the Iterative Closest Point registration algorithm. The integer quadratic programming method generates an assembly configuration in a few seconds. This efficiency allows the generation of an ensemble of candidate solutions that can be assessed by an independent scoring function. We benchmarked the method using simulated density maps of 11 protein assemblies at 20 A, and an experimental cryoEM map at 23.5 A resolution. Our method was able to generate assembly structures with root-mean-square errors <6.5 A, which have been further reduced to <1.8 A by the local refinement procedure. AVAILABILITY The program is available upon request as a Matlab code package. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics Online.
Collapse
Affiliation(s)
- Shihua Zhang
- Program in Molecular and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | | | | | | | | |
Collapse
|
43
|
Gartmann M, Blau M, Armache JP, Mielke T, Topf M, Beckmann R. Mechanism of eIF6-mediated inhibition of ribosomal subunit joining. J Biol Chem 2010; 285:14848-14851. [PMID: 20356839 PMCID: PMC2865328 DOI: 10.1074/jbc.c109.096057] [Citation(s) in RCA: 92] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2009] [Revised: 02/09/2010] [Indexed: 01/25/2023] Open
Abstract
During the process of ribosomal assembly, the essential eukaryotic translation initiation factor 6 (eIF6) is known to act as a ribosomal anti-association factor. However, a molecular understanding of the anti-association activity of eIF6 is still missing. Here we present the cryo-electron microscopy reconstruction of a complex of the large ribosomal subunit with eukaryotic eIF6 from Saccharomyces cerevisiae. The structure reveals that the eIF6 binding site involves mainly rpL23 (L14p in Escherichia coli). Based on our structural data, we propose that the mechanism of the anti-association activity of eIF6 is based on steric hindrance of intersubunit bridge formation around the dynamic bridge B6.
Collapse
Affiliation(s)
- Marco Gartmann
- Department of Biochemistry, Gene Center and Center for Integrated Protein Science (CiPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 Munich, Germany
| | - Michael Blau
- Institut für Biochemie der Charité, Humboldt Universität Berlin, Monbijoustrasse 2, 10117 Berlin, Germany
| | - Jean-Paul Armache
- Department of Biochemistry, Gene Center and Center for Integrated Protein Science (CiPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 Munich, Germany
| | - Thorsten Mielke
- UltraStrukturNetzwerk, Max Planck Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin; Institut für Medizinische Physik und Biophysik, Charité, Ziegelstrasse 5-9, 10098 Berlin, Germany
| | - Maya Topf
- Institute of Structural and Molecular Biology, School of Crystallography, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom
| | - Roland Beckmann
- Department of Biochemistry, Gene Center and Center for Integrated Protein Science (CiPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 Munich, Germany.
| |
Collapse
|
44
|
Rawi R, Whitmore L, Topf M. CHOYCE: a web server for constrained homology modelling with cryoEM maps. ACTA ACUST UNITED AC 2010; 26:1673-4. [PMID: 20444836 PMCID: PMC2887048 DOI: 10.1093/bioinformatics/btq237] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
SUMMARY CHOYCE is a web server for homology modelling of protein components and the fitting of those components into cryo electron microscopy (cryoEM) maps of their assemblies. It provides an interactive approach to improving the selection of models based on the quality of their fit into the EM map. AVAILABILITY http://choyce.ismb.lon.ac.uk/ CONTACT m.topf@cryst.bbk.ac.uk; reda.rawi@uni-due.de SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Reda Rawi
- Institute of Structural and Molecular Biology, Crystallography, Department of Biological Sciences, Birkbeck College, University of London, London, UK.
| | | | | |
Collapse
|
45
|
Abstract
Today, electron cryomicroscopy (cryo-EM) can routinely achieve subnanometer resolutions of complex macromolecular assemblies. From a density map, one can extract key structural and functional information using a variety of computational analysis tools. At subnanometer resolution, these tools make it possible to isolate individual subunits, identify secondary structures, and accurately fit atomic models. With several cryo-EM studies achieving resolutions beyond 5Å, computational modeling and feature recognition tools have been employed to construct backbone and atomic models of the protein components directly from a density map. In this chapter, we describe several common classes of computational tools that can be used to analyze and model subnanometer resolution reconstructions from cryo-EM. A general protocol for analyzing subnanometer resolution density maps is presented along with a full description of steps used in analyzing the 4.3Å resolution structure of Mm-cpn.
Collapse
Affiliation(s)
- Matthew L Baker
- National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas, USA
| | | | | | | |
Collapse
|
46
|
Taylor DJ, Devkota B, Huang AD, Topf M, Narayanan E, Sali A, Harvey SC, Frank J. Comprehensive molecular structure of the eukaryotic ribosome. Structure 2009; 17:1591-1604. [PMID: 20004163 PMCID: PMC2814252 DOI: 10.1016/j.str.2009.09.015] [Citation(s) in RCA: 132] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2009] [Revised: 09/16/2009] [Accepted: 09/18/2009] [Indexed: 01/02/2023]
Abstract
Despite the emergence of a large number of X-ray crystallographic models of the bacterial 70S ribosome over the past decade, an accurate atomic model of the eukaryotic 80S ribosome is still not available. Eukaryotic ribosomes possess more ribosomal proteins and ribosomal RNA than do bacterial ribosomes, which are implicated in extraribosomal functions in the eukaryotic cells. By combining cryo-EM with RNA and protein homology modeling, we obtained an atomic model of the yeast 80S ribosome complete with all ribosomal RNA expansion segments and all ribosomal proteins for which a structural homolog can be identified. Mutation or deletion of 80S ribosomal proteins can abrogate maturation of the ribosome, leading to several human diseases. We have localized one such protein unique to eukaryotes, rpS19e, whose mutations are associated with Diamond-Blackfan anemia in humans. Additionally, we characterize crucial interactions between the dynamic stalk base of the ribosome with eukaryotic elongation factor 2.
Collapse
Affiliation(s)
- Derek J Taylor
- Wadsworth Center, Empire State Plaza, Albany, NY 12201-0509, USA
| | - Batsal Devkota
- School of Biology and Parker H. Petit Institute of Bioengineering and Biosciences, Georgia Institute of Technology, Atlanta, GA 30332-0230, USA
| | - Andrew D Huang
- School of Biology and Parker H. Petit Institute of Bioengineering and Biosciences, Georgia Institute of Technology, Atlanta, GA 30332-0230, USA
| | - Maya Topf
- Department of Bioengineering and Therapeutical Sciences, Department of Pharmaceutical Chemistry, and California Institute of Quantitative Biosciences, Mission Bay Byers Hall, University of California at San Francisco, San Francisco, CA 94158, USA
| | - Eswar Narayanan
- Department of Bioengineering and Therapeutical Sciences, Department of Pharmaceutical Chemistry, and California Institute of Quantitative Biosciences, Mission Bay Byers Hall, University of California at San Francisco, San Francisco, CA 94158, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutical Sciences, Department of Pharmaceutical Chemistry, and California Institute of Quantitative Biosciences, Mission Bay Byers Hall, University of California at San Francisco, San Francisco, CA 94158, USA
| | - Stephen C Harvey
- School of Biology and Parker H. Petit Institute of Bioengineering and Biosciences, Georgia Institute of Technology, Atlanta, GA 30332-0230, USA
| | - Joachim Frank
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biology and Department of Biological Sciences, Columbia University, New York, NY 10032, USA.
| |
Collapse
|
47
|
Lindert S, Stewart PL, Meiler J. Hybrid approaches: applying computational methods in cryo-electron microscopy. Curr Opin Struct Biol 2009; 19:218-25. [PMID: 19339173 DOI: 10.1016/j.sbi.2009.02.010] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2008] [Accepted: 02/26/2009] [Indexed: 12/20/2022]
Abstract
Recent advances in cryo-electron microscopy have led to an increasing number of high (3-5A) to medium (5-10A) resolution cryoEM density maps. These density maps contain valuable information about the protein structure but frequently require computational algorithms to aid their structural interpretation. It is these hybrid approaches between cryoEM and computational protein structure prediction algorithms that will shape protein structure elucidation from density maps.
Collapse
Affiliation(s)
- Steffen Lindert
- Department of Chemistry, Vanderbilt University, Nashville, TN 37212, USA
| | | | | |
Collapse
|
48
|
Lasker K, Topf M, Sali A, Wolfson HJ. Inferential optimization for simultaneous fitting of multiple components into a CryoEM map of their assembly. J Mol Biol 2009; 388:180-94. [PMID: 19233204 DOI: 10.1016/j.jmb.2009.02.031] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2008] [Revised: 12/29/2008] [Accepted: 02/12/2009] [Indexed: 11/24/2022]
Abstract
Models of macromolecular assemblies are essential for a mechanistic description of cellular processes. Such models are increasingly obtained by fitting atomic-resolution structures of components into a density map of the whole assembly. Yet, current density-fitting techniques are frequently insufficient for an unambiguous determination of the positions and orientations of all components. Here, we describe MultiFit, a method used for simultaneously fitting atomic structures of components into their assembly density map at resolutions as low as 25 A. The component positions and orientations are optimized with respect to a scoring function that includes the quality-of-fit of components in the map, the protrusion of components from the map envelope, and the shape complementarity between pairs of components. The scoring function is optimized by our exact inference optimizer DOMINO (Discrete Optimization of Multiple INteracting Objects) that efficiently finds the global minimum in a discrete sampling space. MultiFit was benchmarked on seven assemblies of known structure, consisting of up to seven proteins each. The input atomic structures of the components were obtained from the Protein Data Bank, as well as by comparative modeling based on a 16-99% sequence identity to a template structure. A near-native configuration was usually found as the top-scoring model. Therefore, MultiFit can provide initial configurations for further refinement of many multicomponent assembly structures described by electron microscopy.
Collapse
Affiliation(s)
- Keren Lasker
- Blavatnik School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel-Aviv 69978, Israel.
| | | | | | | |
Collapse
|
49
|
Connell SR, Topf M, Qin Y, Wilson DN, Mielke T, Fucini P, Nierhaus KH, Spahn CMT. A new tRNA intermediate revealed on the ribosome during EF4-mediated back-translocation. Nat Struct Mol Biol 2009; 15:910-5. [PMID: 19172743 DOI: 10.1038/nsmb.1469] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
EF4 (LepA) is an almost universally conserved translational GTPase in eubacteria. It seems to be essential under environmental stress conditions and has previously been shown to back-translocate the tRNAs on the ribosome, thereby reverting the canonical translocation reaction. In the current work, EF4 was directly visualized in the process of back-translocating tRNAs by single-particle cryo-EM. Using flexible fitting methods, we built a model of ribosome-bound EF4 based on the cryo-EM map and a recently published unbound EF4 X-ray structure. The cryo-EM map establishes EF4 as a noncanonical elongation factor that interacts not only with the elongating ribosome, but also with the back-translocated tRNA in the A-site region, which is present in a previously unseen, intermediate state and deviates markedly from the position of a canonical A-tRNA. Our results, therefore, provide insight into the underlying structural principles governing back-translocation.
Collapse
Affiliation(s)
- Sean R Connell
- Institut für Medizinische Physik und Biophysik, Charite-Universitätsmedizin Berlin, Ziegelstrasse 5-9, 10117-Berlin, Germany
| | | | | | | | | | | | | | | |
Collapse
|
50
|
Flexible fitting of high-resolution x-ray structures into cryoelectron microscopy maps using biased molecular dynamics simulations. Biophys J 2008; 95:5692-705. [PMID: 18849406 DOI: 10.1529/biophysj.108.139451] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A methodology for flexible fitting of all-atom high-resolution structures into low-resolution cryoelectron microscopy (cryo-EM) maps is presented. Flexibility of the modeled structure is simulated by classical molecular dynamics and an additional effective potential is introduced to enhance the fitting process. The additional potential is proportional to the correlation coefficient between the experimental cryo-EM map and a synthetic map generated for an all-atom structure being fitted to the map. The additional forces are calculated as a gradient of the correlation coefficient. During the molecular dynamics simulations under the additional forces, the molecule undergoes a conformational transition that maximizes the correlation coefficient, which results in a high-accuracy fit of all-atom structure into a cryo-EM map. Using five test proteins that exhibit structural rearrangement during their biological activity, we demonstrate performance of our method. We also test our method on the experimental cryo-EM of elongation factor G and show that the model obtained is comparable to previous studies. In addition, we show that overfitting can be avoided by assessing the quality of the fitted model in terms of correlation coefficient and secondary structure preservation.
Collapse
|