1
|
Sazzed S. Determining Protein Secondary Structures in Heterogeneous Medium-Resolution Cryo-EM Images Using CryoSSESeg. ACS OMEGA 2024; 9:26409-26416. [PMID: 38911779 PMCID: PMC11191131 DOI: 10.1021/acsomega.4c02608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Revised: 05/02/2024] [Accepted: 05/09/2024] [Indexed: 06/25/2024]
Abstract
While the acquisition of cryo-electron microscopy (cryo-EM) at near-atomic resolution is becoming more prevalent, a considerable number of density maps are still resolved only at intermediate resolutions (5-10 Å). Due to the large variation in quality among these medium-resolution density maps, extracting structural information from them remains a challenging task. This study introduces a convolutional neural network (CNN)-based framework, cryoSSESeg, to determine the organization of protein secondary structure elements in medium-resolution cryo-EM images. CryoSSESeg is trained on approximately 1300 protein chains derived from around 500 experimental cryo-EM density maps of varied quality. It demonstrates strong performance with residue-level F 1 scores of 0.76 for helix detection and 0.60 for β-sheet detection on average across a set of testing chains. In comparison to traditional image processing tools like SSETracer, which demand significant manual intervention and preprocessing steps, cryoSSESeg demonstrates comparable or superior performance. Additionally, it demonstrates competitive performance alongside another deep learning-based model, Emap2sec. Furthermore, this study underscores the importance of secondary structure quality, particularly adherence to expected shapes, in detection performance, emphasizing the necessity for careful evaluation of the data quality.
Collapse
|
2
|
Li T, He J, Cao H, Zhang Y, Chen J, Xiao Y, Huang SY. All-atom RNA structure determination from cryo-EM maps. Nat Biotechnol 2024:10.1038/s41587-024-02149-8. [PMID: 38396075 DOI: 10.1038/s41587-024-02149-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Accepted: 01/24/2024] [Indexed: 02/25/2024]
Abstract
Many methods exist for determining protein structures from cryogenic electron microscopy maps, but this remains challenging for RNA structures. Here we developed EMRNA, a method for accurate, automated determination of full-length all-atom RNA structures from cryogenic electron microscopy maps. EMRNA integrates deep learning-based detection of nucleotides, three-dimensional backbone tracing and scoring with consideration of sequence and secondary structure information, and full-atom construction of the RNA structure. We validated EMRNA on 140 diverse RNA maps ranging from 37 to 423 nt at 2.0-6.0 Å resolutions, and compared EMRNA with auto-DRRAFTER, phenix.map_to_model and CryoREAD on a set of 71 cases. EMRNA achieves a median accuracy of 2.36 Å root mean square deviation and 0.86 TM-score for full-length RNA structures, compared with 6.66 Å and 0.58 for auto-DRRAFTER. EMRNA also obtains a high residue coverage and sequence match of 93.30% and 95.30% in the built models, compared with 58.20% and 42.20% for phenix.map_to_model and 56.45% and 52.3% for CryoREAD. EMRNA is fast and can build an RNA structure of 100 nt within 3 min.
Collapse
Affiliation(s)
- Tao Li
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Jiahua He
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Hong Cao
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Yi Zhang
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Ji Chen
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China
| | - Yi Xiao
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China.
| | - Sheng-You Huang
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, China.
| |
Collapse
|
3
|
Corum MR, Venkannagari H, Hryc CF, Baker ML. Predictive modeling and cryo-EM: A synergistic approach to modeling macromolecular structure. Biophys J 2024; 123:435-450. [PMID: 38268190 PMCID: PMC10912932 DOI: 10.1016/j.bpj.2024.01.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/09/2024] [Accepted: 01/18/2024] [Indexed: 01/26/2024] Open
Abstract
Over the last 15 years, structural biology has seen unprecedented development and improvement in two areas: electron cryo-microscopy (cryo-EM) and predictive modeling. Once relegated to low resolutions, single-particle cryo-EM is now capable of achieving near-atomic resolutions of a wide variety of macromolecular complexes. Ushered in by AlphaFold, machine learning has powered the current generation of predictive modeling tools, which can accurately and reliably predict models for proteins and some complexes directly from the sequence alone. Although they offer new opportunities individually, there is an inherent synergy between these techniques, allowing for the construction of large, complex macromolecular models. Here, we give a brief overview of these approaches in addition to illustrating works that combine these techniques for model building. These examples provide insight into model building, assessment, and limitations when integrating predictive modeling with cryo-EM density maps. Together, these approaches offer the potential to greatly accelerate the generation of macromolecular structural insights, particularly when coupled with experimental data.
Collapse
Affiliation(s)
- Michael R Corum
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Harikanth Venkannagari
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Corey F Hryc
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Matthew L Baker
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas.
| |
Collapse
|
4
|
DiIorio MC, Kulczyk AW. Novel Artificial Intelligence-Based Approaches for Ab Initio Structure Determination and Atomic Model Building for Cryo-Electron Microscopy. MICROMACHINES 2023; 14:1674. [PMID: 37763837 PMCID: PMC10534518 DOI: 10.3390/mi14091674] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 08/21/2023] [Accepted: 08/25/2023] [Indexed: 09/29/2023]
Abstract
Single particle cryo-electron microscopy (cryo-EM) has emerged as the prevailing method for near-atomic structure determination, shedding light on the important molecular mechanisms of biological macromolecules. However, the inherent dynamics and structural variability of biological complexes coupled with the large number of experimental images generated by a cryo-EM experiment make data processing nontrivial. In particular, ab initio reconstruction and atomic model building remain major bottlenecks that demand substantial computational resources and manual intervention. Approaches utilizing recent innovations in artificial intelligence (AI) technology, particularly deep learning, have the potential to overcome the limitations that cannot be adequately addressed by traditional image processing approaches. Here, we review newly proposed AI-based methods for ab initio volume generation, heterogeneous 3D reconstruction, and atomic model building. We highlight the advancements made by the implementation of AI methods, as well as discuss remaining limitations and areas for future development.
Collapse
Affiliation(s)
- Megan C. DiIorio
- Institute for Quantitative Biomedicine, Rutgers University, 174 Frelinghuysen Road, Piscataway, NJ 08854, USA
| | - Arkadiusz W. Kulczyk
- Institute for Quantitative Biomedicine, Rutgers University, 174 Frelinghuysen Road, Piscataway, NJ 08854, USA
- Department of Biochemistry & Microbiology, Rutgers University, 76 Lipman Drive, New Brunswick, NJ 08901, USA
| |
Collapse
|
5
|
Giri N, Roy RS, Cheng J. Deep learning for reconstructing protein structures from cryo-EM density maps: Recent advances and future directions. Curr Opin Struct Biol 2023; 79:102536. [PMID: 36773336 PMCID: PMC10023387 DOI: 10.1016/j.sbi.2023.102536] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 12/20/2022] [Accepted: 01/03/2023] [Indexed: 02/11/2023]
Abstract
Cryo-Electron Microscopy (cryo-EM) has emerged as a key technology to determine the structure of proteins, particularly large protein complexes and assemblies in recent years. A key challenge in cryo-EM data analysis is to automatically reconstruct accurate protein structures from cryo-EM density maps. In this review, we briefly overview various deep learning methods for building protein structures from cryo-EM density maps, analyze their impact, and discuss the challenges of preparing high-quality data sets for training deep learning models. Looking into the future, more advanced deep learning models of effectively integrating cryo-EM data with other sources of complementary data such as protein sequences and AlphaFold-predicted structures need to be developed to further advance the field.
Collapse
Affiliation(s)
- Nabin Giri
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, 65211, Missouri, USA; NextGen Precision Health, University of Missouri, Columbia, 65211, Missouri, USA. https://twitter.com/@nvngiri
| | - Raj S Roy
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, 65211, Missouri, USA. https://twitter.com/@rajshekhorroy
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, 65211, Missouri, USA; NextGen Precision Health, University of Missouri, Columbia, 65211, Missouri, USA.
| |
Collapse
|
6
|
Beton JG, Cragnolini T, Kaleel M, Mulvaney T, Sweeney A, Topf M. Integrating model simulation tools and
cryo‐electron
microscopy. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Joseph George Beton
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Tristan Cragnolini
- Institute of Structural and Molecular Biology, Birkbeck and University College London London UK
| | - Manaz Kaleel
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Thomas Mulvaney
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Aaron Sweeney
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Maya Topf
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| |
Collapse
|
7
|
Zhu Z, Deng Z, Wang Q, Wang Y, Zhang D, Xu R, Guo L, Wen H. Simulation and Machine Learning Methods for Ion-Channel Structure Determination, Mechanistic Studies and Drug Design. Front Pharmacol 2022; 13:939555. [PMID: 35837274 PMCID: PMC9275593 DOI: 10.3389/fphar.2022.939555] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 06/07/2022] [Indexed: 11/13/2022] Open
Abstract
Ion channels are expressed in almost all living cells, controlling the in-and-out communications, making them ideal drug targets, especially for central nervous system diseases. However, owing to their dynamic nature and the presence of a membrane environment, ion channels remain difficult targets for the past decades. Recent advancement in cryo-electron microscopy and computational methods has shed light on this issue. An explosion in high-resolution ion channel structures paved way for structure-based rational drug design and the state-of-the-art simulation and machine learning techniques dramatically improved the efficiency and effectiveness of computer-aided drug design. Here we present an overview of how simulation and machine learning-based methods fundamentally changed the ion channel-related drug design at different levels, as well as the emerging trends in the field.
Collapse
Affiliation(s)
- Zhengdan Zhu
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- Beijing Institute of Big Data Research, Beijing, China
| | - Zhenfeng Deng
- DP Technology, Beijing, China
- School of Pharmaceutical Sciences, Peking University, Beijing, China
| | | | | | - Duo Zhang
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- DP Technology, Beijing, China
| | - Ruihan Xu
- DP Technology, Beijing, China
- National Engineering Research Center of Visual Technology, Peking University, Beijing, China
| | | | - Han Wen
- DP Technology, Beijing, China
| |
Collapse
|
8
|
Behkamal B, Naghibzadeh M, Pagnani A, Saberi MR, Al Nasr K. LPTD: a novel linear programming-based topology determination method for cryo-EM maps. Bioinformatics 2022; 38:2734-2741. [PMID: 35561171 PMCID: PMC9306757 DOI: 10.1093/bioinformatics/btac170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 03/01/2022] [Accepted: 03/18/2022] [Indexed: 02/03/2023] Open
Abstract
SUMMARY Topology determination is one of the most important intermediate steps toward building the atomic structure of proteins from their medium-resolution cryo-electron microscopy (cryo-EM) map. The main goal in the topology determination is to identify correct matches (i.e. assignment and direction) between secondary structure elements (SSEs) (α-helices and β-sheets) detected in a protein sequence and cryo-EM density map. Despite many recent advances in molecular biology technologies, the problem remains a challenging issue. To overcome the problem, this article proposes a linear programming-based topology determination (LPTD) method to solve the secondary structure topology problem in three-dimensional geometrical space. Through modeling of the protein's sequence with the aid of extracting highly reliable features and a distance-based scoring function, the secondary structure matching problem is transformed into a complete weighted bipartite graph matching problem. Subsequently, an algorithm based on linear programming is developed as a decision-making strategy to extract the true topology (native topology) between all possible topologies. The proposed automatic framework is verified using 12 experimental and 15 simulated α-β proteins. Results demonstrate that LPTD is highly efficient and extremely fast in such a way that for 77% of cases in the dataset, the native topology has been detected in the first rank topology in <2 s. Besides, this method is able to successfully handle large complex proteins with as many as 65 SSEs. Such a large number of SSEs have never been solved with current tools/methods. AVAILABILITY AND IMPLEMENTATION The LPTD package (source code and data) is publicly available at https://github.com/B-Behkamal/LPTD. Moreover, two test samples as well as the instruction of utilizing the graphical user interface have been provided in the shared readme file. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Bahareh Behkamal
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran
| | - Mahmoud Naghibzadeh
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran
| | - Andrea Pagnani
- Department of Applied Science and Technology (DISAT), Politecnico di Torino, Torino I-10129, Italy
- Italian Institute for Genomic Medicine (IIGM), IRCC-Candiolo, Candiolo (TO) I-10060, Italy
- INFN Sezione di Torino, Torino I-10125, Italy
| | - Mohammad Reza Saberi
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran
- Bioinformatics Research Group, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran
| | - Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA
| |
Collapse
|
9
|
Behkamal B, Naghibzadeh M, Saberi MR, Tehranizadeh ZA, Pagnani A, Al Nasr K. Three-Dimensional Graph Matching to Identify Secondary Structure Correspondence of Medium-Resolution Cryo-EM Density Maps. Biomolecules 2021; 11:1773. [PMID: 34944417 PMCID: PMC8698881 DOI: 10.3390/biom11121773] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 11/18/2021] [Accepted: 11/20/2021] [Indexed: 01/15/2023] Open
Abstract
Cryo-electron microscopy (cryo-EM) is a structural technique that has played a significant role in protein structure determination in recent years. Compared to the traditional methods of X-ray crystallography and NMR spectroscopy, cryo-EM is capable of producing images of much larger protein complexes. However, cryo-EM reconstructions are limited to medium-resolution (~4-10 Å) for some cases. At this resolution range, a cryo-EM density map can hardly be used to directly determine the structure of proteins at atomic level resolutions, or even at their amino acid residue backbones. At such a resolution, only the position and orientation of secondary structure elements (SSEs) such as α-helices and β-sheets are observable. Consequently, finding the mapping of the secondary structures of the modeled structure (SSEs-A) to the cryo-EM map (SSEs-C) is one of the primary concerns in cryo-EM modeling. To address this issue, this study proposes a novel automatic computational method to identify SSEs correspondence in three-dimensional (3D) space. Initially, through a modeling of the target sequence with the aid of extracting highly reliable features from a generated 3D model and map, the SSEs matching problem is formulated as a 3D vector matching problem. Afterward, the 3D vector matching problem is transformed into a 3D graph matching problem. Finally, a similarity-based voting algorithm combined with the principle of least conflict (PLC) concept is developed to obtain the SSEs correspondence. To evaluate the accuracy of the method, a testing set of 25 experimental and simulated maps with a maximum of 65 SSEs is selected. Comparative studies are also conducted to demonstrate the superiority of the proposed method over some state-of-the-art techniques. The results demonstrate that the method is efficient, robust, and works well in the presence of errors in the predicted secondary structures of the cryo-EM images.
Collapse
Affiliation(s)
- Bahareh Behkamal
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran;
| | - Mahmoud Naghibzadeh
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran;
| | - Mohammad Reza Saberi
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran; (M.R.S.); (Z.A.T.)
- Bioinformatics Research Group, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran
| | - Zeinab Amiri Tehranizadeh
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran; (M.R.S.); (Z.A.T.)
| | - Andrea Pagnani
- Politecnico di Torino, Corso Duca degli Abruzzi 24, I-10129 Torino, Italy;
- Italian Institute for Genomic Medicine, IRCCS Candiolo, SP-142, I-10060 Candiolo, Italy
- INFN, Sezione di Torino, I-10125 Torino, Italy
| | - Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA
| |
Collapse
|
10
|
He J, Huang SY. EMNUSS: a deep learning framework for secondary structure annotation in cryo-EM maps. Brief Bioinform 2021; 22:bbab156. [PMID: 33954706 PMCID: PMC8574626 DOI: 10.1093/bib/bbab156] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 03/30/2021] [Accepted: 04/06/2021] [Indexed: 02/06/2023] Open
Abstract
Cryo-electron microscopy (cryo-EM) has become one of important experimental methods in structure determination. However, despite the rapid growth in the number of deposited cryo-EM maps motivated by advances in microscopy instruments and image processing algorithms, building accurate structure models for cryo-EM maps remains a challenge. Protein secondary structure information, which can be extracted from EM maps, is beneficial for cryo-EM structure modeling. Here, we present a novel secondary structure annotation framework for cryo-EM maps at both intermediate and high resolutions, named EMNUSS. EMNUSS adopts a three-dimensional (3D) nested U-net architecture to assign secondary structures for EM maps. Tested on three diverse datasets including simulated maps, middle resolution experimental maps, and high-resolution experimental maps, EMNUSS demonstrated its accuracy and robustness in identifying the secondary structures for cyro-EM maps of various resolutions. The EMNUSS program is freely available at http://huanglab.phys.hust.edu.cn/EMNUSS.
Collapse
Affiliation(s)
- Jiahua He
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
11
|
He J, Huang SY. Full-length de novo protein structure determination from cryo-EM maps using deep learning. Bioinformatics 2021; 37:3480-3490. [PMID: 33978686 DOI: 10.1093/bioinformatics/btab357] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 04/03/2021] [Accepted: 05/08/2021] [Indexed: 12/11/2022] Open
Abstract
MOTIVATION Advances in microscopy instruments and image processing algorithms have led to an increasing number of cryo-EM maps. However, building accurate models for the EM maps at 3-5 Å resolution remains a challenging and time-consuming process. With the rapid growth of deposited EM maps, there is an increasing gap between the maps and reconstructed/modeled 3-dimensional (3D) structures. Therefore, automatic reconstruction of atomic-accuracy full-atomstructures fromEMmaps is pressingly needed. RESULTS We present a semi-automatic de novo structure determination method using a deep learningbased framework, named as DeepMM, which builds atomic-accuracy all-atom models from cryo-EM maps at near-atomic resolution. In our method, the main-chain and Cα positions as well as their amino acid and secondary structure types are predicted in the EM map using Densely Connected Convolutional Networks. DeepMM was extensively validated on 40 simulated maps at 5 Å resolution and 30 experimental maps at 2.6-4.8 Å resolution as well as an EMDB-wide data set of 2931 experimental maps at 2.6-4.9 Å resolution, and compared with state-of-the-art algorithms including RosettaES, MAINMAST, and Phenix. Overall, our DeepMM algorithm obtained a significant improvement over existing methods in terms of both accuracy and coverage in building full-length protein structures on all test sets, demonstrating the efficacy and general applicability of DeepMM. AVAILABILITY http://huanglab.phys.hust.edu.cn/DeepMM. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jiahua He
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
12
|
Seffernick JT, Lindert S. Hybrid methods for combined experimental and computational determination of protein structure. J Chem Phys 2020; 153:240901. [PMID: 33380110 PMCID: PMC7773420 DOI: 10.1063/5.0026025] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Accepted: 11/10/2020] [Indexed: 02/04/2023] Open
Abstract
Knowledge of protein structure is paramount to the understanding of biological function, developing new therapeutics, and making detailed mechanistic hypotheses. Therefore, methods to accurately elucidate three-dimensional structures of proteins are in high demand. While there are a few experimental techniques that can routinely provide high-resolution structures, such as x-ray crystallography, nuclear magnetic resonance (NMR), and cryo-EM, which have been developed to determine the structures of proteins, these techniques each have shortcomings and thus cannot be used in all cases. However, additionally, a large number of experimental techniques that provide some structural information, but not enough to assign atomic positions with high certainty have been developed. These methods offer sparse experimental data, which can also be noisy and inaccurate in some instances. In cases where it is not possible to determine the structure of a protein experimentally, computational structure prediction methods can be used as an alternative. Although computational methods can be performed without any experimental data in a large number of studies, inclusion of sparse experimental data into these prediction methods has yielded significant improvement. In this Perspective, we cover many of the successes of integrative modeling, computational modeling with experimental data, specifically for protein folding, protein-protein docking, and molecular dynamics simulations. We describe methods that incorporate sparse data from cryo-EM, NMR, mass spectrometry, electron paramagnetic resonance, small-angle x-ray scattering, Förster resonance energy transfer, and genetic sequence covariation. Finally, we highlight some of the major challenges in the field as well as possible future directions.
Collapse
Affiliation(s)
- Justin T. Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, USA
| |
Collapse
|
13
|
Behkamal B, Naghibzadeh M, Pagnani A, Saberi MR, Al Nasr K. Solving the α-helix correspondence problem at medium-resolution Cryo-EM maps through modeling and 3D matching. J Mol Graph Model 2020; 103:107815. [PMID: 33338845 DOI: 10.1016/j.jmgm.2020.107815] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 11/09/2020] [Accepted: 11/18/2020] [Indexed: 11/30/2022]
Abstract
Cryo-electron microscopy (cryo-EM) has recently emerged as a prominent biophysical method for macromolecular structure determination. Many research efforts have been devoted to produce cryo-EM images, density maps, at near-atomic resolution. Despite many advances in technology, the resolution of the generated density maps may not be sufficiently adequate and informative to directly construct the atomic structure of proteins. At medium-resolution (∼4-10 Å), secondary structure elements (α-helices and β-sheets) are discernible, whereas finding the correspondence of secondary structure elements detected in the density map with those on the sequence remains a challenging problem. In this paper, an automatic framework is proposed to solve α-helix correspondence problem in three-dimensional space. Through modeling of the sequence with the aid of a novel strategy, the α-helix correspondence problem is initially transformed into a complete weighted bipartite graph matching problem. An innovative correlation-based scoring function based on a well-known and robust statistical method is proposed for weighting the graph. Moreover, two local optimization algorithms, which are Greedy and Improved Greedy algorithms, have been presented to find α-helix correspondence. A widely used data set including 16 reconstructed and 4 experimental cryo-EM maps were chosen to verify the accuracy and reliability of the proposed automatic method. The experimental results demonstrate that the automatic method is highly efficient (86.25% accuracy), robust (11.3% error rate), fast (∼1.4 s), and works independently from cryo-EM skeleton.
Collapse
Affiliation(s)
- Bahareh Behkamal
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad, 9177948944, Iran.
| | - Mahmoud Naghibzadeh
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad, 9177948944, Iran.
| | - Andrea Pagnani
- Department of Applied Science and Technology (DISAT), Politecnico di Torino, Corso Duca degli Abruzzi 24, Torino, Italy; Italian Institute for Genomic Medicine (IIGM), IRCC-Candiolo, Candiolo, TO, Italy; INFN Sezione di Torino, Via P. Giuria 1, Torino, Italy
| | - Mohammad Reza Saberi
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad, Iran; Bioinformatics Research group, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN, 37209, USA
| |
Collapse
|
14
|
Alshammari M, He J. Combine Cryo-EM Density Map and Residue Contact for Protein Structure Prediction - A Case Study. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2020; 2020:110. [PMID: 35838376 PMCID: PMC9279007 DOI: 10.1145/3388440.3414708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Cryo-electron microscopy is a major structure determination technique for large molecular machines and membrane-associated complexes. Although atomic structures have been determined directly from cryo-EM density maps with high resolutions, current structure determination methods for medium resolution (5 to 10 Å) cryo-EM maps are limited by the availability of structure templates. Secondary structure traces are lines detected from a cryo-EM density map for α-helices and β-strands of a protein. When combined with secondary structure sequence segments predicted from a protein sequence, it is possible to generate a set of likely topologies of α-traces and β-sheet traces. A topology describes the overall folding relationship among secondary structures; it is a critical piece of information for deriving the corresponding atomic structure. We propose a method for protein structure prediction that combines three sources of information: the secondary structure traces detected from the cryo-EM density map, predicted secondary structure sequence segments, and amino acid contact pairs predicted using MULTICOM. A case study shows that using amino acid contact prediction from MULTICOM improves the ranking of the true topology. Our observations convey that using a small set of highly voted secondary structure contact pairs enhances the ranking in all experiments conducted for this case.
Collapse
|
15
|
Alnabati E, Kihara D. Advances in Structure Modeling Methods for Cryo-Electron Microscopy Maps. Molecules 2019; 25:molecules25010082. [PMID: 31878333 PMCID: PMC6982917 DOI: 10.3390/molecules25010082] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Revised: 12/20/2019] [Accepted: 12/20/2019] [Indexed: 01/16/2023] Open
Abstract
Cryo-electron microscopy (cryo-EM) has now become a widely used technique for structure determination of macromolecular complexes. For modeling molecular structures from density maps of different resolutions, many algorithms have been developed. These algorithms can be categorized into rigid fitting, flexible fitting, and de novo modeling methods. It is also observed that machine learning (ML) techniques have been increasingly applied following the rapid progress of the ML field. Here, we review these different categories of macromolecule structure modeling methods and discuss their advances over time.
Collapse
Affiliation(s)
- Eman Alnabati
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
- Correspondence:
| |
Collapse
|
16
|
Terwilliger TC, Adams PD, Afonine PV, Sobolev OV. Cryo-EM map interpretation and protein model-building using iterative map segmentation. Protein Sci 2019; 29:87-99. [PMID: 31599033 PMCID: PMC6933853 DOI: 10.1002/pro.3740] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Revised: 09/30/2019] [Accepted: 10/01/2019] [Indexed: 11/17/2022]
Abstract
A procedure for building protein chains into maps produced by single‐particle electron cryo‐microscopy (cryo‐EM) is described. The procedure is similar to the way an experienced structural biologist might analyze a map, focusing first on secondary structure elements such as helices and sheets, then varying the contour level to identify connections between these elements. Since the high density in a map typically follows the main‐chain of the protein, the main‐chain connection between secondary structure elements can often be identified as the unbranched path between them with the highest minimum value along the path. This chain‐tracing procedure is then combined with finding side‐chain positions based on the presence of density extending away from the main path of the chain, allowing generation of a Cα model. The Cα model is converted to an all‐atom model and is refined against the map. We show that this procedure is as effective as other existing methods for interpretation of cryo‐EM maps and that it is considerably faster and produces models with fewer chain breaks than our previous methods that were based on approaches developed for crystallographic maps.
Collapse
Affiliation(s)
- Thomas C Terwilliger
- Los Alamos National Laboratory, Los Alamos, New Mexico.,New Mexico Consortium, Los Alamos, New Mexico
| | - Paul D Adams
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California.,Department of Bioengineering, University of California Berkeley, Berkeley, California
| | - Pavel V Afonine
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California
| | - Oleg V Sobolev
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California
| |
Collapse
|
17
|
Saltzberg DJ, Hepburn M, Pilla KB, Schriemer DC, Lees-Miller SP, Blundell TL, Sali A. SSEThread: Integrative threading of the DNA-PKcs sequence based on data from chemical cross-linking and hydrogen deuterium exchange. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2019; 147:92-102. [PMID: 31570166 DOI: 10.1016/j.pbiomolbio.2019.09.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Revised: 08/09/2019] [Accepted: 09/10/2019] [Indexed: 01/19/2023]
Abstract
X-ray crystallography and electron microscopy maps resolved to 3-8 Å are generally sufficient for tracing the path of the polypeptide chain in space, while often insufficient for unambiguously registering the sequence on the path (i.e., threading). Frequently, however, additional information is available from other biophysical experiments, physical principles, statistical analyses, and other prior models. Here, we formulate an integrative approach for sequence assignment to a partial backbone model as an optimization problem, which requires three main components: the representation of the system, the scoring function, and the optimization method. The method is implemented in the open source Integrative Modeling Platform (IMP) (https://integrativemodeling.org), allowing a number of different terms in the scoring function. We apply this method to localizing the sequence assignment within a 199-residue disordered region of three structured and sequence unassigned helices in the DNA-PKcs crystallographic structure, using chemical crosslinks, hydrogen deuterium exchange, and sequence connectivity. The resulting ensemble of threading models provides two major solutions, one of which suggests that the crucial ABCDE cluster of phosphorylation sites cannot undergo intra-molecular autophosphorylation without a conformational rearrangement. The ensemble of solutions embodies the most accurate and precise sequence threading given the available information.
Collapse
Affiliation(s)
- Daniel J Saltzberg
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, USA.
| | - Morgan Hepburn
- Department of Biochemistry and Molecular Biology, University of Calgary, Calgary, Canada
| | - Kala Bharath Pilla
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, USA
| | - David C Schriemer
- Department of Biochemistry and Molecular Biology, University of Calgary, Calgary, Canada
| | - Susan P Lees-Miller
- Department of Biochemistry and Molecular Biology, University of Calgary, Calgary, Canada
| | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, USA
| |
Collapse
|
18
|
Terwilliger TC, Adams PD, Afonine PV, Sobolev OV. Map segmentation, automated model-building and their application to the Cryo-EM Model Challenge. J Struct Biol 2018; 204:338-343. [PMID: 30063987 PMCID: PMC6163059 DOI: 10.1016/j.jsb.2018.07.016] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Revised: 07/11/2018] [Accepted: 07/27/2018] [Indexed: 11/27/2022]
Abstract
A recently-developed method for identifying a compact, contiguous region representing the unique part of a density map was applied to 218 Cryo-EM maps with resolutions of 4.5 Å or better. The key elements of the segmentation procedure are (1) identification of all regions of density above a threshold and (2) choice of a unique set of these regions, taking symmetry into consideration, that maximize connectivity and compactness. This segmentation approach was then combined with tools for automated map sharpening and model-building to generate models for the 12 maps in the 2016 Cryo-EM Model Challenge in a fully automated manner. The resulting models have completeness from 24% to 82% and RMS distances from reference interpretations of 0.6 Å-2.1 Å.
Collapse
Affiliation(s)
- Thomas C Terwilliger
- Los Alamos National Laboratory, Los Alamos, NM 87545, USA; New Mexico Consortium, Los Alamos, NM 87544, USA.
| | - Paul D Adams
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720-8235, USA; Department of Bioengineering, University of California Berkeley, Berkeley, CA, USA
| | - Pavel V Afonine
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720-8235, USA; Department of Physics and International Centre for Quantum and Molecular Structures, Shanghai University, Shanghai 200444, People's Republic of China
| | - Oleg V Sobolev
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720-8235, USA
| |
Collapse
|
19
|
Cassidy CK, Himes BA, Luthey-Schulten Z, Zhang P. CryoEM-based hybrid modeling approaches for structure determination. Curr Opin Microbiol 2018; 43:14-23. [PMID: 29107896 PMCID: PMC5934336 DOI: 10.1016/j.mib.2017.10.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Revised: 10/04/2017] [Accepted: 10/09/2017] [Indexed: 12/21/2022]
Abstract
Recent advances in cryo-electron microscopy (cryoEM) have dramatically improved the resolutions at which vitrified biological specimens can be studied, revealing new structural and mechanistic insights over a broad range of spatial scales. Bolstered by these advances, much effort has been directed toward the development of hybrid modeling methodologies for the construction and refinement of high-fidelity atomistic models from cryoEM data. In this brief review, we will survey the key elements of cryoEM-based hybrid modeling, providing an overview of available computational tools and strategies as well as several recent applications.
Collapse
Affiliation(s)
- C Keith Cassidy
- Department of Physics, Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Benjamin A Himes
- Department of Structural Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Zaida Luthey-Schulten
- Department of Chemistry, Center for the Physics of Living Cells, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Peijun Zhang
- Department of Structural Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA; Division of Structural Biology, Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK; Electron Bio-Imaging Centre, Diamond Light Sources, Harwell Science and Innovation Campus, Didcot OX11 0DE, UK.
| |
Collapse
|
20
|
Al Nasr K, Yousef F, Jebril R, Jones C. Analytical Approaches to Improve Accuracy in Solving the Protein Topology Problem. Molecules 2018; 23:E28. [PMID: 29360779 PMCID: PMC6017786 DOI: 10.3390/molecules23020028] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 01/19/2018] [Accepted: 01/19/2018] [Indexed: 11/17/2022] Open
Abstract
To take advantage of recent advances in genomics and proteomics it is critical that the three-dimensional physical structure of biological macromolecules be determined. Cryo-Electron Microscopy (cryo-EM) is a promising and improving method for obtaining this data, however resolution is often not sufficient to directly determine the atomic scale structure. Despite this, information for secondary structure locations is detectable. De novo modeling is a computational approach to modeling these macromolecular structures based on cryo-EM derived data. During de novo modeling a mapping between detected secondary structures and the underlying amino acid sequence must be identified. DP-TOSS (Dynamic Programming for determining the Topology Of Secondary Structures) is one tool that attempts to automate the creation of this mapping. By treating the correspondence between the detected structures and the structures predicted from sequence data as a constraint graph problem DP-TOSS achieved good accuracy in its original iteration. In this paper, we propose modifications to the scoring methodology of DP-TOSS to improve its accuracy. Three scoring schemes were applied to DP-TOSS and tested: (i) a skeleton-based scoring function; (ii) a geometry-based analytical function; and (iii) a multi-well potential energy-based function. A test of 25 proteins shows that a combination of these schemes can improve the performance of DP-TOSS to solve the topology determination problem for macromolecule proteins.
Collapse
Affiliation(s)
- Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| | - Feras Yousef
- Department of Mathematics, The University of Jordan, Amman 11942, Jordan.
| | - Ruba Jebril
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| | - Christopher Jones
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| |
Collapse
|
21
|
Al Nasr K, Jones C, Yousef F, Jebril R. PEM-fitter: A Coarse-Grained Method to Validate Protein Candidate Models. J Comput Biol 2017; 25:21-32. [PMID: 29140718 DOI: 10.1089/cmb.2017.0191] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The volumetric images produced by Cryo-Electron Microscopy (cryo-EM) technique are used to model macromolecular assemblies and machines. De novo protein modeling uses these images to computationally model the structure of the molecules. Many candidate conformations are usually generated during the intermediate step. Conventionally, each of these candidates is evaluated by time-consuming approaches such as potential energy. We introduce an initial version of a geometrical screening method that uses the skeleton of the cryo-EM images to evaluate candidate structures. The aim of this method is to reduce the number of native-like candidate conformations and, therefore, reduce the time required for structural evaluation by energy calculations. A test of two datasets was performed. The first dataset contains 10 proteins and shows that our method can successfully detect the correct native structure for the given skeleton among a set of different protein structures. The second dataset contains 12 proteins and shows that our method can filter slightly modified decoy conformations of the same protein. The efficiency of the method is also reported.
Collapse
Affiliation(s)
- Kamal Al Nasr
- 1 Department of Computer Science, Tennessee State University , Nashville, Tennessee
| | - Christopher Jones
- 1 Department of Computer Science, Tennessee State University , Nashville, Tennessee
| | - Feras Yousef
- 2 Department of Mathematics, The University of Jordan , Amman, Jordan
| | - Ruba Jebril
- 1 Department of Computer Science, Tennessee State University , Nashville, Tennessee
| |
Collapse
|
22
|
Schilbach S, Hantsche M, Tegunov D, Dienemann C, Wigge C, Urlaub H, Cramer P. Structures of transcription pre-initiation complex with TFIIH and Mediator. Nature 2017; 551:204-209. [PMID: 29088706 PMCID: PMC6078178 DOI: 10.1038/nature24282] [Citation(s) in RCA: 183] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2017] [Accepted: 09/14/2017] [Indexed: 12/18/2022]
Abstract
For the initiation of transcription, RNA polymerase II (Pol II) assembles with general transcription factors on promoter DNA to form the pre-initiation complex (PIC). Here we report cryo-electron microscopy structures of the Saccharomyces cerevisiae PIC and PIC-core Mediator complex at nominal resolutions of 4.7 Å and 5.8 Å, respectively. The structures reveal transcription factor IIH (TFIIH), and suggest how the core and kinase TFIIH modules function in the opening of promoter DNA and the phosphorylation of Pol II, respectively. The TFIIH core subunit Ssl2 (a homologue of human XPB) is positioned on downstream DNA by the 'E-bridge' helix in TFIIE, consistent with TFIIE-stimulated DNA opening. The TFIIH kinase module subunit Tfb3 (MAT1 in human) anchors the kinase Kin28 (CDK7), which is mobile in the PIC but preferentially located between the Mediator hook and shoulder in the PIC-core Mediator complex. Open spaces between the Mediator head and middle modules may allow access of the kinase to its substrate, the C-terminal domain of Pol II.
Collapse
Affiliation(s)
- S Schilbach
- Max Planck Institute for Biophysical Chemistry, Department of Molecular Biology, Am Fassberg 11, 37077 Göttingen, Germany
| | - M Hantsche
- Max Planck Institute for Biophysical Chemistry, Department of Molecular Biology, Am Fassberg 11, 37077 Göttingen, Germany
| | - D Tegunov
- Max Planck Institute for Biophysical Chemistry, Department of Molecular Biology, Am Fassberg 11, 37077 Göttingen, Germany
| | - C Dienemann
- Max Planck Institute for Biophysical Chemistry, Department of Molecular Biology, Am Fassberg 11, 37077 Göttingen, Germany
| | - C Wigge
- Max Planck Institute for Biophysical Chemistry, Department of Molecular Biology, Am Fassberg 11, 37077 Göttingen, Germany
| | - H Urlaub
- Max Planck Institute for Biophysical Chemistry, Department of Molecular Biology, Am Fassberg 11, 37077 Göttingen, Germany
- University Medical Center Göttingen, Institute of Clinical Chemistry, Bioanalytics Group, Robert-Koch-Straße 40, 37075 Göttingen, Germany
| | - P Cramer
- Max Planck Institute for Biophysical Chemistry, Department of Molecular Biology, Am Fassberg 11, 37077 Göttingen, Germany
| |
Collapse
|
23
|
Dou H, Burrows DW, Baker ML, Ju T. Flexible Fitting of Atomic Models into Cryo-EM Density Maps Guided by Helix Correspondences. Biophys J 2017. [PMID: 28636906 DOI: 10.1016/j.bpj.2017.04.054] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Although electron cryo-microscopy (cryo-EM) has recently achieved resolutions of better than 3 Å, at which point molecular modeling can be done directly from the density map, analysis and annotation of a cryo-EM density map still primarily rely on fitting atomic or homology models to the density map. In this article, we present, to our knowledge, a new method for flexible fitting of known or modeled protein structures into cryo-EM density maps. Unlike existing methods that are guided by local density gradients, our method is guided by correspondences between the α-helices in the density map and model, and does not require an initial rigid-body fitting step. Compared with current methods on both simulated and experimental density maps, our method not only achieves greater accuracy for proteins with large deformations but also runs as fast or faster than many of the other flexible fitting routines.
Collapse
Affiliation(s)
- Hang Dou
- Department of Computer Science and Engineering, Washington University in St. Louis, St. Louis, Missouri.
| | - Derek W Burrows
- Department of Computer Science and Engineering, Washington University in St. Louis, St. Louis, Missouri
| | - Matthew L Baker
- Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas
| | - Tao Ju
- Department of Computer Science and Engineering, Washington University in St. Louis, St. Louis, Missouri
| |
Collapse
|
24
|
Abstract
RosettaES, an algorithm that uses a fragment-based sampling strategy, improves macromolecular structure modeling from cryo-EM data at 3–5-Å resolution. Accurate atomic modeling of macromolecular structures into cryo-electron microscopy (cryo-EM) maps is a major challenge, as the moderate resolution makes accurate placement of atoms difficult. We present Rosetta enumerative sampling (RosettaES), an automated tool that uses a fragment-based sampling strategy for de novo model completion of macromolecular structures from cryo-EM density maps at 3–5-Å resolution. On a benchmark set of nine proteins, RosettaES was able to identify near-native conformations in 85% of segments. RosettaES was also used to determine models for three challenging macromolecular structures.
Collapse
|
25
|
Zhou N, Wang H, Wang J. EMBuilder: A Template Matching-based Automatic Model-building Program for High-resolution Cryo-Electron Microscopy Maps. Sci Rep 2017; 7:2664. [PMID: 28572576 PMCID: PMC5453991 DOI: 10.1038/s41598-017-02725-w] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2017] [Accepted: 04/18/2017] [Indexed: 01/17/2023] Open
Abstract
The resolution of electron-potential maps in single-particle cryo-electron microscopy (cryoEM) is approaching atomic or near- atomic resolution. However, no program currently exists for de novo cryoEM model building at resolutions exceeding beyond 3.5 Å. Here, we present a program, EMBuilder, based on template matching, to generate cryoEM models at high resolution. The program identifies features in both secondary-structure and Cα stages. In the secondary structure stage, helices and strands are identified with pre-computed templates, and the voxel size of the entire map is then refined to account for microscopic magnification errors. The identified secondary structures are then extended from both ends in the Cα stage via a log-likelihood (LLK) target function, and if possible, the side chains are also assigned. This program can build models of large proteins (~1 MDa) in a reasonable amount of time (~1 day) and thus has the potential to greatly decrease the manual workload required for model building of high-resolution cryoEM maps.
Collapse
Affiliation(s)
- Niyun Zhou
- MOE Key Laboratory of Protein Science, Tsinghua University, Beijing, 100084, China.,School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Hongwei Wang
- MOE Key Laboratory of Protein Science, Tsinghua University, Beijing, 100084, China. .,School of Life Sciences, Tsinghua University, Beijing, 100084, China.
| | - Jiawei Wang
- State Key Laboratory of Membrane Biology, Tsinghua University, Beijing, 100084, China.
| |
Collapse
|
26
|
Biswas A, Ranjan D, Zubair M, Zeil S, Nasr KA, He J. An Effective Computational Method Incorporating Multiple Secondary Structure Predictions in Topology Determination for Cryo-EM Images. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:578-586. [PMID: 27008671 PMCID: PMC5071113 DOI: 10.1109/tcbb.2016.2543721] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
A key idea in de novo modeling of a medium-resolution density image obtained from cryo-electron microscopy is to compute the optimal mapping between the secondary structure traces observed in the density image and those predicted on the protein sequence. When secondary structures are not determined precisely, either from the image or from the amino acid sequence of the protein, the computational problem becomes more complex. We present an efficient method that addresses the secondary structure placement problem in presence of multiple secondary structure predictions and computes the optimal mapping. We tested the method using 12 simulated images from α-proteins and two Cryo-EM images of α-β proteins. We observed that the rank of the true topologies is consistently improved by using multiple secondary structure predictions instead of a single prediction. The results show that the algorithm is robust and works well even when errors/misses in the predicted secondary structures are present in the image or the sequence. The results also show that the algorithm is efficient and is able to handle proteins with as many as 33 helices.
Collapse
Affiliation(s)
- Abhishek Biswas
- Dept. of Computer Science, Old Dominion University, Norfolk, VA 23529
| | - Desh Ranjan
- Dept. of Computer Science, Old Dominion University, Norfolk, VA 23529
| | - Mohammad Zubair
- Dept. of Computer Science, Old Dominion University, Norfolk, VA 23529
| | - Stephanie Zeil
- Dept. of Computer Science, Old Dominion University, Norfolk, VA 23529
| | - Kamal Al Nasr
- Dept. of Computer Science, Tennessee State University, Nashville, TN 37209
| | - Jing He
- Dept. of Computer Science, Old Dominion University, Norfolk, VA 23529
| |
Collapse
|
27
|
Li R, Si D, Zeng T, Ji S, He J. Deep Convolutional Neural Networks for Detecting Secondary Structures in Protein Density Maps from Cryo-Electron Microscopy. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2017; 2016:41-46. [PMID: 29770260 DOI: 10.1109/bibm.2016.7822490] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The detection of secondary structure of proteins using three dimensional (3D) cryo-electron microscopy (cryo-EM) images is still a challenging task when the spatial resolution of cryo-EM images is at medium level (5-10Å ). Prior researches focused on the usage of local features that may not capture the global information of image objects. In this study, we propose to use deep learning methods to extract high representative global features and then automatically detect secondary structures of proteins. In particular, we build a convolutional neural network (CNN) classifier that predicts the probability of label for every individual voxel in 3D cryo-EM image with respect to the secondary structure elements of proteins such as α-helix, β-sheet and background. To effectively incorporate the 3D spatial information in protein structures, we propose to perform 3D convolutions in the convolutional layers of CNNs. We show that the proposed CNN classifier can outperform existing SVM method on identifying the secondary structure elements of proteins from 3D cryo-EM medium resolution images.
Collapse
Affiliation(s)
- Rongjian Li
- Department of Computer Science, Old Dominion University, Norfolk, Virginia 23529
| | - Dong Si
- Division of Computing and Software Systems, University of Washington Bothell, Bothell, WA 98011
| | - Tao Zeng
- School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA 99164
| | - Shuiwang Ji
- School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA 99164
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk, Virginia 23529
| |
Collapse
|
28
|
Si D, He J. Modeling Beta-Traces for Beta-Barrels from Cryo-EM Density Maps. BIOMED RESEARCH INTERNATIONAL 2017; 2017:1793213. [PMID: 28164115 PMCID: PMC5259677 DOI: 10.1155/2017/1793213] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2016] [Accepted: 12/08/2016] [Indexed: 01/09/2023]
Abstract
Cryo-electron microscopy (cryo-EM) has produced density maps of various resolutions. Although α-helices can be detected from density maps at 5-8 Å resolutions, β-strands are challenging to detect at such density maps due to close-spacing of β-strands. The variety of shapes of β-sheets adds the complexity of β-strands detection from density maps. We propose a new approach to model traces of β-strands for β-barrel density regions that are extracted from cryo-EM density maps. In the test containing eight β-barrels extracted from experimental cryo-EM density maps at 5.5 Å-8.25 Å resolution, StrandRoller detected about 74.26% of the amino acids in the β-strands with an overall 2.05 Å 2-way distance between the detected β-traces and the observed ones, if the best of the fifteen detection cases is considered.
Collapse
Affiliation(s)
- Dong Si
- Division of Computing and Software Systems, University of Washington Bothell, Bothell, WA 98011, USA
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| |
Collapse
|
29
|
Poteat M, He J. An Iterative Bézier Method for Fitting Beta-sheet Component of a Cryo-EM Density Map. MOLECULAR BASED MATHEMATICAL BIOLOGY 2017; 5:31-39. [PMID: 34350231 PMCID: PMC8329936 DOI: 10.1515/mlbmb-2017-0003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Cryo-electron microscopy (Cryo-EM) is a powerful technique to produce 3-dimensional density maps for large molecular complexes. Although many atomic structures have been solved from cryo-EM density maps, it is challenging to derive atomic structures when the resolution of density maps is not sufficiently high. Geometrical shape representation of secondary structural components in a medium-resolution density map enhances modeling of atomic structures. We compare two methods in producing surface representation of the β-sheet component of a density map. Given a 3-dimensional volume of β-sheet that is segmented from a density map, the performance of a polynomial fitting was compared with that of an iterative Bézier fitting. The results suggest that the iterative Bézier fitting is more suitable for β-sheets, since it provides more accurate representation of the corners that are naturally twisted in a β-sheet.
Collapse
Affiliation(s)
- Michael Poteat
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529
| |
Collapse
|
30
|
Zeil S, Kovacs J, Wriggers W, He J. Comparing an Atomic Model or Structure to a Corresponding Cryo-electron Microscopy Image at the Central Axis of a Helix. J Comput Biol 2017; 24:52-67. [PMID: 27936925 PMCID: PMC5220566 DOI: 10.1089/cmb.2016.0145] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
Abstract
Three-dimensional density maps of biological specimens from cryo-electron microscopy (cryo-EM) can be interpreted in the form of atomic models that are modeled into the density, or they can be compared to known atomic structures. When the central axis of a helix is detectable in a cryo-EM density map, it is possible to quantify the agreement between this central axis and a central axis calculated from the atomic model or structure. We propose a novel arc-length association method to compare the two axes reliably. This method was applied to 79 helices in simulated density maps and six case studies using cryo-EM maps at 6.4-7.7 Å resolution. The arc-length association method is then compared to three existing measures that evaluate the separation of two helical axes: a two-way distance between point sets, the length difference between two axes, and the individual amino acid detection accuracy. The results show that our proposed method sensitively distinguishes lateral and longitudinal discrepancies between the two axes, which makes the method particularly suitable for the systematic investigation of cryo-EM map-model pairs.
Collapse
Affiliation(s)
- Stephanie Zeil
- Department of Computer Science, Old Dominion University, Norfolk, Virginia
| | - Julio Kovacs
- Department of Mechanical and Aerospace Engineering and Institute of Biomedical Engineering, Old Dominion University, Norfolk, Virginia
| | - Willy Wriggers
- Department of Mechanical and Aerospace Engineering and Institute of Biomedical Engineering, Old Dominion University, Norfolk, Virginia
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk, Virginia
| |
Collapse
|
31
|
Haslam D, Zubair M, Ranjan D, Biswas A, He J. CHALLENGES IN MATCHING SECONDARY STRUCTURES IN CRYO-EM: AN EXPLORATION. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2016; 2016:1714-1719. [PMID: 29770261 PMCID: PMC5952047 DOI: 10.1109/bibm.2016.7822776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Cryo-electron microscopy is a fast emerging biophysical technique for structural determination of large protein complexes. While more atomic structures are being determined using this technique, it is still challenging to derive atomic structures from density maps produced at medium resolution when no suitable templates are available. A critical step in structure determination is how a protein chain threads through the 3-dimensional density map. A dynamic programming method was previously developed to generate K best matches of secondary structures between the density map and its protein sequence using shortest paths in a related weighted graph. We discuss challenges associated with the creation of the weighted graph and explore heuristic methods to solve the problem of matching secondary structures.
Collapse
Affiliation(s)
- Devin Haslam
- Department of Computer Science, Old Dominion University, Norfolk VA23529
| | - Mohammad Zubair
- Department of Computer Science, Old Dominion University, Norfolk VA23529
| | - Desh Ranjan
- Department of Computer Science, Old Dominion University, Norfolk VA23529
| | | | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk VA23529
| |
Collapse
|
32
|
Heterodimers as the Structural Unit of the T=1 Capsid of the Fungal Double-Stranded RNA Rosellinia necatrix Quadrivirus 1. J Virol 2016; 90:11220-11230. [PMID: 27707923 DOI: 10.1128/jvi.01013-16] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2016] [Accepted: 09/29/2016] [Indexed: 02/07/2023] Open
Abstract
Most double-stranded RNA (dsRNA) viruses are transcribed and replicated in a specialized icosahedral capsid with a T=1 lattice consisting of 60 asymmetric capsid protein (CP) dimers. These capsids help to organize the viral genome and replicative complex(es). They also act as molecular sieves that isolate the virus genome from host defense mechanisms and allow the passage of nucleotides and viral transcripts. Rosellinia necatrix quadrivirus 1 (RnQV1), the type species of the family Quadriviridae, is a dsRNA fungal virus with a multipartite genome consisting of four monocistronic segments (segments 1 to 4). dsRNA-2 and dsRNA-4 encode two CPs (P2 and P4, respectively), which coassemble into ∼450-Å-diameter capsids. We used three-dimensional cryo-electron microscopy combined with complementary biophysical techniques to determine the structures of RnQV1 virion strains W1075 and W1118. RnQV1 has a quadripartite genome, and the capsid is based on a single-shelled T=1 lattice built of P2-P4 dimers. Whereas the RnQV1-W1118 capsid is built of full-length CP, P2 and P4 of RnQV1-W1075 are cleaved into several polypeptides, maintaining the capsid structural organization. RnQV1 heterodimers have a quaternary organization similar to that of homodimers of reoviruses and other dsRNA mycoviruses. The RnQV1 capsid is the first T=1 capsid with a heterodimer as an asymmetric unit reported to date and follows the architectural principle for dsRNA viruses that a 120-subunit capsid is a conserved assembly that supports dsRNA replication and organization. IMPORTANCE Given their importance to health, members of the family Reoviridae are the basis of most structural and functional studies and provide much of our knowledge of dsRNA viruses. Analysis of bacterial, protozoal, and fungal dsRNA viruses has improved our understanding of their structure, function, and evolution, as well. Here, we studied a dsRNA virus that infects the fungus Rosellinia necatrix, an ascomycete that is pathogenic to a wide range of plants. Using three-dimensional cryo-electron microscopy and analytical ultracentrifugation analysis, we determined the structure and stoichiometry of Rosellinia necatrix quadrivirus 1 (RnQV1). The RnQV1 capsid is a T=1 capsid with 60 heterodimers as the asymmetric units. The large amount of genetic information used by RnQV1 to construct a simple T=1 capsid is probably related to the numerous virus-host and virus-virus interactions that it must face in its life cycle, which lacks an extracellular phase.
Collapse
|
33
|
Kuzu G, Keskin O, Nussinov R, Gursoy A. PRISM-EM: template interface-based modelling of multi-protein complexes guided by cryo-electron microscopy density maps. Acta Crystallogr D Struct Biol 2016; 72:1137-1148. [PMID: 27710935 PMCID: PMC5053140 DOI: 10.1107/s2059798316013541] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Accepted: 08/23/2016] [Indexed: 12/29/2022] Open
Abstract
The structures of protein assemblies are important for elucidating cellular processes at the molecular level. Three-dimensional electron microscopy (3DEM) is a powerful method to identify the structures of assemblies, especially those that are challenging to study by crystallography. Here, a new approach, PRISM-EM, is reported to computationally generate plausible structural models using a procedure that combines crystallographic structures and density maps obtained from 3DEM. The predictions are validated against seven available structurally different crystallographic complexes. The models display mean deviations in the backbone of <5 Å. PRISM-EM was further tested on different benchmark sets; the accuracy was evaluated with respect to the structure of the complex, and the correlation with EM density maps and interface predictions were evaluated and compared with those obtained using other methods. PRISM-EM was then used to predict the structure of the ternary complex of the HIV-1 envelope glycoprotein trimer, the ligand CD4 and the neutralizing protein m36.
Collapse
Affiliation(s)
- Guray Kuzu
- Center for Computational Biology and Bioinformatics and College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Ozlem Keskin
- Center for Computational Biology and Bioinformatics and College of Engineering, Koc University, 34450 Istanbul, Turkey
- Chemical and Biological Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Ruth Nussinov
- Cancer and Inflammation Program, Leidos Biomedical Research Inc., National Cancer Institute, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
- Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Attila Gursoy
- Computer Engineering, Koc University, 34450 Istanbul, Turkey
| |
Collapse
|
34
|
DiMaio F, Chiu W. Tools for Model Building and Optimization into Near-Atomic Resolution Electron Cryo-Microscopy Density Maps. Methods Enzymol 2016; 579:255-76. [PMID: 27572730 PMCID: PMC5103630 DOI: 10.1016/bs.mie.2016.06.003] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Electron cryo-microscopy (cryoEM) has advanced dramatically to become a viable tool for high-resolution structural biology research. The ultimate outcome of a cryoEM study is an atomic model of a macromolecule or its complex with interacting partners. This chapter describes a variety of algorithms and software to build a de novo model based on the cryoEM 3D density map, to optimize the model with the best stereochemistry restraints and finally to validate the model with proper protocols. The full process of atomic structure determination from a cryoEM map is described. The tools outlined in this chapter should prove extremely valuable in revealing atomic interactions guided by cryoEM data.
Collapse
Affiliation(s)
- F DiMaio
- University of Washington, Seattle, WA, United States; Institute for Protein Design, University of Washington, Seattle, WA, United States.
| | - W Chiu
- National Center for Macromolecular Imaging, Baylor College of Medicine, Houston, TX, United States.
| |
Collapse
|
35
|
Constrained cyclic coordinate descent for cryo-EM images at medium resolutions: beyond the protein loop closure problem. ROBOTICA 2016; 34:1777-1790. [DOI: 10.1017/s0263574716000242] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
SUMMARYThe cyclic coordinate descent (CCD) method is a popular loop closure method in protein structure modeling. It is a robotics algorithm originally developed for inverse kinematic applications. We demonstrate an effective method of building the backbone of protein structure models using the principle of CCD and a guiding trace. For medium-resolution 3-dimensional (3D) images derived using cryo-electron microscopy (cryo-EM), it is possible to obtain guiding traces of secondary structures and their skeleton connections. Our new method, constrained cyclic coordinate descent (CCCD), builds α-helices, β-strands, and loops quickly and fairly accurately along predefined traces. We show that it is possible to build the entire backbone of a protein fairly accurately when the guiding traces are accurate. In a test of 10 proteins, the models constructed using CCCD show an average of 3.91 Å of backbone root mean square deviation (RMSD). When the CCCD method is incorporated in a simulated annealing framework to sample possible shift, translation, and rotation freedom, the models built with the true topology were ranked high on the list, with an average backbone RMSD100 of 3.76 Å. CCCD is an effective method for modeling atomic structures after secondary structure traces and skeletons are extracted from 3D cryo-EM images.
Collapse
|
36
|
Bell JM, Chen M, Baldwin PR, Ludtke SJ. High resolution single particle refinement in EMAN2.1. Methods 2016; 100:25-34. [PMID: 26931650 PMCID: PMC4848122 DOI: 10.1016/j.ymeth.2016.02.018] [Citation(s) in RCA: 116] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Revised: 02/25/2016] [Accepted: 02/26/2016] [Indexed: 01/01/2023] Open
Abstract
EMAN2.1 is a complete image processing suite for quantitative analysis of grayscale images, with a primary focus on transmission electron microscopy, with complete workflows for performing high resolution single particle reconstruction, 2-D and 3-D heterogeneity analysis, random conical tilt reconstruction and subtomogram averaging, among other tasks. In this manuscript we provide the first detailed description of the high resolution single particle analysis pipeline and the philosophy behind its approach to the reconstruction problem. High resolution refinement is a fully automated process, and involves an advanced set of heuristics to select optimal algorithms for each specific refinement task. A gold standard FSC is produced automatically as part of refinement, providing a robust resolution estimate for the final map, and this is used to optimally filter the final CTF phase and amplitude corrected structure. Additional methods are in-place to reduce model bias during refinement, and to permit cross-validation using other computational methods.
Collapse
Affiliation(s)
- James M Bell
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
| | - Muyuan Chen
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
| | - Philip R Baldwin
- Department of Psychiatry, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
| | - Steven J Ludtke
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA.
| |
Collapse
|
37
|
He J, Zeil S, Hallak H, McKaig K, Kovacs J, Wriggers W. Comparison of an Atomic Model and Its Cryo-EM Image at the Central Axis of a Helix. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2015; 2015:1253-1259. [PMID: 27280059 PMCID: PMC4894056 DOI: 10.1109/bibm.2015.7359860] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Cryo-electron microscopy (cryo-EM) is an important biophysical technique that produces three-dimensional (3D) density maps at different resolutions. Because more and more models are being produced from cryo-EM density maps, validation of the models is becoming important. We propose a method for measuring local agreement between a model and the density map using the central axis of the helix. This method was tested using 19 helices from cryo-EM density maps between 5.5 Å and 7.2 Å resolution and 94 helices from simulated density maps. This method distinguished most of the well-fitting helices, although challenges exist for shorter helices.
Collapse
Affiliation(s)
- Jing He
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529
| | - Stephanie Zeil
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529
| | - Hussam Hallak
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529
| | - Kele McKaig
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529
| | - Julio Kovacs
- Department of Mechanical & Aerospace Engineering, Old Dominion University, Norfolk, VA, 23529
| | - Willy Wriggers
- Department of Mechanical & Aerospace Engineering, Old Dominion University, Norfolk, VA, 23529
| |
Collapse
|
38
|
Fan G, Baker ML, Wang Z, Baker MR, Sinyagovskiy PA, Chiu W, Ludtke SJ, Serysheva II. Gating machinery of InsP3R channels revealed by electron cryomicroscopy. Nature 2015; 527:336-41. [PMID: 26458101 DOI: 10.1038/nature15249] [Citation(s) in RCA: 168] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Accepted: 07/24/2015] [Indexed: 12/12/2022]
Abstract
Inositol-1,4,5-trisphosphate receptors (InsP3Rs) are ubiquitous ion channels responsible for cytosolic Ca(2+) signalling and essential for a broad array of cellular processes ranging from contraction to secretion, and from proliferation to cell death. Despite decades of research on InsP3Rs, a mechanistic understanding of their structure-function relationship is lacking. Here we present the first, to our knowledge, near-atomic (4.7 Å) resolution electron cryomicroscopy structure of the tetrameric mammalian type 1 InsP3R channel in its apo-state. At this resolution, we are able to trace unambiguously ∼85% of the protein backbone, allowing us to identify the structural elements involved in gating and modulation of this 1.3-megadalton channel. Although the central Ca(2+)-conduction pathway is similar to other ion channels, including the closely related ryanodine receptor, the cytosolic carboxy termini are uniquely arranged in a left-handed α-helical bundle, directly interacting with the amino-terminal domains of adjacent subunits. This configuration suggests a molecular mechanism for allosteric regulation of channel gating by intracellular signals.
Collapse
Affiliation(s)
- Guizhen Fan
- Department of Biochemistry and Molecular Biology, Structural Biology Imaging Center, The University of Texas Medical School at Houston, 6431 Fannin Street, Houston, Texas 77030, USA
| | - Matthew L Baker
- National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Zhao Wang
- National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Mariah R Baker
- Department of Biochemistry and Molecular Biology, Structural Biology Imaging Center, The University of Texas Medical School at Houston, 6431 Fannin Street, Houston, Texas 77030, USA
| | - Pavel A Sinyagovskiy
- Department of Biochemistry and Molecular Biology, Structural Biology Imaging Center, The University of Texas Medical School at Houston, 6431 Fannin Street, Houston, Texas 77030, USA
| | - Wah Chiu
- National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Steven J Ludtke
- National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Irina I Serysheva
- Department of Biochemistry and Molecular Biology, Structural Biology Imaging Center, The University of Texas Medical School at Houston, 6431 Fannin Street, Houston, Texas 77030, USA
| |
Collapse
|
39
|
Biswas A, Ranjan D, Zubair M, He J. A Dynamic Programming Algorithm for Finding the Optimal Placement of a Secondary Structure Topology in Cryo-EM Data. J Comput Biol 2015; 22:837-43. [PMID: 26244416 DOI: 10.1089/cmb.2015.0120] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The determination of secondary structure topology is a critical step in deriving the atomic structures from the protein density maps obtained from electron cryomicroscopy technique. This step often relies on matching the secondary structure traces detected from the protein density map to the secondary structure sequence segments predicted from the amino acid sequence. Due to inaccuracies in both sources of information, a pool of possible secondary structure positions needs to be sampled. One way to approach the problem is to first derive a small number of possible topologies using existing matching algorithms, and then find the optimal placement for each possible topology. We present a dynamic programming method of Θ(Nq(2)h) to find the optimal placement for a secondary structure topology. We show that our algorithm requires significantly less computational time than the brute force method that is in the order of Θ(q(N) h).
Collapse
Affiliation(s)
- Abhishek Biswas
- Department of Computer Science, Old Dominion University , Norfolk, Virginia
| | - Desh Ranjan
- Department of Computer Science, Old Dominion University , Norfolk, Virginia
| | - Mohammad Zubair
- Department of Computer Science, Old Dominion University , Norfolk, Virginia
| | - Jing He
- Department of Computer Science, Old Dominion University , Norfolk, Virginia
| |
Collapse
|
40
|
San Martín C. Transmission electron microscopy and the molecular structure of icosahedral viruses. Arch Biochem Biophys 2015; 581:59-67. [PMID: 26072114 DOI: 10.1016/j.abb.2015.06.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2015] [Revised: 06/01/2015] [Accepted: 06/04/2015] [Indexed: 11/16/2022]
Abstract
The field of structural virology developed in parallel with methodological advances in X-ray crystallography and cryo-electron microscopy. At the end of the 1970s, crystallography yielded the first high resolution structure of an icosahedral virus, the T=3 tomato bushy stunt virus at 2.9Å. It took longer to reach near-atomic resolution in three-dimensional virus maps derived from electron microscopy data, but this was finally achieved, with the solution of complex icosahedral capsids such as the T=25 human adenovirus at ∼3.5Å. Both techniques now work hand-in-hand to determine those aspects of virus assembly and biology that remain unclear. This review examines the trajectory followed by EM imaging techniques in showing the molecular structure of icosahedral viruses, from the first two-dimensional negative staining images of capsids to the latest sophisticated techniques that provide high resolution three-dimensional data, or snapshots of the conformational changes necessary to complete the infectious cycle.
Collapse
Affiliation(s)
- Carmen San Martín
- Department of Macromolecular Structure and NanoBioMedicine Initiative, Centro Nacional de Biotecnología (CNB-CSIC), Darwin 3, 28049 Madrid, Spain.
| |
Collapse
|
41
|
Wang RYR, Kudryashev M, Li X, Egelman EH, Basler M, Cheng Y, Baker D, DiMaio F. De novo protein structure determination from near-atomic-resolution cryo-EM maps. Nat Methods 2015; 12:335-8. [PMID: 25707029 PMCID: PMC4435692 DOI: 10.1038/nmeth.3287] [Citation(s) in RCA: 130] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2014] [Accepted: 12/31/2014] [Indexed: 12/12/2022]
Abstract
We present a de novo model-building approach that combines predicted backbone conformations with side-chain fit to density to accurately assign sequence into density maps. This method yielded accurate models for six of nine experimental maps at 3.3- to 4.8-Å resolution and produced a nearly complete model for an unsolved map containing a 660-residue heterodimeric protein. This method should enable rapid and reliable protein structure determination from near-atomic-resolution cryo-electron microscopy (cryo-EM) maps.
Collapse
Affiliation(s)
- Ray Yu-Ruei Wang
- Graduate program in Biological Physics, Structure and Design, University of Washington, Seattle, WA 98195, USA
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - Mikhail Kudryashev
- Focal Area Infection Biology, Biozentrum, University of Basel, Switzerland
- Center for Cellular Imaging and NanoAnalytics, Biozentrum, University of Basel, Switzerland
| | - Xueming Li
- The Keck Advanced Microscopy Laboratory, Department of Biochemistry and Biophysics, University of California, San Francisco, CA, USA
| | - Edward H. Egelman
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908-0733, USA
| | - Marek Basler
- Focal Area Infection Biology, Biozentrum, University of Basel, Switzerland
| | - Yifan Cheng
- The Keck Advanced Microscopy Laboratory, Department of Biochemistry and Biophysics, University of California, San Francisco, CA, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
42
|
Schröder GF. Hybrid methods for macromolecular structure determination: experiment with expectations. Curr Opin Struct Biol 2015; 31:20-7. [DOI: 10.1016/j.sbi.2015.02.016] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2014] [Revised: 02/22/2015] [Accepted: 02/26/2015] [Indexed: 12/15/2022]
|
43
|
Si D, He J. Tracing Beta Strands Using StrandTwister from Cryo-EM Density Maps at Medium Resolutions. Structure 2014; 22:1665-76. [DOI: 10.1016/j.str.2014.08.017] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2014] [Revised: 08/07/2014] [Accepted: 08/08/2014] [Indexed: 10/24/2022]
|
44
|
A newly isolated reovirus has the simplest genomic and structural organization of any reovirus. J Virol 2014; 89:676-87. [PMID: 25355879 DOI: 10.1128/jvi.02264-14] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
UNLABELLED A total of 2,691 mosquitoes representing 17 species was collected from eight locations in southwest Cameroon and screened for pathogenic viruses. Ten isolates of a novel reovirus (genus Dinovernavirus) were detected by culturing mosquito pools on Aedes albopictus (C6/36) cell cultures. A virus that caused overt cytopathic effects was isolated, but it did not infect vertebrate cells or produce detectable disease in infant mice after intracerebral inoculation. The virus, tentatively designated Fako virus (FAKV), represents the first 9-segment, double-stranded RNA (dsRNA) virus to be isolated in nature. FAKV appears to have a broad mosquito host range, and its detection in male specimens suggests mosquito-to-mosquito transmission in nature. The structure of the T=1 FAKV virion, determined to subnanometer resolution by cryoelectron microscopy (cryo-EM), showed only four proteins per icosahedral asymmetric unit: a dimer of the major capsid protein, one turret protein, and one clamp protein. While all other turreted reoviruses of known structures have at least two copies of the clamp protein per asymmetric unit, FAKV's clamp protein bound at only one conformer of the major capsid protein. The FAKV capsid architecture and genome organization represent the most simplified reovirus described to date, and phylogenetic analysis suggests that it arose from a more complex ancestor by serial loss-of-function events. IMPORTANCE We describe the detection, genetic, phenotypic, and structural characteristics of a novel Dinovernavirus species isolated from mosquitoes collected in Cameroon. The virus, tentatively designated Fako virus (FAKV), is related to both single-shelled and partially double-shelled viruses. The only other described virus in this genus was isolated from cultured mosquito cells. It was previously unclear whether the phenotypic characteristics of that virus were reflective of this genus in nature or were altered during serial passaging in the chronically infected cell line. FAKV is a naturally occurring single-shelled reovirus with a unique virion architecture that lacks several key structural elements thought to stabilize a single-shelled reovirus virion, suggesting what may be the minimal number of proteins needed to form a viable reovirus particle. FAKV evolved from more complex ancestors by losing a genome segment and several virion proteins.
Collapse
|
45
|
Dlotko P, Specogna R. Topology preserving thinning of cell complexes. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2014; 23:4486-4495. [PMID: 25137728 DOI: 10.1109/tip.2014.2348799] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
A topology preserving skeleton is a synthetic representation of an object that retains its topology and many of its significant morphological properties. The process of obtaining the skeleton, referred to as skeletonization or thinning, is a very active research area. It plays a central role in reducing the amount of information to be processed during image analysis and visualization, computer-aided diagnosis, or by pattern recognition algorithms. This paper introduces a novel topology preserving thinning algorithm, which removes simple cells-a generalization of simple points-of a given cell complex. The test for simple cells is based on acyclicity tables automatically produced in advance with homology computations. Using acyclicity tables render the implementation of thinning algorithms straightforward. Moreover, the fact that tables are automatically filled for all possible configurations allows to rigorously prove the generality of the algorithm and to obtain fool-proof implementations. The novel approach enables, for the first time, according to our knowledge, to thin a general unstructured simplicial complex. Acyclicity tables for cubical and simplicial complexes and an open source implementation of the thinning algorithm are provided as an additional material to allow their immediate use in the vast number of applications arising in medical imaging and beyond.
Collapse
|
46
|
Gipson P, Baker ML, Raytcheva D, Haase-Pettingell C, Piret J, King JA, Chiu W. Protruding knob-like proteins violate local symmetries in an icosahedral marine virus. Nat Commun 2014; 5:4278. [PMID: 24985522 PMCID: PMC4102127 DOI: 10.1038/ncomms5278] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Accepted: 06/03/2014] [Indexed: 02/02/2023] Open
Abstract
Marine viruses play crucial roles in shaping the dynamics of oceanic microbial communities and in the carbon cycle on Earth. Here we report a 4.7-Å structure of a cyanobacterial virus, Syn5, by electron cryo-microscopy and modelling. A Cα backbone trace of the major capsid protein (gp39) reveals a classic phage protein fold. In addition, two knob-like proteins protruding from the capsid surface are also observed. Using bioinformatics and structure analysis tools, these proteins are identified to correspond to gp55 and gp58 (each with two copies per asymmetric unit). The non 1:1 stoichiometric distribution of gp55/58 to gp39 breaks all expected local symmetries and leads to non-quasi-equivalence of the capsid subunits, suggesting a role in capsid stabilization. Such a structural arrangement has not yet been observed in any known virus structures.
Collapse
Affiliation(s)
- Preeti Gipson
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Matthew L Baker
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Desislava Raytcheva
- 1] Department of Microbiology, Northeastern University, Boston, Massachusetts 02115, USA [2] Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Cameron Haase-Pettingell
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Jacqueline Piret
- Department of Microbiology, Northeastern University, Boston, Massachusetts 02115, USA
| | - Jonathan A King
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Wah Chiu
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas 77030, USA
| |
Collapse
|
47
|
Cryo-EM near-atomic structure of a dsRNA fungal virus shows ancient structural motifs preserved in the dsRNA viral lineage. Proc Natl Acad Sci U S A 2014; 111:7641-6. [PMID: 24821769 DOI: 10.1073/pnas.1404330111] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Viruses evolve so rapidly that sequence-based comparison is not suitable for detecting relatedness among distant viruses. Structure-based comparisons suggest that evolution led to a small number of viral classes or lineages that can be grouped by capsid protein (CP) folds. Here, we report that the CP structure of the fungal dsRNA Penicillium chrysogenum virus (PcV) shows the progenitor fold of the dsRNA virus lineage and suggests a relationship between lineages. Cryo-EM structure at near-atomic resolution showed that the 982-aa PcV CP is formed by a repeated α-helical core, indicative of gene duplication despite lack of sequence similarity between the two halves. Superimposition of secondary structure elements identified a single "hotspot" at which variation is introduced by insertion of peptide segments. Structural comparison of PcV and other distantly related dsRNA viruses detected preferential insertion sites at which the complexity of the conserved α-helical core, made up of ancestral structural motifs that have acted as a skeleton, might have increased, leading to evolution of the highly varied current structures. Analyses of structural motifs only apparent after systematic structural comparisons indicated that the hallmark fold preserved in the dsRNA virus lineage shares a long (spinal) α-helix tangential to the capsid surface with the head-tailed phage and herpesvirus viral lineage.
Collapse
|
48
|
Al Nasr K, Ranjan D, Zubair M, Chen L, He J. Solving the Secondary Structure Matching Problem in Cryo-EM De Novo Modeling Using a Constrained K-Shortest Path Graph Algorithm. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:419-430. [PMID: 26355788 DOI: 10.1109/tcbb.2014.2302803] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Electron cryomicroscopy is becoming a major experimental technique in solving the structures of large molecular assemblies. More and more three-dimensional images have been obtained at the medium resolutions between 5 and 10 Å. At this resolution range, major α-helices can be detected as cylindrical sticks and β-sheets can be detected as plain-like regions. A critical question in de novo modeling from cryo-EM images is to determine the match between the detected secondary structures from the image and those on the protein sequence. We formulate this matching problem into a constrained graph problem and present an O(Δ(2)N(2)2(N)) algorithm to this NP-Hard problem. The algorithm incorporates the dynamic programming approach into a constrained K-shortest path algorithm. Our method, DP-TOSS, has been tested using α-proteins with maximum 33 helices and α-β proteins up to five helices and 12 β-strands. The correct match was ranked within the top 35 for 19 of the 20 α-proteins and all nine α-β proteins tested. The results demonstrate that DP-TOSS improves accuracy, time and memory space in deriving the topologies of the secondary structure elements for proteins with a large number of secondary structures and a complex skeleton.
Collapse
|
49
|
McKnight A, Si D, Al Nasr K, Chernikov A, Chrisochoides N, He J. Estimating loop length from CryoEM images at medium resolutions. BMC STRUCTURAL BIOLOGY 2014; 13 Suppl 1:S5. [PMID: 24565041 PMCID: PMC3953143 DOI: 10.1186/1472-6807-13-s1-s5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Background De novo protein modeling approaches utilize 3-dimensional (3D) images derived from electron cryomicroscopy (CryoEM) experiments. The skeleton connecting two secondary structures such as α-helices represent the loop in the 3D image. The accuracy of the skeleton and of the detected secondary structures are critical in De novo modeling. It is important to measure the length along the skeleton accurately since the length can be used as a constraint in modeling the protein. Results We have developed a novel computational geometric approach to derive a simplified curve in order to estimate the loop length along the skeleton. The method was tested using fifty simulated density images of helix-loop-helix segments of atomic structures and eighteen experimentally derived density data from Electron Microscopy Data Bank (EMDB). The test using simulated density maps shows that it is possible to estimate within 0.5Å of the expected length for 48 of the 50 cases. The experiments, involving eighteen experimentally derived CryoEM images, show that twelve cases have error within 2Å. Conclusions The tests using both simulated and experimentally derived images show that it is possible for our proposed method to estimate the loop length along the skeleton if the secondary structure elements, such as α-helices, can be detected accurately, and there is a continuous skeleton linking the α-helices.
Collapse
|
50
|
DiMaio F, Zhang J, Chiu W, Baker D. Cryo-EM model validation using independent map reconstructions. Protein Sci 2013; 22:865-8. [PMID: 23592445 DOI: 10.1002/pro.2267] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2012] [Revised: 03/29/2013] [Accepted: 04/03/2013] [Indexed: 11/09/2022]
Abstract
An increasing number of cryo-electron microscopy (cryo-EM) density maps are being generated with suitable resolution to trace the protein backbone and guide sidechain placement. Generating and evaluating atomic models based on such maps would be greatly facilitated by independent validation metrics for assessing the fit of the models to the data. We describe such a metric based on the fit of atomic models with independent test maps from single particle reconstructions not used in model refinement. The metric provides a means to determine the proper balance between the fit to the density and model energy and stereochemistry during refinement, and is likely to be useful in determining values of model building and refinement metaparameters quite generally.
Collapse
Affiliation(s)
- Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, USA
| | | | | | | |
Collapse
|