1
|
Sazzed S. Determining Protein Secondary Structures in Heterogeneous Medium-Resolution Cryo-EM Images Using CryoSSESeg. ACS OMEGA 2024; 9:26409-26416. [PMID: 38911779 PMCID: PMC11191131 DOI: 10.1021/acsomega.4c02608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Revised: 05/02/2024] [Accepted: 05/09/2024] [Indexed: 06/25/2024]
Abstract
While the acquisition of cryo-electron microscopy (cryo-EM) at near-atomic resolution is becoming more prevalent, a considerable number of density maps are still resolved only at intermediate resolutions (5-10 Å). Due to the large variation in quality among these medium-resolution density maps, extracting structural information from them remains a challenging task. This study introduces a convolutional neural network (CNN)-based framework, cryoSSESeg, to determine the organization of protein secondary structure elements in medium-resolution cryo-EM images. CryoSSESeg is trained on approximately 1300 protein chains derived from around 500 experimental cryo-EM density maps of varied quality. It demonstrates strong performance with residue-level F 1 scores of 0.76 for helix detection and 0.60 for β-sheet detection on average across a set of testing chains. In comparison to traditional image processing tools like SSETracer, which demand significant manual intervention and preprocessing steps, cryoSSESeg demonstrates comparable or superior performance. Additionally, it demonstrates competitive performance alongside another deep learning-based model, Emap2sec. Furthermore, this study underscores the importance of secondary structure quality, particularly adherence to expected shapes, in detection performance, emphasizing the necessity for careful evaluation of the data quality.
Collapse
|
2
|
Corum MR, Venkannagari H, Hryc CF, Baker ML. Predictive modeling and cryo-EM: A synergistic approach to modeling macromolecular structure. Biophys J 2024; 123:435-450. [PMID: 38268190 PMCID: PMC10912932 DOI: 10.1016/j.bpj.2024.01.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/09/2024] [Accepted: 01/18/2024] [Indexed: 01/26/2024] Open
Abstract
Over the last 15 years, structural biology has seen unprecedented development and improvement in two areas: electron cryo-microscopy (cryo-EM) and predictive modeling. Once relegated to low resolutions, single-particle cryo-EM is now capable of achieving near-atomic resolutions of a wide variety of macromolecular complexes. Ushered in by AlphaFold, machine learning has powered the current generation of predictive modeling tools, which can accurately and reliably predict models for proteins and some complexes directly from the sequence alone. Although they offer new opportunities individually, there is an inherent synergy between these techniques, allowing for the construction of large, complex macromolecular models. Here, we give a brief overview of these approaches in addition to illustrating works that combine these techniques for model building. These examples provide insight into model building, assessment, and limitations when integrating predictive modeling with cryo-EM density maps. Together, these approaches offer the potential to greatly accelerate the generation of macromolecular structural insights, particularly when coupled with experimental data.
Collapse
Affiliation(s)
- Michael R Corum
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Harikanth Venkannagari
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Corey F Hryc
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Matthew L Baker
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas.
| |
Collapse
|
3
|
Lata K, Charles S, Mangala Prasad V. Advances in computational approaches to structure determination of alphaviruses and flaviviruses using cryo-electron microscopy. J Struct Biol 2023; 215:107993. [PMID: 37414374 DOI: 10.1016/j.jsb.2023.107993] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 05/15/2023] [Accepted: 07/03/2023] [Indexed: 07/08/2023]
Abstract
Advancements in the field of cryo-electron microscopy (cryo-EM) have greatly contributed to our current understanding of virus structures and life cycles. In this review, we discuss the application of single particle cryo-electron microscopy (EM) for the structure elucidation of small enveloped icosahedral viruses, namely, alpha- and flaviviruses. We focus on technical advances in cryo-EM data collection, image processing, three-dimensional reconstruction, and refinement strategies for obtaining high-resolution structures of these viruses. Each of these developments enabled new insights into the alpha- and flavivirus architecture, leading to a better understanding of their biology, pathogenesis, immune response, immunogen design, and therapeutic development.
Collapse
Affiliation(s)
- Kiran Lata
- Molecular Biophysics Unit, Indian Institute of Science, Bengaluru, Karnataka 560012, India
| | - Sylvia Charles
- Molecular Biophysics Unit, Indian Institute of Science, Bengaluru, Karnataka 560012, India
| | - Vidya Mangala Prasad
- Molecular Biophysics Unit, Indian Institute of Science, Bengaluru, Karnataka 560012, India; Center for Infectious Disease Research, Indian Institute of Science, Bengaluru, Karnataka 560012, India
| |
Collapse
|
4
|
Chawla U, Chopra D. Structural Advancement in Shoc2‐MAPK Signaling Pathways in the Treatment of Cancer and Other Diseases. ChemistrySelect 2022. [DOI: 10.1002/slct.202203791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Affiliation(s)
- Udeep Chawla
- Innovation and Incubation Centre for Entrepreneurship Indian Institute of Science Education and Research Bhopal Bhopal 462066 Madhya Pradesh India
- The University of Arizona, Department of Chemistry and Biochemistry Tucson AZ85721 United States
| | - Deepak Chopra
- Innovation and Incubation Centre for Entrepreneurship Indian Institute of Science Education and Research Bhopal Bhopal 462066 Madhya Pradesh India
- Department of Chemistry Indian Institute of Science Education and Research Bhopal Bhopal 462066 Madhya Pradesh India
| |
Collapse
|
5
|
Garcia Condado J, Muñoz-Barrutia A, Sorzano COS. Automatic determination of the handedness of single-particle maps of macromolecules solved by CryoEM. J Struct Biol 2022; 214:107915. [PMID: 36341955 DOI: 10.1016/j.jsb.2022.107915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 08/29/2022] [Accepted: 10/25/2022] [Indexed: 12/07/2022]
Abstract
Single-Particle Analysis by Cryo-Electron Microscopy is a well-established technique to elucidate the three-dimensional (3D) structure of biological macromolecules. The orientation of the acquired projection images must be initially estimated without any reference to the final structure. In this step, algorithms may find a mirrored version of all the orientations resulting in a mirrored 3D map. It is as compatible with the acquired images as its unmirrored version from the image processing point of view, only that it is not biologically plausible. In this article, we introduce HaPi (Handedness Pipeline), the first method to automatically determine the hand of electron density maps of macromolecules solved by CryoEM. HaPi is built by training two 3D convolutional neural networks. The first determines α-helices in a map, and the second determines whether the α-helix is left-handed or right-handed. A consensus strategy defines the overall map hand. The pipeline is trained on simulated and experimental data. The handedness can be detected only for maps whose resolution is better than 5 Å. HaPi can identify the hand in 89% of new simulated maps correctly. Moreover, we evaluated all the maps deposited at the Electron Microscopy Data Bank and 11 structures uploaded with the incorrect hand were identified.
Collapse
Affiliation(s)
- J Garcia Condado
- Biocruces Bizkaia Instituto Investigación Sanitaria, Cruces Plaza, 48903 Barakaldo, Bizkaia, Spain; Universidad Carlos III de Madrid, Avda. de la Universidad 30, 28911 Leganés, Madrid, Spain; Centro Nacional de Biotecnologia (CNB-CSIC), Darwin, 3, Campus Universidad Autonoma, 28049 Cantoblanco, Madrid, Spain
| | - A Muñoz-Barrutia
- Universidad Carlos III de Madrid, Avda. de la Universidad 30, 28911 Leganés, Madrid, Spain
| | - C O S Sorzano
- Centro Nacional de Biotecnologia (CNB-CSIC), Darwin, 3, Campus Universidad Autonoma, 28049 Cantoblanco, Madrid, Spain.
| |
Collapse
|
6
|
Beton JG, Cragnolini T, Kaleel M, Mulvaney T, Sweeney A, Topf M. Integrating model simulation tools and
cryo‐electron
microscopy. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Joseph George Beton
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Tristan Cragnolini
- Institute of Structural and Molecular Biology, Birkbeck and University College London London UK
| | - Manaz Kaleel
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Thomas Mulvaney
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Aaron Sweeney
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Maya Topf
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| |
Collapse
|
7
|
Thorn A. Artificial intelligence in the experimental determination and prediction of macromolecular structures. Curr Opin Struct Biol 2022; 74:102368. [DOI: 10.1016/j.sbi.2022.102368] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 02/22/2022] [Accepted: 03/08/2022] [Indexed: 11/26/2022]
|
8
|
Behkamal B, Naghibzadeh M, Pagnani A, Saberi MR, Al Nasr K. LPTD: a novel linear programming-based topology determination method for cryo-EM maps. Bioinformatics 2022; 38:2734-2741. [PMID: 35561171 PMCID: PMC9306757 DOI: 10.1093/bioinformatics/btac170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 03/01/2022] [Accepted: 03/18/2022] [Indexed: 02/03/2023] Open
Abstract
SUMMARY Topology determination is one of the most important intermediate steps toward building the atomic structure of proteins from their medium-resolution cryo-electron microscopy (cryo-EM) map. The main goal in the topology determination is to identify correct matches (i.e. assignment and direction) between secondary structure elements (SSEs) (α-helices and β-sheets) detected in a protein sequence and cryo-EM density map. Despite many recent advances in molecular biology technologies, the problem remains a challenging issue. To overcome the problem, this article proposes a linear programming-based topology determination (LPTD) method to solve the secondary structure topology problem in three-dimensional geometrical space. Through modeling of the protein's sequence with the aid of extracting highly reliable features and a distance-based scoring function, the secondary structure matching problem is transformed into a complete weighted bipartite graph matching problem. Subsequently, an algorithm based on linear programming is developed as a decision-making strategy to extract the true topology (native topology) between all possible topologies. The proposed automatic framework is verified using 12 experimental and 15 simulated α-β proteins. Results demonstrate that LPTD is highly efficient and extremely fast in such a way that for 77% of cases in the dataset, the native topology has been detected in the first rank topology in <2 s. Besides, this method is able to successfully handle large complex proteins with as many as 65 SSEs. Such a large number of SSEs have never been solved with current tools/methods. AVAILABILITY AND IMPLEMENTATION The LPTD package (source code and data) is publicly available at https://github.com/B-Behkamal/LPTD. Moreover, two test samples as well as the instruction of utilizing the graphical user interface have been provided in the shared readme file. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Bahareh Behkamal
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran
| | - Mahmoud Naghibzadeh
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran
| | - Andrea Pagnani
- Department of Applied Science and Technology (DISAT), Politecnico di Torino, Torino I-10129, Italy
- Italian Institute for Genomic Medicine (IIGM), IRCC-Candiolo, Candiolo (TO) I-10060, Italy
- INFN Sezione di Torino, Torino I-10125, Italy
| | - Mohammad Reza Saberi
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran
- Bioinformatics Research Group, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran
| | - Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA
| |
Collapse
|
9
|
Behkamal B, Naghibzadeh M, Saberi MR, Tehranizadeh ZA, Pagnani A, Al Nasr K. Three-Dimensional Graph Matching to Identify Secondary Structure Correspondence of Medium-Resolution Cryo-EM Density Maps. Biomolecules 2021; 11:1773. [PMID: 34944417 PMCID: PMC8698881 DOI: 10.3390/biom11121773] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 11/18/2021] [Accepted: 11/20/2021] [Indexed: 01/15/2023] Open
Abstract
Cryo-electron microscopy (cryo-EM) is a structural technique that has played a significant role in protein structure determination in recent years. Compared to the traditional methods of X-ray crystallography and NMR spectroscopy, cryo-EM is capable of producing images of much larger protein complexes. However, cryo-EM reconstructions are limited to medium-resolution (~4-10 Å) for some cases. At this resolution range, a cryo-EM density map can hardly be used to directly determine the structure of proteins at atomic level resolutions, or even at their amino acid residue backbones. At such a resolution, only the position and orientation of secondary structure elements (SSEs) such as α-helices and β-sheets are observable. Consequently, finding the mapping of the secondary structures of the modeled structure (SSEs-A) to the cryo-EM map (SSEs-C) is one of the primary concerns in cryo-EM modeling. To address this issue, this study proposes a novel automatic computational method to identify SSEs correspondence in three-dimensional (3D) space. Initially, through a modeling of the target sequence with the aid of extracting highly reliable features from a generated 3D model and map, the SSEs matching problem is formulated as a 3D vector matching problem. Afterward, the 3D vector matching problem is transformed into a 3D graph matching problem. Finally, a similarity-based voting algorithm combined with the principle of least conflict (PLC) concept is developed to obtain the SSEs correspondence. To evaluate the accuracy of the method, a testing set of 25 experimental and simulated maps with a maximum of 65 SSEs is selected. Comparative studies are also conducted to demonstrate the superiority of the proposed method over some state-of-the-art techniques. The results demonstrate that the method is efficient, robust, and works well in the presence of errors in the predicted secondary structures of the cryo-EM images.
Collapse
Affiliation(s)
- Bahareh Behkamal
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran;
| | - Mahmoud Naghibzadeh
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran;
| | - Mohammad Reza Saberi
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran; (M.R.S.); (Z.A.T.)
- Bioinformatics Research Group, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran
| | - Zeinab Amiri Tehranizadeh
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran; (M.R.S.); (Z.A.T.)
| | - Andrea Pagnani
- Politecnico di Torino, Corso Duca degli Abruzzi 24, I-10129 Torino, Italy;
- Italian Institute for Genomic Medicine, IRCCS Candiolo, SP-142, I-10060 Candiolo, Italy
- INFN, Sezione di Torino, I-10125 Torino, Italy
| | - Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA
| |
Collapse
|
10
|
Alshammari M, He J. Combining Cryo-EM Density Map and Residue Contact for Protein Secondary Structure Topologies. Molecules 2021; 26:7049. [PMID: 34834140 PMCID: PMC8624718 DOI: 10.3390/molecules26227049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 11/01/2021] [Accepted: 11/15/2021] [Indexed: 11/23/2022] Open
Abstract
Although atomic structures have been determined directly from cryo-EM density maps with high resolutions, current structure determination methods for medium resolution (5 to 10 Å) cryo-EM maps are limited by the availability of structure templates. Secondary structure traces are lines detected from a cryo-EM density map for α-helices and β-strands of a protein. A topology of secondary structures defines the mapping between a set of sequence segments and a set of traces of secondary structures in three-dimensional space. In order to enhance accuracy in ranking secondary structure topologies, we explored a method that combines three sources of information: a set of sequence segments in 1D, a set of amino acid contact pairs in 2D, and a set of traces in 3D at the secondary structure level. A test of fourteen cases shows that the accuracy of predicted secondary structures is critical for deriving topologies. The use of significant long-range contact pairs is most effective at enriching the rank of the maximum-match topology for proteins with a large number of secondary structures, if the secondary structure prediction is fairly accurate. It was observed that the enrichment depends on the quality of initial topology candidates in this approach. We provide detailed analysis in various cases to show the potential and challenge when combining three sources of information.
Collapse
Affiliation(s)
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA;
| |
Collapse
|
11
|
He J, Huang SY. EMNUSS: a deep learning framework for secondary structure annotation in cryo-EM maps. Brief Bioinform 2021; 22:bbab156. [PMID: 33954706 PMCID: PMC8574626 DOI: 10.1093/bib/bbab156] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 03/30/2021] [Accepted: 04/06/2021] [Indexed: 02/06/2023] Open
Abstract
Cryo-electron microscopy (cryo-EM) has become one of important experimental methods in structure determination. However, despite the rapid growth in the number of deposited cryo-EM maps motivated by advances in microscopy instruments and image processing algorithms, building accurate structure models for cryo-EM maps remains a challenge. Protein secondary structure information, which can be extracted from EM maps, is beneficial for cryo-EM structure modeling. Here, we present a novel secondary structure annotation framework for cryo-EM maps at both intermediate and high resolutions, named EMNUSS. EMNUSS adopts a three-dimensional (3D) nested U-net architecture to assign secondary structures for EM maps. Tested on three diverse datasets including simulated maps, middle resolution experimental maps, and high-resolution experimental maps, EMNUSS demonstrated its accuracy and robustness in identifying the secondary structures for cyro-EM maps of various resolutions. The EMNUSS program is freely available at http://huanglab.phys.hust.edu.cn/EMNUSS.
Collapse
Affiliation(s)
- Jiahua He
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
12
|
Mu Y, Sazzed S, Alshammari M, Sun J, He J. A Tool for Segmentation of Secondary Structures in 3D Cryo-EM Density Map Components Using Deep Convolutional Neural Networks. FRONTIERS IN BIOINFORMATICS 2021; 1:710119. [PMID: 36303800 PMCID: PMC9581063 DOI: 10.3389/fbinf.2021.710119] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2021] [Accepted: 09/28/2021] [Indexed: 07/20/2023] Open
Abstract
Although cryo-electron microscopy (cryo-EM) has been successfully used to derive atomic structures for many proteins, it is still challenging to derive atomic structures when the resolution of cryo-EM density maps is in the medium resolution range, such as 5-10 Å. Detection of protein secondary structures, such as helices and β-sheets, from cryo-EM density maps provides constraints for deriving atomic structures from such maps. As more deep learning methodologies are being developed for solving various molecular problems, effective tools are needed for users to access them. We have developed an effective software bundle, DeepSSETracer, for the detection of protein secondary structure from cryo-EM component maps in medium resolution. The bundle contains the network architecture and a U-Net model trained with a curriculum and gradient of episodic memory (GEM). The bundle integrates the deep neural network with the visualization capacity provided in ChimeraX. Using a Linux server that is remotely accessed by Windows users, it takes about 6 s on one CPU and one GPU for the trained deep neural network to detect secondary structures in a cryo-EM component map containing 446 amino acids. A test using 28 chain components of cryo-EM maps shows overall residue-level F1 scores of 0.72 and 0.65 to detect helices and β-sheets, respectively. Although deep learning applications are built on software frameworks, such as PyTorch and Tensorflow, our pioneer work here shows that integration of deep learning applications with ChimeraX is a promising and effective approach. Our experiments show that the F1 score measured at the residue level is an effective evaluation of secondary structure detection for individual classes. The test using 28 cryo-EM component maps shows that DeepSSETracer detects β-sheets more accurately than Emap2sec+, with a weighted average residue-level F1 score of 0.65 and 0.42, respectively. It also shows that Emap2sec+ detects helices more accurately than DeepSSETracer with a weighted average residue-level F1 score of 0.77 and 0.72 respectively.
Collapse
Affiliation(s)
| | | | | | | | - Jing He
- *Correspondence: Jing He, ; Jiangwen Sun,
| |
Collapse
|
13
|
Zumbado-Corrales M, Esquivel-Rodríguez J. EvoSeg: Automated Electron Microscopy Segmentation through Random Forests and Evolutionary Optimization. Biomimetics (Basel) 2021; 6:biomimetics6020037. [PMID: 34206006 PMCID: PMC8293153 DOI: 10.3390/biomimetics6020037] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 05/17/2021] [Accepted: 05/28/2021] [Indexed: 11/30/2022] Open
Abstract
Electron Microscopy Maps are key in the study of bio-molecular structures, ranging from borderline atomic level to the sub-cellular range. These maps describe the envelopes that cover possibly a very large number of proteins that form molecular machines within the cell. Within those envelopes, we are interested to find what regions correspond to specific proteins so that we can understand how they function, and design drugs that can enhance or suppress a process that they are involved in, along with other experimental purposes. A classic approach by which we can begin the exploration of map regions is to apply a segmentation algorithm. This yields a mask where each voxel in 3D space is assigned an identifier that maps it to a segment; an ideal segmentation would map each segment to one protein unit, which is rarely the case. In this work, we present a method that uses bio-inspired optimization, through an Evolutionary-Optimized Segmentation algorithm, to iteratively improve upon baseline segments obtained from a classical approach, called watershed segmentation. The cost function used by the evolutionary optimization is based on an ideal segmentation classifier trained as part of this development, which uses basic structural information available to scientists, such as the number of expected units, volume and topology. We show that a basic initial segmentation with the additional information allows our evolutionary method to find better segmentation results, compared to the baseline generated by the watershed.
Collapse
|
14
|
Wang X, Alnabati E, Aderinwale TW, Maddhuri Venkata Subramaniya SR, Terashi G, Kihara D. Detecting protein and DNA/RNA structures in cryo-EM maps of intermediate resolution using deep learning. Nat Commun 2021; 12:2302. [PMID: 33863902 PMCID: PMC8052361 DOI: 10.1038/s41467-021-22577-3] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Accepted: 03/19/2021] [Indexed: 12/21/2022] Open
Abstract
An increasing number of density maps of macromolecular structures, including proteins and DNA/RNA complexes, have been determined by cryo-electron microscopy (cryo-EM). Although lately maps at a near-atomic resolution are routinely reported, there are still substantial fractions of maps determined at intermediate or low resolutions, where extracting structure information is not trivial. Here, we report a new computational method, Emap2sec+, which identifies DNA or RNA as well as the secondary structures of proteins in cryo-EM maps of 5 to 10 Å resolution. Emap2sec+ employs the deep Residual convolutional neural network. Emap2sec+ assigns structural labels with associated probabilities at each voxel in a cryo-EM map, which will help structure modeling in an EM map. Emap2sec+ showed stable and high assignment accuracy for nucleotides in low resolution maps and improved performance for protein secondary structure assignments than its earlier version when tested on simulated and experimental maps.
Collapse
Affiliation(s)
- Xiao Wang
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Eman Alnabati
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Tunde W Aderinwale
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | | | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, USA.
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
15
|
Seffernick JT, Lindert S. Hybrid methods for combined experimental and computational determination of protein structure. J Chem Phys 2020; 153:240901. [PMID: 33380110 PMCID: PMC7773420 DOI: 10.1063/5.0026025] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Accepted: 11/10/2020] [Indexed: 02/04/2023] Open
Abstract
Knowledge of protein structure is paramount to the understanding of biological function, developing new therapeutics, and making detailed mechanistic hypotheses. Therefore, methods to accurately elucidate three-dimensional structures of proteins are in high demand. While there are a few experimental techniques that can routinely provide high-resolution structures, such as x-ray crystallography, nuclear magnetic resonance (NMR), and cryo-EM, which have been developed to determine the structures of proteins, these techniques each have shortcomings and thus cannot be used in all cases. However, additionally, a large number of experimental techniques that provide some structural information, but not enough to assign atomic positions with high certainty have been developed. These methods offer sparse experimental data, which can also be noisy and inaccurate in some instances. In cases where it is not possible to determine the structure of a protein experimentally, computational structure prediction methods can be used as an alternative. Although computational methods can be performed without any experimental data in a large number of studies, inclusion of sparse experimental data into these prediction methods has yielded significant improvement. In this Perspective, we cover many of the successes of integrative modeling, computational modeling with experimental data, specifically for protein folding, protein-protein docking, and molecular dynamics simulations. We describe methods that incorporate sparse data from cryo-EM, NMR, mass spectrometry, electron paramagnetic resonance, small-angle x-ray scattering, Förster resonance energy transfer, and genetic sequence covariation. Finally, we highlight some of the major challenges in the field as well as possible future directions.
Collapse
Affiliation(s)
- Justin T. Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, USA
| |
Collapse
|
16
|
Behkamal B, Naghibzadeh M, Pagnani A, Saberi MR, Al Nasr K. Solving the α-helix correspondence problem at medium-resolution Cryo-EM maps through modeling and 3D matching. J Mol Graph Model 2020; 103:107815. [PMID: 33338845 DOI: 10.1016/j.jmgm.2020.107815] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 11/09/2020] [Accepted: 11/18/2020] [Indexed: 11/30/2022]
Abstract
Cryo-electron microscopy (cryo-EM) has recently emerged as a prominent biophysical method for macromolecular structure determination. Many research efforts have been devoted to produce cryo-EM images, density maps, at near-atomic resolution. Despite many advances in technology, the resolution of the generated density maps may not be sufficiently adequate and informative to directly construct the atomic structure of proteins. At medium-resolution (∼4-10 Å), secondary structure elements (α-helices and β-sheets) are discernible, whereas finding the correspondence of secondary structure elements detected in the density map with those on the sequence remains a challenging problem. In this paper, an automatic framework is proposed to solve α-helix correspondence problem in three-dimensional space. Through modeling of the sequence with the aid of a novel strategy, the α-helix correspondence problem is initially transformed into a complete weighted bipartite graph matching problem. An innovative correlation-based scoring function based on a well-known and robust statistical method is proposed for weighting the graph. Moreover, two local optimization algorithms, which are Greedy and Improved Greedy algorithms, have been presented to find α-helix correspondence. A widely used data set including 16 reconstructed and 4 experimental cryo-EM maps were chosen to verify the accuracy and reliability of the proposed automatic method. The experimental results demonstrate that the automatic method is highly efficient (86.25% accuracy), robust (11.3% error rate), fast (∼1.4 s), and works independently from cryo-EM skeleton.
Collapse
Affiliation(s)
- Bahareh Behkamal
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad, 9177948944, Iran.
| | - Mahmoud Naghibzadeh
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad, 9177948944, Iran.
| | - Andrea Pagnani
- Department of Applied Science and Technology (DISAT), Politecnico di Torino, Corso Duca degli Abruzzi 24, Torino, Italy; Italian Institute for Genomic Medicine (IIGM), IRCC-Candiolo, Candiolo, TO, Italy; INFN Sezione di Torino, Via P. Giuria 1, Torino, Italy
| | - Mohammad Reza Saberi
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad, Iran; Bioinformatics Research group, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN, 37209, USA
| |
Collapse
|
17
|
Alshammari M, He J. Combine Cryo-EM Density Map and Residue Contact for Protein Structure Prediction - A Case Study. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2020; 2020:110. [PMID: 35838376 PMCID: PMC9279007 DOI: 10.1145/3388440.3414708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Cryo-electron microscopy is a major structure determination technique for large molecular machines and membrane-associated complexes. Although atomic structures have been determined directly from cryo-EM density maps with high resolutions, current structure determination methods for medium resolution (5 to 10 Å) cryo-EM maps are limited by the availability of structure templates. Secondary structure traces are lines detected from a cryo-EM density map for α-helices and β-strands of a protein. When combined with secondary structure sequence segments predicted from a protein sequence, it is possible to generate a set of likely topologies of α-traces and β-sheet traces. A topology describes the overall folding relationship among secondary structures; it is a critical piece of information for deriving the corresponding atomic structure. We propose a method for protein structure prediction that combines three sources of information: the secondary structure traces detected from the cryo-EM density map, predicted secondary structure sequence segments, and amino acid contact pairs predicted using MULTICOM. A case study shows that using amino acid contact prediction from MULTICOM improves the ranking of the true topology. Our observations convey that using a small set of highly voted secondary structure contact pairs enhances the ranking in all experiments conducted for this case.
Collapse
|
18
|
Deng Y, Mu Y, Sazzed S, Sun J, He J. Using Curriculum Learning in Pattern Recognition of 3-dimensional Cryo-electron Microscopy Density Maps. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2020; 2020:112. [PMID: 35838357 PMCID: PMC9279008 DOI: 10.1145/3388440.3414710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Although Cryo-electron microscopy (cryo-EM) has been successfully used to derive atomic structures for many proteins, it is still challenging to derive atomic structure when the resolution of cryo-EM density maps is in the medium range, e.g., 5-10 Å. Studies have attempted to utilize machine learning methods, especially deep neural networks to build predictive models for the detection of protein secondary structures from cryo-EM images, which ultimately helps to derive the atomic structure of proteins. However, the large variation in data quality makes it challenging to train a deep neural network with high prediction accuracy. Curriculum learning has been shown as an effective learning paradigm in machine learning. In this paper, we present a study using curriculum learning as a more effective way to utilize cryo-EM density maps with varying quality. We investigated three distinct training curricula that differ in whether/how images used for training in past are reused while the network was continually trained using new images. A total of 1,382 3-dimensional cryo-EM images were extracted from density maps of Electron Microscopy Data Bank in our study. Our results indicate learning with curriculum significantly improves the performance of the final trained network when the forgetting problem is properly addressed.
Collapse
Affiliation(s)
- Yangmei Deng
- Department of Computer Science, Old Dominion University, Norfolk VA USA
| | - Yongcheng Mu
- Department of Computer Science, Old Dominion University, Norfolk VA USA
| | - Salim Sazzed
- Department of Computer Science, Old Dominion University, Norfolk VA USA
| | - Jiangwen Sun
- Department of Computer Science, Old Dominion University, Norfolk VA USA
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk VA USA
| |
Collapse
|
19
|
Mostosi P, Schindelin H, Kollmannsberger P, Thorn A. Haruspex: A Neural Network for the Automatic Identification of Oligonucleotides and Protein Secondary Structure in Cryo-Electron Microscopy Maps. Angew Chem Int Ed Engl 2020; 59:14788-14795. [PMID: 32187813 PMCID: PMC7497202 DOI: 10.1002/anie.202000421] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 03/11/2020] [Indexed: 11/25/2022]
Abstract
In recent years, three-dimensional density maps reconstructed from single particle images obtained by electron cryo-microscopy (cryo-EM) have reached unprecedented resolution. However, map interpretation can be challenging, in particular if the constituting structures require de-novo model building or are very mobile. Herein, we demonstrate the potential of convolutional neural networks for the annotation of cryo-EM maps: our network Haruspex has been trained on a carefully curated set of 293 experimentally derived reconstruction maps to automatically annotate RNA/DNA as well as protein secondary structure elements. It can be straightforwardly applied to newly reconstructed maps in order to support domain placement or as a starting point for main-chain placement. Due to its high recall and precision rates of 95.1 % and 80.3 %, respectively, on an independent test set of 122 maps, it can also be used for validation during model building. The trained network will be available as part of the CCP-EM suite.
Collapse
Affiliation(s)
- Philipp Mostosi
- Institute of Structural BiologyRudolf Virchow Center for Experimental BiomedicineUniversity of WürzburgJosef-Schneider-Str. 297080WürzburgGermany
- Center for Computational and Theoretical BiologyUniversity of WürzburgCampus Hubland Nord 3297074WürzburgGermany
| | - Hermann Schindelin
- Institute of Structural BiologyRudolf Virchow Center for Experimental BiomedicineUniversity of WürzburgJosef-Schneider-Str. 297080WürzburgGermany
| | - Philip Kollmannsberger
- Center for Computational and Theoretical BiologyUniversity of WürzburgCampus Hubland Nord 3297074WürzburgGermany
| | - Andrea Thorn
- Institute of Structural BiologyRudolf Virchow Center for Experimental BiomedicineUniversity of WürzburgJosef-Schneider-Str. 297080WürzburgGermany
| |
Collapse
|
20
|
Leelananda SP, Lindert S. Using NMR Chemical Shifts and Cryo-EM Density Restraints in Iterative Rosetta-MD Protein Structure Refinement. J Chem Inf Model 2020; 60:2522-2532. [PMID: 31872764 PMCID: PMC7262651 DOI: 10.1021/acs.jcim.9b00932] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Cryo-EM has become one of the prime methods for protein structure elucidation, frequently yielding density maps with near-atomic or medium resolution. If protein structures cannot be deduced unambiguously from the density maps, computational structure refinement tools are needed to generate protein structural models. We have previously developed an iterative Rosetta-MDFF protocol that used cryo-EM densities to refine protein structures. Here we show that, in addition to cryo-EM densities, incorporation of other experimental restraints into the Rosetta-MDFF protocol further improved refined structures. We used NMR chemical shift (CS) data integrated with cryo-EM densities in our hybrid protocol in both the Rosetta step and the molecular dynamics (MD) simulations step. In 15 out of 18 cases for all MD rounds, the refinement results obtained when density maps and NMR chemical shift data were used in combination outperformed those of density map-only refinement. Notably, the improvement in refinement was highest when medium and low-resolution density maps were used. With our hybrid method, the RMSDs of final models obtained were always better than the RMSDs obtained by our previous protocol with just density refinement for both medium (6.9 Å) and low (9 Å) resolution maps. For all the six test proteins with medium resolution density maps (6.9 Å), the final refined structure RMSDs were lower for the hybrid method than for the cryo-EM only refinement. The final refined RMSDs were less than 1.5 Å when our hybrid protocol was used with 4 Å density maps. For four out of the six proteins the final RMSDs were even less than 1 Å. This study demonstrates that by using a combination of cryo-EM and NMR restraints, it is possible to refine structures to atomic resolution, outperforming single restraint refinement. This hybrid protocol will be a valuable tool when only low-resolution cryo-EM density data and NMR chemical shift data are available to refine structures.
Collapse
Affiliation(s)
- Sumudu P. Leelananda
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210
| |
Collapse
|
21
|
Sazzed S, Scheible P, Alshammari M, Wriggers W, He J. Cylindrical Similarity Measurement for Helices in Medium-Resolution Cryo-Electron Microscopy Density Maps. J Chem Inf Model 2020; 60:2644-2650. [PMID: 32216344 PMCID: PMC8279803 DOI: 10.1021/acs.jcim.0c00010] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Cryo-electron microscopy (cryo-EM) density maps at medium resolution (5-10 Å) reveal secondary structural features such as α-helices and β-sheets, but they lack the side chain details that would enable a direct structure determination. Among the more than 800 entries in the Electron Microscopy Data Bank (EMDB) of medium-resolution density maps that are associated with atomic models, a wide variety of similarities can be observed between maps and models. To validate such atomic models and to classify structural features, a local similarity criterion, the F1 score, is proposed and evaluated in this study. The F1 score is theoretically normalized to a range from zero to one, providing a local measure of cylindrical agreement between the density and atomic model of a helix. A systematic scan of 30,994 helices (among 3,247 protein chains modeled into medium-resolution density maps) reveals an actual range of observed F1 scores from 0.171 to 0.848, suggesting that the cylindrical fit of the current data is well stratified by the proposed measure. The best (highest) F1 scores tend to be associated with regions that exhibit high and spatially homogeneous local resolution (between 5 Å and 7.5 Å) in the helical density. The proposed F1 scores can be used as a discriminative classifier for validation studies and as a ranking criterion for cryo-EM density features in databases.
Collapse
Affiliation(s)
- Salim Sazzed
- Department of Computer Science, Old Dominion University, Norfolk, Virginia 23529, United States
| | - Peter Scheible
- Department of Computer Science, Old Dominion University, Norfolk, Virginia 23529, United States
| | - Maytha Alshammari
- Department of Computer Science, Old Dominion University, Norfolk, Virginia 23529, United States
| | - Willy Wriggers
- Department of Mechanical and Aerospace Engineering, Old Dominion University, Norfolk, Virginia 23529, United States
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk, Virginia 23529, United States
| |
Collapse
|
22
|
Mostosi P, Schindelin H, Kollmannsberger P, Thorn A. Haruspex: A Neural Network for the Automatic Identification of Oligonucleotides and Protein Secondary Structure in Cryo‐Electron Microscopy Maps. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.202000421] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- Philipp Mostosi
- Institute of Structural Biology Rudolf Virchow Center for Experimental Biomedicine University of Würzburg Josef-Schneider-Str. 2 97080 Würzburg Germany
- Center for Computational and Theoretical Biology University of Würzburg Campus Hubland Nord 32 97074 Würzburg Germany
| | - Hermann Schindelin
- Institute of Structural Biology Rudolf Virchow Center for Experimental Biomedicine University of Würzburg Josef-Schneider-Str. 2 97080 Würzburg Germany
| | - Philip Kollmannsberger
- Center for Computational and Theoretical Biology University of Würzburg Campus Hubland Nord 32 97074 Würzburg Germany
| | - Andrea Thorn
- Institute of Structural Biology Rudolf Virchow Center for Experimental Biomedicine University of Würzburg Josef-Schneider-Str. 2 97080 Würzburg Germany
| |
Collapse
|
23
|
Dodd T, Yan C, Ivanov I. Simulation-Based Methods for Model Building and Refinement in Cryoelectron Microscopy. J Chem Inf Model 2020; 60:2470-2483. [PMID: 32202798 DOI: 10.1021/acs.jcim.0c00087] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Advances in cryoelectron microscopy (cryo-EM) have revolutionized the structural investigation of large macromolecular assemblies. In this review, we first provide a broad overview of modeling methods used for flexible fitting of molecular models into cryo-EM density maps. We give special attention to approaches rooted in molecular simulations-atomistic molecular dynamics and Monte Carlo. Concise descriptions of the methods are given along with discussion of their advantages, limitations, and most popular alternatives. We also describe recent extensions of the widely used molecular dynamics flexible fitting (MDFF) method and discuss how different model-building techniques could be incorporated into new hybrid modeling schemes and simulation workflows. Finally, we provide two illustrative examples of model-building and refinement strategies employing MDFF, cascade MDFF, and RosettaCM. These examples come from recent cryo-EM studies that elucidated transcription preinitiation complexes and shed light on the functional roles of these assemblies in gene expression and gene regulation.
Collapse
Affiliation(s)
- Thomas Dodd
- Department of Chemistry, Georgia State University, Atlanta, Georgia 30302, United States.,Center for Diagnostics and Therapeutics, Georgia State University, Atlanta, Georgia 30302, United States
| | - Chunli Yan
- Department of Chemistry, Georgia State University, Atlanta, Georgia 30302, United States.,Center for Diagnostics and Therapeutics, Georgia State University, Atlanta, Georgia 30302, United States
| | - Ivaylo Ivanov
- Department of Chemistry, Georgia State University, Atlanta, Georgia 30302, United States.,Center for Diagnostics and Therapeutics, Georgia State University, Atlanta, Georgia 30302, United States
| |
Collapse
|
24
|
Terwilliger TC, Adams PD, Afonine PV, Sobolev OV. Cryo-EM map interpretation and protein model-building using iterative map segmentation. Protein Sci 2020; 29:87-99. [PMID: 31599033 PMCID: PMC6933853 DOI: 10.1002/pro.3740] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Revised: 09/30/2019] [Accepted: 10/01/2019] [Indexed: 11/17/2022]
Abstract
A procedure for building protein chains into maps produced by single-particle electron cryo-microscopy (cryo-EM) is described. The procedure is similar to the way an experienced structural biologist might analyze a map, focusing first on secondary structure elements such as helices and sheets, then varying the contour level to identify connections between these elements. Since the high density in a map typically follows the main-chain of the protein, the main-chain connection between secondary structure elements can often be identified as the unbranched path between them with the highest minimum value along the path. This chain-tracing procedure is then combined with finding side-chain positions based on the presence of density extending away from the main path of the chain, allowing generation of a Cα model. The Cα model is converted to an all-atom model and is refined against the map. We show that this procedure is as effective as other existing methods for interpretation of cryo-EM maps and that it is considerably faster and produces models with fewer chain breaks than our previous methods that were based on approaches developed for crystallographic maps.
Collapse
Affiliation(s)
- Thomas C. Terwilliger
- Los Alamos National LaboratoryLos AlamosNew Mexico
- New Mexico ConsortiumLos AlamosNew Mexico
| | - Paul D. Adams
- Molecular Biophysics & Integrated Bioimaging DivisionLawrence Berkeley National LaboratoryBerkeleyCalifornia
- Department of BioengineeringUniversity of California BerkeleyBerkeleyCalifornia
| | - Pavel V. Afonine
- Molecular Biophysics & Integrated Bioimaging DivisionLawrence Berkeley National LaboratoryBerkeleyCalifornia
| | - Oleg V. Sobolev
- Molecular Biophysics & Integrated Bioimaging DivisionLawrence Berkeley National LaboratoryBerkeleyCalifornia
| |
Collapse
|
25
|
Gauto DF, Estrozi LF, Schwieters CD, Effantin G, Macek P, Sounier R, Sivertsen AC, Schmidt E, Kerfah R, Mas G, Colletier JP, Güntert P, Favier A, Schoehn G, Schanda P, Boisbouvier J. Integrated NMR and cryo-EM atomic-resolution structure determination of a half-megadalton enzyme complex. Nat Commun 2019; 10:2697. [PMID: 31217444 PMCID: PMC6584647 DOI: 10.1038/s41467-019-10490-9] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Accepted: 05/10/2019] [Indexed: 12/14/2022] Open
Abstract
Atomic-resolution structure determination is crucial for understanding protein function. Cryo-EM and NMR spectroscopy both provide structural information, but currently cryo-EM does not routinely give access to atomic-level structural data, and, generally, NMR structure determination is restricted to small (<30 kDa) proteins. We introduce an integrated structure determination approach that simultaneously uses NMR and EM data to overcome the limits of each of these methods. The approach enables structure determination of the 468 kDa large dodecameric aminopeptidase TET2 to a precision and accuracy below 1 Å by combining secondary-structure information obtained from near-complete magic-angle-spinning NMR assignments of the 39 kDa-large subunits, distance restraints from backbone amides and ILV methyl groups, and a 4.1 Å resolution EM map. The resulting structure exceeds current standards of NMR and EM structure determination in terms of molecular weight and precision. Importantly, the approach is successful even in cases where only medium-resolution cryo-EM data are available. NMR structure determination is challenging for proteins with a molecular weight above 30 kDa and atomic-resolution structure determination from cryo-EM data is currently not the rule. Here the authors describe an integrated structure determination approach that simultaneously uses NMR and EM data and allows them to determine the structure of the 468 kDa dodecameric aminopeptidase TET2 complex.
Collapse
Affiliation(s)
- Diego F Gauto
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France
| | - Leandro F Estrozi
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France.
| | - Charles D Schwieters
- Laboratory of Imaging Sciences, Center for Information Technology, National Institutes of Health, 12 South Drive, MSC 5624, Bethesda, MD, 20892, USA
| | - Gregory Effantin
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France
| | - Pavel Macek
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France.,NMR-Bio, 5 Place Robert Schuman, F-38025, Grenoble, France
| | - Remy Sounier
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France.,Institut de Génomique Fonctionnelle, CNRS UMR-5203, INSERM U1191, University of Montpellier, F-34000, Montpellier, France
| | - Astrid C Sivertsen
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France
| | - Elena Schmidt
- Institute of Biophysical Chemistry, Goethe University Frankfurt am Main, 60438 Frankfurt am Main, Germany
| | - Rime Kerfah
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France.,NMR-Bio, 5 Place Robert Schuman, F-38025, Grenoble, France
| | - Guillaume Mas
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France.,Biozentrum University of Basel, Klingelbergstrasse 70, 4056, Basel, Switzerland
| | - Jacques-Philippe Colletier
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France
| | - Peter Güntert
- Institute of Biophysical Chemistry, Goethe University Frankfurt am Main, 60438 Frankfurt am Main, Germany.,Laboratory of Physical Chemistry, ETH Zürich, 8093 Zürich, Switzerland.,Graduate School of Science, Tokyo Metropolitan University, Tokyo 192-0397, Japan
| | - Adrien Favier
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France.
| | - Guy Schoehn
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France
| | - Paul Schanda
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France.
| | - Jerome Boisbouvier
- Institut de Biologie Structurale (IBS), CEA, CNRS, Université Grenoble Alpes, 71, Avenue des Martyrs, F-38044, Grenoble, France
| |
Collapse
|
26
|
Terwilliger TC, Adams PD, Afonine PV, Sobolev OV. Map segmentation, automated model-building and their application to the Cryo-EM Model Challenge. J Struct Biol 2018; 204:338-343. [PMID: 30063987 PMCID: PMC6163059 DOI: 10.1016/j.jsb.2018.07.016] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Revised: 07/11/2018] [Accepted: 07/27/2018] [Indexed: 11/27/2022]
Abstract
A recently-developed method for identifying a compact, contiguous region representing the unique part of a density map was applied to 218 Cryo-EM maps with resolutions of 4.5 Å or better. The key elements of the segmentation procedure are (1) identification of all regions of density above a threshold and (2) choice of a unique set of these regions, taking symmetry into consideration, that maximize connectivity and compactness. This segmentation approach was then combined with tools for automated map sharpening and model-building to generate models for the 12 maps in the 2016 Cryo-EM Model Challenge in a fully automated manner. The resulting models have completeness from 24% to 82% and RMS distances from reference interpretations of 0.6 Å-2.1 Å.
Collapse
Affiliation(s)
- Thomas C Terwilliger
- Los Alamos National Laboratory, Los Alamos, NM 87545, USA; New Mexico Consortium, Los Alamos, NM 87544, USA.
| | - Paul D Adams
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720-8235, USA; Department of Bioengineering, University of California Berkeley, Berkeley, CA, USA
| | - Pavel V Afonine
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720-8235, USA; Department of Physics and International Centre for Quantum and Molecular Structures, Shanghai University, Shanghai 200444, People's Republic of China
| | - Oleg V Sobolev
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720-8235, USA
| |
Collapse
|
27
|
Richardson JS, Williams CJ, Videau LL, Chen VB, Richardson DC. Assessment of detailed conformations suggests strategies for improving cryoEM models: Helix at lower resolution, ensembles, pre-refinement fixups, and validation at multi-residue length scale. J Struct Biol 2018; 204:301-312. [PMID: 30107233 PMCID: PMC6163098 DOI: 10.1016/j.jsb.2018.08.007] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Revised: 08/01/2018] [Accepted: 08/08/2018] [Indexed: 11/17/2022]
Abstract
We find that the overall quite good methods used in the CryoEM Model Challenge could still benefit greatly from several strategies for improving local conformations. Our assessments primarily use validation criteria from the MolProbity web service. Those criteria include MolProbity's all-atom contact analysis, updated versions of standard conformational validations for protein and RNA, plus two recent additions: first, flags for cis-nonPro and twisted peptides, and second, the CaBLAM system for diagnosing secondary structure, validating Cα backbone, and validating adjacent peptide CO orientations in the context of the Cα trace. In general, automated ab initio building of starting models is quite good at backbone connectivity but often fails at local conformation or sequence register, especially at poorer than 3.5 Å resolution. However, we show that even if criteria (such as Ramachandran or rotamer) are explicitly restrained to improve refinement behavior and overall validation scores, automated optimization of a deposited structure seldom corrects specific misfittings that start in the wrong local minimum, but just hides them. Therefore, local problems should be identified, and as many as possible corrected, before starting refinement. Secondary structures are confusing at 3-4 Å but can be better recognized at 6-8 Å. In future model challenges, specific steps being tested (such as segmentation) and the required documentation (such as PDB code of starting model) should each be explicitly defined, so competing methods on a given task can be meaningfully compared. Individual local examples are presented here, to understand what local mistakes and corrections look like in 3D, how they probably arise, and what possible improvements to methodology might help avoid them. At these resolutions, both structural biologists and end-users need meaningful estimates of local uncertainty, perhaps through explicit ensembles. Fitting problems can best be diagnosed by validation that spans multiple residues; CaBLAM is such a multi-residue tool, and its effectiveness is demonstrated.
Collapse
Affiliation(s)
| | | | - Lizbeth L Videau
- Department of Biochemistry, Duke University, Durham, NC 27710, USA
| | - Vincent B Chen
- Department of Biochemistry, Duke University, Durham, NC 27710, USA
| | | |
Collapse
|
28
|
Haslam D, Zeng T, Li R, He J. Exploratory Studies Detecting Secondary Structures in Medium Resolution 3D Cryo-EM Images Using Deep Convolutional Neural Networks. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2018; 2018:628-632. [PMID: 35838356 PMCID: PMC9279009 DOI: 10.1145/3233547.3233704] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Cryo-electron microscopy (cryo-EM) is an emerging biophysical technique for structural determination of protein complexes. However, accurate detection of secondary structures is still challenging when cryo-EM density maps are at medium resolutions (5-10 Å). Most of existing methods are image processing methods that do not fully utilize available images in the cryo-EM database. In this paper, we present a deep learning approach to segment secondary structure elements as helices and β-sheets from medium-resolution density maps. The proposed 3D convolutional neural network is shown to detect secondary structure locations with an F1 score between 0.79 and 0.88 for six simulated test cases. The architecture was also applied to an experimentally-derived cryo-EM density map with good accuracy.
Collapse
Affiliation(s)
- Devin Haslam
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529
| | - Tao Zeng
- Department of Computer Science, Washington State University, Pullman, WA 99164
| | | | - Jing He
- Corresponding author: Jing He,
| |
Collapse
|
29
|
Terashi G, Kihara D. De novo main-chain modeling with MAINMAST in 2015/2016 EM Model Challenge. J Struct Biol 2018; 204:351-359. [PMID: 30075190 PMCID: PMC6179447 DOI: 10.1016/j.jsb.2018.07.013] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Revised: 07/13/2018] [Accepted: 07/19/2018] [Indexed: 11/15/2022]
Abstract
Protein tertiary structure modeling is a critical step for the interpretation of three dimensional (3D) election microscopy density. Our group participated the 2015/2016 EM Model Challenge using the MAINMAST software for a de novo main chain modeling. The software generates local dense points using the mean shifting algorithm, and connects them into Cα models by calculating the minimum spanning tree and the longest path. Subsequently, full atom structure models are generated, which are subject to structural refinement. Here, we summarize the qualities of our submitted models and examine successful and unsuccessful models, including 3D models we did not submit to the Challenge. Our protocol using the MAINMAST software was sometimes able to build correct conformations with 3.4–5.1 Å RMSD. Unsuccessful models had failure of chain traces, however, their Cα positions and some local structures were quite correctly built. For evaluate the quality of the models, the MAINMAST software provides a confidence score for each Cα position from the consensus of top 100 scoring models.
Collapse
Affiliation(s)
- Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA; Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA.
| |
Collapse
|
30
|
Aprahamian ML, Chea EE, Jones LM, Lindert S. Rosetta Protein Structure Prediction from Hydroxyl Radical Protein Footprinting Mass Spectrometry Data. Anal Chem 2018; 90:7721-7729. [PMID: 29874044 DOI: 10.1021/acs.analchem.8b01624] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In recent years mass spectrometry-based covalent labeling techniques such as hydroxyl radical footprinting (HRF) have emerged as valuable structural biology techniques, yielding information on protein tertiary structure. These data, however, are not sufficient to predict protein structure unambiguously, as they provide information only on the relative solvent exposure of certain residues. Despite some recent advances, no software currently exists that can utilize covalent labeling mass spectrometry data to predict protein tertiary structure. We have developed the first such tool, which incorporates mass spectrometry derived protection factors from HRF labeling as a new centroid score term for the Rosetta scoring function to improve the prediction of protein tertiary structures. We tested our method on a set of four soluble benchmark proteins with known crystal structures and either published HRF experimental results or internally acquired data. Using the HRF labeling data, we rescored large decoy sets of structures predicted with Rosetta for each of the four benchmark proteins. As a result, the model quality improved for all benchmark proteins as compared to when scored with Rosetta alone. For two of the four proteins we were even able to identify atomic resolution models with the addition of HRF data.
Collapse
Affiliation(s)
- Melanie L Aprahamian
- Department of Chemistry and Biochemistry , Ohio State University , Columbus , Ohio 43210 , United States
| | - Emily E Chea
- Department of Pharmaceutical Sciences , University of Maryland , Baltimore , Maryland 21201 , United States
| | - Lisa M Jones
- Department of Pharmaceutical Sciences , University of Maryland , Baltimore , Maryland 21201 , United States
| | - Steffen Lindert
- Department of Chemistry and Biochemistry , Ohio State University , Columbus , Ohio 43210 , United States
| |
Collapse
|
31
|
Haslam D, Sazzed S, Wriggers W, Kovcas J, Song J, Auer M, He J. A Pattern Recognition Tool for Medium-resolution Cryo-EM Density Maps and Low-resolution Cryo-ET Density maps. BIOINFORMATICS RESEARCH AND APPLICATIONS : 14TH INTERNATIONAL SYMPOSIUM, ISBRA 2018, BEIJING, CHINA, JUNE 8-11, 2018, PROCEEDINGS. ISBRA (CONFERENCE) (14TH : 2018 : BEIJING, CHINA) 2018; 10847:233-238. [PMID: 36383494 PMCID: PMC9645795 DOI: 10.1007/978-3-319-94968-0_22] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Cryo-electron microscopy (Cryo-EM) and cryo-electron tomography (cryo-ET) produce 3-D density maps of biological molecules at a range of resolution levels. Pattern recognition tools are important in distinguishing biological components from volumetric maps with the available resolutions. One of the most distinct characters in density maps at medium (5-10 Å) resolution is the visibility of protein secondary structures. Although computational methods have been developed, the accurate detection of helices and β-strands from cryo-EM density maps is still an active research area. We have developed a tool for protein secondary structure detection and evaluation of medium resolution 3-D cryo-EM density maps which combines three computational methods (SSETracer, StrandTwister, and AxisComparison). The program was integrated in UCSF Chimera, a popular visualization software in the cryo-EM community. In related work, we have developed BundleTrac, a computational method to trace filaments in a bundle from lower resolution cryo-ET density maps. It has been applied to actin filament tracing in stereocilia with good accuracy and can be potentially added as a tool in Chimera.
Collapse
Affiliation(s)
- Devin Haslam
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| | - Salim Sazzed
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| | - Willy Wriggers
- Department of Mechanical and Aerospace Engineering, Old Dominion University, Norfolk, VA 23529, USA
| | - Julio Kovcas
- Department of Mechanical and Aerospace Engineering, Old Dominion University, Norfolk, VA 23529, USA
| | - Junha Song
- Cell and Tissue Imaging, Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Manfred Auer
- Cell and Tissue Imaging, Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| |
Collapse
|
32
|
Cassidy CK, Himes BA, Luthey-Schulten Z, Zhang P. CryoEM-based hybrid modeling approaches for structure determination. Curr Opin Microbiol 2018; 43:14-23. [PMID: 29107896 PMCID: PMC5934336 DOI: 10.1016/j.mib.2017.10.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Revised: 10/04/2017] [Accepted: 10/09/2017] [Indexed: 12/21/2022]
Abstract
Recent advances in cryo-electron microscopy (cryoEM) have dramatically improved the resolutions at which vitrified biological specimens can be studied, revealing new structural and mechanistic insights over a broad range of spatial scales. Bolstered by these advances, much effort has been directed toward the development of hybrid modeling methodologies for the construction and refinement of high-fidelity atomistic models from cryoEM data. In this brief review, we will survey the key elements of cryoEM-based hybrid modeling, providing an overview of available computational tools and strategies as well as several recent applications.
Collapse
Affiliation(s)
- C Keith Cassidy
- Department of Physics, Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Benjamin A Himes
- Department of Structural Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Zaida Luthey-Schulten
- Department of Chemistry, Center for the Physics of Living Cells, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Peijun Zhang
- Department of Structural Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA; Division of Structural Biology, Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK; Electron Bio-Imaging Centre, Diamond Light Sources, Harwell Science and Innovation Campus, Didcot OX11 0DE, UK.
| |
Collapse
|
33
|
Tracing Actin Filament Bundles in Three-Dimensional Electron Tomography Density Maps of Hair Cell Stereocilia. Molecules 2018; 23:molecules23040882. [PMID: 29641472 PMCID: PMC6017643 DOI: 10.3390/molecules23040882] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2017] [Revised: 03/14/2018] [Accepted: 03/22/2018] [Indexed: 12/20/2022] Open
Abstract
Cryo-electron tomography (cryo-ET) is a powerful method of visualizing the three-dimensional organization of supramolecular complexes, such as the cytoskeleton, in their native cell and tissue contexts. Due to its minimal electron dose and reconstruction artifacts arising from the missing wedge during data collection, cryo-ET typically results in noisy density maps that display anisotropic XY versus Z resolution. Molecular crowding further exacerbates the challenge of automatically detecting supramolecular complexes, such as the actin bundle in hair cell stereocilia. Stereocilia are pivotal to the mechanoelectrical transduction process in inner ear sensory epithelial hair cells. Given the complexity and dense arrangement of actin bundles, traditional approaches to filament detection and tracing have failed in these cases. In this study, we introduce BundleTrac, an effective method to trace hundreds of filaments in a bundle. A comparison between BundleTrac and manually tracing the actin filaments in a stereocilium showed that BundleTrac accurately built 326 of 330 filaments (98.8%), with an overall cross-distance of 1.3 voxels for the 330 filaments. BundleTrac is an effective semi-automatic modeling approach in which a seed point is provided for each filament and the rest of the filament is computationally identified. We also demonstrate the potential of a denoising method that uses a polynomial regression to address the resolution and high-noise anisotropic environment of the density map.
Collapse
|
34
|
Li B, Fooksa M, Heinze S, Meiler J. Finding the needle in the haystack: towards solving the protein-folding problem computationally. Crit Rev Biochem Mol Biol 2018; 53:1-28. [PMID: 28976219 PMCID: PMC6790072 DOI: 10.1080/10409238.2017.1380596] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Revised: 08/22/2017] [Accepted: 09/13/2017] [Indexed: 12/22/2022]
Abstract
Prediction of protein tertiary structures from amino acid sequence and understanding the mechanisms of how proteins fold, collectively known as "the protein folding problem," has been a grand challenge in molecular biology for over half a century. Theories have been developed that provide us with an unprecedented understanding of protein folding mechanisms. However, computational simulation of protein folding is still difficult, and prediction of protein tertiary structure from amino acid sequence is an unsolved problem. Progress toward a satisfying solution has been slow due to challenges in sampling the vast conformational space and deriving sufficiently accurate energy functions. Nevertheless, several techniques and algorithms have been adopted to overcome these challenges, and the last two decades have seen exciting advances in enhanced sampling algorithms, computational power and tertiary structure prediction methodologies. This review aims at summarizing these computational techniques, specifically conformational sampling algorithms and energy approximations that have been frequently used to study protein-folding mechanisms or to de novo predict protein tertiary structures. We hope that this review can serve as an overview on how the protein-folding problem can be studied computationally and, in cases where experimental approaches are prohibitive, help the researcher choose the most relevant computational approach for the problem at hand. We conclude with a summary of current challenges faced and an outlook on potential future directions.
Collapse
Affiliation(s)
- Bian Li
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| | - Michaela Fooksa
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
- Chemical and Physical Biology Graduate Program, Vanderbilt University, Nashville, TN, USA
| | - Sten Heinze
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
35
|
Al Nasr K, Yousef F, Jebril R, Jones C. Analytical Approaches to Improve Accuracy in Solving the Protein Topology Problem. Molecules 2018; 23:E28. [PMID: 29360779 PMCID: PMC6017786 DOI: 10.3390/molecules23020028] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 01/19/2018] [Accepted: 01/19/2018] [Indexed: 11/17/2022] Open
Abstract
To take advantage of recent advances in genomics and proteomics it is critical that the three-dimensional physical structure of biological macromolecules be determined. Cryo-Electron Microscopy (cryo-EM) is a promising and improving method for obtaining this data, however resolution is often not sufficient to directly determine the atomic scale structure. Despite this, information for secondary structure locations is detectable. De novo modeling is a computational approach to modeling these macromolecular structures based on cryo-EM derived data. During de novo modeling a mapping between detected secondary structures and the underlying amino acid sequence must be identified. DP-TOSS (Dynamic Programming for determining the Topology Of Secondary Structures) is one tool that attempts to automate the creation of this mapping. By treating the correspondence between the detected structures and the structures predicted from sequence data as a constraint graph problem DP-TOSS achieved good accuracy in its original iteration. In this paper, we propose modifications to the scoring methodology of DP-TOSS to improve its accuracy. Three scoring schemes were applied to DP-TOSS and tested: (i) a skeleton-based scoring function; (ii) a geometry-based analytical function; and (iii) a multi-well potential energy-based function. A test of 25 proteins shows that a combination of these schemes can improve the performance of DP-TOSS to solve the topology determination problem for macromolecule proteins.
Collapse
Affiliation(s)
- Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| | - Feras Yousef
- Department of Mathematics, The University of Jordan, Amman 11942, Jordan.
| | - Ruba Jebril
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| | - Christopher Jones
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| |
Collapse
|
36
|
Islam T, Poteat M, He J. Quantification of Twist from the Central Lines of β-Strands. J Comput Biol 2018; 25:114-120. [PMID: 29313736 DOI: 10.1089/cmb.2017.0174] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Since the discovery of right-handed twist of a β-strand, many studies have been conducted to understand the twist. Given the atomic structure of a protein, twist angles have been defined using atomic positions of the backbone. However, limited study is available to characterize twist when the atomic positions are not available, but the central lines of β-strands are. Recent studies in cryoelectron microscopy show that it is possible to predict the central lines of β-strands from a medium-resolution density map. Accurate measurement of twist angles is important in identification of β-strands from such density maps. We propose an effective method to quantify twist angles from a set of splines. In a data set of 55 pairs of β-strands from 11 β-sheets of 11 proteins, the spline measurement shows comparable results as measured using the discrete method that uses atomic positions directly, particularly in capturing twist angle change along a pair, different levels of twist among different pairs, and the average of twist angles. The proposed method provides an alternative method to characterize twist using the central lines of a β-sheet.
Collapse
Affiliation(s)
- Tunazzina Islam
- Department of Computer Science, Old Dominion University , Norfolk, Virginia
| | - Michael Poteat
- Department of Computer Science, Old Dominion University , Norfolk, Virginia
| | - Jing He
- Department of Computer Science, Old Dominion University , Norfolk, Virginia
| |
Collapse
|
37
|
Ismer J, Rose AS, Tiemann JKS, Hildebrand PW. A fragment based method for modeling of protein segments into cryo-EM density maps. BMC Bioinformatics 2017; 18:475. [PMID: 29132296 PMCID: PMC5683378 DOI: 10.1186/s12859-017-1904-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2017] [Accepted: 11/01/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Single-particle analysis of electron cryo-microscopy (cryo-EM) is a key technology for elucidation of macromolecular structures. Recent technical advances in hardware and software developments significantly enhanced the resolution of cryo-EM density maps and broadened the applicability and the circle of users. To facilitate modeling of macromolecules into cryo-EM density maps, fast and easy to use methods for modeling are now demanded. RESULTS Here we investigated and benchmarked the suitability of a classical and well established fragment-based approach for modeling of segments into cryo-EM density maps (termed FragFit). FragFit uses a hierarchical strategy to select fragments from a pre-calculated set of billions of fragments derived from structures deposited in the Protein Data Bank, based on sequence similarly, fit of stem atoms and fit to a cryo-EM density map. The user only has to specify the sequence of the segment and the number of the N- and C-terminal stem-residues in the protein. Using a representative data set of protein structures, we show that protein segments can be accurately modeled into cryo-EM density maps of different resolution by FragFit. Prediction quality depends on segment length, the type of secondary structure of the segment and local quality of the map. CONCLUSION Fast and automated calculation of FragFit renders it applicable for implementation of interactive web-applications e.g. to model missing segments, flexible protein parts or hinge-regions into cryo-EM density maps.
Collapse
Affiliation(s)
- Jochen Ismer
- Institute of Medical Physics and Biophysics, University Medicine Berlin, Charitéplatz 1, 10117, Berlin, Germany
| | - Alexander S Rose
- Institute of Medical Physics and Biophysics, University Medicine Berlin, Charitéplatz 1, 10117, Berlin, Germany.,RCSB Protein Data Bank, San Diego Supercomputer Center, University of California, San Diego, CA, 92093-0743, USA
| | - Johanna K S Tiemann
- Institute of Medical Physics and Biophysics, University Medicine Berlin, Charitéplatz 1, 10117, Berlin, Germany.,Institute of Medical Physics and Biophysics, University Leipzig, Härtelstraße 16-18, 04107, Leipzig, Germany
| | - Peter W Hildebrand
- Institute of Medical Physics and Biophysics, University Medicine Berlin, Charitéplatz 1, 10117, Berlin, Germany. .,Institute of Medical Physics and Biophysics, University Leipzig, Härtelstraße 16-18, 04107, Leipzig, Germany.
| |
Collapse
|
38
|
Ng A, Si D. Beta-Barrel Detection for Medium Resolution Cryo-Electron Microscopy Density Maps Using Genetic Algorithms and Ray Tracing. J Comput Biol 2017; 25:326-336. [PMID: 29035579 DOI: 10.1089/cmb.2017.0155] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Cryo-electron microscopy (cryo-EM) is a technique that produces three-dimensional density maps of large protein complexes. This allows for the study of the structure of these proteins. Identifying the secondary structures within proteins is vital to understanding the overall structure and function of the protein. The [Formula: see text]-barrel is one such secondary structure, commonly found in lipocalins and membrane proteins. In this article, we present a novel approach that utilizes genetic algorithms, kd-trees, and ray tracing to automatically detect and extract [Formula: see text]-barrels from cryo-EM density maps. This approach was tested on simulated and experimental density maps with zero, one, or multiple barrels in the density map. The results suggest that the proposed approach is capable of performing automatic detection of [Formula: see text]-barrels from medium resolution cryo-EM density maps.
Collapse
Affiliation(s)
- Albert Ng
- 1 Division of Computing and Software Systems, University of Washington Bothell , Bothell, Washington
| | - Dong Si
- 1 Division of Computing and Software Systems, University of Washington Bothell , Bothell, Washington
| |
Collapse
|
39
|
Leelananda SP, Lindert S. Iterative Molecular Dynamics-Rosetta Membrane Protein Structure Refinement Guided by Cryo-EM Densities. J Chem Theory Comput 2017; 13:5131-5145. [PMID: 28949136 DOI: 10.1021/acs.jctc.7b00464] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Knowing atomistic details of proteins is essential not only for the understanding of protein function but also for the development of drugs. Experimental methods such as X-ray crystallography, NMR, and cryo-electron microscopy (cryo-EM) are the preferred forms of protein structure determination and have achieved great success over the most recent decades. Computational methods may be an alternative when experimental techniques fail. However, computational methods are severely limited when it comes to predicting larger macromolecule structures with little sequence similarity to known structures. The incorporation of experimental restraints in computational methods is becoming increasingly important to more reliably predict protein structure. One such experimental input used in structure prediction and refinement is cryo-EM densities. Recent advances in cryo-EM have arguably revolutionized the field of structural biology. Our previously developed cryo-EM-guided Rosetta-MD protocol has shown great promise in the refinement of soluble protein structures. In this study, we extended cryo-EM density-guided iterative Rosetta-MD to membrane proteins. We also improved the methodology in general by picking models based on a combination of their score and fit-to-density during the Rosetta model selection. By doing so, we have been able to pick models superior to those with the previous selection based on Rosetta score only and we have been able to further improve our previously refined models of soluble proteins. The method was tested with five membrane spanning protein structures. By applying density-guided Rosetta-MD iteratively we were able to refine the predicted structures of these membrane proteins to atomic resolutions. We also showed that the resolution of the density maps determines the improvement and quality of the refined models. By incorporating high-resolution density maps (∼4 Å), we were able to more significantly improve the quality of the models than when medium-resolution maps (6.9 Å) were used. Beginning from an average starting structure root mean square deviation (RMSD) to native of 4.66 Å, our protocol was able to refine the structures to bring the average refined structure RMSD to 1.66 Å when 4 Å density maps were used. The protocol also successfully refined the HIV-1 CTD guided by an experimental 5 Å density map.
Collapse
Affiliation(s)
- Sumudu P Leelananda
- Department of Chemistry and Biochemistry, Ohio State University , Columbus, Ohio 43210, United States
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University , Columbus, Ohio 43210, United States
| |
Collapse
|
40
|
Dou H, Burrows DW, Baker ML, Ju T. Flexible Fitting of Atomic Models into Cryo-EM Density Maps Guided by Helix Correspondences. Biophys J 2017. [PMID: 28636906 DOI: 10.1016/j.bpj.2017.04.054] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Although electron cryo-microscopy (cryo-EM) has recently achieved resolutions of better than 3 Å, at which point molecular modeling can be done directly from the density map, analysis and annotation of a cryo-EM density map still primarily rely on fitting atomic or homology models to the density map. In this article, we present, to our knowledge, a new method for flexible fitting of known or modeled protein structures into cryo-EM density maps. Unlike existing methods that are guided by local density gradients, our method is guided by correspondences between the α-helices in the density map and model, and does not require an initial rigid-body fitting step. Compared with current methods on both simulated and experimental density maps, our method not only achieves greater accuracy for proteins with large deformations but also runs as fast or faster than many of the other flexible fitting routines.
Collapse
Affiliation(s)
- Hang Dou
- Department of Computer Science and Engineering, Washington University in St. Louis, St. Louis, Missouri.
| | - Derek W Burrows
- Department of Computer Science and Engineering, Washington University in St. Louis, St. Louis, Missouri
| | - Matthew L Baker
- Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas
| | - Tao Ju
- Department of Computer Science and Engineering, Washington University in St. Louis, St. Louis, Missouri
| |
Collapse
|
41
|
Islam T, Poteat M, He J. Analysis of β-strand Twist from the 3-dimensional Image of a Protein. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2017; 2017:650-654. [PMID: 35838360 PMCID: PMC9279011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Electron cryo-microscopy (Cryo-EM) technique produces density maps that are 3-dimensional (3D) images of molecules. It is challenging to derive atomic structures of proteins from 3D images of medium resolutions. Twist of a β-strand has been studied extensively while little of the known information has been directly obtained from the 3D image of a β-sheet. We describe a method to characterize the twist of β-strands from the 3D image of a protein. An analysis of 11 β-sheet images shows that the Averaged Minimum Twist (AMT) angle is larger for a close set than for a far set of β-traces.
Collapse
|
42
|
Biswas A, Ranjan D, Zubair M, Zeil S, Nasr KA, He J. An Effective Computational Method Incorporating Multiple Secondary Structure Predictions in Topology Determination for Cryo-EM Images. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:578-586. [PMID: 27008671 PMCID: PMC5071113 DOI: 10.1109/tcbb.2016.2543721] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
A key idea in de novo modeling of a medium-resolution density image obtained from cryo-electron microscopy is to compute the optimal mapping between the secondary structure traces observed in the density image and those predicted on the protein sequence. When secondary structures are not determined precisely, either from the image or from the amino acid sequence of the protein, the computational problem becomes more complex. We present an efficient method that addresses the secondary structure placement problem in presence of multiple secondary structure predictions and computes the optimal mapping. We tested the method using 12 simulated images from α-proteins and two Cryo-EM images of α-β proteins. We observed that the rank of the true topologies is consistently improved by using multiple secondary structure predictions instead of a single prediction. The results show that the algorithm is robust and works well even when errors/misses in the predicted secondary structures are present in the image or the sequence. The results also show that the algorithm is efficient and is able to handle proteins with as many as 33 helices.
Collapse
Affiliation(s)
- Abhishek Biswas
- Dept. of Computer Science, Old Dominion University, Norfolk, VA 23529
| | - Desh Ranjan
- Dept. of Computer Science, Old Dominion University, Norfolk, VA 23529
| | - Mohammad Zubair
- Dept. of Computer Science, Old Dominion University, Norfolk, VA 23529
| | - Stephanie Zeil
- Dept. of Computer Science, Old Dominion University, Norfolk, VA 23529
| | - Kamal Al Nasr
- Dept. of Computer Science, Tennessee State University, Nashville, TN 37209
| | - Jing He
- Dept. of Computer Science, Old Dominion University, Norfolk, VA 23529
| |
Collapse
|
43
|
Li R, Si D, Zeng T, Ji S, He J. Deep Convolutional Neural Networks for Detecting Secondary Structures in Protein Density Maps from Cryo-Electron Microscopy. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2017; 2016:41-46. [PMID: 29770260 DOI: 10.1109/bibm.2016.7822490] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The detection of secondary structure of proteins using three dimensional (3D) cryo-electron microscopy (cryo-EM) images is still a challenging task when the spatial resolution of cryo-EM images is at medium level (5-10Å ). Prior researches focused on the usage of local features that may not capture the global information of image objects. In this study, we propose to use deep learning methods to extract high representative global features and then automatically detect secondary structures of proteins. In particular, we build a convolutional neural network (CNN) classifier that predicts the probability of label for every individual voxel in 3D cryo-EM image with respect to the secondary structure elements of proteins such as α-helix, β-sheet and background. To effectively incorporate the 3D spatial information in protein structures, we propose to perform 3D convolutions in the convolutional layers of CNNs. We show that the proposed CNN classifier can outperform existing SVM method on identifying the secondary structure elements of proteins from 3D cryo-EM medium resolution images.
Collapse
Affiliation(s)
- Rongjian Li
- Department of Computer Science, Old Dominion University, Norfolk, Virginia 23529
| | - Dong Si
- Division of Computing and Software Systems, University of Washington Bothell, Bothell, WA 98011
| | - Tao Zeng
- School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA 99164
| | - Shuiwang Ji
- School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA 99164
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk, Virginia 23529
| |
Collapse
|
44
|
Si D, He J. Modeling Beta-Traces for Beta-Barrels from Cryo-EM Density Maps. BIOMED RESEARCH INTERNATIONAL 2017; 2017:1793213. [PMID: 28164115 PMCID: PMC5259677 DOI: 10.1155/2017/1793213] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2016] [Accepted: 12/08/2016] [Indexed: 01/09/2023]
Abstract
Cryo-electron microscopy (cryo-EM) has produced density maps of various resolutions. Although α-helices can be detected from density maps at 5-8 Å resolutions, β-strands are challenging to detect at such density maps due to close-spacing of β-strands. The variety of shapes of β-sheets adds the complexity of β-strands detection from density maps. We propose a new approach to model traces of β-strands for β-barrel density regions that are extracted from cryo-EM density maps. In the test containing eight β-barrels extracted from experimental cryo-EM density maps at 5.5 Å-8.25 Å resolution, StrandRoller detected about 74.26% of the amino acids in the β-strands with an overall 2.05 Å 2-way distance between the detected β-traces and the observed ones, if the best of the fifteen detection cases is considered.
Collapse
Affiliation(s)
- Dong Si
- Division of Computing and Software Systems, University of Washington Bothell, Bothell, WA 98011, USA
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| |
Collapse
|
45
|
Poteat M, He J. An Iterative Bézier Method for Fitting Beta-sheet Component of a Cryo-EM Density Map. MOLECULAR BASED MATHEMATICAL BIOLOGY 2017; 5:31-39. [PMID: 34350231 PMCID: PMC8329936 DOI: 10.1515/mlbmb-2017-0003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Cryo-electron microscopy (Cryo-EM) is a powerful technique to produce 3-dimensional density maps for large molecular complexes. Although many atomic structures have been solved from cryo-EM density maps, it is challenging to derive atomic structures when the resolution of density maps is not sufficiently high. Geometrical shape representation of secondary structural components in a medium-resolution density map enhances modeling of atomic structures. We compare two methods in producing surface representation of the β-sheet component of a density map. Given a 3-dimensional volume of β-sheet that is segmented from a density map, the performance of a polynomial fitting was compared with that of an iterative Bézier fitting. The results suggest that the iterative Bézier fitting is more suitable for β-sheets, since it provides more accurate representation of the corners that are naturally twisted in a β-sheet.
Collapse
Affiliation(s)
- Michael Poteat
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529
| |
Collapse
|
46
|
Zeil S, Kovacs J, Wriggers W, He J. Comparing an Atomic Model or Structure to a Corresponding Cryo-electron Microscopy Image at the Central Axis of a Helix. J Comput Biol 2017; 24:52-67. [PMID: 27936925 PMCID: PMC5220566 DOI: 10.1089/cmb.2016.0145] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
Abstract
Three-dimensional density maps of biological specimens from cryo-electron microscopy (cryo-EM) can be interpreted in the form of atomic models that are modeled into the density, or they can be compared to known atomic structures. When the central axis of a helix is detectable in a cryo-EM density map, it is possible to quantify the agreement between this central axis and a central axis calculated from the atomic model or structure. We propose a novel arc-length association method to compare the two axes reliably. This method was applied to 79 helices in simulated density maps and six case studies using cryo-EM maps at 6.4-7.7 Å resolution. The arc-length association method is then compared to three existing measures that evaluate the separation of two helical axes: a two-way distance between point sets, the length difference between two axes, and the individual amino acid detection accuracy. The results show that our proposed method sensitively distinguishes lateral and longitudinal discrepancies between the two axes, which makes the method particularly suitable for the systematic investigation of cryo-EM map-model pairs.
Collapse
Affiliation(s)
- Stephanie Zeil
- Department of Computer Science, Old Dominion University, Norfolk, Virginia
| | - Julio Kovacs
- Department of Mechanical and Aerospace Engineering and Institute of Biomedical Engineering, Old Dominion University, Norfolk, Virginia
| | - Willy Wriggers
- Department of Mechanical and Aerospace Engineering and Institute of Biomedical Engineering, Old Dominion University, Norfolk, Virginia
| | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk, Virginia
| |
Collapse
|
47
|
Haslam D, Zubair M, Ranjan D, Biswas A, He J. CHALLENGES IN MATCHING SECONDARY STRUCTURES IN CRYO-EM: AN EXPLORATION. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2016; 2016:1714-1719. [PMID: 29770261 PMCID: PMC5952047 DOI: 10.1109/bibm.2016.7822776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Cryo-electron microscopy is a fast emerging biophysical technique for structural determination of large protein complexes. While more atomic structures are being determined using this technique, it is still challenging to derive atomic structures from density maps produced at medium resolution when no suitable templates are available. A critical step in structure determination is how a protein chain threads through the 3-dimensional density map. A dynamic programming method was previously developed to generate K best matches of secondary structures between the density map and its protein sequence using shortest paths in a related weighted graph. We discuss challenges associated with the creation of the weighted graph and explore heuristic methods to solve the problem of matching secondary structures.
Collapse
Affiliation(s)
- Devin Haslam
- Department of Computer Science, Old Dominion University, Norfolk VA23529
| | - Mohammad Zubair
- Department of Computer Science, Old Dominion University, Norfolk VA23529
| | - Desh Ranjan
- Department of Computer Science, Old Dominion University, Norfolk VA23529
| | | | - Jing He
- Department of Computer Science, Old Dominion University, Norfolk VA23529
| |
Collapse
|
48
|
Heterodimers as the Structural Unit of the T=1 Capsid of the Fungal Double-Stranded RNA Rosellinia necatrix Quadrivirus 1. J Virol 2016; 90:11220-11230. [PMID: 27707923 DOI: 10.1128/jvi.01013-16] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2016] [Accepted: 09/29/2016] [Indexed: 02/07/2023] Open
Abstract
Most double-stranded RNA (dsRNA) viruses are transcribed and replicated in a specialized icosahedral capsid with a T=1 lattice consisting of 60 asymmetric capsid protein (CP) dimers. These capsids help to organize the viral genome and replicative complex(es). They also act as molecular sieves that isolate the virus genome from host defense mechanisms and allow the passage of nucleotides and viral transcripts. Rosellinia necatrix quadrivirus 1 (RnQV1), the type species of the family Quadriviridae, is a dsRNA fungal virus with a multipartite genome consisting of four monocistronic segments (segments 1 to 4). dsRNA-2 and dsRNA-4 encode two CPs (P2 and P4, respectively), which coassemble into ∼450-Å-diameter capsids. We used three-dimensional cryo-electron microscopy combined with complementary biophysical techniques to determine the structures of RnQV1 virion strains W1075 and W1118. RnQV1 has a quadripartite genome, and the capsid is based on a single-shelled T=1 lattice built of P2-P4 dimers. Whereas the RnQV1-W1118 capsid is built of full-length CP, P2 and P4 of RnQV1-W1075 are cleaved into several polypeptides, maintaining the capsid structural organization. RnQV1 heterodimers have a quaternary organization similar to that of homodimers of reoviruses and other dsRNA mycoviruses. The RnQV1 capsid is the first T=1 capsid with a heterodimer as an asymmetric unit reported to date and follows the architectural principle for dsRNA viruses that a 120-subunit capsid is a conserved assembly that supports dsRNA replication and organization. IMPORTANCE Given their importance to health, members of the family Reoviridae are the basis of most structural and functional studies and provide much of our knowledge of dsRNA viruses. Analysis of bacterial, protozoal, and fungal dsRNA viruses has improved our understanding of their structure, function, and evolution, as well. Here, we studied a dsRNA virus that infects the fungus Rosellinia necatrix, an ascomycete that is pathogenic to a wide range of plants. Using three-dimensional cryo-electron microscopy and analytical ultracentrifugation analysis, we determined the structure and stoichiometry of Rosellinia necatrix quadrivirus 1 (RnQV1). The RnQV1 capsid is a T=1 capsid with 60 heterodimers as the asymmetric units. The large amount of genetic information used by RnQV1 to construct a simple T=1 capsid is probably related to the numerous virus-host and virus-virus interactions that it must face in its life cycle, which lacks an extracellular phase.
Collapse
|
49
|
Leone V, Faraldo-Gómez JD. Structure and mechanism of the ATP synthase membrane motor inferred from quantitative integrative modeling. J Gen Physiol 2016; 148:441-457. [PMID: 27821609 PMCID: PMC5129741 DOI: 10.1085/jgp.201611679] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2016] [Accepted: 10/13/2016] [Indexed: 01/31/2023] Open
Abstract
The ATP synthase is a molecular rotor that recycles ADP into ATP. Leone and Faraldo-Gómez use structural modeling to reinterpret and reconcile recent cryo-EM data for its membrane domain with other experimental evidence, gaining insights into its mechanism and the mode of inhibition by oligomycin. Two subunits within the transmembrane domain of the ATP synthase—the c-ring and subunit a—energize the production of 90% of cellular ATP by transducing an electrochemical gradient of H+ or Na+ into rotational motion. The nature of this turbine-like energy conversion mechanism has been elusive for decades, owing to the lack of definitive structural information on subunit a or its c-ring interface. In a recent breakthrough, several structures of this complex were resolved by cryo–electron microscopy (cryo-EM), but the modest resolution of the data has led to divergent interpretations. Moreover, the unexpected architecture of the complex has cast doubts on a wealth of earlier biochemical analyses conducted to probe this structure. Here, we use quantitative molecular-modeling methods to derive a structure of the a–c complex that is not only objectively consistent with the cryo-EM data, but also with correlated mutation analyses of both subunits and with prior cross-linking and cysteine accessibility measurements. This systematic, integrative approach reveals unambiguously the topology of subunit a and its relationship with the c-ring. Mapping of known Cd2+ block sites and conserved protonatable residues onto the structure delineates two noncontiguous pathways across the complex, connecting two adjacent proton-binding sites in the c-ring to the space on either side of the membrane. The location of these binding sites and of a strictly conserved arginine on subunit a, which serves to prevent protons from hopping between them, explains the directionality of the rotary mechanism and its strict coupling to the proton-motive force. Additionally, mapping of mutations conferring resistance to oligomycin unexpectedly reveals that this prototypical inhibitor may bind to two distinct sites at the a–c interface, explaining its ability to block the mechanism of the enzyme irrespective of the direction of rotation of the c-ring. In summary, this study is a stepping stone toward establishing the mechanism of the ATP synthase at the atomic level.
Collapse
Affiliation(s)
- Vanessa Leone
- Theoretical Molecular Biophysics Section, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892
| | - José D Faraldo-Gómez
- Theoretical Molecular Biophysics Section, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892
| |
Collapse
|
50
|
DiMaio F, Chiu W. Tools for Model Building and Optimization into Near-Atomic Resolution Electron Cryo-Microscopy Density Maps. Methods Enzymol 2016; 579:255-76. [PMID: 27572730 PMCID: PMC5103630 DOI: 10.1016/bs.mie.2016.06.003] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Electron cryo-microscopy (cryoEM) has advanced dramatically to become a viable tool for high-resolution structural biology research. The ultimate outcome of a cryoEM study is an atomic model of a macromolecule or its complex with interacting partners. This chapter describes a variety of algorithms and software to build a de novo model based on the cryoEM 3D density map, to optimize the model with the best stereochemistry restraints and finally to validate the model with proper protocols. The full process of atomic structure determination from a cryoEM map is described. The tools outlined in this chapter should prove extremely valuable in revealing atomic interactions guided by cryoEM data.
Collapse
Affiliation(s)
- F DiMaio
- University of Washington, Seattle, WA, United States; Institute for Protein Design, University of Washington, Seattle, WA, United States.
| | - W Chiu
- National Center for Macromolecular Imaging, Baylor College of Medicine, Houston, TX, United States.
| |
Collapse
|