1
|
Corum MR, Venkannagari H, Hryc CF, Baker ML. Predictive modeling and cryo-EM: A synergistic approach to modeling macromolecular structure. Biophys J 2024; 123:435-450. [PMID: 38268190 PMCID: PMC10912932 DOI: 10.1016/j.bpj.2024.01.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/09/2024] [Accepted: 01/18/2024] [Indexed: 01/26/2024] Open
Abstract
Over the last 15 years, structural biology has seen unprecedented development and improvement in two areas: electron cryo-microscopy (cryo-EM) and predictive modeling. Once relegated to low resolutions, single-particle cryo-EM is now capable of achieving near-atomic resolutions of a wide variety of macromolecular complexes. Ushered in by AlphaFold, machine learning has powered the current generation of predictive modeling tools, which can accurately and reliably predict models for proteins and some complexes directly from the sequence alone. Although they offer new opportunities individually, there is an inherent synergy between these techniques, allowing for the construction of large, complex macromolecular models. Here, we give a brief overview of these approaches in addition to illustrating works that combine these techniques for model building. These examples provide insight into model building, assessment, and limitations when integrating predictive modeling with cryo-EM density maps. Together, these approaches offer the potential to greatly accelerate the generation of macromolecular structural insights, particularly when coupled with experimental data.
Collapse
Affiliation(s)
- Michael R Corum
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Harikanth Venkannagari
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Corey F Hryc
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Matthew L Baker
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas.
| |
Collapse
|
2
|
Kelly DF, Jonaid GM, Kaylor L, Solares MJ, Berry S, DiCecco LA, Dearnaley W, Casasanta M. Delineating Conformational Variability in Small Protein Structures Using Combinatorial Refinement Strategies. MICROMACHINES 2023; 14:1869. [PMID: 37893306 PMCID: PMC10609307 DOI: 10.3390/mi14101869] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 09/22/2023] [Accepted: 09/27/2023] [Indexed: 10/29/2023]
Abstract
As small protein assemblies and even small proteins are becoming more amenable to cryo-Electron Microscopy (EM) structural studies, it is important to consider the complementary dynamic information present in the data. Current computational strategies are limited in their ability to resolve minute differences among low molecular weight entities. Here, we demonstrate a new combinatorial approach to delineate flexible conformations among small proteins using real-space refinement applications. We performed a meta-analysis of structural data for the SARS CoV-2 Nucleocapsid (N) protein using a combination of rigid-body refinement and simulated annealing methods. For the N protein monomer, we determined three new flexible conformers with good stereochemistry and quantitative comparisons provided new evidence of their dynamic properties. A similar analysis performed for the N protein dimer showed only minor structural differences among the flexible models. These results suggested a more stable view of the N protein dimer than the monomer structure. Taken together, the new computational strategies can delineate conformational changes in low molecular weight proteins that may go unnoticed by conventional assessments. The results also suggest that small proteins may be further stabilized for structural studies through the use of solution components that limit the movement of external flexible regions.
Collapse
Affiliation(s)
- Deborah F. Kelly
- Department of Biomedical Engineering, Pennsylvania State University, University Park, PA 16802, USA
- Center for Structural Oncology, Pennsylvania State University, University Park, PA 16802, USA
| | - G M Jonaid
- Department of Biomedical Engineering, Pennsylvania State University, University Park, PA 16802, USA
- Center for Structural Oncology, Pennsylvania State University, University Park, PA 16802, USA
- Bioinformatics and Genomics Graduate Program, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
| | - Liam Kaylor
- Department of Biomedical Engineering, Pennsylvania State University, University Park, PA 16802, USA
- Center for Structural Oncology, Pennsylvania State University, University Park, PA 16802, USA
- Molecular, Cellular, and Integrative Biosciences Graduate Program, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
| | - Maria J. Solares
- Department of Biomedical Engineering, Pennsylvania State University, University Park, PA 16802, USA
- Center for Structural Oncology, Pennsylvania State University, University Park, PA 16802, USA
- Molecular, Cellular, and Integrative Biosciences Graduate Program, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
| | - Samantha Berry
- Department of Biomedical Engineering, Pennsylvania State University, University Park, PA 16802, USA
- Center for Structural Oncology, Pennsylvania State University, University Park, PA 16802, USA
| | - Liza-Anastasia DiCecco
- Department of Biomedical Engineering, Pennsylvania State University, University Park, PA 16802, USA
- Center for Structural Oncology, Pennsylvania State University, University Park, PA 16802, USA
| | - William Dearnaley
- Department of Biomedical Engineering, Pennsylvania State University, University Park, PA 16802, USA
- Center for Structural Oncology, Pennsylvania State University, University Park, PA 16802, USA
| | - Michael Casasanta
- Department of Biomedical Engineering, Pennsylvania State University, University Park, PA 16802, USA
- Center for Structural Oncology, Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
3
|
Uddin MR, Mahbub S, Rahman MS, Bayzid MS. SAINT: self-attention augmented inception-inside-inception network improves protein secondary structure prediction. Bioinformatics 2021; 36:4599-4608. [PMID: 32437517 DOI: 10.1093/bioinformatics/btaa531] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2019] [Revised: 05/10/2020] [Accepted: 05/16/2020] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Protein structures provide basic insight into how they can interact with other proteins, their functions and biological roles in an organism. Experimental methods (e.g. X-ray crystallography and nuclear magnetic resonance spectroscopy) for predicting the secondary structure (SS) of proteins are very expensive and time consuming. Therefore, developing efficient computational approaches for predicting the SS of protein is of utmost importance. Advances in developing highly accurate SS prediction methods have mostly been focused on 3-class (Q3) structure prediction. However, 8-class (Q8) resolution of SS contains more useful information and is much more challenging than the Q3 prediction. RESULTS We present SAINT, a highly accurate method for Q8 structure prediction, which incorporates self-attention mechanism (a concept from natural language processing) with the Deep Inception-Inside-Inception network in order to effectively capture both the short- and long-range interactions among the amino acid residues. SAINT offers a more interpretable framework than the typical black-box deep neural network methods. Through an extensive evaluation study, we report the performance of SAINT in comparison with the existing best methods on a collection of benchmark datasets, namely, TEST2016, TEST2018, CASP12 and CASP13. Our results suggest that self-attention mechanism improves the prediction accuracy and outperforms the existing best alternate methods. SAINT is the first of its kind and offers the best known Q8 accuracy. Thus, we believe SAINT represents a major step toward the accurate and reliable prediction of SSs of proteins. AVAILABILITY AND IMPLEMENTATION SAINT is freely available as an open-source project at https://github.com/SAINTProtein/SAINT.
Collapse
Affiliation(s)
- Mostofa Rafid Uddin
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh.,Department of Computer Science and Engineering, East West University, Dhaka 1212, Bangladesh
| | - Sazan Mahbub
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh
| | - M Saifur Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh
| | - Md Shamsuzzoha Bayzid
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh
| |
Collapse
|
4
|
Dhingra S, Sowdhamini R, Cadet F, Offmann B. A glance into the evolution of template-free protein structure prediction methodologies. Biochimie 2020; 175:85-92. [DOI: 10.1016/j.biochi.2020.04.026] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 04/24/2020] [Accepted: 04/27/2020] [Indexed: 11/26/2022]
|
5
|
Dodd T, Yan C, Ivanov I. Simulation-Based Methods for Model Building and Refinement in Cryoelectron Microscopy. J Chem Inf Model 2020; 60:2470-2483. [PMID: 32202798 DOI: 10.1021/acs.jcim.0c00087] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Advances in cryoelectron microscopy (cryo-EM) have revolutionized the structural investigation of large macromolecular assemblies. In this review, we first provide a broad overview of modeling methods used for flexible fitting of molecular models into cryo-EM density maps. We give special attention to approaches rooted in molecular simulations-atomistic molecular dynamics and Monte Carlo. Concise descriptions of the methods are given along with discussion of their advantages, limitations, and most popular alternatives. We also describe recent extensions of the widely used molecular dynamics flexible fitting (MDFF) method and discuss how different model-building techniques could be incorporated into new hybrid modeling schemes and simulation workflows. Finally, we provide two illustrative examples of model-building and refinement strategies employing MDFF, cascade MDFF, and RosettaCM. These examples come from recent cryo-EM studies that elucidated transcription preinitiation complexes and shed light on the functional roles of these assemblies in gene expression and gene regulation.
Collapse
Affiliation(s)
- Thomas Dodd
- Department of Chemistry, Georgia State University, Atlanta, Georgia 30302, United States.,Center for Diagnostics and Therapeutics, Georgia State University, Atlanta, Georgia 30302, United States
| | - Chunli Yan
- Department of Chemistry, Georgia State University, Atlanta, Georgia 30302, United States.,Center for Diagnostics and Therapeutics, Georgia State University, Atlanta, Georgia 30302, United States
| | - Ivaylo Ivanov
- Department of Chemistry, Georgia State University, Atlanta, Georgia 30302, United States.,Center for Diagnostics and Therapeutics, Georgia State University, Atlanta, Georgia 30302, United States
| |
Collapse
|
6
|
Alnabati E, Kihara D. Advances in Structure Modeling Methods for Cryo-Electron Microscopy Maps. Molecules 2019; 25:molecules25010082. [PMID: 31878333 PMCID: PMC6982917 DOI: 10.3390/molecules25010082] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Revised: 12/20/2019] [Accepted: 12/20/2019] [Indexed: 01/16/2023] Open
Abstract
Cryo-electron microscopy (cryo-EM) has now become a widely used technique for structure determination of macromolecular complexes. For modeling molecular structures from density maps of different resolutions, many algorithms have been developed. These algorithms can be categorized into rigid fitting, flexible fitting, and de novo modeling methods. It is also observed that machine learning (ML) techniques have been increasingly applied following the rapid progress of the ML field. Here, we review these different categories of macromolecule structure modeling methods and discuss their advances over time.
Collapse
Affiliation(s)
- Eman Alnabati
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
- Correspondence:
| |
Collapse
|
7
|
Pintilie G, Chen DH, Tran BN, Jakana J, Wu J, Hew CL, Chiu W. Segmentation and Comparative Modeling in an 8.6-Å Cryo-EM Map of the Singapore Grouper Iridovirus. Structure 2019; 27:1561-1569.e4. [PMID: 31447288 PMCID: PMC6853598 DOI: 10.1016/j.str.2019.08.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2016] [Revised: 03/19/2019] [Accepted: 08/02/2019] [Indexed: 10/26/2022]
Abstract
SGIV, or Singapore grouper iridovirus, is a large double-stranded DNA virus, reaching a diameter of 220 nm and packaging a genome of 140 kb. We present a 3D cryoelectron microscopy (cryo-EM) icosahedral reconstruction of SGIV determined at 8.6-Å resolution. It reveals several layers including a T = 247 icosahedral outer coat, anchor proteins, a lipid bilayer, and the encapsidated DNA. A new segmentation tool, iSeg, was applied to extract these layers from the reconstructed map. The outer coat was further segmented into major and minor capsid proteins. None of the proteins extracted by segmentation have known atomic structures. We generated models for the major coat protein using three comparative modeling tools, and evaluated each model using the cryo-EM map. Our analysis reveals a new architecture in the Iridoviridae family of viruses. It shares similarities with others in the same family, e.g., Chilo iridescent virus, but also shows new features of the major and minor capsid proteins.
Collapse
Affiliation(s)
- Grigore Pintilie
- Department of Bioengineering, and of Microbiology and Immunology, Stanford University, Stanford, CA 94305, USA.
| | - Dong-Hua Chen
- Department of Structural Biology, Stanford University, Stanford, CA 94305, USA
| | - Bich Ngoc Tran
- Department of Biological Sciences, National University of Singapore, 14 Science Drive 4, Singapore 117558, Singapore
| | - Joanita Jakana
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Jinlu Wu
- Department of Biological Sciences, National University of Singapore, 14 Science Drive 4, Singapore 117558, Singapore
| | - Choy Leong Hew
- Department of Biological Sciences, National University of Singapore, 14 Science Drive 4, Singapore 117558, Singapore
| | - Wah Chiu
- Department of Bioengineering, and of Microbiology and Immunology, Stanford University, Stanford, CA 94305, USA; Division of CryoEM and Bioimaging, SSRL, SLAC National Accelerator Laboratory, Stanford University, Menlo Park, CA 94025, USA
| |
Collapse
|
8
|
Experimental accuracy in protein structure refinement via molecular dynamics simulations. Proc Natl Acad Sci U S A 2018; 115:13276-13281. [PMID: 30530696 DOI: 10.1073/pnas.1811364115] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Refinement is the last step in protein structure prediction pipelines to convert approximate homology models to experimental accuracy. Protocols based on molecular dynamics (MD) simulations have shown promise, but current methods are limited to moderate levels of consistent refinement. To explore the energy landscape between homology models and native structures and analyze the challenges of MD-based refinement, eight test cases were studied via extensive simulations followed by Markov state modeling. In all cases, native states were found very close to the experimental structures and at the lowest free energies, but refinement was hindered by a rough energy landscape. Transitions from the homology model to the native states require the crossing of significant kinetic barriers on at least microsecond time scales. A significant energetic driving force toward the native state was lacking until its immediate vicinity, and there was significant sampling of off-pathway states competing for productive refinement. The role of recent force field improvements is discussed and transition paths are analyzed in detail to inform which key transitions have to be overcome to achieve successful refinement.
Collapse
|
9
|
Chen M, Baker ML. Automation and assessment of de novo modeling with Pathwalking in near atomic resolution cryoEM density maps. J Struct Biol 2018; 204:555-563. [DOI: 10.1016/j.jsb.2018.09.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Revised: 08/28/2018] [Accepted: 09/08/2018] [Indexed: 01/30/2023]
|
10
|
Kumar KK, Devi BU, Neeraja P. Molecular activities and ligand-binding specificities of StAR-related lipid transfer domains: exploring integrated in silico methods and ensemble-docking approaches. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2018; 29:483-501. [PMID: 29688061 DOI: 10.1080/1062936x.2018.1462847] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Accepted: 04/05/2018] [Indexed: 06/08/2023]
Abstract
In this study, cholesterol biotransformation gene-set of human steroidogenic acute regulatory protein-related lipid transfer (START) domains were evaluated from high-throughput gene screening approaches. It was shown that STARD1, STARD3 and STARD4 proteins are better effective transporters of cholesterol than STARD5 and STARD6 domains. Docking studies show a strong agreement with gene ontology enrichment data. According to both complementary strategies, it was found that only STARD1, STARD3 and STARD4 are potentially involved in cholesterol biotransformation in mitochondria through Ω1-loop of C-terminal α4-helical domain. Ensemble docking assessment for a set of selected chemicals of protein-chemical networks has shown possible binding probabilities with START domains. Among those, reproductive toxicity evoked drugs (mifepristone), insecticides (rotenone), tobacco pulmonary carcinogens (benzo(a)pyrene) and endocrine disruptor chemicals (EDCs) including perfluorooctanesulfonic acid (PFOS) and aflatoxin B1 (AFB1) potentially bound with novel hotspot residues of the α4-helical domain. Compound representation space and clustering approaches reveal that the START proteins show more sensitivity with these lead scaffolds, so they could provide probable barrier assets in cholesterol and steroidogenic acute regulatory (StAR) binding and leads adverse consequences in steroidogenesis. These findings indicate potential START domains and their binding levels with toxic chemicals; sorted viewpoints could be useful as a promising way to identify chemicals with related steroidogenisis impacts on human health.
Collapse
Affiliation(s)
- K Kranthi Kumar
- a Department of Zoology , Sri Venkateswara University , Tirupati , 517502 - A.P . India
| | - B Uma Devi
- a Department of Zoology , Sri Venkateswara University , Tirupati , 517502 - A.P . India
| | - P Neeraja
- a Department of Zoology , Sri Venkateswara University , Tirupati , 517502 - A.P . India
| |
Collapse
|
11
|
Al Nasr K, Yousef F, Jebril R, Jones C. Analytical Approaches to Improve Accuracy in Solving the Protein Topology Problem. Molecules 2018; 23:E28. [PMID: 29360779 PMCID: PMC6017786 DOI: 10.3390/molecules23020028] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 01/19/2018] [Accepted: 01/19/2018] [Indexed: 11/17/2022] Open
Abstract
To take advantage of recent advances in genomics and proteomics it is critical that the three-dimensional physical structure of biological macromolecules be determined. Cryo-Electron Microscopy (cryo-EM) is a promising and improving method for obtaining this data, however resolution is often not sufficient to directly determine the atomic scale structure. Despite this, information for secondary structure locations is detectable. De novo modeling is a computational approach to modeling these macromolecular structures based on cryo-EM derived data. During de novo modeling a mapping between detected secondary structures and the underlying amino acid sequence must be identified. DP-TOSS (Dynamic Programming for determining the Topology Of Secondary Structures) is one tool that attempts to automate the creation of this mapping. By treating the correspondence between the detected structures and the structures predicted from sequence data as a constraint graph problem DP-TOSS achieved good accuracy in its original iteration. In this paper, we propose modifications to the scoring methodology of DP-TOSS to improve its accuracy. Three scoring schemes were applied to DP-TOSS and tested: (i) a skeleton-based scoring function; (ii) a geometry-based analytical function; and (iii) a multi-well potential energy-based function. A test of 25 proteins shows that a combination of these schemes can improve the performance of DP-TOSS to solve the topology determination problem for macromolecule proteins.
Collapse
Affiliation(s)
- Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| | - Feras Yousef
- Department of Mathematics, The University of Jordan, Amman 11942, Jordan.
| | - Ruba Jebril
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| | - Christopher Jones
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA.
| |
Collapse
|
12
|
Al Nasr K, Jones C, Yousef F, Jebril R. PEM-fitter: A Coarse-Grained Method to Validate Protein Candidate Models. J Comput Biol 2017; 25:21-32. [PMID: 29140718 DOI: 10.1089/cmb.2017.0191] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The volumetric images produced by Cryo-Electron Microscopy (cryo-EM) technique are used to model macromolecular assemblies and machines. De novo protein modeling uses these images to computationally model the structure of the molecules. Many candidate conformations are usually generated during the intermediate step. Conventionally, each of these candidates is evaluated by time-consuming approaches such as potential energy. We introduce an initial version of a geometrical screening method that uses the skeleton of the cryo-EM images to evaluate candidate structures. The aim of this method is to reduce the number of native-like candidate conformations and, therefore, reduce the time required for structural evaluation by energy calculations. A test of two datasets was performed. The first dataset contains 10 proteins and shows that our method can successfully detect the correct native structure for the given skeleton among a set of different protein structures. The second dataset contains 12 proteins and shows that our method can filter slightly modified decoy conformations of the same protein. The efficiency of the method is also reported.
Collapse
Affiliation(s)
- Kamal Al Nasr
- 1 Department of Computer Science, Tennessee State University , Nashville, Tennessee
| | - Christopher Jones
- 1 Department of Computer Science, Tennessee State University , Nashville, Tennessee
| | - Feras Yousef
- 2 Department of Mathematics, The University of Jordan , Amman, Jordan
| | - Ruba Jebril
- 1 Department of Computer Science, Tennessee State University , Nashville, Tennessee
| |
Collapse
|
13
|
Feig M. Computational protein structure refinement: Almost there, yet still so far to go. WILEY INTERDISCIPLINARY REVIEWS. COMPUTATIONAL MOLECULAR SCIENCE 2017; 7:e1307. [PMID: 30613211 PMCID: PMC6319934 DOI: 10.1002/wcms.1307] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Protein structures are essential in modern biology yet experimental methods are far from being able to catch up with the rapid increase in available genomic data. Computational protein structure prediction methods aim to fill the gap while the role of protein structure refinement is to take approximate initial template-based models and bring them closer to the true native structure. Current methods for computational structure refinement rely on molecular dynamics simulations, related sampling methods, or iterative structure optimization protocols. The best methods are able to achieve moderate degrees of refinement but consistent refinement that can reach near-experimental accuracy remains elusive. Key issues revolve around the accuracy of the energy function, the inability to reliably rank multiple models, and the use of restraints that keep sampling close to the native state but also limit the degree of possible refinement. A different aspect is the question of what exactly the target of high-resolution refinement should be as experimental structures are affected by experimental conditions and different biological questions require varying levels of accuracy. While improvement of the global protein structure is a difficult problem, high-resolution refinement methods that improves local structural quality such as favorable stereochemistry and the avoidance of atomic clashes are much more successful.
Collapse
Affiliation(s)
- Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, 603 Wilson Rd., Room 218 BCH, East Lansing, MI, USA, ; 517-432-7439
| |
Collapse
|
14
|
DiMaio F, Chiu W. Tools for Model Building and Optimization into Near-Atomic Resolution Electron Cryo-Microscopy Density Maps. Methods Enzymol 2016; 579:255-76. [PMID: 27572730 PMCID: PMC5103630 DOI: 10.1016/bs.mie.2016.06.003] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Electron cryo-microscopy (cryoEM) has advanced dramatically to become a viable tool for high-resolution structural biology research. The ultimate outcome of a cryoEM study is an atomic model of a macromolecule or its complex with interacting partners. This chapter describes a variety of algorithms and software to build a de novo model based on the cryoEM 3D density map, to optimize the model with the best stereochemistry restraints and finally to validate the model with proper protocols. The full process of atomic structure determination from a cryoEM map is described. The tools outlined in this chapter should prove extremely valuable in revealing atomic interactions guided by cryoEM data.
Collapse
Affiliation(s)
- F DiMaio
- University of Washington, Seattle, WA, United States; Institute for Protein Design, University of Washington, Seattle, WA, United States.
| | - W Chiu
- National Center for Macromolecular Imaging, Baylor College of Medicine, Houston, TX, United States.
| |
Collapse
|
15
|
Chen M, Baldwin PR, Ludtke SJ, Baker ML. De Novo modeling in cryo-EM density maps with Pathwalking. J Struct Biol 2016; 196:289-298. [PMID: 27436409 DOI: 10.1016/j.jsb.2016.06.004] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Revised: 06/06/2016] [Accepted: 06/07/2016] [Indexed: 11/26/2022]
Abstract
As electron cryo-microscopy (cryo-EM) can now frequently achieve near atomic resolution, accurate interpretation of these density maps in terms of atomistic detail has become paramount in deciphering macromolecular structure and function. However, there are few software tools for modeling protein structure from cryo-EM density maps in this resolution range. Here, we present an extension of our original Pathwalking protocol, which can automatically trace a protein backbone directly from a near-atomic resolution (3-6Å) density map. The original Pathwalking approach utilized a Traveling Salesman Problem solver for backbone tracing, but manual adjustment was still required during modeling. In the new version, human intervention is minimized and we provide a more robust approach for backbone modeling. This includes iterative secondary structure identification, termini detection and the ability to model multiple subunits without prior segmentation. Overall, the new Pathwalking procedure provides a more complete and robust tool for annotating protein structure function in near-atomic resolution density maps.
Collapse
Affiliation(s)
- Muyuan Chen
- Program in Structural and Computational Biology and Molecular Biophysics, United States; Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, United States
| | - Philip R Baldwin
- Department of Psychology, United States; Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, United States
| | - Steven J Ludtke
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, United States
| | - Matthew L Baker
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, United States.
| |
Collapse
|
16
|
Singharoy A, Teo I, McGreevy R, Stone JE, Zhao J, Schulten K. Molecular dynamics-based refinement and validation for sub-5 Å cryo-electron microscopy maps. eLife 2016; 5. [PMID: 27383269 PMCID: PMC4990421 DOI: 10.7554/elife.16105] [Citation(s) in RCA: 115] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Accepted: 07/06/2016] [Indexed: 12/12/2022] Open
Abstract
Two structure determination methods, based on the molecular dynamics flexible fitting (MDFF) paradigm, are presented that resolve sub-5 Å cryo-electron microscopy (EM) maps with either single structures or ensembles of such structures. The methods, denoted cascade MDFF and resolution exchange MDFF, sequentially re-refine a search model against a series of maps of progressively higher resolutions, which ends with the original experimental resolution. Application of sequential re-refinement enables MDFF to achieve a radius of convergence of ~25 Å demonstrated with the accurate modeling of β-galactosidase and TRPV1 proteins at 3.2 Å and 3.4 Å resolution, respectively. The MDFF refinements uniquely offer map-model validation and B-factor determination criteria based on the inherent dynamics of the macromolecules studied, captured by means of local root mean square fluctuations. The MDFF tools described are available to researchers through an easy-to-use and cost-effective cloud computing resource on Amazon Web Services. DOI:http://dx.doi.org/10.7554/eLife.16105.001 To understand the roles that proteins and other large molecules play inside cells, it is important to determine their structures. One of the techniques that researchers can use to do this is called cryo-electron microscopy (cryo-EM), which rapidly freezes molecules to fix them in position before imaging them in fine detail. The cryo-EM images are like maps that show the approximate position of atoms. These images must then be processed in order to build a three-dimensional model of the protein that shows how its atoms are arranged relative to each other. One computational approach called Molecular Dynamics Flexible Fitting (MDFF) works by flexibly fitting possible atomic structures into cryo-EM maps. Although this approach works well with relatively undetailed (or ‘low resolution’) cryo-EM images, it struggles to handle the high-resolution cryo-EM maps now being generated. Singharoy, Teo, McGreevy et al. have now developed two MDFF methods – called cascade MDFF and resolution exchange MDFF – that help to resolve atomic models of biological molecules from cryo-EM images. Each method can refine poorly guessed models into ones that are consistent with the high-resolution experimental images. The refinement is achieved by interpreting a range of images that starts with a ‘fuzzy’ image. The contrast of the image is then progressively improved until an image is produced that has a resolution that is good enough to almost distinguish individual atoms. The method works because each cryo-EM image shows not just one, but a collection of atomic structures that the molecule can take on, with the fuzzier parts of the image representing the more flexible parts of the molecule. By taking into account this flexibility, the large-scale features of the protein structure can be determined first from the fuzzier images, and increasing the contrast of the images allows smaller-scale refinements to be made to the structure. The MDFF tools have been designed to be easy to use and are available to researchers at low cost through cloud computing platforms. They can now be used to unravel the structure of many different proteins and protein complexes including those involved in photosynthesis, respiration and protein synthesis. DOI:http://dx.doi.org/10.7554/eLife.16105.002
Collapse
Affiliation(s)
- Abhishek Singharoy
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, United States
| | - Ivan Teo
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, United States.,Department of Physics, University of Illinois at Urbana-Champaign, Urbana, United States
| | - Ryan McGreevy
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, United States
| | - John E Stone
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, United States
| | - Jianhua Zhao
- Department of Biochemistry and Biophysics, University of California San Francisco School of Medicine, San Francisco, United States
| | - Klaus Schulten
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, United States.,Department of Physics, University of Illinois at Urbana-Champaign, Urbana, United States
| |
Collapse
|
17
|
Constrained cyclic coordinate descent for cryo-EM images at medium resolutions: beyond the protein loop closure problem. ROBOTICA 2016; 34:1777-1790. [DOI: 10.1017/s0263574716000242] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
SUMMARYThe cyclic coordinate descent (CCD) method is a popular loop closure method in protein structure modeling. It is a robotics algorithm originally developed for inverse kinematic applications. We demonstrate an effective method of building the backbone of protein structure models using the principle of CCD and a guiding trace. For medium-resolution 3-dimensional (3D) images derived using cryo-electron microscopy (cryo-EM), it is possible to obtain guiding traces of secondary structures and their skeleton connections. Our new method, constrained cyclic coordinate descent (CCCD), builds α-helices, β-strands, and loops quickly and fairly accurately along predefined traces. We show that it is possible to build the entire backbone of a protein fairly accurately when the guiding traces are accurate. In a test of 10 proteins, the models constructed using CCCD show an average of 3.91 Å of backbone root mean square deviation (RMSD). When the CCCD method is incorporated in a simulated annealing framework to sample possible shift, translation, and rotation freedom, the models built with the true topology were ranked high on the list, with an average backbone RMSD100 of 3.76 Å. CCCD is an effective method for modeling atomic structures after secondary structure traces and skeletons are extracted from 3D cryo-EM images.
Collapse
|
18
|
Greenberg CH, Kollman J, Zelter A, Johnson R, MacCoss MJ, Davis TN, Agard DA, Sali A. Structure of γ-tubulin small complex based on a cryo-EM map, chemical cross-links, and a remotely related structure. J Struct Biol 2016; 194:303-10. [PMID: 26968363 DOI: 10.1016/j.jsb.2016.03.006] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2016] [Revised: 03/06/2016] [Accepted: 03/07/2016] [Indexed: 11/26/2022]
Abstract
Modeling protein complex structures based on distantly related homologues can be challenging due to poor sequence and structure conservation. Therefore, utilizing even low-resolution experimental data can significantly increase model precision and accuracy. Here, we present models of the two key functional states of the yeast γ-tubulin small complex (γTuSC): one for the low-activity "open" state and another for the higher-activity "closed" state. Both models were computed based on remotely related template structures and cryo-EM density maps at 6.9Å and 8.0Å resolution, respectively. For each state, extensive sampling of alignments and conformations was guided by the fit to the corresponding cryo-EM density map. The resulting good-scoring models formed a tightly clustered ensemble of conformations in most regions. We found significant structural differences between the two states, primarily in the γ-tubulin subunit regions where the microtubule binds. We also report a set of chemical cross-links that were found to be consistent with equilibrium between the open and closed states. The protocols developed here have been incorporated into our open-source Integrative Modeling Platform (IMP) software package (http://integrativemodeling.org), and can therefore be applied to many other systems.
Collapse
Affiliation(s)
- Charles H Greenberg
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, CA, USA.
| | - Justin Kollman
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Alex Zelter
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Richard Johnson
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Michael J MacCoss
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Trisha N Davis
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
| | - David A Agard
- Department of Biochemistry and Biophysics, University of California at San Francisco, San Francisco, CA, USA; Howard Hughes Medical Institute, University of California at San Francisco, San Francisco, CA, USA.
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, CA, USA.
| |
Collapse
|
19
|
Segura J, Sanchez-Garcia R, Tabas-Madrid D, Cuenca-Alba J, Sorzano COS, Carazo JM. 3DIANA: 3D Domain Interaction Analysis: A Toolbox for Quaternary Structure Modeling. Biophys J 2016; 110:766-75. [PMID: 26772592 PMCID: PMC4775853 DOI: 10.1016/j.bpj.2015.11.3519] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2015] [Revised: 11/27/2015] [Accepted: 11/30/2015] [Indexed: 11/19/2022] Open
Abstract
Electron microscopy (EM) is experiencing a revolution with the advent of a new generation of Direct Electron Detectors, enabling a broad range of large and flexible structures to be resolved well below 1 nm resolution. Although EM techniques are evolving to the point of directly obtaining structural data at near-atomic resolution, for many molecules the attainable resolution might not be enough to propose high-resolution structural models. However, accessing information on atomic coordinates is a necessary step toward a deeper understanding of the molecular mechanisms that allow proteins to perform specific tasks. For that reason, methods for the integration of EM three-dimensional maps with x-ray and NMR structural data are being developed, a modeling task that is normally referred to as fitting, resulting in the so called hybrid models. In this work, we present a novel application—3DIANA—specially targeted to those cases in which the EM map resolution is medium or low and additional experimental structural information is scarce or even lacking. In this way, 3DIANA statistically evaluates proposed/potential contacts between protein domains, presents a complete catalog of both structurally resolved and predicted interacting regions involving these domains and, finally, suggests structural templates to model the interaction between them. The evaluation of the proposed interactions is computed with DIMERO, a new method that scores physical binding sites based on the topology of protein interaction networks, which has recently shown the capability to increase by 200% the number of domain-domain interactions predicted in interactomes as compared to previous approaches. The new application displays the information at a sequence and structural level and is accessible through a web browser or as a Chimera plugin at http://3diana.cnb.csic.es.
Collapse
Affiliation(s)
- Joan Segura
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain.
| | - Ruben Sanchez-Garcia
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain
| | - Daniel Tabas-Madrid
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain
| | - Jesus Cuenca-Alba
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain
| | - Carlos Oscar S Sorzano
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain
| | - Jose Maria Carazo
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain
| |
Collapse
|
20
|
DiMaio F, Song Y, Li X, Brunner MJ, Xu C, Conticello V, Egelman E, Marlovits T, Cheng Y, Baker D. Atomic-accuracy models from 4.5-Å cryo-electron microscopy data with density-guided iterative local refinement. Nat Methods 2015; 12:361-365. [PMID: 25707030 PMCID: PMC4382417 DOI: 10.1038/nmeth.3286] [Citation(s) in RCA: 250] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2014] [Accepted: 12/11/2014] [Indexed: 12/12/2022]
Abstract
We describe a general approach for refining protein structure models on the basis of cryo-electron microscopy maps with near-atomic resolution. The method integrates Monte Carlo sampling with local density-guided optimization, Rosetta all-atom refinement and real-space B-factor fitting. In tests on experimental maps of three different systems with 4.5-Å resolution or better, the method consistently produced models with atomic-level accuracy largely independently of starting-model quality, and it outperformed the molecular dynamics-based MDFF method. Cross-validated model quality statistics correlated with model accuracy over the three test systems.
Collapse
Affiliation(s)
- Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Yifan Song
- Department of Biochemistry, University of Washington, Seattle, WA, USA.,Cyrus Biotechnology, Inc., Seattle, WA, USA
| | - Xueming Li
- Keck Advanced Microscopy Laboratory, Department of Biochemistry and Biophysics, University of California, San Francisco, California, USA
| | - Matthias J Brunner
- Center for Structural Systems Biology (CSSB) University Medical Center Eppendorf-Hamburg (UKE), Hamburg, Germany.,Deutsches Elektronen-Synchrotron (DESY), Hamburg, Germany.,Institute of Molecular Biotechnology GmbH (IMBA), Austrian Academy of Sciences, Vienna, Austria.,Research Institute of Molecular Pathology (IMP), Vienna, Austria
| | - Chunfu Xu
- Department of Chemistry, Emory University, Atlanta, GA 30322
| | | | - Edward Egelman
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA
| | - Thomas Marlovits
- Center for Structural Systems Biology (CSSB) University Medical Center Eppendorf-Hamburg (UKE), Hamburg, Germany.,Deutsches Elektronen-Synchrotron (DESY), Hamburg, Germany.,Institute of Molecular Biotechnology GmbH (IMBA), Austrian Academy of Sciences, Vienna, Austria.,Research Institute of Molecular Pathology (IMP), Vienna, Austria
| | - Yifan Cheng
- Keck Advanced Microscopy Laboratory, Department of Biochemistry and Biophysics, University of California, San Francisco, California, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington, USA
| |
Collapse
|
21
|
Si D, He J. Tracing Beta Strands Using StrandTwister from Cryo-EM Density Maps at Medium Resolutions. Structure 2014; 22:1665-76. [DOI: 10.1016/j.str.2014.08.017] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2014] [Revised: 08/07/2014] [Accepted: 08/08/2014] [Indexed: 10/24/2022]
|
22
|
López-Blanco JR, Chacón P. Structural modeling from electron microscopy data. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2014. [DOI: 10.1002/wcms.1199] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- José Ramón López-Blanco
- Department of Biological Physical Chemistry; Rocasolano Physical Chemistry Institute, CSIC; Madrid Spain
| | - Pablo Chacón
- Department of Biological Physical Chemistry; Rocasolano Physical Chemistry Institute, CSIC; Madrid Spain
| |
Collapse
|
23
|
Yaseen A, Li Y. Template-based C8-SCORPION: a protein 8-state secondary structure prediction method using structural information and context-based features. BMC Bioinformatics 2014; 15 Suppl 8:S3. [PMID: 25080939 PMCID: PMC4120151 DOI: 10.1186/1471-2105-15-s8-s3] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Secondary structures prediction of proteins is important to many protein structure modeling applications. Correct prediction of secondary structures can significantly reduce the degrees of freedom in protein tertiary structure modeling and therefore reduces the difficulty of obtaining high resolution 3D models. Methods In this work, we investigate a template-based approach to enhance 8-state secondary structure prediction accuracy. We construct structural templates from known protein structures with certain sequence similarity. The structural templates are then incorporated as features with sequence and evolutionary information to train two-stage neural networks. In case of structural templates absence, heuristic structural information is incorporated instead. Results After applying the template-based 8-state secondary structure prediction method, the 7-fold cross-validated Q8 accuracy is 78.85%. Even templates from structures with only 20%~30% sequence similarity can help improve the 8-state prediction accuracy. More importantly, when good templates are available, the prediction accuracy of less frequent secondary structures, such as 3-10 helices, turns, and bends, are highly improved, which are useful for practical applications. Conclusions Our computational results show that the templates containing structural information are effective features to enhance 8-state secondary structure predictions. Our prediction algorithm is implemented on a web server named "C8-SCORPION" available at: http://hpcr.cs.odu.edu/c8scorpion.
Collapse
|
24
|
Vashisth H, Skiniotis G, Brooks CL. Collective variable approaches for single molecule flexible fitting and enhanced sampling. Chem Rev 2014; 114:3353-65. [PMID: 24446720 PMCID: PMC3983124 DOI: 10.1021/cr4005988] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2013] [Indexed: 12/12/2022]
Affiliation(s)
- Harish Vashisth
- Department
of Chemical Engineering, University of New
Hampshire, Durham, New Hampshire 03824, United States
| | - Georgios Skiniotis
- Life Sciences Institute, Department
of Biological Chemistry, and
Biophysics Program, and Department of Chemistry and Biophysics Program, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Charles Lee Brooks
- Life Sciences Institute, Department
of Biological Chemistry, and
Biophysics Program, and Department of Chemistry and Biophysics Program, University of Michigan, Ann Arbor, Michigan 48109, United States
| |
Collapse
|
25
|
Kuzu G, Keskin O, Nussinov R, Gursoy A. Modeling protein assemblies in the proteome. Mol Cell Proteomics 2014; 13:887-96. [PMID: 24445405 PMCID: PMC3945916 DOI: 10.1074/mcp.m113.031294] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2013] [Revised: 12/13/2013] [Indexed: 11/06/2022] Open
Abstract
Most (if not all) proteins function when associated in multimolecular assemblies. Attaining the structures of protein assemblies at the atomic scale is an important aim of structural biology. Experimentally, structures are increasingly available, and computations can help bridge the resolution gap between high- and low-resolution scales. Existing computational methods have made substantial progress toward this aim; however, current approaches are still limited. Some involve manual adjustment of experimental data; some are automated docking methods, which are computationally expensive and not applicable to large-scale proteome studies; and still others exploit the symmetry of the complexes and thus are not applicable to nonsymmetrical complexes. Our study aims to take steps toward overcoming these limitations. We have developed a strategy for the construction of protein assemblies computationally based on binary interactions predicted by a motif-based protein interaction prediction tool, PRISM (Protein Interactions by Structural Matching). Previously, we have shown its power in predicting pairwise interactions. Here we take a step toward multimolecular assemblies, reflecting the more prevalent cellular scenarios. With this method we are able to construct homo-/hetero-complexes and symmetric/asymmetric complexes without a limitation on the number of components. The method considers conformational changes and is applicable to large-scale studies. We also exploit electron microscopy density maps to select a solution from among the predictions. Here we present the method, illustrate its results, and highlight its current limitations.
Collapse
Affiliation(s)
- Guray Kuzu
- From the ‡Center for Computational Biology and Bioinformatics and College of Engineering, Koc University Rumelifeneri Yolu, 34450 Sariyer Istanbul, Turkey
| | - Ozlem Keskin
- From the ‡Center for Computational Biology and Bioinformatics and College of Engineering, Koc University Rumelifeneri Yolu, 34450 Sariyer Istanbul, Turkey
| | - Ruth Nussinov
- §Cancer and Inflammation Program, Leidos Biomedical Research, Inc., National Cancer Institute, Frederick National Laboratory for Cancer Research, Frederick, Maryland 21702
- ¶Sackler Institute of Molecular Medicine Department of Human Genetics and Molecular Medicine Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Attila Gursoy
- From the ‡Center for Computational Biology and Bioinformatics and College of Engineering, Koc University Rumelifeneri Yolu, 34450 Sariyer Istanbul, Turkey
| |
Collapse
|
26
|
Chandrakar B, Jain A, Roy S, Gutlapalli VR, Saraf S, Suppahia A, Verma A, Tiwari A, Yadav M, Nayarisseri A. Molecular modeling of Acetyl-CoA carboxylase (ACC) from Jatropha curcas and virtual screening for identification of inhibitors. ACTA ACUST UNITED AC 2013. [DOI: 10.1016/j.jopr.2013.07.032] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
27
|
Esquivel-Rodríguez J, Kihara D. Computational methods for constructing protein structure models from 3D electron microscopy maps. J Struct Biol 2013; 184:93-102. [PMID: 23796504 DOI: 10.1016/j.jsb.2013.06.008] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 06/11/2013] [Accepted: 06/13/2013] [Indexed: 12/31/2022]
Abstract
Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein structure information, in this article we focus on reviewing computational methods that model protein three-dimensional (3D) structures from a 3D EM density map that is constructed from two-dimensional (2D) maps. The computational methods we discuss range from de novo methods, which identify structural elements in an EM map, to structure fitting methods, where known high resolution structures are fit into a low-resolution EM map. A list of available computational tools is also provided.
Collapse
Affiliation(s)
- Juan Esquivel-Rodríguez
- Department of Computer Science, College of Science, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
28
|
Kanda N, Abe F. Structural and functional implications of the yeast high-affinity tryptophan permease Tat2. Biochemistry 2013; 52:4296-307. [PMID: 23768406 DOI: 10.1021/bi4004638] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Tryptophan is hydrophobic, bulky, and the rarest amino acid found in nutrients. Accordingly, the import machinery can be specialized evolutionarily. Our previous study in Saccharomyces cerevisiae demonstrated that tryptophan import by the high-affinity tryptophan permease Tat2 is accompanied by a large volume increase during substrate import. Nevertheless, the mechanisms by which the permease mediates tryptophan recognition and permeation remain to be elucidated. Here we determined amino acid residues essential for Tat2-mediated tryptophan import. By means of random mutagenesis in combination with site-directed mutagenesis based on crystallographic studies of the Escherichia coli arginine/agmatine antiporter AdiC, we identified 15 amino acid residues in the Tat2 transmembrane domains (TMDs) 1, -3, -5, -8, and -10, which are responsible for tryptophan uptake. T98, Y167, and E286 were assumed to form the central cavity in Tat2. G97/T98 and E286 were located within the putative α-helix break in TMD1 and TMD6, respectively, which are highly conserved among yeast amino acid permeases and bacterial solute transporters. Given the conformational change in AdiC upon substrate binding, G97/T98 and E286 of Tat2 were assumed to mediate a structural shift from an outward-open to a tryptophan-bound-occluded structure upon tryptophan binding, and T320, V322, and F324 became stabilized in TMD7. Such dynamic structural changes may account for the large volume increase associated with tryptophan import occurring concomitantly with a movement of water molecules from the tryptophan binding site. We also propose the working hypothesis that E286 mediates the proton influx that is coupled to tryptophan import.
Collapse
Affiliation(s)
- Naoko Kanda
- Department of Chemistry and Biological Science, College of Science and Engineering, Aoyama Gakuin University, Sagamihara, Japan
| | | |
Collapse
|
29
|
Baker ML, Baker MR, Hryc CF, Ju T, Chiu W. Gorgon and pathwalking: macromolecular modeling tools for subnanometer resolution density maps. Biopolymers 2012; 97:655-68. [PMID: 22696403 PMCID: PMC3899894 DOI: 10.1002/bip.22065] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The complex interplay of proteins and other molecules, often in the form of large transitory assemblies, are critical to cellular function. Today, X-ray crystallography and electron cryo-microscopy (cryo-EM) are routinely used to image these macromolecular complexes, though often at limited resolutions. Despite the rapidly growing number of macromolecular structures, few tools exist for modeling and annotating structures in the range of 3-10 Å resolution. To address this need, we have developed a number of utilities specifically targeting subnanometer resolution density maps. As part of the 2010 Cryo-EM Modeling Challenge, we demonstrated two of our latest de novo modeling tools, Pathwalking and Gorgon, as well as a tool for secondary structure identification (SSEHunter) and a new rigid-body/flexible fitting tool in Gorgon. In total, we submitted 30 structural models from ten different subnanometer resolution data sets in four of the six challenge categories. Each of our utlities produced accurate structural models and annotations across the various density maps. In the end, the utilities that we present here offer users a robust toolkit for analyzing and modeling protein structure in macromolecular assemblies at non-atomic resolutions.
Collapse
Affiliation(s)
- Matthew L Baker
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, National Center for Macromolecular Imaging, Baylor College of Medicine, Houston, TX 77030, USA.
| | | | | | | | | |
Collapse
|
30
|
Wang Z, Schröder GF. Real-space refinement with DireX: from global fitting to side-chain improvements. Biopolymers 2012; 97:687-97. [PMID: 22696405 DOI: 10.1002/bip.22046] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Single-particle cryo-electron microscopy (cryo-EM) has become an important tool to determine the structure of large biomolecules and assemblies thereof. However, the achievable resolution varies considerably over a wide range of about 3.5-20 Å. The interpretation of these intermediate- to low-resolution density maps in terms of atomic models is a big challenge and an area of active research. Here, we present our real-space structure refinement program DireX, which was developed primarily for cryo-EM-derived density maps. The basic principle and its main features are described. DireX employs Deformable Elastic Network (DEN) restraints to reduce overfitting by decreasing the effective number of degrees of freedom used in the refinement. Missing or reduced density due to flexible parts of the protein can lead to artifacts in the structure refinement, which is addressed through the concept of restrained grouped occupancy refinement. Furthermore, we describe the performance of DireX in the 2010 Cryo-EM Modeling Challenge, where we chose six density maps of four different proteins provided by the Modeling Challenge exemplifying typical refinement results at a large resolution range from 3 to 23 Å.
Collapse
Affiliation(s)
- Zhe Wang
- Institute of Complex Systems, Forschungszentrum Jülich, Jülich, Germany
| | | |
Collapse
|
31
|
Lindert S, Alexander N, Wötzel N, Karakaş M, Stewart PL, Meiler J. EM-fold: de novo atomic-detail protein structure determination from medium-resolution density maps. Structure 2012; 20:464-78. [PMID: 22405005 DOI: 10.1016/j.str.2012.01.023] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2011] [Revised: 01/23/2012] [Accepted: 01/26/2012] [Indexed: 11/17/2022]
Abstract
Electron density maps of membrane proteins or large macromolecular complexes are frequently only determined at medium resolution between 4 Å and 10 Å, either by cryo-electron microscopy or X-ray crystallography. In these density maps, the general arrangement of secondary structure elements (SSEs) is revealed, whereas their directionality and connectivity remain elusive. We demonstrate that the topology of proteins with up to 250 amino acids can be determined from such density maps when combined with a computational protein folding protocol. Furthermore, we accurately reconstruct atomic detail in loop regions and amino acid side chains not visible in the experimental data. The EM-Fold algorithm assembles the SSEs de novo before atomic detail is added using Rosetta. In a benchmark of 27 proteins, the protocol consistently and reproducibly achieves models with root mean square deviation values <3 Å.
Collapse
Affiliation(s)
- Steffen Lindert
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, TN 37212, USA
| | | | | | | | | | | |
Collapse
|
32
|
Finding rigid bodies in protein structures: Application to flexible fitting into cryoEM maps. J Struct Biol 2012; 177:520-31. [DOI: 10.1016/j.jsb.2011.10.011] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2011] [Revised: 10/22/2011] [Accepted: 10/27/2011] [Indexed: 11/18/2022]
|
33
|
Reconstructing virus structures from nanometer to near-atomic resolutions with cryo-electron microscopy and tomography. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2012; 726:49-90. [PMID: 22297510 DOI: 10.1007/978-1-4614-0980-9_4] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
The past few decades have seen tremendous advances in single-particle electron -cryo-microscopy (cryo-EM). The field has matured to the point that near-atomic resolution density maps can be generated for icosahedral viruses without the need for crystallization. In parallel, substantial progress has been made in determining the structures of nonicosahedrally arranged proteins in viruses by employing either single-particle cryo-EM or cryo-electron tomography (cryo-ET). Implicit in this course have been the availability of a new generation of electron cryo-microscopes and the development of the computational tools that are essential for generating these maps and models. This methodology has enabled structural biologists to analyze structures in increasing detail for virus particles that are in different morphogenetic states. Furthermore, electron imaging of frozen, hydrated cells, in the process of being infected by viruses, has also opened up a new avenue for studying virus structures "in situ". Here we present the common techniques used to acquire and process cryo-EM and cryo-ET data and discuss their implications for structural virology both now and in the future.
Collapse
|
34
|
Chan KY, Gumbart J, McGreevy R, Watermeyer JM, Sewell BT, Schulten K. Symmetry-restrained flexible fitting for symmetric EM maps. Structure 2011; 19:1211-8. [PMID: 21893283 DOI: 10.1016/j.str.2011.07.017] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2011] [Revised: 07/21/2011] [Accepted: 07/28/2011] [Indexed: 01/25/2023]
Abstract
Many large biological macromolecules have inherent structural symmetry, being composed of a few distinct subunits, repeated in a symmetric array. These complexes are often not amenable to traditional high-resolution structural determination methods, but can be imaged in functionally relevant states using cryo-electron microscopy (cryo-EM). A number of methods for fitting atomic-scale structures into cryo-EM maps have been developed, including the molecular dynamics flexible fitting (MDFF) method. However, quality and resolution of the cryo-EM map are the major determinants of a method's success. In order to incorporate knowledge of structural symmetry into the fitting procedure, we developed the symmetry-restrained MDFF method. The new method adds to the cryo-EM map-derived potential further restraints on the allowed conformations of a complex during fitting, thereby improving the quality of the resultant structure. The benefit of using symmetry-based restraints during fitting, particularly for medium to low-resolution data, is demonstrated for three different systems.
Collapse
Affiliation(s)
- Kwok-Yan Chan
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | | | | | | | | | | |
Collapse
|
35
|
AL NASR KAMAL, RANJAN DESH, ZUBAIR MOHAMMAD, HE JING. RANKING VALID TOPOLOGIES OF THE SECONDARY STRUCTURE ELEMENTS USING A CONSTRAINT GRAPH. J Bioinform Comput Biol 2011; 9:415-30. [DOI: 10.1142/s0219720011005604] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2011] [Revised: 04/12/2011] [Accepted: 04/17/2011] [Indexed: 11/18/2022]
Abstract
Electron cryo-microscopy is a fast advancing biophysical technique to derive three-dimensional structures of large protein complexes. Using this technique, many density maps have been generated at intermediate resolution such as 6–10 Å resolution. Although it is challenging to derive the backbone of the protein directly from such density maps, secondary structure elements such as helices and β-sheets can be computationally detected. Our work in this paper provides an approach to enumerate the top-ranked possible topologies instead of enumerating the entire population of the topologies. This approach is particularly practical for large proteins. We developed a directed weighted graph, the topology graph, to represent the secondary structure assignment problem. We prove that the problem of finding the valid topology with the minimum cost is NP hard. We developed an O(N2 2N) dynamic programming algorithm to identify the topology with the minimum cost. The test of 15 proteins suggests that our dynamic programming approach is feasible to work with proteins of much larger size than we could before. The largest protein in the test contains 18 helical sticks detected from the density map out of 33 helices in the protein.
Collapse
Affiliation(s)
- KAMAL AL NASR
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| | - DESH RANJAN
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| | - MOHAMMAD ZUBAIR
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| | - JING HE
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| |
Collapse
|
36
|
LU YONGGANG, HE JING, STRAUSS CHARLIEEM. DERIVING TOPOLOGY AND SEQUENCE ALIGNMENT FOR THE HELIX SKELETON IN LOW-RESOLUTION PROTEIN DENSITY MAPS. J Bioinform Comput Biol 2011; 6:183-201. [DOI: 10.1142/s0219720008003357] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2007] [Revised: 10/07/2007] [Accepted: 10/13/2007] [Indexed: 11/18/2022]
Abstract
Cryoelectron microscopy (cryoEM) is an experimental technique to determine the three-dimensional (3D) structure of large protein complexes. Currently, this technique is able to generate protein density maps at 6–9 Å resolution, at which the skeleton of the structure (which is composed of α-helices and β-sheets) can be visualized. As a step towards predicting the entire backbone of the protein from the protein density map, we developed a method to predict the topology and sequence alignment for the skeleton helices. Our method combines the geometrical information of the skeleton helices with the Rosetta ab initio structure prediction method to derive a consensus topology and sequence alignment for the skeleton helices. We tested the method with 60 proteins. For 45 proteins, the majority of the skeleton helices were assigned a correct topology from one of our top ten predictions. The offsets of the alignment for most of the assigned helices were within ±2 amino acids in the sequence. We also analyzed the use of the skeleton helices as a clustering tool for the decoy structures generated by Rosetta. Our comparison suggests that the topology clustering is a better method than a general overlap clustering method to enrich the ranking of decoys, particularly when the decoy pool is small.
Collapse
Affiliation(s)
- YONGGANG LU
- Department of Computer Science, New Mexico State University, Las Cruces, NM 88003, USA
| | - JING HE
- Department of Computer Science, New Mexico State University, Las Cruces, NM 88003, USA
| | - CHARLIE E. M. STRAUSS
- Bioscience Division, M888, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| |
Collapse
|
37
|
Sun W, He J. From isotropic to anisotropic side chain representations: comparison of three models for residue contact estimation. PLoS One 2011; 6:e19238. [PMID: 21552527 PMCID: PMC3084275 DOI: 10.1371/journal.pone.0019238] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2010] [Accepted: 03/29/2011] [Indexed: 11/19/2022] Open
Abstract
The criterion to determine residue contact is a fundamental problem in deriving knowledge-based mean-force potential energy calculations for protein structures. A frequently used criterion is to require the side chain center-to-center distance or the -to- atom distance to be within a pre-determined cutoff distance. However, the spatially anisotropic nature of the side chain determines that it is challenging to identify the contact pairs. This study compares three side chain contact models: the Atom Distance criteria (ADC) model, the Isotropic Sphere Side chain (ISS) model and the Anisotropic Ellipsoid Side chain (AES) model using 424 high resolution protein structures in the Protein Data Bank. The results indicate that the ADC model is the most accurate and ISS is the worst. The AES model eliminates about 95% of the incorrectly counted contact-pairs in the ISS model. Algorithm analysis shows that AES model is the most computational intensive while ADC model has moderate computational cost. We derived a dataset of the mis-estimated contact pairs by AES model. The most misjudged pairs are Arg-Glu, Arg-Asp and Arg-Tyr. Such a dataset can be useful for developing the improved AES model by incorporating the pair-specific information for the cutoff distance.
Collapse
Affiliation(s)
- Weitao Sun
- Zhou Pei-Yuan Center for Applied Mathematics, Tsinghua University, Beijing, China.
| | | |
Collapse
|
38
|
Accurate flexible fitting of high-resolution protein structures into cryo-electron microscopy maps using coarse-grained pseudo-energy minimization. Biophys J 2011; 100:478-88. [PMID: 21244844 DOI: 10.1016/j.bpj.2010.12.3680] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2010] [Revised: 11/05/2010] [Accepted: 12/02/2010] [Indexed: 11/22/2022] Open
Abstract
Cryo-electron microscopy (cryo-EM) has been widely used to explore conformational states of large biomolecular assemblies. The detailed interpretation of cryo-EM data requires the flexible fitting of a known high-resolution protein structure into a low-resolution cryo-EM map. To this end, we have developed what we believe is a new method based on a two-bead-per-residue protein representation, and a modified form of the elastic network model that allows large-scale conformational changes while maintaining pseudobonds and secondary structures. Our method minimizes a pseudo-energy which linearly combines various terms of the modified elastic network model energy with a cryo-EM-fitting score and a collision energy that penalizes steric collisions. Unlike previous flexible fitting efforts using the lowest few normal modes, our method effectively utilizes all normal modes so that both global and local structural changes can be fully modeled. We have validated our method for a diverse set of 10 pairs of protein structures using simulated cryo-EM maps with a range of resolutions and in the absence/presence of random noise. We have shown that our method is both accurate and efficient compared with alternative techniques, and its performance is robust to the addition of random noise. Our method is also shown to be useful for the flexible fitting of three experimental cryo-EM maps.
Collapse
|
39
|
Chaudhuri BN, Gupta S, Urban VS, Chance MR, D'Mello R, Smith L, Lyons K, Gee J. A combined global and local approach to elucidate spatial organization of the Mycobacterial ParB-parS partition assembly. Biochemistry 2011; 50:1799-807. [PMID: 21142182 PMCID: PMC3081668 DOI: 10.1021/bi1016759] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Combining diverse sets of data at global (size, shape) and local (residue) scales is an emerging trend for elucidating the organization and function of the cellular assemblies. We used such a strategy, combining data from X-ray and neutron scattering with H/D-contrast variation and X-ray footprinting with mass spectrometry, to elucidate the spatial organization of the ParB-parS assembly from Mycobacterium tuberculosis. The ParB-parS participates in plasmid and chromosome segregation and condensation in predivisional bacterial cells. ParB polymerizes around the parS centromere(s) to form a higher-order assembly that serves to recruit cyto-skeletal ParA ATPases and SMC proteins for chromosome segregation. A hybrid model of the ParB-parS was built by combining and correlating computational models with experiment-derived information about size, shape, position of the symmetry axis within the shape, internal topology, DNA-protein interface, exposed surface patches, and prior knowledge. This first view of the ParB-parS leads us to propose how ParB spread on the chromosome to form a larger assembly.
Collapse
|
40
|
Beck M, Topf M, Frazier Z, Tjong H, Xu M, Zhang S, Alber F. Exploring the spatial and temporal organization of a cell's proteome. J Struct Biol 2011; 173:483-96. [PMID: 21094684 PMCID: PMC3784337 DOI: 10.1016/j.jsb.2010.11.011] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2010] [Revised: 11/05/2010] [Accepted: 11/08/2010] [Indexed: 10/18/2022]
Abstract
To increase our current understanding of cellular processes, such as cell signaling and division, knowledge is needed about the spatial and temporal organization of the proteome at different organizational levels. These levels cover a wide range of length and time scales: from the atomic structures of macromolecules for inferring their molecular function, to the quantitative description of their abundance, and spatial distribution in the cell. Emerging new experimental technologies are greatly increasing the availability of such spatial information on the molecular organization in living cells. This review addresses three fields that have significantly contributed to our understanding of the proteome's spatial and temporal organization: first, methods for the structure determination of individual macromolecular assemblies, specifically the fitting of atomic structures into density maps generated from electron microscopy techniques; second, research that visualizes the spatial distributions of these complexes within the cellular context using cryo electron tomography techniques combined with computational image processing; and third, methods for the spatial modeling of the dynamic organization of the proteome, specifically those methods for simulating reaction and diffusion of proteins and complexes in crowded intracellular fluids. The long-term goal is to integrate the varied data about a proteome's organization into a spatially explicit, predictive model of cellular processes.
Collapse
Affiliation(s)
- Martin Beck
- European Molecular Biology Laboratory, Meyerhofstr. 1, 69117 Heidelberg, Germany
| | - Maya Topf
- Molecular Biology, Crystallography, Department of Biological Sciences, Birkbeck College, University of London, London, UK
| | - Zachary Frazier
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Harianto Tjong
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Min Xu
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Shihua Zhang
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Frank Alber
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| |
Collapse
|
41
|
Baker ML, Abeysinghe SS, Schuh S, Coleman RA, Abrams A, Marsh MP, Hryc CF, Ruths T, Chiu W, Ju T. Modeling protein structure at near atomic resolutions with Gorgon. J Struct Biol 2011; 174:360-73. [PMID: 21296162 DOI: 10.1016/j.jsb.2011.01.015] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2010] [Revised: 01/27/2011] [Accepted: 01/31/2011] [Indexed: 11/29/2022]
Abstract
Electron cryo-microscopy (cryo-EM) has played an increasingly important role in elucidating the structure and function of macromolecular assemblies in near native solution conditions. Typically, however, only non-atomic resolution reconstructions have been obtained for these large complexes, necessitating computational tools for integrating and extracting structural details. With recent advances in cryo-EM, maps at near-atomic resolutions have been achieved for several macromolecular assemblies from which models have been manually constructed. In this work, we describe a new interactive modeling toolkit called Gorgon targeted at intermediate to near-atomic resolution density maps (10-3.5 Å), particularly from cryo-EM. Gorgon's de novo modeling procedure couples sequence-based secondary structure prediction with feature detection and geometric modeling techniques to generate initial protein backbone models. Beyond model building, Gorgon is an extensible interactive visualization platform with a variety of computational tools for annotating a wide variety of 3D volumes. Examples from cryo-EM maps of Rotavirus and Rice Dwarf Virus are used to demonstrate its applicability to modeling protein structure.
Collapse
Affiliation(s)
- Matthew L Baker
- National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Yin S, Dokholyan NV. Fingerprint-based structure retrieval using electron density. Proteins 2011; 79:1002-9. [PMID: 21287628 DOI: 10.1002/prot.22941] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2010] [Revised: 10/08/2010] [Accepted: 11/05/2010] [Indexed: 12/14/2022]
Abstract
We present a computational approach that can quickly search a large protein structural database to identify structures that fit a given electron density, such as determined by cryo-electron microscopy. We use geometric invariants (fingerprints) constructed using 3D Zernike moments to describe the electron density, and reduce the problem of fitting of the structure to the electron density to simple fingerprint comparison. Using this approach, we are able to screen the entire Protein Data Bank and identify structures that fit two experimental electron densities determined by cryo-electron microscopy.
Collapse
Affiliation(s)
- Shuangye Yin
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260, USA
| | | |
Collapse
|
43
|
Automated protein structure modeling with SWISS-MODEL Workspace and the Protein Model Portal. Methods Mol Biol 2011; 857:107-36. [PMID: 22323219 DOI: 10.1007/978-1-61779-588-6_5] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Comparative protein structure modeling is a computational approach to build three-dimensional structural models for proteins using experimental structures of related protein family members as templates. Regular blind assessments of modeling accuracy have demonstrated that comparative protein structure modeling is currently the most reliable technique to model protein structures. Homology models are often sufficiently accurate to substitute for experimental structures in a wide variety of applications. Since the usefulness of a model for specific application is determined by its accuracy, model quality estimation is an essential component of protein structure prediction. Comparative protein modeling has become a routine approach in many areas of life science research since fully automated modeling systems allow also nonexperts to build reliable models. In this chapter, we describe practical approaches for automated protein structure modeling with SWISS-MODEL Workspace and the Protein Model Portal.
Collapse
|
44
|
Application of biasing-potential replica-exchange simulations for loop modeling and refinement of proteins in explicit solvent. Proteins 2010; 78:2809-19. [DOI: 10.1002/prot.22796] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
45
|
Lasker K, Phillips JL, Russel D, Velázquez-Muriel J, Schneidman-Duhovny D, Tjioe E, Webb B, Schlessinger A, Sali A. Integrative structure modeling of macromolecular assemblies from proteomics data. Mol Cell Proteomics 2010; 9:1689-702. [PMID: 20507923 DOI: 10.1074/mcp.r110.000067] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Proteomics techniques have been used to generate comprehensive lists of protein interactions in a number of species. However, relatively little is known about how these interactions result in functional multiprotein complexes. This gap can be bridged by combining data from proteomics experiments with data from established structure determination techniques. Correspondingly, integrative computational methods are being developed to provide descriptions of protein complexes at varying levels of accuracy and resolution, ranging from complex compositions to detailed atomic structures.
Collapse
Affiliation(s)
- Keren Lasker
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California 94158, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Gartmann M, Blau M, Armache JP, Mielke T, Topf M, Beckmann R. Mechanism of eIF6-mediated inhibition of ribosomal subunit joining. J Biol Chem 2010; 285:14848-14851. [PMID: 20356839 PMCID: PMC2865328 DOI: 10.1074/jbc.c109.096057] [Citation(s) in RCA: 92] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2009] [Revised: 02/09/2010] [Indexed: 01/25/2023] Open
Abstract
During the process of ribosomal assembly, the essential eukaryotic translation initiation factor 6 (eIF6) is known to act as a ribosomal anti-association factor. However, a molecular understanding of the anti-association activity of eIF6 is still missing. Here we present the cryo-electron microscopy reconstruction of a complex of the large ribosomal subunit with eukaryotic eIF6 from Saccharomyces cerevisiae. The structure reveals that the eIF6 binding site involves mainly rpL23 (L14p in Escherichia coli). Based on our structural data, we propose that the mechanism of the anti-association activity of eIF6 is based on steric hindrance of intersubunit bridge formation around the dynamic bridge B6.
Collapse
Affiliation(s)
- Marco Gartmann
- Department of Biochemistry, Gene Center and Center for Integrated Protein Science (CiPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 Munich, Germany
| | - Michael Blau
- Institut für Biochemie der Charité, Humboldt Universität Berlin, Monbijoustrasse 2, 10117 Berlin, Germany
| | - Jean-Paul Armache
- Department of Biochemistry, Gene Center and Center for Integrated Protein Science (CiPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 Munich, Germany
| | - Thorsten Mielke
- UltraStrukturNetzwerk, Max Planck Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin; Institut für Medizinische Physik und Biophysik, Charité, Ziegelstrasse 5-9, 10098 Berlin, Germany
| | - Maya Topf
- Institute of Structural and Molecular Biology, School of Crystallography, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom
| | - Roland Beckmann
- Department of Biochemistry, Gene Center and Center for Integrated Protein Science (CiPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 Munich, Germany.
| |
Collapse
|
47
|
Rawi R, Whitmore L, Topf M. CHOYCE: a web server for constrained homology modelling with cryoEM maps. ACTA ACUST UNITED AC 2010; 26:1673-4. [PMID: 20444836 PMCID: PMC2887048 DOI: 10.1093/bioinformatics/btq237] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
SUMMARY CHOYCE is a web server for homology modelling of protein components and the fitting of those components into cryo electron microscopy (cryoEM) maps of their assemblies. It provides an interactive approach to improving the selection of models based on the quality of their fit into the EM map. AVAILABILITY http://choyce.ismb.lon.ac.uk/ CONTACT m.topf@cryst.bbk.ac.uk; reda.rawi@uni-due.de SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Reda Rawi
- Institute of Structural and Molecular Biology, Crystallography, Department of Biological Sciences, Birkbeck College, University of London, London, UK.
| | | | | |
Collapse
|
48
|
Zhu J, Cheng L, Fang Q, Zhou ZH, Honig B. Building and refining protein models within cryo-electron microscopy density maps based on homology modeling and multiscale structure refinement. J Mol Biol 2010; 397:835-51. [PMID: 20109465 DOI: 10.1016/j.jmb.2010.01.041] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2009] [Revised: 01/04/2010] [Accepted: 01/20/2010] [Indexed: 11/16/2022]
Abstract
Automatic modeling methods using cryoelectron microscopy (cryoEM) density maps as constraints are promising approaches to building atomic models of individual proteins or protein domains. However, their application to large macromolecular assemblies has not been possible largely due to computational limitations inherent to such unsupervised methods. Here we describe a new method, EM-IMO (electron microscopy-iterative modular optimization), for building, modifying and refining local structures of protein models using cryoEM maps as a constraint. As a supervised refinement method, EM-IMO allows users to specify parameters derived from inspections so as to guide, and as a consequence, significantly speed up the refinement. An EM-IMO-based refinement protocol is first benchmarked on a data set of 50 homology models using simulated density maps. A multiscale refinement strategy that combines EM-IMO-based and molecular dynamics-based refinement is then applied to build backbone models for the seven conformers of the five capsid proteins in our near-atomic-resolution cryoEM map of the grass carp reovirus virion, a member of the Aquareovirus genus of the Reoviridae family. The refined models allow us to reconstruct a backbone model of the entire grass carp reovirus capsid and provide valuable functional insights that are described in the accompanying publication [Cheng, L., Zhu, J., Hui, W. H., Zhang, X., Honig, B., Fang, Q. & Zhou, Z. H. (2010). Backbone model of an aquareovirus virion by cryo-electron microscopy and bioinformatics. J. Mol. Biol. (this issue). doi:10.1016/j.jmb.2009.12.027.]. Our study demonstrates that the integrated use of homology modeling and a multiscale refinement protocol that combines supervised and automated structure refinement offers a practical strategy for building atomic models based on medium- to high-resolution cryoEM density maps.
Collapse
Affiliation(s)
- Jiang Zhu
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | | | | | | | | |
Collapse
|
49
|
Al Nasr K, Sun W, He J. Structure prediction for the helical skeletons detected from the low resolution protein density map. BMC Bioinformatics 2010; 11 Suppl 1:S44. [PMID: 20122218 PMCID: PMC3009517 DOI: 10.1186/1471-2105-11-s1-s44] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Background The current advances in electron cryo-microscopy technique have made it possible to obtain protein density maps at about 6-10 Å resolution. Although it is hard to derive the protein chain directly from such a low resolution map, the location of the secondary structures such as helices and strands can be computationally detected. It has been demonstrated that such low-resolution map can be used during the protein structure prediction process to enhance the structure prediction. Results We have developed an approach to predict the 3-dimensional structure for the helical skeletons that can be detected from the low resolution protein density map. This approach does not require the construction of the entire chain and distinguishes the structures based on the conformation of the helices. A test with 35 low resolution density maps shows that the highest ranked structure with the correct topology can be found within the top 1% of the list ranked by the effective energy formed by the helices. Conclusion The results in this paper suggest that it is possible to eliminate the great majority of the bad conformations of the helices even without the construction of the entire chain of the protein. For many proteins, the effective contact energy formed by the secondary structures alone can distinguish a small set of likely structures from the pool.
Collapse
Affiliation(s)
- Kamal Al Nasr
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA.
| | | | | |
Collapse
|
50
|
Abstract
Today, electron cryomicroscopy (cryo-EM) can routinely achieve subnanometer resolutions of complex macromolecular assemblies. From a density map, one can extract key structural and functional information using a variety of computational analysis tools. At subnanometer resolution, these tools make it possible to isolate individual subunits, identify secondary structures, and accurately fit atomic models. With several cryo-EM studies achieving resolutions beyond 5Å, computational modeling and feature recognition tools have been employed to construct backbone and atomic models of the protein components directly from a density map. In this chapter, we describe several common classes of computational tools that can be used to analyze and model subnanometer resolution reconstructions from cryo-EM. A general protocol for analyzing subnanometer resolution density maps is presented along with a full description of steps used in analyzing the 4.3Å resolution structure of Mm-cpn.
Collapse
Affiliation(s)
- Matthew L Baker
- National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas, USA
| | | | | | | |
Collapse
|