1
|
Chung JM, Durie CL, Lee J. Artificial Intelligence in Cryo-Electron Microscopy. Life (Basel) 2022; 12:1267. [PMID: 36013446 PMCID: PMC9410485 DOI: 10.3390/life12081267] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 08/15/2022] [Accepted: 08/18/2022] [Indexed: 11/17/2022] Open
Abstract
Cryo-electron microscopy (cryo-EM) has become an unrivaled tool for determining the structure of macromolecular complexes. The biological function of macromolecular complexes is inextricably tied to the flexibility of these complexes. Single particle cryo-EM can reveal the conformational heterogeneity of a biochemically pure sample, leading to well-founded mechanistic hypotheses about the roles these complexes play in biology. However, the processing of increasingly large, complex datasets using traditional data processing strategies is exceedingly expensive in both user time and computational resources. Current innovations in data processing capitalize on artificial intelligence (AI) to improve the efficiency of data analysis and validation. Here, we review new tools that use AI to automate the data analysis steps of particle picking, 3D map reconstruction, and local resolution determination. We discuss how the application of AI moves the field forward, and what obstacles remain. We also introduce potential future applications of AI to use cryo-EM in understanding protein communities in cells.
Collapse
Affiliation(s)
- Jeong Min Chung
- Department of Biotechnology, The Catholic University of Korea, Bucheon-si 14662, Gyeonggi, Korea
| | - Clarissa L. Durie
- Department of Biochemistry, University of Missouri, Columbia, MO 65211, USA
| | - Jinseok Lee
- Department of Biomedical Engineering, Kyung Hee University, Yongin-si 17104, Gyeonggi, Korea
| |
Collapse
|
2
|
He J, Lin P, Chen J, Cao H, Huang SY. Model building of protein complexes from intermediate-resolution cryo-EM maps with deep learning-guided automatic assembly. Nat Commun 2022; 13:4066. [PMID: 35831370 PMCID: PMC9279371 DOI: 10.1038/s41467-022-31748-9] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 06/30/2022] [Indexed: 12/29/2022] Open
Abstract
Advances in microscopy instruments and image processing algorithms have led to an increasing number of cryo-electron microscopy (cryo-EM) maps. However, building accurate models into intermediate-resolution EM maps remains challenging and labor-intensive. Here, we propose an automatic model building method of multi-chain protein complexes from intermediate-resolution cryo-EM maps, named EMBuild, by integrating AlphaFold structure prediction, FFT-based global fitting, domain-based semi-flexible refinement, and graph-based iterative assembling on the main-chain probability map predicted by a deep convolutional network. EMBuild is extensively evaluated on diverse test sets of 47 single-particle EM maps at 4.0-8.0 Å resolution and 16 subtomogram averaging maps of cryo-ET data at 3.7-9.3 Å resolution, and compared with state-of-the-art approaches. We demonstrate that EMBuild is able to build high-quality complex structures that are comparably accurate to the manually built PDB structures from the cryo-EM maps. These results demonstrate the accuracy and reliability of EMBuild in automatic model building.
Collapse
Affiliation(s)
- Jiahua He
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Peicong Lin
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Ji Chen
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Hong Cao
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Sheng-You Huang
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
| |
Collapse
|
3
|
Zhang B, Zhang W, Pearce R, Zhang Y, Shen HB. Fitting Low-Resolution Protein Structures into Cryo-EM Density Maps by Multiobjective Optimization of Global and Local Correlations. J Phys Chem B 2021; 125:528-538. [PMID: 33397114 DOI: 10.1021/acs.jpcb.0c09903] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The rigid-body fitting of predicted structural models into cryo-electron microscopy (cryo-EM) density maps is a necessary procedure for density map-guided protein structure determination and prediction. We proposed a novel multiobjective optimization protocol, MOFIT, which performs a rigid-body density-map fitting based on particle swarm optimization (PSO). MOFIT was tested on a large set of 292 nonhomologous single-domain proteins. Starting from structural models predicted by I-TASSER, MOFIT achieved an average coordinate root-mean-square deviation of 2.46 Å, which was 1.57, 2.79, and 3.95 Å lower than three leading single-objective function-based methods, where the differences were statistically significant with p-values of 1.65 × 10-6, 6.36 × 10-8, and 6.44 × 10-11 calculated using two-tail Student's t tests. Detailed analyses showed that the major advantages of MOFIT lie in the multiobjective protocol and the extensive PSO search simulations guided by the composite objective functions, which integrates complementary correlation coefficients from the global structure, local fragments, and individual residues with the cryo-EM density maps.
Collapse
Affiliation(s)
- Biao Zhang
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, China.,Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Wenyi Zhang
- School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310024, China
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, China
| |
Collapse
|
4
|
Abstract
Genome sequencing projects have resulted in a rapid increase in the number of known protein sequences. In contrast, only about one-hundredth of these sequences have been characterized at atomic resolution using experimental structure determination methods. Computational protein structure modeling techniques have the potential to bridge this sequence-structure gap. In the following chapter, we present an example that illustrates the use of MODELLER to construct a comparative model for a protein with unknown structure. Automation of a similar protocol has resulted in models of useful accuracy for domains in more than half of all known protein sequences.
Collapse
|
5
|
Aderinwale T, Christoffer CW, Sarkar D, Alnabati E, Kihara D. Computational structure modeling for diverse categories of macromolecular interactions. Curr Opin Struct Biol 2020; 64:1-8. [PMID: 32599506 PMCID: PMC7665979 DOI: 10.1016/j.sbi.2020.05.017] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 05/06/2020] [Accepted: 05/21/2020] [Indexed: 01/23/2023]
Abstract
Computational protein-protein docking is one of the most intensively studied topics in structural bioinformatics. The field has made substantial progress through over three decades of development. The development began with methods for rigid-body docking of two proteins, which have now been extended in different directions to cover the various macromolecular interactions observed in a cell. Here, we overview the recent developments of the variations of docking methods, including multiple protein docking, peptide-protein docking, and disordered protein docking methods.
Collapse
Affiliation(s)
- Tunde Aderinwale
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | | | - Daipayan Sarkar
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Eman Alnabati
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA; Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA.
| |
Collapse
|
6
|
Rout MP, Sali A. Principles for Integrative Structural Biology Studies. Cell 2020; 177:1384-1403. [PMID: 31150619 DOI: 10.1016/j.cell.2019.05.016] [Citation(s) in RCA: 182] [Impact Index Per Article: 36.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Revised: 04/24/2019] [Accepted: 05/06/2019] [Indexed: 12/22/2022]
Abstract
Integrative structure determination is a powerful approach to modeling the structures of biological systems based on data produced by multiple experimental and theoretical methods, with implications for our understanding of cellular biology and drug discovery. This Primer introduces the theory and methods of integrative approaches, emphasizing the kinds of data that can be effectively included in developing models and using the nuclear pore complex as an example to illustrate the practice and challenges involved. These guidelines are intended to aid the researcher in understanding and applying integrative structural methods to systems of their interest and thus take advantage of this rapidly evolving field.
Collapse
Affiliation(s)
- Michael P Rout
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY 10065, USA.
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences, Byers Hall, 1700 4th Street, Suite 503B, University of California, San Francisco, San Francisco, CA 94158, USA.
| |
Collapse
|
7
|
Abstract
Many of the biological functions of the cell are driven by protein-protein interactions. However, determining which proteins interact and exactly how they do so to enable their functions, remain major research questions. Functional interactions are dependent on a number of complicated factors; therefore, modeling the three-dimensional structure of protein-protein complexes is still considered a complex endeavor. Nevertheless, the rewards for modeling protein interactions to atomic level detail are substantial, and there are numerous examples of how models can provide useful information for drug design, protein engineering, systems biology, and understanding of the immune system. Here, we provide practical guidelines for docking proteins using the web-server, SwarmDock, a flexible protein-protein docking method. Moreover, we provide an overview of the factors that need to be considered when deciding whether docking is likely to be successful.
Collapse
Affiliation(s)
- Iain H Moal
- European Bioinformatics Institute, Hinxton, UK
| | | | | | - Paul A Bates
- Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK.
| |
Collapse
|
8
|
Vallotton P, Rajoo S, Wojtynek M, Onischenko E, Kralt A, Derrer CP, Weis K. Mapping the native organization of the yeast nuclear pore complex using nuclear radial intensity measurements. Proc Natl Acad Sci U S A 2019; 116:14606-14613. [PMID: 31262825 PMCID: PMC6642398 DOI: 10.1073/pnas.1903764116] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Selective transport across the nuclear envelope (NE) is mediated by the nuclear pore complex (NPC), a massive ∼100-MDa assembly composed of multiple copies of ∼30 nuclear pore proteins (Nups). Recent advances have shed light on the composition and structure of NPCs, but approaches that could map their organization in live cells are still lacking. Here, we introduce an in vivo method to perform nuclear radial intensity measurements (NuRIM) using fluorescence microscopy to determine the average position of NE-localized proteins along the nucleocytoplasmic transport axis. We apply NuRIM to study the organization of the NPC and the mobile transport machinery in budding yeast. This reveals a unique snapshot of the intact yeast NPC and identifies distinct steady-state localizations for various NE-associated proteins and nuclear transport factors. We find that the NPC architecture is robust against compositional changes and could also confirm that in contrast to Chlamydomonas reinhardtii, the scaffold Y complex is arranged symmetrically in the yeast NPC. Furthermore, NuRIM was applied to probe the orientation of intrinsically disordered FG-repeat segments, providing insight into their roles in selective NPC permeability and structure.
Collapse
Affiliation(s)
- Pascal Vallotton
- Institute of Biochemistry, Department of Biology, Swiss Federal Institute of Technology Zürich (ETH Zürich), 8093 Zürich, Switzerland;
| | - Sasikumar Rajoo
- Institute of Biochemistry, Department of Biology, Swiss Federal Institute of Technology Zürich (ETH Zürich), 8093 Zürich, Switzerland
| | - Matthias Wojtynek
- Institute of Biochemistry, Department of Biology, Swiss Federal Institute of Technology Zürich (ETH Zürich), 8093 Zürich, Switzerland
- Department of Biochemistry, University of Zürich, 8057 Zürich, Switzerland
| | - Evgeny Onischenko
- Institute of Biochemistry, Department of Biology, Swiss Federal Institute of Technology Zürich (ETH Zürich), 8093 Zürich, Switzerland
| | - Annemarie Kralt
- Institute of Biochemistry, Department of Biology, Swiss Federal Institute of Technology Zürich (ETH Zürich), 8093 Zürich, Switzerland
| | - Carina Patrizia Derrer
- Institute of Biochemistry, Department of Biology, Swiss Federal Institute of Technology Zürich (ETH Zürich), 8093 Zürich, Switzerland
| | - Karsten Weis
- Institute of Biochemistry, Department of Biology, Swiss Federal Institute of Technology Zürich (ETH Zürich), 8093 Zürich, Switzerland;
| |
Collapse
|
9
|
Vallat B, Webb B, Westbrook J, Sali A, Berman HM. Archiving and disseminating integrative structure models. JOURNAL OF BIOMOLECULAR NMR 2019; 73:385-398. [PMID: 31278630 PMCID: PMC6692293 DOI: 10.1007/s10858-019-00264-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Accepted: 06/25/2019] [Indexed: 05/04/2023]
Abstract
Limitations in the applicability, accuracy, and precision of individual structure characterization methods can sometimes be overcome via an integrative modeling approach that relies on information from all available sources, including all available experimental data and prior models. The open-source Integrative Modeling Platform (IMP) is one piece of software that implements all computational aspects of integrative modeling. To maximize the impact of integrative structures, the coordinates should be made publicly available, as is already the case for structures based on X-ray crystallography, NMR spectroscopy, and electron microscopy. Moreover, the associated experimental data and modeling protocols should also be archived, such that the original results can easily be reproduced. Finally, it is essential that the integrative structures are validated as part of their publication and deposition. A number of research groups have already developed software to implement integrative modeling and have generated a number of structures, prompting the formation of an Integrative/Hybrid Methods Task Force. Following the recommendations of this task force, the existing PDBx/mmCIF data representation used for atomic PDB structures has been extended to address the requirements for archiving integrative structural models. This IHM-dictionary adds a flexible model representation, including coarse graining, models in multiple states and/or related by time or other order, and multiple input experimental information sources. A prototype archiving system called PDB-Dev ( https://pdb-dev.wwpdb.org ) has also been created to archive integrative structural models, together with a Python library to facilitate handling of integrative models in PDBx/mmCIF format.
Collapse
Affiliation(s)
- Brinda Vallat
- Institute for Quantitative Biomedicine, Piscataway, USA
| | - Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, University of California at San Francisco, San Francisco, CA, 94143, USA
| | - John Westbrook
- Institute for Quantitative Biomedicine, Piscataway, USA
- RCSB Protein Data Bank, Piscataway, USA
| | - Andrej Sali
- RCSB Protein Data Bank, Piscataway, USA.
- Department of Bioengineering and Therapeutic Sciences, University of California at San Francisco, San Francisco, CA, 94143, USA.
- Department of Pharmaceutical Chemistry and California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, CA, 94143, USA.
- Lead Contacts, San Francisco, USA.
| | - Helen M Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA.
- Lead Contacts, Piscataway, USA.
| |
Collapse
|
10
|
Vreven T, Schweppe DK, Chavez JD, Weisbrod CR, Shibata S, Zheng C, Bruce JE, Weng Z. Integrating Cross-Linking Experiments with Ab Initio Protein-Protein Docking. J Mol Biol 2018; 430:1814-1828. [PMID: 29665372 DOI: 10.1016/j.jmb.2018.04.010] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2017] [Revised: 03/19/2018] [Accepted: 04/10/2018] [Indexed: 12/23/2022]
Abstract
Ab initio protein-protein docking algorithms often rely on experimental data to identify the most likely complex structure. We integrated protein-protein docking with the experimental data of chemical cross-linking followed by mass spectrometry. We tested our approach using 19 cases that resulted from an exhaustive search of the Protein Data Bank for protein complexes with cross-links identified in our experiments. We implemented cross-links as constraints based on Euclidean distance or void-volume distance. For most test cases, the rank of the top-scoring near-native prediction was improved by at least twofold compared with docking without the cross-link information, and the success rate for the top 5 predictions nearly tripled. Our results demonstrate the delicate balance between retaining correct predictions and eliminating false positives. Several test cases had multiple components with distinct interfaces, and we present an approach for assigning cross-links to the interfaces. Employing the symmetry information for these cases further improved the performance of complex structure prediction.
Collapse
Affiliation(s)
- Thom Vreven
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA.
| | - Devin K Schweppe
- Department of Chemistry and Department of Genome Sciences, University of Washington, Seattle, WA 98109, USA
| | - Juan D Chavez
- Department of Chemistry and Department of Genome Sciences, University of Washington, Seattle, WA 98109, USA
| | - Chad R Weisbrod
- Department of Chemistry and Department of Genome Sciences, University of Washington, Seattle, WA 98109, USA
| | - Sayaka Shibata
- Department of Chemistry and Department of Genome Sciences, University of Washington, Seattle, WA 98109, USA
| | - Chunxiang Zheng
- Department of Chemistry and Department of Genome Sciences, University of Washington, Seattle, WA 98109, USA
| | - James E Bruce
- Department of Chemistry and Department of Genome Sciences, University of Washington, Seattle, WA 98109, USA
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA.
| |
Collapse
|
11
|
Stoichiometry and compositional plasticity of the yeast nuclear pore complex revealed by quantitative fluorescence microscopy. Proc Natl Acad Sci U S A 2018; 115:E3969-E3977. [PMID: 29632211 DOI: 10.1073/pnas.1719398115] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The nuclear pore complex (NPC) is an eightfold symmetrical channel providing selective transport of biomolecules across the nuclear envelope. Each NPC consists of ∼30 different nuclear pore proteins (Nups) all present in multiple copies per NPC. Significant progress has recently been made in the characterization of the vertebrate NPC structure. However, because of the estimated size differences between the vertebrate and yeast NPC, it has been unclear whether the NPC architecture is conserved between species. Here, we have developed a quantitative image analysis pipeline, termed nuclear rim intensity measurement (NuRIM), to precisely determine copy numbers for almost all Nups within native NPCs of budding yeast cells. Our analysis demonstrates that the majority of yeast Nups are present at most in 16 copies per NPC. This reveals a dramatic difference to the stoichiometry determined for the human NPC, suggesting that despite a high degree of individual Nup conservation, the yeast and human NPC architecture is significantly different. Furthermore, using NuRIM, we examined the effects of mutations on NPC stoichiometry. We demonstrate for two paralog pairs of key scaffold Nups, Nup170/Nup157 and Nup192/Nup188, that their altered expression leads to significant changes in the NPC stoichiometry inducing either voids in the NPC structure or substitution of one paralog by the other. Thus, our results not only provide accurate stoichiometry information for the intact yeast NPC but also reveal an intriguing compositional plasticity of the NPC architecture, which may explain how differences in NPC composition could arise in the course of evolution.
Collapse
|
12
|
Abstract
A new Gaussian mixture model (GMM) has been developed for better representations of both atomic models and electron microscopy 3D density maps. The standard GMM algorithm employs an EM algorithm to determine the parameters. It accepted a set of 3D points with weights, corresponding to voxel or atomic centers. Although the standard algorithm worked reasonably well; however, it had three problems. First, it ignored the size (voxel width or atomic radius) of the input, and thus it could lead to a GMM with a smaller spread than the input. Second, the algorithm had a singularity problem, as it sometimes stopped the iterative procedure due to a Gaussian function with almost zero variance. Third, a map with a large number of voxels required a long computation time for conversion to a GMM. To solve these problems, we have introduced a Gaussian-input GMM algorithm, which considers the input atoms or voxels as a set of Gaussian functions. The standard EM algorithm of GMM was extended to optimize the new GMM. The new GMM has identical radius of gyration to the input, and does not suddenly stop due to the singularity problem. For fast computation, we have introduced a down-sampled Gaussian functions (DSG) by merging neighboring voxels into an anisotropic Gaussian function. It provides a GMM with thousands of Gaussian functions in a short computation time. We also have introduced a DSG-input GMM: the Gaussian-input GMM with the DSG as the input. This new algorithm is much faster than the standard algorithm.
Collapse
Affiliation(s)
- Takeshi Kawabata
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan.
| |
Collapse
|
13
|
Rigid-Body Fitting of Atomic Models on 3D Density Maps of Electron Microscopy. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2018; 1105:219-235. [PMID: 30617832 DOI: 10.1007/978-981-13-2200-6_14] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Cryo electron microscopy has revolutionarily evolved for the determination of the 3D structure of macromolecular complexes. The modeling procedures on the 3D density maps of electron microscopy are roughly classified into three categories: fitting, de novo modeling and refinement. The registered atomic models from the maps have mostly been hand-built and auto-refined. Several programs aiming at automatic modeling have also been developed using various kinds of molecular representations. Among these three classes of the modeling procedures, the rigid body fitting is reviewed here, because it is the most basic modeling process applied before the other steps. The fitting problems are classified as the fittings of single subunit or multiple subunits, and the fittings on global or local parts of maps. A higher resolution map enables more local fitting. Various molecular representations have been employed in the fitting programs. A point and digital image models are generally used to represent molecules, but new representations, such as the Gaussian mixture model, have been applied recently.
Collapse
|
14
|
Abstract
The atomic structures of protein complexes can provide useful information for drug design, protein engineering, systems biology, and understanding pathology. Obtaining this information experimentally can be challenging. However, if the structures of the subunits are known, then it is often possible to model the complex computationally. This chapter provide practical guidelines for docking proteins using the SwarmDock flexible protein-protein docking method, providing an overview of the factors that need to be considered when deciding whether docking is likely to be successful, the preparation of structural input, generation of docked poses, analysis and ranking of docked poses, and the validation of models using external data.
Collapse
Affiliation(s)
- Iain H Moal
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK.
| | | | - Paul A Bates
- Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
| |
Collapse
|
15
|
D'Imprima E, Salzer R, Bhaskara RM, Sánchez R, Rose I, Kirchner L, Hummer G, Kühlbrandt W, Vonck J, Averhoff B. Cryo-EM structure of the bifunctional secretin complex of Thermus thermophilus. eLife 2017; 6. [PMID: 29280731 PMCID: PMC5745081 DOI: 10.7554/elife.30483] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 12/13/2017] [Indexed: 11/13/2022] Open
Abstract
Secretins form multimeric channels across the outer membrane of Gram-negative bacteria that mediate the import or export of substrates and/or extrusion of type IV pili. The secretin complex of Thermus thermophilus is an oligomer of the 757-residue PilQ protein, essential for DNA uptake and pilus extrusion. Here, we present the cryo-EM structure of this bifunctional complex at a resolution of ~7 Å using a new reconstruction protocol. Thirteen protomers form a large periplasmic domain of six stacked rings and a secretin domain in the outer membrane. A homology model of the PilQ protein was fitted into the cryo-EM map. A crown-like structure outside the outer membrane capping the secretin was found not to be part of PilQ. Mutations in the secretin domain disrupted the crown and abolished DNA uptake, suggesting a central role of the crown in natural transformation.
Collapse
Affiliation(s)
- Edoardo D'Imprima
- Department of Structural Biology, Max Planck Institute of Biophysics, Frankfurt, Germany
| | - Ralf Salzer
- Molecular Microbiology and Bioenergetics, Institute of Molecular Biosciences, Goethe University Frankfurt, Frankfurt, Germany
| | - Ramachandra M Bhaskara
- Department of Theoretical Biophysics, Max Planck Institute of Biophysics, Frankfurt, Germany
| | - Ricardo Sánchez
- Sofja Kovalevskaja Group, Max Planck Institute of Biophysics, Frankfurt, Germany
| | - Ilona Rose
- Molecular Microbiology and Bioenergetics, Institute of Molecular Biosciences, Goethe University Frankfurt, Frankfurt, Germany
| | - Lennart Kirchner
- Molecular Microbiology and Bioenergetics, Institute of Molecular Biosciences, Goethe University Frankfurt, Frankfurt, Germany
| | - Gerhard Hummer
- Department of Theoretical Biophysics, Max Planck Institute of Biophysics, Frankfurt, Germany.,Institute of Biophysics, Goethe University Frankfurt, Frankfurt, Germany
| | - Werner Kühlbrandt
- Department of Structural Biology, Max Planck Institute of Biophysics, Frankfurt, Germany
| | - Janet Vonck
- Department of Structural Biology, Max Planck Institute of Biophysics, Frankfurt, Germany
| | - Beate Averhoff
- Molecular Microbiology and Bioenergetics, Institute of Molecular Biosciences, Goethe University Frankfurt, Frankfurt, Germany
| |
Collapse
|
16
|
Webb B, Viswanath S, Bonomi M, Pellarin R, Greenberg CH, Saltzberg D, Sali A. Integrative structure modeling with the Integrative Modeling Platform. Protein Sci 2017; 27:245-258. [PMID: 28960548 DOI: 10.1002/pro.3311] [Citation(s) in RCA: 73] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2017] [Revised: 09/23/2017] [Accepted: 09/25/2017] [Indexed: 11/06/2022]
Abstract
Building models of a biological system that are consistent with the myriad data available is one of the key challenges in biology. Modeling the structure and dynamics of macromolecular assemblies, for example, can give insights into how biological systems work, evolved, might be controlled, and even designed. Integrative structure modeling casts the building of structural models as a computational optimization problem, for which information about the assembly is encoded into a scoring function that evaluates candidate models. Here, we describe our open source software suite for integrative structure modeling, Integrative Modeling Platform (https://integrativemodeling.org), and demonstrate its use.
Collapse
Affiliation(s)
- Benjamin Webb
- California Institute for Quantitative Biosciences, University of California, San Francisco, California, 94158
| | - Shruthi Viswanath
- California Institute for Quantitative Biosciences, University of California, San Francisco, California, 94158
| | | | - Riccardo Pellarin
- Structural Bioinformatics Unit, Institut Pasteur, CNRS UMR 3528, Paris, France
| | - Charles H Greenberg
- California Institute for Quantitative Biosciences, University of California, San Francisco, California, 94158
| | - Daniel Saltzberg
- California Institute for Quantitative Biosciences, University of California, San Francisco, California, 94158
| | - Andrej Sali
- California Institute for Quantitative Biosciences, University of California, San Francisco, California, 94158
| |
Collapse
|
17
|
Computational modeling of protein assemblies. Curr Opin Struct Biol 2017; 44:179-189. [DOI: 10.1016/j.sbi.2017.04.006] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Revised: 04/07/2017] [Accepted: 04/11/2017] [Indexed: 01/18/2023]
|
18
|
Tamò G, Maesani A, Träger S, Degiacomi MT, Floreano D, Dal Peraro M. Disentangling constraints using viability evolution principles in integrative modeling of macromolecular assemblies. Sci Rep 2017; 7:235. [PMID: 28331186 PMCID: PMC5427971 DOI: 10.1038/s41598-017-00266-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2016] [Accepted: 02/14/2017] [Indexed: 11/22/2022] Open
Abstract
Predicting the structure of large molecular assemblies remains a challenging task in structural biology when using integrative modeling approaches. One of the main issues stems from the treatment of heterogeneous experimental data used to predict the architecture of native complexes. We propose a new method, applied here for the first time to a set of symmetrical complexes, based on evolutionary computation that treats every available experimental input independently, bypassing the need to balance weight components assigned to aggregated fitness functions during optimization.
Collapse
Affiliation(s)
- Giorgio Tamò
- Laboratory for Biomolecular Modeling, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland.,Swiss Institute of Bioinformatics (SIB), Lausanne, CH-1015, Switzerland
| | - Andrea Maesani
- Laboratory of Intelligent Systems, Institute of Microengineering, École Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland
| | - Sylvain Träger
- Laboratory for Biomolecular Modeling, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland.,Swiss Institute of Bioinformatics (SIB), Lausanne, CH-1015, Switzerland
| | - Matteo T Degiacomi
- Chemistry Research Laboratory, Department of Chemistry, University of Oxford, Oxford, UK
| | - Dario Floreano
- Laboratory of Intelligent Systems, Institute of Microengineering, École Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland.
| | - Matteo Dal Peraro
- Laboratory for Biomolecular Modeling, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland. .,Swiss Institute of Bioinformatics (SIB), Lausanne, CH-1015, Switzerland.
| |
Collapse
|
19
|
Abstract
Genome sequencing projects have resulted in a rapid increase in the number of known protein sequences. In contrast, only about one-hundredth of these sequences have been characterized at atomic resolution using experimental structure determination methods. Computational protein structure modeling techniques have the potential to bridge this sequence-structure gap. In the following chapter, we present an example that illustrates the use of MODELLER to construct a comparative model for a protein with unknown structure. Automation of a similar protocol has resulted in models of useful accuracy for domains in more than half of all known protein sequences.
Collapse
Affiliation(s)
- Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, CA, 94143, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, CA, 94143, USA.
| |
Collapse
|
20
|
Vreven T, Pierce BG, Borrman TM, Weng Z. Performance of ZDOCK and IRAD in CAPRI rounds 28-34. Proteins 2016; 85:408-416. [PMID: 27718275 DOI: 10.1002/prot.25186] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Revised: 09/20/2016] [Accepted: 09/29/2016] [Indexed: 11/11/2022]
Abstract
We report the performance of our protein-protein docking pipeline, including the ZDOCK rigid-body docking algorithm, on 19 targets in CAPRI rounds 28-34. Following the docking step, we reranked the ZDOCK predictions using the IRAD scoring function, pruned redundant predictions, performed energy landscape analysis, and utilized our interface prediction approach RCF. In addition, we applied constraints to the search space based on biological information that we culled from the literature, which increased the chance of making a correct prediction. For all but two targets we were able to find and apply biological information and we found the information to be highly accurate, indicating that effective incorporation of biological information is an important component for protein-protein docking. Proteins 2017; 85:408-416. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Thom Vreven
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
| | - Brian G Pierce
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
| | - Tyler M Borrman
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
| |
Collapse
|
21
|
Kuzu G, Keskin O, Nussinov R, Gursoy A. PRISM-EM: template interface-based modelling of multi-protein complexes guided by cryo-electron microscopy density maps. Acta Crystallogr D Struct Biol 2016; 72:1137-1148. [PMID: 27710935 PMCID: PMC5053140 DOI: 10.1107/s2059798316013541] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Accepted: 08/23/2016] [Indexed: 12/29/2022] Open
Abstract
The structures of protein assemblies are important for elucidating cellular processes at the molecular level. Three-dimensional electron microscopy (3DEM) is a powerful method to identify the structures of assemblies, especially those that are challenging to study by crystallography. Here, a new approach, PRISM-EM, is reported to computationally generate plausible structural models using a procedure that combines crystallographic structures and density maps obtained from 3DEM. The predictions are validated against seven available structurally different crystallographic complexes. The models display mean deviations in the backbone of <5 Å. PRISM-EM was further tested on different benchmark sets; the accuracy was evaluated with respect to the structure of the complex, and the correlation with EM density maps and interface predictions were evaluated and compared with those obtained using other methods. PRISM-EM was then used to predict the structure of the ternary complex of the HIV-1 envelope glycoprotein trimer, the ligand CD4 and the neutralizing protein m36.
Collapse
Affiliation(s)
- Guray Kuzu
- Center for Computational Biology and Bioinformatics and College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Ozlem Keskin
- Center for Computational Biology and Bioinformatics and College of Engineering, Koc University, 34450 Istanbul, Turkey
- Chemical and Biological Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Ruth Nussinov
- Cancer and Inflammation Program, Leidos Biomedical Research Inc., National Cancer Institute, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
- Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Attila Gursoy
- Computer Engineering, Koc University, 34450 Istanbul, Turkey
| |
Collapse
|
22
|
Im W, Liang J, Olson A, Zhou HX, Vajda S, Vakser IA. Challenges in structural approaches to cell modeling. J Mol Biol 2016; 428:2943-64. [PMID: 27255863 PMCID: PMC4976022 DOI: 10.1016/j.jmb.2016.05.024] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2016] [Revised: 05/19/2016] [Accepted: 05/24/2016] [Indexed: 11/17/2022]
Abstract
Computational modeling is essential for structural characterization of biomolecular mechanisms across the broad spectrum of scales. Adequate understanding of biomolecular mechanisms inherently involves our ability to model them. Structural modeling of individual biomolecules and their interactions has been rapidly progressing. However, in terms of the broader picture, the focus is shifting toward larger systems, up to the level of a cell. Such modeling involves a more dynamic and realistic representation of the interactomes in vivo, in a crowded cellular environment, as well as membranes and membrane proteins, and other cellular components. Structural modeling of a cell complements computational approaches to cellular mechanisms based on differential equations, graph models, and other techniques to model biological networks, imaging data, etc. Structural modeling along with other computational and experimental approaches will provide a fundamental understanding of life at the molecular level and lead to important applications to biology and medicine. A cross section of diverse approaches presented in this review illustrates the developing shift from the structural modeling of individual molecules to that of cell biology. Studies in several related areas are covered: biological networks; automated construction of three-dimensional cell models using experimental data; modeling of protein complexes; prediction of non-specific and transient protein interactions; thermodynamic and kinetic effects of crowding; cellular membrane modeling; and modeling of chromosomes. The review presents an expert opinion on the current state-of-the-art in these various aspects of structural modeling in cellular biology, and the prospects of future developments in this emerging field.
Collapse
Affiliation(s)
- Wonpil Im
- Center for Computational Biology and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS 66047, United States.
| | - Jie Liang
- Department of Bioengineering, University of Illinois at Chicago, Chicago, IL 60607, United States.
| | - Arthur Olson
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, United States.
| | - Huan-Xiang Zhou
- Department of Physics and Institute of Molecular Biophysics, Florida State University, Tallahassee, FL 32306, United States.
| | - Sandor Vajda
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, United States.
| | - Ilya A Vakser
- Center for Computational Biology and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS 66047, United States.
| |
Collapse
|
23
|
Faini M, Stengel F, Aebersold R. The Evolving Contribution of Mass Spectrometry to Integrative Structural Biology. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2016; 27:966-974. [PMID: 27056566 PMCID: PMC4867889 DOI: 10.1007/s13361-016-1382-4] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/01/2016] [Revised: 03/09/2016] [Accepted: 03/10/2016] [Indexed: 06/05/2023]
Abstract
Protein complexes are key catalysts and regulators for the majority of cellular processes. Unveiling their assembly and structure is essential to understanding their function and mechanism of action. Although conventional structural techniques such as X-ray crystallography and NMR have solved the structure of important protein complexes, they cannot consistently deal with dynamic and heterogeneous assemblies, limiting their applications to small scale experiments. A novel methodological paradigm, integrative structural biology, aims at overcoming such limitations by combining complementary data sources into a comprehensive structural model. Recent applications have shown that a range of mass spectrometry (MS) techniques are able to generate interaction and spatial restraints (cross-linking MS) information on native complexes or to study the stoichiometry and connectivity of entire assemblies (native MS) rapidly, reliably, and from small amounts of substrate. Although these techniques by themselves do not solve structures, they do provide invaluable structural information and are thus ideally suited to contribute to integrative modeling efforts. The group of Brian Chait has made seminal contributions in the use of mass spectrometric techniques to study protein complexes. In this perspective, we honor the contributions of the Chait group and discuss concepts and milestones of integrative structural biology. We also review recent examples of integration of structural MS techniques with an emphasis on cross-linking MS. We then speculate on future MS applications that would unravel the dynamic nature of protein complexes upon diverse cellular states. Graphical Abstract ᅟ.
Collapse
Affiliation(s)
- Marco Faini
- Department of Biology, Institute of Molecular Systems Biology, ETH Zürich, 8093, Zürich, Switzerland
| | - Florian Stengel
- Department of Biology, University of Konstanz, 78457, Konstanz, Germany
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zürich, 8093, Zürich, Switzerland.
- Faculty of Science, University of Zürich, Zürich, Switzerland.
| |
Collapse
|
24
|
de Vries SJ, Chauvot de Beauchêne I, Schindler CEM, Zacharias M. Cryo-EM Data Are Superior to Contact and Interface Information in Integrative Modeling. Biophys J 2016; 110:785-97. [PMID: 26846888 DOI: 10.1016/j.bpj.2015.12.038] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2015] [Revised: 11/18/2015] [Accepted: 12/14/2015] [Indexed: 12/29/2022] Open
Abstract
Protein-protein interactions carry out a large variety of essential cellular processes. Cryo-electron microscopy (cryo-EM) is a powerful technique for the modeling of protein-protein interactions at a wide range of resolutions, and recent developments have caused a revolution in the field. At low resolution, cryo-EM maps can drive integrative modeling of the interaction, assembling existing structures into the map. Other experimental techniques can provide information on the interface or on the contacts between the monomers in the complex. This inevitably raises the question regarding which type of data is best suited to drive integrative modeling approaches. Systematic comparison of the prediction accuracy and specificity of the different integrative modeling paradigms is unavailable to date. Here, we compare EM-driven, interface-driven, and contact-driven integrative modeling paradigms. Models were generated for the protein docking benchmark using the ATTRACT docking engine and evaluated using the CAPRI two-star criterion. At 20 Å resolution, EM-driven modeling achieved a success rate of 100%, outperforming the other paradigms even with perfect interface and contact information. Therefore, even very low resolution cryo-EM data is superior in predicting heterodimeric and heterotrimeric protein assemblies. Our study demonstrates that a force field is not necessary, cryo-EM data alone is sufficient to accurately guide the monomers into place. The resulting rigid models successfully identify regions of conformational change, opening up perspectives for targeted flexible remodeling.
Collapse
Affiliation(s)
- Sjoerd J de Vries
- Physik-Department T38, Technische Universität München, Garching, Germany.
| | | | - Christina E M Schindler
- Physik-Department T38, Technische Universität München, Garching, Germany; Center for Integrated Protein Science Munich (CIPSM) at the Physics Department, Technische Universität München, Garching, Germany
| | - Martin Zacharias
- Physik-Department T38, Technische Universität München, Garching, Germany; Center for Integrated Protein Science Munich (CIPSM) at the Physics Department, Technische Universität München, Garching, Germany
| |
Collapse
|
25
|
Segura J, Sanchez-Garcia R, Tabas-Madrid D, Cuenca-Alba J, Sorzano COS, Carazo JM. 3DIANA: 3D Domain Interaction Analysis: A Toolbox for Quaternary Structure Modeling. Biophys J 2016; 110:766-75. [PMID: 26772592 PMCID: PMC4775853 DOI: 10.1016/j.bpj.2015.11.3519] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2015] [Revised: 11/27/2015] [Accepted: 11/30/2015] [Indexed: 11/19/2022] Open
Abstract
Electron microscopy (EM) is experiencing a revolution with the advent of a new generation of Direct Electron Detectors, enabling a broad range of large and flexible structures to be resolved well below 1 nm resolution. Although EM techniques are evolving to the point of directly obtaining structural data at near-atomic resolution, for many molecules the attainable resolution might not be enough to propose high-resolution structural models. However, accessing information on atomic coordinates is a necessary step toward a deeper understanding of the molecular mechanisms that allow proteins to perform specific tasks. For that reason, methods for the integration of EM three-dimensional maps with x-ray and NMR structural data are being developed, a modeling task that is normally referred to as fitting, resulting in the so called hybrid models. In this work, we present a novel application—3DIANA—specially targeted to those cases in which the EM map resolution is medium or low and additional experimental structural information is scarce or even lacking. In this way, 3DIANA statistically evaluates proposed/potential contacts between protein domains, presents a complete catalog of both structurally resolved and predicted interacting regions involving these domains and, finally, suggests structural templates to model the interaction between them. The evaluation of the proposed interactions is computed with DIMERO, a new method that scores physical binding sites based on the topology of protein interaction networks, which has recently shown the capability to increase by 200% the number of domain-domain interactions predicted in interactomes as compared to previous approaches. The new application displays the information at a sequence and structural level and is accessible through a web browser or as a Chimera plugin at http://3diana.cnb.csic.es.
Collapse
Affiliation(s)
- Joan Segura
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain.
| | - Ruben Sanchez-Garcia
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain
| | - Daniel Tabas-Madrid
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain
| | - Jesus Cuenca-Alba
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain
| | - Carlos Oscar S Sorzano
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain
| | - Jose Maria Carazo
- GN7, Spanish National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC)/Instruct Image Processing Center, Madrid, Spain
| |
Collapse
|
26
|
Pandurangan AP, Vasishtan D, Alber F, Topf M. γ-TEMPy: Simultaneous Fitting of Components in 3D-EM Maps of Their Assembly Using a Genetic Algorithm. Structure 2015; 23:2365-2376. [PMID: 26655474 PMCID: PMC4671957 DOI: 10.1016/j.str.2015.10.013] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Revised: 09/24/2015] [Accepted: 10/01/2015] [Indexed: 12/02/2022]
Abstract
We have developed a genetic algorithm for building macromolecular complexes using only a 3D-electron microscopy density map and the atomic structures of the relevant components. For efficient sampling the method uses map feature points calculated by vector quantization. The fitness function combines a mutual information score that quantifies the goodness of fit with a penalty score that helps to avoid clashes between components. Testing the method on ten assemblies (containing 3–8 protein components) and simulated density maps at 10, 15, and 20 Å resolution resulted in identification of the correct topology in 90%, 70%, and 60% of the cases, respectively. We further tested it on four assemblies with experimental maps at 7.2–23.5 Å resolution, showing the ability of the method to identify the correct topology in all cases. We have also demonstrated the importance of the map feature-point quality on assembly fitting in the lack of additional experimental information. γ-TEMPy uses a genetic algorithm to fit multiple components into 3D-EM density maps The fitness score is a combination of a Mutual Information score and a clash penalty Efficient sampling is aided by using map feature points from vector quantization Native topologies for assemblies containing up to eight components can be predicted
Collapse
Affiliation(s)
- Arun Prasad Pandurangan
- Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet Street, London WC1E 7HX, UK
| | - Daven Vasishtan
- Division of Structural Biology, Oxford Particle Imaging Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - Frank Alber
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI413E, Los Angeles, CA 90089, USA
| | - Maya Topf
- Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet Street, London WC1E 7HX, UK.
| |
Collapse
|
27
|
Farabella I, Vasishtan D, Joseph AP, Pandurangan AP, Sahota H, Topf M. TEMPy: a Python library for assessment of three-dimensional electron microscopy density fits. J Appl Crystallogr 2015; 48:1314-1323. [PMID: 26306092 PMCID: PMC4520291 DOI: 10.1107/s1600576715010092] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2014] [Accepted: 05/24/2015] [Indexed: 12/21/2022] Open
Abstract
TEMPy is an object-oriented Python library that provides the means to validate density fits in electron microscopy reconstructions. This article highlights several features of particular interest for this purpose and includes some customized examples. Three-dimensional electron microscopy is currently one of the most promising techniques used to study macromolecular assemblies. Rigid and flexible fitting of atomic models into density maps is often essential to gain further insights into the assemblies they represent. Currently, tools that facilitate the assessment of fitted atomic models and maps are needed. TEMPy (template and electron microscopy comparison using Python) is a toolkit designed for this purpose. The library includes a set of methods to assess density fits in intermediate-to-low resolution maps, both globally and locally. It also provides procedures for single-fit assessment, ensemble generation of fits, clustering, and multiple and consensus scoring, as well as plots and output files for visualization purposes to help the user in analysing rigid and flexible fits. The modular nature of TEMPy helps the integration of scoring and assessment of fits into large pipelines, making it a tool suitable for both novice and expert structural biologists.
Collapse
Affiliation(s)
- Irene Farabella
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London , Malet street, London WC1E 7HX, UK
| | - Daven Vasishtan
- Oxford Particle Imaging Centre, Division of Structural Biology, Wellcome Trust Centre for Human Genetics, University of Oxford , Oxford OX3 7BN, UK
| | - Agnel Praveen Joseph
- Scientific Computing Department, Science and Technology Facilities Council, Research Complex at Harwell , Didcot, Oxon OX11 0QX, UK
| | - Arun Prasad Pandurangan
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London , Malet street, London WC1E 7HX, UK
| | - Harpal Sahota
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London , Malet street, London WC1E 7HX, UK
| | - Maya Topf
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London , Malet street, London WC1E 7HX, UK
| |
Collapse
|
28
|
Integrative Modeling of Biomolecular Complexes: HADDOCKing with Cryo-Electron Microscopy Data. Structure 2015; 23:949-960. [DOI: 10.1016/j.str.2015.03.014] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Revised: 03/12/2015] [Accepted: 03/13/2015] [Indexed: 12/13/2022]
|
29
|
Janin J, Wodak SJ, Lensink MF, Velankar S. Assessing Structural Predictions of Protein-Protein Recognition: The CAPRI Experiment. REVIEWS IN COMPUTATIONAL CHEMISTRY 2015. [DOI: 10.1002/9781118889886.ch4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
30
|
Erzberger JP, Stengel F, Pellarin R, Zhang S, Schaefer T, Aylett CHS, Cimermančič P, Boehringer D, Sali A, Aebersold R, Ban N. Molecular architecture of the 40S⋅eIF1⋅eIF3 translation initiation complex. Cell 2015; 158:1123-1135. [PMID: 25171412 PMCID: PMC4151992 DOI: 10.1016/j.cell.2014.07.044] [Citation(s) in RCA: 165] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Revised: 05/29/2014] [Accepted: 07/29/2014] [Indexed: 11/25/2022]
Abstract
Eukaryotic translation initiation requires the recruitment of the large, multiprotein eIF3 complex to the 40S ribosomal subunit. We present X-ray structures of all major components of the minimal, six-subunit Saccharomyces cerevisiae eIF3 core. These structures, together with electron microscopy reconstructions, cross-linking coupled to mass spectrometry, and integrative structure modeling, allowed us to position and orient all eIF3 components on the 40S⋅eIF1 complex, revealing an extended, modular arrangement of eIF3 subunits. Yeast eIF3 engages 40S in a clamp-like manner, fully encircling 40S to position key initiation factors on opposite ends of the mRNA channel, providing a platform for the recruitment, assembly, and regulation of the translation initiation machinery. The structures of eIF3 components reported here also have implications for understanding the architecture of the mammalian 43S preinitiation complex and the complex of eIF3, 40S, and the hepatitis C internal ribosomal entry site RNA. X-ray structures of major yeast eIF3 components and subcomplexes Crosslinking coupled to mass-spectrometry analysis of 40S⋅eIF1⋅eIF3 complex Integrative modeling reveals architecture of 40S⋅eIF1⋅eIF3 complex
Collapse
Affiliation(s)
- Jan P Erzberger
- Department of Biology, Institute of Molecular Biology and Biophysics, ETH Zurich, Otto-Stern-Weg 5, 8093 Zurich, Switzerland.
| | - Florian Stengel
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Auguste-Piccard-Hof 1, 8093 Zurich, Switzerland
| | - Riccardo Pellarin
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, UCSF MC 2552, Byers Hall Room 503B, 1700 4th Street, San Francisco, CA 94158-2330, USA
| | - Suyang Zhang
- Department of Biology, Institute of Molecular Biology and Biophysics, ETH Zurich, Otto-Stern-Weg 5, 8093 Zurich, Switzerland
| | - Tanja Schaefer
- Department of Biology, Institute of Molecular Biology and Biophysics, ETH Zurich, Otto-Stern-Weg 5, 8093 Zurich, Switzerland
| | - Christopher H S Aylett
- Department of Biology, Institute of Molecular Biology and Biophysics, ETH Zurich, Otto-Stern-Weg 5, 8093 Zurich, Switzerland
| | - Peter Cimermančič
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, UCSF MC 2552, Byers Hall Room 503B, 1700 4th Street, San Francisco, CA 94158-2330, USA
| | - Daniel Boehringer
- Department of Biology, Institute of Molecular Biology and Biophysics, ETH Zurich, Otto-Stern-Weg 5, 8093 Zurich, Switzerland
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, UCSF MC 2552, Byers Hall Room 503B, 1700 4th Street, San Francisco, CA 94158-2330, USA
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Auguste-Piccard-Hof 1, 8093 Zurich, Switzerland; Faculty of Science, University of Zurich, 8006 Zurich, Switzerland
| | - Nenad Ban
- Department of Biology, Institute of Molecular Biology and Biophysics, ETH Zurich, Otto-Stern-Weg 5, 8093 Zurich, Switzerland.
| |
Collapse
|
31
|
Cassioli A, Bardiaux B, Bouvier G, Mucherino A, Alves R, Liberti L, Nilges M, Lavor C, Malliavin TE. An algorithm to enumerate all possible protein conformations verifying a set of distance constraints. BMC Bioinformatics 2015; 16:23. [PMID: 25627244 PMCID: PMC4384350 DOI: 10.1186/s12859-015-0451-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2014] [Accepted: 01/05/2015] [Indexed: 11/15/2023] Open
Abstract
BACKGROUND The determination of protein structures satisfying distance constraints is an important problem in structural biology. Whereas the most common method currently employed is simulated annealing, there have been other methods previously proposed in the literature. Most of them, however, are designed to find one solution only. RESULTS In order to explore exhaustively the feasible conformational space, we propose here an interval Branch-and-Prune algorithm (iBP) to solve the Distance Geometry Problem (DGP) associated to protein structure determination. This algorithm is based on a discretization of the problem obtained by recursively constructing a search space having the structure of a tree, and by verifying whether the generated atomic positions are feasible or not by making use of pruning devices. The pruning devices used here are directly related to features of protein conformations. CONCLUSIONS We described the new algorithm iBP to generate protein conformations satisfying distance constraints, that would potentially allows a systematic exploration of the conformational space. The algorithm iBP has been applied on three α-helical peptides.
Collapse
Affiliation(s)
| | - Benjamin Bardiaux
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| | - Guillaume Bouvier
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| | | | - Rafael Alves
- LIX, Ecole Polytechnique, Palaiseau, 91128, France.
| | - Leo Liberti
- LIX, Ecole Polytechnique, Palaiseau, 91128, France. .,IBM TJ Watson Research Center, NY Yorktown Heights, 10598, USA.
| | - Michael Nilges
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| | - Carlile Lavor
- University of Campinas (IMECC-UNICAMP), Campinas-SP, 13083-859, Brasil.
| | - Thérèse E Malliavin
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| |
Collapse
|
32
|
López-Blanco JR, Chacón P. Structural modeling from electron microscopy data. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2014. [DOI: 10.1002/wcms.1199] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- José Ramón López-Blanco
- Department of Biological Physical Chemistry; Rocasolano Physical Chemistry Institute, CSIC; Madrid Spain
| | - Pablo Chacón
- Department of Biological Physical Chemistry; Rocasolano Physical Chemistry Institute, CSIC; Madrid Spain
| |
Collapse
|
33
|
Algret R, Fernandez-Martinez J, Shi Y, Kim SJ, Pellarin R, Cimermancic P, Cochet E, Sali A, Chait BT, Rout MP, Dokudovskaya S. Molecular architecture and function of the SEA complex, a modulator of the TORC1 pathway. Mol Cell Proteomics 2014; 13:2855-70. [PMID: 25073740 DOI: 10.1074/mcp.m114.039388] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
The TORC1 signaling pathway plays a major role in the control of cell growth and response to stress. Here we demonstrate that the SEA complex physically interacts with TORC1 and is an important regulator of its activity. During nitrogen starvation, deletions of SEA complex components lead to Tor1 kinase delocalization, defects in autophagy, and vacuolar fragmentation. TORC1 inactivation, via nitrogen deprivation or rapamycin treatment, changes cellular levels of SEA complex members. We used affinity purification and chemical cross-linking to generate the data for an integrative structure modeling approach, which produced a well-defined molecular architecture of the SEA complex and showed that the SEA complex comprises two regions that are structurally and functionally distinct. The SEA complex emerges as a platform that can coordinate both structural and enzymatic activities necessary for the effective functioning of the TORC1 pathway.
Collapse
Affiliation(s)
- Romain Algret
- From the ‡CNRS UMR 8126, Université Paris-Sud 11, Institut Gustave Roussy, 114, rue Edouard Vaillant, 94805, Villejuif, France
| | - Javier Fernandez-Martinez
- §Laboratory of Cellular and Structural Biology, The Rockefeller University, 1230 York Avenue, New York, New York 10065
| | - Yi Shi
- ¶Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, 1230 York Avenue, New York, New York 10065
| | - Seung Joong Kim
- ‖Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, UCSF MC 2552, Byers Hall Room 503B, 1700 4th Street, San Francisco, California 94158-2330
| | - Riccardo Pellarin
- ‖Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, UCSF MC 2552, Byers Hall Room 503B, 1700 4th Street, San Francisco, California 94158-2330
| | - Peter Cimermancic
- ‖Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, UCSF MC 2552, Byers Hall Room 503B, 1700 4th Street, San Francisco, California 94158-2330
| | - Emilie Cochet
- From the ‡CNRS UMR 8126, Université Paris-Sud 11, Institut Gustave Roussy, 114, rue Edouard Vaillant, 94805, Villejuif, France
| | - Andrej Sali
- ‖Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, UCSF MC 2552, Byers Hall Room 503B, 1700 4th Street, San Francisco, California 94158-2330
| | - Brian T Chait
- ¶Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, 1230 York Avenue, New York, New York 10065
| | - Michael P Rout
- §Laboratory of Cellular and Structural Biology, The Rockefeller University, 1230 York Avenue, New York, New York 10065
| | - Svetlana Dokudovskaya
- From the ‡CNRS UMR 8126, Université Paris-Sud 11, Institut Gustave Roussy, 114, rue Edouard Vaillant, 94805, Villejuif, France;
| |
Collapse
|
34
|
Jacobson MP, Kalyanaraman C, Zhao S, Tian B. Leveraging structure for enzyme function prediction: methods, opportunities, and challenges. Trends Biochem Sci 2014; 39:363-71. [PMID: 24998033 DOI: 10.1016/j.tibs.2014.05.006] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2014] [Revised: 05/26/2014] [Accepted: 05/29/2014] [Indexed: 02/06/2023]
Abstract
The rapid growth of the number of protein sequences that can be inferred from sequenced genomes presents challenges for function assignment, because only a small fraction (currently <1%) has been experimentally characterized. Bioinformatics tools are commonly used to predict functions of uncharacterized proteins. Recently, there has been significant progress in using protein structures as an additional source of information to infer aspects of enzyme function, which is the focus of this review. Successful application of these approaches has led to the identification of novel metabolites, enzyme activities, and biochemical pathways. We discuss opportunities to elucidate systematically protein domains of unknown function, orphan enzyme activities, dead-end metabolites, and pathways in secondary metabolism.
Collapse
Affiliation(s)
- Matthew P Jacobson
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94158, USA; California Institute for Quantitative Biomedical Research, University of California, San Francisco, CA 94158, USA.
| | - Chakrapani Kalyanaraman
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94158, USA; California Institute for Quantitative Biomedical Research, University of California, San Francisco, CA 94158, USA
| | - Suwen Zhao
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94158, USA; California Institute for Quantitative Biomedical Research, University of California, San Francisco, CA 94158, USA
| | - Boxue Tian
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94158, USA; California Institute for Quantitative Biomedical Research, University of California, San Francisco, CA 94158, USA
| |
Collapse
|
35
|
Kuzu G, Keskin O, Nussinov R, Gursoy A. Modeling protein assemblies in the proteome. Mol Cell Proteomics 2014; 13:887-96. [PMID: 24445405 PMCID: PMC3945916 DOI: 10.1074/mcp.m113.031294] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2013] [Revised: 12/13/2013] [Indexed: 11/06/2022] Open
Abstract
Most (if not all) proteins function when associated in multimolecular assemblies. Attaining the structures of protein assemblies at the atomic scale is an important aim of structural biology. Experimentally, structures are increasingly available, and computations can help bridge the resolution gap between high- and low-resolution scales. Existing computational methods have made substantial progress toward this aim; however, current approaches are still limited. Some involve manual adjustment of experimental data; some are automated docking methods, which are computationally expensive and not applicable to large-scale proteome studies; and still others exploit the symmetry of the complexes and thus are not applicable to nonsymmetrical complexes. Our study aims to take steps toward overcoming these limitations. We have developed a strategy for the construction of protein assemblies computationally based on binary interactions predicted by a motif-based protein interaction prediction tool, PRISM (Protein Interactions by Structural Matching). Previously, we have shown its power in predicting pairwise interactions. Here we take a step toward multimolecular assemblies, reflecting the more prevalent cellular scenarios. With this method we are able to construct homo-/hetero-complexes and symmetric/asymmetric complexes without a limitation on the number of components. The method considers conformational changes and is applicable to large-scale studies. We also exploit electron microscopy density maps to select a solution from among the predictions. Here we present the method, illustrate its results, and highlight its current limitations.
Collapse
Affiliation(s)
- Guray Kuzu
- From the ‡Center for Computational Biology and Bioinformatics and College of Engineering, Koc University Rumelifeneri Yolu, 34450 Sariyer Istanbul, Turkey
| | - Ozlem Keskin
- From the ‡Center for Computational Biology and Bioinformatics and College of Engineering, Koc University Rumelifeneri Yolu, 34450 Sariyer Istanbul, Turkey
| | - Ruth Nussinov
- §Cancer and Inflammation Program, Leidos Biomedical Research, Inc., National Cancer Institute, Frederick National Laboratory for Cancer Research, Frederick, Maryland 21702
- ¶Sackler Institute of Molecular Medicine Department of Human Genetics and Molecular Medicine Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Attila Gursoy
- From the ‡Center for Computational Biology and Bioinformatics and College of Engineering, Koc University Rumelifeneri Yolu, 34450 Sariyer Istanbul, Turkey
| |
Collapse
|
36
|
Webb B, Lasker K, Velázquez-Muriel J, Schneidman-Duhovny D, Pellarin R, Bonomi M, Greenberg C, Raveh B, Tjioe E, Russel D, Sali A. Modeling of proteins and their assemblies with the Integrative Modeling Platform. Methods Mol Biol 2014; 1091:277-95. [PMID: 24203340 DOI: 10.1007/978-1-62703-691-7_20] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
To understand the workings of the living cell, we need to characterize protein assemblies that constitute the cell (for example, the ribosome, 26S proteasome, and the nuclear pore complex). A reliable high-resolution structural characterization of these assemblies is frequently beyond the reach of current experimental methods, such as X-ray crystallography, NMR spectroscopy, electron microscopy, footprinting, chemical cross-linking, FRET spectroscopy, small angle X-ray scattering, and proteomics. However, the information garnered from different methods can be combined and used to build models of the assembly structures that are consistent with all of the available datasets, and therefore more accurate, precise, and complete. Here, we describe a protocol for this integration, whereby the information is converted to a set of spatial restraints and a variety of optimization procedures can be used to generate models that satisfy the restraints as well as possible. These generated models can then potentially inform about the precision and accuracy of structure determination, the accuracy of the input datasets, and further data generation. We also demonstrate the Integrative Modeling Platform (IMP) software, which provides the necessary computational framework to implement this protocol, and several applications for specific use cases.
Collapse
Affiliation(s)
- Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quanstitative Biosciences (QB3), University of California San Francisco, San Francisco, CA, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Abstract
Genome sequencing projects have resulted in a rapid increase in the number of known protein sequences. In contrast, only about one-hundredth of these sequences have been characterized at atomic resolution using experimental structure determination methods. Computational protein structure modeling techniques have the potential to bridge this sequence-structure gap. In this chapter, we present an example that illustrates the use of MODELLER to construct a comparative model for a protein with unknown structure. Automation of a similar protocol has resulted in models of useful accuracy for domains in more than half of all known protein sequences.
Collapse
Affiliation(s)
- Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | | |
Collapse
|
38
|
On the use of knowledge-based potentials for the evaluation of models of protein-protein, protein-DNA, and protein-RNA interactions. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2014; 94:77-120. [PMID: 24629186 DOI: 10.1016/b978-0-12-800168-4.00004-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Proteins are the bricks and mortar of cells, playing structural and functional roles. In order to perform their function, they interact with each other as well as with other biomolecules such as DNA or RNA. Therefore, to fathom the function of a protein, we require knowing its partners and the atomic details of its interactions (i.e., the structure of the complex). However, the amount of protein interactions with an experimentally determined three-dimensional structure is scarce. Therefore, computational techniques such as homology modeling are foremost to fill this gap. Protein interactions can be modeled using as templates the interactions of homologous proteins, if the structure of the complex is known, or using docking methods. In both approaches, the estimation of the quality of models is essential. There are several ways to address this problem. In this review, we focus on the use of knowledge-based potentials for the analysis of protein interactions. We describe the procedure to derive statistical potentials and split them into different energetic terms that can be used for different purposes. We extensively discuss the fields where knowledge-based potentials have been successfully applied to (1) model protein-protein, protein-DNA, and protein-RNA interactions and (2) predict binding sites (in the protein and in the DNA). Moreover, we provide ready-to-use resources for docking and benchmarking protein interactions.
Collapse
|
39
|
Esquivel-Rodríguez J, Kihara D. Computational methods for constructing protein structure models from 3D electron microscopy maps. J Struct Biol 2013; 184:93-102. [PMID: 23796504 DOI: 10.1016/j.jsb.2013.06.008] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 06/11/2013] [Accepted: 06/13/2013] [Indexed: 12/31/2022]
Abstract
Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein structure information, in this article we focus on reviewing computational methods that model protein three-dimensional (3D) structures from a 3D EM density map that is constructed from two-dimensional (2D) maps. The computational methods we discuss range from de novo methods, which identify structural elements in an EM map, to structure fitting methods, where known high resolution structures are fit into a low-resolution EM map. A list of available computational tools is also provided.
Collapse
Affiliation(s)
- Juan Esquivel-Rodríguez
- Department of Computer Science, College of Science, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
40
|
Sun S, Xiang Y, Akahata W, Holdaway H, Pal P, Zhang X, Diamond MS, Nabel GJ, Rossmann MG. Structural analyses at pseudo atomic resolution of Chikungunya virus and antibodies show mechanisms of neutralization. eLife 2013; 2:e00435. [PMID: 23577234 PMCID: PMC3614025 DOI: 10.7554/elife.00435] [Citation(s) in RCA: 115] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2012] [Accepted: 02/18/2013] [Indexed: 01/07/2023] Open
Abstract
A 5.3 Å resolution, cryo-electron microscopy (cryoEM) map of Chikungunya virus-like particles (VLPs) has been interpreted using the previously published crystal structure of the Chikungunya E1-E2 glycoprotein heterodimer. The heterodimer structure was divided into domains to obtain a good fit to the cryoEM density. Differences in the T = 4 quasi-equivalent heterodimer components show their adaptation to different environments. The spikes on the icosahedral 3-fold axes and those in general positions are significantly different, possibly representing different phases during initial generation of fusogenic E1 trimers. CryoEM maps of neutralizing Fab fragments complexed with VLPs have been interpreted using the crystal structures of the Fab fragments and the VLP structure. Based on these analyses the CHK-152 antibody was shown to stabilize the viral surface, hindering the exposure of the fusion-loop, likely neutralizing infection by blocking fusion. The CHK-9, m10 and m242 antibodies surround the receptor-attachment site, probably inhibiting infection by blocking cell attachment. DOI:http://dx.doi.org/10.7554/eLife.00435.001.
Collapse
Affiliation(s)
- Siyang Sun
- Department of Biological Sciences, Purdue University, West Lafayette, United States
| | - Ye Xiang
- Department of Biological Sciences, Purdue University, West Lafayette, United States
| | - Wataru Akahata
- Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, United States
| | - Heather Holdaway
- Department of Biological Sciences, Purdue University, West Lafayette, United States
| | - Pankaj Pal
- Departments of Medicine, Molecular Microbiology, Pathology & Immunology, Washington University School of Medicine, St Louis, United States
| | - Xinzheng Zhang
- Department of Biological Sciences, Purdue University, West Lafayette, United States
| | - Michael S Diamond
- Departments of Medicine, Molecular Microbiology, Pathology & Immunology, Washington University School of Medicine, St Louis, United States
| | - Gary J Nabel
- Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, United States
| | - Michael G Rossmann
- Department of Biological Sciences, Purdue University, West Lafayette, United States,For correspondence:
| |
Collapse
|
41
|
Karaca E, Bonvin AM. Advances in integrative modeling of biomolecular complexes. Methods 2013; 59:372-81. [DOI: 10.1016/j.ymeth.2012.12.004] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2012] [Revised: 11/30/2012] [Accepted: 12/14/2012] [Indexed: 11/25/2022] Open
|
42
|
Atomic modeling of cryo-electron microscopy reconstructions--joint refinement of model and imaging parameters. J Struct Biol 2013; 182:10-21. [PMID: 23376441 DOI: 10.1016/j.jsb.2013.01.003] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2012] [Revised: 12/20/2012] [Accepted: 01/11/2013] [Indexed: 11/22/2022]
Abstract
When refining the fit of component atomic structures into electron microscopic reconstructions, use of a resolution-dependent atomic density function makes it possible to jointly optimize the atomic model and imaging parameters of the microscope. Atomic density is calculated by one-dimensional Fourier transform of atomic form factors convoluted with a microscope envelope correction and a low-pass filter, allowing refinement of imaging parameters such as resolution, by optimizing the agreement of calculated and experimental maps. A similar approach allows refinement of atomic displacement parameters, providing indications of molecular flexibility even at low resolution. A modest improvement in atomic coordinates is possible following optimization of these additional parameters. Methods have been implemented in a Python program that can be used in stand-alone mode for rigid-group refinement, or embedded in other optimizers for flexible refinement with stereochemical restraints. The approach is demonstrated with refinements of virus and chaperonin structures at resolutions of 9 through 4.5 Å, representing regimes where rigid-group and fully flexible parameterizations are appropriate. Through comparisons to known crystal structures, flexible fitting by RSRef is shown to be an improvement relative to other methods and to generate models with all-atom rms accuracies of 1.5-2.5 Å at resolutions of 4.5-6 Å.
Collapse
|
43
|
Low-resolution structural modeling of protein interactome. Curr Opin Struct Biol 2013; 23:198-205. [PMID: 23294579 DOI: 10.1016/j.sbi.2012.12.003] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2012] [Accepted: 12/03/2012] [Indexed: 11/23/2022]
Abstract
Structural characterization of protein-protein interactions across the broad spectrum of scales is key to our understanding of life at the molecular level. Low-resolution approach to protein interactions is needed for modeling large interaction networks, given the significant level of uncertainties in large biomolecular systems and the high-throughput nature of the task. Since only a fraction of protein structures in interactome are determined experimentally, protein docking approaches are increasingly focusing on modeled proteins. Current rapid advancement of template-based modeling of protein-protein complexes is following a long standing trend in structure prediction of individual proteins. Protein-protein templates are already available for almost all interactions of structurally characterized proteins, and about one third of such templates are likely correct.
Collapse
|
44
|
Kulczyk AW, Akabayov B, Lee SJ, Bostina M, Berkowitz SA, Richardson CC. An interaction between DNA polymerase and helicase is essential for the high processivity of the bacteriophage T7 replisome. J Biol Chem 2012; 287:39050-60. [PMID: 22977246 DOI: 10.1074/jbc.m112.410647] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Synthesis of the leading DNA strand requires the coordinated activity of DNA polymerase and DNA helicase, whereas synthesis of the lagging strand involves interactions of these proteins with DNA primase. We present the first structural model of a bacteriophage T7 DNA helicase-DNA polymerase complex using a combination of small angle x-ray scattering, single-molecule, and biochemical methods. We propose that the protein-protein interface stabilizing the leading strand synthesis involves two distinct interactions: a stable binding of the helicase to the palm domain of the polymerase and an electrostatic binding of the carboxyl-terminal tail of the helicase to a basic patch on the polymerase. DNA primase facilitates binding of DNA helicase to ssDNA and contributes to formation of the DNA helicase-DNA polymerase complex by stabilizing DNA helicase.
Collapse
Affiliation(s)
- Arkadiusz W Kulczyk
- Department of Biological Chemistry and Molecular Pharmacology, Harvard University Medical School, Boston, Massachusetts 02115, USA
| | | | | | | | | | | |
Collapse
|
45
|
|
46
|
Esquivel-Rodríguez J, Kihara D. Fitting multimeric protein complexes into electron microscopy maps using 3D Zernike descriptors. J Phys Chem B 2012; 116:6854-61. [PMID: 22417139 PMCID: PMC3376205 DOI: 10.1021/jp212612t] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A novel computational method for fitting high-resolution structures of multiple proteins into a cryoelectron microscopy map is presented. The method named EMLZerD generates a pool of candidate multiple protein docking conformations of component proteins, which are later compared with a provided electron microscopy (EM) density map to select the ones that fit well into the EM map. The comparison of docking conformations and the EM map is performed using the 3D Zernike descriptor (3DZD), a mathematical series expansion of three-dimensional functions. The 3DZD provides a unified representation of the surface shape of multimeric protein complex models and EM maps, which allows a convenient, fast quantitative comparison of the three-dimensional structural data. Out of 19 multimeric complexes tested, near native complex structures with a root-mean-square deviation of less than 2.5 Å were obtained for 14 cases while medium range resolution structures with correct topology were computed for the additional 5 cases.
Collapse
Affiliation(s)
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
- Markey Center for Structural Biology, Purdue University, West Lafayette, IN, 47907, USA
| |
Collapse
|
47
|
Sinitskiy AV, Saunders MG, Voth GA. Optimal number of coarse-grained sites in different components of large biomolecular complexes. J Phys Chem B 2012; 116:8363-74. [PMID: 22276676 DOI: 10.1021/jp2108895] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The computational study of large biomolecular complexes (molecular machines, cytoskeletal filaments, etc.) is a formidable challenge facing computational biophysics and biology. To achieve biologically relevant length and time scales, coarse-grained (CG) models of such complexes usually must be built and employed. One of the important early stages in this approach is to determine an optimal number of CG sites in different constituents of a complex. This work presents a systematic approach to this problem. First, a universal scaling law is derived and numerically corroborated for the intensity of the intrasite (intradomain) thermal fluctuations as a function of the number of CG sites. Second, this result is used for derivation of the criterion for the optimal number of CG sites in different parts of a large multibiomolecule complex. In the zeroth-order approximation, this approach validates the empirical rule of taking one CG site per fixed number of atoms or residues in each biomolecule, previously widely used for smaller systems (e.g., individual biomolecules). The first-order corrections to this rule are derived and numerically checked by the case studies of the Escherichia coli ribosome and Arp2/3 actin filament junction. In different ribosomal proteins, the optimal number of amino acids per CG site is shown to differ by a factor of 3.5, and an even wider spread may exist in other large biomolecular complexes. Therefore, the method proposed in this paper is valuable for the optimal construction of CG models of such complexes.
Collapse
Affiliation(s)
- Anton V Sinitskiy
- Department of Chemistry, Institute for Biophysical Dynamics, University of Chicago, Chicago, Illinois 60637, United States
| | | | | |
Collapse
|
48
|
Russel D, Lasker K, Webb B, Velázquez-Muriel J, Tjioe E, Schneidman-Duhovny D, Peterson B, Sali A. Putting the pieces together: integrative modeling platform software for structure determination of macromolecular assemblies. PLoS Biol 2012; 10:e1001244. [PMID: 22272186 PMCID: PMC3260315 DOI: 10.1371/journal.pbio.1001244] [Citation(s) in RCA: 386] [Impact Index Per Article: 29.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
A set of software tools for building and distributing models of macromolecular assemblies uses an integrative structure modeling approach, which casts the building of models as a computational optimization problem where information is encoded into a scoring function used to evaluate candidate models.
Collapse
Affiliation(s)
- Daniel Russel
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, San Francisco, California, United States of America
| | - Keren Lasker
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, San Francisco, California, United States of America
- Raymond and Beverly Sackler Faculty of Exact Sciences, Blavatnik School of Computer Science, Tel Aviv University, Tel-Aviv, Israel
| | - Ben Webb
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, San Francisco, California, United States of America
| | - Javier Velázquez-Muriel
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, San Francisco, California, United States of America
| | - Elina Tjioe
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, San Francisco, California, United States of America
| | - Dina Schneidman-Duhovny
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, San Francisco, California, United States of America
| | - Bret Peterson
- Google, Mountain View, California, United States of America
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, San Francisco, California, United States of America
- * E-mail:
| |
Collapse
|
49
|
Sacan A, Ekins S, Kortagere S. Applications and limitations of in silico models in drug discovery. Methods Mol Biol 2012; 910:87-124. [PMID: 22821594 DOI: 10.1007/978-1-61779-965-5_6] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Drug discovery in the late twentieth and early twenty-first century has witnessed a myriad of changes that were adopted to predict whether a compound is likely to be successful, or conversely enable identification of molecules with liabilities as early as possible. These changes include integration of in silico strategies for lead design and optimization that perform complementary roles to that of the traditional in vitro and in vivo approaches. The in silico models are facilitated by the availability of large datasets associated with high-throughput screening, bioinformatics algorithms to mine and annotate the data from a target perspective, and chemoinformatics methods to integrate chemistry methods into lead design process. This chapter highlights the applications of some of these methods and their limitations. We hope this serves as an introduction to in silico drug discovery.
Collapse
Affiliation(s)
- Ahmet Sacan
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, USA
| | | | | |
Collapse
|
50
|
Lasker K, Velázquez-Muriel JA, Webb BM, Yang Z, Ferrin TE, Sali A. Macromolecular assembly structures by comparative modeling and electron microscopy. Methods Mol Biol 2012; 857:331-350. [PMID: 22323229 DOI: 10.1007/978-1-61779-588-6_15] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Advances in electron microscopy allow for structure determination of large biological machines at increasingly higher resolutions. A key step in this process is fitting component structures into the electron microscopy-derived density map of their assembly. Comparative modeling can contribute by providing atomic models of the components, via fold assignment, sequence-structure alignment, model building, and model assessment. All four stages of comparative modeling can also benefit from consideration of the density map. In this chapter, we describe numerous types of modeling problems restrained by a density map and available protocols for finding solutions. In particular, we provide detailed instructions for density map-guided modeling using the Integrative Modeling Platform (IMP), MODELLER, and UCSF Chimera.
Collapse
Affiliation(s)
- Keren Lasker
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA.
| | | | | | | | | | | |
Collapse
|