51
|
Joseph AP, Olek M, Malhotra S, Zhang P, Cowtan K, Burnley T, Winn MD. Atomic model validation using the CCP-EM software suite. Acta Crystallogr D Struct Biol 2022; 78:152-161. [PMID: 35102881 PMCID: PMC8805302 DOI: 10.1107/s205979832101278x] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 12/01/2021] [Indexed: 12/02/2022] Open
Abstract
Recently, there has been a dramatic improvement in the quality and quantity of data derived using cryogenic electron microscopy (cryo-EM). This is also associated with a large increase in the number of atomic models built. Although the best resolutions that are achievable are improving, often the local resolution is variable, and a significant majority of data are still resolved at resolutions worse than 3 Å. Model building and refinement is often challenging at these resolutions, and hence atomic model validation becomes even more crucial to identify less reliable regions of the model. Here, a graphical user interface for atomic model validation, implemented in the CCP-EM software suite, is presented. It is aimed to develop this into a platform where users can access multiple complementary validation metrics that work across a range of resolutions and obtain a summary of evaluations. Based on the validation estimates from atomic models associated with cryo-EM structures from SARS-CoV-2, it was observed that models typically favor adopting the most common conformations over fitting the observations when compared with the model agreement with data. At low resolutions, the stereochemical quality may be favored over data fit, but care should be taken to ensure that the model agrees with the data in terms of resolvable features. It is demonstrated that further re-refinement can lead to improvement of the agreement with data without the loss of geometric quality. This also highlights the need for improved resolution-dependent weight optimization in model refinement and an effective test for overfitting that would help to guide the refinement process.
Collapse
Affiliation(s)
- Agnel Praveen Joseph
- Scientific Computing Department, Science and Technology Facilities Council, Didcot, United Kingdom
| | - Mateusz Olek
- Department of Chemistry, University of York, York, United Kingdom
- Electron BioImaging Center, Diamond Light Source, Rutherford Appleton Laboratory, Didcot, United Kingdom
| | - Sony Malhotra
- Scientific Computing Department, Science and Technology Facilities Council, Didcot, United Kingdom
| | - Peijun Zhang
- Electron BioImaging Center, Diamond Light Source, Rutherford Appleton Laboratory, Didcot, United Kingdom
| | - Kevin Cowtan
- Department of Chemistry, University of York, York, United Kingdom
| | - Tom Burnley
- Scientific Computing Department, Science and Technology Facilities Council, Didcot, United Kingdom
| | - Martyn D. Winn
- Scientific Computing Department, Science and Technology Facilities Council, Didcot, United Kingdom
| |
Collapse
|
52
|
Laroussi M, Bekeschus S, Keidar M, Bogaerts A, Fridman A, Lu XP, Ostrikov KK, Hori M, Stapelmann K, Miller V, Reuter S, Laux C, Mesbah A, Walsh J, Jiang C, Thagard SM, Tanaka H, Liu DW, Yan D, Yusupov M. Low Temperature Plasma for Biology, Hygiene, and Medicine: Perspective and Roadmap. IEEE TRANSACTIONS ON RADIATION AND PLASMA MEDICAL SCIENCES 2022. [DOI: 10.1109/trpms.2021.3135118] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
53
|
Saibil HR. Cryo-EM in molecular and cellular biology. Mol Cell 2022; 82:274-284. [DOI: 10.1016/j.molcel.2021.12.016] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Revised: 12/13/2021] [Accepted: 12/14/2021] [Indexed: 11/16/2022]
|
54
|
Chojnowski G, Simpkin AJ, Leonardo DA, Seifert-Davila W, Vivas-Ruiz DE, Keegan RM, Rigden DJ. findMySequence: a neural-network-based approach for identification of unknown proteins in X-ray crystallography and cryo-EM. IUCRJ 2022; 9:86-97. [PMID: 35059213 PMCID: PMC8733886 DOI: 10.1107/s2052252521011088] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 10/22/2021] [Indexed: 05/15/2023]
Abstract
Although experimental protein-structure determination usually targets known proteins, chains of unknown sequence are often encountered. They can be purified from natural sources, appear as an unexpected fragment of a well characterized protein or appear as a contaminant. Regardless of the source of the problem, the unknown protein always requires characterization. Here, an automated pipeline is presented for the identification of protein sequences from cryo-EM reconstructions and crystallographic data. The method's application to characterize the crystal structure of an unknown protein purified from a snake venom is presented. It is also shown that the approach can be successfully applied to the identification of protein sequences and validation of sequence assignments in cryo-EM protein structures.
Collapse
Affiliation(s)
- Grzegorz Chojnowski
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
- Correspondence e-mail:
| | - Adam J. Simpkin
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Diego A. Leonardo
- São Carlos Institute of Physics, University of São Paulo, Avenida João Dagnone 1100, São Carlos, SP 13563-120, Brazil
| | | | - Dan E. Vivas-Ruiz
- Laboratorio de Biología Molecular, Facultad de Ciencias Biológicas, Universidad Nacional Mayor de San Marcos, Avenida Venezuela Cdra 34 S/N, Ciudad Universitaria, Lima, Peru
| | - Ronan M. Keegan
- Rutherford Appleton Laboratory, Research Complex at Harwell, UKRI-STFC, Didcot OX11 0FA, United Kingdom
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| |
Collapse
|
55
|
Cragnolini T, Kryshtafovych A, Topf M. Cryo-EM targets in CASP14. Proteins 2021; 89:1949-1958. [PMID: 34398978 PMCID: PMC8630773 DOI: 10.1002/prot.26216] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Revised: 07/27/2021] [Accepted: 08/06/2021] [Indexed: 11/22/2022]
Abstract
Structures of seven CASP14 targets were determined using cryo-electron microscopy (cryo-EM) technique with resolution between 2.1 and 3.8 Å. We provide an evaluation of the submitted models versus the experimental data (cryo-EM density maps) and experimental reference structures built into the maps. The accuracy of models is measured in terms of coordinate-to-density and coordinate-to-coordinate fit. A-posteriori refinement of the most accurate models in their corresponding cryo-EM density resulted in structures that are close to the reference structure, including some regions with better fit to the density. Regions that were found to be less "refineable" correlate well with regions of high diversity between the CASP models and low goodness-of-fit to density in the reference structure.
Collapse
Affiliation(s)
- Tristan Cragnolini
- Institute of Structural and Molecular Biology, Birkbeck, University College London, London, UK
| | | | - Maya Topf
- Center for Structural Systems Biology, Leibniz-Institut für Experimentelle Virologie and Universitätsklinikum Hamburg-Eppendorf (UKE), Hamburg, Germany
| |
Collapse
|
56
|
Gao M, Lund-Andersen P, Morehead A, Mahmud S, Chen C, Chen X, Giri N, Roy RS, Quadir F, Effler TC, Prout R, Abraham S, Elwasif W, Haas NQ, Skolnick J, Cheng J, Sedova A. High-Performance Deep Learning Toolbox for Genome-Scale Prediction of Protein Structure and Function. WORKSHOP ON MACHINE LEARNING IN HPC ENVIRONMENTS. WORKSHOP ON MACHINE LEARNING IN HPC ENVIRONMENTS 2021; 2021:46-57. [PMID: 35112110 PMCID: PMC8802329 DOI: 10.1109/mlhpc54614.2021.00010] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Computational biology is one of many scientific disciplines ripe for innovation and acceleration with the advent of high-performance computing (HPC). In recent years, the field of machine learning has also seen significant benefits from adopting HPC practices. In this work, we present a novel HPC pipeline that incorporates various machine-learning approaches for structure-based functional annotation of proteins on the scale of whole genomes. Our pipeline makes extensive use of deep learning and provides computational insights into best practices for training advanced deep-learning models for high-throughput data such as proteomics data. We showcase methodologies our pipeline currently supports and detail future tasks for our pipeline to envelop, including large-scale sequence comparison using SAdLSA and prediction of protein tertiary structures using AlphaFold2.
Collapse
Affiliation(s)
- Mu Gao
- Georgia Institute of Technology, Atlanta, GA
| | | | | | | | - Chen Chen
- University of Missouri, Columbia, MO
| | - Xiao Chen
- University of Missouri, Columbia, MO
| | | | | | | | | | - Ryan Prout
- Oak Ridge National Laboratory, Oak Ridge, TN
| | | | | | | | | | | | - Ada Sedova
- Oak Ridge National Laboratory, Oak Ridge, TN
| |
Collapse
|
57
|
Zhang J, Trejo M, Bradford SD, Lei L, Kong W. Electron Diffraction of Ionic Argon Nanoclusters Embedded in Superfluid Helium Droplets. J Phys Chem Lett 2021; 12:9644-9650. [PMID: 34586826 PMCID: PMC8550877 DOI: 10.1021/acs.jpclett.1c02712] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
We report electron diffraction of cationic argon nanoclusters embedded in superfluid helium droplets. Superfluid helium droplets are first doped with neutral argon atoms to form nanoclusters, and then the doped droplets are ionized by electrons. The much lower ionization energy of argon ensures that the positive charge resides on the Ar nanocluster. Using different stagnation temperatures and therefore droplets with different sizes, we have been able to preferentially form a small ionic cluster containing 2-4 Ar atoms and a larger cluster containing 7-11 atoms. The fitting results of the diffraction profiles agree with structures reported from theoretical calculations, containing a cationic trimer core with the remaining atoms largely neutral. This work testifies to the feasibility of performing electron diffraction from ionic species embedded in superfluid helium droplets, dispelling the concern over the particle density in the diffraction region. However, the large number of neutral helium atoms surrounding the cationic nanoclusters poses a challenge for the detection of the helium solvation layer, and the detection of which awaits further technological improvements.
Collapse
Affiliation(s)
| | | | | | | | - Wei Kong
- Corresponding author, , 541-737-6714
| |
Collapse
|
58
|
Shekhar M, Terashi G, Gupta C, Sarkar D, Debussche G, Sisco NJ, Nguyen J, Mondal A, Vant J, Fromme P, Van Horn WD, Tajkhorshid E, Kihara D, Dill K, Perez A, Singharoy A. CryoFold: determining protein structures and data-guided ensembles from cryo-EM density maps. MATTER 2021; 4:3195-3216. [PMID: 35874311 PMCID: PMC9302471 DOI: 10.1016/j.matt.2021.09.004] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Cryo-electron microscopy (EM) requires molecular modeling to refine structural details from data. Ensemble models arrive at low free-energy molecular structures, but are computationally expensive and limited to resolving only small proteins that cannot be resolved by cryo-EM. Here, we introduce CryoFold - a pipeline of molecular dynamics simulations that determines ensembles of protein structures directly from sequence by integrating density data of varying sparsity at 3-5 Å resolution with coarse-grained topological knowledge of the protein folds. We present six examples showing its broad applicability for folding proteins between 72 to 2000 residues, including large membrane and multi-domain systems, and results from two EMDB competitions. Driven by data from a single state, CryoFold discovers ensembles of common low-energy models together with rare low-probability structures that capture the equilibrium distribution of proteins constrained by the density maps. Many of these conformations, unseen by traditional methods, are experimentally validated and functionally relevant. We arrive at a set of best practices for data-guided protein folding that are controlled using a Python GUI.
Collapse
Affiliation(s)
- Mrinal Shekhar
- Center for Biophysics and Quantitative Biology, Department of Biochemistry, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, 61801, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| | - Chitrak Gupta
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Center for Structural Discovery, Arizona State University, Tempe, AZ 85281, USA
| | - Daipayan Sarkar
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Gaspard Debussche
- Department of Mathematics and Computer Sciences, Grenoble INP, 38000 Grenoble, France
| | - Nicholas J Sisco
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Virginia G. Piper Center for Personalized Diagnostics, Arizona State University, Tempe, AZ 85281, USA
| | - Jonathan Nguyen
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Center for Structural Discovery, Arizona State University, Tempe, AZ 85281, USA
| | - Arup Mondal
- Chemistry Department, Quantum Theory Project, University of Florida, Gainesville, Florida, 32611, USA
| | - John Vant
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Center for Structural Discovery, Arizona State University, Tempe, AZ 85281, USA
| | - Petra Fromme
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Center for Structural Discovery, Arizona State University, Tempe, AZ 85281, USA
| | - Wade D Van Horn
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Virginia G. Piper Center for Personalized Diagnostics, Arizona State University, Tempe, AZ 85281, USA
| | - Emad Tajkhorshid
- Center for Biophysics and Quantitative Biology, Department of Biochemistry, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, 61801, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
| | - Ken Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| | - Alberto Perez
- Chemistry Department, Quantum Theory Project, University of Florida, Gainesville, Florida, 32611, USA
| | - Abhishek Singharoy
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Center for Structural Discovery, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
59
|
Lobo VR, Warwicker J. Predicted pH-dependent stability of SARS-CoV-2 spike protein trimer from interfacial acidic groups. Comput Struct Biotechnol J 2021; 19:5140-5148. [PMID: 34490059 PMCID: PMC8410215 DOI: 10.1016/j.csbj.2021.08.049] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2021] [Revised: 08/29/2021] [Accepted: 08/30/2021] [Indexed: 01/10/2023] Open
Abstract
Transition between receptor binding domain (RBD) up and down forms of the SARS-CoV-2 spike protein trimer is coupled to receptor binding and is one route by which variants can alter viral properties. It is becoming apparent that key roles in the transition are played by pH and a more compact closed form, termed locked. Calculations of pH-dependence are made for a large set of spike trimers, including locked form trimer structures that have recently become available. Several acidic sidechains become sufficiently buried in the locked form to give a predicted pH-dependence in the mild acidic range, with stabilisation of the locked form as pH reduces from 7.5 to 5, consistent with emerging characterisation by cryo-electron microscopy. The calculated pH effects in pre-fusion spike trimers are modulated mainly by aspartic acid residues, rather than the more familiar histidine role at mild acidic pH. These acidic sidechains are generally surface located and weakly interacting when not in a locked conformation. According to this model, their replacement (perhaps with asparagine) would remove the pH-dependent destabilisation of locked spike trimer conformations, and increase their recovery at neutral pH. This would provide an alternative or supplement to the insertion of disulphide linkages for stabilising spike protein trimers, with potential relevance for vaccine design.
Collapse
Affiliation(s)
- Vanessa R. Lobo
- School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Institute of Biotechnology, University of Manchester, M1 7DN, UK
| | - Jim Warwicker
- School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Institute of Biotechnology, University of Manchester, M1 7DN, UK
| |
Collapse
|
60
|
Pintilie G, Chiu W. Validation, analysis and annotation of cryo-EM structures. Acta Crystallogr D Struct Biol 2021; 77:1142-1152. [PMID: 34473085 PMCID: PMC8411978 DOI: 10.1107/s2059798321006069] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Accepted: 06/09/2021] [Indexed: 11/08/2023] Open
Abstract
The process of turning 2D micrographs into 3D atomic models of the imaged macromolecules has been under rapid development and scrutiny in the field of cryo-EM. Here, some important methods for validation at several stages in this process are described. Firstly, how Fourier shell correlation of two independent maps and phase randomization beyond a certain frequency address the assessment of map resolution is reviewed. Techniques for local resolution estimation and map sharpening are also touched upon. The topic of validating models which are either built de novo or based on a known atomic structure fitted into a cryo-EM map is then approached. Map-model comparison using Q-scores and Fourier shell correlation plots is used to assure the agreement of the model with the observed map density. The importance of annotating the model with B factors to account for the resolvability of individual atoms in the map is illustrated. Finally, the timely topic of detecting and validating water molecules and metal ions in maps that have surpassed ∼2 Å resolution is described.
Collapse
Affiliation(s)
- Grigore Pintilie
- Department of Bioengineering, James H. Clark Center, Stanford University, Stanford, CA 94305, USA
| | - Wah Chiu
- Department of Bioengineering, James H. Clark Center, Stanford University, Stanford, CA 94305, USA
- Division of Cryo-EM and Bioimaging, SSRL, SLAC National Accelerator Laboratory, Stanford University, Menlo Park, CA 94025, USA
| |
Collapse
|
61
|
Olek M, Joseph AP. Cryo-EM Map-Based Model Validation Using the False Discovery Rate Approach. Front Mol Biosci 2021; 8:652530. [PMID: 34084774 PMCID: PMC8167059 DOI: 10.3389/fmolb.2021.652530] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Accepted: 04/26/2021] [Indexed: 01/18/2023] Open
Abstract
Significant technological developments and increasing scientific interest in cryogenic electron microscopy (cryo-EM) has resulted in a rapid increase in the amount of data generated by these experiments and the derived atomic models. Robust measures for the validation of 3D reconstructions and atomic models are essential for appropriate interpretation of the data. The resolution of data and availability of software tools that work across a range of resolutions often limit the quality of derived models. Hence, the final atomic model is often incomplete or contains regions where atomic positions are less reliable or incorrectly built. Extensive manual pruning and local adjustments or rebuilding are usually required to address these issues. The presented research introduces a software tool for the validation of the backbone trace of atomic models built in the cryo-EM density maps. In this study, we use the false discovery rate analysis, which can be used to segregate molecular signals from the background. Each atomic position in the model can be associated with an FDR backbone validation score, which can be used to identify potential mistraced residues. We demonstrate that the proposed validation score is complementary to existing validation metrics and is useful especially in cases where the model is built in the maps having varying local resolution. We also discuss the application of the score for automated pruning of atomic models built ab-initio during the iterative model building process in Buccaneer. We have implemented this score in the CCP-EM software suite.
Collapse
Affiliation(s)
- Mateusz Olek
- Department of Chemistry, University of York, York, United Kingdom.,Electron BioImaging Center, Rutherford Appleton Laboratory, Didcot, United Kingdom
| | - Agnel Praveen Joseph
- Scientific Computing Department, Science and Technology Facilities Council, Research Complex at Harwell, Didcot, United Kingdom
| |
Collapse
|
62
|
He J, Huang SY. Full-length de novo protein structure determination from cryo-EM maps using deep learning. Bioinformatics 2021; 37:3480-3490. [PMID: 33978686 DOI: 10.1093/bioinformatics/btab357] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 04/03/2021] [Accepted: 05/08/2021] [Indexed: 12/11/2022] Open
Abstract
MOTIVATION Advances in microscopy instruments and image processing algorithms have led to an increasing number of cryo-EM maps. However, building accurate models for the EM maps at 3-5 Å resolution remains a challenging and time-consuming process. With the rapid growth of deposited EM maps, there is an increasing gap between the maps and reconstructed/modeled 3-dimensional (3D) structures. Therefore, automatic reconstruction of atomic-accuracy full-atomstructures fromEMmaps is pressingly needed. RESULTS We present a semi-automatic de novo structure determination method using a deep learningbased framework, named as DeepMM, which builds atomic-accuracy all-atom models from cryo-EM maps at near-atomic resolution. In our method, the main-chain and Cα positions as well as their amino acid and secondary structure types are predicted in the EM map using Densely Connected Convolutional Networks. DeepMM was extensively validated on 40 simulated maps at 5 Å resolution and 30 experimental maps at 2.6-4.8 Å resolution as well as an EMDB-wide data set of 2931 experimental maps at 2.6-4.9 Å resolution, and compared with state-of-the-art algorithms including RosettaES, MAINMAST, and Phenix. Overall, our DeepMM algorithm obtained a significant improvement over existing methods in terms of both accuracy and coverage in building full-length protein structures on all test sets, demonstrating the efficacy and general applicability of DeepMM. AVAILABILITY http://huanglab.phys.hust.edu.cn/DeepMM. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jiahua He
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
63
|
Affiliation(s)
- Alexis Rohou
- Department of Structural Biology, Genentech, South San Francisco, CA, USA.
| |
Collapse
|
64
|
Chiu W, Schmid MF, Pintilie GD, Lawson CL. Evolution of standardization and dissemination of cryo-EM structures and data jointly by the community, PDB, and EMDB. J Biol Chem 2021; 296:100560. [PMID: 33744287 PMCID: PMC8050867 DOI: 10.1016/j.jbc.2021.100560] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 02/08/2021] [Accepted: 03/16/2021] [Indexed: 01/04/2023] Open
Abstract
Cryogenic electron microscopy (cryo-EM) methods began to be used in the mid-1970s to study thin and periodic arrays of proteins. Following a half-century of development in cryo-specimen preparation, instrumentation, data collection, data processing, and modeling software, cryo-EM has become a routine method for solving structures from large biological assemblies to small biomolecules at near to true atomic resolution. This review explores the critical roles played by the Protein Data Bank (PDB) and Electron Microscopy Data Bank (EMDB) in partnership with the community to develop the necessary infrastructure to archive cryo-EM maps and associated models. Public access to cryo-EM structure data has in turn facilitated better understanding of structure–function relationships and advancement of image processing and modeling tool development. The partnership between the global cryo-EM community and PDB and EMDB leadership has synergistically shaped the standards for metadata, one-stop deposition of maps and models, and validation metrics to assess the quality of cryo-EM structures. The advent of cryo-electron tomography (cryo-ET) for in situ molecular cell structures at a broad resolution range and their correlations with other imaging data introduce new data archival challenges in terms of data size and complexity in the years to come.
Collapse
Affiliation(s)
- Wah Chiu
- Department of Bioengineering, Stanford University, Stanford, California, USA; Division of CryoEM and Bioimaging, SLAC National Accelerator Laboratory, Stanford University, Menlo Park, California, USA.
| | - Michael F Schmid
- Division of CryoEM and Bioimaging, SLAC National Accelerator Laboratory, Stanford University, Menlo Park, California, USA
| | - Grigore D Pintilie
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Catherine L Lawson
- Institute for Quantitative Biomedicine and Research Collaboratory for Structural Bioinformatics, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| |
Collapse
|
65
|
Croll TI, Williams CJ, Chen VB, Richardson DC, Richardson JS. Improving SARS-CoV-2 structures: Peer review by early coordinate release. Biophys J 2021; 120:1085-1096. [PMID: 33460600 PMCID: PMC7834719 DOI: 10.1016/j.bpj.2020.12.029] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 12/17/2020] [Accepted: 12/22/2020] [Indexed: 01/18/2023] Open
Abstract
This work builds upon the record-breaking speed and generous immediate release of new experimental three-dimensional structures of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) proteins and complexes, which are crucial to downstream vaccine and drug development. We have surveyed those structures to catch the occasional errors that could be significant for those important uses and for which we were able to provide demonstrably higher-accuracy corrections. This process relied on new validation and correction methods such as CaBLAM and ISOLDE, which are not yet in routine use. We found such important and correctable problems in seven early SARS-CoV-2 structures. Two of the structures were soon superseded by new higher-resolution data, confirming our proposed changes. For the other five, we emailed the depositors a documented and illustrated report and encouraged them to make the model corrections themselves and use the new option at the worldwide Protein Data Bank for depositors to re-version their coordinates without changing the Protein Data Bank code. This quickly and easily makes the better-accuracy coordinates available to anyone who examines or downloads their structure, even before formal publication. The changes have involved sequence misalignments, incorrect RNA conformations near a bound inhibitor, incorrect metal ligands, and cis-trans or peptide flips that prevent good contact at interaction sites. These improvements have propagated into nearly all related structures done afterward. This process constitutes a new form of highly rigorous peer review, which is actually faster and more strict than standard publication review because it has access to coordinates and maps; journal peer review would also be strengthened by such access.
Collapse
Affiliation(s)
| | | | - Vincent B Chen
- Department of Biochemistry, Duke University, Durham, North Carolina
| | | | - Jane S Richardson
- Department of Biochemistry, Duke University, Durham, North Carolina.
| |
Collapse
|