1
|
Toader B, Brubaker MA, Lederman RR. Efficient high-resolution refinement in cryo-EM with stochastic gradient descent. ARXIV 2023:arXiv:2311.16100v1. [PMID: 38076514 PMCID: PMC10705587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
Electron cryomicroscopy (cryo-EM) is an imaging technique widely used in structural biology to determine the three-dimensional structure of biological molecules from noisy two-dimensional projections with unknown orientations. As the typical pipeline involves processing large amounts of data, efficient algorithms are crucial for fast and reliable results. The stochastic gradient descent (SGD) algorithm has been used to improve the speed of ab initio reconstruction, which results in a first, low-resolution estimation of the volume representing the molecule of interest, but has yet to be applied successfully in the high-resolution regime, where expectation-maximization algorithms achieve state-of-the-art results, at a high computational cost. In this article, we investigate the conditioning of the optimization problem and show that the large condition number prevents the successful application of gradient descent-based methods at high resolution. Our results include a theoretical analysis of the condition number of the optimization problem in a simplified setting where the individual projection directions are known, an algorithm based on computing a diagonal preconditioner using Hutchinson's diagonal estimator, and numerical experiments showing the improvement in the convergence speed when using the estimated preconditioner with SGD. The preconditioned SGD approach can potentially enable a simple and unified approach to ab initio reconstruction and high-resolution refinement with faster convergence speed and higher flexibility, and our results are a promising step in this direction.
Collapse
Affiliation(s)
- Bogdan Toader
- Department of Statistics and Data Science, Yale University
| | - Marcus A Brubaker
- Department of Electrical Engineering and Computer Science, York University
| | - Roy R Lederman
- Department of Statistics and Data Science, Yale University
| |
Collapse
|
2
|
Gilles MA, Singer A. A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.28.564422. [PMID: 37961393 PMCID: PMC10634927 DOI: 10.1101/2023.10.28.564422] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Proteins and the complexes they form are central to nearly all cellular processes. Their flexibility, expressed through a continuum of states, provides a window into their biological functions. Cryogenic-electron microscopy (cryo-EM) is an ideal tool to study these dynamic states as it captures specimens in non-crystalline conditions and enables high-resolution reconstructions. However, analyzing the heterogeneous distribution of conformations from cryo-EM data is challenging. Current methods face issues such as a lack of explainability, overfitting caused by lack of regularization, and a large number of parameters to tune; problems exacerbated by the lack of proper metrics to evaluate or compare heterogeneous reconstructions. To address these challenges, we present RECOVAR, a white-box method based on principal component analysis (PCA) computed via regularized covariance estimation that can resolve intricate heterogeneity with similar expressive power to neural networks with significantly lower computational demands. We extend the ubiquitous Bayesian framework used in homogeneous reconstruction to automatically regularize principal components, overcoming overfitting concerns and removing the need for most parameters. We further exploit the conservation of density and distances endowed by the embedding in PCA space, opening the door to reliable free energy computation. We leverage the predictable uncertainty of image labels to generate high-resolution reconstructions and identify high-density trajectories in latent space. We make the code freely available at https://github.com/ma-gilles/recovar.
Collapse
|
3
|
Vuillemot R, Rouiller I, Jonić S. MDTOMO method for continuous conformational variability analysis in cryo electron subtomograms based on molecular dynamics simulations. Sci Rep 2023; 13:10596. [PMID: 37391578 PMCID: PMC10313669 DOI: 10.1038/s41598-023-37037-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 06/14/2023] [Indexed: 07/02/2023] Open
Abstract
Cryo electron tomography (cryo-ET) allows observing macromolecular complexes in their native environment. The common routine of subtomogram averaging (STA) allows obtaining the three-dimensional (3D) structure of abundant macromolecular complexes, and can be coupled with discrete classification to reveal conformational heterogeneity of the sample. However, the number of complexes extracted from cryo-ET data is usually small, which restricts the discrete-classification results to a small number of enough populated states and, thus, results in a largely incomplete conformational landscape. Alternative approaches are currently being investigated to explore the continuity of the conformational landscapes that in situ cryo-ET studies could provide. In this article, we present MDTOMO, a method for analyzing continuous conformational variability in cryo-ET subtomograms based on Molecular Dynamics (MD) simulations. MDTOMO allows obtaining an atomic-scale model of conformational variability and the corresponding free-energy landscape, from a given set of cryo-ET subtomograms. The article presents the performance of MDTOMO on a synthetic ABC exporter dataset and an in situ SARS-CoV-2 spike dataset. MDTOMO allows analyzing dynamic properties of molecular complexes to understand their biological functions, which could also be useful for structure-based drug discovery.
Collapse
Affiliation(s)
- Rémi Vuillemot
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, CC 115, 4 Place Jussieu, 75005, Paris, France
- Department of Biochemistry and Pharmacology and Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, VIC, 3010, Australia
| | - Isabelle Rouiller
- Department of Biochemistry and Pharmacology and Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, VIC, 3010, Australia
- Australian Research Council Centre for Cryo-Electron Microscopy of Membrane Proteins, Parkville, VIC, 3052, Australia
| | - Slavica Jonić
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, CC 115, 4 Place Jussieu, 75005, Paris, France.
| |
Collapse
|
4
|
Punjani A, Fleet DJ. 3DFlex: determining structure and motion of flexible proteins from cryo-EM. Nat Methods 2023:10.1038/s41592-023-01853-8. [PMID: 37169929 DOI: 10.1038/s41592-023-01853-8] [Citation(s) in RCA: 47] [Impact Index Per Article: 47.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 03/16/2023] [Indexed: 05/13/2023]
Abstract
Modeling flexible macromolecules is one of the foremost challenges in single-particle cryogenic-electron microscopy (cryo-EM), with the potential to illuminate fundamental questions in structural biology. We introduce Three-Dimensional Flexible Refinement (3DFlex), a motion-based neural network model for continuous molecular heterogeneity for cryo-EM data. 3DFlex exploits knowledge that conformational variability of a protein is often the result of physical processes that transport density over space and tend to preserve local geometry. From two-dimensional image data, 3DFlex enables the determination of high-resolution 3D density, and provides an explicit model of a flexible protein's motion over its conformational landscape. Experimentally, for large molecular machines (tri-snRNP spliceosome complex, translocating ribosome) and small flexible proteins (TRPV1 ion channel, αVβ8 integrin, SARS-CoV-2 spike), 3DFlex learns nonrigid molecular motions while resolving details of moving secondary structure elements. 3DFlex can improve 3D density resolution beyond the limits of existing methods because particle images contribute coherent signal over the conformational landscape.
Collapse
Affiliation(s)
- Ali Punjani
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada.
- Structura Biotechnology Inc., Toronto, Ontario, Canada.
| | - David J Fleet
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada.
- Google Research, Toronto, Ontario, Canada.
| |
Collapse
|
5
|
Toader B, Sigworth FJ, Lederman RR. Methods for Cryo-EM Single Particle Reconstruction of Macromolecules Having Continuous Heterogeneity. J Mol Biol 2023; 435:168020. [PMID: 36863660 PMCID: PMC10164696 DOI: 10.1016/j.jmb.2023.168020] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 02/15/2023] [Accepted: 02/16/2023] [Indexed: 03/02/2023]
Abstract
Macromolecules change their shape (conformation) in the process of carrying out their functions. The imaging by cryo-electron microscopy of rapidly-frozen, individual copies of macromolecules (single particles) is a powerful and general approach to understanding the motions and energy landscapes of macromolecules. Widely-used computational methods already allow the recovery of a few distinct conformations from heterogeneous single-particle samples, but the treatment of complex forms of heterogeneity such as the continuum of possible transitory states and flexible regions remains largely an open problem. In recent years there has been a surge of new approaches for treating the more general problem of continuous heterogeneity. This paper surveys the current state of the art in this area.
Collapse
Affiliation(s)
- Bogdan Toader
- Department of Statistics and Data Science, Yale University, United States.
| | - Fred J Sigworth
- Department of Cellular and Molecular Physiology, Yale University, United States
| | - Roy R Lederman
- Department of Statistics and Data Science, Yale University, United States. https://twitter.com/roylederman
| |
Collapse
|
6
|
Wu JG, Yan Y, Zhang DX, Liu BW, Zheng QB, Xie XL, Liu SQ, Ge SX, Hou ZG, Xia NS. Machine Learning for Structure Determination in Single-Particle Cryo-Electron Microscopy: A Systematic Review. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:452-472. [PMID: 34932487 DOI: 10.1109/tnnls.2021.3131325] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Recently, single-particle cryo-electron microscopy (cryo-EM) has become an indispensable method for determining macromolecular structures at high resolution to deeply explore the relevant molecular mechanism. Its recent breakthrough is mainly because of the rapid advances in hardware and image processing algorithms, especially machine learning. As an essential support of single-particle cryo-EM, machine learning has powered many aspects of structure determination and greatly promoted its development. In this article, we provide a systematic review of the applications of machine learning in this field. Our review begins with a brief introduction of single-particle cryo-EM, followed by the specific tasks and challenges of its image processing. Then, focusing on the workflow of structure determination, we describe relevant machine learning algorithms and applications at different steps, including particle picking, 2-D clustering, 3-D reconstruction, and other steps. As different tasks exhibit distinct characteristics, we introduce the evaluation metrics for each task and summarize their dynamics of technology development. Finally, we discuss the open issues and potential trends in this promising field.
Collapse
|
7
|
Chang WH, Huang SH, Lin HH, Chung SC, Tu IP. Cryo-EM Analyses Permit Visualization of Structural Polymorphism of Biological Macromolecules. FRONTIERS IN BIOINFORMATICS 2021; 1:788308. [PMID: 36303748 PMCID: PMC9580929 DOI: 10.3389/fbinf.2021.788308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2021] [Accepted: 11/16/2021] [Indexed: 11/13/2022] Open
Abstract
The functions of biological macromolecules are often associated with conformational malleability of the structures. This phenomenon of chemically identical molecules with different structures is coined structural polymorphism. Conventionally, structural polymorphism is observed directly by structural determination at the density map level from X-ray crystal diffraction. Although crystallography approach can report the conformation of a macromolecule with the position of each atom accurately defined in it, the exploration of structural polymorphism and interpreting biological function in terms of crystal structures is largely constrained by the crystal packing. An alternative approach to studying the macromolecule of interest in solution is thus desirable. With the advancement of instrumentation and computational methods for image analysis and reconstruction, cryo-electron microscope (cryo-EM) has been transformed to be able to produce “in solution” structures of macromolecules routinely with resolutions comparable to crystallography but without the need of crystals. Since the sample preparation of single-particle cryo-EM allows for all forms co-existing in solution to be simultaneously frozen, the image data contain rich information as to structural polymorphism. The ensemble of structure information can be subsequently disentangled through three-dimensional (3D) classification analyses. In this review, we highlight important examples of protein structural polymorphism in relation to allostery, subunit cooperativity and function plasticity recently revealed by cryo-EM analyses, and review recent developments in 3D classification algorithms including neural network/deep learning approaches that would enable cryo-EM analyese in this regard. Finally, we brief the frontier of cryo-EM structure determination of RNA molecules where resolving the structural polymorphism is at dawn.
Collapse
Affiliation(s)
- Wei-Hau Chang
- Institute of Chemistry, Academia Sinica, Taipei, Taiwan
- *Correspondence: Wei-Hau Chang,
| | | | - Hsin-Hung Lin
- Institute of Chemistry, Academia Sinica, Taipei, Taiwan
| | - Szu-Chi Chung
- Department of Applied Mathematics, National Sun Yat-sen University, Kaohsiung, Taiwan
| | - I-Ping Tu
- Institute of Statistical Science, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
8
|
Giraldo-Barreto J, Ortiz S, Thiede EH, Palacio-Rodriguez K, Carpenter B, Barnett AH, Cossio P. A Bayesian approach to extracting free-energy profiles from cryo-electron microscopy experiments. Sci Rep 2021; 11:13657. [PMID: 34211017 PMCID: PMC8249403 DOI: 10.1038/s41598-021-92621-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 06/01/2021] [Indexed: 11/08/2022] Open
Abstract
Cryo-electron microscopy (cryo-EM) extracts single-particle density projections of individual biomolecules. Although cryo-EM is widely used for 3D reconstruction, due to its single-particle nature it has the potential to provide information about a biomolecule's conformational variability and underlying free-energy landscape. However, treating cryo-EM as a single-molecule technique is challenging because of the low signal-to-noise ratio (SNR) in individual particles. In this work, we propose the cryo-BIFE method (cryo-EM Bayesian Inference of Free-Energy profiles), which uses a path collective variable to extract free-energy profiles and their uncertainties from cryo-EM images. We test the framework on several synthetic systems where the imaging parameters and conditions were controlled. We found that for realistic cryo-EM environments and relevant biomolecular systems, it is possible to recover the underlying free energy, with the pose accuracy and SNR as crucial determinants. We then use the method to study the conformational transitions of a calcium-activated channel with real cryo-EM particles. Interestingly, we recover not only the most probable conformation (used to generate a high-resolution reconstruction of the calcium-bound state) but also a metastable state that corresponds to the calcium-unbound conformation. As expected for turnover transitions within the same sample, the activation barriers are on the order of [Formula: see text]. We expect our tool for extracting free-energy profiles from cryo-EM images to enable more complete characterization of the thermodynamic ensemble of biomolecules.
Collapse
Affiliation(s)
- Julian Giraldo-Barreto
- Biophysics of Tropical Diseases Max Planck Tandem Group, University of Antioquia UdeA, Calle 70 No. 52-21, Medellín, Colombia
- Magnetism and Simulation Group, University of Antioquia UdeA, Calle 70 No. 52-21, Medellín, Colombia
| | - Sebastian Ortiz
- Biophysics of Tropical Diseases Max Planck Tandem Group, University of Antioquia UdeA, Calle 70 No. 52-21, Medellín, Colombia
| | - Erik H Thiede
- Center for Computational Mathematics, Flatiron Institute, New York City, USA
| | - Karen Palacio-Rodriguez
- Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, Sorbonne Université, Paris, France
| | - Bob Carpenter
- Center for Computational Mathematics, Flatiron Institute, New York City, USA
| | - Alex H Barnett
- Center for Computational Mathematics, Flatiron Institute, New York City, USA
| | - Pilar Cossio
- Biophysics of Tropical Diseases Max Planck Tandem Group, University of Antioquia UdeA, Calle 70 No. 52-21, Medellín, Colombia.
- Department of Theoretical Biophysics, Max Planck Institute of Biophysics, 60438, Frankfurt am Main, Germany.
| |
Collapse
|
9
|
Sorzano COS, Carazo JM. Principal component analysis is limited to low-resolution analysis in cryoEM. Acta Crystallogr D Struct Biol 2021; 77:835-839. [PMID: 34076596 PMCID: PMC8171071 DOI: 10.1107/s2059798321002291] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Accepted: 02/27/2021] [Indexed: 01/13/2023] Open
Abstract
Principal component analysis (PCA) has been widely proposed to analyze flexibility and heterogeneity in cryo-electron microscopy (cryoEM). In this paper, it is argued that (i) PCA is an excellent technique to describe continuous flexibility at low resolution (but not so much at high resolution) and (ii) PCA components should be analyzed in a concerted manner (and not independently).
Collapse
Affiliation(s)
- Carlos Oscar S. Sorzano
- National Center of Biotechnology (CSIC), Darwin 3, Campus Universidad Autónoma de Madrid, Cantoblanco, 28049 Madrid, Spain
| | - Jose Maria Carazo
- National Center of Biotechnology (CSIC), Darwin 3, Campus Universidad Autónoma de Madrid, Cantoblanco, 28049 Madrid, Spain
| |
Collapse
|
10
|
Harastani M, Eltsov M, Leforestier A, Jonic S. HEMNMA-3D: Cryo Electron Tomography Method Based on Normal Mode Analysis to Study Continuous Conformational Variability of Macromolecular Complexes. Front Mol Biosci 2021; 8:663121. [PMID: 34095222 PMCID: PMC8170028 DOI: 10.3389/fmolb.2021.663121] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 04/09/2021] [Indexed: 12/28/2022] Open
Abstract
Cryogenic electron tomography (cryo-ET) allows structural determination of biomolecules in their native environment (in situ). Its potential of providing information on the dynamics of macromolecular complexes in cells is still largely unexploited, due to the challenges of the data analysis. The crowded cell environment and continuous conformational changes of complexes make difficult disentangling the data heterogeneity. We present HEMNMA-3D, which is, to the best of our knowledge, the first method for analyzing cryo electron subtomograms in terms of continuous conformational changes of complexes. HEMNMA-3D uses a combination of elastic and rigid-body 3D-to-3D iterative alignments of a flexible 3D reference (atomic structure or electron microscopy density map) to match the conformation, orientation, and position of the complex in each subtomogram. The elastic matching combines molecular mechanics simulation (Normal Mode Analysis of the 3D reference) and experimental, subtomogram data analysis. The rigid-body alignment includes compensation for the missing wedge, due to the limited tilt angle of cryo-ET. The conformational parameters (amplitudes of normal modes) of the complexes in subtomograms obtained through the alignment are processed to visualize the distribution of conformations in a space of lower dimension (typically, 2D or 3D) referred to as space of conformations. This allows a visually interpretable insight into the dynamics of the complexes, by calculating 3D averages of subtomograms with similar conformations from selected (densest) regions and by recording movies of the 3D reference's displacement along selected trajectories through the densest regions. We describe HEMNMA-3D and show its validation using synthetic datasets. We apply HEMNMA-3D to an experimental dataset describing in situ nucleosome conformational variability. HEMNMA-3D software is available freely (open-source) as part of ContinuousFlex plugin of Scipion V3.0 (http://scipion.i2pc.es).
Collapse
Affiliation(s)
- Mohamad Harastani
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France
| | - Mikhail Eltsov
- Department of Integrated Structural Biology, Institute of Genetics and Molecular and Cellular Biology, Illkirch, France
| | - Amélie Leforestier
- Laboratoire de Physique des Solides, UMR 8502 CNRS, Université Paris-Saclay, Paris, France
| | - Slavica Jonic
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France
| |
Collapse
|
11
|
Punjani A, Fleet DJ. 3D variability analysis: Resolving continuous flexibility and discrete heterogeneity from single particle cryo-EM. J Struct Biol 2021; 213:107702. [PMID: 33582281 DOI: 10.1016/j.jsb.2021.107702] [Citation(s) in RCA: 425] [Impact Index Per Article: 141.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 01/12/2021] [Accepted: 01/26/2021] [Indexed: 01/06/2023]
Abstract
Single particle cryo-EM excels in determining static structures of protein molecules, but existing 3D reconstruction methods have been ineffective in modelling flexible proteins. We introduce 3D variability analysis (3DVA), an algorithm that fits a linear subspace model of conformational change to cryo-EM data at high resolution. 3DVA enables the resolution and visualization of detailed molecular motions of both large and small proteins, revealing new biological insight from single particle cryo-EM data. Experimental results demonstrate the ability of 3DVA to resolve multiple flexible motions of α-helices in the sub-50 kDa transmembrane domain of a GPCR complex, bending modes of a sodium ion channel, five types of symmetric and symmetry-breaking flexibility in a proteasome, large motions in a spliceosome complex, and discrete conformational states of a ribosome assembly. 3DVA is implemented in the cryoSPARC software package.
Collapse
Affiliation(s)
- Ali Punjani
- Department of Computer Sciences, University of Toronto M5S 3G4, Canada; Vector Institute, 710-661 University Ave., Toronto M5G 1M1, Canada; Structura Biotechnology Inc., 129-100 College Ave., Toronto M5G 1L5, Canada.
| | - David J Fleet
- Department of Computer Sciences, University of Toronto M5S 3G4, Canada; Vector Institute, 710-661 University Ave., Toronto M5G 1M1, Canada.
| |
Collapse
|
12
|
A set of common movements within GPCR-G-protein complexes from variability analysis of cryo-EM datasets. J Struct Biol 2021; 213:107699. [PMID: 33545352 DOI: 10.1016/j.jsb.2021.107699] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 01/05/2021] [Accepted: 01/22/2021] [Indexed: 12/11/2022]
Abstract
G-protein coupled receptors (GPCRs) are among the most versatile signal transducers in the cell. Once activated, GPCRs sample a large conformational space and couple to G-proteins to initiate distinct signaling pathways. The dynamical behavior of GPCR-G-protein complexes is difficult characterize structurally, and it might hinder obtaining routine high-resolution density maps in single-particle reconstructions. Here, we used variability analysis on the rhodopsin-Gi-Fab16 complex cryo-EM dataset, and the results provide insights into the dynamic nature of the receptor-complex interaction. We compare the outcome of this analysis with recent results obtained on the cannabinoid-Gi- and secretin-Gs-receptor complexes. Despite differences related to the biochemical compositions of the three samples, a set of consensus movements emerges. We anticipate that systematic variability analysis on GPCR-G-protein complexes may provide useful information not only at the biological level, but also for improving the preparation of more stable samples for cryo-EM single-particle analysis.
Collapse
|
13
|
Abstract
Single-particle electron cryomicroscopy (cryo-EM) is an increasingly popular technique for elucidating the three-dimensional structure of proteins and other biologically significant complexes at near-atomic resolution. It is an imaging method that does not require crystallization and can capture molecules in their native states. In single-particle cryo-EM, the three-dimensional molecular structure needs to be determined from many noisy two-dimensional tomographic projections of individual molecules, whose orientations and positions are unknown. The high level of noise and the unknown pose parameters are two key elements that make reconstruction a challenging computational problem. Even more challenging is the inference of structural variability and flexible motions when the individual molecules being imaged are in different conformational states. This review discusses computational methods for structure determination by single-particle cryo-EM and their guiding principles from statistical inference, machine learning, and signal processing that also play a significant role in many other data science applications.
Collapse
Affiliation(s)
- Amit Singer
- Department of Mathematics and Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ 08544, USA
| | - Fred J Sigworth
- Departments of Cellular and Molecular Physiology, Biomedical Engineering, and Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| |
Collapse
|
14
|
Lederman RR, Andén J, Singer A. Hyper-Molecules: on the Representation and Recovery of Dynamical Structures for Applications in Flexible Macro-Molecules in Cryo-EM. INVERSE PROBLEMS 2020; 36:044005. [PMID: 38304203 PMCID: PMC10831863 DOI: 10.1088/1361-6420/ab5ede] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/03/2024]
Abstract
Cryo-electron microscopy (cryo-EM), the subject of the 2017 Nobel Prize in Chemistry, is a technology for obtaining 3-D reconstructions of macromolecules from many noisy 2-D projections of instances of these macromolecules, whose orientations and positions are unknown. These molecules are not rigid objects, but flexible objects involved in dynamical processes. The different conformations are exhibited by different instances of the macromolecule observed in a cryo-EM experiment, each of which is recorded as a particle image. The range of conformations and the conformation of each particle are not known a priori; one of the great promises of cryo-EM is to map this conformation space. Remarkable progress has been made in reconstructing rigid molecules based on homogeneous samples of molecules in spite of the unknown orientation of each particle image and significant progress has been made in recovering a few distinct states from mixtures of rather distinct conformations, but more complex heterogeneous samples remain a major challenge. We introduce the "hyper-molecule" theoretical framework for modeling structures across different states of heterogeneous molecules, including continuums of states. The key idea behind this framework is representing heterogeneous macromolecules as high-dimensional objects, with the additional dimensions representing the conformation space. This idea is then refined to model properties such as localized heterogeneity. In addition, we introduce an algorithmic framework for reconstructing such heterogeneous objects from experimental data using a Bayesian formulation of the problem and Markov chain Monte Carlo (MCMC) algorithms to address the computational challenges in recovering these high dimensional hyper-molecules. We demonstrate these ideas in a preliminary prototype implementation, applied to synthetic data.
Collapse
Affiliation(s)
- Roy R Lederman
- The Department of Statistics and Data Science, Yale University, New Haven, CT
| | - Joakim Andén
- Center for Computational Mathematics, Flatiron Institute, New York, NY
| | - Amit Singer
- Department of Mathematics and Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ
| |
Collapse
|
15
|
Abstract
Cross-validation is used to determine the validity of a model on unseen data by assessing if the model is overfitted to noise. It is widely used in many fields, from artificial intelligence to structural biology in X-ray crystallography and nuclear magnetic resonance. Although there are concerns of map overfitting in cryo-electron microscopy (cryo-EM), cross-validation is rarely used. The problem is that establishing a performance metric of the maps over unseen data (given by 2D-projection images) is difficult due to the low signal-to-noise ratios in the individual particles. Here, I present recent advances for cryo-EM map reconstruction. I highlight that the gold-standard procedure can fail to detect map overfitting in certain cases, showing the necessity of assessing the map quality on unbiased data. Finally, I describe the challenges and advantages of developing a robust cross-validation methodology for cryo-EM.
Collapse
Affiliation(s)
- Pilar Cossio
- Biophysics of Tropical Diseases, Max Planck Tandem Group, University of Antioquia UdeA, Calle 70 No. 52-21, Medellin, Colombia.,Department of Theoretical Biophysics, Max Planck Institute of Biophysics, 60438 Frankfurt am Main, Germany
| |
Collapse
|
16
|
Moscovich A, Halevi A, Andén J, Singer A. Cryo-EM reconstruction of continuous heterogeneity by Laplacian spectral volumes. INVERSE PROBLEMS 2020; 36:024003. [PMID: 32394996 PMCID: PMC7213598 DOI: 10.1088/1361-6420/ab4f55] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Single-particle electron cryomicroscopy is an essential tool for high-resolution 3D reconstruction of proteins and other biological macromolecules. An important challenge in cryo-EM is the reconstruction of non-rigid molecules with parts that move and deform. Traditional reconstruction methods fail in these cases, resulting in smeared reconstructions of the moving parts. This poses a major obstacle for structural biologists, who need high-resolution reconstructions of entire macromolecules, moving parts included. To address this challenge, we present a new method for the reconstruction of macromolecules exhibiting continuous heterogeneity. The proposed method uses projection images from multiple viewing directions to construct a graph Laplacian through which the manifold of three-dimensional conformations is analyzed. The 3D molecular structures are then expanded in a basis of Laplacian eigenvectors, using a novel generalized tomographic reconstruction algorithm to compute the expansion coefficients. These coefficients, which we name spectral volumes, provide a high-resolution visualization of the molecular dynamics. We provide a theoretical analysis and evaluate the method empirically on several simulated data sets.
Collapse
Affiliation(s)
- Amit Moscovich
- Program in Applied & Computational Mathematics, Princeton University, Princeton, NJ
| | - Amit Halevi
- Program in Applied & Computational Mathematics, Princeton University, Princeton, NJ
| | - Joakim Andén
- Center for Computational Mathematics, Flatiron Institute, New York, NY
| | - Amit Singer
- Program in Applied & Computational Mathematics, Princeton University, Princeton, NJ
- Department of Mathematics, Princeton University, Princeton, NJ
| |
Collapse
|
17
|
Harastani M, Sorzano COS, Jonić S. Hybrid Electron Microscopy Normal Mode Analysis with Scipion. Protein Sci 2019; 29:223-236. [PMID: 31693263 DOI: 10.1002/pro.3772] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 11/03/2019] [Accepted: 11/04/2019] [Indexed: 12/12/2022]
Abstract
Hybrid Electron Microscopy Normal Mode Analysis (HEMNMA) method was introduced in 2014. HEMNMA computes normal modes of a reference model (an atomic structure or an electron microscopy map) of a molecular complex and uses this model and its normal modes to analyze single-particle images of the complex to obtain information on its continuous conformational changes, by determining the full distribution of conformational variability from the images. An advantage of HEMNMA is a simultaneous determination of all parameters of each image (particle conformation, orientation, and shift) through their iterative optimization, which allows applications of HEMNMA even when the effects of conformational changes dominate those of orientational changes. HEMNMA was first implemented in Xmipp and was using MATLAB for statistical analysis of obtained conformational distributions and for fitting of underlying trajectories of conformational changes. A HEMNMA implementation independent of MATLAB is now available as part of a plugin of Scipion V2.0 (http://scipion.i2pc.es). This plugin, named ContinuousFlex, can be installed by following the instructions at https://pypi.org/project/scipion-em-continuousflex. In this article, we present this new HEMNMA software, which is user-friendly, totally free, and open-source. STATEMENT FOR A BROADER AUDIENCE: This article presents Hybrid Electron Microscopy Normal Mode Analysis (HEMNMA) software that allows analyzing single-particle images of a complex to obtain information on continuous conformational changes of the complex, by determining the full distribution of conformational variability from the images. The HEMNMA software is user-friendly, totally free, open-source, and available as part of ContinuousFlex plugin (https://pypi.org/project/scipion-em-continuousflex) of Scipion V2.0 (http://scipion.i2pc.es).
Collapse
Affiliation(s)
- Mohamad Harastani
- Sorbonne Université, UMR CNRS 7590, Muséum National d'Histoire Naturelle, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | | | - Slavica Jonić
- Sorbonne Université, UMR CNRS 7590, Muséum National d'Histoire Naturelle, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| |
Collapse
|
18
|
Xu N, Doerschuk PC. Reconstruction of Stochastic 3D Signals With Symmetric Statistics From 2D Projection Images Motivated by Cryo-Electron Microscopy. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2019; 28:5479-5494. [PMID: 31095482 DOI: 10.1109/tip.2019.2915631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Cryo-electron microscopy provides 2D projection images of the 3D electron scattering intensity of many instances of the particle under study (e.g., a virus). Both symmetry (rotational point groups) and heterogeneity are important aspects of biological particles and both aspects can be combined by describing the electron scattering intensity of the particle as a stochastic process with a symmetric probability law and, therefore, symmetric moments. A maximum likelihood estimator implemented by an expectation-maximization algorithm is described, which estimates the unknown statistics of the electron scattering intensity stochastic process from the images of instances of the particle. The algorithm is demonstrated on the bacteriophage HK97 and the virus [Formula: see text]. The results are contrasted with the existing algorithms, which assume that each instance of the particle has the symmetry rather than the less restrictive assumption that the probability law has the symmetry.
Collapse
|
19
|
Sorzano COS, Jiménez A, Mota J, Vilas JL, Maluenda D, Martínez M, Ramírez-Aportela E, Majtner T, Segura J, Sánchez-García R, Rancel Y, del Caño L, Conesa P, Melero R, Jonic S, Vargas J, Cazals F, Freyberg Z, Krieger J, Bahar I, Marabini R, Carazo JM. Survey of the analysis of continuous conformational variability of biological macromolecules by electron microscopy. Acta Crystallogr F Struct Biol Commun 2019; 75:19-32. [PMID: 30605122 PMCID: PMC6317454 DOI: 10.1107/s2053230x18015108] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Accepted: 10/26/2018] [Indexed: 11/10/2022] Open
Abstract
Single-particle analysis by electron microscopy is a well established technique for analyzing the three-dimensional structures of biological macromolecules. Besides its ability to produce high-resolution structures, it also provides insights into the dynamic behavior of the structures by elucidating their conformational variability. Here, the different image-processing methods currently available to study continuous conformational changes are reviewed.
Collapse
Affiliation(s)
| | - A. Jiménez
- National Center of Biotechnology (CSIC), Spain
| | - J. Mota
- National Center of Biotechnology (CSIC), Spain
| | - J. L. Vilas
- National Center of Biotechnology (CSIC), Spain
| | - D. Maluenda
- National Center of Biotechnology (CSIC), Spain
| | - M. Martínez
- National Center of Biotechnology (CSIC), Spain
| | | | - T. Majtner
- National Center of Biotechnology (CSIC), Spain
| | - J. Segura
- National Center of Biotechnology (CSIC), Spain
| | | | - Y. Rancel
- National Center of Biotechnology (CSIC), Spain
| | - L. del Caño
- National Center of Biotechnology (CSIC), Spain
| | - P. Conesa
- National Center of Biotechnology (CSIC), Spain
| | - R. Melero
- National Center of Biotechnology (CSIC), Spain
| | - S. Jonic
- Sorbonne Université, UMR CNRS 7590, Muséum National d’Histoire Naturelle, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | | | - F. Cazals
- Inria Sophia Antipolis – Méditerranée, France
| | | | | | | | | | | |
Collapse
|