1
|
Rangan R, Feathers R, Khavnekar S, Lerer A, Johnston JD, Kelley R, Obr M, Kotecha A, Zhong ED. CryoDRGN-ET: deep reconstructing generative networks for visualizing dynamic biomolecules inside cells. Nat Methods 2024:10.1038/s41592-024-02340-4. [PMID: 39025970 DOI: 10.1038/s41592-024-02340-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Accepted: 06/06/2024] [Indexed: 07/20/2024]
Abstract
Advances in cryo-electron tomography (cryo-ET) have produced new opportunities to visualize the structures of dynamic macromolecules in native cellular environments. While cryo-ET can reveal structures at molecular resolution, image processing algorithms remain a bottleneck in resolving the heterogeneity of biomolecular structures in situ. Here, we introduce cryoDRGN-ET for heterogeneous reconstruction of cryo-ET subtomograms. CryoDRGN-ET learns a deep generative model of three-dimensional density maps directly from subtomogram tilt-series images and can capture states diverse in both composition and conformation. We validate this approach by recovering the known translational states in Mycoplasma pneumoniae ribosomes in situ. We then perform cryo-ET on cryogenic focused ion beam-milled Saccharomyces cerevisiae cells. CryoDRGN-ET reveals the structural landscape of S. cerevisiae ribosomes during translation and captures continuous motions of fatty acid synthase complexes inside cells. This method is openly available in the cryoDRGN software.
Collapse
Affiliation(s)
- Ramya Rangan
- Department of Computer Science, Princeton University, Princeton, NJ, USA
| | - Ryan Feathers
- Department of Computer Science, Princeton University, Princeton, NJ, USA
| | | | | | - Jake D Johnston
- Physiology and Cellular Biophysics, Columbia University, New York, NY, USA
- Simons Electron Microscopy Center, New York Structural Biology Center, New York, NY, USA
| | - Ron Kelley
- Materials and Structural Analysis Division, Thermo Fisher Scientific, Eindhoven, the Netherlands
| | - Martin Obr
- Materials and Structural Analysis Division, Thermo Fisher Scientific, Eindhoven, the Netherlands
| | - Abhay Kotecha
- Materials and Structural Analysis Division, Thermo Fisher Scientific, Eindhoven, the Netherlands.
| | - Ellen D Zhong
- Department of Computer Science, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
2
|
Watson AJI, Bartesaghi A. Advances in cryo-ET data processing: meeting the demands of visual proteomics. Curr Opin Struct Biol 2024; 87:102861. [PMID: 38889501 DOI: 10.1016/j.sbi.2024.102861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 05/20/2024] [Accepted: 05/21/2024] [Indexed: 06/20/2024]
Abstract
Cryogenic electron tomography (cryo-ET), a method that enables the viewing of biomolecules in near-native environments at high resolution, is rising in accessibility and applicability. Over the past several years, once slow sample preparation and data collection procedures have seen innovations which enable rapid collection of the large datasets required for attaining high resolution structures. Increased data availability has provided a driving force for exciting improvements in cryo-ET data processing methodologies throughout the entire processing pipeline and the development of accessible graphical user interfaces (GUIs) that enable individuals inexperienced in computational fields to convert raw tilt series into 3D structures. These advances in data processing are enabling cryo-ET to attain higher resolution and extending its applicability to more complex samples.
Collapse
Affiliation(s)
- Abigail J I Watson
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Alberto Bartesaghi
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, 27710, USA; Department of Computer Science, Duke University, Durham, NC, 27708, USA; Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27708, USA.
| |
Collapse
|
3
|
Powell BM, Davis JH. Learning structural heterogeneity from cryo-electron sub-tomograms with tomoDRGN. Nat Methods 2024:10.1038/s41592-024-02210-z. [PMID: 38459385 DOI: 10.1038/s41592-024-02210-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 02/13/2024] [Indexed: 03/10/2024]
Abstract
Cryo-electron tomography (cryo-ET) enables observation of macromolecular complexes in their native, spatially contextualized cellular environment. Cryo-ET processing software to visualize such complexes at nanometer resolution via iterative alignment and averaging are well developed but rely upon assumptions of structural homogeneity among the complexes of interest. Recently developed tools allow for some assessment of structural diversity but have limited capacity to represent highly heterogeneous structures, including those undergoing continuous conformational changes. Here we extend the highly expressive cryoDRGN (Deep Reconstructing Generative Networks) deep learning architecture, originally created for single-particle cryo-electron microscopy analysis, to cryo-ET. Our new tool, tomoDRGN, learns a continuous low-dimensional representation of structural heterogeneity in cryo-ET datasets while also learning to reconstruct heterogeneous structural ensembles supported by the underlying data. Using simulated and experimental data, we describe and benchmark architectural choices within tomoDRGN that are uniquely necessitated and enabled by cryo-ET. We additionally illustrate tomoDRGN's efficacy in analyzing diverse datasets, using it to reveal high-level organization of human immunodeficiency virus (HIV) capsid complexes assembled in virus-like particles and to resolve extensive structural heterogeneity among ribosomes imaged in situ.
Collapse
Affiliation(s)
- Barrett M Powell
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Joseph H Davis
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Program in Computational and Systems Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
4
|
Zeng X, Ding Y, Zhang Y, Uddin MR, Dabouei A, Xu M. DUAL: deep unsupervised simultaneous simulation and denoising for cryo-electron tomography. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.02.583135. [PMID: 38496657 PMCID: PMC10942334 DOI: 10.1101/2024.03.02.583135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Recent biotechnological developments in cryo-electron tomography allow direct visualization of native sub-cellular structures with unprecedented details and provide essential information on protein functions/dysfunctions. Denoising can enhance the visualization of protein structures and distributions. Automatic annotation via data simulation can ameliorate the time-consuming manual labeling of large-scale datasets. Here, we combine the two major cryo-ET tasks together in DUAL, by a specific cyclic generative adversarial network with novel noise disentanglement. This enables end-to-end unsupervised learning that requires no labeled data for training. The denoising branch outperforms existing works and substantially improves downstream particle picking accuracy on benchmark datasets. The simulation branch provides learning-based cryo-ET simulation for the first time and generates synthetic tomograms indistinguishable from experimental ones. Through comprehensive evaluations, we showcase the effectiveness of DUAL in detecting macromolecular complexes across a wide range of molecular weights in experimental datasets. The versatility of DUAL is expected to empower cryo-ET researchers by improving visual interpretability, enhancing structural detection accuracy, expediting annotation processes, facilitating cross-domain model adaptability, and compensating for missing wedge artifacts. Our work represents a significant advancement in the unsupervised mining of protein structures in cryo-ET, offering a multifaceted tool that facilitates cryo-ET research.
Collapse
Affiliation(s)
- Xiangrui Zeng
- Ray and Stephanie Lane Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| | - Yizhe Ding
- Department of Statistics, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Yueqian Zhang
- School of Electrical and Electronic Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore
| | - Mostofa Rafid Uddin
- Ray and Stephanie Lane Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| | - Ali Dabouei
- Ray and Stephanie Lane Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| | - Min Xu
- Ray and Stephanie Lane Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| |
Collapse
|
5
|
Vuillemot R, Harastani M, Hamitouche I, Jonic S. MDSPACE and MDTOMO Software for Extracting Continuous Conformational Landscapes from Datasets of Single Particle Images and Subtomograms Based on Molecular Dynamics Simulations: Latest Developments in ContinuousFlex Software Package. Int J Mol Sci 2023; 25:20. [PMID: 38203192 PMCID: PMC10779004 DOI: 10.3390/ijms25010020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 12/16/2023] [Accepted: 12/17/2023] [Indexed: 01/12/2024] Open
Abstract
Cryo electron microscopy (cryo-EM) instrumentation allows obtaining 3D reconstruction of the structure of biomolecular complexes in vitro (purified complexes studied by single particle analysis) and in situ (complexes studied in cells by cryo electron tomography). Standard cryo-EM approaches allow high-resolution reconstruction of only a few conformational states of a molecular complex, as they rely on data classification into a given number of classes to increase the resolution of the reconstruction from the most populated classes while discarding all other classes. Such discrete classification approaches result in a partial picture of the full conformational variability of the complex, due to continuous conformational transitions with many, uncountable intermediate states. In this article, we present the software with a user-friendly graphical interface for running two recently introduced methods, namely, MDSPACE and MDTOMO, to obtain continuous conformational landscapes of biomolecules by analyzing in vitro and in situ cryo-EM data (single particle images and subtomograms) based on molecular dynamics simulations of an available atomic model of one of the conformations. The MDSPACE and MDTOMO software is part of the open-source ContinuousFlex software package (starting from version 3.4.2 of ContinuousFlex), which can be run as a plugin of the Scipion software package (version 3.1 and later), broadly used in the cryo-EM field.
Collapse
Affiliation(s)
| | | | | | - Slavica Jonic
- IMPMC-UMR 7590 CNRS, Sorbonne Université, MNHN, 75005 Paris, France
| |
Collapse
|
6
|
Vuillemot R, Rouiller I, Jonić S. MDTOMO method for continuous conformational variability analysis in cryo electron subtomograms based on molecular dynamics simulations. Sci Rep 2023; 13:10596. [PMID: 37391578 PMCID: PMC10313669 DOI: 10.1038/s41598-023-37037-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 06/14/2023] [Indexed: 07/02/2023] Open
Abstract
Cryo electron tomography (cryo-ET) allows observing macromolecular complexes in their native environment. The common routine of subtomogram averaging (STA) allows obtaining the three-dimensional (3D) structure of abundant macromolecular complexes, and can be coupled with discrete classification to reveal conformational heterogeneity of the sample. However, the number of complexes extracted from cryo-ET data is usually small, which restricts the discrete-classification results to a small number of enough populated states and, thus, results in a largely incomplete conformational landscape. Alternative approaches are currently being investigated to explore the continuity of the conformational landscapes that in situ cryo-ET studies could provide. In this article, we present MDTOMO, a method for analyzing continuous conformational variability in cryo-ET subtomograms based on Molecular Dynamics (MD) simulations. MDTOMO allows obtaining an atomic-scale model of conformational variability and the corresponding free-energy landscape, from a given set of cryo-ET subtomograms. The article presents the performance of MDTOMO on a synthetic ABC exporter dataset and an in situ SARS-CoV-2 spike dataset. MDTOMO allows analyzing dynamic properties of molecular complexes to understand their biological functions, which could also be useful for structure-based drug discovery.
Collapse
Affiliation(s)
- Rémi Vuillemot
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, CC 115, 4 Place Jussieu, 75005, Paris, France
- Department of Biochemistry and Pharmacology and Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, VIC, 3010, Australia
| | - Isabelle Rouiller
- Department of Biochemistry and Pharmacology and Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, VIC, 3010, Australia
- Australian Research Council Centre for Cryo-Electron Microscopy of Membrane Proteins, Parkville, VIC, 3052, Australia
| | - Slavica Jonić
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, CC 115, 4 Place Jussieu, 75005, Paris, France.
| |
Collapse
|
7
|
Powell BM, Davis JH. Learning structural heterogeneity from cryo-electron sub-tomograms with tomoDRGN. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.31.542975. [PMID: 37398315 PMCID: PMC10312494 DOI: 10.1101/2023.05.31.542975] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Cryo-electron tomography (cryo-ET) allows one to observe macromolecular complexes in their native, spatially contextualized environment. Tools to visualize such complexes at nanometer resolution via iterative alignment and averaging are well-developed but rely on assumptions of structural homogeneity among the complexes under consideration. Recently developed downstream analysis tools allow for some assessment of macromolecular diversity but have limited capacity to represent highly heterogeneous macromolecules, including those undergoing continuous conformational changes. Here, we extend the highly expressive cryoDRGN deep learning architecture, originally created for cryo-electron microscopy single particle analysis, to sub-tomograms. Our new tool, tomoDRGN, learns a continuous low-dimensional representation of structural heterogeneity in cryo-ET datasets while also learning to reconstruct a large, heterogeneous ensemble of structures supported by the underlying data. Using simulated and experimental data, we describe and benchmark architectural choices within tomoDRGN that are uniquely necessitated and enabled by cryo-ET data. We additionally illustrate tomoDRGN's efficacy in analyzing an exemplar dataset, using it to reveal extensive structural heterogeneity among ribosomes imaged in situ.
Collapse
Affiliation(s)
- Barrett M. Powell
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Joseph H. Davis
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
- Program in Computational and Systems Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| |
Collapse
|
8
|
Vuillemot R, Mirzaei A, Harastani M, Hamitouche I, Fréchin L, Klaholz BP, Miyashita O, Tama F, Rouiller I, Jonic S. MDSPACE: Extracting Continuous Conformational Landscapes from Cryo-EM Single Particle Datasets Using 3D-to-2D Flexible Fitting based on Molecular Dynamics Simulation. J Mol Biol 2023; 435:167951. [PMID: 36638910 DOI: 10.1016/j.jmb.2023.167951] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Revised: 12/08/2022] [Accepted: 01/03/2023] [Indexed: 01/12/2023]
Abstract
This article presents an original approach for extracting atomic-resolution landscapes of continuous conformational variability of biomolecular complexes from cryo electron microscopy (cryo-EM) single particle images. This approach is based on a new 3D-to-2D flexible fitting method, which uses molecular dynamics (MD) simulation and is embedded in an iterative conformational-landscape refinement scheme. This new approach is referred to as MDSPACE, which stands for Molecular Dynamics simulation for Single Particle Analysis of Continuous Conformational hEterogeneity. The article describes the MDSPACE approach and shows its performance using synthetic and experimental datasets.
Collapse
Affiliation(s)
- Rémi Vuillemot
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France; Department of Biochemistry & Pharmacology and Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Victoria, Australia
| | - Alex Mirzaei
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France
| | - Mohamad Harastani
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France
| | - Ilyes Hamitouche
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France
| | - Léo Fréchin
- Centre for Integrative Biology, Department of Integrated Structural Biology, IGBMC-UMR 7104 CNRS, U964 Inserm, Université de Strasbourg, Strasbourg, France
| | - Bruno P Klaholz
- Centre for Integrative Biology, Department of Integrated Structural Biology, IGBMC-UMR 7104 CNRS, U964 Inserm, Université de Strasbourg, Strasbourg, France
| | | | - Florence Tama
- RIKEN Center for Computational Science, Kobe, Japan; Institute of Transformative Biomolecules, Graduate School of Science, Nagoya University, Nagoya, Japan; Department of Physics, Graduate School of Science, Nagoya University, Nagoya, Japan
| | - Isabelle Rouiller
- Department of Biochemistry & Pharmacology and Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Victoria, Australia
| | - Slavica Jonic
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France.
| |
Collapse
|
9
|
Harastani M, Vuillemot R, Hamitouche I, Moghadam NB, Jonic S. ContinuousFlex: Software package for analyzing continuous conformational variability of macromolecules in cryo electron microscopy and tomography data. J Struct Biol 2022; 214:107906. [PMID: 36244611 DOI: 10.1016/j.jsb.2022.107906] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Revised: 09/02/2022] [Accepted: 10/07/2022] [Indexed: 11/06/2022]
Abstract
ContinuousFlex is a user-friendly open-source software package for analyzing continuous conformational variability of macromolecules in cryo electron microscopy (cryo-EM) and cryo electron tomography (cryo-ET) data. In 2019, ContinuousFlex became available as a plugin for Scipion, an image processing software package extensively used in the cryo-EM field. Currently, ContinuousFlex contains software for running (1) recently published methods HEMNMA-3D, TomoFlow, and NMMD; (2) earlier published methods HEMNMA and StructMap; and (3) methods for simulating cryo-EM and cryo-ET data with conformational variability and methods for data preprocessing. It also includes external software for molecular dynamics simulation (GENESIS) and normal mode analysis (ElNemo), used in some of the mentioned methods. The HEMNMA software has been presented in the past, but not the software of other methods. Besides, ContinuousFlex currently also offers a deep learning extension of HEMNMA, named DeepHEMNMA. In this article, we review these methods in the context of the ContinuousFlex package, developed to facilitate their use by the community.
Collapse
Affiliation(s)
- Mohamad Harastani
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France
| | - Rémi Vuillemot
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France
| | - Ilyes Hamitouche
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France
| | - Nima Barati Moghadam
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France
| | - Slavica Jonic
- IMPMC-UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle, Paris, France.
| |
Collapse
|
10
|
Uddin MR, Howe G, Zeng X, Xu M. Harmony: A Generic Unsupervised Approach for Disentangling Semantic Content from Parameterized Transformations. PROCEEDINGS. IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION 2022; 2022:20614-20623. [PMID: 36188422 PMCID: PMC9521798 DOI: 10.1109/cvpr52688.2022.01999] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
In many real-life image analysis applications, particularly in biomedical research domains, the objects of interest undergo multiple transformations that alters their visual properties while keeping the semantic content unchanged. Disentangling images into semantic content factors and transformations can provide significant benefits into many domain-specific image analysis tasks. To this end, we propose a generic unsupervised framework, Harmony, that simultaneously and explicitly disentangles semantic content from multiple parameterized transformations. Harmony leverages a simple cross-contrastive learning framework with multiple explicitly parameterized latent representations to disentangle content from transformations. To demonstrate the efficacy of Harmony, we apply it to disentangle image semantic content from several parameterized transformations (rotation, translation, scaling, and contrast). Harmony achieves significantly improved disentanglement over the baseline models on several image datasets of diverse domains. With such disentanglement, Harmony is demonstrated to incentivize bioimage analysis research by modeling structural heterogeneity of macromolecules from cryo-ET images and learning transformation-invariant representations of protein particles from single-particle cryo-EM images. Harmony also performs very well in disentangling content from 3D transformations and can perform coarse and fast alignment of 3D cryo-ET subtomograms. Therefore, Harmony is generalizable to many other imaging domains and can potentially be extended to domains beyond imaging as well.
Collapse
Affiliation(s)
- Mostofa Rafid Uddin
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Gregory Howe
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Xiangrui Zeng
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Min Xu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
11
|
Jiménez de la Morena J, Conesa P, Fonseca YC, de Isidro-Gómez FP, Herreros D, Fernández-Giménez E, Strelak D, Moebel E, Buchholz TO, Jug F, Martinez-Sanchez A, Harastani M, Jonic S, Conesa JJ, Cuervo A, Losana P, Sánchez I, Iceta M, Del Cano L, Gragera M, Melero R, Sharov G, Castaño-Díez D, Koster A, Piccirillo JG, Vilas JL, Otón J, Marabini R, Sorzano COS, Carazo JM. ScipionTomo: Towards cryo-electron tomography software integration, reproducibility, and validation. J Struct Biol 2022; 214:107872. [PMID: 35660516 PMCID: PMC7613607 DOI: 10.1016/j.jsb.2022.107872] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 05/26/2022] [Accepted: 05/28/2022] [Indexed: 11/25/2022]
Abstract
Image processing in cryogenic electron tomography (cryoET) is currently at a similar state as Single Particle Analysis (SPA) in cryogenic electron microscopy (cryoEM) was a few years ago. Its data processing workflows are far from being well defined and the user experience is still not smooth. Moreover, file formats of different software packages and their associated metadata are not standardized, mainly since different packages are developed by different groups, focusing on different steps of the data processing pipeline. The Scipion framework, originally developed for SPA (de la Rosa-Trevín et al., 2016), has a generic python workflow engine that gives it the versatility to be extended to other fields, as demonstrated for model building (Martínez et al., 2020). In this article, we provide an extension of Scipion based on a set of tomography plugins (referred to as ScipionTomo hereafter), with a similar purpose: to allow users to be focused on the data processing and analysis instead of having to deal with multiple software installation issues and the inconvenience of switching from one to another, converting metadata files, managing possible incompatibilities, scripting (writing a simple program in a language that the computer must convert to machine language each time the program is run), etcetera. Additionally, having all the software available in an integrated platform allows comparing the results of different algorithms trying to solve the same problem. In this way, the commonalities and differences between estimated parameters shed light on which results can be more trusted than others. ScipionTomo is developed by a collaborative multidisciplinary team composed of Scipion team engineers, structural biologists, and in some cases, the developers whose software packages have been integrated. It is open to anyone in the field willing to contribute to this project. The result is a framework extension that combines the acquired knowledge of Scipion developers in close collaboration with third-party developers, and the on-demand design of functionalities requested by beta testers applying this solution to actual biological problems.
Collapse
Affiliation(s)
| | - P Conesa
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - Y C Fonseca
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | | | - D Herreros
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | | | - D Strelak
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain; Masaryk University, Brno, Czech Republic
| | - E Moebel
- Inria Rennes - Bretagne Atlantique, Rennes
| | - T O Buchholz
- Max Planck Institute of Molecular Cell Biology and Genetics (MPI-CBG), Germany; Center for Systems Biology Dresden (CSBD), Germany
| | - F Jug
- Max Planck Institute of Molecular Cell Biology and Genetics (MPI-CBG), Germany; Fondazione Human Technopole, Milan, Italy
| | - A Martinez-Sanchez
- University of Oviedo, Department of Computer Sciences, Oviedo, Spain; Health Research Institute of Asturias (ISPA), Oviedo, Spain
| | - M Harastani
- IMPMC-UMR 7590 CNRS, Sorbonne Université, MNHN, Paris, France
| | - S Jonic
- IMPMC-UMR 7590 CNRS, Sorbonne Université, MNHN, Paris, France
| | - J J Conesa
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - A Cuervo
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - P Losana
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - I Sánchez
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - M Iceta
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - L Del Cano
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - M Gragera
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - R Melero
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - G Sharov
- Structural Studies Division, MRC Laboratory of Molecular Biology, Cambridge, United Kingdom
| | - D Castaño-Díez
- BioEM Lab, Biozentrum, University of Basel, Basel, Switzerland
| | - A Koster
- University of Leiden, Ultrastructural and molecular imaging, Leiden, The Netherlands
| | - J G Piccirillo
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - J L Vilas
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - J Otón
- Alba Synchrotron - CELLS (ICTS), Barcelona, Spain
| | - R Marabini
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain; Superior Polytechnic School. Univ. Autónoma of Madrid. Madrid, Spain
| | - C O S Sorzano
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | - J M Carazo
- National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| |
Collapse
|