1
|
Dahmani Z, Scott AL, Vénien-Bryan C, Perahia D, Costa MG. MDFF_NM: Improved Molecular Dynamics Flexible Fitting into Cryo-EM Density Maps with a Multireplica Normal Mode-Based Search. J Chem Inf Model 2024; 64:5151-5160. [PMID: 38907694 PMCID: PMC11234365 DOI: 10.1021/acs.jcim.3c02007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 06/17/2024] [Accepted: 06/17/2024] [Indexed: 06/24/2024]
Abstract
Molecular Dynamics Flexible Fitting (MDFF) is a widely used tool to refine high-resolution structures into cryo-EM density maps. Despite many successful applications, MDFF is still limited by its high computational cost, overfitting, accuracy, and performance issues due to entrapment within wrong local minima. Modern ensemble-based MDFF tools have generated promising results in the past decade. In line with these studies, we present MDFF_NM, a stochastic hybrid flexible fitting algorithm combining Normal Mode Analysis (NMA) and simulation-based flexible fitting. Initial tests reveal that, besides accelerating the fitting process, MDFF_NM increases the diversity of fitting routes leading to the target, uncovering ensembles of conformations in closer agreement with experimental data. The potential integration of MDFF_NM with other existing methods and integrative modeling pipelines is also discussed.
Collapse
Affiliation(s)
- Zakaria
L. Dahmani
- School
of Medicine, Department of Computational and Systems Biology, University of Pittsburgh, 800 Murdoch I Bldg, 3420 Forbes Avenue, Pittsburgh, Pennsylvania 15260, United States
- UMR
7590, CNRS, Museum National d’Histoire Naturelle, Institut
de Minéralogie, Physique des Matériaux et Cosmochimie,
IMPMC, Sorbonne Université, 4 place Jussieu, Paris 75005, France
| | - Ana Ligia Scott
- CMCC,
Computational Biophysics and Biology, Universidade Federal do ABC, Avenida dos Estados 5001, São Paulo, Santo André 09210-580, Brazil
- Université
de Strasbourg—IGBMC—Departament de Biologie structurale
integrative, 1 rue Laurent
Fries BP, Illkirch 10142
67404, CEDEX, France
| | - Catherine Vénien-Bryan
- UMR
7590, CNRS, Museum National d’Histoire Naturelle, Institut
de Minéralogie, Physique des Matériaux et Cosmochimie,
IMPMC, Sorbonne Université, 4 place Jussieu, Paris 75005, France
| | - David Perahia
- Laboratoire
de Biologie et Pharmacologie Appliquée, UMR 8113, École
Normale Supérieure Paris-Saclay, Gif-sur-Yvette 91190, France
| | - Mauricio G.S Costa
- UMR
7590, CNRS, Museum National d’Histoire Naturelle, Institut
de Minéralogie, Physique des Matériaux et Cosmochimie,
IMPMC, Sorbonne Université, 4 place Jussieu, Paris 75005, France
- Laboratoire
de Biologie et Pharmacologie Appliquée, UMR 8113, École
Normale Supérieure Paris-Saclay, Gif-sur-Yvette 91190, France
- Programa de Computação Científica,
Vice-Presidência de Educação, Informação
e Comunicação, Fundação Oswaldo Cruz, Av.Brasil 4365, Residência
Oficial, Manguinhos, Rio de Janeiro 21040-900, Brazil
| |
Collapse
|
2
|
Shekhar M, Terashi G, Gupta C, Sarkar D, Debussche G, Sisco NJ, Nguyen J, Mondal A, Vant J, Fromme P, Van Horn WD, Tajkhorshid E, Kihara D, Dill K, Perez A, Singharoy A. CryoFold: determining protein structures and data-guided ensembles from cryo-EM density maps. MATTER 2021; 4:3195-3216. [PMID: 35874311 PMCID: PMC9302471 DOI: 10.1016/j.matt.2021.09.004] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Cryo-electron microscopy (EM) requires molecular modeling to refine structural details from data. Ensemble models arrive at low free-energy molecular structures, but are computationally expensive and limited to resolving only small proteins that cannot be resolved by cryo-EM. Here, we introduce CryoFold - a pipeline of molecular dynamics simulations that determines ensembles of protein structures directly from sequence by integrating density data of varying sparsity at 3-5 Å resolution with coarse-grained topological knowledge of the protein folds. We present six examples showing its broad applicability for folding proteins between 72 to 2000 residues, including large membrane and multi-domain systems, and results from two EMDB competitions. Driven by data from a single state, CryoFold discovers ensembles of common low-energy models together with rare low-probability structures that capture the equilibrium distribution of proteins constrained by the density maps. Many of these conformations, unseen by traditional methods, are experimentally validated and functionally relevant. We arrive at a set of best practices for data-guided protein folding that are controlled using a Python GUI.
Collapse
Affiliation(s)
- Mrinal Shekhar
- Center for Biophysics and Quantitative Biology, Department of Biochemistry, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, 61801, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| | - Chitrak Gupta
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Center for Structural Discovery, Arizona State University, Tempe, AZ 85281, USA
| | - Daipayan Sarkar
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Gaspard Debussche
- Department of Mathematics and Computer Sciences, Grenoble INP, 38000 Grenoble, France
| | - Nicholas J Sisco
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Virginia G. Piper Center for Personalized Diagnostics, Arizona State University, Tempe, AZ 85281, USA
| | - Jonathan Nguyen
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Center for Structural Discovery, Arizona State University, Tempe, AZ 85281, USA
| | - Arup Mondal
- Chemistry Department, Quantum Theory Project, University of Florida, Gainesville, Florida, 32611, USA
| | - John Vant
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Center for Structural Discovery, Arizona State University, Tempe, AZ 85281, USA
| | - Petra Fromme
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Center for Structural Discovery, Arizona State University, Tempe, AZ 85281, USA
| | - Wade D Van Horn
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Virginia G. Piper Center for Personalized Diagnostics, Arizona State University, Tempe, AZ 85281, USA
| | - Emad Tajkhorshid
- Center for Biophysics and Quantitative Biology, Department of Biochemistry, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, 61801, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
| | - Ken Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| | - Alberto Perez
- Chemistry Department, Quantum Theory Project, University of Florida, Gainesville, Florida, 32611, USA
| | - Abhishek Singharoy
- The School of Molecular Sciences, Arizona State University, Tempe, AZ 85287, USA
- The Biodesign Institute Center for Structural Discovery, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
3
|
Vant JW, Sarkar D, Gupta C, Shekhar MS, Mittal S, Singharoy A. Molecular Dynamics Flexible Fitting: All You Want to Know About Resolution Exchange. Methods Mol Biol 2021; 2165:301-315. [PMID: 32621233 DOI: 10.1007/978-1-0716-0708-4_18] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
In recent years, owing to the advances in instrumentation, cryo-EM has emerged as the go-to tool for obtaining high-resolution structures of biomolecular systems. However, building three-dimensional atomic structures of biomolecules from these high-resolution maps remains a concern for the traditional map-guided structure-determination schemes. Recently, we developed a computational tool, Resolution Exchange Molecular Dynamics Flexible Fitting (ReMDFF) to address this problem by re-refining a search model against a series of maps of progressively higher resolutions, which ends with the original experimental resolution (Wang et al., J Struct Biol 204(2):319-328, 2018). In this chapter, we present a step-by-step outline for preparing, executing, and analyzing ReMDFF refinements of simple proteins and multimeric complexes. The structure determination of carbon monoxide dehydrogenase and Mg2+-channel CorA are employed as case studies. All scripts are provided via GitHub (Vant, Resolution exchange molecular dynamics flexible fitting (ReMDFF) all you want to know about flexible fitting, 2019, https://github.com/jvant/ReMDFF_Singharoy_Group.git ).
Collapse
Affiliation(s)
- John W Vant
- The School of Molecular Sciences, Arizona State University, Tempe, AZ, USA
| | - Daipayan Sarkar
- The School of Molecular Sciences, Arizona State University, Tempe, AZ, USA
| | - Chitrak Gupta
- The School of Molecular Sciences, Arizona State University, Tempe, AZ, USA
| | - Mrinal S Shekhar
- Oncology, IMED Biotech Unit, AstraZeneca R&D Boston, Waltham, MA, USA
| | - Sumit Mittal
- Department of Chemistry, VIT Bhopal University, Bhopal, India
| | - Abhishek Singharoy
- The School of Molecular Sciences, Arizona State University, Tempe, AZ, USA.
| |
Collapse
|
4
|
Cryo-EM structures of a lipid-sensitive pentameric ligand-gated ion channel embedded in a phosphatidylcholine-only bilayer. Proc Natl Acad Sci U S A 2020; 117:1788-1798. [PMID: 31911476 PMCID: PMC6983364 DOI: 10.1073/pnas.1906823117] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The lipid dependence of the nicotinic acetylcholine receptor from the Torpedo electric organ has long been recognized, and one of the most consistent experimental observations is that, when reconstituted in membranes formed by zwitterionic phospholipids alone, exposure to agonist fails to elicit ion-flux activity. More recently, it has been suggested that the bacterial homolog ELIC (Erwinia chrysanthemi ligand-gated ion channel) has a similar lipid sensitivity. As a first step toward the elucidation of the structural basis of this phenomenon, we solved the structures of ELIC embedded in palmitoyl-oleoyl-phosphatidylcholine- (POPC-) only nanodiscs in both the unliganded (4.1-Å resolution) and agonist-bound (3.3 Å) states using single-particle cryoelectron microscopy. Comparison of the two structural models revealed that the largest differences occur at the level of loop C-at the agonist-binding sites-and the loops at the interface between the extracellular and transmembrane domains (ECD and TMD, respectively). On the other hand, the transmembrane pore is occluded in a remarkably similar manner in both structures. A straightforward interpretation of these findings is that POPC-only membranes frustrate the ECD-TMD coupling in such a way that the "conformational wave" of liganded-receptor gating takes place in the ECD and the interfacial M2-M3 linker but fails to penetrate the membrane and propagate into the TMD. Furthermore, analysis of the structural models and molecular simulations suggested that the higher affinity for agonists characteristic of the open- and desensitized-channel conformations results, at least in part, from the tighter confinement of the ligand to its binding site; this limits the ligand's fluctuations, and thus delays its escape into bulk solvent.
Collapse
|
5
|
Igaev M, Kutzner C, Bock LV, Vaiana AC, Grubmüller H. Automated cryo-EM structure refinement using correlation-driven molecular dynamics. eLife 2019; 8:e43542. [PMID: 30829573 PMCID: PMC6424565 DOI: 10.7554/elife.43542] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Accepted: 03/03/2019] [Indexed: 12/22/2022] Open
Abstract
We present a correlation-driven molecular dynamics (CDMD) method for automated refinement of atomistic models into cryo-electron microscopy (cryo-EM) maps at resolutions ranging from near-atomic to subnanometer. It utilizes a chemically accurate force field and thermodynamic sampling to improve the real-space correlation between the modeled structure and the cryo-EM map. Our framework employs a gradual increase in resolution and map-model agreement as well as simulated annealing, and allows fully automated refinement without manual intervention or any additional rotamer- and backbone-specific restraints. Using multiple challenging systems covering a wide range of map resolutions, system sizes, starting model geometries and distances from the target state, we assess the quality of generated models in terms of both model accuracy and potential of overfitting. To provide an objective comparison, we apply several well-established methods across all examples and demonstrate that CDMD performs best in most cases.
Collapse
Affiliation(s)
- Maxim Igaev
- Department of Theoretical and Computational BiophysicsMax Planck Institute for Biophysical ChemistryGöttingenGermany
| | - Carsten Kutzner
- Department of Theoretical and Computational BiophysicsMax Planck Institute for Biophysical ChemistryGöttingenGermany
| | - Lars V Bock
- Department of Theoretical and Computational BiophysicsMax Planck Institute for Biophysical ChemistryGöttingenGermany
| | - Andrea C Vaiana
- Department of Theoretical and Computational BiophysicsMax Planck Institute for Biophysical ChemistryGöttingenGermany
| | - Helmut Grubmüller
- Department of Theoretical and Computational BiophysicsMax Planck Institute for Biophysical ChemistryGöttingenGermany
| |
Collapse
|
6
|
Richardson JS, Williams CJ, Videau LL, Chen VB, Richardson DC. Assessment of detailed conformations suggests strategies for improving cryoEM models: Helix at lower resolution, ensembles, pre-refinement fixups, and validation at multi-residue length scale. J Struct Biol 2018; 204:301-312. [PMID: 30107233 PMCID: PMC6163098 DOI: 10.1016/j.jsb.2018.08.007] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Revised: 08/01/2018] [Accepted: 08/08/2018] [Indexed: 11/17/2022]
Abstract
We find that the overall quite good methods used in the CryoEM Model Challenge could still benefit greatly from several strategies for improving local conformations. Our assessments primarily use validation criteria from the MolProbity web service. Those criteria include MolProbity's all-atom contact analysis, updated versions of standard conformational validations for protein and RNA, plus two recent additions: first, flags for cis-nonPro and twisted peptides, and second, the CaBLAM system for diagnosing secondary structure, validating Cα backbone, and validating adjacent peptide CO orientations in the context of the Cα trace. In general, automated ab initio building of starting models is quite good at backbone connectivity but often fails at local conformation or sequence register, especially at poorer than 3.5 Å resolution. However, we show that even if criteria (such as Ramachandran or rotamer) are explicitly restrained to improve refinement behavior and overall validation scores, automated optimization of a deposited structure seldom corrects specific misfittings that start in the wrong local minimum, but just hides them. Therefore, local problems should be identified, and as many as possible corrected, before starting refinement. Secondary structures are confusing at 3-4 Å but can be better recognized at 6-8 Å. In future model challenges, specific steps being tested (such as segmentation) and the required documentation (such as PDB code of starting model) should each be explicitly defined, so competing methods on a given task can be meaningfully compared. Individual local examples are presented here, to understand what local mistakes and corrections look like in 3D, how they probably arise, and what possible improvements to methodology might help avoid them. At these resolutions, both structural biologists and end-users need meaningful estimates of local uncertainty, perhaps through explicit ensembles. Fitting problems can best be diagnosed by validation that spans multiple residues; CaBLAM is such a multi-residue tool, and its effectiveness is demonstrated.
Collapse
Affiliation(s)
| | | | - Lizbeth L Videau
- Department of Biochemistry, Duke University, Durham, NC 27710, USA
| | - Vincent B Chen
- Department of Biochemistry, Duke University, Durham, NC 27710, USA
| | | |
Collapse
|
7
|
Affiliation(s)
- Catherine L Lawson
- Institute for Quantitative Biomedicine, Rutgers University, Piscataway, NJ 08854, USA.
| | - Wah Chiu
- Division of Cryo-EM and Bioimaging, SSRL, SLAC National Accelerator Laboratory, Stanford University, Menlo Park, CA 94025, USA; Department of Bioengineering, Department of Microbiology and Immunology, James H. Clark Center, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|