1
|
Parui S, Brini E, Dill KA. Computing Free Energies of Fold-Switching Proteins Using MELD x MD. J Chem Theory Comput 2023; 19:6839-6847. [PMID: 37725050 DOI: 10.1021/acs.jctc.3c00679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/21/2023]
Abstract
Some proteins are conformational switches, able to transition between relatively different conformations. To understand what drives them requires computing the free-energy difference ΔGAB between their stable states, A and B. Molecular dynamics (MD) simulations alone are often slow because they require a reaction coordinate and must sample many transitions in between. Here, we show that modeling employing limited data (MELD) x MD on known endstates A and B is accurate and efficient because it does not require passing over barriers or knowing reaction coordinates. We validate this method on two problems: (1) it gives correct relative populations of α and β conformers for small designed chameleon sequences of protein G; and (2) it correctly predicts the conformations of the C-terminal domain (CTD) of RfaH. Free-energy methods like MELD x MD can often resolve structures that confuse machine-learning (ML) methods.
Collapse
Affiliation(s)
- Sridip Parui
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| | - Emiliano Brini
- School of Chemistry and Materials Science, 85 Lomb Memorial Drive, Rochester, New York 14623, United States
| | - Ken A Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
- Department of Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
- Department of Physics and Astronomy, Stony Brook University, Stony Brook, New York 11794, United States
| |
Collapse
|
2
|
Kosenko M, Onkhonova G, Susloparov I, Ryzhikov A. SARS-CoV-2 proteins structural studies using synchrotron radiation. Biophys Rev 2023; 15:1185-1194. [PMID: 37974992 PMCID: PMC10643813 DOI: 10.1007/s12551-023-01153-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 09/20/2023] [Indexed: 11/19/2023] Open
Abstract
In the process of the development of structural biology, both the size and the complexity of the determined macromolecular structures have grown significantly. As a result, the range of application areas for the results of structural studies of biological macromolecules has expanded. Significant progress in the development of structural biology methods has been largely achieved through the use of synchrotron radiation. Modern sources of synchrotron radiation allow to conduct high-performance structural studies with high temporal and spatial resolution. Thus, modern techniques make it possible to obtain not only static structures, but also to study dynamic processes, which play a key role in understanding biological mechanisms. One of the key directions in the development of structural research is the drug design based on the structures of biomolecules. Synchrotron radiation offers insights into the three-dimensional time-resolved structure of individual viral proteins and their complexes at atomic resolution. The rapid and accurate determination of protein structures is crucial for understanding viral pathogenicity and designing targeted therapeutics. Through the application of experimental techniques, including X-ray crystallography and small-angle X-ray scattering (SAXS), it is possible to elucidate the structural details of SARS-CoV-2 virion containing 4 structural, 16 nonstructural proteins (nsp), and several accessory proteins. The most studied potential targets for vaccines and drugs are the structural spike (S) protein, which is responsible for entering the host cell, as well as nonstructural proteins essential for replication and transcription, such as main protease (Mpro), papain-like protease (PLpro), and RNA-dependent RNA polymerase (RdRp). This article provides a brief overview of structural analysis techniques, with focus on synchrotron radiation-based methods applied to the analysis of SARS-CoV-2 proteins.
Collapse
Affiliation(s)
- Maksim Kosenko
- Federal Budgetary Research Institution State Research Center of Virology and Biotechnology “Vector” Rospotrebnadzor, Koltsovo, 630559 Russia
| | - Galina Onkhonova
- Federal Budgetary Research Institution State Research Center of Virology and Biotechnology “Vector” Rospotrebnadzor, Koltsovo, 630559 Russia
| | - Ivan Susloparov
- Federal Budgetary Research Institution State Research Center of Virology and Biotechnology “Vector” Rospotrebnadzor, Koltsovo, 630559 Russia
| | - Alexander Ryzhikov
- Federal Budgetary Research Institution State Research Center of Virology and Biotechnology “Vector” Rospotrebnadzor, Koltsovo, 630559 Russia
| |
Collapse
|
3
|
Parui S, Robertson JC, Somani S, Tresadern G, Liu C, Dill KA. MELD-Bracket Ranks Binding Affinities of Diverse Sets of Ligands. J Chem Inf Model 2023; 63:2857-2865. [PMID: 37093848 DOI: 10.1021/acs.jcim.3c00243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Affinity ranking of structurally diverse small-molecule ligands is a challenging problem with important applications in structure-based drug discovery. Absolute binding free energy methods can model diverse ligands, but the high computational cost of the current methods limits application to data sets with few ligands. We recently developed MELD-Bracket, a Molecular Dynamics method for efficient affinity ranking of ligands [ JCTC 2022, 18 (1), 374-379]. It utilizes a Bayesian framework to guide sampling to relevant regions of phase space, and it couples this with a bracket-like competition on a pool of ligands. Here we find that 6-competitor MELD-Bracket can rank dozens of diverse ligands that have low structural similarity and different net charges. We benchmark it on four protein systems─PTB1B, Tyk2, BACE, and JAK3─having varied modes of interactions. We also validated 8-competitor and 12-competitor protocols. The MELD-Bracket protocols presented here may have the appropriate balance of accuracy and computational efficiency to be suitable for ranking diverse ligands from typical drug discovery campaigns.
Collapse
Affiliation(s)
- Sridip Parui
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| | - James C Robertson
- Janssen Research and Development, Spring House, Pennsylvania 19477, United States
| | - Sandeep Somani
- Janssen Research and Development, Spring House, Pennsylvania 19477, United States
| | - Gary Tresadern
- Janssen Research and Development, Turnhoutseweg 30, Beerse B-2340, Belgium
| | - Cong Liu
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States
| | - Ken A Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
- Department of Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
- Department of Physics and Astronomy, Stony Brook University, Stony Brook, New York 11794, United States
| |
Collapse
|
4
|
Hu K, Lee W, Montelione GT, Sgourakis NG, Vögeli B. Editorial: Computational approaches for interpreting experimental data and understanding protein structure, dynamics and function relationships. Front Mol Biosci 2022; 9:1018149. [PMID: 36262477 PMCID: PMC9576191 DOI: 10.3389/fmolb.2022.1018149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 09/15/2022] [Indexed: 11/16/2022] Open
Affiliation(s)
- Kaifeng Hu
- Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan, China
| | - Woonghee Lee
- Department of Chemistry, University of Colorado Denver, Denver, CO, United States
- *Correspondence: Woonghee Lee,
| | - Gaetano T. Montelione
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, United States
| | - Nikolaos G. Sgourakis
- Department of Pathology and Laboratory Medicine, The Children’s Hospital of Philadelphia, and Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Beat Vögeli
- Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| |
Collapse
|
5
|
Lam K, Kasavajhala K, Gunasekera S, Simmerling C. Accelerating the Ensemble Convergence of RNA Hairpin Simulations with a Replica Exchange Structure Reservoir. J Chem Theory Comput 2022; 18:3930-3947. [PMID: 35502992 PMCID: PMC10658646 DOI: 10.1021/acs.jctc.2c00065] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
RNA is a key participant in many biological processes, but studies of RNA using computer simulations lag behind those of proteins, largely due to less-developed force fields and the slow dynamics of RNA. Generating converged RNA ensembles for force field development and other studies remains a challenge. In this study, we explore the ability of replica exchange molecular dynamics to obtain well-converged conformational ensembles for two RNA hairpin systems in an implicit solvent. Even for these small model systems, standard REMD remains computationally costly, but coupling to a pre-generated structure library using the reservoir REMD approach provides a dramatic acceleration of ensemble convergence for both model systems. Such precise ensembles could facilitate RNA force field development and validation and applications of simulation to more complex RNA systems. The advantages and remaining challenges of applying R-REMD to RNA are investigated in detail.
Collapse
Affiliation(s)
- Kenneth Lam
- Molecular and Cellular Biology, Stony Brook University, Stony Brook, New York 11794, United States
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| | - Koushik Kasavajhala
- Department of Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| | - Sarah Gunasekera
- Program in Biology, Stony Brook University, Stony Brook, New York 11794, United States
| | - Carlos Simmerling
- Molecular and Cellular Biology, Stony Brook University, Stony Brook, New York 11794, United States
- Department of Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| |
Collapse
|
6
|
Tejero R, Huang YJ, Ramelot TA, Montelione GT. AlphaFold Models of Small Proteins Rival the Accuracy of Solution NMR Structures. Front Mol Biosci 2022; 9:877000. [PMID: 35769913 PMCID: PMC9234698 DOI: 10.3389/fmolb.2022.877000] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Accepted: 04/25/2022] [Indexed: 11/13/2022] Open
Abstract
Recent advances in molecular modeling using deep learning have the potential to revolutionize the field of structural biology. In particular, AlphaFold has been observed to provide models of protein structures with accuracies rivaling medium-resolution X-ray crystal structures, and with excellent atomic coordinate matches to experimental protein NMR and cryo-electron microscopy structures. Here we assess the hypothesis that AlphaFold models of small, relatively rigid proteins have accuracies (based on comparison against experimental data) similar to experimental solution NMR structures. We selected six representative small proteins with structures determined by both NMR and X-ray crystallography, and modeled each of them using AlphaFold. Using several structure validation tools integrated under the Protein Structure Validation Software suite (PSVS), we then assessed how well these models fit to experimental NMR data, including NOESY peak lists (RPF-DP scores), comparisons between predicted rigidity and chemical shift data (ANSURR scores), and 15N-1H residual dipolar coupling data (RDC Q factors) analyzed by software tools integrated in the PSVS suite. Remarkably, the fits to NMR data for the protein structure models predicted with AlphaFold are generally similar, or better, than for the corresponding experimental NMR or X-ray crystal structures. Similar conclusions were reached in comparing AlphaFold2 predictions and NMR structures for three targets from the Critical Assessment of Protein Structure Prediction (CASP). These results contradict the widely held misperception that AlphaFold cannot accurately model solution NMR structures. They also document the value of PSVS for model vs. data assessment of protein NMR structures, and the potential for using AlphaFold models for guiding analysis of experimental NMR data and more generally in structural biology.
Collapse
Affiliation(s)
- Roberto Tejero
- Departamento de Química Física, Universidad de Valencia, Valencia, Spain
- *Correspondence: Roberto Tejero, ; Gaetano T. Montelione,
| | - Yuanpeng Janet Huang
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, United States
| | - Theresa A. Ramelot
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, United States
| | - Gaetano T. Montelione
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, United States
- *Correspondence: Roberto Tejero, ; Gaetano T. Montelione,
| |
Collapse
|
7
|
Hou Q, Pucci F, Pan F, Xue F, Rooman M, Feng Q. Using metagenomic data to boost protein structure prediction and discovery. Comput Struct Biotechnol J 2022; 20:434-442. [PMID: 35070166 PMCID: PMC8760478 DOI: 10.1016/j.csbj.2021.12.030] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 12/17/2021] [Accepted: 12/21/2021] [Indexed: 11/19/2022] Open
Abstract
Over the past decade, metagenomic sequencing approaches have been providing an ever-increasing amount of protein sequence data at an astonishing rate. These constitute an invaluable source of information which has been exploited in various research fields such as the study of the role of the gut microbiota in human diseases and aging. However, only a small fraction of all metagenomic sequences collected have been functionally or structurally characterized, leaving much of them completely unexplored. Here, we review how this information has been used in protein structure prediction and protein discovery. We begin by presenting some widely used metagenomic databases and analyze in detail how metagenomic data has contributed to the impressive improvement in the accuracy of structure prediction methods in recent years. We then examine how metagenomic information can be exploited to annotate protein sequences. More specifically, we focus on the role of metagenomes in the discovery of enzymes and new CRISPR-Cas systems, and in the identification of antibiotic resistance genes. With this review, we provide an overview of how metagenomic data is currently revolutionizing our understanding of protein science.
Collapse
Affiliation(s)
- Qingzhen Hou
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Shandong 250012, China
- National Institute of Health Data Science of China, Shandong University, Shandong 250002, China
| | - Fabrizio Pucci
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, 1050 Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, 1050 Brussels, Belgium
| | - Fengming Pan
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Shandong 250012, China
- National Institute of Health Data Science of China, Shandong University, Shandong 250002, China
| | - Fuzhong Xue
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Shandong 250012, China
- National Institute of Health Data Science of China, Shandong University, Shandong 250002, China
| | - Marianne Rooman
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, 1050 Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, 1050 Brussels, Belgium
| | - Qiang Feng
- Shandong Provincial Key Laboratory of Oral Tissue Regeneration & Shandong Engineering Laboratory for Dental Materials and Oral Tissue Regeneration, Department of Human Microbiome, School of Stomatology, Shandong University, Jinan, Shandong Province 250012, China
- State Key Laboratory of Microbial Technology, Shandong University, Qingdao, Shandong Province 266237, China
| |
Collapse
|
8
|
Mondal A, Perez A. Simultaneous Assignment and Structure Determination of Proteins From Sparsely Labeled NMR Datasets. Front Mol Biosci 2021; 8:774394. [PMID: 34912846 PMCID: PMC8667806 DOI: 10.3389/fmolb.2021.774394] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2021] [Accepted: 10/25/2021] [Indexed: 11/29/2022] Open
Abstract
Sparsely labeled NMR samples provide opportunities to study larger biomolecular assemblies than is traditionally done by NMR. This requires new computational tools that can handle the sparsity and ambiguity in the NMR datasets. The MELD (modeling employing limited data) Bayesian approach was assessed to be the best performing in predicting structures from sparsely labeled NMR data in the 13th edition of the Critical Assessment of Structure Prediction (CASP) event—and limitations of the methodology were also noted. In this report, we evaluate the nature and difficulty in modeling unassigned sparsely labeled NMR datasets and report on an improved methodological pipeline leading to higher-accuracy predictions. We benchmark our methodology against the NMR datasets provided by CASP 13.
Collapse
Affiliation(s)
- Arup Mondal
- The Quantum Theory Project, Department of Chemistry, University of Florida, Gainesville, FL, United States
| | - Alberto Perez
- The Quantum Theory Project, Department of Chemistry, University of Florida, Gainesville, FL, United States
| |
Collapse
|
9
|
Liu C, Brini E, Dill KA. Accelerating Molecular Dynamics Enrichments of High-Affinity Ligands for Proteins. J Chem Theory Comput 2021; 18:374-379. [PMID: 34877865 DOI: 10.1021/acs.jctc.1c00855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Molecular docking algorithms are used to seek the most active compounds from a pool of ligands. In principle, molecular dynamics (MD) simulations with accurate physical potentials and sampling could yield better enrichments, but they are computationally expensive. Here, we describe a method called MELD-Bracket that utilizes biased replica exchange ladders in MD in order to compete different ligands against each other within a fast bracket style "binding tournament". MELD-Bracket finds best-binders rapidly when ligands are well separated in their binding affinities.
Collapse
Affiliation(s)
- Cong Liu
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States.,Department of Chemistry, Stony Brook University, Stony Brook, New York 11790, United States
| | - Emiliano Brini
- School of Chemistry and Materials Science, Rochester Institute of Technology, Rochester, New York 14623, United States
| | - Ken A Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States.,Department of Chemistry, Stony Brook University, Stony Brook, New York 11790, United States.,Department of Physics and Astronomy, Stony Brook University, Stony Brook, New York 11790, United States
| |
Collapse
|
10
|
Sharma B, Dill KA. MELD-accelerated molecular dynamics help determine amyloid fibril structures. Commun Biol 2021; 4:942. [PMID: 34354239 PMCID: PMC8342454 DOI: 10.1038/s42003-021-02461-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 07/15/2021] [Indexed: 02/07/2023] Open
Abstract
It is challenging to determine the structures of protein fibrils such as amyloids. In principle, Molecular Dynamics (MD) modeling can aid experiments, but normal MD has been impractical for these large multi-molecules. Here, we show that MELD accelerated MD (MELD x MD) can give amyloid structures from limited data. Five long-chain fibril structures are accurately predicted from NMR and Solid State NMR (SSNMR) data. Ten short-chain fibril structures are accurately predicted from more limited restraints information derived from the knowledge of strand directions. Although the present study only tests against structure predictions - which are the most detailed form of validation currently available - the main promise of this physical approach is ultimately in going beyond structures to also give mechanical properties, conformational ensembles, and relative stabilities.
Collapse
Affiliation(s)
- Bhanita Sharma
- grid.36425.360000 0001 2216 9681Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY USA
| | - Ken A. Dill
- grid.36425.360000 0001 2216 9681Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY USA ,grid.36425.360000 0001 2216 9681Department of Physics and Astronomy, Stony Brook University, Stony Brook, NY USA ,grid.36425.360000 0001 2216 9681Departments of Chemistry and Physics, Stony Brook University, Stony Brook, NY USA
| |
Collapse
|
11
|
Seffernick JT, Canfield SM, Harvey SR, Wysocki VH, Lindert S. Prediction of Protein Complex Structure Using Surface-Induced Dissociation and Cryo-Electron Microscopy. Anal Chem 2021; 93:7596-7605. [PMID: 33999617 DOI: 10.1021/acs.analchem.0c05468] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
A variety of techniques involving the use of mass spectrometry (MS) have been developed to obtain structural information on proteins and protein complexes. One example of these techniques, surface-induced dissociation (SID), has been used to study the oligomeric state and connectivity of protein complexes. Recently, we demonstrated that appearance energies (AE) could be extracted from SID experiments and that they correlate with structural features of specific protein-protein interfaces. While SID AE provides some structural information, the AE data alone are not sufficient to determine the structures of the complexes. For this reason, we sought to supplement the data with computational modeling, through protein-protein docking. In a previous study, we demonstrated that the scoring of structures generated from protein-protein docking could be improved with the inclusion of SID data; however, this work relied on knowledge of the correct tertiary structure and only built full complexes for a few cases. Here, we performed docking using input structures that require less prior knowledge, using homology models, unbound crystal structures, and bound+perturbed crystal structures. Using flexible ensemble docking (to build primarily subcomplexes from an ensemble of backbone structures), the RMSD100 of all (15/15) predicted structures using the combined Rosetta, cryo-electron microscopy (cryo-EM), and SID score was less than 4 Å, compared to only 7/15 without SID and cryo-EM. Symmetric docking (which used symmetry to build full complexes) resulted in predicted structures with RMSD100 less than 4 Å for 14/15 cases with experimental data, compared to only 5/15 without SID and cryo-EM. Finally, we also developed a confidence metric for which all (26/26) proteins flagged as high confidence were accurately predicted.
Collapse
Affiliation(s)
- Justin T Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, 2114 Newman & Wolfrom Laboratory, 100 West 18th Avenue, Columbus, Ohio 43210, United States
| | - Shane M Canfield
- Department of Chemistry, Kenyon College, Gambier, Ohio 43022, United States
| | - Sophie R Harvey
- Department of Chemistry and Biochemistry, Ohio State University, 2114 Newman & Wolfrom Laboratory, 100 West 18th Avenue, Columbus, Ohio 43210, United States
| | - Vicki H Wysocki
- Department of Chemistry and Biochemistry, Ohio State University, 2114 Newman & Wolfrom Laboratory, 100 West 18th Avenue, Columbus, Ohio 43210, United States
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, 2114 Newman & Wolfrom Laboratory, 100 West 18th Avenue, Columbus, Ohio 43210, United States
| |
Collapse
|
12
|
Abstract
Every protein has a story-how it folds, what it binds, its biological actions, and how it misbehaves in aging or disease. Stories are often inferred from a protein's shape (i.e., its structure). But increasingly, stories are told using computational molecular physics (CMP). CMP is rooted in the principled physics of driving forces and reveals granular detail of conformational populations in space and time. Recent advances are accessing longer time scales, larger actions, and blind testing, enabling more of biology's stories to be told in the language of atomistic physics.
Collapse
Affiliation(s)
- Emiliano Brini
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, USA
| | - Carlos Simmerling
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, USA.,Department of Chemistry, Stony Brook University, Stony Brook, NY 11794, USA
| | - Ken Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, USA. .,Department of Chemistry, Stony Brook University, Stony Brook, NY 11794, USA.,Department of Physics and Astronomy, Stony Brook University, Stony Brook, New NY 11794, USA
| |
Collapse
|
13
|
Seffernick JT, Lindert S. Hybrid methods for combined experimental and computational determination of protein structure. J Chem Phys 2020; 153:240901. [PMID: 33380110 PMCID: PMC7773420 DOI: 10.1063/5.0026025] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Accepted: 11/10/2020] [Indexed: 02/04/2023] Open
Abstract
Knowledge of protein structure is paramount to the understanding of biological function, developing new therapeutics, and making detailed mechanistic hypotheses. Therefore, methods to accurately elucidate three-dimensional structures of proteins are in high demand. While there are a few experimental techniques that can routinely provide high-resolution structures, such as x-ray crystallography, nuclear magnetic resonance (NMR), and cryo-EM, which have been developed to determine the structures of proteins, these techniques each have shortcomings and thus cannot be used in all cases. However, additionally, a large number of experimental techniques that provide some structural information, but not enough to assign atomic positions with high certainty have been developed. These methods offer sparse experimental data, which can also be noisy and inaccurate in some instances. In cases where it is not possible to determine the structure of a protein experimentally, computational structure prediction methods can be used as an alternative. Although computational methods can be performed without any experimental data in a large number of studies, inclusion of sparse experimental data into these prediction methods has yielded significant improvement. In this Perspective, we cover many of the successes of integrative modeling, computational modeling with experimental data, specifically for protein folding, protein-protein docking, and molecular dynamics simulations. We describe methods that incorporate sparse data from cryo-EM, NMR, mass spectrometry, electron paramagnetic resonance, small-angle x-ray scattering, Förster resonance energy transfer, and genetic sequence covariation. Finally, we highlight some of the major challenges in the field as well as possible future directions.
Collapse
Affiliation(s)
- Justin T. Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, USA
| |
Collapse
|
14
|
Khramushin A, Marcu O, Alam N, Shimony O, Padhorny D, Brini E, Dill KA, Vajda S, Kozakov D, Schueler-Furman O. Modeling beta-sheet peptide-protein interactions: Rosetta FlexPepDock in CAPRI rounds 38-45. Proteins 2020; 88:1037-1049. [PMID: 31891416 PMCID: PMC7539656 DOI: 10.1002/prot.25871] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 12/17/2019] [Accepted: 12/26/2019] [Indexed: 01/09/2023]
Abstract
Peptide-protein docking is challenging due to the considerable conformational freedom of the peptide. CAPRI rounds 38-45 included two peptide-protein interactions, both characterized by a peptide forming an additional beta strand of a beta sheet in the receptor. Using the Rosetta FlexPepDock peptide docking protocol we generated top-performing, high-accuracy models for targets 134 and 135, involving an interaction between a peptide derived from L-MAG with DLC8. In addition, we were able to generate the only medium-accuracy models for a particularly challenging target, T121. In contrast to the classical peptide-mediated interaction, in which receptor side chains contact both peptide backbone and side chains, beta-sheet complementation involves a major contribution to binding by hydrogen bonds between main chain atoms. To establish how binding affinity and specificity are established in this special class of peptide-protein interactions, we extracted PeptiDBeta, a benchmark of solved structures of different protein domains that are bound by peptides via beta-sheet complementation, and tested our protocol for global peptide-docking PIPER-FlexPepDock on this dataset. We find that the beta-strand part of the peptide is sufficient to generate approximate and even high resolution models of many interactions, but inclusion of adjacent motif residues often provides additional information necessary to achieve high resolution model quality.
Collapse
Affiliation(s)
- Alisa Khramushin
- Department of Microbiologyand Molecular Genetics, Institute
for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University,
Jerusalem, Israel
| | - Orly Marcu
- Department of Microbiologyand Molecular Genetics, Institute
for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University,
Jerusalem, Israel
| | - Nawsad Alam
- Department of Microbiologyand Molecular Genetics, Institute
for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University,
Jerusalem, Israel
| | - Orly Shimony
- Department of Microbiologyand Molecular Genetics, Institute
for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University,
Jerusalem, Israel
| | - Dzmitry Padhorny
- Department of Applied Mathematics and Statistics, Stony
Brook University, New York, New York
- Laufer Center for Physical and Quantitative Biology, Stony
Brook University, New York, New York
| | - Emiliano Brini
- Laufer Center for Physical and Quantitative Biology, Stony
Brook University, New York, New York
| | - Ken A. Dill
- Laufer Center for Physical and Quantitative Biology, Stony
Brook University, New York, New York
- Department of Physics and Astronomy, Stony Brook
University, New York, New York
- Department of Chemistry, Stony Brook University, New York,
New York
| | - Sandor Vajda
- Department of Biomedical Engineering, Boston University,
Boston, Massachusetts
- Department of Chemistry, Boston University, Boston,
Massachusetts
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony
Brook University, New York, New York
- Laufer Center for Physical and Quantitative Biology, Stony
Brook University, New York, New York
| | - Ora Schueler-Furman
- Department of Microbiologyand Molecular Genetics, Institute
for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University,
Jerusalem, Israel
| |
Collapse
|
15
|
Kryshtafovych A, Schwede T, Topf M, Fidelis K, Moult J. Critical assessment of methods of protein structure prediction (CASP)-Round XIII. Proteins 2019; 87:1011-1020. [PMID: 31589781 DOI: 10.1002/prot.25823] [Citation(s) in RCA: 281] [Impact Index Per Article: 56.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 09/25/2019] [Accepted: 09/27/2019] [Indexed: 12/24/2022]
Abstract
CASP (critical assessment of structure prediction) assesses the state of the art in modeling protein structure from amino acid sequence. The most recent experiment (CASP13 held in 2018) saw dramatic progress in structure modeling without use of structural templates (historically "ab initio" modeling). Progress was driven by the successful application of deep learning techniques to predict inter-residue distances. In turn, these results drove dramatic improvements in three-dimensional structure accuracy: With the proviso that there are an adequate number of sequences known for the protein family, the new methods essentially solve the long-standing problem of predicting the fold topology of monomeric proteins. Further, the number of sequences required in the alignment has fallen substantially. There is also substantial improvement in the accuracy of template-based models. Other areas-model refinement, accuracy estimation, and the structure of protein assemblies-have again yielded interesting results. CASP13 placed increased emphasis on the use of sparse data together with modeling and chemical crosslinking, SAXS, and NMR all yielded more mature results. This paper summarizes the key outcomes of CASP13. The special issue of PROTEINS contains papers describing the CASP13 assessments in each modeling category and contributions from the participants.
Collapse
Affiliation(s)
| | - Torsten Schwede
- Biozentrum & SIB Swiss Institute of Bioinformatics, University of Basel, Basel, Switzerland
| | - Maya Topf
- Institute of Structural and Molecular Biology, Birkbeck College, University of London, London, UK
| | | | - John Moult
- Institute for Bioscience and Biotechnology Research, Rockville, Maryland.,Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland
| |
Collapse
|
16
|
Sala D, Huang YJ, Cole CA, Snyder DA, Liu G, Ishida Y, Swapna GVT, Brock KP, Sander C, Fidelis K, Kryshtafovych A, Inouye M, Tejero R, Valafar H, Rosato A, Montelione GT. Protein structure prediction assisted with sparse NMR data in CASP13. Proteins 2019; 87:1315-1332. [PMID: 31603581 DOI: 10.1002/prot.25837] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Revised: 09/26/2019] [Accepted: 09/27/2019] [Indexed: 01/05/2023]
Abstract
CASP13 has investigated the impact of sparse NMR data on the accuracy of protein structure prediction. NOESY and 15 N-1 H residual dipolar coupling data, typical of that obtained for 15 N,13 C-enriched, perdeuterated proteins up to about 40 kDa, were simulated for 11 CASP13 targets ranging in size from 80 to 326 residues. For several targets, two prediction groups generated models that are more accurate than those produced using baseline methods. Real NMR data collected for a de novo designed protein were also provided to predictors, including one data set in which only backbone resonance assignments were available. Some NMR-assisted prediction groups also did very well with these data. CASP13 also assessed whether incorporation of sparse NMR data improves the accuracy of protein structure prediction relative to nonassisted regular methods. In most cases, incorporation of sparse, noisy NMR data results in models with higher accuracy. The best NMR-assisted models were also compared with the best regular predictions of any CASP13 group for the same target. For six of 13 targets, the most accurate model provided by any NMR-assisted prediction group was more accurate than the most accurate model provided by any regular prediction group; however, for the remaining seven targets, one or more regular prediction method provided a more accurate model than even the best NMR-assisted model. These results suggest a novel approach for protein structure determination, in which advanced prediction methods are first used to generate structural models, and sparse NMR data is then used to validate and/or refine these models.
Collapse
Affiliation(s)
- Davide Sala
- Magnetic Resonance Center, University of Florence, Sesto Fiorentino, Italy.,Department of Chemistry, University of Florence, Sesto Fiorentino, Italy
| | - Yuanpeng Janet Huang
- Center for Advanced Biotechnology and Medicine, and Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, New Jersey.,Department of Chemistry and Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, New York
| | - Casey A Cole
- Department of Computer Science & Engineering, University of South Carolina, Columbia, South Carolina
| | - David A Snyder
- Department of Chemistry, College of Science and Health, William Paterson University, Wayne, New Jersey
| | - Gaohua Liu
- Center for Advanced Biotechnology and Medicine, and Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, New Jersey.,Nexomics Biosciences, Bordentown, New Jersey
| | - Yojiro Ishida
- Center for Advanced Biotechnology and Medicine, and Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, New Jersey.,Department of Biochemistry and Molecular Biology, The Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, New Jersey
| | - G V T Swapna
- Center for Advanced Biotechnology and Medicine, and Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, New Jersey
| | - Kelly P Brock
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts
| | - Chris Sander
- Department of Cell Biology, Harvard Medical School, Boston, Massachusetts.,cBio Center, Dana-Farber Cancer Institute, Boston, Massachusetts
| | | | | | - Masayori Inouye
- Department of Biochemistry and Molecular Biology, The Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, New Jersey
| | - Roberto Tejero
- Departamento de Quimica Fisica, Universidad de Valencia, Valencia, Spain
| | - Homayoun Valafar
- Department of Computer Science & Engineering, University of South Carolina, Columbia, South Carolina
| | - Antonio Rosato
- Magnetic Resonance Center, University of Florence, Sesto Fiorentino, Italy.,Department of Chemistry, University of Florence, Sesto Fiorentino, Italy
| | - Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, and Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, New Jersey.,Department of Chemistry and Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, New York.,Department of Biochemistry and Molecular Biology, The Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, New Jersey
| |
Collapse
|