1
|
Turzo SMBA, Seffernick JT, Lyskov S, Lindert S. Predicting ion mobility collision cross sections using projection approximation with ROSIE-PARCS webserver. Brief Bioinform 2023; 24:bbad308. [PMID: 37609950 PMCID: PMC10516336 DOI: 10.1093/bib/bbad308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 07/03/2023] [Accepted: 08/08/2023] [Indexed: 08/24/2023] Open
Abstract
Ion mobility coupled to mass spectrometry informs on the shape and size of protein structures in the form of a collision cross section (CCSIM). Although there are several computational methods for predicting CCSIM based on protein structures, including our previously developed projection approximation using rough circular shapes (PARCS), the process usually requires prior experience with the command-line interface. To overcome this challenge, here we present a web application on the Rosetta Online Server that Includes Everyone (ROSIE) webserver to predict CCSIM from protein structure using projection approximation with PARCS. In this web interface, the user is only required to provide one or more PDB files as input. Results from our case studies suggest that CCSIM predictions (with ROSIE-PARCS) are highly accurate with an average error of 6.12%. Furthermore, the absolute difference between CCSIM and CCSPARCS can help in distinguishing accurate from inaccurate AlphaFold2 protein structure predictions. ROSIE-PARCS is designed with a user-friendly interface, is available publicly and is free to use. The ROSIE-PARCS web interface is supported by all major web browsers and can be accessed via this link (https://rosie.graylab.jhu.edu).
Collapse
Affiliation(s)
- S M Bargeen Alam Turzo
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH 43210, USA
| | - Justin T Seffernick
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH 43210, USA
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH 43210, USA
| |
Collapse
|
2
|
Koehler Leman J, Künze G. Recent Advances in NMR Protein Structure Prediction with ROSETTA. Int J Mol Sci 2023; 24:ijms24097835. [PMID: 37175539 PMCID: PMC10178863 DOI: 10.3390/ijms24097835] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 04/15/2023] [Accepted: 04/21/2023] [Indexed: 05/15/2023] Open
Abstract
Nuclear magnetic resonance (NMR) spectroscopy is a powerful method for studying the structure and dynamics of proteins in their native state. For high-resolution NMR structure determination, the collection of a rich restraint dataset is necessary. This can be difficult to achieve for proteins with high molecular weight or a complex architecture. Computational modeling techniques can complement sparse NMR datasets (<1 restraint per residue) with additional structural information to elucidate protein structures in these difficult cases. The Rosetta software for protein structure modeling and design is used by structural biologists for structure determination tasks in which limited experimental data is available. This review gives an overview of the computational protocols available in the Rosetta framework for modeling protein structures from NMR data. We explain the computational algorithms used for the integration of different NMR data types in Rosetta. We also highlight new developments, including modeling tools for data from paramagnetic NMR and hydrogen-deuterium exchange, as well as chemical shifts in CS-Rosetta. Furthermore, strategies are discussed to complement and improve structure predictions made by the current state-of-the-art AlphaFold2 program using NMR-guided Rosetta modeling.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY 10010, USA
| | - Georg Künze
- Institute for Drug Discovery, Medical Faculty, University of Leipzig, Brüderstr. 34, D-04103 Leipzig, Germany
- Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstr. 16-18, D-04107 Leipzig, Germany
| |
Collapse
|
3
|
He J, Turzo SBA, Seffernick JT, Kim SS, Lindert S. Prediction of Intrinsic Disorder Using Rosetta ResidueDisorder and AlphaFold2. J Phys Chem B 2022; 126:8439-8446. [PMID: 36251522 DOI: 10.1021/acs.jpcb.2c05508] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
The combination of deep learning and sequence data has transformed protein structure prediction and modeling, evidenced in the success of AlphaFold (AF). For this reason, many methods have been developed to take advantage of this success in areas where inaccurate structural modeling may limit computational predictiveness. For example, many methods have been developed to predict protein intrinsic disorder from sequence, including our Rosetta ResidueDisorder (RRD) approach. Intrinsically disordered regions in proteins are parts of the sequence that do not form ordered, folded structures under typical physiological conditions. In the original implementation of RRD, Rosetta ab initio models were generated, and disordered regions were predicted based on residue scores (disordered residues typically exist in regions of unfavorable scores). In this work, we show that by (i) replacing the ab initio modeling with AF (using the same scoring and disorder assignment approach) and (ii) updating the score function, the predictiveness improved significantly. Residues were better ranked by the order/disorder, evidenced by an improvement in receiver operating characteristic area-under-the-curve from 0.69 to 0.78 on a large (229 protein) and balanced data set (relatively even ordered versus disordered residues). Finally, the binary prediction accuracy also improved from 62% to 74% on the same data set. Our results show that the combined AF-RRD approach was as good as or better than all existing methods by these metrics (AF-RRD had the highest prediction accuracy).
Collapse
Affiliation(s)
- Jiadi He
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, United States
| | - Sm Bargeen Alam Turzo
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, United States
| | - Justin T Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, United States
| | - Stephanie S Kim
- School of Biological Sciences, Seoul National University, Seoul 08826, South Korea
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, United States
| |
Collapse
|
4
|
Turzo SMBA, Seffernick JT, Rolland AD, Donor MT, Heinze S, Prell JS, Wysocki VH, Lindert S. Protein shape sampled by ion mobility mass spectrometry consistently improves protein structure prediction. Nat Commun 2022; 13:4377. [PMID: 35902583 PMCID: PMC9334640 DOI: 10.1038/s41467-022-32075-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 07/14/2022] [Indexed: 11/09/2022] Open
Abstract
Ion mobility (IM) mass spectrometry provides structural information about protein shape and size in the form of an orientationally-averaged collision cross-section (CCSIM). While IM data have been used with various computational methods, they have not yet been utilized to predict monomeric protein structure from sequence. Here, we show that IM data can significantly improve protein structure determination using the modelling suite Rosetta. We develop the Rosetta Projection Approximation using Rough Circular Shapes (PARCS) algorithm that allows for fast and accurate prediction of CCSIM from structure. Following successful testing of the PARCS algorithm, we use an integrative modelling approach to utilize IM data for protein structure prediction. Additionally, we propose a confidence metric that identifies near native models in the absence of a known structure. The results of this study demonstrate the ability of IM data to consistently improve protein structure prediction. Collision cross sections (CCS) from ion mobility mass spectrometry provide information about protein shape and size. Here, the authors develop an algorithm to predict CCS and integrate experimental ion mobility data into Rosetta-based molecular modelling to predict protein structures from sequence.
Collapse
Affiliation(s)
- S M Bargeen Alam Turzo
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH, 43210, USA
| | - Justin T Seffernick
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH, 43210, USA
| | - Amber D Rolland
- Department of Chemistry and Biochemistry and Materials Science Institute, University of Oregon, Eugene, OR, 97403, USA
| | - Micah T Donor
- Department of Chemistry and Biochemistry and Materials Science Institute, University of Oregon, Eugene, OR, 97403, USA
| | - Sten Heinze
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH, 43210, USA
| | - James S Prell
- Department of Chemistry and Biochemistry and Materials Science Institute, University of Oregon, Eugene, OR, 97403, USA
| | - Vicki H Wysocki
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH, 43210, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH, 43210, USA.
| |
Collapse
|
5
|
Tran MH, Schoeder CT, Schey KL, Meiler J. Computational Structure Prediction for Antibody-Antigen Complexes From Hydrogen-Deuterium Exchange Mass Spectrometry: Challenges and Outlook. Front Immunol 2022; 13:859964. [PMID: 35720345 PMCID: PMC9204306 DOI: 10.3389/fimmu.2022.859964] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2022] [Accepted: 04/22/2022] [Indexed: 11/21/2022] Open
Abstract
Although computational structure prediction has had great successes in recent years, it regularly fails to predict the interactions of large protein complexes with residue-level accuracy, or even the correct orientation of the protein partners. The performance of computational docking can be notably enhanced by incorporating experimental data from structural biology techniques. A rapid method to probe protein-protein interactions is hydrogen-deuterium exchange mass spectrometry (HDX-MS). HDX-MS has been increasingly used for epitope-mapping of antibodies (Abs) to their respective antigens (Ags) in the past few years. In this paper, we review the current state of HDX-MS in studying protein interactions, specifically Ab-Ag interactions, and how it has been used to inform computational structure prediction calculations. Particularly, we address the limitations of HDX-MS in epitope mapping and techniques and protocols applied to overcome these barriers. Furthermore, we explore computational methods that leverage HDX-MS to aid structure prediction, including the computational simulation of HDX-MS data and the combination of HDX-MS and protein docking. We point out challenges in interpreting and incorporating HDX-MS data into Ab-Ag complex docking and highlight the opportunities they provide to build towards a more optimized hybrid method, allowing for more reliable, high throughput epitope identification.
Collapse
Affiliation(s)
- Minh H. Tran
- Chemical and Physical Biology Program, Vanderbilt University, Nashville, TN, United States
- Center of Structural Biology, Vanderbilt University, Nashville, TN, United States
- Mass Spectrometry Research Center, Department of Biochemistry, Vanderbilt University, Nashville, TN, United States
| | - Clara T. Schoeder
- Center of Structural Biology, Vanderbilt University, Nashville, TN, United States
- Department of Chemistry, Vanderbilt University, Nashville, TN, United States
- Institute for Drug Discovery, University Leipzig Medical School, Leipzig, Germany
| | - Kevin L. Schey
- Mass Spectrometry Research Center, Department of Biochemistry, Vanderbilt University, Nashville, TN, United States
| | - Jens Meiler
- Center of Structural Biology, Vanderbilt University, Nashville, TN, United States
- Department of Chemistry, Vanderbilt University, Nashville, TN, United States
- Institute for Drug Discovery, University Leipzig Medical School, Leipzig, Germany
| |
Collapse
|
6
|
Tamburrini KC, Pesce G, Nilsson J, Gondelaud F, Kajava AV, Berrin JG, Longhi S. Predicting Protein Conformational Disorder and Disordered Binding Sites. Methods Mol Biol 2022; 2449:95-147. [PMID: 35507260 DOI: 10.1007/978-1-0716-2095-3_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In the last two decades it has become increasingly evident that a large number of proteins adopt either a fully or a partially disordered conformation. Intrinsically disordered proteins are ubiquitous proteins that fulfill essential biological functions while lacking a stable 3D structure. Their conformational heterogeneity is encoded by the amino acid sequence, thereby allowing intrinsically disordered proteins or regions to be recognized based on their sequence properties. The identification of disordered regions facilitates the functional annotation of proteins and is instrumental for delineating boundaries of protein domains amenable to crystallization. This chapter focuses on the methods currently employed for predicting protein disorder and identifying intrinsically disordered binding sites.
Collapse
Affiliation(s)
- Ketty C Tamburrini
- Aix Marseille Univ, CNRS, Architecture et Fonction des Macromolécules Biologiques, AFMB, UMR 7257, Marseille, France
- INRAE, Aix Marseille Univ, Biodiversité et Biotechnologie Fongiques (BBF), UMR 1163, Marseille, France
| | - Giulia Pesce
- Aix Marseille Univ, CNRS, Architecture et Fonction des Macromolécules Biologiques, AFMB, UMR 7257, Marseille, France
| | - Juliet Nilsson
- Aix Marseille Univ, CNRS, Architecture et Fonction des Macromolécules Biologiques, AFMB, UMR 7257, Marseille, France
| | - Frank Gondelaud
- Aix Marseille Univ, CNRS, Architecture et Fonction des Macromolécules Biologiques, AFMB, UMR 7257, Marseille, France
| | - Andrey V Kajava
- Centre de Recherche en Biologie cellulaire de Montpellier, UMR 5237, CNRS, Université Montpellier, Montpellier, France
| | - Jean-Guy Berrin
- INRAE, Aix Marseille Univ, Biodiversité et Biotechnologie Fongiques (BBF), UMR 1163, Marseille, France
| | - Sonia Longhi
- Aix Marseille Univ, CNRS, Architecture et Fonction des Macromolécules Biologiques, AFMB, UMR 7257, Marseille, France.
| |
Collapse
|
7
|
Nguyen TT, Marzolf DR, Seffernick JT, Heinze S, Lindert S. Protein structure prediction using residue-resolved protection factors from hydrogen-deuterium exchange NMR. Structure 2021; 30:313-320.e3. [PMID: 34739840 DOI: 10.1016/j.str.2021.10.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Revised: 08/04/2021] [Accepted: 10/15/2021] [Indexed: 11/17/2022]
Abstract
Hydrogen-deuterium exchange (HDX) measured by nuclear magnetic resonance (NMR) provides structural information for proteins relating to solvent accessibility and flexibility. While this structural information is beneficial, the data cannot be used exclusively to elucidate structures. However, the structural information provided by the HDX-NMR data can be supplemented by computational methods. In previous work, we developed an algorithm in Rosetta to predict structures using qualitative HDX-NMR data (categories of exchange rate). Here we expand on the effort, and utilize quantitative protection factors (PFs) from HDX-NMR for structure prediction. From observed correlations between PFs and solvent accessibility/flexibility measures, we present a scoring function to quantify the agreement with HDX data. Using a benchmark set of 10 proteins, an average improvement of 5.13 Å in root-mean-square deviation (RMSD) is observed for cases of inaccurate Rosetta predictions. Ultimately, seven out of 10 predictions are accurate without including HDX data, and nine out of 10 are accurate when using our PF-based HDX score.
Collapse
Affiliation(s)
- Tung T Nguyen
- Department of Chemistry and Biochemistry, Denison University, Granville, OH 43023, USA
| | - Daniel R Marzolf
- Department of Chemistry and Biochemistry, Ohio State University, 2114 Newman & Wolfrom Laboratory, 100 W. 18(th) Avenue, Columbus, OH 43210, USA
| | - Justin T Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, 2114 Newman & Wolfrom Laboratory, 100 W. 18(th) Avenue, Columbus, OH 43210, USA
| | - Sten Heinze
- Department of Chemistry and Biochemistry, Ohio State University, 2114 Newman & Wolfrom Laboratory, 100 W. 18(th) Avenue, Columbus, OH 43210, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, 2114 Newman & Wolfrom Laboratory, 100 W. 18(th) Avenue, Columbus, OH 43210, USA.
| |
Collapse
|
8
|
Marzolf DR, Seffernick JT, Lindert S. Protein Structure Prediction from NMR Hydrogen-Deuterium Exchange Data. J Chem Theory Comput 2021; 17:2619-2629. [PMID: 33780620 DOI: 10.1021/acs.jctc.1c00077] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Amide hydrogen-deuterium exchange (HDX) has long been used to determine regional flexibility and binding sites in proteins; however, the data are too sparse for full structural characterization. Experiments that measure HDX rates, such as HDX-NMR, have far higher throughput compared to structure determination via X-ray crystallography, cryo-EM, or a full suite of NMR experiments. Data from HDX-NMR experiments encode information on the protein structure, making HDX a prime candidate to be supplemented by computational algorithms for protein structure prediction. We have developed a methodology to incorporate HDX-NMR data into ab initio protein structure prediction using the Rosetta software framework to predict structures based on experimental agreement. To demonstrate the efficacy of our algorithm, we examined 38 proteins with HDX-NMR data available, comparing the predicted model with and without the incorporation of HDX data into scoring. The root-mean-square deviation (rmsd, a measure of the average atomic distance between superimposed models) of the predicted model improved by 1.42 Å on average after incorporating the HDX-NMR data into scoring. The average rmsd improvement for the proteins where the selected model rmsd changed after incorporating HDX data was 3.63 Å, including one improvement of more than 11 Å and seven proteins improving by greater than 4 Å, with 12/15 proteins improving overall. Additionally, for independent verification, two proteins that were not part of the original benchmark were scored including HDX data, with a dramatic improvement of the selected model rmsd of nearly 9 Å for one of the proteins. Moreover, we have developed a confidence metric allowing us to successfully identify near-native models in the absence of a native structure. Improvement in model selection with a strong confidence measure demonstrates that protein structure prediction with HDX-NMR is a powerful tool which can be performed with minimal additional computational strain and expense.
Collapse
Affiliation(s)
- Daniel R Marzolf
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, United States
| | - Justin T Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, United States
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, United States
| |
Collapse
|
9
|
Ferrie JJ, Petersson EJ. A Unified De Novo Approach for Predicting the Structures of Ordered and Disordered Proteins. J Phys Chem B 2020; 124:5538-5548. [PMID: 32525675 DOI: 10.1021/acs.jpcb.0c02924] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
As recognition of the abundance and relevance of intrinsically disordered proteins (IDPs) continues to grow, demand increases for methods that can rapidly predict the conformational ensembles populated by these proteins. To date, IDP simulations have largely been dominated by molecular dynamics (MD) simulations, which require significant compute times and/or complex hardware. Recent developments in MD have afforded methods capable of simulating both ordered and disordered proteins, yet to date, accurate fold prediction from a sequence has been dominated by Monte Carlo (MC)-based methods such as Rosetta. To overcome the limitations of current approaches in IDP simulation using Rosetta while maintaining its utility for modeling folded domains, we developed PyRosetta-based algorithms that allow for the accurate de novo prediction of proteins across all degrees of foldedness along with structural ensembles of disordered proteins. Our simulations have accuracy comparable to state-of-the-art MD with vastly reduced computational demands.
Collapse
Affiliation(s)
- John J Ferrie
- Department of Chemistry, University of Pennsylvania, 231 South 34th Street, Philadelphia, Pennsylvania 19104-6323, United States
| | - E James Petersson
- Department of Chemistry, University of Pennsylvania, 231 South 34th Street, Philadelphia, Pennsylvania 19104-6323, United States
| |
Collapse
|