1
|
Sun J, Kulandaisamy A, Ru J, Gromiha MM, Cribbs AP. TMKit: a Python interface for computational analysis of transmembrane proteins. Brief Bioinform 2023; 24:bbad288. [PMID: 37594311 PMCID: PMC10516361 DOI: 10.1093/bib/bbad288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 07/07/2023] [Accepted: 07/18/2023] [Indexed: 08/19/2023] Open
Abstract
Transmembrane proteins are receptors, enzymes, transporters and ion channels that are instrumental in regulating a variety of cellular activities, such as signal transduction and cell communication. Despite tremendous progress in computational capacities to support protein research, there is still a significant gap in the availability of specialized computational analysis toolkits for transmembrane protein research. Here, we introduce TMKit, an open-source Python programming interface that is modular, scalable and specifically designed for processing transmembrane protein data. TMKit is a one-stop computational analysis tool for transmembrane proteins, enabling users to perform database wrangling, engineer features at the mutational, domain and topological levels, and visualize protein-protein interaction interfaces. In addition, TMKit includes seqNetRR, a high-performance computing library that allows customized construction of a large number of residue connections. This library is particularly well suited for assigning correlation matrix-based features at a fast speed. TMKit should serve as a useful tool for researchers in assisting the study of transmembrane protein sequences and structures. TMKit is publicly available through https://github.com/2003100127/tmkit and https://tmkit-guide.herokuapp.com/doc/overview.
Collapse
Affiliation(s)
- Jianfeng Sun
- Nuffield Department of Orthopedics, Rheumatology, and Musculoskeletal Sciences, Botnar Research Centre, University of Oxford, Headington, Oxford OX3 7LD, UK
| | - Arulsamy Kulandaisamy
- Department of Biotechnology, Bhupat and Jyoti Mehta School of BioSciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| | - Jinlong Ru
- Chair of Prevention of Microbial Diseases, School of Life Sciences Weihenstephan, Technical University of Munich, 85354 Freising, Germany
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of BioSciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| | - Adam P Cribbs
- Nuffield Department of Orthopedics, Rheumatology, and Musculoskeletal Sciences, Botnar Research Centre, University of Oxford, Headington, Oxford OX3 7LD, UK
| |
Collapse
|
2
|
Turzo SMBA, Seffernick JT, Lyskov S, Lindert S. Predicting ion mobility collision cross sections using projection approximation with ROSIE-PARCS webserver. Brief Bioinform 2023; 24:bbad308. [PMID: 37609950 PMCID: PMC10516336 DOI: 10.1093/bib/bbad308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 07/03/2023] [Accepted: 08/08/2023] [Indexed: 08/24/2023] Open
Abstract
Ion mobility coupled to mass spectrometry informs on the shape and size of protein structures in the form of a collision cross section (CCSIM). Although there are several computational methods for predicting CCSIM based on protein structures, including our previously developed projection approximation using rough circular shapes (PARCS), the process usually requires prior experience with the command-line interface. To overcome this challenge, here we present a web application on the Rosetta Online Server that Includes Everyone (ROSIE) webserver to predict CCSIM from protein structure using projection approximation with PARCS. In this web interface, the user is only required to provide one or more PDB files as input. Results from our case studies suggest that CCSIM predictions (with ROSIE-PARCS) are highly accurate with an average error of 6.12%. Furthermore, the absolute difference between CCSIM and CCSPARCS can help in distinguishing accurate from inaccurate AlphaFold2 protein structure predictions. ROSIE-PARCS is designed with a user-friendly interface, is available publicly and is free to use. The ROSIE-PARCS web interface is supported by all major web browsers and can be accessed via this link (https://rosie.graylab.jhu.edu).
Collapse
Affiliation(s)
- S M Bargeen Alam Turzo
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH 43210, USA
| | - Justin T Seffernick
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH 43210, USA
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH 43210, USA
| |
Collapse
|
3
|
Wells NGM, Smith CA. Predicting binding affinity changes from long-distance mutations using molecular dynamics simulations and Rosetta. Proteins 2023. [PMID: 36757060 DOI: 10.1002/prot.26477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 01/20/2023] [Accepted: 02/07/2023] [Indexed: 02/10/2023]
Abstract
Computationally modeling how mutations affect protein-protein binding not only helps uncover the biophysics of protein interfaces, but also enables the redesign and optimization of protein interactions. Traditional high-throughput methods for estimating binding free energy changes are currently limited to mutations directly at the interface due to difficulties in accurately modeling how long-distance mutations propagate their effects through the protein structure. However, the modeling and design of such mutations is of substantial interest as it allows for greater control and flexibility in protein design applications. We have developed a method that combines high-throughput Rosetta-based side-chain optimization with conformational sampling using classical molecular dynamics simulations, finding significant improvements in our ability to accurately predict long-distance mutational perturbations to protein binding. Our approach uses an analytical framework grounded in alchemical free energy calculations while enabling exploration of a vastly larger sequence space. When comparing to experimental data, we find that our method can predict internal long-distance mutational perturbations with a level of accuracy similar to that of traditional methods in predicting the effects of mutations at the protein-protein interface. This work represents a new and generalizable approach to optimize protein free energy landscapes for desired biological functions.
Collapse
Affiliation(s)
- Nicholas G M Wells
- Department of Chemistry, Wesleyan University, Middletown, Connecticut, USA
| | - Colin A Smith
- Department of Chemistry, Wesleyan University, Middletown, Connecticut, USA
| |
Collapse
|
4
|
Llanos MA, Alberca LN, Ruiz MD, Sbaraglini ML, Miranda C, Pino-Martinez A, Fraccaroli L, Carrillo C, Alba Soto CD, Gavernet L, Talevi A. A combined ligand and target-based virtual screening strategy to repurpose drugs as putrescine uptake inhibitors with trypanocidal activity. J Comput Aided Mol Des 2023; 37:75-90. [PMID: 36494599 DOI: 10.1007/s10822-022-00491-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 11/17/2022] [Indexed: 12/13/2022]
Abstract
Chagas disease, also known as American trypanosomiasis, is a neglected tropical disease caused by the protozoa Trypanosoma cruzi, affecting nearly 7 million people only in the Americas. Polyamines are essential compounds for parasite growth, survival, and differentiation. However, because trypanosomatids are auxotrophic for polyamines, they must be obtained from the host by specific transporters. In this investigation, an ensemble of QSAR classifiers able to identify polyamine analogs with trypanocidal activity was developed. Then, a multi-template homology model of the dimeric polyamine transporter of T. cruzi, TcPAT12, was created with Rosetta, and then refined by enhanced sampling molecular dynamics simulations. Using representative snapshots extracted from the trajectory, a docking model able to discriminate between active and inactive compounds was developed and validated. Both models were applied in a parallel virtual screening campaign to repurpose known drugs as anti-trypanosomal compounds inhibiting polyamine transport in T. cruzi. Montelukast, Quinestrol, Danazol, and Dutasteride were selected for in vitro testing, and all of them inhibited putrescine uptake in biochemical assays, confirming the predictive ability of the computational models. Furthermore, all the confirmed hits proved to inhibit epimastigote proliferation, and Quinestrol and Danazol were able to inhibit, in the low micromolar range, the viability of trypomastigotes and the intracellular growth of amastigotes.
Collapse
Affiliation(s)
- Manuel A Llanos
- Departamento de Ciencias Biológicas and Laboratorio de Investigación y Desarrollo de Bioactivos (LIDeB), Facultad de Ciencias Exactas, Universidad Nacional de La Plata. La Plata (B1900ADU), Buenos Aires, Argentina
| | - Lucas N Alberca
- Departamento de Ciencias Biológicas and Laboratorio de Investigación y Desarrollo de Bioactivos (LIDeB), Facultad de Ciencias Exactas, Universidad Nacional de La Plata. La Plata (B1900ADU), Buenos Aires, Argentina
| | - María D Ruiz
- Laboratorio de Biología Molecular y Bioquímica en Trypanosoma cruzi y otros agentes infecciosos, Instituto de Ciencia y Tecnología (ICT) Milstein - Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Buenos Aires, Argentina
| | - María L Sbaraglini
- Departamento de Ciencias Biológicas and Laboratorio de Investigación y Desarrollo de Bioactivos (LIDeB), Facultad de Ciencias Exactas, Universidad Nacional de La Plata. La Plata (B1900ADU), Buenos Aires, Argentina
| | - Cristian Miranda
- Departamento de Microbiología, Parasitología e Inmunología, Facultad de Medicina, Instituto de Investigaciones en Microbiología y Parasitología Médica (IMPaM, UBA-CONICET), Universidad de Buenos Aires., Buenos Aires, Argentina
| | - Agustina Pino-Martinez
- Departamento de Microbiología, Parasitología e Inmunología, Facultad de Medicina, Instituto de Investigaciones en Microbiología y Parasitología Médica (IMPaM, UBA-CONICET), Universidad de Buenos Aires., Buenos Aires, Argentina
| | - Laura Fraccaroli
- Laboratorio de Biología Molecular y Bioquímica en Trypanosoma cruzi y otros agentes infecciosos, Instituto de Ciencia y Tecnología (ICT) Milstein - Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Buenos Aires, Argentina
| | - Carolina Carrillo
- Laboratorio de Biología Molecular y Bioquímica en Trypanosoma cruzi y otros agentes infecciosos, Instituto de Ciencia y Tecnología (ICT) Milstein - Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Buenos Aires, Argentina
| | - Catalina D Alba Soto
- Departamento de Microbiología, Parasitología e Inmunología, Facultad de Medicina, Instituto de Investigaciones en Microbiología y Parasitología Médica (IMPaM, UBA-CONICET), Universidad de Buenos Aires., Buenos Aires, Argentina
| | - Luciana Gavernet
- Departamento de Ciencias Biológicas and Laboratorio de Investigación y Desarrollo de Bioactivos (LIDeB), Facultad de Ciencias Exactas, Universidad Nacional de La Plata. La Plata (B1900ADU), Buenos Aires, Argentina.
| | - Alan Talevi
- Departamento de Ciencias Biológicas and Laboratorio de Investigación y Desarrollo de Bioactivos (LIDeB), Facultad de Ciencias Exactas, Universidad Nacional de La Plata. La Plata (B1900ADU), Buenos Aires, Argentina
| |
Collapse
|
5
|
Drake ZC, Seffernick JT, Lindert S. Protein complex prediction using Rosetta, AlphaFold, and mass spectrometry covalent labeling. Nat Commun 2022; 13:7846. [PMID: 36543826 PMCID: PMC9772387 DOI: 10.1038/s41467-022-35593-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 12/09/2022] [Indexed: 12/24/2022] Open
Abstract
Covalent labeling (CL) in combination with mass spectrometry can be used as an analytical tool to study and determine structural properties of protein-protein complexes. However, data from these experiments is sparse and does not unambiguously elucidate protein structure. Thus, computational algorithms are needed to deduce structure from the CL data. In this work, we present a hybrid method that combines models of protein complex subunits generated with AlphaFold with differential CL data via a CL-guided protein-protein docking in Rosetta. In a benchmark set, the RMSD (root-mean-square deviation) of the best-scoring models was below 3.6 Å for 5/5 complexes with inclusion of CL data, whereas the same quality was only achieved for 1/5 complexes without CL data. This study suggests that our integrated approach can successfully use data obtained from CL experiments to distinguish between nativelike and non-nativelike models.
Collapse
Affiliation(s)
- Zachary C Drake
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210, US
| | - Justin T Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210, US
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210, US.
| |
Collapse
|
6
|
Turzo SMBA, Seffernick JT, Rolland AD, Donor MT, Heinze S, Prell JS, Wysocki VH, Lindert S. Protein shape sampled by ion mobility mass spectrometry consistently improves protein structure prediction. Nat Commun 2022; 13:4377. [PMID: 35902583 PMCID: PMC9334640 DOI: 10.1038/s41467-022-32075-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 07/14/2022] [Indexed: 11/09/2022] Open
Abstract
Ion mobility (IM) mass spectrometry provides structural information about protein shape and size in the form of an orientationally-averaged collision cross-section (CCSIM). While IM data have been used with various computational methods, they have not yet been utilized to predict monomeric protein structure from sequence. Here, we show that IM data can significantly improve protein structure determination using the modelling suite Rosetta. We develop the Rosetta Projection Approximation using Rough Circular Shapes (PARCS) algorithm that allows for fast and accurate prediction of CCSIM from structure. Following successful testing of the PARCS algorithm, we use an integrative modelling approach to utilize IM data for protein structure prediction. Additionally, we propose a confidence metric that identifies near native models in the absence of a known structure. The results of this study demonstrate the ability of IM data to consistently improve protein structure prediction. Collision cross sections (CCS) from ion mobility mass spectrometry provide information about protein shape and size. Here, the authors develop an algorithm to predict CCS and integrate experimental ion mobility data into Rosetta-based molecular modelling to predict protein structures from sequence.
Collapse
Affiliation(s)
- S M Bargeen Alam Turzo
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH, 43210, USA
| | - Justin T Seffernick
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH, 43210, USA
| | - Amber D Rolland
- Department of Chemistry and Biochemistry and Materials Science Institute, University of Oregon, Eugene, OR, 97403, USA
| | - Micah T Donor
- Department of Chemistry and Biochemistry and Materials Science Institute, University of Oregon, Eugene, OR, 97403, USA
| | - Sten Heinze
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH, 43210, USA
| | - James S Prell
- Department of Chemistry and Biochemistry and Materials Science Institute, University of Oregon, Eugene, OR, 97403, USA
| | - Vicki H Wysocki
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH, 43210, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry and Resource for Native Mass Spectrometry Guided Structural Biology, Ohio State University, Columbus, OH, 43210, USA.
| |
Collapse
|
7
|
He J, Lin P, Chen J, Cao H, Huang SY. Model building of protein complexes from intermediate-resolution cryo-EM maps with deep learning-guided automatic assembly. Nat Commun 2022; 13:4066. [PMID: 35831370 PMCID: PMC9279371 DOI: 10.1038/s41467-022-31748-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 06/30/2022] [Indexed: 12/29/2022] Open
Abstract
Advances in microscopy instruments and image processing algorithms have led to an increasing number of cryo-electron microscopy (cryo-EM) maps. However, building accurate models into intermediate-resolution EM maps remains challenging and labor-intensive. Here, we propose an automatic model building method of multi-chain protein complexes from intermediate-resolution cryo-EM maps, named EMBuild, by integrating AlphaFold structure prediction, FFT-based global fitting, domain-based semi-flexible refinement, and graph-based iterative assembling on the main-chain probability map predicted by a deep convolutional network. EMBuild is extensively evaluated on diverse test sets of 47 single-particle EM maps at 4.0-8.0 Å resolution and 16 subtomogram averaging maps of cryo-ET data at 3.7-9.3 Å resolution, and compared with state-of-the-art approaches. We demonstrate that EMBuild is able to build high-quality complex structures that are comparably accurate to the manually built PDB structures from the cryo-EM maps. These results demonstrate the accuracy and reliability of EMBuild in automatic model building.
Collapse
Affiliation(s)
- Jiahua He
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Peicong Lin
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Ji Chen
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Hong Cao
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Sheng-You Huang
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
| |
Collapse
|
8
|
Puthenveetil R, Christenson ET, Vinogradova O. New Horizons in Structural Biology of Membrane Proteins: Experimental Evaluation of the Role of Conformational Dynamics and Intrinsic Flexibility. MEMBRANES 2022; 12:227. [PMID: 35207148 PMCID: PMC8877495 DOI: 10.3390/membranes12020227] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 02/13/2022] [Accepted: 02/14/2022] [Indexed: 02/08/2023]
Abstract
A plethora of membrane proteins are found along the cell surface and on the convoluted labyrinth of membranes surrounding organelles. Since the advent of various structural biology techniques, a sub-population of these proteins has become accessible to investigation at near-atomic resolutions. The predominant bona fide methods for structure solution, X-ray crystallography and cryo-EM, provide high resolution in three-dimensional space at the cost of neglecting protein motions through time. Though structures provide various rigid snapshots, only an amorphous mechanistic understanding can be inferred from interpolations between these different static states. In this review, we discuss various techniques that have been utilized in observing dynamic conformational intermediaries that remain elusive from rigid structures. More specifically we discuss the application of structural techniques such as NMR, cryo-EM and X-ray crystallography in studying protein dynamics along with complementation by conformational trapping by specific binders such as antibodies. We finally showcase the strength of various biophysical techniques including FRET, EPR and computational approaches using a multitude of succinct examples from GPCRs, transporters and ion channels.
Collapse
Affiliation(s)
- Robbins Puthenveetil
- Section on Structural and Chemical Biology of Membrane Proteins, Neurosciences and Cellular and Structural Biology Division, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, 35A Convent Dr., Bethesda, MD 20892, USA
| | | | - Olga Vinogradova
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Connecticut, Storrs, CT 06269, USA
| |
Collapse
|
9
|
Zhang C, Zhao DX, Feng Y, Wang J, Yang ZZ. Energetics and J-coupling constants for Ala, Gly, and Val peptides demonstrated using ABEEM polarizable force field in vacuo and an aqueous solution. Phys Chem Chem Phys 2022; 24:4232-4250. [DOI: 10.1039/d1cp05676j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The development of an atom-bond electronegativity equalisation method at the σπ-level (ABEEM) polarisable force field (PFF) for peptides is presented. ABEEM PFF utilises a fluctuating charge model to explicitly describe...
Collapse
|
10
|
Behkamal B, Naghibzadeh M, Saberi MR, Tehranizadeh ZA, Pagnani A, Al Nasr K. Three-Dimensional Graph Matching to Identify Secondary Structure Correspondence of Medium-Resolution Cryo-EM Density Maps. Biomolecules 2021; 11:1773. [PMID: 34944417 PMCID: PMC8698881 DOI: 10.3390/biom11121773] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 11/18/2021] [Accepted: 11/20/2021] [Indexed: 01/15/2023] Open
Abstract
Cryo-electron microscopy (cryo-EM) is a structural technique that has played a significant role in protein structure determination in recent years. Compared to the traditional methods of X-ray crystallography and NMR spectroscopy, cryo-EM is capable of producing images of much larger protein complexes. However, cryo-EM reconstructions are limited to medium-resolution (~4-10 Å) for some cases. At this resolution range, a cryo-EM density map can hardly be used to directly determine the structure of proteins at atomic level resolutions, or even at their amino acid residue backbones. At such a resolution, only the position and orientation of secondary structure elements (SSEs) such as α-helices and β-sheets are observable. Consequently, finding the mapping of the secondary structures of the modeled structure (SSEs-A) to the cryo-EM map (SSEs-C) is one of the primary concerns in cryo-EM modeling. To address this issue, this study proposes a novel automatic computational method to identify SSEs correspondence in three-dimensional (3D) space. Initially, through a modeling of the target sequence with the aid of extracting highly reliable features from a generated 3D model and map, the SSEs matching problem is formulated as a 3D vector matching problem. Afterward, the 3D vector matching problem is transformed into a 3D graph matching problem. Finally, a similarity-based voting algorithm combined with the principle of least conflict (PLC) concept is developed to obtain the SSEs correspondence. To evaluate the accuracy of the method, a testing set of 25 experimental and simulated maps with a maximum of 65 SSEs is selected. Comparative studies are also conducted to demonstrate the superiority of the proposed method over some state-of-the-art techniques. The results demonstrate that the method is efficient, robust, and works well in the presence of errors in the predicted secondary structures of the cryo-EM images.
Collapse
Affiliation(s)
- Bahareh Behkamal
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran;
| | - Mahmoud Naghibzadeh
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran;
| | - Mohammad Reza Saberi
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran; (M.R.S.); (Z.A.T.)
- Bioinformatics Research Group, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran
| | - Zeinab Amiri Tehranizadeh
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran; (M.R.S.); (Z.A.T.)
| | - Andrea Pagnani
- Politecnico di Torino, Corso Duca degli Abruzzi 24, I-10129 Torino, Italy;
- Italian Institute for Genomic Medicine, IRCCS Candiolo, SP-142, I-10060 Candiolo, Italy
- INFN, Sezione di Torino, I-10125 Torino, Italy
| | - Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA
| |
Collapse
|
11
|
Mori T, Terashi G, Matsuoka D, Kihara D, Sugita Y. Efficient Flexible Fitting Refinement with Automatic Error Fixing for De Novo Structure Modeling from Cryo-EM Density Maps. J Chem Inf Model 2021; 61:3516-3528. [PMID: 34142833 PMCID: PMC9282639 DOI: 10.1021/acs.jcim.1c00230] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Structural modeling of proteins from cryo-electron microscopy (cryo-EM) density maps is one of the challenging issues in structural biology. De novo modeling combined with flexible fitting refinement (FFR) has been widely used to build a structure of new proteins. In de novo prediction, artificial conformations containing local structural errors such as chirality errors, cis peptide bonds, and ring penetrations are frequently generated and cannot be easily removed in the subsequent FFR. Moreover, refinement can be significantly suppressed due to the low mobility of atoms inside the protein. To overcome these problems, we propose an efficient scheme for FFR, in which the local structural errors are fixed first, followed by FFR using an iterative simulated annealing (SA) molecular dynamics protocol with the united atom (UA) model in an implicit solvent model; we call this scheme "SAUA-FFR". The best model is selected from multiple flexible fitting runs with various biasing force constants to reduce overfitting. We apply our scheme to the decoys obtained from MAINMAST and demonstrate an improvement of the best model of eight selected proteins in terms of the root-mean-square deviation, MolProbity score, and RWplus score compared to the original scheme of MAINMAST. Fixing the local structural errors can enhance the formation of secondary structures, and the UA model enables progressive refinement compared to the all-atom model owing to its high mobility in the implicit solvent. The SAUA-FFR scheme realizes efficient and accurate protein structure modeling from medium-resolution maps with less overfitting.
Collapse
Affiliation(s)
- Takaharu Mori
- RIKEN Cluster for Pioneering Research, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, United States
| | - Daisuke Matsuoka
- RIKEN Cluster for Pioneering Research, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, United States.,Department of Computer Science, Purdue University, West Lafayette, Indiana 47907, United States
| | - Yuji Sugita
- RIKEN Cluster for Pioneering Research, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan.,RIKEN Center for Computational Science, 7-1-26 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.,RIKEN Center for Biosystems Dynamics Research, 7-1-26 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan
| |
Collapse
|
12
|
Marzolf DR, Seffernick JT, Lindert S. Protein Structure Prediction from NMR Hydrogen-Deuterium Exchange Data. J Chem Theory Comput 2021; 17:2619-2629. [PMID: 33780620 DOI: 10.1021/acs.jctc.1c00077] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Amide hydrogen-deuterium exchange (HDX) has long been used to determine regional flexibility and binding sites in proteins; however, the data are too sparse for full structural characterization. Experiments that measure HDX rates, such as HDX-NMR, have far higher throughput compared to structure determination via X-ray crystallography, cryo-EM, or a full suite of NMR experiments. Data from HDX-NMR experiments encode information on the protein structure, making HDX a prime candidate to be supplemented by computational algorithms for protein structure prediction. We have developed a methodology to incorporate HDX-NMR data into ab initio protein structure prediction using the Rosetta software framework to predict structures based on experimental agreement. To demonstrate the efficacy of our algorithm, we examined 38 proteins with HDX-NMR data available, comparing the predicted model with and without the incorporation of HDX data into scoring. The root-mean-square deviation (rmsd, a measure of the average atomic distance between superimposed models) of the predicted model improved by 1.42 Å on average after incorporating the HDX-NMR data into scoring. The average rmsd improvement for the proteins where the selected model rmsd changed after incorporating HDX data was 3.63 Å, including one improvement of more than 11 Å and seven proteins improving by greater than 4 Å, with 12/15 proteins improving overall. Additionally, for independent verification, two proteins that were not part of the original benchmark were scored including HDX data, with a dramatic improvement of the selected model rmsd of nearly 9 Å for one of the proteins. Moreover, we have developed a confidence metric allowing us to successfully identify near-native models in the absence of a native structure. Improvement in model selection with a strong confidence measure demonstrates that protein structure prediction with HDX-NMR is a powerful tool which can be performed with minimal additional computational strain and expense.
Collapse
Affiliation(s)
- Daniel R Marzolf
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, United States
| | - Justin T Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, United States
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, United States
| |
Collapse
|
13
|
Biehn SE, Lindert S. Accurate protein structure prediction with hydroxyl radical protein footprinting data. Nat Commun 2021; 12:341. [PMID: 33436604 PMCID: PMC7804018 DOI: 10.1038/s41467-020-20549-7] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 12/08/2020] [Indexed: 01/10/2023] Open
Abstract
Hydroxyl radical protein footprinting (HRPF) in combination with mass spectrometry reveals the relative solvent exposure of labeled residues within a protein, thereby providing insight into protein tertiary structure. HRPF labels nineteen residues with varying degrees of reliability and reactivity. Here, we are presenting a dynamics-driven HRPF-guided algorithm for protein structure prediction. In a benchmark test of our algorithm, usage of the dynamics data in a score term resulted in notable improvement of the root-mean-square deviations of the lowest-scoring ab initio models and improved the funnel-like metric Pnear for all benchmark proteins. We identified models with accurate atomic detail for three of the four benchmark proteins. This work suggests that HRPF data along with side chain dynamics sampled by a Rosetta mover ensemble can be used to accurately predict protein structure.
Collapse
Affiliation(s)
- Sarah E Biehn
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210, USA.
| |
Collapse
|
14
|
Rosário-Ferreira N, Marques-Pereira C, Gouveia RP, Mourão J, Moreira IS. Guardians of the Cell: State-of-the-Art of Membrane Proteins from a Computational Point-of-View. Methods Mol Biol 2021; 2315:3-28. [PMID: 34302667 DOI: 10.1007/978-1-0716-1468-6_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Membrane proteins (MPs) encompass a large family of proteins with distinct cellular functions, and although representing over 50% of existing pharmaceutical drug targets, their structural and functional information is still very scarce. Over the last years, in silico analysis and algorithm development were essential to characterize MPs and overcome some limitations of experimental approaches. The optimization and improvement of these methods remain an ongoing process, with key advances in MPs' structure, folding, and interface prediction being continuously tackled. Herein, we discuss the latest trends in computational methods toward a deeper understanding of the atomistic and mechanistic details of MPs.
Collapse
Affiliation(s)
- Nícia Rosário-Ferreira
- Coimbra Chemistry Center, Department of Chemistry, University of Coimbra, Coimbra, Portugal.,Center for Neuroscience and Cell Biology, Center for Innovative Biomedicine and Biotechnology, University of Coimbra, Coimbra, Portugal
| | - Catarina Marques-Pereira
- Center for Neuroscience and Cell Biology, Center for Innovative Biomedicine and Biotechnology, University of Coimbra, Coimbra, Portugal.,PhD Programme in Experimental Biology and Biomedicine, Institute for Interdisciplinary Research (IIIUC), University of Coimbra, Coimbra, Portugal
| | - Raquel P Gouveia
- Center for Neuroscience and Cell Biology, Center for Innovative Biomedicine and Biotechnology, University of Coimbra, Coimbra, Portugal
| | - Joana Mourão
- Center for Neuroscience and Cell Biology, University of Coimbra, Coimbra, Portugal
| | - Irina S Moreira
- Department of Life Sciences, University of Coimbra, Coimbra, Portugal.
| |
Collapse
|
15
|
Seffernick JT, Lindert S. Hybrid methods for combined experimental and computational determination of protein structure. J Chem Phys 2020; 153:240901. [PMID: 33380110 PMCID: PMC7773420 DOI: 10.1063/5.0026025] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Accepted: 11/10/2020] [Indexed: 02/04/2023] Open
Abstract
Knowledge of protein structure is paramount to the understanding of biological function, developing new therapeutics, and making detailed mechanistic hypotheses. Therefore, methods to accurately elucidate three-dimensional structures of proteins are in high demand. While there are a few experimental techniques that can routinely provide high-resolution structures, such as x-ray crystallography, nuclear magnetic resonance (NMR), and cryo-EM, which have been developed to determine the structures of proteins, these techniques each have shortcomings and thus cannot be used in all cases. However, additionally, a large number of experimental techniques that provide some structural information, but not enough to assign atomic positions with high certainty have been developed. These methods offer sparse experimental data, which can also be noisy and inaccurate in some instances. In cases where it is not possible to determine the structure of a protein experimentally, computational structure prediction methods can be used as an alternative. Although computational methods can be performed without any experimental data in a large number of studies, inclusion of sparse experimental data into these prediction methods has yielded significant improvement. In this Perspective, we cover many of the successes of integrative modeling, computational modeling with experimental data, specifically for protein folding, protein-protein docking, and molecular dynamics simulations. We describe methods that incorporate sparse data from cryo-EM, NMR, mass spectrometry, electron paramagnetic resonance, small-angle x-ray scattering, Förster resonance energy transfer, and genetic sequence covariation. Finally, we highlight some of the major challenges in the field as well as possible future directions.
Collapse
Affiliation(s)
- Justin T. Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio 43210, USA
| |
Collapse
|
16
|
Kim DN, Gront D, Sanbonmatsu KY. Practical Considerations for Atomistic Structure Modeling with Cryo-EM Maps. J Chem Inf Model 2020; 60:2436-2442. [PMID: 32422044 PMCID: PMC7891309 DOI: 10.1021/acs.jcim.0c00090] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We describe common approaches to atomistic structure modeling with single particle analysis derived cryo-EM maps. Several strategies for atomistic model building and atomistic model fitting methods are discussed, including selection criteria and implementation procedures. In covering basic concepts and caveats, this short perspective aims to help facilitate active discussion between scientists at different levels with diverse backgrounds.
Collapse
Affiliation(s)
- Doo Nam Kim
- Computational Biology Team, Biological Science Division, Pacific Northwest National Laboratory, Richland, Washington, 99354, United States
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| | - Karissa Y. Sanbonmatsu
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico, 87545, United States
- New Mexico Consortium, Los Alamos, New Mexico, 87544, United States
| |
Collapse
|
17
|
Leelananda SP, Lindert S. Using NMR Chemical Shifts and Cryo-EM Density Restraints in Iterative Rosetta-MD Protein Structure Refinement. J Chem Inf Model 2020; 60:2522-2532. [PMID: 31872764 PMCID: PMC7262651 DOI: 10.1021/acs.jcim.9b00932] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Cryo-EM has become one of the prime methods for protein structure elucidation, frequently yielding density maps with near-atomic or medium resolution. If protein structures cannot be deduced unambiguously from the density maps, computational structure refinement tools are needed to generate protein structural models. We have previously developed an iterative Rosetta-MDFF protocol that used cryo-EM densities to refine protein structures. Here we show that, in addition to cryo-EM densities, incorporation of other experimental restraints into the Rosetta-MDFF protocol further improved refined structures. We used NMR chemical shift (CS) data integrated with cryo-EM densities in our hybrid protocol in both the Rosetta step and the molecular dynamics (MD) simulations step. In 15 out of 18 cases for all MD rounds, the refinement results obtained when density maps and NMR chemical shift data were used in combination outperformed those of density map-only refinement. Notably, the improvement in refinement was highest when medium and low-resolution density maps were used. With our hybrid method, the RMSDs of final models obtained were always better than the RMSDs obtained by our previous protocol with just density refinement for both medium (6.9 Å) and low (9 Å) resolution maps. For all the six test proteins with medium resolution density maps (6.9 Å), the final refined structure RMSDs were lower for the hybrid method than for the cryo-EM only refinement. The final refined RMSDs were less than 1.5 Å when our hybrid protocol was used with 4 Å density maps. For four out of the six proteins the final RMSDs were even less than 1 Å. This study demonstrates that by using a combination of cryo-EM and NMR restraints, it is possible to refine structures to atomic resolution, outperforming single restraint refinement. This hybrid protocol will be a valuable tool when only low-resolution cryo-EM density data and NMR chemical shift data are available to refine structures.
Collapse
Affiliation(s)
- Sumudu P. Leelananda
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210
| |
Collapse
|
18
|
Gulsevin A, Papke RL, Horenstein N. In Silico Modeling of the α7 Nicotinic Acetylcholine Receptor: New Pharmacological Challenges Associated with Multiple Modes of Signaling. Mini Rev Med Chem 2020; 20:841-864. [PMID: 32000651 PMCID: PMC8719523 DOI: 10.2174/1389557520666200130105256] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 11/06/2019] [Accepted: 11/07/2019] [Indexed: 12/13/2022]
Abstract
The α7 nicotinic acetylcholine receptor is a homopentameric ion-channel of the Cys-loop superfamily characterized by its low probability of opening, high calcium permeability, and rapid desensitization. The α7 receptor has been targeted for the treatment of the cognitive symptoms of schizophrenia, depression, and Alzheimer's disease, but it is also involved in inflammatory modulation as a part of the cholinergic anti-inflammatory pathway. Despite its functional importance, in silico studies of the α7 receptor cannot produce a general model explaining the structural features of receptor activation, nor predict the mode of action for various ligand classes. Two particular problems in modeling the α7 nAChR are the absence of a high-resolution structure and the presence of five potentially nonequivalent orthosteric ligand binding sites. There is wide variability regarding the templates used for homology modeling, types of ligands investigated, simulation methods, and simulation times. However, a systematic survey focusing on the methodological similarities and differences in modeling α7 has not been done. In this work, we make a critical analysis of the modeling literature of α7 nAChR by comparing the findings of computational studies with each other and with experimental studies under the main topics of structural studies, ligand binding studies, and comparisons with other nAChR. In light of our findings, we also summarize current problems in the field and make suggestions for future studies concerning modeling of the α7 receptor.
Collapse
Affiliation(s)
- Alican Gulsevin
- Department of Chemistry, University of Florida, P.O. Box 117200, Gainesville, FL, 32611-7200, United States
| | - Roger L Papke
- Department of Pharmacology and Therapeutics, University of Florida, P.O. Box 100267, Gainesville, FL 32610, United States
| | - Nicole Horenstein
- Department of Chemistry, University of Florida, P.O. Box 117200, Gainesville, FL, 32611-7200, United States
| |
Collapse
|
19
|
Kim DN, Moriarty NW, Kirmizialtin S, Afonine PV, Poon B, Sobolev OV, Adams PD, Sanbonmatsu K. Cryo_fit: Democratization of flexible fitting for cryo-EM. J Struct Biol 2019; 208:1-6. [PMID: 31279069 PMCID: PMC7112765 DOI: 10.1016/j.jsb.2019.05.012] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Accepted: 05/31/2019] [Indexed: 12/18/2022]
Abstract
Cryo-electron microscopy (cryo-EM) is becoming a method of choice for describing native conformations of biomolecular complexes at high resolution. The rapid growth of cryo-EM in recent years has created a high demand for automated solutions, both in hardware and software. Flexible fitting of atomic models to three-dimensional (3D) cryo-EM reconstructions by molecular dynamics (MD) simulation is a popular technique but often requires technical expertise in computer simulation. This work introduces cryo_fit, a package for the automatic flexible fitting of atomic models in cryo-EM maps using MD simulation. The package is integrated with the Phenix software suite. The module was designed to automate the multiple steps of MD simulation in a reproducible manner, as well as facilitate refinement and validation through Phenix. Through the use of cryo_fit, scientists with little experience in MD simulation can produce high quality atomic models automatically and better exploit the potential of cryo-EM.
Collapse
Affiliation(s)
- Doo Nam Kim
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Nigel W Moriarty
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, USA
| | - Serdal Kirmizialtin
- Chemistry Program, Science Division, New York University, Abu Dhabi, United Arab Emirates
| | - Pavel V Afonine
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, USA
| | - Billy Poon
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, USA
| | - Oleg V Sobolev
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, USA
| | - Paul D Adams
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, USA; Department of Bioengineering, University of California Berkeley, Berkeley, CA, USA
| | - Karissa Sanbonmatsu
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, NM, USA; New Mexico Consortium, Los Alamos, NM, USA.
| |
Collapse
|
20
|
Barros EP, Schiffer JM, Vorobieva A, Dou J, Baker D, Amaro RE. Improving the Efficiency of Ligand-Binding Protein Design with Molecular Dynamics Simulations. J Chem Theory Comput 2019; 15:5703-5715. [PMID: 31442033 DOI: 10.1021/acs.jctc.9b00483] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Custom-designed ligand-binding proteins represent a promising class of macromolecules with exciting applications toward the design of new enzymes or the engineering of antibodies and small-molecule recruited proteins for therapeutic interventions. However, several challenges remain in designing a protein sequence such that the binding site organization results in high affinity interaction with a bound ligand. Here, we study the dynamics of explicitly solvated designed proteins through all-atom molecular dynamics (MD) simulations to gain insight into the causes that lead to the low affinity or instability of most of these designs, despite the prediction of their success by the computational design methodology. Simulations ranging from 500 to 1000 ns per replicate were conducted on 37 designed protein variants encompassing two distinct folds and a range of ligand affinities, resulting in more than 180 μs of combined sampling. The simulations provide retrospective insights into the properties affecting ligand affinity that can prove useful in guiding further steps of design optimization. Features indicate that entropic components are particularly important for affinity, which are not easily incorporated in the empirical models often used in design protocols. Additionally, we demonstrate that the application of machine learning approaches built upon the output from the simulations can help discriminate between successful and failed binders, such that MD could act as a screening step in protein design, resulting in a more efficient process.
Collapse
Affiliation(s)
| | - Jamie M Schiffer
- Janssen Pharmaceuticals, Inc. , San Diego , California 92121 , United States
| | | | | | | | | |
Collapse
|
21
|
Seffernick J, Harvey SR, Wysocki VH, Lindert S. Predicting Protein Complex Structure from Surface-Induced Dissociation Mass Spectrometry Data. ACS CENTRAL SCIENCE 2019; 5:1330-1341. [PMID: 31482115 PMCID: PMC6716128 DOI: 10.1021/acscentsci.8b00912] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2018] [Indexed: 05/23/2023]
Abstract
Recently, mass spectrometry (MS) has become a viable method for elucidation of protein structure. Surface-induced dissociation (SID), colliding multiply charged protein complexes or other ions with a surface, has been paired with native MS to provide useful structural information such as connectivity and topology for many different protein complexes. We recently showed that SID gives information not only on connectivity and topology but also on relative interface strengths. However, SID has not yet been coupled with computational structure prediction methods that could use the sparse information from SID to improve the prediction of quaternary structures, i.e., how protein subunits interact with each other to form complexes. Protein-protein docking, a computational method to predict the quaternary structure of protein complexes, can be used in combination with subunit structures from X-ray crystallography and NMR in situations where it is difficult to obtain an experimental structure of an entire complex. While de novo structure prediction can be successful, many studies have shown that inclusion of experimental data can greatly increase prediction accuracy. In this study, we show that the appearance energy (AE, defined as 10% fragmentation) extracted from SID can be used in combination with Rosetta to successfully evaluate protein-protein docking poses. We developed an improved model to predict measured SID AEs and incorporated this model into a scoring function that combines the RosettaDock scoring function with a novel SID scoring term, which quantifies agreement between experiments and structures generated from RosettaDock. As a proof of principle, we tested the effectiveness of these restraints on 57 systems using ideal SID AE data (AE determined from crystal structures using the predictive model). When theoretical AEs were used, the RMSD of the selected structure improved or stayed the same in 95% of cases. When experimental SID data were incorporated on a different set of systems, the method predicted near-native structures (less than 2 Å root-mean-square deviation, RMSD, from native) for 6/9 tested cases, while unrestrained RosettaDock (without SID data) only predicted 3/9 such cases. Score versus RMSD funnel profiles were also improved when SID data were included. Additionally, we developed a confidence measure to evaluate predicted model quality in the absence of a crystal structure.
Collapse
|
22
|
Seffernick JT, Ren H, Kim SS, Lindert S. Measuring Intrinsic Disorder and Tracking Conformational Transitions Using Rosetta ResidueDisorder. J Phys Chem B 2019; 123:7103-7112. [PMID: 31411026 DOI: 10.1021/acs.jpcb.9b04333] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Many proteins contain regions of intrinsic disorder, not folding into unique, stable conformations. Numerous experimental methods have been developed to measure the disorder of all or select residues. In the absence of experimental data, computational methods are often utilized to identify these disordered regions and thus gain a better understanding of both structure and function. Many freely available computational methods have been developed to predict regions of intrinsic disorder from the primary sequence of a protein, including our recently developed Rosetta ResidueDisorder. While these methods are very useful, they are only designed to predict intrinsic disorder from the sequence. However, it would be useful to have a method that could also measure intrinsic disorder directly from structure. Such a method might also be used to identify changes in the structure of systems that can transition from folded to unfolded or vice versa, even systems that are not intrinsically disordered. Here we extended the capabilities of Rosetta ResidueDisorder to measure the intrinsic disorder from the coordinates of a single conformation of a protein. As a proof of principle, we show that ResidueDisorder can measure the intrinsic disorder from the coordinates with a higher accuracy (69.2%) than when predicted from sequence (65.4%) using a benchmark set of 229 proteins, showing that intrinsic disorder can be measured accurately from single structures over a large range of intrinsic disorder (0-100%). Additionally, we used ResidueDisorder to analyze unfolding trajectories of 12 fast-folding, nonintrinsically disordered proteins generated using molecular dynamics (MD), specifically steered MD (SMD), high-temperature MD, and accelerated MD (aMD) as well as long-time scale folding/unfolding trajectories. Using ResidueDisorder, a clear correlation between RMSD with respect to the native structure and measured fraction of denatured residues was observed. Finally, we introduced methods to predict folding/unfolding transitions as well as a native-like structure in the absence of a crystal structure from folding/unfolding MD trajectories. Rosetta ResidueDisorder is available as an application in the Rosetta software suite with the addition of new capabilities for directly identifying denatured regions and predicting events.
Collapse
Affiliation(s)
- Justin T Seffernick
- Department of Chemistry and Biochemistry , Ohio State University , Columbus , Ohio 43210 , United States
| | - He Ren
- Department of Chemistry/Biochemistry , Oberlin College , Oberlin , Ohio 44074 , United States
| | - Stephanie S Kim
- Department of Chemistry and Biochemistry , Ohio State University , Columbus , Ohio 43210 , United States
| | - Steffen Lindert
- Department of Chemistry and Biochemistry , Ohio State University , Columbus , Ohio 43210 , United States
| |
Collapse
|
23
|
Dutagaci B, Heo L, Feig M. Structure refinement of membrane proteins via molecular dynamics simulations. Proteins 2018; 86:738-750. [PMID: 29675899 PMCID: PMC6013386 DOI: 10.1002/prot.25508] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2018] [Revised: 04/09/2018] [Accepted: 04/14/2018] [Indexed: 12/12/2022]
Abstract
A refinement protocol based on physics-based techniques established for water soluble proteins is tested for membrane protein structures. Initial structures were generated by homology modeling and sampled via molecular dynamics simulations in explicit lipid bilayer and aqueous solvent systems. Snapshots from the simulations were selected based on scoring with either knowledge-based or implicit membrane-based scoring functions and averaged to obtain refined models. The protocol resulted in consistent and significant refinement of the membrane protein structures similar to the performance of refinement methods for soluble proteins. Refinement success was similar between sampling in the presence of lipid bilayers and aqueous solvent but the presence of lipid bilayers may benefit the improvement of lipid-facing residues. Scoring with knowledge-based functions (DFIRE and RWplus) was found to be as good as scoring using implicit membrane-based scoring functions suggesting that differences in internal packing is more important than orientations relative to the membrane during the refinement of membrane protein homology models.
Collapse
Affiliation(s)
- Bercem Dutagaci
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| | - Lim Heo
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| | - Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| |
Collapse
|
24
|
Aprahamian ML, Chea EE, Jones LM, Lindert S. Rosetta Protein Structure Prediction from Hydroxyl Radical Protein Footprinting Mass Spectrometry Data. Anal Chem 2018; 90:7721-7729. [PMID: 29874044 DOI: 10.1021/acs.analchem.8b01624] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In recent years mass spectrometry-based covalent labeling techniques such as hydroxyl radical footprinting (HRF) have emerged as valuable structural biology techniques, yielding information on protein tertiary structure. These data, however, are not sufficient to predict protein structure unambiguously, as they provide information only on the relative solvent exposure of certain residues. Despite some recent advances, no software currently exists that can utilize covalent labeling mass spectrometry data to predict protein tertiary structure. We have developed the first such tool, which incorporates mass spectrometry derived protection factors from HRF labeling as a new centroid score term for the Rosetta scoring function to improve the prediction of protein tertiary structures. We tested our method on a set of four soluble benchmark proteins with known crystal structures and either published HRF experimental results or internally acquired data. Using the HRF labeling data, we rescored large decoy sets of structures predicted with Rosetta for each of the four benchmark proteins. As a result, the model quality improved for all benchmark proteins as compared to when scored with Rosetta alone. For two of the four proteins we were even able to identify atomic resolution models with the addition of HRF data.
Collapse
Affiliation(s)
- Melanie L Aprahamian
- Department of Chemistry and Biochemistry , Ohio State University , Columbus , Ohio 43210 , United States
| | - Emily E Chea
- Department of Pharmaceutical Sciences , University of Maryland , Baltimore , Maryland 21201 , United States
| | - Lisa M Jones
- Department of Pharmaceutical Sciences , University of Maryland , Baltimore , Maryland 21201 , United States
| | - Steffen Lindert
- Department of Chemistry and Biochemistry , Ohio State University , Columbus , Ohio 43210 , United States
| |
Collapse
|
25
|
Kim SS, Seffernick JT, Lindert S. Accurately Predicting Disordered Regions of Proteins Using Rosetta ResidueDisorder Application. J Phys Chem B 2018; 122:3920-3930. [PMID: 29595057 DOI: 10.1021/acs.jpcb.8b01763] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Although many proteins necessitate well-folded structures to properly instigate their biological functions, a large fraction of functioning proteins contain regions-known as intrinsically disordered protein regions-where stable structures are not likely to form. Notable functional roles of intrinsically disordered proteins are in transcriptional regulation, translation, and cellular signal transduction. Moreover, intrinsically disordered protein regions are highly abundant in many proteins associated with various human diseases, therefore these segments have become attractive drug targets for potential therapeutics. Over the past decades, numerous computational methods have been developed to accurately predict disordered regions of proteins. Here we introduce a user-friendly and reliable approach for the prediction of disordered protein regions using the structure prediction software Rosetta. Using 245 proteins from a benchmark data set (16 DisProt database proteins) and a test data set (229 proteins with NMR data), we use Rosetta to predict the global protein structures and then show that there is a statistically significant difference between Rosetta scores in disordered and ordered regions, with scores being less favorable in disordered regions. Furthermore, the difference in scores between ordered and disordered protein regions is sufficient to accurately identify disordered protein regions. As a result, our Rosetta ResidueDisorder method (benchmark data set prediction accuracy of 71.77% and independent test data set prediction accuracy of 65.37%) outperformed other established disorder prediction tools and did not exhibit a biased prediction toward either ordered or disordered regions. To facilitate usage, a Rosetta application has been developed for the Rosetta ResidueDisorder method.
Collapse
Affiliation(s)
- Stephanie S Kim
- Department of Chemistry and Biochemistry , Ohio State University , Columbus , Ohio 43210 , United States
| | - Justin T Seffernick
- Department of Chemistry and Biochemistry , Ohio State University , Columbus , Ohio 43210 , United States
| | - Steffen Lindert
- Department of Chemistry and Biochemistry , Ohio State University , Columbus , Ohio 43210 , United States
| |
Collapse
|
26
|
Ludwiczak J, Jarmula A, Dunin-Horkawicz S. Combining Rosetta with molecular dynamics (MD): A benchmark of the MD-based ensemble protein design. J Struct Biol 2018; 203:54-61. [PMID: 29454111 DOI: 10.1016/j.jsb.2018.02.004] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2017] [Revised: 01/25/2018] [Accepted: 02/13/2018] [Indexed: 01/15/2023]
Abstract
Computational protein design is a set of procedures for computing amino acid sequences that will fold into a specified structure. Rosetta Design, a commonly used software for protein design, allows for the effective identification of sequences compatible with a given backbone structure, while molecular dynamics (MD) simulations can thoroughly sample near-native conformations. We benchmarked a procedure in which Rosetta design is started on MD-derived structural ensembles and showed that such a combined approach generates 20-30% more diverse sequences than currently available methods with only a slight increase in computation time. Importantly, the increase in diversity is achieved without a loss in the quality of the designed sequences assessed by their resemblance to natural sequences. We demonstrate that the MD-based procedure is also applicable to de novo design tasks started from backbone structures without any sequence information. In addition, we implemented a protocol that can be used to assess the stability of designed models and to select the best candidates for experimental validation. In sum our results demonstrate that the MD ensemble-based flexible backbone design can be a viable method for protein design, especially for tasks that require a large pool of diverse sequences.
Collapse
Affiliation(s)
- Jan Ludwiczak
- Laboratory of Structural Bioinformatics, Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland; Laboratory of Bioinformatics, Nencki Institute of Experimental Biology, Pasteura 3, 02-093 Warsaw, Poland
| | - Adam Jarmula
- Laboratory of Bioinformatics, Nencki Institute of Experimental Biology, Pasteura 3, 02-093 Warsaw, Poland
| | - Stanislaw Dunin-Horkawicz
- Laboratory of Structural Bioinformatics, Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland.
| |
Collapse
|