1
|
Tessmer MH, Stoll S. A novel approach to modeling side chain ensembles of the bifunctional spin label RX. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.24.542139. [PMID: 37292623 PMCID: PMC10245940 DOI: 10.1101/2023.05.24.542139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
We introduce a novel approach to modeling side chain ensembles of bifunctional spin labels. This approach utilizes rotamer libraries to generate side chain conformational ensembles. Because the bifunctional label is constrained by two attachment sites, the label is split into two monofunctional rotamers which are first attached to their respective sites, then rejoined by a local optimization in dihedral space. We validate this method against a set of previously published experimental data using the bifunctional spin label, RX. This method is relatively fast and can readily be used for both experimental analysis and protein modeling, providing significant advantages over modeling bifunctional labels with molecular dynamics simulations. Use of bifunctional labels for site directed spin labeling (SDSL) electron paramagnetic resonance (EPR) spectroscopy dramatically reduces label mobility, which can significantly improve resolution of small changes in protein backbone structure and dynamics. Coupling the use of bifunctional labels with side chain modeling methods allows for improved quantitative application of experimental SDSL EPR data to protein modeling.
Collapse
Affiliation(s)
- Maxx H. Tessmer
- Department of Chemistry, University of Washington, Seattle, WA 98103, United States
| | - Stefan Stoll
- Department of Chemistry, University of Washington, Seattle, WA 98103, United States
| |
Collapse
|
2
|
Childers MC, Daggett V. Molecular Dynamics Methods for Antibody Design. Methods Mol Biol 2023; 2552:109-124. [PMID: 36346588 DOI: 10.1007/978-1-0716-2609-2_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Complex and coordinated dynamics are closely connected with protein functions, including the binding of antibodies to antigens. Knowledge of such dynamics could improve the design of antibodies. Molecular dynamics (MD) simulations provide a "computational microscope" that can resolve atomic motions and inform antibody design efforts.
Collapse
Affiliation(s)
| | - Valerie Daggett
- Department of Bioengineering, University of Washington, Seattle, WA, USA.
| |
Collapse
|
3
|
López-Blanco JR, Dehouck Y, Bastolla U, Chacón P. Local Normal Mode Analysis for Fast Loop Conformational Sampling. J Chem Inf Model 2022; 62:4561-4568. [PMID: 36099639 PMCID: PMC9516680 DOI: 10.1021/acs.jcim.2c00870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
![]()
We propose and validate
a novel method to efficiently
explore local
protein loop conformations based on a new formalism for constrained
normal mode analysis (NMA) in internal coordinates. The manifold of
possible loop configurations imposed by the position and orientation
of the fixed loop ends is reduced to an orthogonal set of motions
(or modes) encoding concerted rotations of all the backbone dihedral
angles. We validate the sampling power on a set of protein loops with
highly variable experimental structures and demonstrate that our approach
can efficiently explore the conformational space of closed loops.
We also show an acceptable resemblance of the ensembles around equilibrium
conformations generated by long molecular simulations and constrained
NMA on a set of exposed and diverse loops. In comparison with other
methods, the main advantage is the lack of restrictions on the number
of dihedrals that can be altered simultaneously. Furthermore, the
method is computationally efficient since it only requires the diagonalization
of a tiny matrix, and the modes of motions are energetically contextualized
by the elastic network model, which includes both the loop and the
neighboring residues.
Collapse
Affiliation(s)
- José Ramón López-Blanco
- Department of Biological Physical Chemistry, Rocasolano Institute of Physical Chemistry, CSIC, Serrano 119, 28006 Madrid, Spain
| | - Yves Dehouck
- Centro de Biología Molecular "Severo Ochoa," CSIC-UAM, Cantoblanco, 28049 Madrid, Spain
| | - Ugo Bastolla
- Centro de Biología Molecular "Severo Ochoa," CSIC-UAM, Cantoblanco, 28049 Madrid, Spain
| | - Pablo Chacón
- Department of Biological Physical Chemistry, Rocasolano Institute of Physical Chemistry, CSIC, Serrano 119, 28006 Madrid, Spain
| |
Collapse
|
4
|
Varghese PM, Mukherjee S, Al-Mohanna FA, Saleh SM, Almajhdi FN, Beirag N, Alkahtani SH, Rajkumari R, Nal Rogier B, Sim RB, Idicula-Thomas S, Madan T, Murugaiah V, Kishore U. Human Properdin Released By Infiltrating Neutrophils Can Modulate Influenza A Virus Infection. Front Immunol 2021; 12:747654. [PMID: 34956182 PMCID: PMC8695448 DOI: 10.3389/fimmu.2021.747654] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Accepted: 11/08/2021] [Indexed: 11/13/2022] Open
Abstract
The complement system is designed to recognise and eliminate invading pathogens via activation of classical, alternative and lectin pathways. Human properdin stabilises the alternative pathway C3 convertase, resulting in an amplification loop that leads to the formation of C5 convertase, thereby acting as a positive regulator of the alternative pathway. It has been noted that human properdin on its own can operate as a pattern recognition receptor and exert immune functions outside its involvement in complement activation. Properdin can bind directly to microbial targets via DNA, sulfatides and glycosaminoglycans, apoptotic cells, nanoparticles, and well-known viral virulence factors. This study was aimed at investigating the complement-independent role of properdin against Influenza A virus infection. As one of the first immune cells to arrive at the site of IAV infection, we show here that IAV challenged neutrophils released properdin in a time-dependent manner. Properdin was found to directly interact with haemagglutinin, neuraminidase and matrix 1 protein Influenza A virus proteins in ELISA and western blot. Furthermore, modelling studies revealed that properdin could bind HA and NA of the H1N1 subtype with higher affinity compared to that of H3N2 due to the presence of an HA cleavage site in H1N1. In an infection assay using A549 cells, properdin suppressed viral replication in pH1N1 subtype while promoting replication of H3N2 subtype, as revealed by qPCR analysis of M1 transcripts. Properdin treatment triggered an anti-inflammatory response in H1N1-challenged A549 cells and a pro-inflammatory response in H3N2-infected cells, as evident from differential mRNA expression of TNF-α, NF-κB, IFN-α, IFN-β, IL-6, IL-12 and RANTES. Properdin treatment also reduced luciferase reporter activity in MDCK cells transduced with H1N1 pseudotyped lentiviral particles; however, it was increased in the case of pseudotyped H3N2 particles. Collectively, we conclude that infiltrating neutrophils at the site of IAV infection can release properdin, which then acts as an entry inhibitor for pandemic H1N1 subtype while suppressing viral replication and inducing an anti-inflammatory response. H3N2 subtype can escape this immune restriction due to altered haemagglutinin and neuraminindase, leading to enhanced viral entry, replication and pro-inflammatory response. Thus, depending on the subtype, properdin can either limit or aggravate IAV infection in the host.
Collapse
Affiliation(s)
- Praveen M Varghese
- Biosciences, College of Health, Medicine and Life Sciences, Brunel University London, Uxbridge, United Kingdom.,School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India
| | - Shuvechha Mukherjee
- Biomedical Informatics Centre, Indian Council of Medical Research (ICMR)-National Institute for Research in Reproductive Health, Mumbai, India
| | - Futwan A Al-Mohanna
- Department of Cell Biology, King Faisal Specialist Hospital and Research Centre, Riyadh, Saudi Arabia
| | - Souad M Saleh
- Department of Cell Biology, King Faisal Specialist Hospital and Research Centre, Riyadh, Saudi Arabia
| | - Fahad N Almajhdi
- Department of Botany and Microbiology, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Nazar Beirag
- Biosciences, College of Health, Medicine and Life Sciences, Brunel University London, Uxbridge, United Kingdom
| | - Saad H Alkahtani
- Department of Zoology, College of Sciences, King Saud University, Riyadh, Saudi Arabia
| | - Reena Rajkumari
- School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India
| | - Beatrice Nal Rogier
- INSERM U1104 Centre d'immunologie de Marseille-Luminy (CIML), Marseille, France
| | - Robert B Sim
- Department of Biochemistry, University of Oxford, Oxford, United Kingdom
| | - Susan Idicula-Thomas
- Biomedical Informatics Centre, Indian Council of Medical Research (ICMR)-National Institute for Research in Reproductive Health, Mumbai, India
| | - Taruna Madan
- Department of Innate Immunity, Indian Council of Medical Research (ICMR)-National Institute for Research in Reproductive Health, Mumbai, India
| | - Valarmathy Murugaiah
- Biosciences, College of Health, Medicine and Life Sciences, Brunel University London, Uxbridge, United Kingdom
| | - Uday Kishore
- Biosciences, College of Health, Medicine and Life Sciences, Brunel University London, Uxbridge, United Kingdom
| |
Collapse
|
5
|
Del Alamo D, Fischer AW, Moretti R, Alexander NS, Mendenhall J, Hyman NJ, Meiler J. Efficient Sampling of Protein Loop Regions Using Conformational Hashing Complemented with Random Coordinate Descent. J Chem Theory Comput 2021; 17:560-570. [PMID: 33373213 DOI: 10.1021/acs.jctc.0c00836] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
De novo construction of loop regions is an important problem in computational structural biology. Compared to regions with well-defined secondary structure, loops tend to exhibit significant conformational heterogeneity. As a result, their structures are often ambiguous when determined using experimental data obtained by crystallography, cryo-EM, or NMR. Although structurally diverse models could provide a more relevant representation of proteins in their native states, obtaining large numbers of biophysically realistic and physiologically relevant loop conformations is a resource-consuming task. To address this need, we developed a novel loop construction algorithm, Hash/RCD, that combines knowledge-based conformational hashing with random coordinate descent (RCD). This hybrid approach achieved a closure rate of 100% on a benchmark set of 195 loops in 29 proteins that range from 3 to 31 residues. More importantly, the use of templates allows Hash/RCD to maintain the accuracy of state-of-the-art coordinate descent methods while reducing sampling time from over 400 to 141 ms. These results highlight how the integration of coordinate descent with knowledge-based sampling overcomes barriers inherent to either approach in isolation. This method may facilitate the identification of native-like loop conformations using experimental data or full-atom scoring functions by allowing rapid sampling of large numbers of loops. In this manuscript, we investigate and discuss the advantages, bottlenecks, and limitations of combining conformational hashing with RCD. By providing a detailed technical description of the Hash/RCD algorithm, we hope to facilitate its implementation by other researchers.
Collapse
Affiliation(s)
- Diego Del Alamo
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, 37235 Tennessee, United States
| | - Axel W Fischer
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, 37235 Tennessee, United States
| | - Rocco Moretti
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, 37235 Tennessee, United States
| | - Nathan S Alexander
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, 37235 Tennessee, United States
| | - Jeffrey Mendenhall
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, 37235 Tennessee, United States
| | - Nicholas J Hyman
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, 37235 Tennessee, United States
| | - Jens Meiler
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, 37235 Tennessee, United States.,Institut for Drug Discovery, Leipzig University, Leipzig SAC 04103, Germany
| |
Collapse
|
6
|
Barozet A, Molloy K, Vaisset M, Siméon T, Cortés J. A reinforcement-learning-based approach to enhance exhaustive protein loop sampling. Bioinformatics 2020; 36:1099-1106. [PMID: 31504192 DOI: 10.1093/bioinformatics/btz684] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 08/23/2019] [Accepted: 08/28/2019] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Loop portions in proteins are involved in many molecular interaction processes. They often exhibit a high degree of flexibility, which can be essential for their function. However, molecular modeling approaches usually represent loops using a single conformation. Although this conformation may correspond to a (meta-)stable state, it does not always provide a realistic representation. RESULTS In this paper, we propose a method to exhaustively sample the conformational space of protein loops. It exploits structural information encoded in a large library of three-residue fragments, and enforces loop-closure using a closed-form inverse kinematics solver. A novel reinforcement-learning-based approach is applied to accelerate sampling while preserving diversity. The performance of our method is showcased on benchmark datasets involving 9-, 12- and 15-residue loops. In addition, more detailed results presented for streptavidin illustrate the ability of the method to exhaustively sample the conformational space of loops presenting several meta-stable conformations. AVAILABILITY AND IMPLEMENTATION We are developing a software package called MoMA (for Molecular Motion Algorithms), which includes modeling tools and algorithms to sample conformations and transition paths of biomolecules, including the application described in this work. The binaries can be provided upon request and a web application will also be implemented in the short future. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Amélie Barozet
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse 31400, France.,Sanofi Recherche & Développement, Integrated Drug Discovery, Molecular Design Sciences, Vitry-sur-Seine Cedex 94403, France
| | - Kevin Molloy
- Department of Computer Science, Department of Biology, James Madison University, Harrisonburg, VA 22807, USA
| | - Marc Vaisset
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse 31400, France
| | - Thierry Siméon
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse 31400, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse 31400, France
| |
Collapse
|
7
|
A comparative multivariate analysis of nitrilase enzymes: An ensemble based computational approach. Comput Biol Chem 2019; 83:107095. [DOI: 10.1016/j.compbiolchem.2019.107095] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 06/20/2019] [Accepted: 07/11/2019] [Indexed: 12/20/2022]
|
8
|
Riziotis IG, Glykos NM. On the presence of short‐range periodicities in protein structures that are not related to established secondary structure elements. Proteins 2019; 87:966-978. [DOI: 10.1002/prot.25758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2019] [Revised: 05/21/2019] [Accepted: 06/07/2019] [Indexed: 11/06/2022]
Affiliation(s)
- Ioannis G. Riziotis
- Department of Molecular Biology and GeneticsDemocritus University of Thrace, University campus Alexandroupolis Greece
| | - Nicholas M. Glykos
- Department of Molecular Biology and GeneticsDemocritus University of Thrace, University campus Alexandroupolis Greece
| |
Collapse
|
9
|
Nguyen SP, Li Z, Xu D, Shang Y. New Deep Learning Methods for Protein Loop Modeling. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:596-606. [PMID: 29990046 PMCID: PMC6580050 DOI: 10.1109/tcbb.2017.2784434] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Computational protein structure prediction is a long-standing challenge in bioinformatics. In the process of predicting protein 3D structures, it is common that parts of an experimental structure are missing or parts of a predicted structure need to be remodeled. The process of predicting local protein structures of particular regions is called loop modeling. In this paper, five new loop modeling methods based on machine learning techniques, called NearLooper, ConLooper, ResLooper, HyLooper1, and HyLooper2 are proposed. NearLooper is based on the nearest neighbor technique. ConLooper applies deep convolutional neural networks to predict ${\mathrm{C}}_{{{\alpha }}}$Cα atoms distance matrix as an orientation-independent representation of protein structure. ResLooper uses residual neural networks instead of deep convolutional neural networks. HyLooper1 combines the results of NearLooper and ConLooper while HyLooper2 combines NearLooper and ResLooper. Three commonly used benchmarks for loop modeling are used to compare the performance between these methods and existing state-of-the-art methods. The experiment results show promising performance in which our best method improves existing state-of-the-art methods by 28 and 54 percent of average RMSD on two datasets while being comparable on the other one.
Collapse
|
10
|
Kundert K, Kortemme T. Computational design of structured loops for new protein functions. Biol Chem 2019; 400:275-288. [PMID: 30676995 PMCID: PMC6530579 DOI: 10.1515/hsz-2018-0348] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Accepted: 12/18/2018] [Indexed: 12/20/2022]
Abstract
The ability to engineer the precise geometries, fine-tuned energetics and subtle dynamics that are characteristic of functional proteins is a major unsolved challenge in the field of computational protein design. In natural proteins, functional sites exhibiting these properties often feature structured loops. However, unlike the elements of secondary structures that comprise idealized protein folds, structured loops have been difficult to design computationally. Addressing this shortcoming in a general way is a necessary first step towards the routine design of protein function. In this perspective, we will describe the progress that has been made on this problem and discuss how recent advances in the field of loop structure prediction can be harnessed and applied to the inverse problem of computational loop design.
Collapse
Affiliation(s)
- Kale Kundert
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, CA 94158, USA
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158, USA
| | - Tanja Kortemme
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, CA 94158, USA
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158, USA
- Chan Zuckerberg Biohub, 499 Illinois St, San Francisco, CA 94158, USA
| |
Collapse
|
11
|
Bansal N, Zheng Z, Song LF, Pei J, Merz KM. The Role of the Active Site Flap in Streptavidin/Biotin Complex Formation. J Am Chem Soc 2018; 140:5434-5446. [PMID: 29607642 DOI: 10.1021/jacs.8b00743] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Obtaining a detailed description of how active site flap motion affects substrate or ligand binding will advance structure-based drug design (SBDD) efforts on systems including the kinases, HSP90, HIV protease, ureases, etc. Through this understanding, we will be able to design better inhibitors and better proteins that have desired functions. Herein we address this issue by generating the relevant configurational states of a protein flap on the molecular energy landscape using an approach we call MTFlex-b and then following this with a procedure to estimate the free energy associated with the motion of the flap region. To illustrate our overall workflow, we explored the free energy changes in the streptavidin/biotin system upon introducing conformational flexibility in loop3-4 in the biotin unbound ( apo) and bound ( holo) state. The free energy surfaces were created using the Movable Type free energy method, and for further validation, we compared them to potential of mean force (PMF) generated free energy surfaces using MD simulations employing the FF99SBILDN and FF14SB force fields. We also estimated the free energy thermodynamic cycle using an ensemble of closed-like and open-like end states for the ligand unbound and bound states and estimated the binding free energy to be approximately -16.2 kcal/mol (experimental -18.3 kcal/mol). The good agreement between MTFlex-b in combination with the MT method with experiment and MD simulations supports the effectiveness of our strategy in obtaining unique insights into the motions in proteins that can then be used in a range of biological and biomedical applications.
Collapse
Affiliation(s)
- Nupur Bansal
- Department of Chemistry and Department of Biochemistry and Molecular Biology , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Zheng Zheng
- Department of Chemistry and Department of Biochemistry and Molecular Biology , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Lin Frank Song
- Department of Chemistry and Department of Biochemistry and Molecular Biology , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Jun Pei
- Department of Chemistry and Department of Biochemistry and Molecular Biology , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Kenneth M Merz
- Department of Chemistry and Department of Biochemistry and Molecular Biology , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States.,Institute for Cyber Enabled Research , Michigan State University , 567 Wilson Road , East Lansing , Michigan 48824 , United States
| |
Collapse
|
12
|
Marks C, Nowak J, Klostermann S, Georges G, Dunbar J, Shi J, Kelm S, Deane CM. Sphinx: merging knowledge-based and ab initio approaches to improve protein loop prediction. Bioinformatics 2018; 33:1346-1353. [PMID: 28453681 PMCID: PMC5408792 DOI: 10.1093/bioinformatics/btw823] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2016] [Accepted: 01/09/2017] [Indexed: 01/31/2023] Open
Abstract
Motivation Loops are often vital for protein function, however, their irregular structures make them difficult to model accurately. Current loop modelling algorithms can mostly be divided into two categories: knowledge-based, where databases of fragments are searched to find suitable conformations and ab initio, where conformations are generated computationally. Existing knowledge-based methods only use fragments that are the same length as the target, even though loops of slightly different lengths may adopt similar conformations. Here, we present a novel method, Sphinx, which combines ab initio techniques with the potential extra structural information contained within loops of a different length to improve structure prediction. Results We show that Sphinx is able to generate high-accuracy predictions and decoy sets enriched with near-native loop conformations, performing better than the ab initio algorithm on which it is based. In addition, it is able to provide predictions for every target, unlike some knowledge-based methods. Sphinx can be used successfully for the difficult problem of antibody H3 prediction, outperforming RosettaAntibody, one of the leading H3-specific ab initio methods, both in accuracy and speed. Availability and Implementation Sphinx is available at http://opig.stats.ox.ac.uk/webapps/sphinx. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Claire Marks
- Department of Statistics, University of Oxford, Oxford, UK
| | - Jaroslaw Nowak
- Department of Statistics, University of Oxford, Oxford, UK
| | | | - Guy Georges
- Pharma Research and Early Development, Large Molecule Research, Roche Innovation Center Munich, Penzberg, DE, Germany
| | - James Dunbar
- Pharma Research and Early Development, Large Molecule Research, Roche Innovation Center Munich, Penzberg, DE, Germany
| | - Jiye Shi
- Department of Informatics, UCB Pharma, Slough, UK
| | | | | |
Collapse
|
13
|
Wong SWK, Liu JS, Kou SC. Fast de novo discovery of low-energy protein loop conformations. Proteins 2017; 85:1402-1412. [PMID: 28378911 DOI: 10.1002/prot.25300] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2017] [Revised: 03/19/2017] [Accepted: 03/27/2017] [Indexed: 12/25/2022]
Abstract
In the prediction of protein structure from amino acid sequence, loops are challenging regions for computational methods. Since loops are often located on the protein surface, they can have significant roles in determining protein functions and binding properties. Loop prediction without the aid of a structural template requires extensive conformational sampling and energy minimization, which are computationally difficult. In this article we present a new de novo loop sampling method, the Parallely filtered Energy Targeted All-atom Loop Sampler (PETALS) to rapidly locate low energy conformations. PETALS explores both backbone and side-chain positions of the loop region simultaneously according to the energy function selected by the user, and constructs a nonredundant ensemble of low energy loop conformations using filtering criteria. The method is illustrated with the DFIRE potential and DiSGro energy function for loops, and shown to be highly effective at discovering conformations with near-native (or better) energy. Using the same energy function as the DiSGro algorithm, PETALS samples conformations with both lower RMSDs and lower energies. PETALS is also useful for assessing the accuracy of different energy functions. PETALS runs rapidly, requiring an average time cost of 10 minutes for a length 12 loop on a single 3.2 GHz processor core, comparable to the fastest existing de novo methods for generating an ensemble of conformations. Proteins 2017; 85:1402-1412. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Samuel W K Wong
- Department of Statistics, University of Florida, Gainesville, Florida, 32611
| | - Jun S Liu
- Department of Statistics, Harvard University, Cambridge, Massachusetts, 02138
| | - S C Kou
- Department of Statistics, Harvard University, Cambridge, Massachusetts, 02138
| |
Collapse
|
14
|
Abstract
Comparative protein structure modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and how to use the ModBase database of such models, and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described. © 2016 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Benjamin Webb
- University of California at San Francisco, San Francisco, California
| | - Andrej Sali
- University of California at San Francisco, San Francisco, California
| |
Collapse
|
15
|
Lubbe L, Sewell BT, Sturrock ED. The influence of angiotensin converting enzyme mutations on the kinetics and dynamics of N-domain selective inhibition. FEBS J 2016; 283:3941-3961. [PMID: 27636235 DOI: 10.1111/febs.13900] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2016] [Revised: 09/06/2016] [Accepted: 09/14/2016] [Indexed: 11/29/2022]
Abstract
Angiotensin-1-converting enzyme (ACE) is a zinc metalloprotease that plays a major role in blood pressure regulation via the renin-angiotensin-aldosterone system. ACE consists of two domains with differences in inhibitor binding affinities despite their 90% active site identity. While the C-domain primarily controls blood pressure, the N-domain is selective for cleavage of the antifibrotic N-acetyl-Ser-Asp-Lys-Pro. Inhibitors, such as 33RE, that selectively bind to the N-domain thus show potential for treating fibrosis without affecting blood pressure. The aim of this study was to elucidate the molecular mechanism of this selectivity. ACE inhibition by 33RE was characterized using a continuous kinetic assay with fluorogenic substrate. The N-domain displayed nanomolar (Ki = 11.21 ± 0.74 nm) and the C-domain micromolar (Ki = 11 278 ± 410 nm) inhibition, thus 1000-fold selectivity. Residues predicted to contribute to selectivity based on the N-domain-33RE co-crystal structure were subsequently mutated to their C-domain counterparts. S2 subsite mutation with resulting loss of a hydrogen bond drastically decreased the affinity (Ki = 2 794 ± 156 nm), yet did not entirely account for selectivity. Additional substitution of all unique S2 ' residues, however, completely abolished selectivity (Ki = 10 009 ± 157 nm). Interestingly, these residues do not directly bind 33RE. All mutants were therefore subjected to molecular dynamics simulations in the presence and absence of 33RE. Trajectory analyses highlighted the importance of these S2 ' residues in formation of a favourable interface between the ACE subdomains and thus a closed, ligand-bound complex. This study provides a molecular basis for the intersubsite synergism governing 33RE's 1000-fold N-selectivity and aids the future design of novel inhibitors for fibrosis treatment. ENZYMES Angiotensin converting enzyme (ACE, EC 3.4.15.1).
Collapse
Affiliation(s)
- Lizelle Lubbe
- Department of Integrative Biomedical Sciences, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, South Africa
| | - Brian T Sewell
- Department of Integrative Biomedical Sciences, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, South Africa.,Structural Biology Research Unit, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, South Africa
| | - Edward D Sturrock
- Department of Integrative Biomedical Sciences, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, South Africa
| |
Collapse
|
16
|
Kolodny R, Guibas L, Levitt M, Koehl P. Inverse Kinematics in Biology: The Protein Loop Closure Problem. Int J Rob Res 2016. [DOI: 10.1177/0278364905050352] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Assembling fragments from known protein structures is a widely used approach to construct structural models for new proteins. We describe an application of this idea to an important inverse kinematics problem in structural biology: the loop closure problem. We have developed an algorithm for generating the conformations of candidate loops that fit in a gap of given length in a protein structure framework. Our method proceeds by concatenating small fragments of protein chosen from small libraries of representative fragments. Our approach has the advantages of ab initio methods since we are able to enumerate all candidate loops in the discrete approximation of the conformational space accessible to the loop, as well as the advantages of database search approach since the use of fragments of known protein structures guarantees that the backbone conformations are physically reasonable. We test our approach on a set of 427 loops, varying in length from four residues to 14 residues. The quality of the candidate loops is evaluated in terms of global coordinate root mean square (cRMS). The top predictions vary between 0.3 and 4.2 Å for four-residue loops and between 1.5 and 3.1 Å for 14-residue loops, respectively.
Collapse
Affiliation(s)
- Rachel Kolodny
- Department of Structural Biology and Computer Science Department, Stanford University, Stanford, CA 94305, USA,
| | - Leonidas Guibas
- Computer Science Department, Stanford University, Stanford, CA 94305, USA
| | - Michael Levitt
- Department of Structural Biology, Stanford University, Stanford, CA 94305, USA
| | - Patrice Koehl
- Department of Structural Biology, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
17
|
Webb B, Sali A. Comparative Protein Structure Modeling Using MODELLER. CURRENT PROTOCOLS IN BIOINFORMATICS 2016; 54:5.6.1-5.6.37. [PMID: 27322406 PMCID: PMC5031415 DOI: 10.1002/cpbi.3] [Citation(s) in RCA: 1908] [Impact Index Per Article: 238.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Comparative protein structure modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and how to use the ModBase database of such models, and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described. © 2016 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Benjamin Webb
- University of California at San Francisco, San Francisco, California
| | - Andrej Sali
- University of California at San Francisco, San Francisco, California
| |
Collapse
|
18
|
Constrained cyclic coordinate descent for cryo-EM images at medium resolutions: beyond the protein loop closure problem. ROBOTICA 2016; 34:1777-1790. [DOI: 10.1017/s0263574716000242] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
SUMMARYThe cyclic coordinate descent (CCD) method is a popular loop closure method in protein structure modeling. It is a robotics algorithm originally developed for inverse kinematic applications. We demonstrate an effective method of building the backbone of protein structure models using the principle of CCD and a guiding trace. For medium-resolution 3-dimensional (3D) images derived using cryo-electron microscopy (cryo-EM), it is possible to obtain guiding traces of secondary structures and their skeleton connections. Our new method, constrained cyclic coordinate descent (CCCD), builds α-helices, β-strands, and loops quickly and fairly accurately along predefined traces. We show that it is possible to build the entire backbone of a protein fairly accurately when the guiding traces are accurate. In a test of 10 proteins, the models constructed using CCCD show an average of 3.91 Å of backbone root mean square deviation (RMSD). When the CCCD method is incorporated in a simulated annealing framework to sample possible shift, translation, and rotation freedom, the models built with the true topology were ranked high on the list, with an average backbone RMSD100 of 3.76 Å. CCCD is an effective method for modeling atomic structures after secondary structure traces and skeletons are extracted from 3D cryo-EM images.
Collapse
|
19
|
Arora B, Coudrat T, Wootten D, Christopoulos A, Noronha SB, Sexton PM. Prediction of Loops in G Protein-Coupled Receptor Homology Models: Effect of Imprecise Surroundings and Constraints. J Chem Inf Model 2016; 56:671-86. [DOI: 10.1021/acs.jcim.5b00554] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Bhumika Arora
- Department
of Chemical Engineering, Indian Institute of Technology Bombay, Mumbai 400076, India
- Department
of Pharmacology, Monash University, Clayton, Victoria 3800, Australia
- IITB−Monash
Research Academy, IIT Bombay, Mumbai 400076, India
| | - Thomas Coudrat
- Drug
Discovery Biology, Monash Institute of Pharmaceutical Sciences, and
Department of Pharmacology, Monash University, Parkville, Victoria 3052, Australia
| | - Denise Wootten
- Drug
Discovery Biology, Monash Institute of Pharmaceutical Sciences, and
Department of Pharmacology, Monash University, Parkville, Victoria 3052, Australia
| | - Arthur Christopoulos
- Drug
Discovery Biology, Monash Institute of Pharmaceutical Sciences, and
Department of Pharmacology, Monash University, Parkville, Victoria 3052, Australia
| | - Santosh B. Noronha
- Department
of Chemical Engineering, Indian Institute of Technology Bombay, Mumbai 400076, India
| | - Patrick M. Sexton
- Drug
Discovery Biology, Monash Institute of Pharmaceutical Sciences, and
Department of Pharmacology, Monash University, Parkville, Victoria 3052, Australia
| |
Collapse
|
20
|
Li H, Sun L, Luo S, Xia X, Lyu Q. Modeling Protein Loop Structure by Cyclic Coordinate Descent-based Approach. J Comput Biol 2016; 23:123-136. [DOI: 10.1089/cmb.2015.0145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Haiou Li
- School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu, China
| | - Lu Sun
- School of Electronical Engineering and Computer Science, University of Ottawa, Ottawa, Ontario, Canada
| | - Sheng Luo
- School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu, China
| | - Xiaoyan Xia
- School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu, China
- Jiangsu Provincial Key Lab for Information Processing Technologies, Suzhou, Jiangsu, China
| | - Qiang Lyu
- School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu, China
- Jiangsu Provincial Key Lab for Information Processing Technologies, Suzhou, Jiangsu, China
| |
Collapse
|
21
|
Wang DD, Ma L, Wong MP, Lee VHF, Yan H. Contribution of EGFR and ErbB-3 Heterodimerization to the EGFR Mutation-Induced Gefitinib- and Erlotinib-Resistance in Non-Small-Cell Lung Carcinoma Treatments. PLoS One 2015; 10:e0128360. [PMID: 25993617 PMCID: PMC4439022 DOI: 10.1371/journal.pone.0128360] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 04/25/2015] [Indexed: 12/02/2022] Open
Abstract
EGFR mutation-induced drug resistance has become a major threat to the treatment of non-small-cell lung carcinoma. Essentially, the resistance mechanism involves modifications of the intracellular signaling pathways. In our work, we separately investigated the EGFR and ErbB-3 heterodimerization, regarded as the origin of intracellular signaling pathways. On one hand, we combined the molecular interaction in EGFR heterodimerization with that between the EGFR tyrosine kinase and its inhibitor. For 168 clinical subjects, we characterized their corresponding EGFR mutations using molecular interactions, with three potential dimerization partners (ErbB-2, IGF-1R and c-Met) of EGFR and two of its small molecule inhibitors (gefitinib and erlotinib). Based on molecular dynamics simulations and structural analysis, we modeled these mutant-partner or mutant-inhibitor interactions using binding free energy and its components. As a consequence, the mutant-partner interactions are amplified for mutants L858R and L858R_T790M, compared to the wild type EGFR. Mutant delL747_P753insS represents the largest difference between the mutant-IGF-1R interaction and the mutant-inhibitor interaction, which explains the shorter progression-free survival of an inhibitor to this mutant type. Besides, feature sets including different energy components were constructed, and efficient regression trees were applied to map these features to the progression-free survival of an inhibitor. On the other hand, we comparably examined the interactions between ErbB-3 and its partners (EGFR mutants, IGF-1R, ErbB-2 and c-Met). Compared to others, c-Met shows a remarkably-strong binding with ErbB-3, implying its significant role in regulating ErbB-3 signaling. Moreover, EGFR mutants corresponding to poor clinical outcomes, such as L858R_T790M, possess lower binding affinities with ErbB-3 than c-Met does. This may promote the communication between ErbB-3 and c-Met in these cancer cells. The analysis verified the important contribution of IGF-1R or c-Met in the drug resistance mechanism developed in lung cancer treatments, which may bring many benefits to specialized therapy design and innovative drug discovery.
Collapse
Affiliation(s)
- Debby D. Wang
- Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong
- * E-mail:
| | - Lichun Ma
- Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong
| | - Maria P. Wong
- Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong
| | - Victor H. F. Lee
- Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong
| | - Hong Yan
- Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong
| |
Collapse
|
22
|
Abstract
Functional characterization of a protein sequence is one of the most frequent problems in biology. This task is usually facilitated by accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described.
Collapse
Affiliation(s)
- Benjamin Webb
- University of California at San Francisco, San Francisco, California
| | | |
Collapse
|
23
|
Tang K, Zhang J, Liang J. Fast protein loop sampling and structure prediction using distance-guided sequential chain-growth Monte Carlo method. PLoS Comput Biol 2014; 10:e1003539. [PMID: 24763317 PMCID: PMC3998890 DOI: 10.1371/journal.pcbi.1003539] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2013] [Accepted: 02/01/2014] [Indexed: 11/18/2022] Open
Abstract
Loops in proteins are flexible regions connecting regular secondary structures. They are often involved in protein functions through interacting with other molecules. The irregularity and flexibility of loops make their structures difficult to determine experimentally and challenging to model computationally. Conformation sampling and energy evaluation are the two key components in loop modeling. We have developed a new method for loop conformation sampling and prediction based on a chain growth sequential Monte Carlo sampling strategy, called Distance-guided Sequential chain-Growth Monte Carlo (DISGRO). With an energy function designed specifically for loops, our method can efficiently generate high quality loop conformations with low energy that are enriched with near-native loop structures. The average minimum global backbone RMSD for 1,000 conformations of 12-residue loops is 1:53 A° , with a lowest energy RMSD of 2:99 A° , and an average ensembleRMSD of 5:23 A° . A novel geometric criterion is applied to speed up calculations. The computational cost of generating 1,000 conformations for each of the x loops in a benchmark dataset is only about 10 cpu minutes for 12-residue loops, compared to ca 180 cpu minutes using the FALCm method. Test results on benchmark datasets show that DISGRO performs comparably or better than previous successful methods, while requiring far less computing time. DISGRO is especially effective in modeling longer loops (10-17 residues).
Collapse
Affiliation(s)
- Ke Tang
- Department of Bioengineering, University of Illinois at Chicago, Chicago, Illinois, United States of America
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, Tallahassee, Florida, United States of America
- * E-mail: (JZ); (JL)
| | - Jie Liang
- Department of Bioengineering, University of Illinois at Chicago, Chicago, Illinois, United States of America
- * E-mail: (JZ); (JL)
| |
Collapse
|
24
|
Holtby D, Li SC, Li M. LoopWeaver: loop modeling by the weighted scaling of verified proteins. J Comput Biol 2014; 20:212-23. [PMID: 23461572 DOI: 10.1089/cmb.2012.0078] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Modeling loops is a necessary step in protein structure determination, even with experimental nuclear magnetic resonance (NMR) data, it is widely known to be difficult. Database techniques have the advantage of producing a higher proportion of predictions with subangstrom accuracy when compared with ab initio techniques, but the disadvantage of also producing a higher proportion of clashing or highly inaccurate predictions. We introduce LoopWeaver, a database method that uses multidimensional scaling to achieve better, clash-free placement of loops obtained from a database of protein structures. This allows us to maintain the above-mentioned advantage while avoiding the disadvantage. Test results show that we achieve significantly better results than all other methods, including Modeler, Loopy, SuperLooper, and Rapper, before refinement. With refinement, our results (LoopWeaver and Loopy consensus) are better than ROSETTA, with 0.42 Å RMSD on average for 206 length 6 loops, 0.64 Å local RMSD for 168 length 7 loops, 0.81Å RMSD for 117 length 8 loops, and 0.98 Å RMSD for length 9 loops, while ROSETTA has 0.55, 0.79, 1.16, 1.42, respectively, at the same average time limit (3 hours). When we allow ROSETTA to run for over a week, it approaches, but does not surpass, our accuracy.
Collapse
Affiliation(s)
- Daniel Holtby
- David R. Chariton School of Computer Science, University of Waterloo, Waterloo, Canada.
| | | | | |
Collapse
|
25
|
Abstract
Structural proteomics aims to understand the structural basis of protein interactions and functions. A prerequisite for this is the availability of 3D protein structures that mediate the biochemical interactions. The explosion in the number of available gene sequences set the stage for the next step in genome-scale projects -- to obtain 3D structures for each protein. To achieve this ambitious goal, the slow and costly structure determination experiments are supplemented with theoretical approaches. The current state and recent advances in structure modeling approaches are reviewed here, with special emphasis on comparative protein structure modeling techniques.
Collapse
Affiliation(s)
- András Fiser
- Department of Biochemistry, Seaver Foundation Center for Bioinformatics, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY 10461, USA.
| |
Collapse
|
26
|
Webb B, Eswar N, Fan H, Khuri N, Pieper U, Dong G, Sali A. Comparative Modeling of Drug Target Proteins☆. REFERENCE MODULE IN CHEMISTRY, MOLECULAR SCIENCES AND CHEMICAL ENGINEERING 2014. [PMCID: PMC7157477 DOI: 10.1016/b978-0-12-409547-2.11133-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
In this perspective, we begin by describing the comparative protein structure modeling technique and the accuracy of the corresponding models. We then discuss the significant role that comparative prediction plays in drug discovery. We focus on virtual ligand screening against comparative models and illustrate the state-of-the-art by a number of specific examples.
Collapse
|
27
|
|
28
|
Kelm S, Vangone A, Choi Y, Ebejer JP, Shi J, Deane CM. Fragment-based modeling of membrane protein loops: successes, failures, and prospects for the future. Proteins 2013; 82:175-86. [PMID: 23589399 DOI: 10.1002/prot.24299] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2012] [Revised: 02/22/2013] [Accepted: 03/26/2013] [Indexed: 11/12/2022]
Abstract
Membrane proteins (MPs) have become a major focus in structure prediction, due to their medical importance. There is, however, a lack of fast and reliable methods that specialize in the modeling of MP loops. Often methods designed for soluble proteins (SPs) are applied directly to MPs. In this article, we investigate the validity of such an approach in the realm of fragment-based methods. We also examined the differences in membrane and soluble protein loops that might affect accuracy. We test our ability to predict soluble and MP loops with the previously published method FREAD. We show that it is possible to predict accurately the structure of MP loops using a database of MP fragments (0.5-1 Å median root-mean-square deviation). The presence of homologous proteins in the database helps prediction accuracy. However, even when homologues are removed better results are still achieved using fragments of MPs (0.8-1.6 Å) rather than SPs (1-4 Å) to model MP loops. We find that many fragments of SPs have shapes similar to their MP counterparts but have very different sequences; however, they do not appear to differ in their substitution patterns. Our findings may allow further improvements to fragment-based loop modeling algorithms for MPs. The current version of our proof-of-concept loop modeling protocol produces high-accuracy loop models for MPs and is available as a web server at http://medeller.info/fread.
Collapse
Affiliation(s)
- Sebastian Kelm
- Department of Statistics, University of Oxford, Oxford, United Kingdom
| | | | | | | | | | | |
Collapse
|
29
|
Wang DD, Zhou W, Yan H, Wong M, Lee V. Personalized prediction of EGFR mutation-induced drug resistance in lung cancer. Sci Rep 2013; 3:2855. [PMID: 24092472 PMCID: PMC3790204 DOI: 10.1038/srep02855] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2013] [Accepted: 09/17/2013] [Indexed: 12/21/2022] Open
Abstract
EGFR mutation-induced drug resistance has significantly impaired the potency of small molecule tyrosine kinase inhibitors in lung cancer treatment. Computational approaches can provide powerful and efficient techniques in the investigation of drug resistance. In our work, the EGFR mutation feature is characterized by the energy components of binding free energy (concerning the mutant-inhibitor complex), and we combine it with specific personal features for 168 clinical subjects to construct a personalized drug resistance prediction model. The 3D structure of an EGFR mutant is computationally predicted from its protein sequence, after which the dynamics of the bound mutant-inhibitor complex is simulated via AMBER and the binding free energy of the complex is calculated based on the dynamics. The utilization of extreme learning machines and leave-one-out cross-validation promises a successful identification of resistant subjects with high accuracy. Overall, our study demonstrates advantages in the development of personalized medicine/therapy design and innovative drug discovery.
Collapse
Affiliation(s)
- Debby D Wang
- Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong
| | | | | | | | | |
Collapse
|
30
|
Su YC, Liu HL. The Effects of Salt and pH on the Spermatozoa Agglutinating Activity of Carp Ovum Cystatin by Molecular Dynamics Simulations. J CHIN CHEM SOC-TAIP 2013. [DOI: 10.1002/jccs.200600094] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
31
|
Liu HL, Lin JC, Ho Y, Hsieh WC, Chen CW, Su YC. Homology Models and Molecular Dynamics Simulations of Main Proteinase from Coronavirus Associated with Severe Acute Respiratory Syndrome (SARS). J CHIN CHEM SOC-TAIP 2013; 51:889-900. [PMID: 32336761 PMCID: PMC7167048 DOI: 10.1002/jccs.200400134] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2004] [Indexed: 11/29/2022]
Abstract
In this study, two structural models (denoted as MproST and MproSH) of the main proteinase (Mpro) from the novel coronavirus associated with severe acute respiratory syndrome (SARS‐CoV) were constructed based on the crystallographic structures of Mpro from transmissible gastroenteritis coronavirus (TGEV) (MproT) and human coronavirus HcoV‐229E (MproH), respectively. Various 200 ps molecular dynamics simulations were subsequently performed to investigate the dynamics behaviors of several structural features. Both MproST and MproSH exhibit similar folds as their respective template proteins. These structural models reveal three distinct functional domains as well as an intervening loop connecting domains II and III as found in both template proteins. In addition, domain III of these structures exhibits the least secondary structural conservation. A catalytic cleft containing the substrate binding subsites S1 and the S2 between domains I and II are also observed in these structural models. Although these structures share many common features, the most significant difference occurs at the S2 subsite, where the amino acid residues lining up this subsite are least conserved. It may be a critical challenge for designing anti‐SARS drugs by simply screening the known database of proteinase inhibitors.
Collapse
Affiliation(s)
- Hsuan-Liang Liu
- Department of Chemical Engineering and Graduate Institute of Biotechnology, National Taipei University of Technology, Taipei 10608, Taiwan, R.O.C
| | - Jin-Chung Lin
- Department of Chemical Engineering and Graduate Institute of Biotechnology, National Taipei University of Technology, Taipei 10608, Taiwan, R.O.C
| | - Yih Ho
- School of Pharmacy, Taipei Medical University, Taipei 110, Taiwan, R.O.C
| | - Wei-Chan Hsieh
- Department of Chemical Engineering and Graduate Institute of Biotechnology, National Taipei University of Technology, Taipei 10608, Taiwan, R.O.C
| | - Chin-Wen Chen
- Department of Chemical Engineering and Graduate Institute of Biotechnology, National Taipei University of Technology, Taipei 10608, Taiwan, R.O.C
| | - Yuan-Chen Su
- Department of Chemical Engineering and Graduate Institute of Biotechnology, National Taipei University of Technology, Taipei 10608, Taiwan, R.O.C
| |
Collapse
|
32
|
Mishra S, Saxena A, Sangwan RS. Fundamentals of Homology Modeling Steps and Comparison among Important Bioinformatics Tools: An Overview. ACTA ACUST UNITED AC 2013. [DOI: 10.17311/sciintl.2013.237.252] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
33
|
Computational methods for high resolution prediction and refinement of protein structures. Curr Opin Struct Biol 2013; 23:177-84. [DOI: 10.1016/j.sbi.2013.01.010] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2012] [Revised: 01/22/2013] [Accepted: 01/24/2013] [Indexed: 01/29/2023]
|
34
|
Chys P, Chacón P. Random Coordinate Descent with Spinor-matrices and Geometric Filters for Efficient Loop Closure. J Chem Theory Comput 2013; 9:1821-9. [PMID: 26587638 DOI: 10.1021/ct300977f] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Protein loop closure constitutes a critical step in loop and protein modeling whereby geometrically feasible loops must be found between two given anchor residues. Here, a new analytic/iterative algorithm denoted random coordinate descent (RCD) to perform protein loop closure is described. The algorithm solves loop closure through minimization as in cyclic coordinate descent but selects bonds for optimization randomly, updates loop conformations by spinor-matrices, performs loop closure in both chain directions, and uses a set of geometric filters to yield efficient conformational sampling. Geometric filters allow one to detect clashes and constrain dihedral angles on the fly. The RCD algorithm is at least comparable to state of the art loop closure algorithms due to an excellent balance between efficiency and intrinsic sampling capability. Furthermore, its efficiency allows one to improve conformational sampling by increasing the sampling number without much penalty. Overall, RCD turns out to be accurate, fast, robust, and applicable over a wide range of loop lengths. Because of the versatility of RCD, it is a solid alternative for integration with current loop modeling strategies.
Collapse
Affiliation(s)
- Pieter Chys
- Structural Bioinformatics Group, Biological Chemical Physics Department, Institute of Physical Chemistry Rocasolano (IQFR), Consejo Superior de Investigaciones Cientı́ficas (CSIC), Calle de Serrano 119, Madrid 28006, Spain
| | - Pablo Chacón
- Structural Bioinformatics Group, Biological Chemical Physics Department, Institute of Physical Chemistry Rocasolano (IQFR), Consejo Superior de Investigaciones Cientı́ficas (CSIC), Calle de Serrano 119, Madrid 28006, Spain
| |
Collapse
|
35
|
Li Y. Conformational sampling in template-free protein loop structure modeling: an overview. Comput Struct Biotechnol J 2013; 5:e201302003. [PMID: 24688696 PMCID: PMC3962101 DOI: 10.5936/csbj.201302003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2012] [Revised: 01/23/2013] [Accepted: 01/28/2013] [Indexed: 01/04/2023] Open
Abstract
Accurately modeling protein loops is an important step to predict three-dimensional structures as well as to understand functions of many proteins. Because of their high flexibility, modeling the three-dimensional structures of loops is difficult and is usually treated as a "mini protein folding problem" under geometric constraints. In the past decade, there has been remarkable progress in template-free loop structure modeling due to advances of computational methods as well as stably increasing number of known structures available in PDB. This mini review provides an overview on the recent computational approaches for loop structure modeling. In particular, we focus on the approaches of sampling loop conformation space, which is a critical step to obtain high resolution models in template-free methods. We review the potential energy functions for loop modeling, loop buildup mechanisms to satisfy geometric constraints, and loop conformation sampling algorithms. The recent loop modeling results are also summarized.
Collapse
Affiliation(s)
- Yaohang Li
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| |
Collapse
|
36
|
Vyas VK, Ukawala RD, Ghate M, Chintha C. Homology modeling a fast tool for drug discovery: current perspectives. Indian J Pharm Sci 2012. [PMID: 23204616 PMCID: PMC3507339 DOI: 10.4103/0250-474x.102537] [Citation(s) in RCA: 139] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Major goal of structural biology involve formation of protein-ligand complexes; in which the protein molecules act energetically in the course of binding. Therefore, perceptive of protein-ligand interaction will be very important for structure based drug design. Lack of knowledge of 3D structures has hindered efforts to understand the binding specificities of ligands with protein. With increasing in modeling software and the growing number of known protein structures, homology modeling is rapidly becoming the method of choice for obtaining 3D coordinates of proteins. Homology modeling is a representation of the similarity of environmental residues at topologically corresponding positions in the reference proteins. In the absence of experimental data, model building on the basis of a known 3D structure of a homologous protein is at present the only reliable method to obtain the structural information. Knowledge of the 3D structures of proteins provides invaluable insights into the molecular basis of their functions. The recent advances in homology modeling, particularly in detecting and aligning sequences with template structures, distant homologues, modeling of loops and side chains as well as detecting errors in a model contributed to consistent prediction of protein structure, which was not possible even several years ago. This review focused on the features and a role of homology modeling in predicting protein structure and described current developments in this field with victorious applications at the different stages of the drug design and discovery.
Collapse
Affiliation(s)
- V K Vyas
- Department of Pharmaceutical Chemistry, Institute of Pharmacy, Nirma University, Ahmedabad-382 481, India
| | | | | | | |
Collapse
|
37
|
Liang S, Zhang C, Sarmiento J, Standley DM. Protein Loop Modeling with Optimized Backbone Potential Functions. J Chem Theory Comput 2012; 8:1820-7. [PMID: 26593673 DOI: 10.1021/ct300131p] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We represented protein backbone potential as a Fourier series. The parameters of the backbone dihedral potential were initialized to random values and optimized by Monte Carlo simulations so that generated native-like loop decoys had a lower energy than non-native decoys. The low energy regions of the optimized backbone potential were consistent with observed Ramachandran plots derived from crystal structures. The backbone potential was then used for the prediction of loop conformations (OSCAR-loop) combining with the previously described OSCAR force field, which has been shown to be very accurate in side chain modeling. As a result, the accuracy of OSCAR-loop was improved by local energy minimization based on the complete force field. The average accuracies were 0.40, 0.70, 1.10, 2.08, and 3.58 Å for 4, 6, 8, 10, and 12-residue loops, respectively, with each size being represented by 325 to 2809 targets. The accuracy was better than that of other loop modeling algorithms for short loops (<10 residues). For longer loops, the prediction accuracy was improved by concurrently sampling with a fragment-based method, Spanner. OSCAR-loop is available for download at http://sysimm.ifrec.osaka-u.ac.jp/OSCAR/ .
Collapse
Affiliation(s)
- Shide Liang
- Systems Immunology Lab, Immunology Frontier Research Center, Osaka University , Suita, Osaka, 565-0871, Japan
| | - Chi Zhang
- School of Biological Sciences, Center for Plant Science and Innovation, University of Nebraska , Lincoln, Nebraska 68588, United States
| | - Jamica Sarmiento
- Systems Immunology Lab, Immunology Frontier Research Center, Osaka University , Suita, Osaka, 565-0871, Japan
| | - Daron M Standley
- Systems Immunology Lab, Immunology Frontier Research Center, Osaka University , Suita, Osaka, 565-0871, Japan
| |
Collapse
|
38
|
|
39
|
Tripathy C, Zeng J, Zhou P, Donald BR. Protein loop closure using orientational restraints from NMR data. Proteins 2012; 80:433-53. [PMID: 22161780 PMCID: PMC3305838 DOI: 10.1002/prot.23207] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2011] [Revised: 08/23/2011] [Accepted: 09/06/2011] [Indexed: 11/12/2022]
Abstract
Protein loops often play important roles in biological functions. Modeling loops accurately is crucial to determining the functional specificity of a protein. Despite the recent progress in loop prediction approaches, which led to a number of algorithms over the past decade, few rigorous algorithmic approaches exist to model protein loops using global orientational restraints, such as those obtained from residual dipolar coupling (RDC) data in solution nuclear magnetic resonance (NMR) spectroscopy. In this article, we present a novel, sparse data, RDC-based algorithm, which exploits the mathematical interplay between RDC-derived sphero-conics and protein kinematics, and formulates the loop structure determination problem as a system of low-degree polynomial equations that can be solved exactly, in closed-form. The polynomial roots, which encode the candidate conformations, are searched systematically, using provable pruning strategies that triage the vast majority of conformations, to enumerate or prune all possible loop conformations consistent with the data; therefore, completeness is ensured. Results on experimental RDC datasets for four proteins, including human ubiquitin, FF2, DinI, and GB3, demonstrate that our algorithm can compute loops with higher accuracy, a three- to six-fold improvement in backbone RMSD, versus those obtained by traditional structure determination protocols on the same data. Excellent results were also obtained on synthetic RDC datasets for protein loops of length 4, 8, and 12 used in previous studies. These results suggest that our algorithm can be successfully applied to determine protein loop conformations, and hence, will be useful in high-resolution protein backbone structure determination, including loops, from sparse NMR data. Proteins 2012. © 2011 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
| | - Jianyang Zeng
- Department of Computer Science, Duke University, Durham, NC 27708, USA
| | - Pei Zhou
- Department of Biochemistry, Duke University Medical Center, Durham, NC 27710, USA
| | - Bruce Randall Donald
- Department of Computer Science, Duke University, Durham, NC 27708, USA
- Department of Biochemistry, Duke University Medical Center, Durham, NC 27710, USA
| |
Collapse
|
40
|
Joo H, Chavan AG, Day R, Lennox KP, Sukhanov P, Dahl DB, Vannucci M, Tsai J. Near-native protein loop sampling using nonparametric density estimation accommodating sparcity. PLoS Comput Biol 2011; 7:e1002234. [PMID: 22028638 PMCID: PMC3197639 DOI: 10.1371/journal.pcbi.1002234] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2011] [Accepted: 09/01/2011] [Indexed: 11/29/2022] Open
Abstract
Unlike the core structural elements of a protein like regular secondary structure, template based modeling (TBM) has difficulty with loop regions due to their variability in sequence and structure as well as the sparse sampling from a limited number of homologous templates. We present a novel, knowledge-based method for loop sampling that leverages homologous torsion angle information to estimate a continuous joint backbone dihedral angle density at each loop position. The φ,ψ distributions are estimated via a Dirichlet process mixture of hidden Markov models (DPM-HMM). Models are quickly generated based on samples from these distributions and were enriched using an end-to-end distance filter. The performance of the DPM-HMM method was evaluated against a diverse test set in a leave-one-out approach. Candidates as low as 0.45 Å RMSD and with a worst case of 3.66 Å were produced. For the canonical loops like the immunoglobulin complementarity-determining regions (mean RMSD <2.0 Å), the DPM-HMM method performs as well or better than the best templates, demonstrating that our automated method recaptures these canonical loops without inclusion of any IgG specific terms or manual intervention. In cases with poor or few good templates (mean RMSD >7.0 Å), this sampling method produces a population of loop structures to around 3.66 Å for loops up to 17 residues. In a direct test of sampling to the Loopy algorithm, our method demonstrates the ability to sample nearer native structures for both the canonical CDRH1 and non-canonical CDRH3 loops. Lastly, in the realistic test conditions of the CASP9 experiment, successful application of DPM-HMM for 90 loops from 45 TBM targets shows the general applicability of our sampling method in loop modeling problem. These results demonstrate that our DPM-HMM produces an advantage by consistently sampling near native loop structure. The software used in this analysis is available for download at http://www.stat.tamu.edu/~dahl/software/cortorgles/. A protein's structure consists of elements of regular secondary structure connected by less regular stretches of loop segments. The irregularity of the loop structure makes loop modeling quite challenging. More accurate sampling of these loop conformations has a direct impact on protein modeling, design, function classification, as well as protein interactions. A method has been developed that extends a more comprehensive knowledge-based approach to producing models of the loop regions of protein structure. Most physical models cannot adequately sample the large conformational space, while the more discrete knowledge based libraries are conformationally limited. To address both of these problems, we introduce a novel statistical method that produces a continuous yet weighted estimation of loop conformational space from a discrete library of structures by using a Dirichlet process mixture of hidden Markov models (DPM-HMM). Applied to loop structure sampling, the results of a number of tests demonstrate that our approach quickly generates large numbers of candidates with near native loop conformations. Most significantly, in the cases where the template sampling is sparse and/or far from native conformations, the DPM-HMM method samples close to the native space and produces a population of accurate loop structures.
Collapse
Affiliation(s)
- Hyun Joo
- Department of Chemistry, University of the Pacific, Stockton, California, United States of America
| | - Archana G. Chavan
- Department of Chemistry, University of the Pacific, Stockton, California, United States of America
| | - Ryan Day
- Department of Chemistry, University of the Pacific, Stockton, California, United States of America
| | - Kristin P. Lennox
- Department of Statistics, Texas A&M University, College Station, Texas, United States of America
| | - Paul Sukhanov
- Department of Chemistry, University of the Pacific, Stockton, California, United States of America
| | - David B. Dahl
- Department of Statistics, Texas A&M University, College Station, Texas, United States of America
| | - Marina Vannucci
- Department of Statistics, Rice University, Houston, Texas, United States of America
| | - Jerry Tsai
- Department of Chemistry, University of the Pacific, Stockton, California, United States of America
- * E-mail:
| |
Collapse
|
41
|
Li Y, Rata I, Jakobsson E. Sampling multiple scoring functions can improve protein loop structure prediction accuracy. J Chem Inf Model 2011; 51:1656-66. [PMID: 21702492 PMCID: PMC3211142 DOI: 10.1021/ci200143u] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Accurately predicting loop structures is important for understanding functions of many proteins. In order to obtain loop models with high accuracy, efficiently sampling the loop conformation space to discover reasonable structures is a critical step. In loop conformation sampling, coarse-grain energy (scoring) functions coupling with reduced protein representations are often used to reduce the number of degrees of freedom as well as sampling computational time. However, due to implicitly considering many factors by reduced representations, the coarse-grain scoring functions may have potential insensitivity and inaccuracy, which can mislead the sampling process and consequently ignore important loop conformations. In this paper, we present a new computational sampling approach to obtain reasonable loop backbone models, so-called the Pareto optimal sampling (POS) method. The rationale of the POS method is to sample the function space of multiple, carefully selected scoring functions to discover an ensemble of diversified structures yielding Pareto optimality to all sampled conformations. The POS method can efficiently tolerate insensitivity and inaccuracy in individual scoring functions and thereby lead to significant accuracy improvement in loop structure prediction. We apply the POS method to a set of 4-12-residue loop targets using a function space composed of backbone-only Rosetta and distance-scale finite ideal-gas reference (DFIRE) and a triplet backbone dihedral potential developed in our lab. Our computational results show that in 501 out of 502 targets, the model sets generated by POS contain structure models are within subangstrom resolution. Moreover, the top-ranked models have a root mean square deviation (rmsd) less than 1 A in 96.8, 84.1, and 72.2% of the short (4-6 residues), medium (7-9 residues), and long (10-12 residues) targets, respectively, when the all-atom models are generated by local optimization from the backbone models and are ranked by our recently developed Pareto optimal consensus (POC) method. Similar sampling effectiveness can also be found in a set of 13-residue loop targets.
Collapse
Affiliation(s)
- Yaohang Li
- Department of Computer Science, Old Dominion University
| | - Ionel Rata
- Center for Biophysics and Computational Biology, University of Illinois at Urbana-Champaign
| | - Eric Jakobsson
- Department of Molecular and Integrative Physiology, Beckman Institute, and National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign
| |
Collapse
|
42
|
Liang S, Zhang C, Standley DM. Protein loop selection using orientation-dependent force fields derived by parameter optimization. Proteins 2011; 79:2260-7. [DOI: 10.1002/prot.23051] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2011] [Revised: 03/21/2011] [Accepted: 03/31/2011] [Indexed: 12/25/2022]
|
43
|
Arnautova YA, Abagyan RA, Totrov M. Development of a new physics-based internal coordinate mechanics force field and its application to protein loop modeling. Proteins 2011; 79:477-98. [PMID: 21069716 PMCID: PMC3057902 DOI: 10.1002/prot.22896] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
We report the development of internal coordinate mechanics force field (ICMFF), new force field parameterized using a combination of experimental data for crystals of small molecules and quantum mechanics calculations. The main features of ICMFF include: (a) parameterization for the dielectric constant relevant to the condensed state (ε = 2) instead of vacuum, (b) an improved description of hydrogen-bond interactions using duplicate sets of van der Waals parameters for heavy atom-hydrogen interactions, and (c) improved backbone covalent geometry and energetics achieved using novel backbone torsional potentials and inclusion of the bond angles at the C(α) atoms into the internal variable set. The performance of ICMFF was evaluated through loop modeling simulations for 4-13 residue loops. ICMFF was combined with a solvent-accessible surface area solvation model optimized using a large set of loop decoys. Conformational sampling was carried out using the biased probability Monte Carlo method. Average/median backbone root-mean-square deviations of the lowest energy conformations from the native structures were 0.25/0.21 Å for four residues loops, 0.84/0.46 Å for eight residue loops, and 1.16/0.73 Å for 12 residue loops. To our knowledge, these results are significantly better than or comparable with those reported to date for any loop modeling method that does not take crystal packing into account. Moreover, the accuracy of our method is on par with the best previously reported results obtained considering the crystal environment. We attribute this success to the high accuracy of the new ICM force field achieved by meticulous parameterization, to the optimized solvent model, and the efficiency of the search method.
Collapse
Affiliation(s)
- Yelena A Arnautova
- Molsoft LLC, 3366 North Torrey Pines Court, Suite 300, La Jolla, California 92037, USA
| | | | | |
Collapse
|
44
|
Abstract
Loop modeling is crucial for high-quality homology model construction outside conserved secondary structure elements. Dozens of loop modeling protocols involving a range of database and ab initio search algorithms and a variety of scoring functions have been proposed. Knowledge-based loop modeling methods are very fast and some can successfully and reliably predict loops up to about eight residues long. Several recent ab initio loop simulation methods can be used to construct accurate models of loops up to 12-13 residues long, albeit at a substantial computational cost. Major current challenges are the simulations of loops longer than 12-13 residues, the modeling of multiple interacting flexible loops, and the sensitivity of the loop predictions to the accuracy of the loop environment.
Collapse
|
45
|
Arsenault J, Cabana J, Fillion D, Leduc R, Guillemette G, Lavigne P, Escher E. Temperature dependent photolabeling of the human angiotensin II type 1 receptor reveals insights into its conformational landscape and its activation mechanism. Biochem Pharmacol 2010; 80:990-9. [DOI: 10.1016/j.bcp.2010.06.004] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2010] [Revised: 06/07/2010] [Accepted: 06/07/2010] [Indexed: 11/15/2022]
|
46
|
Hayward S, Kitao A. The effect of end constraints on protein loop kinematics. Biophys J 2010; 98:1976-85. [PMID: 20441762 DOI: 10.1016/j.bpj.2010.01.017] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2009] [Revised: 01/08/2010] [Accepted: 01/11/2010] [Indexed: 11/17/2022] Open
Abstract
Despite the prevalent involvement of loops in function little is known about how the constraining of end groups influences their kinematics. Using a linear inverse-kinematics approach and assuming fixed bond lengths, bond angles, and peptide bond torsions, as well as ignoring molecular interactions to assess the effect of the end-constraint only, it is shown that the constraint creates a closed surface in torsion angle space. For pentapeptides, the constraint gives rise to inaccessible regions in a Ramachandran plot. This complex and tightly curved surface produces interesting effects that may play a functional role. For example, a small change in one torsion angle can radically change the behavior of the whole loop. The constraint also produces long-range correlations, and structures exist where the correlation coefficient is 1.0 or -1.0 between rotations about bonds separated by >30 A. Another application allows some torsion angles to be targeted to specified values while others are constrained. When this application was used on key torsions in lactate dehydrogenase, it was found that the functional loop first folds forward and then moves sideways. For horse liver alcohol dehydrogenase, it was confirmed that the functional loop's Pro-Pro motif creates a rigid arm in an NAD-activated switch for domain closure.
Collapse
Affiliation(s)
- Steven Hayward
- D'Arcy Thompson Centre for Computational Biology, School of Computing Sciences, University of East Anglia, Norwich, United Kingdom.
| | | |
Collapse
|
47
|
Danielson ML, Lill MA. New computational method for prediction of interacting protein loop regions. Proteins 2010; 78:1748-59. [PMID: 20186974 DOI: 10.1002/prot.22690] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Flexible loop regions of proteins play a crucial role in many biological functions such as protein-ligand recognition, enzymatic catalysis, and protein-protein association. To date, most computational methods that predict the conformational states of loops only focus on individual loop regions. However, loop regions are often spatially in close proximity to one another and their mutual interactions stabilize their conformations. We have developed a new method, titled CorLps, capable of simultaneously predicting such interacting loop regions. First, an ensemble of individual loop conformations is generated for each loop region. The members of the individual ensembles are combined and are accepted or rejected based on a steric clash filter. After a subsequent side-chain optimization step, the resulting conformations of the interacting loops are ranked by the statistical scoring function DFIRE that originated from protein structure prediction. Our results show that predicting interacting loops with CorLps is superior to sequential prediction of the two interacting loop regions, and our method is comparable in accuracy to single loop predictions. Furthermore, improved predictive accuracy of the top-ranked solution is achieved for 12-residue length loop regions by diversifying the initial pool of individual loop conformations using a quality threshold clustering algorithm.
Collapse
Affiliation(s)
- Matthew L Danielson
- Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, Indiana 47907, USA
| | | |
Collapse
|
48
|
Abstract
Functional characterization of a protein is often facilitated by its 3D structure. However, the fraction of experimentally known 3D models is currently less than 1% due to the inherently time-consuming and complicated nature of structure determination techniques. Computational approaches are employed to bridge the gap between the number of known sequences and that of 3D models. Template-based protein structure modeling techniques rely on the study of principles that dictate the 3D structure of natural proteins from the theory of evolution viewpoint. Strategies for template-based structure modeling will be discussed with a focus on comparative modeling, by reviewing techniques available for all the major steps involved in the comparative modeling pipeline.
Collapse
Affiliation(s)
- Andras Fiser
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA
| |
Collapse
|
49
|
Muppirala UK, Desensi S, Lybrand TP, Hazelbauer GL, Li Z. Molecular modeling of flexible arm-mediated interactions between bacterial chemoreceptors and their modification enzyme. Protein Sci 2009; 18:1702-14. [PMID: 19606502 DOI: 10.1002/pro.170] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Sensory adaptation in bacterial chemotaxis is mediated by methylation and demethylation of specific glutamyl residues in the cytoplasmic domain of chemoreceptors. Methylation is catalyzed by methyltransferase CheR. In E. coli and related organisms, methylation sufficiently rapid to be physiologically effective requires a carboxyl terminal pentapeptide sequence on the receptor being modified or, via adaptational assistance, on a neighboring homodimer in a receptor cluster. Pentapeptide-enhanced methylation is thought to be mediated by a approximately 30 residue, potentially disordered sequence that serves as a flexible arm connecting the receptor body and pentapeptide-bound methyltransferase, thus allowing diffusionally restricted enzyme to reach methyl-accepting sites. However, it was not known how many or which sites on the same or neighboring receptors were accessible to the tethered enzyme. We investigated using molecular modeling and found that, in a hexagonal array of trimers of receptor dimers, CheR tethered to a dimer of chemoreceptor Tar by its native 30-residue flexible-arm sequence could reach all methyl-accepting sites on the dimer to which it was tethered plus 48 methyl-accepting sites distributed among nine neighboring dimers, equivalent to the total sites carried by six receptors. This modeling-determined methylation neighborhood of one enzyme-binding dimer and six neighbors corresponds precisely with the experimentally identified neighborhood of seven. Thus, the experimentally observed adaptational assistance can occur by docking of pentapeptide-bound, diffusionally restricted enzyme to methyl-accepting sites on neighboring receptors. Our analysis introduces the notion that physiologically relevant adaptational assistance could occur even if only a subset of sites on a particular receptor are within reach.
Collapse
Affiliation(s)
- Usha K Muppirala
- Department of Bioinformatics and Computer Science, University of the Sciences in Philadelphia, Philadelphia, Pennsylvania 19104, USA
| | | | | | | | | |
Collapse
|
50
|
Clément M, Cabana J, Holleran BJ, Leduc R, Guillemette G, Lavigne P, Escher E. Activation induces structural changes in the liganded angiotensin II type 1 receptor. J Biol Chem 2009; 284:26603-12. [PMID: 19635801 DOI: 10.1074/jbc.m109.012922] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The octapeptide hormone angiotensin II (AngII) binds to and activates the human angiotensin II type 1 receptor (hAT(1)) of the G protein-coupled receptor class A family. Several activation mechanisms have been proposed for this family, but they have not yet been experimentally validated. We previously used the methionine proximity assay to show that 11 residues in transmembrane domain (TMD) III, VI, and VII of the hAT(1) receptor reside in close proximity to the C-terminal residue of AngII. With the exception of a single change in TMD VI, the same contacts are present on N111G-hAT(1), a constitutively active mutant; this N111G-hAT(1) is a model for the active form of the receptor. In this study, two series of 53 individual methionine mutations were constructed in TMD I, II, IV, and V on both receptor forms. The mutants were photolabeled with a neutral antagonist, (125)I-[Sar(1),p-benzoyl-L-Phe(8)]AngII, and the resulting complexes were digested with cyanogen bromide. Although no new contacts were found for the hAT(1) mutants, two were found in the constitutively active mutants, Phe-77 in TMD II and Asn-200 in TMD V. To our knowledge, this is the first time that a direct ligand contact with TMD II and TMD V has been reported. These contact point differences were used to identify the structural changes between the WT-hAT(1) and N111G-hAT(1) complexes through homology-based modeling and restrained molecular dynamics. The model generated revealed an important structural rearrangement of several TMDs from the basal to the activated form in the WT-hAT(1) receptor.
Collapse
Affiliation(s)
- Martin Clément
- Department of Pharmacology, Faculty of Medicine, Université de Sherbrooke, Sherbrooke, Quebec J1H 5N4, Canada
| | | | | | | | | | | | | |
Collapse
|