1
|
Roterman I, Stapor K, Konieczny L. Role of environmental specificity in CASP results. BMC Bioinformatics 2023; 24:425. [PMID: 37950210 PMCID: PMC10638730 DOI: 10.1186/s12859-023-05559-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 11/06/2023] [Indexed: 11/12/2023] Open
Abstract
BACKGROUND Recently, significant progress has been made in the field of protein structure prediction by the application of artificial intelligence techniques, as shown by the results of the CASP13 and CASP14 (Critical Assessment of Structure Prediction) competition. However, the question of the mechanism behind the protein folding process itself remains unanswered. Correctly predicting the structure also does not solve the problem of, for example, amyloid proteins, where a polypeptide chain with an unaltered sequence adopts a different 3D structure. RESULTS This work was an attempt at explaining the structural variation by considering the contribution of the environment to protein structuring. The application of the fuzzy oil drop (FOD) model to assess the validity of the selected models provided in the CASP13, CASP14 and CASP15 projects reveals the need for an environmental factor to determine the 3D structure of proteins. Consideration of the external force field in the form of polar water (Fuzzy Oil Drop) and a version modified by the presence of the hydrophobic compounds, FOD-M (FOD-Modified) reveals that the protein folding process is environmentally dependent. An analysis of selected models from the CASP competitions indicates the need for structure prediction as dependent on the consideration of the protein folding environment. CONCLUSIONS The conditions governed by the environment direct the protein folding process occurring in a certain environment. Therefore, the variation of the external force field should be taken into account in the models used in protein structure prediction.
Collapse
Affiliation(s)
- Irena Roterman
- Department of Bioinformatics and Telemedicine, Jagiellonian University - Medical College, Medyczna 7, 30-688, Krakow, Poland.
| | - Katarzyna Stapor
- Faculty of Automatic, Electronics and Computer Science, Department of Applied, Informatics, Silesian University of Technology, Akademicka 16, 44-100, Gliwice, Poland
| | - Leszek Konieczny
- Jagiellonian University - Medical College, Kopernika 7, 31-034, Krakow, Poland
| |
Collapse
|
2
|
The Possible Mechanism of Amyloid Transformation Based on the Geometrical Parameters of Early-Stage Intermediate in Silico Model for Protein Folding. Int J Mol Sci 2022; 23:ijms23169502. [PMID: 36012765 PMCID: PMC9409474 DOI: 10.3390/ijms23169502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Revised: 08/09/2022] [Accepted: 08/19/2022] [Indexed: 12/03/2022] Open
Abstract
The specificity of the available experimentally determined structures of amyloid forms is expressed primarily by the two- and not three-dimensional forms of a single polypeptide chain. Such a flat structure is possible due to the β structure, which occurs predominantly. The stabilization of the fibril in this structure is achieved due to the presence of the numerous hydrogen bonds between the adjacent chains. Together with the different forms of twists created by the single R- or L-handed α-helices, they form the hydrogen bond network. The specificity of the arrangement of these hydrogen bonds lies in their joint orientation in a system perpendicular to the plane formed by the chain and parallel to the fibril axis. The present work proposes the possible mechanism for obtaining such a structure based on the geometric characterization of the polypeptide chain constituting the basis of our early intermediate model for protein folding introduced formerly. This model, being the conformational subspace of Ramachandran plot (the ellipse path), was developed on the basis of the backbone conformation, with the side-chain interactions excluded. Our proposal is also based on the results from molecular dynamics available in the literature leading to the unfolding of α-helical sections, resulting in the β-structural forms. Both techniques used provide a similar suggestion in a search for a mechanism of conformational changes leading to a formation of the amyloid form. The potential mechanism of amyloid transformation is presented here using the fragment of the transthyretin as well as amyloid Aβ.
Collapse
|
3
|
CavitySpace: A Database of Potential Ligand Binding Sites in the Human Proteome. Biomolecules 2022; 12:biom12070967. [PMID: 35883523 PMCID: PMC9312471 DOI: 10.3390/biom12070967] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Revised: 07/07/2022] [Accepted: 07/08/2022] [Indexed: 02/01/2023] Open
Abstract
Location and properties of ligand binding sites provide important information to uncover protein functions and to direct structure-based drug design approaches. However, as binding site detection depends on the three-dimensional (3D) structural data of proteins, functional analysis based on protein ligand binding sites is formidable for proteins without structural information. Recent developments in protein structure prediction and the 3D structures built by AlphaFold provide an unprecedented opportunity for analyzing ligand binding sites in human proteins. Here, we constructed the CavitySpace database, the first pocket library for all the proteins in the human proteome, using a widely-applied ligand binding site detection program CAVITY. Our analysis showed that known ligand binding sites could be well recovered. We grouped the predicted binding sites according to their similarity which can be used in protein function prediction and drug repurposing studies. Novel binding sites in highly reliable predicted structure regions provide new opportunities for drug discovery. Our CavitySpace is freely available and provides a valuable tool for drug discovery and protein function studies.
Collapse
|
4
|
Su H, Wang W, Du Z, Peng Z, Gao S, Cheng M, Yang J. Improved Protein Structure Prediction Using a New Multi-Scale Network and Homologous Templates. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2021; 8:e2102592. [PMID: 34719864 PMCID: PMC8693034 DOI: 10.1002/advs.202102592] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Revised: 09/12/2021] [Indexed: 06/04/2023]
Abstract
The accuracy of de novo protein structure prediction has been improved considerably in recent years, mostly due to the introduction of deep learning techniques. In this work, trRosettaX, an improved version of trRosetta for protein structure prediction is presented. The major improvement over trRosetta consists of two folds. The first is the application of a new multi-scale network, i.e., Res2Net, for improved prediction of inter-residue geometries, including distance and orientations. The second is an attention-based module to exploit multiple homologous templates to increase the accuracy further. Compared with trRosetta, trRosettaX improves the contact precision by 6% and 8% on the free modeling targets of CASP13 and CASP14, respectively. A preliminary version of trRosettaX is ranked as one of the top server groups in CASP14's blind test. Additional benchmark test on 161 targets from CAMEO (between Jun and Sep 2020) shows that trRosettaX achieves an average TM-score ≈0.8, outperforming the top groups in CAMEO. These data suggest the effectiveness of using the multi-scale network and the benefit of incorporating homologous templates into the network. The trRosettaX algorithm is incorporated into the trRosetta server since Nov 2020. The web server, the training and inference codes are available at: https://yanglab.nankai.edu.cn/trRosetta/.
Collapse
Affiliation(s)
- Hong Su
- School of Mathematical SciencesNankai UniversityTianjin300071China
| | - Wenkai Wang
- School of Mathematical SciencesNankai UniversityTianjin300071China
| | - Zongyang Du
- School of Mathematical SciencesNankai UniversityTianjin300071China
| | - Zhenling Peng
- Research Center for Mathematics and Interdisciplinary SciencesShandong UniversityQingdao266237China
| | - Shang‐Hua Gao
- College of Computer ScienceNankai UniversityTianjin300071China
| | - Ming‐Ming Cheng
- College of Computer ScienceNankai UniversityTianjin300071China
| | - Jianyi Yang
- Research Center for Mathematics and Interdisciplinary SciencesShandong UniversityQingdao266237China
| |
Collapse
|
5
|
Applying Bioinformatic Platforms, In Vitro, and In Vivo Functional Assays in the Characterization of Genetic Variants in the GH/IGF Pathway Affecting Growth and Development. Cells 2021; 10:cells10082063. [PMID: 34440832 PMCID: PMC8392544 DOI: 10.3390/cells10082063] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Revised: 08/06/2021] [Accepted: 08/09/2021] [Indexed: 02/07/2023] Open
Abstract
Heritability accounts for over 80% of adult human height, indicating that genetic variability is the main determinant of stature. The rapid technological development of Next-Generation Sequencing (NGS), particularly Whole Exome Sequencing (WES), has resulted in the characterization of several genetic conditions affecting growth and development. The greatest challenge of NGS remains the high number of candidate variants identified. In silico bioinformatic tools represent the first approach for classifying these variants. However, solving the complicated problem of variant interpretation requires the use of experimental approaches such as in vitro and, when needed, in vivo functional assays. In this review, we will discuss a rational approach to apply to the gene variants identified in children with growth and developmental defects including: (i) bioinformatic tools; (ii) in silico modeling tools; (iii) in vitro functional assays; and (iv) the development of in vivo models. While bioinformatic tools are useful for a preliminary selection of potentially pathogenic variants, in vitro—and sometimes also in vivo—functional assays are further required to unequivocally determine the pathogenicity of a novel genetic variant. This long, time-consuming, and expensive process is the only scientifically proven method to determine causality between a genetic variant and a human genetic disease.
Collapse
|
6
|
Zhang R, Stahr MC, Kennedy MA. Introduction of a new scheme for classifying β-turns in protein structures. Proteins 2021; 90:110-122. [PMID: 34322903 DOI: 10.1002/prot.26190] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 07/11/2021] [Indexed: 11/09/2022]
Abstract
Protein β-turn classification remains an area of ongoing development in structural biology research. While the commonly used nomenclature defining type I, type II and type IV β-turns was introduced in the 1970s and 1980s, refinements of β-turn type definitions have been introduced as recently as 2019 by Dunbrack, Jr and co-workers who expanded the number of β-turn types to 18 (Shapovalov et al, PLOS Computat. Biol., 15, e1006844, 2019). Based on their analysis of 13 030 turns from 1074 ultrahigh resolution (≤1.2 Å) protein structures, they used a new clustering algorithm to expand the definitions used to classify protein β-turns and introduced a new nomenclature system. We recently encountered a specific problem when classifying β-turns in crystal structures of pentapeptide repeat proteins (PRPs) determined in our lab that are largely composed of β-turns that often lie close to, but just outside of, canonical β-turn regions. To address this problem, we devised a new scheme that merges the Klyne-Prelog stereochemistry nomenclature and definitions with the Ramachandran plot. The resulting Klyne-Prelog-modified Ramachandran plot scheme defines 1296 distinct potential β-turn classifications that cover all possible protein β-turn space with a nomenclature that indicates the stereochemistry of i + 1 and i + 2 backbone dihedral angles. The utility of the new classification scheme was illustrated by re-classification of the β-turns in all known protein structures in the PRP superfamily and further assessed using a database of 16 657 high-resolution protein structures (≤1.5 Å) from which 522 776 β-turns were identified and classified.
Collapse
Affiliation(s)
- Ruojing Zhang
- Department of Chemistry and Biochemistry, Miami University, Oxford, Ohio, USA
| | - Michael C Stahr
- Department of Computer Science and Software Engineering, Miami University, Oxford, Ohio, USA
| | - Michael A Kennedy
- Department of Chemistry and Biochemistry, Miami University, Oxford, Ohio, USA
| |
Collapse
|
7
|
María Hernández-Domínguez E, Sofía Castillo-Ortega L, García-Esquivel Y, Mandujano-González V, Díaz-Godínez G, Álvarez-Cervantes J. Bioinformatics as a Tool for the Structural and Evolutionary Analysis of Proteins. Comput Biol Chem 2020. [DOI: 10.5772/intechopen.89594] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
This chapter deals with the topic of bioinformatics, computational, mathematics, and statistics tools applied to biology, essential for the analysis and characterization of biological molecules, in particular proteins, which play an important role in all cellular and evolutionary processes of the organisms. In recent decades, with the next generation sequencing technologies and bioinformatics, it has facilitated the collection and analysis of a large amount of genomic, transcriptomic, proteomic, and metabolomic data from different organisms that have allowed predictions on the regulation of expression, transcription, translation, structure, and mechanisms of action of proteins as well as homology, mutations, and evolutionary processes that generate structural and functional changes over time. Although the information in the databases is greater every day, all bioinformatics tools continue to be constantly modified to improve performance that leads to more accurate predictions regarding protein functionality, which is why bioinformatics research remains a great challenge.
Collapse
|
8
|
Dhingra S, Sowdhamini R, Cadet F, Offmann B. A glance into the evolution of template-free protein structure prediction methodologies. Biochimie 2020; 175:85-92. [DOI: 10.1016/j.biochi.2020.04.026] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 04/24/2020] [Accepted: 04/27/2020] [Indexed: 11/26/2022]
|
9
|
Zhang L, Ma H, Qian W, Li H. Protein structure optimization using improved simulated annealing algorithm on a three-dimensional AB off-lattice model. Comput Biol Chem 2020; 85:107237. [PMID: 32109854 DOI: 10.1016/j.compbiolchem.2020.107237] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2018] [Revised: 02/11/2020] [Accepted: 02/15/2020] [Indexed: 01/01/2023]
Abstract
This paper proposed an improved simulated annealing (ISA) algorithm for protein structure optimization based on a three-dimensional AB off-lattice model. In the algorithm, we provided a general formula used for producing initial solution, and designed a multivariable disturbance term, relating to the parameters of simulated annealing and a tuned constant, to generate neighborhood solution. To avoid missing optimal solution, storage operation was performed in searching process. We applied the algorithm to test artificial protein sequences from literature and constructed a benchmark dataset consisting of 10 real protein sequences from the Protein Data Bank (PDB). Otherwise, we generated Cα space-filling model to represent protein folding conformation. The results indicate our algorithm outperforms the five methods before in searching lower energies of artificial protein sequences. In the testing on real proteins, our method can achieve the energy conformations with Cα-RMSD less than 3.0 Å from the PDB structures. Moreover, Cα space-filling model may simulate dynamic change of protein folding conformation at atomic level.
Collapse
Affiliation(s)
- Lizhong Zhang
- College of Medicine and Biological Information Engineering, Northeastern University, Shenyang 110169, China; College of Computer Science and Technology, Shenyang University of Chemical Technology, Shenyang 110142, China
| | - He Ma
- College of Medicine and Biological Information Engineering, Northeastern University, Shenyang 110169, China; Key Laboratory of Medical Image Computing (Northeastern University), Ministry of Education, Shenyang 110169, China.
| | - Wei Qian
- Department of Electrical and Computer Engineering, College of Engineering, University of Texas, El Paso TX 79968, USA
| | - Haiyan Li
- College of Pharmaceutical and Bioengineering, Shenyang University of Chemical Technology, Shenyang 110142, China
| |
Collapse
|
10
|
Abstract
A class of secondary structure prediction algorithms use the information from the statistics of the residue pairs found in secondary structural elements. Because the protein folding process is dominated by backbone hydrogen bonding, an approach based on backbone hydrogen-bonded residue pairings would improve the predicting capabilities of these class algorithms. The reliability of the prediction algorithms depends on the quality of the statistics, therefore, of the data set. In this study, it was aimed to determine the propensities of the backbone hydrogen-bonded residue pairings for secondary structural elements of α-helix and β-sheet in globular proteins using a new and comprehensive data set created from the peptides deposited in Worldwide Protein Data Bank. A master data set including 4882 globular peptide chains with resolution better than 2.5 Å, sequence identity smaller than 25% and length of no shorter than 100 residues were created. Separate data sub sets also were created for helix and sheet structures from master set and each sub set includes 4594 and 4483 chains, respectively. Backbone hydrogen-bonded residue pairings in helices and sheets were detected and the propensities of them were represented as odds ratios (observed/[random or expected]) in matrices. Propensities assigned by this study to the residue pairings in secondary structural elements (as helix, overall strands, parallel strands and antiparallel strands) differ from the previous studies by 19 to 34%. These dissimilarities are important and they would cause further improvements in secondary structure prediction algorithms.
Collapse
Affiliation(s)
- Cevdet Nacar
- Department of Biophysics, School of Medicine, Marmara University, Istanbul, Turkey.
| |
Collapse
|
11
|
Issa NT, Badiavas EV, Schürer S. Research Techniques Made Simple: Molecular Docking in Dermatology - A Foray into In Silico Drug Discovery. J Invest Dermatol 2019; 139:2400-2408.e1. [PMID: 31753122 DOI: 10.1016/j.jid.2019.06.129] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Revised: 06/05/2019] [Accepted: 06/17/2019] [Indexed: 11/22/2022]
Abstract
Drug discovery is a complex process with many potential pitfalls. To go to market, a drug must undergo extensive preclinical optimization followed by clinical trials to establish its efficacy and minimize toxicity and adverse events. The process can take 10-15 years and command vast research and development resources costing over $1 billion. The success rates for new drug approvals in the United States are < 15%, and investment costs often cannot be recouped. With the increasing availability of large public datasets (big data) and computational capabilities, data science is quickly becoming a key component of the drug discovery pipeline. One such computational method, large-scale molecular modeling, is critical in the preclinical hit and lead identification process. Molecular modeling involves the study of the chemical structure of a drug and how it interacts with a potential disease-relevant target, as well as predicting its ADMET properties. The scope of molecular modeling is wide and complex. Here we specifically discuss docking, a tool commonly employed for studying drug-target interactions. Docking allows for the systematic exploration of how a drug interacts at a protein binding site and allows for the rank-ordering of drug libraries for prioritization in subsequent studies. This process can be efficiently used to virtually screen libraries containing over millions of compounds.
Collapse
Affiliation(s)
- Naiem T Issa
- Dr. Phillip Frost Department of Dermatology and Cutaneous Surgery, University of Miami School of Medicine, Miami, Florida, USA.
| | - Evangelos V Badiavas
- Dr. Phillip Frost Department of Dermatology and Cutaneous Surgery, University of Miami School of Medicine, Miami, Florida, USA
| | - Stephan Schürer
- Department of Molecular and Cellular Pharmacology, University of Miami School of Medicine, Miami, Florida, USA
| |
Collapse
|
12
|
Gandhi J, Antonelli AC, Afridi A, Vatsia S, Joshi G, Romanov V, Murray IVJ, Khan SA. Protein misfolding and aggregation in neurodegenerative diseases: a review of pathogeneses, novel detection strategies, and potential therapeutics. Rev Neurosci 2019; 30:339-358. [PMID: 30742586 DOI: 10.1515/revneuro-2016-0035] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2018] [Accepted: 08/03/2018] [Indexed: 12/13/2022]
Abstract
Protein folding is a complex, multisystem process characterized by heavy molecular and cellular footprints. Chaperone machinery enables proper protein folding and stable conformation. Other pathways concomitant with the protein folding process include transcription, translation, post-translational modifications, degradation through the ubiquitin-proteasome system, and autophagy. As such, the folding process can go awry in several different ways. The pathogenic basis behind most neurodegenerative diseases is that the disruption of protein homeostasis (i.e. proteostasis) at any level will eventually lead to protein misfolding. Misfolded proteins often aggregate and accumulate to trigger neurotoxicity through cellular stress pathways and consequently cause neurodegenerative diseases. The manifestation of a disease is usually dependent on the specific brain region that the neurotoxicity affects. Neurodegenerative diseases are age-associated, and their incidence is expected to rise as humans continue to live longer and pursue a greater life expectancy. We presently review the sequelae of protein misfolding and aggregation, as well as the role of these phenomena in several neurodegenerative diseases including Alzheimer's disease, Huntington's disease, amyotrophic lateral sclerosis, Parkinson's disease, transmissible spongiform encephalopathies, and spinocerebellar ataxia. Strategies for treatment and therapy are also conferred with respect to impairing, inhibiting, or reversing protein misfolding.
Collapse
Affiliation(s)
- Jason Gandhi
- Department of Physiology and Biophysics, Stony Brook University School of Medicine, 101 Nicolls Road, Health Sciences Center, Stony Brook, NY 11794-8434, USA.,Medical Student Research Institute, St. George's University School of Medicine, Grenada, West Indies
| | - Anthony C Antonelli
- Department of Pathology, Stony Brook University School of Medicine, 101 Nicolls Road, Health Sciences Center, Stony Brook, NY 11794-8434, USA
| | - Adil Afridi
- Department of Physiology and Biophysics, Stony Brook University School of Medicine, 101 Nicolls Road, Health Sciences Center, Stony Brook, NY 11794-8434, USA
| | - Sohrab Vatsia
- Department of Cardiothoracic Surgery, Lenox Hill Hospital, 130 East 77th Street, New York, NY 10075, USA
| | - Gunjan Joshi
- Department of Internal Medicine, Stony Brook Southampton Hospital, 240 Meeting House Lane, Southampton, NY 11968, USA
| | - Victor Romanov
- Department of Urology, Health Sciences Center T9-040, Stony Brook University School of Medicine, 101 Nicolls Road, Stony Brook, NY 11794-8093, USA
| | - Ian V J Murray
- Department of Physiology and Neuroscience, St. George's University School of Medicine, Grenada, West Indies
| | - Sardar Ali Khan
- Department of Physiology and Biophysics, Stony Brook University School of Medicine, 101 Nicolls Road, Health Sciences Center, Stony Brook, NY 11794-8434, USA.,Department of Urology, Health Sciences Center T9-040, Stony Brook University School of Medicine, 101 Nicolls Road, Stony Brook, NY 11794-8093, USA
| |
Collapse
|
13
|
Molecular simulation of peptides coming of age: Accurate prediction of folding, dynamics and structures. Arch Biochem Biophys 2019; 664:76-88. [DOI: 10.1016/j.abb.2019.01.033] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Revised: 01/23/2019] [Accepted: 01/28/2019] [Indexed: 12/24/2022]
|
14
|
Latham AP, Zhang B. Improving Coarse-Grained Protein Force Fields with Small-Angle X-ray Scattering Data. J Phys Chem B 2019; 123:1026-1034. [PMID: 30620594 DOI: 10.1021/acs.jpcb.8b10336] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Small-angle X-ray scattering (SAXS) experiments provide valuable structural data for biomolecules in solution. We develop a highly efficient maximum entropy approach to fit SAXS data by introducing minimal biases to a coarse-grained protein force field, the associative memory, water mediated, structure, and energy model (AWSEM). We demonstrate that the resulting force field, AWSEM-SAXS, succeeds in reproducing scattering profiles and models protein structures with shapes that are in much better agreement with experimental results. Quantitative metrics further reveal a modest, but consistent, improvement in the accuracy of modeled structures when SAXS data are incorporated into the force field. Additionally, when applied to a multiconformational protein, we find that AWSEM-SAXS is able to recover the population of different protein conformations from SAXS data alone. We, therefore, conclude that the maximum entropy approach is effective in fine-tuning the force field to better characterize both protein structure and conformational fluctuation.
Collapse
Affiliation(s)
- Andrew P Latham
- Department of Chemistry , Massachusetts Institute of Technology , Cambridge , Massachusetts 02139 , United States
| | - Bin Zhang
- Department of Chemistry , Massachusetts Institute of Technology , Cambridge , Massachusetts 02139 , United States
| |
Collapse
|
15
|
Mass Spectrometry- and Computational Structural Biology-Based Investigation of Proteins and Peptides. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2019; 1140:265-287. [PMID: 31347053 DOI: 10.1007/978-3-030-15950-4_15] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Recent developments of mass spectrometry (MS) allow us to identify, estimate, and characterize proteins and protein complexes. At the same time, structural biology helps to determine the protein structure and its structure-function relationship. Together, they aid to understand the protein structure, property, function, protein-complex assembly, protein-protein interaction, and dynamics. The present chapter is organized with illustrative results to demonstrate how experimental mass spectrometry can be combined with computational structural biology for detailed studies of protein's structures. We have used tumor differentiation factor protein/peptide as ligand and Hsp70/Hsp90 as receptor protein as examples to study ligand-protein interaction. To investigate possible protein conformation, we will describe two proteins-lysozyme and myoglobin. As an application of MS-based assignment of disulfide bridges, the case of the spider venom polypeptide Phα1β will also be discussed.
Collapse
|
16
|
Trevizani R, Custódio FL. Supersecondary Structures and Fragment Libraries. Methods Mol Biol 2019; 1958:283-295. [PMID: 30945224 DOI: 10.1007/978-1-4939-9161-7_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
The use of smotifs and fragment libraries has proven useful to both simplify and increase the quality of protein models. Here, we present Profrager, a tool that automatically generates putative structural fragments to reproduce local motifs of proteins given a target sequence. Profrager is highly customizable, allowing the user to select the number of fragments per library, the ranking method is able to generate fragments of all sizes, and it was recently modified to include the possibility of output exclusively smotifs.
Collapse
|
17
|
Experimental accuracy in protein structure refinement via molecular dynamics simulations. Proc Natl Acad Sci U S A 2018; 115:13276-13281. [PMID: 30530696 DOI: 10.1073/pnas.1811364115] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Refinement is the last step in protein structure prediction pipelines to convert approximate homology models to experimental accuracy. Protocols based on molecular dynamics (MD) simulations have shown promise, but current methods are limited to moderate levels of consistent refinement. To explore the energy landscape between homology models and native structures and analyze the challenges of MD-based refinement, eight test cases were studied via extensive simulations followed by Markov state modeling. In all cases, native states were found very close to the experimental structures and at the lowest free energies, but refinement was hindered by a rough energy landscape. Transitions from the homology model to the native states require the crossing of significant kinetic barriers on at least microsecond time scales. A significant energetic driving force toward the native state was lacking until its immediate vicinity, and there was significant sampling of off-pathway states competing for productive refinement. The role of recent force field improvements is discussed and transition paths are analyzed in detail to inform which key transitions have to be overcome to achieve successful refinement.
Collapse
|
18
|
Faraggi E, Krupa P, Mozolewska MA, Liwo A, Kloczkowski A. Reoptimized UNRES Potential for Protein Model Quality Assessment. Genes (Basel) 2018; 9:genes9120601. [PMID: 30513992 PMCID: PMC6315818 DOI: 10.3390/genes9120601] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Revised: 11/25/2018] [Accepted: 11/27/2018] [Indexed: 11/16/2022] Open
Abstract
Ranking protein structure models is an elusive problem in bioinformatics. These models are evaluated on both the degree of similarity to the native structure and the folding pathway. Here, we simulated the use of the coarse-grained UNited RESidue (UNRES) force field as a tool to choose the best protein structure models for a given protein sequence among a pool of candidate models, using server data from the CASP11 experiment. Because the original UNRES was optimized for Molecular Dynamics simulations, we reoptimized UNRES using a deep feed-forward neural network, and we show that introducing additional descriptive features can produce better results. Overall, we found that the reoptimized UNRES performs better in selecting the best structures and tracking protein unwinding from its native state. We also found a relatively poor correlation between UNRES values and the model’s Template Modeling Score (TMS). This is remedied by reoptimization. We discuss some cases where our reoptimization procedure is useful.
Collapse
Affiliation(s)
- Eshel Faraggi
- Research and Information Systems, LLC, Indianapolis, IN 46240, USA.
- Department of Physics, Indiana University Purdue University Indianapolis, Indianapolis, IN 46202, USA.
- Battelle Center for Mathematical Medicine, Nationwide Children's Hospital, Columbus, OH 43215, USA.
| | - Pawel Krupa
- Battelle Center for Mathematical Medicine, Nationwide Children's Hospital, Columbus, OH 43215, USA.
- Institute of Physics, Polish Academy of Sciences, Al. Lotnikow 32/46, PL-02-668 Warsaw, Poland.
| | - Magdalena A Mozolewska
- Battelle Center for Mathematical Medicine, Nationwide Children's Hospital, Columbus, OH 43215, USA.
- Institute of Computer Science, Polish Academy of Sciences, ul. Jana Kazimierza 5, 01-248 Warszawa, Poland.
| | - Adam Liwo
- Faculty of Chemistry, University of Gdańsk, Wita Stwosza 63, 80-308 Gdańsk, Poland.
- Center for In Silico Protein Structure and School of Computational Sciences, Korea Institute for Advanced Study, 85 Hoegiro, Dongdaemun-gu, Seoul 130-722, Korea.
| | - Andrzej Kloczkowski
- Battelle Center for Mathematical Medicine, Nationwide Children's Hospital, Columbus, OH 43215, USA.
- Department of Pediatrics, The Ohio State University, Columbus, OH 43215, USA.
- Kavli Institute for Theoretical Physics China, Chinese Academy of Sciences, Beijing 100190, China.
| |
Collapse
|
19
|
Song S, Ji J, Chen X, Gao S, Tang Z, Todo Y. Adoption of an improved PSO to explore a compound multi-objective energy function in protein structure prediction. Appl Soft Comput 2018. [DOI: 10.1016/j.asoc.2018.07.042] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
20
|
Delarue M, Koehl P. Combined approaches from physics, statistics, and computer science for ab initio protein structure prediction: ex unitate vires (unity is strength)? F1000Res 2018; 7. [PMID: 30079234 PMCID: PMC6058471 DOI: 10.12688/f1000research.14870.1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/19/2018] [Indexed: 11/20/2022] Open
Abstract
Connecting the dots among the amino acid sequence of a protein, its structure, and its function remains a central theme in molecular biology, as it would have many applications in the treatment of illnesses related to misfolding or protein instability. As a result of high-throughput sequencing methods, biologists currently live in a protein sequence-rich world. However, our knowledge of protein structure based on experimental data remains comparatively limited. As a consequence, protein structure prediction has established itself as a very active field of research to fill in this gap. This field, once thought to be reserved for theoretical biophysicists, is constantly reinventing itself, borrowing ideas informed by an ever-increasing assembly of scientific domains, from biology, chemistry, (statistical) physics, mathematics, computer science, statistics, bioinformatics, and more recently data sciences. We review the recent progress arising from this integration of knowledge, from the development of specific computer architecture to allow for longer timescales in physics-based simulations of protein folding to the recent advances in predicting contacts in proteins based on detection of coevolution using very large data sets of aligned protein sequences.
Collapse
Affiliation(s)
- Marc Delarue
- Unité Dynamique Structurale des Macromolécules, Institut Pasteur, and UMR 3528 du CNRS, Paris, France
| | - Patrice Koehl
- Department of Computer Science, Genome Center, University of California, Davis, Davis, California, USA
| |
Collapse
|
21
|
Dutagaci B, Heo L, Feig M. Structure refinement of membrane proteins via molecular dynamics simulations. Proteins 2018; 86:738-750. [PMID: 29675899 PMCID: PMC6013386 DOI: 10.1002/prot.25508] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2018] [Revised: 04/09/2018] [Accepted: 04/14/2018] [Indexed: 12/12/2022]
Abstract
A refinement protocol based on physics-based techniques established for water soluble proteins is tested for membrane protein structures. Initial structures were generated by homology modeling and sampled via molecular dynamics simulations in explicit lipid bilayer and aqueous solvent systems. Snapshots from the simulations were selected based on scoring with either knowledge-based or implicit membrane-based scoring functions and averaged to obtain refined models. The protocol resulted in consistent and significant refinement of the membrane protein structures similar to the performance of refinement methods for soluble proteins. Refinement success was similar between sampling in the presence of lipid bilayers and aqueous solvent but the presence of lipid bilayers may benefit the improvement of lipid-facing residues. Scoring with knowledge-based functions (DFIRE and RWplus) was found to be as good as scoring using implicit membrane-based scoring functions suggesting that differences in internal packing is more important than orientations relative to the membrane during the refinement of membrane protein homology models.
Collapse
Affiliation(s)
- Bercem Dutagaci
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| | - Lim Heo
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| | - Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| |
Collapse
|
22
|
Nerli S, McShan AC, Sgourakis NG. Chemical shift-based methods in NMR structure determination. PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY 2018; 106-107:1-25. [PMID: 31047599 PMCID: PMC6788782 DOI: 10.1016/j.pnmrs.2018.03.002] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Revised: 03/09/2018] [Accepted: 03/09/2018] [Indexed: 05/08/2023]
Abstract
Chemical shifts are highly sensitive probes harnessed by NMR spectroscopists and structural biologists as conformational parameters to characterize a range of biological molecules. Traditionally, assignment of chemical shifts has been a labor-intensive process requiring numerous samples and a suite of multidimensional experiments. Over the past two decades, the development of complementary computational approaches has bolstered the analysis, interpretation and utilization of chemical shifts for elucidation of high resolution protein and nucleic acid structures. Here, we review the development and application of chemical shift-based methods for structure determination with a focus on ab initio fragment assembly, comparative modeling, oligomeric systems, and automated assignment methods. Throughout our discussion, we point out practical uses, as well as advantages and caveats, of using chemical shifts in structure modeling. We additionally highlight (i) hybrid methods that employ chemical shifts with other types of NMR restraints (residual dipolar couplings, paramagnetic relaxation enhancements and pseudocontact shifts) that allow for improved accuracy and resolution of generated 3D structures, (ii) the utilization of chemical shifts to model the structures of sparsely populated excited states, and (iii) modeling of sidechain conformations. Finally, we briefly discuss the advantages of contemporary methods that employ sparse NMR data recorded using site-specific isotope labeling schemes for chemical shift-driven structure determination of larger molecules. With this review, we aim to emphasize the accessibility and versatility of chemical shifts for structure determination of challenging biological systems, and to point out emerging areas of development that lead us towards the next generation of tools.
Collapse
Affiliation(s)
- Santrupti Nerli
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA 95064, United States; Department of Computer Science, University of California Santa Cruz, Santa Cruz, CA 95064, United States
| | - Andrew C McShan
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA 95064, United States
| | - Nikolaos G Sgourakis
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA 95064, United States.
| |
Collapse
|
23
|
AIMOES: Archive information assisted multi-objective evolutionary strategy for ab initio protein structure prediction. Knowl Based Syst 2018. [DOI: 10.1016/j.knosys.2018.01.028] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
24
|
Sachdeva G, Myhrvold C, Yin P, Silver PA. Synthetic RNA Scaffolds for Spatial Engineering in Cells. Synth Biol (Oxf) 2018. [DOI: 10.1002/9783527688104.ch13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Affiliation(s)
- Gairik Sachdeva
- Harvard John A. Paulson School of Engineering and Applied Sciences, 29 Oxford Street; Cambridge MA 02138 USA
- Harvard University; Wyss Institute for Biologically Inspired Engineering, 3 Blackfan Circle; Boston MA 02115 USA
- Harvard Medical School; Department of Systems Biology, 200 Longwood Avenue; Boston MA 02115 USA
| | - Cameron Myhrvold
- Harvard University; Wyss Institute for Biologically Inspired Engineering, 3 Blackfan Circle; Boston MA 02115 USA
- Harvard Medical School; Department of Systems Biology, 200 Longwood Avenue; Boston MA 02115 USA
| | - Peng Yin
- Harvard University; Wyss Institute for Biologically Inspired Engineering, 3 Blackfan Circle; Boston MA 02115 USA
| | - Pamela A. Silver
- Harvard University; Wyss Institute for Biologically Inspired Engineering, 3 Blackfan Circle; Boston MA 02115 USA
- Harvard Medical School; Department of Systems Biology, 200 Longwood Avenue; Boston MA 02115 USA
| |
Collapse
|
25
|
Moult J, Fidelis K, Kryshtafovych A, Schwede T, Tramontano A. Critical assessment of methods of protein structure prediction (CASP)-Round XII. Proteins 2018; 86 Suppl 1:7-15. [PMID: 29082672 PMCID: PMC5897042 DOI: 10.1002/prot.25415] [Citation(s) in RCA: 245] [Impact Index Per Article: 40.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2017] [Revised: 10/25/2017] [Accepted: 10/27/2017] [Indexed: 12/24/2022]
Abstract
This article reports the outcome of the 12th round of Critical Assessment of Structure Prediction (CASP12), held in 2016. CASP is a community experiment to determine the state of the art in modeling protein structure from amino acid sequence. Participants are provided sequence information and in turn provide protein structure models and related information. Analysis of the submitted structures by independent assessors provides a comprehensive picture of the capabilities of current methods, and allows progress to be identified. This was again an exciting round of CASP, with significant advances in 4 areas: (i) The use of new methods for predicting three-dimensional contacts led to a two-fold improvement in contact accuracy. (ii) As a consequence, model accuracy for proteins where no template was available improved dramatically. (iii) Models based on a structural template showed overall improvement in accuracy. (iv) Methods for estimating the accuracy of a model continued to improve. CASP continued to develop new areas: (i) Assessing methods for building quaternary structure models, including an expansion of the collaboration between CASP and CAPRI. (ii) Modeling with the aid of experimental data was extended to include SAXS data, as well as again using chemical cross-linking information. (iii) A team of assessors evaluated the suitability of models for a range of applications, including mutation interpretation, analysis of ligand binding properties, and identification of interfaces. This article describes the experiment and summarizes the results. The rest of this special issue of PROTEINS contains papers describing CASP12 results and assessments in more detail.
Collapse
Affiliation(s)
- John Moult
- Institute for Bioscience and Biotechnology Research and Department of Cell Biology and Molecular Genetics, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA
| | - Krzysztof Fidelis
- Genome Center, University of California, Davis, 451 Health Sciences Drive, Davis, CA 95616, USA
| | - Andriy Kryshtafovych
- Genome Center, University of California, Davis, 451 Health Sciences Drive, Davis, CA 95616, USA
| | - Torsten Schwede
- University of Basel, Biozentrum & SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Anna Tramontano
- Department of Physics and Istituto Pasteur - Fondazione Cenci Bolognetti, Sapienza University of Rome, P.le Aldo Moro, 5, 00185 Rome, Italy
| |
Collapse
|
26
|
Goethe M, Gleixner J, Fita I, Rubi JM. Prediction of Protein Configurational Entropy (Popcoen). J Chem Theory Comput 2018; 14:1811-1819. [PMID: 29351717 DOI: 10.1021/acs.jctc.7b01079] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
A knowledge-based method for configurational entropy prediction of proteins is presented; this methodology is extremely fast, compared to previous approaches, because it does not involve any type of configurational sampling. Instead, the configurational entropy of a query fold is estimated by evaluating an artificial neural network, which was trained on molecular-dynamics simulations of ∼1000 proteins. The predicted entropy can be incorporated into a large class of protein software based on cost-function minimization/evaluation, in which configurational entropy is currently neglected for performance reasons. Software of this type is used for all major protein tasks such as structure predictions, proteins design, NMR and X-ray refinement, docking, and mutation effect predictions. Integrating the predicted entropy can yield a significant accuracy increase as we show exemplarily for native-state identification with the prominent protein software FoldX. The method has been termed Popcoen for Prediction of Protein Configurational Entropy. An implementation is freely available at http://fmc.ub.edu/popcoen/ .
Collapse
Affiliation(s)
- Martin Goethe
- Department of Condensed Matter Physics , University of Barcelona , Carrer Martí i Franqués 1 , 08028 Barcelona , Spain
| | - Jan Gleixner
- Faculty of Biosciences , Heidelberg University , Im Neuenheimer Feld 234 , 69120 Heidelberg , Germany
| | - Ignacio Fita
- Molecular Biology Institute of Barcelona (IBMB-CSIC, Maria de Maeztu Unit of Excellence) , Carrer Baldiri Reixac 4-8 , 08028 Barcelona , Spain
| | | |
Collapse
|
27
|
Li B, Fooksa M, Heinze S, Meiler J. Finding the needle in the haystack: towards solving the protein-folding problem computationally. Crit Rev Biochem Mol Biol 2018; 53:1-28. [PMID: 28976219 PMCID: PMC6790072 DOI: 10.1080/10409238.2017.1380596] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Revised: 08/22/2017] [Accepted: 09/13/2017] [Indexed: 12/22/2022]
Abstract
Prediction of protein tertiary structures from amino acid sequence and understanding the mechanisms of how proteins fold, collectively known as "the protein folding problem," has been a grand challenge in molecular biology for over half a century. Theories have been developed that provide us with an unprecedented understanding of protein folding mechanisms. However, computational simulation of protein folding is still difficult, and prediction of protein tertiary structure from amino acid sequence is an unsolved problem. Progress toward a satisfying solution has been slow due to challenges in sampling the vast conformational space and deriving sufficiently accurate energy functions. Nevertheless, several techniques and algorithms have been adopted to overcome these challenges, and the last two decades have seen exciting advances in enhanced sampling algorithms, computational power and tertiary structure prediction methodologies. This review aims at summarizing these computational techniques, specifically conformational sampling algorithms and energy approximations that have been frequently used to study protein-folding mechanisms or to de novo predict protein tertiary structures. We hope that this review can serve as an overview on how the protein-folding problem can be studied computationally and, in cases where experimental approaches are prohibitive, help the researcher choose the most relevant computational approach for the problem at hand. We conclude with a summary of current challenges faced and an outlook on potential future directions.
Collapse
Affiliation(s)
- Bian Li
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| | - Michaela Fooksa
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
- Chemical and Physical Biology Graduate Program, Vanderbilt University, Nashville, TN, USA
| | - Sten Heinze
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
28
|
Wang T, Yang Y, Zhou Y, Gong H. LRFragLib: an effective algorithm to identify fragments for de novo protein structure prediction. Bioinformatics 2017; 33:677-684. [PMID: 27797773 DOI: 10.1093/bioinformatics/btw668] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2016] [Accepted: 10/18/2016] [Indexed: 11/13/2022] Open
Abstract
Motivation The quality of fragment library determines the efficiency of fragment assembly, an approach that is widely used in most de novo protein-structure prediction algorithms. Conventional fragment libraries are constructed mainly based on the identities of amino acids, sometimes facilitated by predicted information including dihedral angles and secondary structures. However, it remains challenging to identify near-native fragment structures with low sequence homology. Results We introduce a novel fragment-library-construction algorithm, LRFragLib, to improve the detection of near-native low-homology fragments of 7-10 residues, using a multi-stage, flexible selection protocol. Based on logistic regression scoring models, LRFragLib outperforms existing techniques by achieving a significantly higher precision and a comparable coverage on recent CASP protein sets in sampling near-native structures. The method also has a comparable computational efficiency to the fastest existing techniques with substantially reduced memory usage. Availability and Implementation The source code is available for download at http://166.111.152.91/Downloads.html. Contact hgong@tsinghua.edu.cn. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Tong Wang
- MOE Key Laboratory of Bioinformatics, School of Life Sciences.,Beijing Innovation Center of Structural Biology, Tsinghua University, Beijing 100084, China
| | - Yuedong Yang
- Institute for Glycomics and School of Information and Communication Technology, Griffith University, Gold Coast, QLD 4222, Australia
| | - Yaoqi Zhou
- Institute for Glycomics and School of Information and Communication Technology, Griffith University, Gold Coast, QLD 4222, Australia
| | - Haipeng Gong
- MOE Key Laboratory of Bioinformatics, School of Life Sciences.,Beijing Innovation Center of Structural Biology, Tsinghua University, Beijing 100084, China
| |
Collapse
|
29
|
Feig M. Computational protein structure refinement: Almost there, yet still so far to go. WILEY INTERDISCIPLINARY REVIEWS. COMPUTATIONAL MOLECULAR SCIENCE 2017; 7:e1307. [PMID: 30613211 PMCID: PMC6319934 DOI: 10.1002/wcms.1307] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Protein structures are essential in modern biology yet experimental methods are far from being able to catch up with the rapid increase in available genomic data. Computational protein structure prediction methods aim to fill the gap while the role of protein structure refinement is to take approximate initial template-based models and bring them closer to the true native structure. Current methods for computational structure refinement rely on molecular dynamics simulations, related sampling methods, or iterative structure optimization protocols. The best methods are able to achieve moderate degrees of refinement but consistent refinement that can reach near-experimental accuracy remains elusive. Key issues revolve around the accuracy of the energy function, the inability to reliably rank multiple models, and the use of restraints that keep sampling close to the native state but also limit the degree of possible refinement. A different aspect is the question of what exactly the target of high-resolution refinement should be as experimental structures are affected by experimental conditions and different biological questions require varying levels of accuracy. While improvement of the global protein structure is a difficult problem, high-resolution refinement methods that improves local structural quality such as favorable stereochemistry and the avoidance of atomic clashes are much more successful.
Collapse
Affiliation(s)
- Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, 603 Wilson Rd., Room 218 BCH, East Lansing, MI, USA, ; 517-432-7439
| |
Collapse
|
30
|
Critical Features of Fragment Libraries for Protein Structure Prediction. PLoS One 2017; 12:e0170131. [PMID: 28085928 PMCID: PMC5235372 DOI: 10.1371/journal.pone.0170131] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2016] [Accepted: 12/29/2016] [Indexed: 11/19/2022] Open
Abstract
The use of fragment libraries is a popular approach among protein structure prediction methods and has proven to substantially improve the quality of predicted structures. However, some vital aspects of a fragment library that influence the accuracy of modeling a native structure remain to be determined. This study investigates some of these features. Particularly, we analyze the effect of using secondary structure prediction guiding fragments selection, different fragments sizes and the effect of structural clustering of fragments within libraries. To have a clearer view of how these factors affect protein structure prediction, we isolated the process of model building by fragment assembly from some common limitations associated with prediction methods, e.g., imprecise energy functions and optimization algorithms, by employing an exact structure-based objective function under a greedy algorithm. Our results indicate that shorter fragments reproduce the native structure more accurately than the longer. Libraries composed of multiple fragment lengths generate even better structures, where longer fragments show to be more useful at the beginning of the simulations. The use of many different fragment sizes shows little improvement when compared to predictions carried out with libraries that comprise only three different fragment sizes. Models obtained from libraries built using only sequence similarity are, on average, better than those built with a secondary structure prediction bias. However, we found that the use of secondary structure prediction allows greater reduction of the search space, which is invaluable for prediction methods. The results of this study can be critical guidelines for the use of fragment libraries in protein structure prediction.
Collapse
|
31
|
Isvoran A. Online Molecular Docking Resources. Oncology 2017. [DOI: 10.4018/978-1-5225-0549-5.ch037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
This chapter aims to present the available online resources that are used for protein modeling with accent to online molecular docking resources. SwissDock, MTiAutoDock, and PatchDock online docking tools are described and a few illustrative examples concerning the molecular docking studies for the cytochrom P450 interactions with the fungicide difenoconazole. The results obtained using different servers based on distinct approaches are compared and the advantages and/or disadvantages of every server are illustrated.
Collapse
|
32
|
Identification and characterization of protein coding genes in monsonia (Monsonia burkeana Planch. ex harv) using a combination of approaches. Genes Genomics 2016. [DOI: 10.1007/s13258-016-0499-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
33
|
|
34
|
Gebala M, Bonilla S, Bisaria N, Herschlag D. Does Cation Size Affect Occupancy and Electrostatic Screening of the Nucleic Acid Ion Atmosphere? J Am Chem Soc 2016; 138:10925-34. [PMID: 27479701 PMCID: PMC5010015 DOI: 10.1021/jacs.6b04289] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Indexed: 01/14/2023]
Abstract
Electrostatics are central to all aspects of nucleic acid behavior, including their folding, condensation, and binding to other molecules, and the energetics of these processes are profoundly influenced by the ion atmosphere that surrounds nucleic acids. Given the highly complex and dynamic nature of the ion atmosphere, understanding its properties and effects will require synergy between computational modeling and experiment. Prior computational models and experiments suggest that cation occupancy in the ion atmosphere depends on the size of the cation. However, the computational models have not been independently tested, and the experimentally observed effects were small. Here, we evaluate a computational model of ion size effects by experimentally testing a blind prediction made from that model, and we present additional experimental results that extend our understanding of the ion atmosphere. Giambasu et al. developed and implemented a three-dimensional reference interaction site (3D-RISM) model for monovalent cations surrounding DNA and RNA helices, and this model predicts that Na(+) would outcompete Cs(+) by 1.8-2.1-fold; i.e., with Cs(+) in 2-fold excess of Na(+) the ion atmosphere would contain an equal number of each cation (Nucleic Acids Res. 2015, 43, 8405). However, our ion counting experiments indicate that there is no significant preference for Na(+) over Cs(+). There is an ∼25% preferential occupancy of Li(+) over larger cations in the ion atmosphere but, counter to general expectations from existing models, no size dependence for the other alkali metal ions. Further, we followed the folding of the P4-P6 RNA and showed that differences in folding with different alkali metal ions observed at high concentration arise from cation-anion interactions and not cation size effects. Overall, our results provide a critical test of a computational prediction, fundamental information about ion atmosphere properties, and parameters that will aid in the development of next-generation nucleic acid computational models.
Collapse
Affiliation(s)
- Magdalena Gebala
- Department
of Biochemistry, Stanford University, Stanford, California 94305, United States
| | - Steve Bonilla
- Department
of Chemical Engineering, Stanford University, Stanford, California 94305, United States
| | - Namita Bisaria
- Department
of Biochemistry, Stanford University, Stanford, California 94305, United States
| | - Daniel Herschlag
- Department
of Biochemistry, Stanford University, Stanford, California 94305, United States
- Department
of Chemistry, Stanford University, Stanford, California 94305, United States
- ChEM-H
Institute, Stanford University, Stanford, California 94305, United States
| |
Collapse
|
35
|
Kolodny R, Guibas L, Levitt M, Koehl P. Inverse Kinematics in Biology: The Protein Loop Closure Problem. Int J Rob Res 2016. [DOI: 10.1177/0278364905050352] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Assembling fragments from known protein structures is a widely used approach to construct structural models for new proteins. We describe an application of this idea to an important inverse kinematics problem in structural biology: the loop closure problem. We have developed an algorithm for generating the conformations of candidate loops that fit in a gap of given length in a protein structure framework. Our method proceeds by concatenating small fragments of protein chosen from small libraries of representative fragments. Our approach has the advantages of ab initio methods since we are able to enumerate all candidate loops in the discrete approximation of the conformational space accessible to the loop, as well as the advantages of database search approach since the use of fragments of known protein structures guarantees that the backbone conformations are physically reasonable. We test our approach on a set of 427 loops, varying in length from four residues to 14 residues. The quality of the candidate loops is evaluated in terms of global coordinate root mean square (cRMS). The top predictions vary between 0.3 and 4.2 Å for four-residue loops and between 1.5 and 3.1 Å for 14-residue loops, respectively.
Collapse
Affiliation(s)
- Rachel Kolodny
- Department of Structural Biology and Computer Science Department, Stanford University, Stanford, CA 94305, USA,
| | - Leonidas Guibas
- Computer Science Department, Stanford University, Stanford, CA 94305, USA
| | - Michael Levitt
- Department of Structural Biology, Stanford University, Stanford, CA 94305, USA
| | - Patrice Koehl
- Department of Structural Biology, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
36
|
The Folding of de Novo Designed Protein DS119 via Molecular Dynamics Simulations. Int J Mol Sci 2016; 17:ijms17050612. [PMID: 27128902 PMCID: PMC4881441 DOI: 10.3390/ijms17050612] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Revised: 04/13/2016] [Accepted: 04/13/2016] [Indexed: 02/01/2023] Open
Abstract
As they are not subjected to natural selection process, de novo designed proteins usually fold in a manner different from natural proteins. Recently, a de novo designed mini-protein DS119, with a βαβ motif and 36 amino acids, has folded unusually slowly in experiments, and transient dimers have been detected in the folding process. Here, by means of all-atom replica exchange molecular dynamics (REMD) simulations, several comparably stable intermediate states were observed on the folding free-energy landscape of DS119. Conventional molecular dynamics (CMD) simulations showed that when two unfolded DS119 proteins bound together, most binding sites of dimeric aggregates were located at the N-terminal segment, especially residues 5-10, which were supposed to form β-sheet with its own C-terminal segment. Furthermore, a large percentage of individual proteins in the dimeric aggregates adopted conformations similar to those in the intermediate states observed in REMD simulations. These results indicate that, during the folding process, DS119 can easily become trapped in intermediate states. Then, with diffusion, a transient dimer would be formed and stabilized with the binding interface located at N-terminals. This means that it could not quickly fold to the native structure. The complicated folding manner of DS119 implies the important influence of natural selection on protein-folding kinetics, and more improvement should be achieved in rational protein design.
Collapse
|
37
|
Tavousi P, Behandish M, Ilieş HT, Kazerounian K. Protofold II: Enhanced Model and Implementation for Kinetostatic Protein Folding1. J Nanotechnol Eng Med 2016. [DOI: 10.1115/1.4032759] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
A reliable prediction of three-dimensional (3D) protein structures from sequence data remains a big challenge due to both theoretical and computational difficulties. We have previously shown that our kinetostatic compliance method (KCM) implemented into the Protofold package can overcome some of the key difficulties faced by other de novo structure prediction methods, such as the very small time steps required by the molecular dynamics (MD) approaches or the very large number of samples needed by the Monte Carlo (MC) sampling techniques. In this paper, we improve the free energy formulation used in Protofold by including the typically underrated entropic effects, imparted due to differences in hydrophobicity of the chemical groups, which dominate the folding of most water-soluble proteins. In addition to the model enhancement, we revisit the numerical implementation by redesigning the algorithms and introducing efficient data structures that reduce the expected complexity from quadratic to linear. Moreover, we develop and optimize parallel implementations of the algorithms on both central and graphics processing units (CPU/GPU) achieving speed-ups up to two orders of magnitude on the GPU. Our simulations are consistent with the general behavior observed in the folding process in aqueous solvent, confirming the effectiveness of model improvements. We report on the folding process at multiple levels, namely, the formation of secondary structural elements and tertiary interactions between secondary elements or across larger domains. We also observe significant enhancements in running times that make the folding simulation tractable for large molecules.
Collapse
Affiliation(s)
- Pouya Tavousi
- Kinematics Design Laboratory, Department of Mechanical Engineering, University of Connecticut, Storrs, CT 06269 e-mail:
| | - Morad Behandish
- Computational Design Laboratory, Department of Mechanical Engineering, University of Connecticut, Storrs, CT 06269 e-mail:
| | - Horea T. Ilieş
- Department of Mechanical Engineering, University of Connecticut, Storrs, CT 06269 e-mail:
| | - Kazem Kazerounian
- Department of Mechanical Engineering, University of Connecticut, Storrs, CT 06269 e-mail:
| |
Collapse
|
38
|
Design of Self-Assembling Protein-Polymer Conjugates. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 940:179-214. [PMID: 27677514 DOI: 10.1007/978-3-319-39196-0_9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Protein-polymer conjugates are of particular interest for nanobiotechnology applications because of the various and complementary roles that each component may play in composite hybrid-materials. This chapter focuses on the design principles and applications of self-assembling protein-polymer conjugate materials. We address the general design methodology, from both synthetic and genetic perspective, conjugation strategies, protein vs. polymer driven self-assembly and finally, emerging applications for conjugate materials. By marrying proteins and polymers into conjugated bio-hybrid materials, materials scientists, chemists, and biologists alike, have at their fingertips a vast toolkit for material design. These inherently hierarchical structures give rise to useful patterning, mechanical and transport properties that may help realize new, more efficient materials for energy generation, catalysis, nanorobots, etc.
Collapse
|
39
|
Dalton JAR, Lans I, Rovira X, Malhaire F, Gómez-Santacana X, Pittolo S, Gorostiza P, Llebaria A, Goudet C, Pin JP, Giraldo J. Shining Light on an mGlu5 Photoswitchable NAM: A Theoretical Perspective. Curr Neuropharmacol 2016; 14:441-54. [PMID: 26391742 PMCID: PMC4983757 DOI: 10.2174/1570159x13666150407231417] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Revised: 03/06/2015] [Accepted: 04/04/2015] [Indexed: 02/07/2023] Open
Abstract
Metabotropic glutamate receptors (mGluRs) are important drug targets because of their involvement in several neurological diseases. Among mGluRs, mGlu5 is a particularly high-profile target because its positive or negative allosteric modulation can potentially treat schizophrenia or anxiety and chronic pain, respectively. Here, we computationally and experimentally probe the functional binding of a novel photoswitchable mGlu5 NAM, termed alloswitch-1, which loses its NAM functionality under violet light. We show alloswitch-1 binds deep in the allosteric pocket in a similar fashion to mavoglurant, the co-crystallized NAM in the mGlu5 transmembrane domain crystal structure. Alloswitch-1, like NAM 2-Methyl-6-(phenylethynyl)pyridine (MPEP), is significantly affected by P655M mutation deep in the allosteric pocket, eradicating its functionality. In MD simulations, we show alloswitch-1 and MPEP stabilize the co-crystallized water molecule located at the bottom of the allosteric site that is seemingly characteristic of the inactive receptor state. Furthermore, both NAMs form H-bonds with S809 on helix 7, which may constitute an important stabilizing interaction for NAM-induced mGlu5 inactivation. Alloswitch-1, through isomerization of its amide group from trans to cis is able to form an additional interaction with N747 on helix 5. This may be an important interaction for amide-containing mGlu5 NAMs, helping to stabilize their binding in a potentially unusual cis-amide state. Simulated conformational switching of alloswitch-1 in silico suggests photoisomerization of its azo group from trans to cis may be possible within the allosteric pocket. However, photoexcited alloswitch-1 binds in an unstable fashion, breaking H-bonds with the protein and destabilizing the co-crystallized water molecule. This suggests photoswitching may have destabilizing effects on mGlu5 binding and functionality.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Jesús Giraldo
- Laboratory of Molecular Neuropharmacology and Bioinformatics, Institut de Neurociències and Unitat de Bioestadística, Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain.
| |
Collapse
|
40
|
Molecular Modeling and Its Applications in Protein Engineering. Synth Biol (Oxf) 2016. [DOI: 10.1007/978-3-319-22708-5_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
|
41
|
Hospital A, Goñi JR, Orozco M, Gelpí JL. Molecular dynamics simulations: advances and applications. Adv Appl Bioinform Chem 2015; 8:37-47. [PMID: 26604800 PMCID: PMC4655909 DOI: 10.2147/aabc.s70333] [Citation(s) in RCA: 240] [Impact Index Per Article: 26.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Molecular dynamics simulations have evolved into a mature technique that can be used effectively to understand macromolecular structure-to-function relationships. Present simulation times are close to biologically relevant ones. Information gathered about the dynamic properties of macromolecules is rich enough to shift the usual paradigm of structural bioinformatics from studying single structures to analyze conformational ensembles. Here, we describe the foundations of molecular dynamics and the improvements made in the direction of getting such ensemble. Specific application of the technique to three main issues (allosteric regulation, docking, and structure refinement) is discussed.
Collapse
Affiliation(s)
- Adam Hospital
- Institute for Research in Biomedicine, The Barcelona Institute of Science and Technology, University of Barcelona, Barcelona, Spain
| | - Josep Ramon Goñi
- Joint BSC-IRB Research Program in Computational Biology, University of Barcelona, Barcelona, Spain ; Barcelona Supercomputing Center, University of Barcelona, Barcelona, Spain
| | - Modesto Orozco
- Institute for Research in Biomedicine, The Barcelona Institute of Science and Technology, University of Barcelona, Barcelona, Spain ; Joint BSC-IRB Research Program in Computational Biology, University of Barcelona, Barcelona, Spain ; Barcelona Supercomputing Center, University of Barcelona, Barcelona, Spain ; Department of Biochemistry and Molecular Biology, University of Barcelona, Barcelona, Spain
| | - Josep L Gelpí
- Joint BSC-IRB Research Program in Computational Biology, University of Barcelona, Barcelona, Spain ; Barcelona Supercomputing Center, University of Barcelona, Barcelona, Spain ; Department of Biochemistry and Molecular Biology, University of Barcelona, Barcelona, Spain
| |
Collapse
|
42
|
Khor BY, Tye GJ, Lim TS, Choong YS. General overview on structure prediction of twilight-zone proteins. Theor Biol Med Model 2015; 12:15. [PMID: 26338054 PMCID: PMC4559291 DOI: 10.1186/s12976-015-0014-1] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Accepted: 08/27/2015] [Indexed: 01/02/2023] Open
Abstract
Protein structure prediction from amino acid sequence has been one of the most challenging aspects in computational structural biology despite significant progress in recent years showed by critical assessment of protein structure prediction (CASP) experiments. When experimentally determined structures are unavailable, the predictive structures may serve as starting points to study a protein. If the target protein consists of homologous region, high-resolution (typically <1.5 Å) model can be built via comparative modelling. However, when confronted with low sequence similarity of the target protein (also known as twilight-zone protein, sequence identity with available templates is less than 30%), the protein structure prediction has to be initiated from scratch. Traditionally, twilight-zone proteins can be predicted via threading or ab initio method. Based on the current trend, combination of different methods brings an improved success in the prediction of twilight-zone proteins. In this mini review, the methods, progresses and challenges for the prediction of twilight-zone proteins were discussed.
Collapse
Affiliation(s)
- Bee Yin Khor
- Institute for Research in Molecular Medicine, Universiti Sains Malaysia, 11800, Minden, Penang, Malaysia.
| | - Gee Jun Tye
- Institute for Research in Molecular Medicine, Universiti Sains Malaysia, 11800, Minden, Penang, Malaysia.
| | - Theam Soon Lim
- Institute for Research in Molecular Medicine, Universiti Sains Malaysia, 11800, Minden, Penang, Malaysia.
| | - Yee Siew Choong
- Institute for Research in Molecular Medicine, Universiti Sains Malaysia, 11800, Minden, Penang, Malaysia.
| |
Collapse
|
43
|
DasGupta D, Kaushik R, Jayaram B. From Ramachandran Maps to Tertiary Structures of Proteins. J Phys Chem B 2015; 119:11136-45. [DOI: 10.1021/acs.jpcb.5b02999] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Debarati DasGupta
- Department of Chemistry, ‡Supercomputing Facility for Bioinformatics & Computational Biology, and §Kusuma School of Biological Sciences, Indian Institute of Technology, Hauz Khas, New Delhi-110016, India
| | - Rahul Kaushik
- Department of Chemistry, ‡Supercomputing Facility for Bioinformatics & Computational Biology, and §Kusuma School of Biological Sciences, Indian Institute of Technology, Hauz Khas, New Delhi-110016, India
| | - B. Jayaram
- Department of Chemistry, ‡Supercomputing Facility for Bioinformatics & Computational Biology, and §Kusuma School of Biological Sciences, Indian Institute of Technology, Hauz Khas, New Delhi-110016, India
| |
Collapse
|
44
|
Hierarchical Conformational Analysis of Native Lysozyme Based on Sub-Millisecond Molecular Dynamics Simulations. PLoS One 2015; 10:e0129846. [PMID: 26057625 PMCID: PMC4461368 DOI: 10.1371/journal.pone.0129846] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Accepted: 05/12/2015] [Indexed: 11/19/2022] Open
Abstract
Hierarchical organization of free energy landscape (FEL) for native globular proteins has been widely accepted by the biophysics community. However, FEL of native proteins is usually projected onto one or a few dimensions. Here we generated collectively 0.2 milli-second molecular dynamics simulation trajectories in explicit solvent for hen egg white lysozyme (HEWL), and carried out detailed conformational analysis based on backbone torsional degrees of freedom (DOF). Our results demonstrated that at micro-second and coarser temporal resolutions, FEL of HEWL exhibits hub-like topology with crystal structures occupying the dominant structural ensemble that serves as the hub of conformational transitions. However, at 100ns and finer temporal resolutions, conformational substates of HEWL exhibit network-like topology, crystal structures are associated with kinetic traps that are important but not dominant ensembles. Backbone torsional state transitions on time scales ranging from nanoseconds to beyond microseconds were found to be associated with various types of molecular interactions. Even at nanoseconds temporal resolution, the number of conformational substates that are of statistical significance is quite limited. These observations suggest that detailed analysis of conformational substates at multiple temporal resolutions is both important and feasible. Transition state ensembles among various conformational substates at microsecond temporal resolution were observed to be considerably disordered. Life times of these transition state ensembles are found to be nearly independent of the time scales of the participating torsional DOFs.
Collapse
|
45
|
Jayaram B, Dhingra P, Mishra A, Kaushik R, Mukherjee G, Singh A, Shekhar S. Bhageerath-H: a homology/ab initio hybrid server for predicting tertiary structures of monomeric soluble proteins. BMC Bioinformatics 2014; 15 Suppl 16:S7. [PMID: 25521245 PMCID: PMC4290660 DOI: 10.1186/1471-2105-15-s16-s7] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND The advent of human genome sequencing project has led to a spurt in the number of protein sequences in the databanks. Success of structure based drug discovery severely hinges on the availability of structures. Despite significant progresses in the area of experimental protein structure determination, the sequence-structure gap is continually widening. Data driven homology based computational methods have proved successful in predicting tertiary structures for sequences sharing medium to high sequence similarities. With dwindling similarities of query sequences, advanced homology/ ab initio hybrid approaches are being explored to solve structure prediction problem. Here we describe Bhageerath-H, a homology/ ab initio hybrid software/server for predicting protein tertiary structures with advancing drug design attempts as one of the goals. RESULTS Bhageerath-H web-server was validated on 75 CASP10 targets which showed TM-scores ≥ 0.5 in 91% of the cases and Cα RMSDs ≤ 5 Å from the native in 58% of the targets, which is well above the CASP10 water mark. Comparison with some leading servers demonstrated the uniqueness of the hybrid methodology in effectively sampling conformational space, scoring best decoys and refining low resolution models to high and medium resolution. CONCLUSION Bhageerath-H methodology is web enabled for the scientific community as a freely accessible web server. The methodology is fielded in the on-going CASP11 experiment.
Collapse
|
46
|
Three-dimensional protein structure prediction: Methods and computational strategies. Comput Biol Chem 2014; 53PB:251-276. [DOI: 10.1016/j.compbiolchem.2014.10.001] [Citation(s) in RCA: 121] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Revised: 10/03/2014] [Accepted: 10/07/2014] [Indexed: 01/01/2023]
|
47
|
Minami S, Sawada K, Chikenji G. How a spatial arrangement of secondary structure elements is dispersed in the universe of protein folds. PLoS One 2014; 9:e107959. [PMID: 25243952 PMCID: PMC4171485 DOI: 10.1371/journal.pone.0107959] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2014] [Accepted: 08/18/2014] [Indexed: 11/18/2022] Open
Abstract
It has been known that topologically different proteins of the same class sometimes share the same spatial arrangement of secondary structure elements (SSEs). However, the frequency by which topologically different structures share the same spatial arrangement of SSEs is unclear. It is important to estimate this frequency because it provides both a deeper understanding of the geometry of protein folds and a valuable suggestion for predicting protein structures with novel folds. Here we clarified the frequency with which protein folds share the same SSE packing arrangement with other folds, the types of spatial arrangement of SSEs that are frequently observed across different folds, and the diversity of protein folds that share the same spatial arrangement of SSEs with a given fold, using a protein structure alignment program MICAN, which we have been developing. By performing comprehensive structural comparison of SCOP fold representatives, we found that approximately 80% of protein folds share the same spatial arrangement of SSEs with other folds. We also observed that many protein pairs that share the same spatial arrangement of SSEs belong to the different classes, often with an opposing N- to C-terminal direction of the polypeptide chain. The most frequently observed spatial arrangement of SSEs was the 2-layer α/β packing arrangement and it was dispersed among as many as 27% of SCOP fold representatives. These results suggest that the same spatial arrangements of SSEs are adopted by a wide variety of different folds and that the spatial arrangement of SSEs is highly robust against the N- to C-terminal direction of the polypeptide chain.
Collapse
Affiliation(s)
- Shintaro Minami
- Department of Complex Systems Science, Nagoya University, Nagoya, Aichi, Japan
| | - Kengo Sawada
- Department of Applied Physics, Nagoya University, Nagoya, Aichi, Japan
| | - George Chikenji
- Department of Computational Science and Engineering, Nagoya University, Nagoya, Aichi, Japan
- * E-mail:
| |
Collapse
|
48
|
Dalton JAR, Gómez-Santacana X, Llebaria A, Giraldo J. Computational analysis of negative and positive allosteric modulator binding and function in metabotropic glutamate receptor 5 (in)activation. J Chem Inf Model 2014; 54:1476-87. [PMID: 24793143 DOI: 10.1021/ci500127c] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Metabotropic glutamate receptors (mGluRs) are high-profile G-protein coupled receptors drug targets because of their involvement in several neurological disease states, and mGluR5 in particular is a subtype whose controlled allosteric modulation, both positive and negative, can potentially be useful for the treatment of schizophrenia and relief of chronic pain, respectively. Here we model mGluR5 with a collection of positive and negative allosteric modulators (PAMs and NAMs) in both active and inactive receptor states, in a manner that is consistent with experimental information, using a specialized protocol that includes homology to increase docking accuracy, and receptor relaxation to generate an individual induced fit with each allosteric modulator. Results implicate two residues in particular for NAM and PAM function: NAM interaction with W785 for receptor inactivation, and NAM/PAM H-bonding with S809 for receptor (in)activation. Models suggest the orientation of the H-bond between allosteric modulator and S809, controlled by PAM/NAM chemistry, influences the position of TM7, which in turn influences the shape of the allosteric site, and potentially the receptor state. NAM-bound and PAM-bound mGluR5 models also reveal that although PAMs and NAMs bind in the same pocket and share similar binding modes, they have distinct effects on the conformation of the receptor. Our models, together with the identification of a possible activation mechanism, may be useful in the rational design of new allosteric modulators for mGluR5.
Collapse
Affiliation(s)
- James A R Dalton
- Laboratory of Molecular Neuropharmacology and Bioinformatics, Institut de Neurociències and Unitat de Bioestadística, Universitat Autònoma de Barcelona , 08193 Bellaterra, Barcelona, Spain
| | | | | | | |
Collapse
|
49
|
A Parallel Framework for Multipoint Spiral Search in ab Initio Protein Structure Prediction. Adv Bioinformatics 2014; 2014:985968. [PMID: 24744779 PMCID: PMC3976798 DOI: 10.1155/2014/985968] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2013] [Revised: 02/04/2014] [Accepted: 02/06/2014] [Indexed: 11/17/2022] Open
Abstract
Protein structure prediction is computationally a very challenging problem. A large number of existing search algorithms attempt to solve the problem by exploring possible structures and finding the one with the minimum free energy. However, these algorithms perform poorly on large sized proteins due to an astronomically wide search space. In this paper, we present a multipoint spiral search framework that uses parallel processing techniques to expedite exploration by starting from different points. In our approach, a set of random initial solutions are generated and distributed to different threads. We allow each thread to run for a predefined period of time. The improved solutions are stored threadwise. When the threads finish, the solutions are merged together and the duplicates are removed. A selected distinct set of solutions are then split to different threads again. In our ab initio protein structure prediction method, we use the three-dimensional face-centred-cubic lattice for structure-backbone mapping. We use both the low resolution hydrophobic-polar energy model and the high-resolution 20 × 20 energy model for search guiding. The experimental results show that our new parallel framework significantly improves the results obtained by the state-of-the-art single-point search approaches for both energy models on three-dimensional face-centred-cubic lattice. We also experimentally show the effectiveness of mixing energy models within parallel threads.
Collapse
|
50
|
Siddiqi AR, Nioche P, Siddiqui AB, Rauf SA, Waseem A, Villoutreix BO. EFFICIENCY OF A HIERARCHICAL DOCKING PROTOCOL FOR COMPUTATIONAL LIGAND SCREENING AGAINST HOMOLOGY MODELS. BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS 2014. [DOI: 10.4015/s1016237214500240] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
We assessed the efficiency of a hierarchical docking protocol against homology models in virtual ligand screening (VLS) studies. A low resolution model of factor X (FX) was built on a template of Trypsin molecule (PDB ID:1EB2). Afterward VLS was performed involving a hierarchical protocol, rigid body followed by flexible docking, both against model as well as an X-ray structure of FX (PDB ID:1FJS) using a smart library of 50,000 chemical compounds seeded with 9 known inhibitors of FX. The percentage enrichments of screened chemical compounds obtained both from the crystal structure and homology model of FX were compared to analyze the efficiency of the protocol. In the first 5% of the finally ranked database of the screened compounds, both against model and the X-ray structure, 67% of the inhibitors were retrieved.
Collapse
Affiliation(s)
- Abdul Rauf Siddiqi
- INSERM UMR-S 747, Pharmacologie, Toxicologie et Signalisation Cellulaire, Université Paris Descartes 75006 Paris, France
| | - Pierre Nioche
- INSERM UMR-S 747, Pharmacologie, Toxicologie et Signalisation Cellulaire, Université Paris Descartes 75006 Paris, France
| | | | - Sadaf Abdul Rauf
- Department of Software Engineering, Fatima Jinnah Women University, Rawalpindi, Pakistan
| | - Amir Waseem
- Department of Chemistry, Quaid-e-Azam University, Islamabad, Pakistan
| | | |
Collapse
|