1
|
A Peptides Prediction Methodology for Tertiary Structure Based on Simulated Annealing. MATHEMATICAL AND COMPUTATIONAL APPLICATIONS 2021. [DOI: 10.3390/mca26020039] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The Protein Folding Problem (PFP) is a big challenge that has remained unsolved for more than fifty years. This problem consists of obtaining the tertiary structure or Native Structure (NS) of a protein knowing its amino acid sequence. The computational methodologies applied to this problem are classified into two groups, known as Template-Based Modeling (TBM) and ab initio models. In the latter methodology, only information from the primary structure of the target protein is used. In the literature, Hybrid Simulated Annealing (HSA) algorithms are among the best ab initio algorithms for PFP; Golden Ratio Simulated Annealing (GRSA) is a PFP family of these algorithms designed for peptides. Moreover, for the algorithms designed with TBM, they use information from a target protein’s primary structure and information from similar or analog proteins. This paper presents GRSA-SSP methodology that implements a secondary structure prediction to build an initial model and refine it with HSA algorithms. Additionally, we compare the performance of the GRSAX-SSP algorithms versus its corresponding GRSAX. Finally, our best algorithm GRSAX-SSP is compared with PEP-FOLD3, I-TASSER, QUARK, and Rosetta, showing that it competes in small peptides except when predicting the largest peptides.
Collapse
|
2
|
Coban MA, Blackburn PR, Whitelaw ML, van Haelst MM, Atwal PS, Caulfield TR. Structural Models for the Dynamic Effects of Loss-of-Function Variants in the Human SIM1 Protein Transcriptional Activation Domain. Biomolecules 2020; 10:biom10091314. [PMID: 32932609 PMCID: PMC7563489 DOI: 10.3390/biom10091314] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2020] [Revised: 08/21/2020] [Accepted: 09/08/2020] [Indexed: 02/02/2023] Open
Abstract
Single-minded homologue 1 (SIM1) is a transcription factor with numerous different physiological and developmental functions. SIM1 is a member of the class I basic helix-loop-helix-PER-ARNT-SIM (bHLH-PAS) transcription factor family, that includes several other conserved proteins, including the hypoxia-inducible factors, aryl hydrocarbon receptor, neuronal PAS proteins, and the CLOCK circadian regulator. Recent studies of HIF-a-ARNT and CLOCK-BMAL1 protein complexes have revealed the organization of their bHLH, PASA, and PASB domains and provided insight into how these heterodimeric protein complexes form; however, experimental structures for SIM1 have been lacking. Here, we describe the first full-length atomic structural model for human SIM1 with its binding partner ARNT in a heterodimeric complex and analyze several pathogenic variants utilizing state-of-the-art simulations and algorithms. Using local and global positional deviation metrics, deductions to the structural basis for the individual mutants are addressed in terms of the deleterious structural reorganizations that could alter protein function. We propose new experiments to probe these hypotheses and examine an interesting SIM1 dynamic behavior. The conformational dynamics demonstrates conformational changes on local and global regions that represent a mechanism for dysfunction in variants presented. In addition, we used our ab initio hybrid model for further prediction of variant hotspots that can be engineered to test for counter variant (restoration of wild-type function) or basic research probe.
Collapse
Affiliation(s)
- Mathew A. Coban
- Department of Cancer Biology, Mayo Clinic, Jacksonville, FL 32224, USA;
| | - Patrick R. Blackburn
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN 55905, USA;
| | - Murray L. Whitelaw
- Department of Molecular and Cellular Biology, University of Adelaide, Adelaide SA 5000, Australia;
| | - Mieke M. van Haelst
- Department of Clinical Genetics, Amsterdam UMC, University of Amsterdam, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands;
- Department of Clinical Genetics, Amsterdam UMC, Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
| | - Paldeep S. Atwal
- Center for Individualized Medicine, Mayo Clinic, Jacksonville, FL 32224, USA;
- Atwal Clinic, Jacksonville, FL 32224, USA
| | - Thomas R. Caulfield
- Department of Cancer Biology, Mayo Clinic, Jacksonville, FL 32224, USA;
- Center for Individualized Medicine, Mayo Clinic, Jacksonville, FL 32224, USA;
- Department of Clinical Genomics, Mayo Clinic, Rochester, MN 55905, USA, MN, USA
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA
- Correspondence: ; Tel.: +1-904-953-6072; Fax: +1-904-953-7370
| |
Collapse
|
3
|
Runthala A, Chowdhury S. Refined template selection and combination algorithm significantly improves template-based modeling accuracy. J Bioinform Comput Biol 2020; 17:1950006. [PMID: 31057073 DOI: 10.1142/s0219720019500069] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
In contrast to ab-initio protein modeling methodologies, comparative modeling is considered as the most popular and reliable algorithm to model protein structure. However, the selection of the best set of templates is still a major challenge. An effective template-ranking algorithm is developed to efficiently select only the reliable hits for predicting the protein structures. The algorithm employs the pairwise as well as multiple sequence alignments of template hits to rank and select the best possible set of templates. It captures several key sequences and structural information of template hits and converts into scores to effectively rank them. This selected set of templates is used to model a target. Modeling accuracy of the algorithm is tested and evaluated on TBM-HA domain containing CASP8, CASP9 and CASP10 targets. On an average, this template ranking and selection algorithm improves GDT-TS, GDT-HA and TM_Score by 3.531, 4.814 and 0.022, respectively. Further, it has been shown that the inclusion of structurally similar templates with ample conformational diversity is crucial for the modeling algorithm to maximally as well as reliably span the target sequence and construct its near-native model. The optimal model sampling also holds the key to predict the best possible target structure.
Collapse
Affiliation(s)
- Ashish Runthala
- 1 Department of Biological Sciences, Birla Institute of Technology and Science, Pilani-333031, India
| | - Shibasish Chowdhury
- 1 Department of Biological Sciences, Birla Institute of Technology and Science, Pilani-333031, India
| |
Collapse
|
4
|
Choudhary P, Kumar S, Bachhawat AK, Pandit SB. CSmetaPred: a consensus method for prediction of catalytic residues. BMC Bioinformatics 2017; 18:583. [PMID: 29273005 PMCID: PMC5741869 DOI: 10.1186/s12859-017-1987-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Accepted: 12/05/2017] [Indexed: 01/27/2023] Open
Abstract
Background Knowledge of catalytic residues can play an essential role in elucidating mechanistic details of an enzyme. However, experimental identification of catalytic residues is a tedious and time-consuming task, which can be expedited by computational predictions. Despite significant development in active-site prediction methods, one of the remaining issues is ranked positions of putative catalytic residues among all ranked residues. In order to improve ranking of catalytic residues and their prediction accuracy, we have developed a meta-approach based method CSmetaPred. In this approach, residues are ranked based on the mean of normalized residue scores derived from four well-known catalytic residue predictors. The mean residue score of CSmetaPred is combined with predicted pocket information to improve prediction performance in meta-predictor, CSmetaPred_poc. Results Both meta-predictors are evaluated on two comprehensive benchmark datasets and three legacy datasets using Receiver Operating Characteristic (ROC) and Precision Recall (PR) curves. The visual and quantitative analysis of ROC and PR curves shows that meta-predictors outperform their constituent methods and CSmetaPred_poc is the best of evaluated methods. For instance, on CSAMAC dataset CSmetaPred_poc (CSmetaPred) achieves highest Mean Average Specificity (MAS), a scalar measure for ROC curve, of 0.97 (0.96). Importantly, median predicted rank of catalytic residues is the lowest (best) for CSmetaPred_poc. Considering residues ranked ≤20 classified as true positive in binary classification, CSmetaPred_poc achieves prediction accuracy of 0.94 on CSAMAC dataset. Moreover, on the same dataset CSmetaPred_poc predicts all catalytic residues within top 20 ranks for ~73% of enzymes. Furthermore, benchmarking of prediction on comparative modelled structures showed that models result in better prediction than only sequence based predictions. These analyses suggest that CSmetaPred_poc is able to rank putative catalytic residues at lower (better) ranked positions, which can facilitate and expedite their experimental characterization. Conclusions The benchmarking studies showed that employing meta-approach in combining residue-level scores derived from well-known catalytic residue predictors can improve prediction accuracy as well as provide improved ranked positions of known catalytic residues. Hence, such predictions can assist experimentalist to prioritize residues for mutational studies in their efforts to characterize catalytic residues. Both meta-predictors are available as webserver at: http://14.139.227.206/csmetapred/. Electronic supplementary material The online version of this article (10.1186/s12859-017-1987-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Preeti Choudhary
- Department of Biological Sciences, Indian Institute of Science Education and Research, Mohali, Knowledge City, Sector 81, SAS Nagar, Manuali PO 140306, India
| | - Shailesh Kumar
- Department of Biological Sciences, Indian Institute of Science Education and Research, Mohali, Knowledge City, Sector 81, SAS Nagar, Manuali PO 140306, India.,Laboratory of Biochemistry and Genetics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Anand Kumar Bachhawat
- Department of Biological Sciences, Indian Institute of Science Education and Research, Mohali, Knowledge City, Sector 81, SAS Nagar, Manuali PO 140306, India
| | - Shashi Bhushan Pandit
- Department of Biological Sciences, Indian Institute of Science Education and Research, Mohali, Knowledge City, Sector 81, SAS Nagar, Manuali PO 140306, India.
| |
Collapse
|
5
|
Gupta P, Dash PK. Molecular details of secretory phospholipase A 2 from flax (Linum usitatissimum L.) provide insight into its structure and function. Sci Rep 2017; 7:11080. [PMID: 28894144 DOI: 10.1038/s41598-017-109699] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 08/17/2017] [Indexed: 05/29/2023] Open
Abstract
Secretory phospholipase A2 (sPLA2) are low molecular weight proteins (12-18 kDa) involved in a suite of plant cellular processes imparting growth and development. With myriad roles in physiological and biochemical processes in plants, detailed analysis of sPLA2 in flax/linseed is meagre. The present work, first in flax, embodies cloning, expression, purification and molecular characterisation of two distinct sPLA2s (I and II) from flax. PLA2 activity of the cloned sPLA2s were biochemically assayed authenticating them as bona fide phospholipase A2. Physiochemical properties of both the sPLA2s revealed they are thermostable proteins requiring di-valent cations for optimum activity.While, structural analysis of both the proteins revealed deviations in the amino acid sequence at C- & N-terminal regions; hydropathic study revealed LusPLA2I as a hydrophobic protein and LusPLA2II as a hydrophilic protein. Structural analysis of flax sPLA2s revealed that secondary structure of both the proteins are dominated by α-helix followed by random coils. Modular superimposition of LusPLA2 isoforms with rice sPLA2 confirmed monomeric structural preservation among plant phospholipase A2 and provided insight into structure of folded flax sPLA2s.
Collapse
Affiliation(s)
- Payal Gupta
- ICAR-National Research Centre on Plant Biotechnology, Pusa Campus, New Delhi, 110012, India.
- Department of Biotechnology, Kurukshetra University, Thanesar, 136119, India.
| | - Prasanta K Dash
- ICAR-National Research Centre on Plant Biotechnology, Pusa Campus, New Delhi, 110012, India.
| |
Collapse
|
6
|
Gupta P, Dash PK. Molecular details of secretory phospholipase A 2 from flax (Linum usitatissimum L.) provide insight into its structure and function. Sci Rep 2017; 7:11080. [PMID: 28894144 PMCID: PMC5593939 DOI: 10.1038/s41598-017-10969-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 08/17/2017] [Indexed: 01/19/2023] Open
Abstract
Secretory phospholipase A2 (sPLA2) are low molecular weight proteins (12-18 kDa) involved in a suite of plant cellular processes imparting growth and development. With myriad roles in physiological and biochemical processes in plants, detailed analysis of sPLA2 in flax/linseed is meagre. The present work, first in flax, embodies cloning, expression, purification and molecular characterisation of two distinct sPLA2s (I and II) from flax. PLA2 activity of the cloned sPLA2s were biochemically assayed authenticating them as bona fide phospholipase A2. Physiochemical properties of both the sPLA2s revealed they are thermostable proteins requiring di-valent cations for optimum activity.While, structural analysis of both the proteins revealed deviations in the amino acid sequence at C- & N-terminal regions; hydropathic study revealed LusPLA2I as a hydrophobic protein and LusPLA2II as a hydrophilic protein. Structural analysis of flax sPLA2s revealed that secondary structure of both the proteins are dominated by α-helix followed by random coils. Modular superimposition of LusPLA2 isoforms with rice sPLA2 confirmed monomeric structural preservation among plant phospholipase A2 and provided insight into structure of folded flax sPLA2s.
Collapse
Affiliation(s)
- Payal Gupta
- ICAR-National Research Centre on Plant Biotechnology, Pusa Campus, New Delhi, 110012, India.
- Department of Biotechnology, Kurukshetra University, Thanesar, 136119, India.
| | - Prasanta K Dash
- ICAR-National Research Centre on Plant Biotechnology, Pusa Campus, New Delhi, 110012, India.
| |
Collapse
|
7
|
Ando M, Fiesel FC, Hudec R, Caulfield TR, Ogaki K, Górka-Skoczylas P, Koziorowski D, Friedman A, Chen L, Dawson VL, Dawson TM, Bu G, Ross OA, Wszolek ZK, Springer W. The PINK1 p.I368N mutation affects protein stability and ubiquitin kinase activity. Mol Neurodegener 2017; 12:32. [PMID: 28438176 PMCID: PMC5404317 DOI: 10.1186/s13024-017-0174-z] [Citation(s) in RCA: 57] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2017] [Accepted: 04/14/2017] [Indexed: 01/24/2023] Open
Abstract
Background Mutations in PINK1 and PARKIN are the most common causes of recessive early-onset Parkinson’s disease (EOPD). Together, the mitochondrial ubiquitin (Ub) kinase PINK1 and the cytosolic E3 Ub ligase PARKIN direct a complex regulated, sequential mitochondrial quality control. Thereby, damaged mitochondria are identified and targeted to degradation in order to prevent their accumulation and eventually cell death. Homozygous or compound heterozygous loss of either gene function disrupts this protective pathway, though at different steps and by distinct mechanisms. While structure and function of PARKIN variants have been well studied, PINK1 mutations remain poorly characterized, in particular under endogenous conditions. A better understanding of the exact molecular pathogenic mechanisms underlying the pathogenicity is crucial for rational drug design in the future. Methods Here, we characterized the pathogenicity of the PINK1 p.I368N mutation on the clinical and genetic as well as on the structural and functional level in patients’ fibroblasts and in cell-based, biochemical assays. Results Under endogenous conditions, PINK1 p.I368N is expressed, imported, and N-terminally processed in healthy mitochondria similar to PINK1 wild type (WT). Upon mitochondrial damage, however, full-length PINK1 p.I368N is not sufficiently stabilized on the outer mitochondrial membrane (OMM) resulting in loss of mitochondrial quality control. We found that binding of PINK1 p.I368N to the co-chaperone complex HSP90/CDC37 is reduced and stress-induced interaction with TOM40 of the mitochondrial protein import machinery is abolished. Analysis of a structural PINK1 p.I368N model additionally suggested impairments of Ub kinase activity as the ATP-binding pocket was found deformed and the substrate Ub was slightly misaligned within the active site of the kinase. Functional assays confirmed the lack of Ub kinase activity. Conclusions Here we demonstrated that mutant PINK1 p.I368N can not be stabilized on the OMM upon mitochondrial stress and due to conformational changes in the active site does not exert kinase activity towards Ub. In patients’ fibroblasts, biochemical assays and by structural analyses, we unraveled two pathomechanisms that lead to loss of function upon mutation of p.I368N and highlight potential strategies for future drug development. Electronic supplementary material The online version of this article (doi:10.1186/s13024-017-0174-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Maya Ando
- Department of Neuroscience, Mayo Clinic, 4500 San Pablo Road, Jacksonville, FL, 32224, USA
| | - Fabienne C Fiesel
- Department of Neuroscience, Mayo Clinic, 4500 San Pablo Road, Jacksonville, FL, 32224, USA.,Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, 32224, USA
| | - Roman Hudec
- Department of Neuroscience, Mayo Clinic, 4500 San Pablo Road, Jacksonville, FL, 32224, USA
| | - Thomas R Caulfield
- Department of Neuroscience, Mayo Clinic, 4500 San Pablo Road, Jacksonville, FL, 32224, USA.,Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, 32224, USA
| | - Kotaro Ogaki
- Department of Neuroscience, Mayo Clinic, 4500 San Pablo Road, Jacksonville, FL, 32224, USA
| | - Paulina Górka-Skoczylas
- Department of Medical Genetics, Institute of Mother and Child, Warsaw, Poland.,Institute of Genetics and Biotechnology, Faculty of Biology, Warsaw University, Warsaw, Poland
| | - Dariusz Koziorowski
- Department of Neurology, Faculty of Health Science, Medical University of Warsaw, Warsaw, Poland
| | - Andrzej Friedman
- Department of Neurology, Faculty of Health Science, Medical University of Warsaw, Warsaw, Poland
| | - Li Chen
- Neuroregeneration and Stem Cell Programs, Institute for Cell Engineering, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA.,Solomon H. Snyder Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA.,Adrienne Helis Malvin Medical Research Foundation, New Orleans, LA, 70130-2685, USA
| | - Valina L Dawson
- Neuroregeneration and Stem Cell Programs, Institute for Cell Engineering, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA.,Solomon H. Snyder Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA.,Adrienne Helis Malvin Medical Research Foundation, New Orleans, LA, 70130-2685, USA.,Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA.,Department of Physiology, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | - Ted M Dawson
- Neuroregeneration and Stem Cell Programs, Institute for Cell Engineering, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA.,Solomon H. Snyder Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA.,Adrienne Helis Malvin Medical Research Foundation, New Orleans, LA, 70130-2685, USA.,Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA.,Department of Pharmacology and Molecular Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | - Guojun Bu
- Department of Neuroscience, Mayo Clinic, 4500 San Pablo Road, Jacksonville, FL, 32224, USA.,Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, 32224, USA
| | - Owen A Ross
- Department of Neuroscience, Mayo Clinic, 4500 San Pablo Road, Jacksonville, FL, 32224, USA.,Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, 32224, USA
| | | | - Wolfdieter Springer
- Department of Neuroscience, Mayo Clinic, 4500 San Pablo Road, Jacksonville, FL, 32224, USA. .,Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, 32224, USA.
| |
Collapse
|
8
|
Kumar S, Plotnikov NV, Rouse JC, Singh SK. Biopharmaceutical Informatics: supporting biologic drug development via molecular modelling and informatics. J Pharm Pharmacol 2017; 70:595-608. [DOI: 10.1111/jphp.12700] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Accepted: 12/29/2016] [Indexed: 12/23/2022]
Abstract
Abstract
Objectives
The purpose of this article is to introduce an emerging field called ‘Biopharmaceutical Informatics’. It describes how tools from Information technology and Molecular Biophysics can be adapted, developed and gainfully employed in discovery and development of biologic drugs.
Key Findings
The findings described here are based on literature surveys and the authors’ collective experiences in the field of biologic drug product development. A strategic framework to forecast early the hurdles faced during drug product development is weaved together and elucidated using chemical degradation as an example. Efficiency of translating biologic drug discoveries into drug products can be significantly improved by combining learnings from experimental biophysical and analytical data on the drug candidates with molecular properties computed from their sequences and structures via molecular modeling and simulations.
Summary
Biopharmaceutical Informatics seeks to promote applications of computational tools towards discovery and development of biologic drugs. When fully implemented, industry-wide, it will enable rapid materials-free developability assessments of biologic drug candidates at early stages as well as streamline drug product development activities such as commercial scale production, purification, formulation, analytical characterization, safety and in vivo performance.
Collapse
Affiliation(s)
- Sandeep Kumar
- Pharmaceutical Research and Development, Biotherapeutics Pharmaceutical Sciences, Pfizer Inc., Chesterfield, MO, USA
| | - Nikolay V Plotnikov
- Pharmaceutical Research and Development, Biotherapeutics Pharmaceutical Sciences, Pfizer Inc., Chesterfield, MO, USA
| | - Jason C Rouse
- Analytical Research and Development, Biotherapeutics Pharmaceutical Sciences, Pfizer Inc., Andover, MA, USA
| | - Satish K Singh
- Pharmaceutical Research and Development, Biotherapeutics Pharmaceutical Sciences, Pfizer Inc., Chesterfield, MO, USA
| |
Collapse
|
9
|
Puschmann A, Fiesel FC, Caulfield TR, Hudec R, Ando M, Truban D, Hou X, Ogaki K, Heckman MG, James ED, Swanberg M, Jimenez-Ferrer I, Hansson O, Opala G, Siuda J, Boczarska-Jedynak M, Friedman A, Koziorowski D, Rudzińska-Bar M, Aasly JO, Lynch T, Mellick GD, Mohan M, Silburn PA, Sanotsky Y, Vilariño-Güell C, Farrer MJ, Chen L, Dawson VL, Dawson TM, Wszolek ZK, Ross OA, Springer W. Heterozygous PINK1 p.G411S increases risk of Parkinson's disease via a dominant-negative mechanism. Brain 2016; 140:98-117. [PMID: 27807026 PMCID: PMC5379862 DOI: 10.1093/brain/aww261] [Citation(s) in RCA: 103] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2016] [Revised: 08/31/2016] [Accepted: 09/02/2016] [Indexed: 01/31/2023] Open
Abstract
See Gandhi and Plun-Favreau (doi:10.1093/aww320) for a scientific commentary on this article. Heterozygous mutations in recessive Parkinson’s disease genes have been postulated to increase disease risk. Puschmann et al. report a genetic association between heterozygous PINK1 p.G411S and Parkinson’s disease. They provide structural and functional explanations for a partial dominant-negative effect of the mutant protein, which impairs wild-type PINK1 activity through hetero-dimerization. See Gandhi and Plun-Favreau (doi:10.1093/aww320) for a scientific commentary on this article. It has been postulated that heterozygous mutations in recessive Parkinson’s genes may increase the risk of developing the disease. In particular, the PTEN-induced putative kinase 1 (PINK1) p.G411S (c.1231G>A, rs45478900) mutation has been reported in families with dominant inheritance patterns of Parkinson’s disease, suggesting that it might confer a sizeable disease risk when present on only one allele. We examined families with PINK1 p.G411S and conducted a genetic association study with 2560 patients with Parkinson’s disease and 2145 control subjects. Heterozygous PINK1 p.G411S mutations markedly increased Parkinson’s disease risk (odds ratio = 2.92, P = 0.032); significance remained when supplementing with results from previous studies on 4437 additional subjects (odds ratio = 2.89, P = 0.027). We analysed primary human skin fibroblasts and induced neurons from heterozygous PINK1 p.G411S carriers compared to PINK1 p.Q456X heterozygotes and PINK1 wild-type controls under endogenous conditions. While cells from PINK1 p.Q456X heterozygotes showed reduced levels of PINK1 protein and decreased initial kinase activity upon mitochondrial damage, stress-response was largely unaffected over time, as expected for a recessive loss-of-function mutation. By contrast, PINK1 p.G411S heterozygotes showed no decrease of PINK1 protein levels but a sustained, significant reduction in kinase activity. Molecular modelling and dynamics simulations as well as multiple functional assays revealed that the p.G411S mutation interferes with ubiquitin phosphorylation by wild-type PINK1 in a heterodimeric complex. This impairs the protective functions of the PINK1/parkin-mediated mitochondrial quality control. Based on genetic and clinical evaluation as well as functional and structural characterization, we established p.G411S as a rare genetic risk factor with a relatively large effect size conferred by a partial dominant-negative function phenotype.
Collapse
Affiliation(s)
- Andreas Puschmann
- 1 Lund University, Department of Clinical Sciences Lund, Neurology, Sweden .,2 Department of Neurology, Skåne University Hospital, Sweden.,3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Fabienne C Fiesel
- 3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA
| | | | - Roman Hudec
- 3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Maya Ando
- 3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Dominika Truban
- 3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Xu Hou
- 3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Kotaro Ogaki
- 3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Michael G Heckman
- 4 Division of Biomedical Statistics and Informatics, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Elle D James
- 3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Maria Swanberg
- 5 Lund University, Department of Experimental Medical Science, Lund, Sweden
| | | | - Oskar Hansson
- 6 Clinical Memory Research Unit, Department of Clinical Sciences Malmö, Lund University, Sweden.,7 Memory Clinic, Skåne University Hospital, Malmö, Sweden
| | - Grzegorz Opala
- 8 Department of Neurology, School of Medicine in Katowice, Medical University of Silesia, Katowice, Poland
| | - Joanna Siuda
- 8 Department of Neurology, School of Medicine in Katowice, Medical University of Silesia, Katowice, Poland
| | | | | | | | | | - Jan O Aasly
- 10 Department of Neurology, St. Olav's Hospital, and Department of Neuroscience, Norwegian University of Science and Technology, Trondheim, Norway
| | - Timothy Lynch
- 11 Dublin Neurological Institute at the Mater Misericordiae University Hospital, Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Dublin, Ireland
| | - George D Mellick
- 12 Eskitis Institute for Drug Discovery, Griffith University, Nathan, Queensland, Australia
| | - Megha Mohan
- 12 Eskitis Institute for Drug Discovery, Griffith University, Nathan, Queensland, Australia
| | - Peter A Silburn
- 12 Eskitis Institute for Drug Discovery, Griffith University, Nathan, Queensland, Australia.,13 University of Queensland, Asia-Pacific Centre for Neuromodulation, Centre for Clinical Research, Brisbane, Queensland, Australia
| | | | - Carles Vilariño-Güell
- 3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA.,15 Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
| | - Matthew J Farrer
- 3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA.,15 Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
| | - Li Chen
- 16 Neuroregeneration and Stem Cell Programs, Institute for Cell Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.,17 Solomon H Snyder Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.,18 Adrienne Helis Malvin Medical Research Foundation, New Orleans, LA 70130-2685, USA
| | - Valina L Dawson
- 16 Neuroregeneration and Stem Cell Programs, Institute for Cell Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.,17 Solomon H Snyder Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.,18 Adrienne Helis Malvin Medical Research Foundation, New Orleans, LA 70130-2685, USA.,19 Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.,20 Department of Physiology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Ted M Dawson
- 16 Neuroregeneration and Stem Cell Programs, Institute for Cell Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.,17 Solomon H Snyder Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.,18 Adrienne Helis Malvin Medical Research Foundation, New Orleans, LA 70130-2685, USA.,19 Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.,21 Department of Pharmacology and Molecular Sciences, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | | | - Owen A Ross
- 3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA.,23 School of Medicine and Medical Science, University College Dublin, Dublin, Ireland.,24 Mayo Graduate School, Neurobiology of Disease, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Wolfdieter Springer
- 3 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA .,24 Mayo Graduate School, Neurobiology of Disease, Mayo Clinic, Jacksonville, FL 32224, USA
| |
Collapse
|
10
|
Raval A, Piana S, Eastwood MP, Shaw DE. Assessment of the utility of contact-based restraints in accelerating the prediction of protein structure using molecular dynamics simulations. Protein Sci 2015; 25:19-29. [PMID: 26266489 PMCID: PMC4815320 DOI: 10.1002/pro.2770] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2015] [Revised: 08/07/2015] [Accepted: 08/11/2015] [Indexed: 12/15/2022]
Abstract
Molecular dynamics (MD) simulation is a well-established tool for the computational study of protein structure and dynamics, but its application to the important problem of protein structure prediction remains challenging, in part because extremely long timescales can be required to reach the native structure. Here, we examine the extent to which the use of low-resolution information in the form of residue-residue contacts, which can often be inferred from bioinformatics or experimental studies, can accelerate the determination of protein structure in simulation. We incorporated sets of 62, 31, or 15 contact-based restraints in MD simulations of ubiquitin, a benchmark system known to fold to the native state on the millisecond timescale in unrestrained simulations. One-third of the restrained simulations folded to the native state within a few tens of microseconds-a speedup of over an order of magnitude compared with unrestrained simulations and a demonstration of the potential for limited amounts of structural information to accelerate structure determination. Almost all of the remaining ubiquitin simulations reached near-native conformations within a few tens of microseconds, but remained trapped there, apparently due to the restraints. We discuss potential methodological improvements that would facilitate escape from these near-native traps and allow more simulations to quickly reach the native state. Finally, using a target from the Critical Assessment of protein Structure Prediction (CASP) experiment, we show that distance restraints can improve simulation accuracy: In our simulations, restraints stabilized the native state of the protein, enabling a reasonable structural model to be inferred.
Collapse
Affiliation(s)
- Alpan Raval
- D. E. Shaw Research, New York, New York, 10036
| | | | | | - David E Shaw
- D. E. Shaw Research, New York, New York, 10036.,Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, 10032
| |
Collapse
|
11
|
Sánchez-Guerrero E, Hernández-Campos ME, Correa-Basurto J, López-Sánchez P, Tolentino-López LE. Three-dimensional structure and molecular dynamics studies of prorrenin/renin receptor: description of the active site. MOLECULAR BIOSYSTEMS 2015; 11:2520-8. [PMID: 26177886 DOI: 10.1039/c5mb00342c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The recent finding of a specific receptor for prorrenin/renin (PRR) has brought new insights into the physiology of the renin-angiotensin-aldosterone system. No undoubtable role has been described for this receptor so far. Its role seems to be important in chronic illnesses such as hypertension, possibly participating in the cardiovascular remodeling process, and diabetes where participation in inflammation development has been described. It is not possible, however, to explore the PRR function using classical pharmacological approaches due to the lack of specific agonists or antagonists. Two synthetic peptides have been described to accomplish these roles, but no conclusive data have been provided. There are no X-ray crystallography studies available to describe the structure and potential sites for drug development. So, the aim of this work was to model and theoretically describe the PRR. We describe and characterize the whole receptor protein, its spatial conformation and the potential interactions of PRR with the synthetic peptides available, describing the amino acid residues responsible for these interactions. This information provides the basis for directed development of drugs, seeking to agonize or antagonize PRR activity and study its function in health and ill stages.
Collapse
Affiliation(s)
- E Sánchez-Guerrero
- Sección de Estudios de Posgrado e Investigación, Escuela Superior de Medicina del IPN, Plan de San Luis y Díaz Mirón, Casco de Santo Tomás, México D.F. 11340, Mexico
| | | | | | | | | |
Collapse
|
12
|
Three-dimensional protein structure prediction: Methods and computational strategies. Comput Biol Chem 2014; 53PB:251-276. [DOI: 10.1016/j.compbiolchem.2014.10.001] [Citation(s) in RCA: 121] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Revised: 10/03/2014] [Accepted: 10/07/2014] [Indexed: 01/01/2023]
|
13
|
Caulfield TR, Fiesel FC, Moussaud-Lamodière EL, Dourado DFAR, Flores SC, Springer W. Phosphorylation by PINK1 releases the UBL domain and initializes the conformational opening of the E3 ubiquitin ligase Parkin. PLoS Comput Biol 2014; 10:e1003935. [PMID: 25375667 PMCID: PMC4222639 DOI: 10.1371/journal.pcbi.1003935] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Accepted: 09/25/2014] [Indexed: 11/19/2022] Open
Abstract
Loss-of-function mutations in PINK1 or PARKIN are the most common causes of autosomal recessive Parkinson's disease. Both gene products, the Ser/Thr kinase PINK1 and the E3 Ubiquitin ligase Parkin, functionally cooperate in a mitochondrial quality control pathway. Upon stress, PINK1 activates Parkin and enables its translocation to and ubiquitination of damaged mitochondria to facilitate their clearance from the cell. Though PINK1-dependent phosphorylation of Ser65 is an important initial step, the molecular mechanisms underlying the activation of Parkin's enzymatic functions remain unclear. Using molecular modeling, we generated a complete structural model of human Parkin at all atom resolution. At steady state, the Ub ligase is maintained inactive in a closed, auto-inhibited conformation that results from intra-molecular interactions. Evidently, Parkin has to undergo major structural rearrangements in order to unleash its catalytic activity. As a spark, we have modeled PINK1-dependent Ser65 phosphorylation in silico and provide the first molecular dynamics simulation of Parkin conformations along a sequential unfolding pathway that could release its intertwined domains and enable its catalytic activity. We combined free (unbiased) molecular dynamics simulation, Monte Carlo algorithms, and minimal-biasing methods with cell-based high content imaging and biochemical assays. Phosphorylation of Ser65 results in widening of a newly defined cleft and dissociation of the regulatory N-terminal UBL domain. This motion propagates through further opening conformations that allow binding of an Ub-loaded E2 co-enzyme. Subsequent spatial reorientation of the catalytic centers of both enzymes might facilitate the transfer of the Ub moiety to charge Parkin. Our structure-function study provides the basis to elucidate regulatory mechanisms and activity of the neuroprotective Parkin. This may open up new avenues for the development of small molecule Parkin activators through targeted drug design. Parkinson's disease (PD) is a devastating neurological condition caused by the selective and progressive degeneration of dopaminergic neurons in the brain. Loss-of-function mutations in the PINK1 or PARKIN genes are the most common causes of recessively inherited PD. Together the encoded proteins coordinate a protective cellular quality control pathway that allows elimination of impaired mitochondria in order to prevent further cellular damage and ultimately death. Although it is known that the kinase PINK1 operates upstream and activates the E3 Ubiquitin ligase Parkin, the molecular mechanisms remain elusive. Here, we combined state-of-the art computational and functional biological methods to demonstrate that Parkin is sequentially activated through PINK1-dependent phosphorylation and subsequent structural rearrangement. The induced motions result in release of Parkin's closed, auto-inhibited conformation to liberate its enzymatic functions. We provide for the first time a complete protein structure of Parkin at an all atom resolution and a comprehensive molecular dynamics simulation of its activation and opening conformations. The generated models will allow uncovering the exact mechanisms of regulation and enzymatic activity of Parkin and potentially the development of novel therapeutics through a structure-function-based drug design.
Collapse
Affiliation(s)
- Thomas R. Caulfield
- Department of Neuroscience, Mayo Clinic Jacksonville, Florida, United States of America
- * E-mail: (TRC); (WS)
| | - Fabienne C. Fiesel
- Department of Neuroscience, Mayo Clinic Jacksonville, Florida, United States of America
| | | | - Daniel F. A. R. Dourado
- Department of Cell & Molecular Biology, Computational & Systems Biology, Uppsala University, Uppsala, Sweden
| | - Samuel C. Flores
- Department of Cell & Molecular Biology, Computational & Systems Biology, Uppsala University, Uppsala, Sweden
| | - Wolfdieter Springer
- Department of Neuroscience, Mayo Clinic Jacksonville, Florida, United States of America
- Mayo Graduate School, Neurobiology of Disease, Mayo Clinic, Jacksonville, Florida, United States of America
- * E-mail: (TRC); (WS)
| |
Collapse
|
14
|
Hoffmann F, Vancea I, Kamat SG, Strodel B. Protein structure prediction: assembly of secondary structure elements by basin-hopping. Chemphyschem 2014; 15:3378-90. [PMID: 25056272 DOI: 10.1002/cphc.201402247] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2014] [Indexed: 12/30/2022]
Abstract
The prediction of protein tertiary structure from primary structure remains a challenging task. One possible approach to this problem is the application of basin-hopping global optimization combined with an all-atom force field. In this work, the efficiency of basin-hopping is improved by introducing an approach that derives tertiary structures from the secondary structure assignments of individual residues. This approach is termed secondary-to-tertiary basin-hopping and benchmarked for three miniproteins: trpzip, trp-cage and ER-10. For each of the three miniproteins, the secondary-to-tertiary basin-hopping approach successfully and reliably predicts their three-dimensional structure. When it is applied to larger proteins, correctly folded structures are obtained. It can be concluded that the assembly of secondary structure elements using basin-hopping is a promising tool for de novo protein structure prediction.
Collapse
Affiliation(s)
- Falk Hoffmann
- Institute of Complex Systems: Structural Biochemistry, Forschungszentrum Jülich, 52425 Jülich (Germany)
| | | | | | | |
Collapse
|
15
|
Ohue M, Matsuzaki Y, Shimoda T, Ishida T, Akiyama Y. Highly precise protein-protein interaction prediction based on consensus between template-based and de novo docking methods. BMC Proc 2013; 7:S6. [PMID: 24564962 PMCID: PMC4044902 DOI: 10.1186/1753-6561-7-s7-s6] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Background Elucidation of protein-protein interaction (PPI) networks is important for understanding disease mechanisms and for drug discovery. Tertiary-structure-based in silico PPI prediction methods have been developed with two typical approaches: a method based on template matching with known protein structures and a method based on de novo protein docking. However, the template-based method has a narrow applicable range because of its use of template information, and the de novo docking based method does not have good prediction performance. In addition, both of these in silico prediction methods have insufficient precision, and require validation of the predicted PPIs by biological experiments, leading to considerable expenditure; therefore, PPI prediction methods with greater precision are needed. Results We have proposed a new structure-based PPI prediction method by combining template-based prediction and de novo docking prediction. When we applied the method to the human apoptosis signaling pathway, we obtained a precision value of 0.333, which is higher than that achieved using conventional methods (0.231 for PRISM, a template-based method, and 0.145 for MEGADOCK, a non-template-based method), while maintaining an F-measure value (0.285) comparable to that obtained using conventional methods (0.296 for PRISM, and 0.220 for MEGADOCK). Conclusions Our consensus method successfully predicted a PPI network with greater precision than conventional template/non-template methods, which may thus reduce the cost of validation by laboratory experiments for confirming novel PPIs from predicted PPIs. Therefore, our method may serve as an aid for promoting interactome analysis.
Collapse
|
16
|
Khoury GA, Thompson JP, Smadbeck J, Kieslich CA, Floudas CA. Forcefield_PTM: Ab Initio Charge and AMBER Forcefield Parameters for Frequently Occurring Post-Translational Modifications. J Chem Theory Comput 2013; 9:5653-5674. [PMID: 24489522 PMCID: PMC3904396 DOI: 10.1021/ct400556v] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
In this work, we introduce Forcefield_PTM, a set of AMBER forcefield parameters consistent with ff03 for 32 common post-translational modifications. Partial charges were calculated through ab initio calculations and a two-stage RESP-fitting procedure in an ether-like implicit solvent environment. The charges were found to be generally consistent with others previously reported for phosphorylated amino acids, and trimethyllysine, using different parameterization methods. Pairs of modified and their corresponding unmodified structures were curated from the PDB for both single and multiple modifications. Background structural similarity was assessed in the context of secondary and tertiary structures from the global dataset. Next, the charges derived for Forcefield_PTM were tested on a macroscopic scale using unrestrained all-atom Langevin molecular dynamics simulations in AMBER for 34 (17 pairs of modified/unmodified) systems in implicit solvent. Assessment was performed in the context of secondary structure preservation, stability in energies, and correlations between the modified and unmodified structure trajectories on the aggregate. As an illustration of their utility, the parameters were used to compare the structural stability of the phosphorylated and dephosphorylated forms of OdhI. Microscopic comparisons between quantum and AMBER single point energies along key χ torsions on several PTMs were performed and corrections to improve their agreement in terms of mean squared errors and squared correlation coefficients were parameterized. This forcefield for post-translational modifications in condensed-phase simulations can be applied to a number of biologically relevant and timely applications including protein structure prediction, protein and peptide design, docking, and to study the effect of PTMs on folding and dynamics. We make the derived parameters and an associated interactive webtool capable of performing post-translational modifications on proteins using Forcefield_PTM available at http://selene.princeton.edu/FFPTM.
Collapse
Affiliation(s)
- George A. Khoury
- Department of Chemical and Biological Engineering, Princeton, NJ, USA
| | - Jeff P. Thompson
- Department of Chemical and Biological Engineering, Princeton, NJ, USA
| | - James Smadbeck
- Department of Chemical and Biological Engineering, Princeton, NJ, USA
| | - Chris A. Kieslich
- Department of Chemical and Biological Engineering, Princeton, NJ, USA
| | | |
Collapse
|
17
|
Qi X, Vargas E, Larsen L, Knapp W, Hatfield GW, Lathrop R, Sandmeyer S. Directed DNA shuffling of retrovirus and retrotransposon integrase protein domains. PLoS One 2013; 8:e63957. [PMID: 23691126 PMCID: PMC3656877 DOI: 10.1371/journal.pone.0063957] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2012] [Accepted: 04/11/2013] [Indexed: 12/15/2022] Open
Abstract
Chimeric proteins are used to study protein domain functions and to recombine protein domains for novel or optimal functions. We used a library of chimeric integrase proteins to study DNA integration specificity. The library was constructed using a directed shuffling method that we adapted from fusion PCR. This method easily and accurately shuffles multiple DNA gene sequences simultaneously at specific base-pair positions, such as protein domain boundaries. It produced all 27 properly-ordered combinations of the amino-terminal, catalytic core, and carboxyl-terminal domains of the integrase gene from human immunodeficiency virus, prototype foamy virus, and Saccharomyces cerevisiae retrotransposon Ty3. Retrotransposons can display dramatic position-specific integration specificity compared to retroviruses. The yeast retrotransposon Ty3 integrase interacts with RNA polymerase III transcription factors to target integration at the transcription initiation site. In vitro assays of the native and chimeric proteins showed that human immunodeficiency virus integrase was active with heterologous substrates, whereas prototype foamy virus and Ty3 integrases were not. This observation was consistent with a lower substrate specificity for human immunodeficiency virus integrase than for other retrovirus integrases. All eight chimeras containing the Ty3 integrase carboxyl-terminal domain, a candidate targeting domain, failed to target strand transfer in the presence of the targeting protein, suggesting that multiple domains of the Ty3 integrase cooperate in this function.
Collapse
Affiliation(s)
- Xiaojie Qi
- Department of Biological Chemistry, School of Medicine, University of California Irvine, Irvine, California, United States of America
| | - Edwin Vargas
- Department of Computer Science, School of Information and Computer Sciences, University of California Irvine, Irvine, California, United States of America
- Institute for Genomics and Bioinformatics, University of California Irvine, Irvine, California, United States of America
| | - Liza Larsen
- Institute for Genomics and Bioinformatics, University of California Irvine, Irvine, California, United States of America
| | - Whitney Knapp
- Department of Biological Chemistry, School of Medicine, University of California Irvine, Irvine, California, United States of America
| | - G. Wesley Hatfield
- Institute for Genomics and Bioinformatics, University of California Irvine, Irvine, California, United States of America
- Department of Chemical Engineering and Materials Science, School of Engineering, University of California Irvine, Irvine, California, United States of America
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California Irvine, Irvine, California, United States of America
- Department of Biomedical Engineering, School of Engineering, University of California Irvine, Irvine, California, United States of America
- CODA Genomics, Inc., Laguna Hills, California, United States of America
| | - Richard Lathrop
- Department of Computer Science, School of Information and Computer Sciences, University of California Irvine, Irvine, California, United States of America
- Institute for Genomics and Bioinformatics, University of California Irvine, Irvine, California, United States of America
- Department of Biomedical Engineering, School of Engineering, University of California Irvine, Irvine, California, United States of America
- CODA Genomics, Inc., Laguna Hills, California, United States of America
| | - Suzanne Sandmeyer
- Department of Biological Chemistry, School of Medicine, University of California Irvine, Irvine, California, United States of America
- Institute for Genomics and Bioinformatics, University of California Irvine, Irvine, California, United States of America
- Department of Chemical Engineering and Materials Science, School of Engineering, University of California Irvine, Irvine, California, United States of America
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California Irvine, Irvine, California, United States of America
- * E-mail:
| |
Collapse
|
18
|
Capturing native/native like structures with a physico-chemical metric (pcSM) in protein folding. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013; 1834:1520-31. [PMID: 23665455 DOI: 10.1016/j.bbapap.2013.04.023] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2013] [Revised: 04/12/2013] [Accepted: 04/15/2013] [Indexed: 12/15/2022]
Abstract
Specification of the three dimensional structure of a protein from its amino acid sequence, also called a "Grand Challenge" problem, has eluded a solution for over six decades. A modestly successful strategy has evolved over the last couple of decades based on development of scoring functions (e.g. mimicking free energy) that can capture native or native-like structures from an ensemble of decoys generated as plausible candidates for the native structure. A scoring function must be fast enough in discriminating the native from unfolded/misfolded structures, and requires validation on a large data set(s) to generate sufficient confidence in the score. Here we develop a scoring function called pcSM that detects true native structure in the top 5 with 93% accuracy from an ensemble of candidate structures. If we eliminate the native from ensemble of decoys then pcSM is able to capture near native structure (RMSD<=5Ǻ) in top 10 with 86% accuracy. The parameters considered in pcSM are a C-alpha Euclidean metric, secondary structural propensity, surface areas and an intramolecular energy function. pcSM has been tested on 415 systems consisting 142,698 decoys (public and CASP-largest reported hitherto in literature). The average rank for the native is 2.38, a significant improvement over that existing in literature. In-silico protein structure prediction requires robust scoring technique(s). Therefore, pcSM is easily amenable to integration into a successful protein structure prediction strategy. The tool is freely available at http://www.scfbio-iitd.res.in/software/pcsm.jsp.
Collapse
|
19
|
Karakaş M, Woetzel N, Staritzbichler R, Alexander N, Weiner BE, Meiler J. BCL::Fold--de novo prediction of complex and large protein topologies by assembly of secondary structure elements. PLoS One 2012; 7:e49240. [PMID: 23173050 PMCID: PMC3500284 DOI: 10.1371/journal.pone.0049240] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2012] [Accepted: 10/07/2012] [Indexed: 01/10/2023] Open
Abstract
Computational de novo protein structure prediction is limited to small proteins of simple topology. The present work explores an approach to extend beyond the current limitations through assembling protein topologies from idealized α-helices and β-strands. The algorithm performs a Monte Carlo Metropolis simulated annealing folding simulation. It optimizes a knowledge-based potential that analyzes radius of gyration, β-strand pairing, secondary structure element (SSE) packing, amino acid pair distance, amino acid environment, contact order, secondary structure prediction agreement and loop closure. Discontinuation of the protein chain favors sampling of non-local contacts and thereby creation of complex protein topologies. The folding simulation is accelerated through exclusion of flexible loop regions further reducing the size of the conformational search space. The algorithm is benchmarked on 66 proteins with lengths between 83 and 293 amino acids. For 61 out of these proteins, the best SSE-only models obtained have an RMSD100 below 8.0 Å and recover more than 20% of the native contacts. The algorithm assembles protein topologies with up to 215 residues and a relative contact order of 0.46. The method is tailored to be used in conjunction with low-resolution or sparse experimental data sets which often provide restraints for regions of defined secondary structure.
Collapse
Affiliation(s)
- Mert Karakaş
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Nils Woetzel
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Rene Staritzbichler
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Nathan Alexander
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Brian E. Weiner
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Jens Meiler
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| |
Collapse
|
20
|
Zhao F, Xu J. A position-specific distance-dependent statistical potential for protein structure and functional study. Structure 2012; 20:1118-26. [PMID: 22608968 PMCID: PMC3372698 DOI: 10.1016/j.str.2012.04.003] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2012] [Revised: 04/09/2012] [Accepted: 04/10/2012] [Indexed: 10/28/2022]
Abstract
Although studied extensively, designing highly accurate protein energy potential is still challenging. A lot of knowledge-based statistical potentials are derived from the inverse of the Boltzmann law and consist of two major components: observed atomic interacting probability and reference state. These potentials mainly distinguish themselves in the reference state and use a similar simple counting method to estimate the observed probability, which is usually assumed to correlate with only atom types. This article takes a rather different view on the observed probability and parameterizes it by the protein sequence profile context of the atoms and the radius of the gyration, in addition to atom types. Experiments confirm that our position-specific statistical potential outperforms currently the popular ones in several decoy discrimination tests. Our results imply that, in addition to reference state, the observed probability also makes energy potentials different and evolutionary information greatly boost performance of energy potentials.
Collapse
Affiliation(s)
- Feng Zhao
- Toyota Technological Institute at Chicago, Chicago IL, USA 60637
| | - Jinbo Xu
- Toyota Technological Institute at Chicago, Chicago IL, USA 60637
| |
Collapse
|
21
|
Zhou H, Skolnick J. Template-based protein structure modeling using TASSER(VMT.). Proteins 2011; 80:352-61. [PMID: 22105797 DOI: 10.1002/prot.23183] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2011] [Revised: 08/25/2011] [Accepted: 09/04/2011] [Indexed: 12/29/2022]
Abstract
Template-based protein structure modeling is commonly used for protein structure prediction. Based on the observation that multiple template-based methods often perform better than single template-based methods, we further explore the use of a variable number of multiple templates for a given target in the latest variant of TASSER, TASSER(VMT) . We first develop an algorithm that improves the target-template alignment for a given template. The improved alignment, called the SP(3) alternative alignment, is generated by a parametric alignment method coupled with short TASSER refinement on models selected using knowledge-based scores. The refined top model is then structurally aligned to the template to produce the SP(3) alternative alignment. Templates identified using SP(3) threading are combined with the SP(3) alternative and HHEARCH alignments to provide target alignments to each template. These template models are then grouped into sets containing a variable number of template/alignment combinations. For each set, we run short TASSER simulations to build full-length models. Then, the models from all sets of templates are pooled, and the top 20-50 models selected using FTCOM ranking method. These models are then subjected to a single longer TASSER refinement run for final prediction. We benchmarked our method by comparison with our previously developed approach, pro-sp(3) -TASSER, on a set with 874 easy and 318 hard targets. The average GDT-TS score improvements for the first model are 3.5 and 4.3% for easy and hard targets, respectively. When tested on the 112 CASP9 targets, our method improves the average GDT-TS scores as compared to pro-sp3-TASSER by 8.2 and 9.3% for the 80 easy and 32 hard targets, respectively. It also shows slightly better results than the top ranked CASP9 Zhang-Server, QUARK and HHpredA methods. The program is available for download at http://cssb.biology.gatech.edu/.
Collapse
Affiliation(s)
- Hongyi Zhou
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia 30318
| | | |
Collapse
|
22
|
Holleboom AG, Kuivenhoven JA, Peelman F, Schimmel AW, Peter J, Defesche JC, Kastelein JJP, Hovingh GK, Stroes ES, Motazacker MM. High prevalence of mutations in LCAT in patients with low HDL cholesterol levels in The Netherlands: identification and characterization of eight novel mutations. Hum Mutat 2011; 32:1290-8. [PMID: 21901787 DOI: 10.1002/humu.21578] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2011] [Accepted: 07/04/2011] [Indexed: 12/13/2022]
Abstract
Lecithin:cholesterol acyltransferase (LCAT) is crucial to the maturation of high-density lipoprotein (HDL). Homozygosity for LCAT mutations underlies rare disorders characterized by HDL-cholesterol (HDL-c) deficiency while heterozygotes have half normal HDL-c levels. We studied the prevalence of LCAT mutations in referred patients with low HDL-c to better understand the molecular basis of low HDL-c in our patients. LCAT was sequenced in 98 patients referred for HDL-c <5th percentile and in four patients referred for low HDL-c and corneal opacities. LCAT mutations were highly prevalent: in 28 of the 98 participants (29%), heterozygosity for nonsynonymous mutations was identified while 18 patients carried the same mutation (p.T147I). The four patients with corneal opacity were compound heterozygotes. All previously identified mutations are documented to cause loss of catalytic activity. Nine novel mutations-c.402G>T (p.E134D), c.403T>A (p.Y135N), c.964C>T (p.R322C), c.296G>C (p.W99S), c.736G>T (p.V246F), c.802C>T (p.R268C), c.945G>A (p.W315X), c.1012C>T (p.L338F), and c.1039C>T (p.R347C)--were shown to be functional through in vitro characterization. The effect of several mutations on the core protein structure was studied by a three-dimensional (3D) model. Unlike previous reports, functional mutations in LCAT were found in 29% of patients with low HDL-c, thus constituting a common cause of low HDL-c in referred patients in The Netherlands.
Collapse
Affiliation(s)
- Adriaan G Holleboom
- Department of Vascular Medicine, Academic Medical Center, Amsterdam, The Netherlands
| | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Pandit SB, Skolnick J. TASSER_low-zsc: an approach to improve structure prediction using low z-score-ranked templates. Proteins 2011; 78:2769-80. [PMID: 20635423 DOI: 10.1002/prot.22791] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
In a variety of threading methods, often poorly ranked (low z-score) templates have good alignments. Here, a new method, TASSER_low-zsc that identifies these low z-score-ranked templates to improve protein structure prediction accuracy, is described. The approach consists of clustering of threading templates by affinity propagation on the basis of structural similarity (thread_cluster) followed by TASSER modeling, with final models selected by using a TASSER_QA variant. To establish the generality of the approach, templates provided by two threading methods, SP(3) and SPARKS(2), are examined. The SP(3) and SPARKS(2) benchmark datasets consist of 351 and 357 medium/hard proteins (those with moderate to poor quality templates and/or alignments) of length < or =250 residues, respectively. For SP(3) medium and hard targets, using thread_cluster, the TM-scores of the best template improve by approximately 4 and 9% over the original set (without low z-score templates) respectively; after TASSER modeling/refinement and ranking, the best model improves by approximately 7 and 9% over the best model generated with the original template set. Moreover, TASSER_low-zsc generates 22% (43%) more foldable medium (hard) targets. Similar improvements are observed with low-ranked templates from SPARKS(2). The template clustering approach could be applied to other modeling methods that utilize multiple templates to improve structure prediction.
Collapse
Affiliation(s)
- Shashi B Pandit
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia 30318, USA
| | | |
Collapse
|
24
|
Abstract
Homology modeling is based on the observation that related protein sequences adopt similar three-dimensional structures. Hence, a homology model of a protein can be derived using related protein structure(s) as modeling template(s). A key step in this approach is the establishment of correspondence between residues of the protein to be modeled and those of modeling template(s). This step, often referred to as sequence-structure alignment, is one of the major determinants of the accuracy of a homology model. This chapter gives an overview of methods for deriving sequence-structure alignments and discusses recent methodological developments leading to improved performance. However, no method is perfect. How to find alignment regions that may have errors and how to make improvements? This is another focus of this chapter. Finally, the chapter provides a practical guidance of how to get the most of the available tools in maximizing the accuracy of sequence-structure alignments.
Collapse
|
25
|
Abstract
Knowledge-based approaches frequently employ empirical relations to determine effective potentials for coarse-grained protein models directly from protein databank structures. Although these approaches have enjoyed considerable success and widespread popularity in computational protein science, their fundamental basis has been widely questioned. It is well established that conventional knowledge-based approaches do not correctly treat many-body correlations between amino acids. Moreover, the physical significance of potentials determined by using structural statistics from different proteins has remained obscure. In the present work, we address both of these concerns by introducing and demonstrating a theory for calculating transferable potentials directly from a databank of protein structures. This approach assumes that the databank structures correspond to representative configurations sampled from equilibrium solution ensembles for different proteins. Given this assumption, this physics-based theory exactly treats many-body structural correlations and directly determines the transferable potentials that provide a variationally optimized approximation to the free energy landscape for each protein. We illustrate this approach by first constructing a databank of protein structures using a model potential and then quantitatively recovering this potential from the structure databank. The proposed framework will clarify the assumptions and physical significance of knowledge-based potentials, allow for their systematic improvement, and provide new insight into many-body correlations and cooperativity in folded proteins.
Collapse
|
26
|
Zhou H, Skolnick J. Improving threading algorithms for remote homology modeling by combining fragment and template comparisons. Proteins 2010; 78:2041-8. [PMID: 20455261 DOI: 10.1002/prot.22717] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
In this work, we develop a method called fragment comparison and the template comparison (FTCOM) for assessing the global quality of protein structural models for targets of medium and hard difficulty (remote homology) produced by structure prediction approaches such as threading or ab initio structure prediction. FTCOM requires the C(alpha) coordinates of full length models and assesses model quality based on fragment comparison and a score derived from comparison of the model to top threading templates. On a set of 361 medium/hard targets, FTCOM was applied to and assessed for its ability to improve on the results from the SP(3), SPARKS, PROSPECTOR_3, and PRO-SP(3)-TASSER threading algorithms. The average TM-score improves by 5-10% for the first selected model by the new method over models obtained by the original selection procedure in the respective threading methods. Moreover, the number of foldable targets (TM-score >or= 0.4) increases from least 7.6% for SP(3) to 54% for SPARKS. Thus, FTCOM is a promising approach to template selection. Proteins 2010. (c) 2010 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Hongyi Zhou
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia 30318, USA
| | | |
Collapse
|
27
|
Abstract
Motivation: The challenge of template-based modeling lies in the recognition of correct templates and generation of accurate sequence-template alignments. Homologous information has proved to be very powerful in detecting remote homologs, as demonstrated by the state-of-the-art profile-based method HHpred. However, HHpred does not fare well when proteins under consideration are low-homology. A protein is low-homology if we cannot obtain sufficient amount of homologous information for it from existing protein sequence databases. Results: We present a profile-entropy dependent scoring function for low-homology protein threading. This method will model correlation among various protein features and determine their relative importance according to the amount of homologous information available. When proteins under consideration are low-homology, our method will rely more on structure information; otherwise, homologous information. Experimental results indicate that our threading method greatly outperforms the best profile-based method HHpred and all the top CASP8 servers on low-homology proteins. Tested on the CASP8 hard targets, our threading method is also better than all the top CASP8 servers but slightly worse than Zhang-Server. This is significant considering that Zhang-Server and other top CASP8 servers use a combination of multiple structure-prediction techniques including consensus method, multiple-template modeling, template-free modeling and model refinement while our method is a classical single-template-based threading method without any post-threading refinement. Contact:jinboxu@gmail.com
Collapse
Affiliation(s)
- Jian Peng
- Toyota Technological Institute at Chicago, IL 60637, USA
| | | |
Collapse
|
28
|
Tress ML, Valencia A. Predicted residue-residue contacts can help the scoring of 3D models. Proteins 2010; 78:1980-91. [PMID: 20408174 DOI: 10.1002/prot.22714] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
During the 7th Critical Assessment of Protein Structure Prediction (CASP7) experiment, it was suggested that the real value of predicted residue-residue contacts might lie in the scoring of 3D model structures. Here, we have carried out a detailed reassessment of the contact predictions made during the recent CASP8 experiment to determine whether predicted contacts might aid in the selection of close-to-native structures or be a useful tool for scoring 3D structural models. We used the contacts predicted by the CASP8 residue-residue contact prediction groups to select models for each target domain submitted to the experiment. We found that the information contained in the predicted residue-residue contacts would probably have helped in the selection of 3D models in the free modeling regime and over the harder comparative modeling targets. Indeed, in many cases, the models selected using just the predicted contacts had better GDT-TS scores than all but the best 3D prediction groups. Despite the well-known low accuracy of residue-residue contact predictions, it is clear that the predictive power of contacts can be useful in 3D model prediction strategies.
Collapse
Affiliation(s)
- Michael L Tress
- Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre (CNIO), Madrid, Spain.
| | | |
Collapse
|
29
|
Wang Z, Eickholt J, Cheng J. MULTICOM: a multi-level combination approach to protein structure prediction and its assessments in CASP8. Bioinformatics 2010; 26:882-8. [PMID: 20150411 PMCID: PMC2844995 DOI: 10.1093/bioinformatics/btq058] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2009] [Revised: 02/02/2010] [Accepted: 02/08/2010] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Protein structure prediction is one of the most important problems in structural bioinformatics. Here we describe MULTICOM, a multi-level combination approach to improve the various steps in protein structure prediction. In contrast to those methods which look for the best templates, alignments and models, our approach tries to combine complementary and alternative templates, alignments and models to achieve on average better accuracy. RESULTS The multi-level combination approach was implemented via five automated protein structure prediction servers and one human predictor which participated in the eighth Critical Assessment of Techniques for Protein Structure Prediction (CASP8), 2008. The MULTICOM servers and human predictor were consistently ranked among the top predictors on the CASP8 benchmark. The methods can predict moderate- to high-resolution models for most template-based targets and low-resolution models for some template-free targets. The results show that the multi-level combination of complementary templates, alternative alignments and similar models aided by model quality assessment can systematically improve both template-based and template-free protein modeling. AVAILABILITY The MULTICOM server is freely available at http://casp.rnet.missouri.edu/multicom_3d.html .
Collapse
Affiliation(s)
- Zheng Wang
- Department of Computer Science, Informatics Institute and C. Bond Life Science Center, University of Missouri, Columbia, MO 65211, USA
| | | | | |
Collapse
|