151
|
Yan J, Kurgan L. DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues. Nucleic Acids Res 2017; 45:e84. [PMID: 28132027 PMCID: PMC5449545 DOI: 10.1093/nar/gkx059] [Citation(s) in RCA: 72] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2016] [Accepted: 01/24/2017] [Indexed: 01/18/2023] Open
Abstract
Protein-DNA and protein-RNA interactions are part of many diverse and essential cellular functions and yet most of them remain to be discovered and characterized. Recent research shows that sequence-based predictors of DNA-binding residues accurately find these residues but also cross-predict many RNA-binding residues as DNA-binding, and vice versa. Most of these methods are also relatively slow, prohibiting applications on the whole-genome scale. We describe a novel sequence-based method, DRNApred, which accurately and in high-throughput predicts and discriminates between DNA- and RNA-binding residues. DRNApred was designed using a new dataset with both DNA- and RNA-binding proteins, regression that penalizes cross-predictions, and a novel two-layered architecture. DRNApred outperforms state-of-the-art predictors of DNA- or RNA-binding residues on a benchmark test dataset by substantially reducing the cross predictions and predicting arguably higher quality false positives that are located nearby the native binding residues. Moreover, it also more accurately predicts the DNA- and RNA-binding proteins. Application on the human proteome confirms that DRNApred reduces the cross predictions among the native nucleic acid binders. Also, novel putative DNA/RNA-binding proteins that it predicts share similar subcellular locations and residue charge profiles with the known native binding proteins. Webserver of DRNApred is freely available at http://biomine.cs.vcu.edu/servers/DRNApred/.
Collapse
Affiliation(s)
- Jing Yan
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton T6G 2V4, Canada
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, 23284, USA
| |
Collapse
|
152
|
Meng F, Uversky VN, Kurgan L. Comprehensive review of methods for prediction of intrinsic disorder and its molecular functions. Cell Mol Life Sci 2017; 74:3069-3090. [PMID: 28589442 PMCID: PMC11107660 DOI: 10.1007/s00018-017-2555-4] [Citation(s) in RCA: 130] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Accepted: 06/01/2017] [Indexed: 12/19/2022]
Abstract
Computational prediction of intrinsic disorder in protein sequences dates back to late 1970 and has flourished in the last two decades. We provide a brief historical overview, and we review over 30 recent predictors of disorder. We are the first to also cover predictors of molecular functions of disorder, including 13 methods that focus on disordered linkers and disordered protein-protein, protein-RNA, and protein-DNA binding regions. We overview their predictive models, usability, and predictive performance. We highlight newest methods and predictors that offer strong predictive performance measured based on recent comparative assessments. We conclude that the modern predictors are relatively accurate, enjoy widespread use, and many of them are fast. Their predictions are conveniently accessible to the end users, via web servers and databases that store pre-computed predictions for millions of proteins. However, research into methods that predict many not yet addressed functions of intrinsic disorder remains an outstanding challenge.
Collapse
Affiliation(s)
- Fanchi Meng
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada
| | - Vladimir N Uversky
- Department of Molecular Medicine, USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
- Institute for Biological Instrumentation, Russian Academy of Sciences, Pushchino, Moscow Region, Russian Federation
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, USA.
| |
Collapse
|
153
|
Tarnovskaya S, Kiselev A, Kostareva A, Frishman D. Structural consequences of mutations associated with idiopathic restrictive cardiomyopathy. Amino Acids 2017; 49:1815-1829. [DOI: 10.1007/s00726-017-2480-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Accepted: 08/12/2017] [Indexed: 02/02/2023]
|
154
|
Uversky VN, El-Baky NA, El-Fakharany EM, Sabry A, Mattar EH, Uversky AV, Redwan EM. Functionality of intrinsic disorder in tumor necrosis factor-α and its receptors. FEBS J 2017; 284:3589-3618. [PMID: 28746777 DOI: 10.1111/febs.14182] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Revised: 06/15/2017] [Accepted: 07/20/2017] [Indexed: 01/02/2023]
Affiliation(s)
- Vladimir N. Uversky
- Department of Biological Sciences; Faculty of Sciences; King Abdulaziz University; Jeddah Saudi Arabia
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute; Morsani College of Medicine; University of South Florida; Tampa FL USA
- Institute for Biological Instrumentation; Russian Academy of Sciences; Pushchino, Moscow Region Russia
| | - Nawal Abd El-Baky
- Protective Proteins Laboratory; Protein Research Department; Genetic Engineering and Biotechnology Research Institute; City for Scientific Research and Technology Applications; New Borg EL-Arab, Alexandria Egypt
| | - Esmail M. El-Fakharany
- Protective Proteins Laboratory; Protein Research Department; Genetic Engineering and Biotechnology Research Institute; City for Scientific Research and Technology Applications; New Borg EL-Arab, Alexandria Egypt
| | - Amira Sabry
- Protective Proteins Laboratory; Protein Research Department; Genetic Engineering and Biotechnology Research Institute; City for Scientific Research and Technology Applications; New Borg EL-Arab, Alexandria Egypt
| | - Ehab H. Mattar
- Department of Biological Sciences; Faculty of Sciences; King Abdulaziz University; Jeddah Saudi Arabia
| | - Alexey V. Uversky
- Center for Data Analytics and Biomedical Informatics; Department of Computer and Information Sciences; College of Science and Technology; Temple University; Philadelphia PA USA
| | - Elrashdy M. Redwan
- Department of Biological Sciences; Faculty of Sciences; King Abdulaziz University; Jeddah Saudi Arabia
- Protective Proteins Laboratory; Protein Research Department; Genetic Engineering and Biotechnology Research Institute; City for Scientific Research and Technology Applications; New Borg EL-Arab, Alexandria Egypt
| |
Collapse
|
155
|
Meng F, Kurgan L. DFLpred: High-throughput prediction of disordered flexible linker regions in protein sequences. Bioinformatics 2017; 32:i341-i350. [PMID: 27307636 PMCID: PMC4908364 DOI: 10.1093/bioinformatics/btw280] [Citation(s) in RCA: 63] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Motivation: Disordered flexible linkers (DFLs) are disordered regions that serve as flexible linkers/spacers in multi-domain proteins or between structured constituents in domains. They are different from flexible linkers/residues because they are disordered and longer. Availability of experimentally annotated DFLs provides an opportunity to build high-throughput computational predictors of these regions from protein sequences. To date, there are no computational methods that directly predict DFLs and they can be found only indirectly by filtering predicted flexible residues with predictions of disorder. Results: We conceptualized, developed and empirically assessed a first-of-its-kind sequence-based predictor of DFLs, DFLpred. This method outputs propensity to form DFLs for each residue in the input sequence. DFLpred uses a small set of empirically selected features that quantify propensities to form certain secondary structures, disordered regions and structured regions, which are processed by a fast linear model. Our high-throughput predictor can be used on the whole-proteome scale; it needs <1 h to predict entire proteome on a single CPU. When assessed on an independent test dataset with low sequence-identity proteins, it secures area under the receiver operating characteristic curve equal 0.715 and outperforms existing alternatives that include methods for the prediction of flexible linkers, flexible residues, intrinsically disordered residues and various combinations of these methods. Prediction on the complete human proteome reveals that about 10% of proteins have a large content of over 30% DFL residues. We also estimate that about 6000 DFL regions are long with ≥30 consecutive residues. Availability and implementation:http://biomine.ece.ualberta.ca/DFLpred/. Contact:lkurgan@vcu.edu Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Fanchi Meng
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton T6G 2V4, Canada
| | - Lukasz Kurgan
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton T6G 2V4, Canada Department of Computer Science, Virginia Commonwealth University, Richmond, 23284, U.S.A
| |
Collapse
|
156
|
Uversky VN. Intrinsic Disorder, Protein-Protein Interactions, and Disease. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2017; 110:85-121. [PMID: 29413001 DOI: 10.1016/bs.apcsb.2017.06.005] [Citation(s) in RCA: 78] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
It is recognized now that biologically active proteins without stable tertiary structure (known as intrinsically disordered proteins, IDPs) and hybrid proteins containing ordered domains and intrinsically disordered protein regions (IDPRs) are important players found in any given proteome. These IDPs/IDPRs possess functions that complement functional repertoire of their ordered counterparts, being commonly related to recognition, as well as control and regulation of various signaling pathways. They are interaction masters, being able to utilize a wide spectrum of interaction mechanisms, ranging from induced folding to formation of fuzzy complexes where significant levels of disorder are preserved, to polyvalent stochastic interactions playing crucial roles in the liquid-liquid phase transitions leading to the formation of proteinaceous membrane-less organelles. IDPs/IDPRs are tightly controlled themselves via various means, including alternative splicing, precisely controlled expression and degradation, binding to specific partners, and posttranslational modifications. Distortions in the regulation and control of IDPs/IDPRs, as well as their aberrant interactivity are commonly associated with various human diseases. This review presents some aspects of the intrinsic disorder-based functionality and dysfunctionality, paying special attention to the normal and pathological protein-protein interactions.
Collapse
Affiliation(s)
- Vladimir N Uversky
- USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, United States; Institute for Biological Instrumentation, Russian Academy of Sciences, Pushchino, Moscow Region, Russia.
| |
Collapse
|
157
|
Whelan JN, Reddy KD, Uversky VN, Teng MN. Functional correlations of respiratory syncytial virus proteins to intrinsic disorder. MOLECULAR BIOSYSTEMS 2017; 12:1507-26. [PMID: 27062995 DOI: 10.1039/c6mb00122j] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Protein intrinsic disorder is an important characteristic demonstrated by the absence of higher order structure, and is commonly detected in multifunctional proteins encoded by RNA viruses. Intrinsically disordered regions (IDRs) of proteins exhibit high flexibility and solvent accessibility, which permit several distinct protein functions, including but not limited to binding of multiple partners and accessibility for post-translational modifications. IDR-containing viral proteins can therefore execute various functional roles to enable productive viral replication. Respiratory syncytial virus (RSV) is a globally circulating, non-segmented, negative sense (NNS) RNA virus that causes severe lower respiratory infections. In this study, we performed a comprehensive evaluation of predicted intrinsic disorder of the RSV proteome to better understand the functional role of RSV protein IDRs. We included 27 RSV strains to sample major RSV subtypes and genotypes, as well as geographic and temporal isolate differences. Several types of disorder predictions were applied to the RSV proteome, including per-residue (PONDR®-FIT and PONDR® VL-XT), binary (CH, CDF, CH-CDF), and disorder-based interactions (ANCHOR and MoRFpred). We classified RSV IDRs by size, frequency and function. Finally, we determined the functional implications of RSV IDRs by mapping predicted IDRs to known functional domains of each protein. Identification of RSV IDRs within functional domains improves our understanding of RSV pathogenesis in addition to providing potential therapeutic targets. Furthermore, this approach can be applied to other NNS viruses that encode essential multifunctional proteins for the elucidation of viral protein regions that can be manipulated for attenuation of viral replication.
Collapse
Affiliation(s)
- Jillian N Whelan
- Division of Allergy and Immunology, Department of Internal Medicine, and the Joy McCann Culverhouse Airway Diseases Research Center, University of South Florida Morsani College of Medicine, Tampa, FL, USA.
| | - Krishna D Reddy
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA and Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA and Institute for Biological Instrumentation, Russian Academy of Sciences, 142292 Pushchino, Moscow Region, Russia
| | - Michael N Teng
- Division of Allergy and Immunology, Department of Internal Medicine, and the Joy McCann Culverhouse Airway Diseases Research Center, University of South Florida Morsani College of Medicine, Tampa, FL, USA.
| |
Collapse
|
158
|
Liu Y, Wu J, Sun N, Tu C, Shi X, Cheng H, Liu S, Li S, Wang Y, Zheng Y, Uversky VN. Intrinsically Disordered Proteins as Important Players during Desiccation Stress of Soybean Radicles. J Proteome Res 2017; 16:2393-2409. [PMID: 28525284 DOI: 10.1021/acs.jproteome.6b01045] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Intrinsically disordered proteins (IDPs) play a variety of important physiological roles in all living organisms. However, there is no comprehensive analysis of the abundance of IDPs associated with environmental stress in plants. Here, we show that a set of heat-stable proteins (i.e., proteins that do not denature after boiling at 100 °C for 10 min) was present in R0mm and R15mm radicles (i.e., before radicle emergence and 15 mm long radicles) of soybean (Glycine max) seeds. This set of 795 iTRAQ-quantified heat-stable proteins contained a high proportion of wholly or highly disordered proteins (15%), which was significantly higher than that estimated for the whole soybean proteome containing 55,787 proteins (9%). The heat-stable proteome of soybean radicles that contain many IDPs could protect lactate dehydrogenase (LDH) during freeze-thaw cycles. Comparison of the 795 heat-stable proteins in the R0mm and R15mm soybean radicles revealed that many of these proteins changed abundance during seedling growth with 170 and 89 proteins being more abundant in R0mm and R15mm, respectively. KEGG analysis identified 18 proteins from the cysteine and methionine metabolism pathways and nine proteins from the phenylpropanoid biosynthesis pathway. As an important type of IDP related to stress, 30 late embryogenesis abundant proteins were also found. Ten selected proteins with high levels of predicted intrinsic disorder were able to efficiently protect LDH from the freeze-thaw-induced inactivation, but the protective ability was not correlated with the disorder content of these proteins. These observations suggest that protection of the enzymes and other proteins in a stressed cell can be one of the biological functions of plant IDPs.
Collapse
Affiliation(s)
- Yun Liu
- Guangdong Provincial Key Laboratory for Plant Epigenetics, Shenzhen Key Laboratory of Microbial Genetic Engineering, College of Life Sciences and Oceanography, Shenzhen University , Nanhai Ave 3688, Shenzhen, Guangdong 518060, China
| | - Jiahui Wu
- Guangdong Provincial Key Laboratory for Plant Epigenetics, Shenzhen Key Laboratory of Microbial Genetic Engineering, College of Life Sciences and Oceanography, Shenzhen University , Nanhai Ave 3688, Shenzhen, Guangdong 518060, China
| | - Nan Sun
- Guangdong Provincial Key Laboratory for Plant Epigenetics, Shenzhen Key Laboratory of Microbial Genetic Engineering, College of Life Sciences and Oceanography, Shenzhen University , Nanhai Ave 3688, Shenzhen, Guangdong 518060, China
| | - Chengjian Tu
- Department of Pharmaceutical Sciences, State University of New York at Buffalo , 285 Kapoor Hall, Buffalo, New York14260, United States
| | - Xiaoying Shi
- Guangdong Provincial Key Laboratory for Plant Epigenetics, Shenzhen Key Laboratory of Microbial Genetic Engineering, College of Life Sciences and Oceanography, Shenzhen University , Nanhai Ave 3688, Shenzhen, Guangdong 518060, China
| | - Hua Cheng
- Guangdong Provincial Key Laboratory for Plant Epigenetics, Shenzhen Key Laboratory of Microbial Genetic Engineering, College of Life Sciences and Oceanography, Shenzhen University , Nanhai Ave 3688, Shenzhen, Guangdong 518060, China
| | - Simu Liu
- Guangdong Provincial Key Laboratory for Plant Epigenetics, Shenzhen Key Laboratory of Microbial Genetic Engineering, College of Life Sciences and Oceanography, Shenzhen University , Nanhai Ave 3688, Shenzhen, Guangdong 518060, China
| | - Shuiming Li
- Guangdong Provincial Key Laboratory for Plant Epigenetics, Shenzhen Key Laboratory of Microbial Genetic Engineering, College of Life Sciences and Oceanography, Shenzhen University , Nanhai Ave 3688, Shenzhen, Guangdong 518060, China
| | - Yong Wang
- Guangdong Provincial Key Laboratory for Plant Epigenetics, Shenzhen Key Laboratory of Microbial Genetic Engineering, College of Life Sciences and Oceanography, Shenzhen University , Nanhai Ave 3688, Shenzhen, Guangdong 518060, China
| | - Yizhi Zheng
- Guangdong Provincial Key Laboratory for Plant Epigenetics, Shenzhen Key Laboratory of Microbial Genetic Engineering, College of Life Sciences and Oceanography, Shenzhen University , Nanhai Ave 3688, Shenzhen, Guangdong 518060, China
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida , 12901 Bruce B. Downs Boulevard MDC07, Tampa, Florida 33612, United States
- Laboratory of New Methods in Biology, Institute for Biological Instrumentation of the Russian Academy of Sciences , Institutskaya str., 7, Pushchino, Moscow region 142290, Russia
| |
Collapse
|
159
|
Na I, Kong MJ, Straight S, Pinto JR, Uversky VN. Troponins, intrinsic disorder, and cardiomyopathy. Biol Chem 2017; 397:731-51. [PMID: 27074551 DOI: 10.1515/hsz-2015-0303] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Accepted: 04/05/2016] [Indexed: 11/15/2022]
Abstract
Cardiac troponin is a dynamic complex of troponin C, troponin I, and troponin T (TnC, TnI, and TnT, respectively) found in the myocyte thin filament where it plays an essential role in cardiac muscle contraction. Mutations in troponin subunits are found in inherited cardiomyopathies, such as hypertrophic cardiomyopathy (HCM) and dilated cardiomyopathy (DCM). The highly dynamic nature of human cardiac troponin and presence of numerous flexible linkers in its subunits suggest that understanding of structural and functional properties of this important complex can benefit from the consideration of the protein intrinsic disorder phenomenon. We show here that mutations causing decrease in the disorder score in TnI and TnT are significantly more abundant in HCM and DCM than mutations leading to the increase in the disorder score. Identification and annotation of intrinsically disordered regions in each of the troponin subunits conducted in this study can help in better understanding of the roles of intrinsic disorder in regulation of interactomes and posttranslational modifications of these proteins. These observations suggest that disease-causing mutations leading to a decrease in the local flexibility of troponins can trigger a whole plethora of functional changes in the heart.
Collapse
|
160
|
Migliaccio AR, Uversky VN. Dissecting physical structure of calreticulin, an intrinsically disordered Ca 2+-buffering chaperone from endoplasmic reticulum. J Biomol Struct Dyn 2017; 36:1617-1636. [PMID: 28504081 DOI: 10.1080/07391102.2017.1330224] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Calreticulin (CALR) is a Ca2+ binding multifunctional protein that mostly resides in the endoplasmic reticulum (ER) and plays a number of important roles in various physiological and pathological processes. Although the major functions ascribed to CALR are controlling the Ca2+ homeostasis in ER and acting as a lectin-like ER chaperon for many glycoproteins, this moonlighting protein can be found in various cellular compartments where it has many non-ER functions. To shed more light on the mechanisms underlying polyfunctionality of this moonlighting protein that can be found in different cellular compartments and that possesses a wide spectrum of unrelated biological activities, being able to interact with Ca2+ (and potentially other metal ions), RNA, oligosaccharides, and numerous proteins, we used a set of experimental and computational tools to evaluate the intrinsic disorder status of CALR and the role of calcium binding on structural properties and conformational stability of the full-length CALR and its isolated P- and C-domains.
Collapse
Affiliation(s)
- Anna Rita Migliaccio
- a Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai (ISMMS) , New York , NY , USA.,b Department of Biomedical and Neuromotorial Sciences , Alma Mater University , Bologna , Italy
| | - Vladimir N Uversky
- c Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute , Morsani College of Medicine, University of South Florida , Tampa , FL , USA.,d Laboratory of New Methods in Biology , Institute for Biological Instrumentation, Russian Academy of Sciences , Pushchino , Moscow Region 142290 , Russia
| |
Collapse
|
161
|
Yruela I, Contreras-Moreira B, Magalhães C, Osório NS, Gonzalo-Asensio J. Mycobacterium tuberculosis Complex Exhibits Lineage-Specific Variations Affecting Protein Ductility and Epitope Recognition. Genome Biol Evol 2017; 8:3751-3764. [PMID: 28062754 PMCID: PMC5521731 DOI: 10.1093/gbe/evw279] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/18/2016] [Indexed: 12/19/2022] Open
Abstract
The advent of whole-genome sequencing has provided an unprecedented detail about the evolution and genetic significance of species-specific variations across the whole Mycobacterium tuberculosis Complex. However, little attention has been focused on understanding the functional roles of these variations in the protein coding sequences. In this work, we compare the coding sequences from 74 sequenced mycobacterial species including M. africanum, M. bovis, M. canettii, M. caprae, M. orygis, and M. tuberculosis. Results show that albeit protein variations affect all functional classes, those proteins involved in lipid and intermediary metabolism and respiration have accumulated mutations during evolution. To understand the impact of these mutations on protein functionality, we explored their implications on protein ductility/disorder, a yet unexplored feature of mycobacterial proteomes. In agreement with previous studies, we found that a Gly71Ile substitution in the PhoPR virulence system severely affects the ductility of its nearby region in M. africanum and animal-adapted species. In the same line of evidence, the SmtB transcriptional regulator shows amino acid variations specific to the Beijing lineage, which affects the flexibility of the N-terminal trans-activation domain. Furthermore, despite the fact that MTBC epitopes are evolutionary hyperconserved, we identify strain- and lineage-specific amino acid mutations affecting previously known T-cell epitopes such as EsxH and FbpA (Ag85A). Interestingly, in silico studies reveal that these variations result in differential interaction of epitopes with the main HLA haplogroups.
Collapse
Affiliation(s)
- Inmaculada Yruela
- Estación Experimental de Aula Dei-Consejo Superior de Investigaciones Científicas (EEAD-CSIC), Zaragoza, Spain.,Grupo de Bioquímica, Biofísica y Biología Computacional (BIFI, UNIZAR), Unidad Asociada I+D+i al CSIC, Zaragoza, Spain
| | - Bruno Contreras-Moreira
- Estación Experimental de Aula Dei-Consejo Superior de Investigaciones Científicas (EEAD-CSIC), Zaragoza, Spain.,Grupo de Bioquímica, Biofísica y Biología Computacional (BIFI, UNIZAR), Unidad Asociada I+D+i al CSIC, Zaragoza, Spain.,Fundación ARAID, Aragón, Spain
| | - Carlos Magalhães
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Braga, Portugal.,ICVS/3B's - PT Government Associate Laboratory, Braga/Guimarães, Portugal
| | - Nuno S Osório
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Braga, Portugal.,ICVS/3B's - PT Government Associate Laboratory, Braga/Guimarães, Portugal
| | - Jesús Gonzalo-Asensio
- Grupo de Genética de Micobacterias, Departamento de Microbiología y Medicina Preventiva, Facultad de Medicina, Universidad de Zaragoza, Zaragoza, Spain.,CIBER Enfermedades Respiratorias, Instituto de Salud Carlos III, Madrid, Spain.,Instituto de Biocomputación y Física de Sistemas Complejos (BIFI-UNIZAR), Zaragoza, Spain.,Servicio de Microbiología Hospital Universitario Miguel Servet, ISS Aragón, Zaragoza, Spain
| |
Collapse
|
162
|
Meng F, Uversky V, Kurgan L. Computational Prediction of Intrinsic Disorder in Proteins. ACTA ACUST UNITED AC 2017; 88:2.16.1-2.16.14. [DOI: 10.1002/cpps.28] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Affiliation(s)
- Fanchi Meng
- Department of Electrical and Computer Engineering, University of Alberta Edmonton Canada
| | - Vladimir Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida Tampa FL USA
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences St. Petersburg Russia
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University Richmond USA
| |
Collapse
|
163
|
Basu S, Söderquist F, Wallner B. Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins. J Comput Aided Mol Des 2017; 31:453-466. [PMID: 28365882 PMCID: PMC5429364 DOI: 10.1007/s10822-017-0020-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Accepted: 03/24/2017] [Indexed: 12/03/2022]
Abstract
The focus of the computational structural biology community has taken a dramatic shift over the past one-and-a-half decades from the classical protein structure prediction problem to the possible understanding of intrinsically disordered proteins (IDP) or proteins containing regions of disorder (IDPR). The current interest lies in the unraveling of a disorder-to-order transitioning code embedded in the amino acid sequences of IDPs/IDPRs. Disordered proteins are characterized by an enormous amount of structural plasticity which makes them promiscuous in binding to different partners, multi-functional in cellular activity and atypical in folding energy landscapes resembling partially folded molten globules. Also, their involvement in several deadly human diseases (e.g. cancer, cardiovascular and neurodegenerative diseases) makes them attractive drug targets, and important for a biochemical understanding of the disease(s). The study of the structural ensemble of IDPs is rather difficult, in particular for transient interactions. When bound to a structured partner, an IDPR adapts an ordered conformation in the complex. The residues that undergo this disorder-to-order transition are called protean residues, generally found in short contiguous stretches and the first step in understanding the modus operandi of an IDP/IDPR would be to predict these residues. There are a few available methods which predict these protean segments from their amino acid sequences; however, their performance reported in the literature leaves clear room for improvement. With this background, the current study presents ‘Proteus’, a random forest classifier that predicts the likelihood of a residue undergoing a disorder-to-order transition upon binding to a potential partner protein. The prediction is based on features that can be calculated using the amino acid sequence alone. Proteus compares favorably with existing methods predicting twice as many true positives as the second best method (55 vs. 27%) with a much higher precision on an independent data set. The current study also sheds some light on a possible ‘disorder-to-order’ transitioning consensus, untangled, yet embedded in the amino acid sequence of IDPs. Some guidelines have also been suggested for proceeding with a real-life structural modeling involving an IDPR using Proteus.
Collapse
Affiliation(s)
- Sankar Basu
- Bioinformatics Division, Department of Physics, Chemistry and Biology, Linköping University, Linköping, Sweden.,Department of Biochemistry, University of Calcutta, Kolkata, 700019, India
| | - Fredrik Söderquist
- Bioinformatics Division, Department of Physics, Chemistry and Biology, Linköping University, Linköping, Sweden
| | - Björn Wallner
- Bioinformatics Division, Department of Physics, Chemistry and Biology, Linköping University, Linköping, Sweden. .,Swedish e-Science Research Center, Linköping University, Linköping, Sweden.
| |
Collapse
|
164
|
Nguyen HH, Varadi M, Tompa P, Pauwels K. Affinity purification of human m-calpain through an intrinsically disordered inhibitor, calpastatin. PLoS One 2017; 12:e0174125. [PMID: 28319173 PMCID: PMC5358782 DOI: 10.1371/journal.pone.0174125] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Accepted: 03/04/2017] [Indexed: 12/17/2022] Open
Abstract
Calpains are calcium-activated proteases that have biomedical and biotechnological potential. Their activity is tightly regulated by their endogenous inhibitor, calpastatin that binds to the enzyme only in the presence of calcium. Conventional approaches to purify calpain comprise multiple chromatographic steps, and are labor-intensive, leading to low yields. Here we report a new purification procedure for the human m-calpain based on its reversible calcium-mediated interaction with the intrinsically disordered calpastatin. We exploit the specific binding properties of human calpastatin domain 1 (hCSD1) to physically capture human m-calpain from a complex biological mixture. The dissociation of the complex is mediated by chelating calcium, upon which heterodimeric calpain elutes while hCSD1 remains immobilized onto the stationary phase. This novel affinity-based purification was compared to the conventional multistep purification strategy and we find that it is robust, it yields a homogeneous preparation, it can be scaled up easily and it rests on a non-disruptive step that maintains close to physiological conditions that allow further biophysical and functional studies.
Collapse
Affiliation(s)
- Hung Huy Nguyen
- VIB Center for Structural Biology (CSB), Vlaams Instituut voor Biotechnologie (VIB), Brussels, Belgium
- Structural Biology Brussels (SBB), Vrije Universiteit Brussel, Brussels, Belgium
| | - Mihaly Varadi
- VIB Center for Structural Biology (CSB), Vlaams Instituut voor Biotechnologie (VIB), Brussels, Belgium
- Structural Biology Brussels (SBB), Vrije Universiteit Brussel, Brussels, Belgium
| | - Peter Tompa
- VIB Center for Structural Biology (CSB), Vlaams Instituut voor Biotechnologie (VIB), Brussels, Belgium
- Structural Biology Brussels (SBB), Vrije Universiteit Brussel, Brussels, Belgium
- Institute of Enzymology, Research Centre for Natural Sciences of the Hungarian Academy of Sciences, Budapest, Hungary
| | - Kris Pauwels
- VIB Center for Structural Biology (CSB), Vlaams Instituut voor Biotechnologie (VIB), Brussels, Belgium
- Structural Biology Brussels (SBB), Vrije Universiteit Brussel, Brussels, Belgium
| |
Collapse
|
165
|
Structural coalescence underlies the aggregation propensity of a β-barrel protein motif. PLoS One 2017; 12:e0170607. [PMID: 28187186 PMCID: PMC5302452 DOI: 10.1371/journal.pone.0170607] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2016] [Accepted: 01/07/2017] [Indexed: 02/07/2023] Open
Abstract
A clear understanding of the structural foundations underlying protein aggregation is an elusive goal of central biomedical importance. A step toward this aim is exemplified by the β-barrel motif represented by the intestinal fatty acid binding protein (IFABP) and two abridged all-β sheet forms (Δ98Δ and Δ78Δ). At odds with the established notion that a perturbation of the native fold should necessarily favor a buildup of intermediate forms with an enhanced tendency to aggregate, the intrinsic stability (ΔG°H2O) of these proteins does not bear a straightforward correlation with their trifluoroethanol (TFE)-induced aggregation propensity. In view of this fact, we found it more insightful to delve into the connection between structure and stability under sub-aggregating conditions (10% TFE). In the absence of the co-solvent, the abridged variants display a common native-like region decorated with a disordered C-terminal stretch. Upon TFE addition, an increase in secondary structure content is observed, assimilating them to the parent protein. In this sense, TFE perturbs a common native like region while exerting a global compaction effect. Importantly, in all cases, fatty acid binding function is preserved. Interestingly, energetic as well as structural diversity in aqueous solution evolves into a common conformational ensemble more akin in stability. These facts reconcile apparent paradoxical findings related to stability and rates of aggregation. This scenario likely mimics the accrual of aggregation-prone species in the population, an early critical event for the development of fibrillation.
Collapse
|
166
|
Neira JL, Bintz J, Arruebo M, Rizzuti B, Bonacci T, Vega S, Lanas A, Velázquez-Campoy A, Iovanna JL, Abián O. Identification of a Drug Targeting an Intrinsically Disordered Protein Involved in Pancreatic Adenocarcinoma. Sci Rep 2017; 7:39732. [PMID: 28054562 PMCID: PMC5213423 DOI: 10.1038/srep39732] [Citation(s) in RCA: 90] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 11/28/2016] [Indexed: 11/13/2022] Open
Abstract
Intrinsically disordered proteins (IDPs) are prevalent in eukaryotes, performing signaling and regulatory functions. Often associated with human diseases, they constitute drug-development targets. NUPR1 is a multifunctional IDP, over-expressed and involved in pancreatic ductal adenocarcinoma (PDAC) development. By screening 1120 FDA-approved compounds, fifteen candidates were selected, and their interactions with NUPR1 were characterized by experimental and simulation techniques. The protein remained disordered upon binding to all fifteen candidates. These compounds were tested in PDAC-derived cell-based assays, and all induced cell-growth arrest and senescence, reduced cell migration, and decreased chemoresistance, mimicking NUPR1-deficiency. The most effective compound completely arrested tumor development in vivo on xenografted PDAC-derived cells in mice. Besides reporting the discovery of a compound targeting an intact IDP and specifically active against PDAC, our study proves the possibility to target the ‘fuzzy’ interface of a protein that remains disordered upon binding to its natural biological partners or to selected drugs.
Collapse
Affiliation(s)
- José L Neira
- Instituto de Biología Molecular y Celular, Universidad Miguel Hernández, Edificio Torregaitán, Avda. del Ferrocarril s/n, 03202 Elche, Alicante, Spain.,Instituto de Biocomputación y Física de Sistemas Complejos (BIFI), Unidad Asociada IQFR-CSIC-BIFI, Universidad de Zaragoza, Edificio I+D, Mariano Esquillor s/n, 50018 Zaragoza, Spain
| | - Jennifer Bintz
- Centre de Recherche en Cancérologie de Marseille (CRCM), INSERM U1068, CNRS UMR 7258, Aix-Marseille Université and Institut Paoli-Calmettes, Parc Scientifique et Technologique de Luminy, 163 Avenue de Luminy, 13288, Marseille, France
| | - María Arruebo
- Instituto Aragonés de Ciencias de la Salud (IACS), Av. San Juan Bosco 13, 50009 Zaragoza, Spain.,Instituto de Investigaciones Sanitarias (IIS) Aragón, Av. San Juan Bosco, 13, 50009 Zaragoza, Spain
| | - Bruno Rizzuti
- CNR-NANOTEC, Licryl-UOS Cosenza and CEMIF.Cal, Department of Physics, University of Calabria, Via P. Bucci, Cubo 31 C, 87036 Arcavacata di Rende, Cosenza, Italy
| | - Thomas Bonacci
- Centre de Recherche en Cancérologie de Marseille (CRCM), INSERM U1068, CNRS UMR 7258, Aix-Marseille Université and Institut Paoli-Calmettes, Parc Scientifique et Technologique de Luminy, 163 Avenue de Luminy, 13288, Marseille, France
| | - Sonia Vega
- Instituto de Biocomputación y Física de Sistemas Complejos (BIFI), Unidad Asociada IQFR-CSIC-BIFI, Universidad de Zaragoza, Edificio I+D, Mariano Esquillor s/n, 50018 Zaragoza, Spain
| | - Angel Lanas
- Instituto de Investigaciones Sanitarias (IIS) Aragón, Av. San Juan Bosco, 13, 50009 Zaragoza, Spain.,Centro de Investigación Biomédica en Red en el Área Temática de Enfermedades Hepáticas y Digestivas (CIBERehd), Spain.,Servicio de Aparato Digestivo, Hospital Clínico Universitario "Lozano Blesa", Av. San Juan Bosco, 15, 50009 Zaragoza, Spain.,Department of Medicine, University of Zaragoza, Perdro Cerbuna 12, 50009 Zaragoza, Spain
| | - Adrián Velázquez-Campoy
- Instituto de Biocomputación y Física de Sistemas Complejos (BIFI), Unidad Asociada IQFR-CSIC-BIFI, Universidad de Zaragoza, Edificio I+D, Mariano Esquillor s/n, 50018 Zaragoza, Spain.,Instituto de Investigaciones Sanitarias (IIS) Aragón, Av. San Juan Bosco, 13, 50009 Zaragoza, Spain.,Fundación ARAID, Diputación General de Aragón, C/María de Luna 11, Edificio CEEIARAGÓN, 50018 Zaragoza, Spain
| | - Juan L Iovanna
- Centre de Recherche en Cancérologie de Marseille (CRCM), INSERM U1068, CNRS UMR 7258, Aix-Marseille Université and Institut Paoli-Calmettes, Parc Scientifique et Technologique de Luminy, 163 Avenue de Luminy, 13288, Marseille, France
| | - Olga Abián
- Instituto de Biocomputación y Física de Sistemas Complejos (BIFI), Unidad Asociada IQFR-CSIC-BIFI, Universidad de Zaragoza, Edificio I+D, Mariano Esquillor s/n, 50018 Zaragoza, Spain.,Instituto Aragonés de Ciencias de la Salud (IACS), Av. San Juan Bosco 13, 50009 Zaragoza, Spain.,Instituto de Investigaciones Sanitarias (IIS) Aragón, Av. San Juan Bosco, 13, 50009 Zaragoza, Spain.,Centro de Investigación Biomédica en Red en el Área Temática de Enfermedades Hepáticas y Digestivas (CIBERehd), Spain
| |
Collapse
|
167
|
Yang Y, Yoo CG, Guo HB, Rottmann W, Winkeler KA, Collins CM, Gunter LE, Jawdy SS, Yang X, Guo H, Pu Y, Ragauskas AJ, Tuskan GA, Chen JG. Overexpression of a Domain of Unknown Function 266-containing protein results in high cellulose content, reduced recalcitrance, and enhanced plant growth in the bioenergy crop Populus. BIOTECHNOLOGY FOR BIOFUELS 2017; 10:74. [PMID: 28344649 PMCID: PMC5364563 DOI: 10.1186/s13068-017-0760-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2016] [Accepted: 03/18/2017] [Indexed: 05/17/2023]
Abstract
BACKGROUND Domain of Unknown Function 266 (DUF266) is a plant-specific domain. DUF266-containing proteins (DUF266 proteins) have been categorized as 'not classified glycosyltransferases (GTnc)' due to amino acid similarity with GTs. However, little is known about the function of DUF266 proteins. RESULTS Phylogenetic analysis revealed that DUF266 proteins are only present in the land plants including moss and lycophyte. We report the functional characterization of one member of DUF266 proteins in Populus, PdDUF266A. PdDUF266A was ubiquitously expressed with high abundance in the xylem. In Populus transgenic plants overexpressing PdDUF266A (OXPdDUF266A), the glucose and cellulose contents were significantly higher, while the lignin content was lower than that in the wild type. Degree of polymerization of cellulose in OXPdDUF266A transgenic plants was also higher, whereas cellulose crystallinity index remained unchanged. Gene expression analysis indicated that cellulose biosynthesis-related genes such as CESA and SUSY were upregulated in mature leaf and xylem of OXPdDUF266A transgenic plants. Moreover, PdDUF266A overexpression resulted in an increase of biomass production. Their glucose contents and biomass phenotypes were further validated via heterologous expression of PdDUF266A in Arabidopsis. Results from saccharification treatment demonstrated that the rate of sugar release was increased by approximately 38% in the OXPdDUF266A transgenic plants. CONCLUSIONS These results suggest that the overexpression of PdDUF266A can increase cellulose content, reduce recalcitrance, and enhance biomass production, and that PdDUF266A is a promising target for genetic manipulation for biofuel production.
Collapse
Affiliation(s)
- Yongil Yang
- BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
| | - Chang Geun Yoo
- BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
- UT-ORNL Joint Institute for Biological Science, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
| | - Hao-Bo Guo
- Department of Biochemistry & Cellular & Molecular Biology, University of Tennessee, Knoxville, TN 37996 USA
| | | | | | | | - Lee E. Gunter
- BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
| | - Sara S. Jawdy
- BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
| | - Xiaohan Yang
- BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
| | - Hong Guo
- Department of Biochemistry & Cellular & Molecular Biology, University of Tennessee, Knoxville, TN 37996 USA
| | - Yunqiao Pu
- BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
- UT-ORNL Joint Institute for Biological Science, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
| | - Arthur J. Ragauskas
- BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
- UT-ORNL Joint Institute for Biological Science, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
- Department of Chemical and Biomolecular Engineering & Department of Forestry, Wildlife, and Fisheries, University of Tennessee, Knoxville, TN 37996 USA
| | - Gerald A. Tuskan
- BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
| | - Jin-Gui Chen
- BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831 USA
| |
Collapse
|
168
|
Viscardi LH, Tovo-Rodrigues L, Paré P, Fagundes NJR, Salzano FM, Paixão-Côrtes VR, Bau CHD, Bortolini MC. FOXP in Tetrapoda: Intrinsically Disordered Regions, Short Linear Motifs and their evolutionary significance. Genet Mol Biol 2017; 40:181-190. [PMID: 28257525 PMCID: PMC5409772 DOI: 10.1590/1678-4685-gmb-2016-0115] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2016] [Accepted: 10/03/2016] [Indexed: 11/22/2022] Open
Abstract
The FOXP subfamily is probably the most extensively characterized subfamily of the forkhead superfamily, playing important roles in development and homeostasis in vertebrates. Intrinsically disorder protein regions (IDRs) are protein segments that exhibit multiple physical interactions and play critical roles in various biological processes, including regulation and signaling. IDRs in proteins may play an important role in the evolvability of genetic systems. In this study, we analyzed 77 orthologous FOXP genes/proteins from Tetrapoda, regarding protein disorder content and evolutionary rate. We also predicted the number and type of short linear motifs (SLIMs) in the IDRs. Similar levels of protein disorder (approximately 70%) were found for FOXP1, FOXP2, and FOXP4. However, for FOXP3, which is shorter in length and has a more specific function, the disordered content was lower (30%). Mammals showed higher protein disorders for FOXP1 and FOXP4 than non-mammals. Specific analyses related to linear motifs in the four genes showed also a clear differentiation between FOXPs in mammals and non-mammals. We predicted for the first time the role of IDRs and SLIMs in the FOXP gene family associated with possible adaptive novelties within Tetrapoda. For instance, we found gain and loss of important phosphorylation sites in the Homo sapiens FOXP2 IDR regions, with possible implication for the evolution of human speech.
Collapse
Affiliation(s)
- Lucas Henriques Viscardi
- Programa de Pós-Graduação em Genética e Biologia Molecular,
Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre,
RS, Brazil
| | - Luciana Tovo-Rodrigues
- Programa de Pós-Graduação em Epidemiologia, Universidade Federal de
Pelotas, Pelotas, RS, Brazil
| | - Pamela Paré
- Programa de Pós-Graduação em Genética e Biologia Molecular,
Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre,
RS, Brazil
| | - Nelson Jurandi Rosa Fagundes
- Programa de Pós-Graduação em Genética e Biologia Molecular,
Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre,
RS, Brazil
| | - Francisco Mauro Salzano
- Programa de Pós-Graduação em Genética e Biologia Molecular,
Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre,
RS, Brazil
| | - Vanessa Rodrigues Paixão-Côrtes
- Programa de Pós-Graduação em Genética e Biodiversidade, Instituto de
Biologia, Universidade Federal da Bahia, Salvador, BA, Brazil
| | - Claiton Henrique Dotto Bau
- Programa de Pós-Graduação em Genética e Biologia Molecular,
Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre,
RS, Brazil
| | - Maria Cátira Bortolini
- Programa de Pós-Graduação em Genética e Biologia Molecular,
Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre,
RS, Brazil
| |
Collapse
|
169
|
Wang Y, Guo Y, Pu X, Li M. A sequence-based computational method for prediction of MoRFs. RSC Adv 2017. [DOI: 10.1039/c6ra27161h] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Molecular recognition features (MoRFs) are relatively short segments (10–70 residues) within intrinsically disordered regions (IDRs) that can undergo disorder-to-order transitions during binding to partner proteins.
Collapse
Affiliation(s)
- Yu Wang
- College of Chemistry
- Sichuan University
- Chengdu
- People's Republic of China
| | - Yanzhi Guo
- College of Chemistry
- Sichuan University
- Chengdu
- People's Republic of China
| | - Xuemei Pu
- College of Chemistry
- Sichuan University
- Chengdu
- People's Republic of China
| | - Menglong Li
- College of Chemistry
- Sichuan University
- Chengdu
- People's Republic of China
| |
Collapse
|
170
|
Protein intrinsic disorder-based liquid-liquid phase transitions in biological systems: Complex coacervates and membrane-less organelles. Adv Colloid Interface Sci 2017; 239:97-114. [PMID: 27291647 DOI: 10.1016/j.cis.2016.05.012] [Citation(s) in RCA: 153] [Impact Index Per Article: 21.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2016] [Accepted: 05/24/2016] [Indexed: 12/18/2022]
Abstract
It is clear now that eukaryotic cells contain numerous membrane-less organelles, many of which are formed in response to changes in the cellular environment. Being typically liquid in nature, many of these organelles can be described as products of the reversible and highly controlled liquid-liquid phase transitions in biological systems. Many of these membrane-less organelles are complex coacervates containing (almost invariantly) intrinsically disordered proteins and often nucleic acids. It seems that the lack of stable structure in major proteinaceous constituents of these organelles is crucial for the formation of phase-separated droplets. This review considers several biologically relevant liquid-liquid phase transitions, introduces some general features attributed to intrinsically disordered proteins, represents several illustrative examples of intrinsic disorder-based phase separation, and provides some reasons for the abundance of intrinsically disordered proteins in organelles formed as a result of biological liquid-liquid phase transitions.
Collapse
|
171
|
Abstract
Intrinsically disordered proteins and regions (IDPs and IDRs) are involved in a wide range of cellular functions and they often facilitate interactions with RNAs, DNAs, and proteins. Although many computational methods can predict IDPs and IDRs in protein sequences, only a few methods predict their functions and these functions primarily concern protein binding. We describe how to use the first computational method DisoRDPbind for high-throughput prediction of multiple functions of disordered regions. Our method predicts the RNA-, DNA-, and protein-binding residues located in IDRs in the input protein sequences. DisoRDPbind provides accurate predictions and is sufficiently fast to make predictions for full genomes. Our method is implemented as a user-friendly webserver that is freely available at http://biomine.ece.ualberta.ca/DisoRDPbind/ . We overview our predictor, discuss how to run the webserver, and show how to interpret the corresponding results. We also demonstrate the utility of our method based on two case studies, human BRCA1 protein that binds various proteins and DNA, and yeast 60S ribosomal protein L4 that interacts with proteins and RNA.
Collapse
|
172
|
Abstract
Currently available computational tools, which are many, provide a researcher with the multitude of options for prediction of intrinsic disorder in a protein of interest and for finding at least some of its disorder-based functions. This chapter provides a highly subjective guideline on how not to be lost in the "dark forest" of available tools for the analysis of intrinsic disorder. By no means it gives a unique pathway through this forest, but simply presents some of the tools the author uses in his everyday research.
Collapse
Affiliation(s)
- Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA.
- Institute for Biological Instrumentation, Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russian Federation.
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russian Federation.
| |
Collapse
|
173
|
Sharma R, Kumar S, Tsunoda T, Patil A, Sharma A. Predicting MoRFs in protein sequences using HMM profiles. BMC Bioinformatics 2016; 17:504. [PMID: 28155710 PMCID: PMC5259822 DOI: 10.1186/s12859-016-1375-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Intrinsically Disordered Proteins (IDPs) lack an ordered three-dimensional structure and are enriched in various biological processes. The Molecular Recognition Features (MoRFs) are functional regions within IDPs that undergo a disorder-to-order transition on binding to a partner protein. Identifying MoRFs in IDPs using computational methods is a challenging task. METHODS In this study, we introduce hidden Markov model (HMM) profiles to accurately identify the location of MoRFs in disordered protein sequences. Using windowing technique, HMM profiles are utilised to extract features from protein sequences and support vector machines (SVM) are used to calculate a propensity score for each residue. Two different SVM kernels with high noise tolerance are evaluated with a varying window size and the scores of the SVM models are combined to generate the final propensity score to predict MoRF residues. The SVM models are designed to extract maximal information between MoRF residues, its neighboring regions (Flanks) and the remainder of the sequence (Others). RESULTS To evaluate the proposed method, its performance was compared to that of other MoRF predictors; MoRFpred and ANCHOR. The results show that the proposed method outperforms these two predictors. CONCLUSIONS Using HMM profile as a source of feature extraction, the proposed method indicates improvement in predicting MoRFs in disordered protein sequences.
Collapse
Affiliation(s)
- Ronesh Sharma
- School of Electrical and Electronics Engineering, Fiji National University, Suva, Fiji.,School of Engineering and Physics, The University of the South Pacific, Suva, Fiji
| | - Shiu Kumar
- School of Electrical and Electronics Engineering, Fiji National University, Suva, Fiji.,School of Engineering and Physics, The University of the South Pacific, Suva, Fiji
| | - Tatsuhiko Tsunoda
- CREST, JST, Yokohama, 230-0045, Japan.,RIKEN Center for Integrative Medical Science, Yokohama, 230-0045, Japan.,Medical Research Institute, Tokyo Medical and Dental University, Tokyo, 113-8510, Japan
| | - Ashwini Patil
- Human Genome Center, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan.
| | - Alok Sharma
- School of Engineering and Physics, The University of the South Pacific, Suva, Fiji. .,CREST, JST, Yokohama, 230-0045, Japan. .,RIKEN Center for Integrative Medical Science, Yokohama, 230-0045, Japan. .,Medical Research Institute, Tokyo Medical and Dental University, Tokyo, 113-8510, Japan.
| |
Collapse
|
174
|
Lieutaud P, Ferron F, Uversky AV, Kurgan L, Uversky VN, Longhi S. How disordered is my protein and what is its disorder for? A guide through the "dark side" of the protein universe. INTRINSICALLY DISORDERED PROTEINS 2016; 4:e1259708. [PMID: 28232901 DOI: 10.1080/21690707.2016.1259708] [Citation(s) in RCA: 80] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2016] [Revised: 11/03/2016] [Accepted: 11/04/2016] [Indexed: 12/18/2022]
Abstract
In the last 2 decades it has become increasingly evident that a large number of proteins are either fully or partially disordered. Intrinsically disordered proteins lack a stable 3D structure, are ubiquitous and fulfill essential biological functions. Their conformational heterogeneity is encoded in their amino acid sequences, thereby allowing intrinsically disordered proteins or regions to be recognized based on properties of these sequences. The identification of disordered regions facilitates the functional annotation of proteins and is instrumental for delineating boundaries of protein domains amenable to structural determination with X-ray crystallization. This article discusses a comprehensive selection of databases and methods currently employed to disseminate experimental and putative annotations of disorder, predict disorder and identify regions involved in induced folding. It also provides a set of detailed instructions that should be followed to perform computational analysis of disorder.
Collapse
Affiliation(s)
- Philippe Lieutaud
- Aix-Marseille Université, AFMB UMR, Marseille, France; CNRS, AFMB UMR, Marseille, France
| | - François Ferron
- Aix-Marseille Université, AFMB UMR, Marseille, France; CNRS, AFMB UMR, Marseille, France
| | - Alexey V Uversky
- Center for Data Analytics and Biomedical Informatics, Department of Computer and Information Sciences, College of Science and Technology, Temple University , Philadelphia, PA, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University , Richmond, VA, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA; Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russia
| | - Sonia Longhi
- Aix-Marseille Université, AFMB UMR, Marseille, France; CNRS, AFMB UMR, Marseille, France
| |
Collapse
|
175
|
O'Shea C, Staby L, Bendsen SK, Tidemand FG, Redsted A, Willemoës M, Kragelund BB, Skriver K. Structures and Short Linear Motif of Disordered Transcription Factor Regions Provide Clues to the Interactome of the Cellular Hub Protein Radical-induced Cell Death1. J Biol Chem 2016; 292:512-527. [PMID: 27881680 DOI: 10.1074/jbc.m116.753426] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2016] [Revised: 11/23/2016] [Indexed: 11/06/2022] Open
Abstract
Intrinsically disordered protein regions (IDRs) lack a well defined three-dimensional structure but often facilitate key protein functions. Some interactions between IDRs and folded protein domains rely on short linear motifs (SLiMs). These motifs are challenging to identify, but once found they can point to larger networks of interactions, such as with proteins that serve as hubs for essential cellular functions. The stress-associated plant protein radical-induced cell death1 (RCD1) is one such hub, interacting with many transcription factors via their flexible IDRs. To identify the SLiM bound by RCD1, we analyzed the IDRs in three protein partners, DREB2A (dehydration-responsive element-binding protein 2A), ANAC013, and ANAC046, considering parameters such as disorder, context, charges, and pI. Using a combined bioinformatics and experimental approach, we have identified the bipartite RCD1-binding SLiM as (DE)X(1,2)(YF)X(1,4)(DE)L, with essential contributions from conserved aromatic, acidic, and leucine residues. Detailed thermodynamic analysis revealed both favorable and unfavorable contributions from the IDRs surrounding the SLiM to the interactions with RCD1, and the SLiM affinities ranged from low nanomolar to 50 times higher Kd values. Specifically, although the SLiM was surrounded by IDRs, individual intrinsic α-helix propensities varied as shown by CD spectroscopy. NMR spectroscopy further demonstrated that DREB2A underwent coupled folding and binding with α-helix formation upon interaction with RCD1, whereas peptides from ANAC013 and ANAC046 formed different structures or were fuzzy in the complexes. These findings allow us to present a model of the stress-associated RCD1-transcription factor interactome and to contribute to the emerging understanding of the interactions between folded hubs and their intrinsically disordered partners.
Collapse
Affiliation(s)
- Charlotte O'Shea
- From the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, 5 Ole Maaloes Vej, Copenhagen DK-2200, Denmark
| | - Lasse Staby
- From the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, 5 Ole Maaloes Vej, Copenhagen DK-2200, Denmark
| | - Sidsel Krogh Bendsen
- From the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, 5 Ole Maaloes Vej, Copenhagen DK-2200, Denmark
| | - Frederik Grønbæk Tidemand
- From the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, 5 Ole Maaloes Vej, Copenhagen DK-2200, Denmark
| | - Andreas Redsted
- From the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, 5 Ole Maaloes Vej, Copenhagen DK-2200, Denmark
| | - Martin Willemoës
- From the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, 5 Ole Maaloes Vej, Copenhagen DK-2200, Denmark
| | - Birthe B Kragelund
- From the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, 5 Ole Maaloes Vej, Copenhagen DK-2200, Denmark
| | - Karen Skriver
- From the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, 5 Ole Maaloes Vej, Copenhagen DK-2200, Denmark
| |
Collapse
|
176
|
Charon J, Theil S, Nicaise V, Michon T. Protein intrinsic disorder within the Potyvirus genus: from proteome-wide analysis to functional annotation. MOLECULAR BIOSYSTEMS 2016; 12:634-52. [PMID: 26699268 DOI: 10.1039/c5mb00677e] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Within proteins, intrinsically disordered regions (IDRs) are devoid of stable secondary and tertiary structures under physiological conditions and rather exist as dynamic ensembles of inter-converting conformers. Although ubiquitous in all domains of life, the intrinsic disorder content is highly variable in viral genomes. Over the years, functional annotations of disordered regions at the scale of the whole proteome have been conducted for several animal viruses. But to date, similar studies applied to plant viruses are still missing. Based on disorder prediction tools combined with annotation programs and evolutionary studies, we analyzed the intrinsic disorder content in Potyvirus, using a 10-species dataset representative of this genus diversity. In this paper, we revealed that: (i) the Potyvirus proteome displays high disorder content, (ii) disorder is conserved during Potyvirus evolution, suggesting a functional advantage of IDRs, (iii) IDRs evolve faster than ordered regions, and (iv) IDRs may be associated with major biological functions required for the Potyvirus cycle. Notably, the proteins P1, Coat protein (CP) and Viral genome-linked protein (VPg) display a high content of conserved disorder, enriched in specific motifs mimicking eukaryotic functional modules and suggesting strategies of host machinery hijacking. In these three proteins, IDRs are particularly conserved despite their high amino acid polymorphism, indicating a link to adaptive processes. Through this comprehensive study, we further investigate the biological relevance of intrinsic disorder in Potyvirus biology and we propose a functional annotation of potyviral proteome IDRs.
Collapse
Affiliation(s)
- Justine Charon
- UMR Biologie du Fruit et Pathologie, INRA, Villenave d'Ornon cedex, France. and UMR Biologie du Fruit et Pathologie, Université de Bordeaux, Villenave d'Ornon cedex, France
| | - Sébastien Theil
- UMR Biologie du Fruit et Pathologie, INRA, Villenave d'Ornon cedex, France. and UMR Biologie du Fruit et Pathologie, Université de Bordeaux, Villenave d'Ornon cedex, France
| | - Valérie Nicaise
- UMR Biologie du Fruit et Pathologie, INRA, Villenave d'Ornon cedex, France. and UMR Biologie du Fruit et Pathologie, Université de Bordeaux, Villenave d'Ornon cedex, France
| | - Thierry Michon
- UMR Biologie du Fruit et Pathologie, INRA, Villenave d'Ornon cedex, France. and UMR Biologie du Fruit et Pathologie, Université de Bordeaux, Villenave d'Ornon cedex, France
| |
Collapse
|
177
|
Ulrich AKC, Seeger M, Schütze T, Bartlick N, Wahl MC. Scaffolding in the Spliceosome via Single α Helices. Structure 2016; 24:1972-1983. [PMID: 27773687 DOI: 10.1016/j.str.2016.09.007] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2016] [Revised: 09/08/2016] [Accepted: 09/28/2016] [Indexed: 12/16/2022]
Abstract
The spliceosomal B complex-specific protein Prp38 forms a complex with the intrinsically unstructured proteins MFAP1 and Snu23. Our binding and crystal structure analyses show that MFAP1 and Snu23 contact Prp38 via ER/K motif-stabilized single α helices, which have previously been recognized only as rigid connectors or force springs between protein domains. A variant of the Prp38-binding single α helix of MFAP1, in which ER/K motifs not involved in Prp38 binding were mutated, was less α-helical in isolation and showed a reduced Prp38 affinity, with opposing tendencies in interaction enthalpy and entropy. Our results indicate that the strengths of single α helix-based interactions can be tuned by the degree of helix stabilization in the unbound state. MFAP1, Snu23, and several other spliceosomal proteins contain multiple regions that likely form single α helices via which they might tether several binding partners and act as intermittent scaffolds that facilitate remodeling steps during assembly of an active spliceosome.
Collapse
Affiliation(s)
- Alexander K C Ulrich
- Laboratory of Structural Biochemistry, Freie Universität Berlin, Takustraße 6, 14195 Berlin, Germany
| | - Martin Seeger
- Laboratory of Structural Biochemistry, Freie Universität Berlin, Takustraße 6, 14195 Berlin, Germany
| | - Tonio Schütze
- Laboratory of Structural Biochemistry, Freie Universität Berlin, Takustraße 6, 14195 Berlin, Germany
| | - Natascha Bartlick
- Laboratory of Structural Biochemistry, Freie Universität Berlin, Takustraße 6, 14195 Berlin, Germany
| | - Markus C Wahl
- Laboratory of Structural Biochemistry, Freie Universität Berlin, Takustraße 6, 14195 Berlin, Germany; Helmholtz-Zentrum Berlin für Materialien und Energie, Macromolecular Crystallography, Albert-Einstein-Straße 15, 12489 Berlin, Germany.
| |
Collapse
|
178
|
Kathiriya JJ, Pathak RR, Bezginov A, Xue B, Uversky VN, Tillier ERM, Davé V. Structural pliability adjacent to the kinase domain highlights contribution of FAK1 IDRs to cytoskeletal remodeling. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2016; 1865:43-54. [PMID: 27718363 DOI: 10.1016/j.bbapap.2016.10.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Revised: 09/27/2016] [Accepted: 10/03/2016] [Indexed: 12/24/2022]
Abstract
Therapeutic protein kinase inhibitors are designed on the basis of kinase structures. Here, we define intrinsically disordered regions (IDRs) in structurally hybrid kinases. We reveal that 65% of kinases have an IDR adjacent to their kinase domain (KD). These IDRs are evolutionarily more conserved than IDRs distant to KDs. Strikingly, 36 kinases have adjacent IDRs extending into their KDs, defining a unique structural and functional subset of the kinome. Functional network analysis of this subset of the kinome uncovered FAK1 as topologically the most connected hub kinase. We identify that KD-flanking IDR of FAK1 is more conserved and undergoes more post-translational modifications than other IDRs. It preferentially interacts with proteins regulating scaffolding and kinase activity, which contribute to cytoskeletal remodeling. In summary, spatially and evolutionarily conserved IDRs in kinases may influence their functions, which can be exploited for targeted therapies in diseases including those that involve aberrant cytoskeletal remodeling.
Collapse
Affiliation(s)
- Jaymin J Kathiriya
- Morsani College of Medicine, Department of Pathology and Cell Biology, University of South Florida, Tampa, FL 33612, United States
| | - Ravi Ramesh Pathak
- Morsani College of Medicine, Department of Pathology and Cell Biology, University of South Florida, Tampa, FL 33612, United States
| | - Alexandr Bezginov
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Bin Xue
- Department of Cell Biology, Microbiology and Molecular Biology, University of South Florida, Tampa, FL 33620, United States
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, United States; USF Health Byrd Alzheimer's Research Institute, University of South Florida, Tampa, FL 33612, United States; Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russian Federation
| | | | - Vrushank Davé
- Morsani College of Medicine, Department of Pathology and Cell Biology, University of South Florida, Tampa, FL 33612, United States; Department of Cancer Biology and Evolution, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL 33612, United States.
| |
Collapse
|
179
|
Tovo-Rodrigues L, Recamonde-Mendoza M, Paixão-Côrtes VR, Bruxel EM, Schuch JB, Friedrich DC, Rohde LA, Hutz MH. The role of protein intrinsic disorder in major psychiatric disorders. Am J Med Genet B Neuropsychiatr Genet 2016; 171:848-60. [PMID: 27184105 DOI: 10.1002/ajmg.b.32455] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/03/2015] [Accepted: 04/22/2016] [Indexed: 01/26/2023]
Abstract
Although new candidate genes for Autism Spectrum Disorder (ASD), Schizophrenia (SCZ), Attention-Deficit/Hyperactivity Disorder (ADHD), and Bipolar Disorder (BD) emerged from genome-wide association studies (GWAS), their underlying molecular mechanisms remain poorly understood. Evidences of the involvement of intrinsically disordered proteins in diseases have grown in the last decade. These proteins lack tridimensional structure under physiological conditions and are involved in important cellular functions such as signaling, recognition and regulation. The aim of the present study was to identify the role and abundance of intrinsically disordered proteins in a set of psychiatric diseases and to test whether diseases are different regarding protein intrinsic disorder. Our hypothesis is that differences across psychiatric illnesses phenotypes and symptoms may arise from differences in intrinsic protein disorder content and properties of each group. A bioinformatics prediction of intrinsic disorder was performed in proteins retrieved based on top findings from GWAS, Copy Number Variation and candidate gene investigations for each disease. This approach revealed that about 80% of studied proteins presented long stretches of disorder. This amount was significantly higher than that observed in general eukaryotic proteins, and those involved in cardiovascular diseases. These results suggest that proteins with intrinsic disorder are a common feature of neurodevelopment and synaptic transmission processes which are potentially involved in the etiology of psychiatric diseases. Moreover, we identified differences between ADHD and ASD when the binary prediction of structure and putative binding sites were compared. These differences may be related to variation in symptom complexity between both diseases. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Luciana Tovo-Rodrigues
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil.,Postgraduate Program in Epidemiology, Federal University of Pelotas, Pelotas, RS, Brazil
| | - Mariana Recamonde-Mendoza
- Experimental and Molecular Cardiovascular Laboratory, Hospital de Clinicas de Porto Alegre, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | | | - Estela M Bruxel
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Jaqueline B Schuch
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Deise C Friedrich
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Luis A Rohde
- Child and Adolescent Psychiatric Division, Hospital de Clinicas de Porto Alegre, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil.,National Institute of Developmental Psychiatry for Children and Adolescents (INCT-CNPq), Brazil
| | - Mara H Hutz
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| |
Collapse
|
180
|
Zhao B, Xue B. Self-regulation of functional pathways by motifs inside the disordered tails of beta-catenin. BMC Genomics 2016; 17 Suppl 5:484. [PMID: 27585692 PMCID: PMC5009561 DOI: 10.1186/s12864-016-2825-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Beta-catenin has two major functions: coordinating cell-cell adhesion by interacting with cadherin in cadherin junction formation pathway; and regulating gene expression through Wnt signaling pathway. Accomplishing these two functions requires synergistic action of various sequential regions of the same beta-Catenin molecule, including the N-terminal tail, the middle armadillo domain, and the C-terminal tail. Although the middle armadillo domain is the major functional unit of beta-Catenin, the involvement of tails in the regulation of interaction between beta-Catenin and its partners has been well observed. Nonetheless, the regulatory processes of both tails are still elusive. In addition, it is interesting to note that the three sequential regions have different structural features: The middle armadillo domain is structured, but both N- and C-terminal tails are disordered. This observation leads to another important question on the functions and mechanisms of disordered tails, which is also largely unknown. RESULTS In this study, we focused on the characterization of sequential, structural, and functional features of the disordered tails of beta-Catenin. We identified multiple functional motifs and conserved sequence motifs in the disordered tails, discovered the correlation between cancer-associated mutations and functional motifs, explored the abundance of protein intrinsic disorder in the interactomes of beta-Catenin, and elaborated a working model on the regulatory roles of disordered tails in the functional pathways of beta-Catenin. CONCLUSION Disordered tails of beta-Catenin contain multiple functional motifs. These motifs interact with each other and the armadillo domain of beta-catenin to regulate the function of beta-Catenin in both cadherin junction formation pathway and Wnt signaling pathway.
Collapse
Affiliation(s)
- Bi Zhao
- Department of Cell Biology, Microbiology and Molecular Biology, School of Natural Sciences and Mathematics, College of Arts and Sciences, University of South Florida, 4202 E. Fowler Ave, ISA 2015, Tampa, 33620 FL USA
| | - Bin Xue
- Department of Cell Biology, Microbiology and Molecular Biology, School of Natural Sciences and Mathematics, College of Arts and Sciences, University of South Florida, 4202 E. Fowler Ave, ISA 2015, Tampa, 33620 FL USA
| |
Collapse
|
181
|
Kowalski A, Pałyga J. Modulation of chromatin function through linker histone H1 variants. Biol Cell 2016; 108:339-356. [PMID: 27412812 DOI: 10.1111/boc.201600007] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2016] [Revised: 07/08/2016] [Accepted: 07/08/2016] [Indexed: 12/30/2022]
Abstract
In this review, the structural aspects of linker H1 histones are presented as a background for characterization of the factors influencing their function in animal and human chromatin. The action of H1 histone variants is largely determined by dynamic alterations of their intrinsically disordered tail domains, posttranslational modifications and allelic diversification. The interdependent effects of these factors can establish dynamic histone H1 states that may affect the organization and function of chromatin regions.
Collapse
Affiliation(s)
- Andrzej Kowalski
- Department of Biochemistry and Genetics, Institute of Biology, Jan Kochanowski University, 25-406 Kielce, Poland
| | - Jan Pałyga
- Department of Biochemistry and Genetics, Institute of Biology, Jan Kochanowski University, 25-406 Kielce, Poland
| |
Collapse
|
182
|
Kowalski A. Nuclear and nucleolar activity of linker histone variant H1.0. Cell Mol Biol Lett 2016; 21:15. [PMID: 28536618 PMCID: PMC5414669 DOI: 10.1186/s11658-016-0014-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2015] [Accepted: 01/28/2016] [Indexed: 12/31/2022] Open
Abstract
Histone H1.0 belongs to the class of linker histones (H1), although it is substantially distinct from other histone H1 family members. The differences can be observed in the chromosomal location and organization of the histone H1.0 encoding gene, as well as in the length and composition of its amino acid chain. Whereas somatic (H1.1-H1.5) histone H1 variants are synthesized in the cell cycle S-phase, histone H1.0 is synthesized throughout the cell cycle. By replacing somatic H1 variants during cell maturation, histone H1.0 is gradually deposited in low dividing cells and achieves the highest level of expression in the terminally differentiated cells. Compared to other differentiation-specific H1 histone (H5) characteristic for unique tissue and organisms, the distribution of histone H1.0 remains non-specific. Classic investigations emphasize that histone H1.0 is engaged in the organization of nuclear chromatin accounting for formation and maintenance of its nucleosomal and higher-order structure, and thus influences gene expression. However, the recent data confirmed histone H1.0 peculiar localization in the nucleolus and unexpectedly revealed its potential for regulation of nucleolar, RNA-dependent, activity via interaction with other proteins. According to such findings, histone H1.0 participates in the formation of gene-coded information through its control at both transcriptional and translational levels. In order to reappraise the biological significance of histone H1.0, both aspects of its activity are presented in this review.
Collapse
Affiliation(s)
- Andrzej Kowalski
- Department of Biochemistry and Genetics, Institute of Biology, Jan Kochanowski University, Świętokrzyska 15, 25-406 Kielce, Poland
| |
Collapse
|
183
|
A simplified method for the purification of an intrinsically disordered coagulant protein from defatted Moringa oleifera seeds. Process Biochem 2016. [DOI: 10.1016/j.procbio.2016.04.021] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
184
|
Bürgi J, Xue B, Uversky VN, van der Goot FG. Intrinsic Disorder in Transmembrane Proteins: Roles in Signaling and Topology Prediction. PLoS One 2016; 11:e0158594. [PMID: 27391701 PMCID: PMC4938508 DOI: 10.1371/journal.pone.0158594] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2016] [Accepted: 06/18/2016] [Indexed: 12/31/2022] Open
Abstract
Intrinsically disordered regions (IDRs) are peculiar stretches of amino acids that lack stable conformations in solution. Intrinsic Disorder containing Proteins (IDP) are defined by the presence of at least one large IDR and have been linked to multiple cellular processes including cell signaling, DNA binding and cancer. Here we used computational analyses and publicly available databases to deepen insight into the prevalence and function of IDRs specifically in transmembrane proteins, which are somewhat neglected in most studies. We found that 50% of transmembrane proteins have at least one IDR of 30 amino acids or more. Interestingly, these domains preferentially localize to the cytoplasmic side especially of multi-pass transmembrane proteins, suggesting that disorder prediction could increase the confidence of topology prediction algorithms. This was supported by the successful prediction of the topology of the uncharacterized multi-pass transmembrane protein TMEM117, as confirmed experimentally. Pathway analysis indicated that IDPs are enriched in cell projection and axons and appear to play an important role in cell adhesion, signaling and ion binding. In addition, we found that IDP are enriched in phosphorylation sites, a crucial post translational modification in signal transduction, when compared to fully ordered proteins and to be implicated in more protein-protein interaction events. Accordingly, IDPs were highly enriched in short protein binding regions called Molecular Recognition Features (MoRFs). Altogether our analyses strongly support the notion that the transmembrane IDPs act as hubs in cellular signal events.
Collapse
Affiliation(s)
- Jérôme Bürgi
- Faculty of life science, Global Health Institute, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Bin Xue
- Department of Cell Biology, Microbiology, and Molecular Biology, School of Natural Sciences and Mathematics, College of Arts and Sciences, University of South Florida, Tampa, FL, 33620, United States of America
| | - Vladimir N. Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, 33647, United States of America
- Institute for Biological Instrumentation, Russian Academy of Sciences, Pushchino, Moscow Region, Russian Federation
- Department of Biology, Faculty of Science, King Abdulaziz University, P.O. Box 80203, Jeddah, 21589, Kingdom of Saudi Arabia
| | - F. Gisou van der Goot
- Faculty of life science, Global Health Institute, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- * E-mail:
| |
Collapse
|
185
|
Micaroni M, Giacchetti G, Plebani R, Xiao GG, Federici L. ATP2C1 gene mutations in Hailey-Hailey disease and possible roles of SPCA1 isoforms in membrane trafficking. Cell Death Dis 2016; 7:e2259. [PMID: 27277681 PMCID: PMC5143377 DOI: 10.1038/cddis.2016.147] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2015] [Revised: 04/17/2016] [Accepted: 04/28/2016] [Indexed: 12/12/2022]
Abstract
ATP2C1 gene codes for the secretory pathway Ca(2+)/Mn(2+)-ATPase pump type 1 (SPCA1) localizing at the golgi apparatus. Mutations on the human ATP2C1 gene, causing decreased levels of the SPCA1 expression, have been identified as the cause of the Hailey-Hailey disease, a rare skin disorder. In the last few years, several mutations have been described, and here we summarize how they are distributed along the gene and how missense mutations affect protein expression. SPCA1 is expressed in four different isoforms through alternative splicing of the ATP2C1 gene and none of these isoforms is differentially affected by any of these mutations. However, a better understanding of the tissue specific expression of the isoforms, their localization along the secretory pathway, their specific binding partners and the role of the C-terminal tail making isoforms different from each other, will be future goals of the research in this field.
Collapse
Affiliation(s)
- M Micaroni
- School of Pharmaceutical Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - G Giacchetti
- Aging Research Center (Ce.S.I.), University 'G. D'Annunzio' of Chieti-Pescara, Chieti 66100, Italy.,Department of Neuroscience, Imaging and Clinical Sciences, University 'G. D'Annunzio' of Chieti-Pescara, Chieti 66100, Italy
| | - R Plebani
- Aging Research Center (Ce.S.I.), University 'G. D'Annunzio' of Chieti-Pescara, Chieti 66100, Italy.,Department of Medical Oral and Biotechnological Sciences, School of Medicine and Health Sciences, University 'G. D'Annunzio' of Chieti-Pescara, Chieti 66100, Italy
| | - G G Xiao
- School of Pharmaceutical Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - L Federici
- Aging Research Center (Ce.S.I.), University 'G. D'Annunzio' of Chieti-Pescara, Chieti 66100, Italy.,Department of Medical Oral and Biotechnological Sciences, School of Medicine and Health Sciences, University 'G. D'Annunzio' of Chieti-Pescara, Chieti 66100, Italy
| |
Collapse
|
186
|
Malhis N, Jacobson M, Gsponer J. MoRFchibi SYSTEM: software tools for the identification of MoRFs in protein sequences. Nucleic Acids Res 2016; 44:W488-93. [PMID: 27174932 PMCID: PMC4987941 DOI: 10.1093/nar/gkw409] [Citation(s) in RCA: 103] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Accepted: 05/03/2016] [Indexed: 11/13/2022] Open
Abstract
Molecular recognition features, MoRFs, are short segments within longer disordered protein regions that bind to globular protein domains in a process known as disorder-to-order transition. MoRFs have been found to play a significant role in signaling and regulatory processes in cells. High-confidence computational identification of MoRFs remains an important challenge. In this work, we introduce MoRFchibi SYSTEM that contains three MoRF predictors: MoRFCHiBi, a basic predictor best suited as a component in other applications, MoRFCHiBi_ Light, ideal for high-throughput predictions and MoRFCHiBi_ Web, slower than the other two but best for high accuracy predictions. Results show that MoRFchibi SYSTEM provides more than double the precision of other predictors. MoRFchibi SYSTEM is available in three different forms: as HTML web server, RESTful web server and downloadable software at: http://www.chibi.ubc.ca/faculty/joerg-gsponer/gsponer-lab/software/morf_chibi/.
Collapse
Affiliation(s)
- Nawar Malhis
- Michael Smith Laboratories-Centre for High-Throughput Biology, The University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Matthew Jacobson
- Michael Smith Laboratories-Centre for High-Throughput Biology, The University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Jörg Gsponer
- Michael Smith Laboratories-Centre for High-Throughput Biology, The University of British Columbia, Vancouver, BC V6T 1Z4, Canada Department of Biochemistry and Molecular Biology, University of British Columbia, Vancouver, BC V6T 1Z3, Canada
| |
Collapse
|
187
|
Urbanowicz A, Lewandowski D, Szpotkowski K, Figlerowicz M. Tick receptor for outer surface protein A from Ixodes ricinus - the first intrinsically disordered protein involved in vector-microbe recognition. Sci Rep 2016; 6:25205. [PMID: 27112540 PMCID: PMC4844993 DOI: 10.1038/srep25205] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Accepted: 04/12/2016] [Indexed: 01/02/2023] Open
Abstract
The tick receptor for outer surface protein A (TROSPA) is the only identified factor involved in tick gut colonization by various Borrelia species. TROSPA is localized in the gut epithelium and can recognize and bind the outer surface bacterial protein OspA via an unknown mechanism. Based on earlier reports and our latest observations, we considered that TROSPA would be the first identified intrinsically disordered protein (IDP) involved in the interaction between a vector and a pathogenic microbe. To verify this hypothesis, we performed structural studies of a TROSPA mutant from Ixodes ricinus using both computational and experimental approaches. Irrespective of the method used, we observed that the secondary structure content of the TROSPA polypeptide chain is low. In addition, the collected SAXS data indicated that this protein is highly extended and exists in solution as a set of numerous conformers. These features are all commonly considered hallmarks of IDPs. Taking advantage of our SAXS data, we created structural models of TROSPA and proposed a putative mechanism for the TROSPA-OspA interaction. The disordered nature of TROSPA may explain the ability of a wide spectrum of Borrelia species to colonize the tick gut.
Collapse
Affiliation(s)
- Anna Urbanowicz
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, 61-704, Poland
| | - Dominik Lewandowski
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, 61-704, Poland
| | - Kamil Szpotkowski
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, 61-704, Poland
| | - Marek Figlerowicz
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, 61-704, Poland.,Institute of Computing Science, University of Technology, Poznan, 60-965, Poland
| |
Collapse
|
188
|
Yacoub HA, Al-Maghrabi OA, Ahmed ES, Uversky VN. Abundance and functional roles of intrinsic disorder in the antimicrobial peptides of the NK-lysin family. J Biomol Struct Dyn 2016; 35:836-856. [DOI: 10.1080/07391102.2016.1164077] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Affiliation(s)
- Haitham A. Yacoub
- Faculty of Science, Department of Biological Sciences, University of Jeddah, Jeddah, Saudi Arabia
- Department of Cell Biology, Genetic Engineering and Biotechnology Division, National Research Centre, P.O. Box 12622, Gizza, Egypt
| | - Omar A. Al-Maghrabi
- Faculty of Science, Department of Biological Sciences, University of Jeddah, Jeddah, Saudi Arabia
| | - Ekram S. Ahmed
- Department of Cell Biology, Genetic Engineering and Biotechnology Division, National Research Centre, P.O. Box 12622, Gizza, Egypt
| | - Vladimir N. Uversky
- Faculty of Sciences, Department of Biological Sciences, King Abdulaziz University, P.O. Box 80203, Jeddah, Saudi Arabia
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russia
- Department of Molecular Medicine and USF Health Byrd Alzheimer’s Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| |
Collapse
|
189
|
Schuch JB, Paixão-Côrtes VR, Friedrich DC, Tovo-Rodrigues L. The contribution of protein intrinsic disorder to understand the role of genetic variants uncovered by autism spectrum disorders exome studies. Am J Med Genet B Neuropsychiatr Genet 2016; 171B:479-91. [PMID: 26892727 DOI: 10.1002/ajmg.b.32431] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/24/2015] [Accepted: 02/04/2016] [Indexed: 01/09/2023]
Abstract
Several autism spectrum disorders (ASD) exome studies suggest that coding single nucleotide variants (SNVs) play an important role on ASD etiology. Usually, the pathogenic effect of missense mutations is estimated through predictors that lose accuracy for those SNVs placed in intrinsically disordered regions of protein. Here, we used bioinformatics tools to investigate the effect of mutations described in ASD published exome studies (549 mutations) in protein disorder, considering post-translational modification, PEST and Molecular Recognition Features (MoRFs) motifs. Schizophrenia and type 2 diabetes (T2D) datasets were created for comparison purposes. The frequency of mutations predicted as disordered was comparable among the three datasets (38.1% in ASD, 35.7% in schizophrenia, 46.4% in T2D). However, the frequency of SNVs predicted to lead a gain or loss of functional sites or change intrinsic disorder tendencies was higher in ASD and schizophrenia than T2D (46.9%, 36.4%, and 23.1%, respectively). The results obtained by SIFT and PolyPhen-2 indicated that 38.9% and 34.4% of the mutations predicted, respectively, as tolerated and benign showed functional alterations in disorder properties. Given the frequency of mutations placed in IDRs and their functional impact, this study suggests that alterations in intrinsic disorder properties might play a role in ASD and schizophrenia etiologies. They should be taken into consideration when researching the pathogenicity of mutations in neurodevelopmental and psychiatric diseases. Finally, mutations with functional alterations in disorder properties must be potential targets for in vitro and in vivo functional studies.
Collapse
Affiliation(s)
- Jaqueline Bohrer Schuch
- Department of Genetics, Federal University of Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
| | | | - Deise C Friedrich
- Department of Cellular and Molecular Biology, Pontifical Catholic University of Rio Grande do Sul (PUCRS), Porto Alegre, Rio Grande do Sul, Brazil
| | - Luciana Tovo-Rodrigues
- Postgraduate Program in Epidemiology, Federal University of Pelotas (UFPel), Pelotas, Rio Grande do Sul, Brazil
| |
Collapse
|
190
|
Jandrlić DR, Lazić GM, Mitić NS, Pavlović MD. Software tools for simultaneous data visualization and T cell epitopes and disorder prediction in proteins. J Biomed Inform 2016; 60:120-31. [PMID: 26851400 DOI: 10.1016/j.jbi.2016.01.016] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2015] [Revised: 01/15/2016] [Accepted: 01/28/2016] [Indexed: 11/16/2022]
Abstract
We have developed EpDis and MassPred, extendable open source software tools that support bioinformatic research and enable parallel use of different methods for the prediction of T cell epitopes, disorder and disordered binding regions and hydropathy calculation. These tools offer a semi-automated installation of chosen sets of external predictors and an interface allowing for easy application of the prediction methods, which can be applied either to individual proteins or to datasets of a large number of proteins. In addition to access to prediction methods, the tools also provide visualization of the obtained results, calculation of consensus from results of different methods, as well as import of experimental data and their comparison with results obtained with different predictors. The tools also offer a graphical user interface and the possibility to store data and the results obtained using all of the integrated methods in the relational database or flat file for further analysis. The MassPred part enables a massive parallel application of all integrated predictors to the set of proteins. Both tools can be downloaded from http://bioinfo.matf.bg.ac.rs/home/downloads.wafl?cat=Software. Appendix A includes the technical description of the created tools and a list of supported predictors.
Collapse
Affiliation(s)
- Davorka R Jandrlić
- University of Belgrade, Faculty of Mechanical Engineering, Kraljice Marije 16, Belgrade, Serbia.
| | - Goran M Lazić
- University of Belgrade, Faculty of Mathematics, P.O.B. 550, Studentski trg 16/IV, Belgrade, Serbia.
| | - Nenad S Mitić
- University of Belgrade, Faculty of Mathematics, P.O.B. 550, Studentski trg 16/IV, Belgrade, Serbia.
| | - Mirjana D Pavlović
- University of Belgrade, Institute of General and Physical Chemistry, Studentski trg 12/V, Belgrade, Serbia.
| |
Collapse
|
191
|
Banerjee S, De RK. Structural disorder: a tool for housekeeping proteins performing tissue-specific interactions. J Biomol Struct Dyn 2016; 34:1930-45. [DOI: 10.1080/07391102.2015.1095115] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Sanghita Banerjee
- Machine Intelligence Unit, Indian Statistical Institute, 203 Barrackpore Trunk Road, Kolkata 700108, India
| | - Rajat K. De
- Machine Intelligence Unit, Indian Statistical Institute, 203 Barrackpore Trunk Road, Kolkata 700108, India
| |
Collapse
|
192
|
Deller MC, Kong L, Rupp B. Protein stability: a crystallographer's perspective. ACTA CRYSTALLOGRAPHICA SECTION F-STRUCTURAL BIOLOGY COMMUNICATIONS 2016; 72:72-95. [PMID: 26841758 PMCID: PMC4741188 DOI: 10.1107/s2053230x15024619] [Citation(s) in RCA: 140] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/27/2015] [Accepted: 12/21/2015] [Indexed: 12/18/2022]
Abstract
Protein stability is a topic of major interest for the biotechnology, pharmaceutical and food industries, in addition to being a daily consideration for academic researchers studying proteins. An understanding of protein stability is essential for optimizing the expression, purification, formulation, storage and structural studies of proteins. In this review, discussion will focus on factors affecting protein stability, on a somewhat practical level, particularly from the view of a protein crystallographer. The differences between protein conformational stability and protein compositional stability will be discussed, along with a brief introduction to key methods useful for analyzing protein stability. Finally, tactics for addressing protein-stability issues during protein expression, purification and crystallization will be discussed.
Collapse
Affiliation(s)
- Marc C Deller
- Stanford ChEM-H, Macromolecular Structure Knowledge Center, Stanford University, Shriram Center, 443 Via Ortega, Room 097, MC5082, Stanford, CA 94305-4125, USA
| | - Leopold Kong
- Laboratory of Cell and Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), National Institutes of Health (NIH), Building 8, Room 1A03, 8 Center Drive, Bethesda, MD 20814, USA
| | - Bernhard Rupp
- Department of Forensic Crystallography, k.-k. Hofkristallamt, 91 Audrey Place, Vista, CA 92084, USA
| |
Collapse
|
193
|
Polo A, Colonna G, Guariniello S, Ciliberto G, Costantini S. Deducing the functional characteristics of the human selenoprotein SELK from the structural properties of its intrinsically disordered C-terminal domain. MOLECULAR BIOSYSTEMS 2016; 12:758-72. [PMID: 26735936 DOI: 10.1039/c5mb00679a] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The intrinsically disordered proteins (IDPs) cannot be described by a single structural representation but, due to their high structural fluctuation, through conformational ensembles. Certainly, molecular dynamics (MD) simulations represent a useful tool to study their different conformations capturing the conformational distribution. Our group is focusing on the structural characterization of proteins belonging to the seleno-proteome due to their involvement in cancer. They present disordered domains central for their biological function, and, in particular, SELK is a single-pass transmembrane protein that resides in the endoplasmic reticulum membrane (ER) with a C-terminal domain exposed to the cytoplasm that is known to interact with different components of the endoplasmic reticulum associated to the protein degradation (ERAD) pathway. This protein is found to be up-expressed in hepatocellular carcinoma and in other cancers. In this work we performed a detailed analysis of the C-terminal domain sequence of SELK and discovered that it is characterized by many prolines, and four negatively and eleven positively charged residues, which are crucial for its biological activity. This region can be considered as a weak polyelectrolyte and, specifically, a polycation, with high disordered propensity and different phosphorylation sites dislocated along the sequence. Then, we modeled its three-dimensional structure by performing MD simulations in water at neutral pH to analyze the structural stability as well as to identify the presence of HUB residues that play a key structural role as evidenced by the residue-residue interaction network analysis. Through this approach, we demonstrate that the C-terminal domain of SELK (i) presents a poor content of regular secondary structure elements, (ii) is dynamically stabilized by a network of intra-molecular H-bonds and H-bonds with water molecules, (iii) is highly fluctuating and, therefore, can be described only through a conformational ensemble, where we evidenced a distribution of equilibrium conformers which continuously inter-change their conformations. Finally to verify the specific role played by the negative charges, we also performed MD simulations at acidic pH. Overall, all the obtained results evidenced that SELK has the dynamic structural features to be defined as a HUB protein able to interact with multiple members. Therefore, considering the possible role that this protein can have in cancer development and progression, it can represent a target for drug design studies.
Collapse
Affiliation(s)
- Andrea Polo
- Servizio di Informatica Medica, Azienda Ospedaliera Universitaria, Seconda Università di Napoli, Napoli, Italy
| | | | | | | | | |
Collapse
|
194
|
Abstract
In the last two decades, it has become increasingly evident that a large number of proteins are either fully or partially disordered. Intrinsically disordered proteins are ubiquitous proteins that fulfill essential biological functions while lacking a stable 3D structure. Their conformational heterogeneity is encoded at the amino acid sequence level, thereby allowing intrinsically disordered proteins or regions to be recognized based on their sequence properties. The identification of disordered regions facilitates the functional annotation of proteins and is instrumental for delineating boundaries of protein domains amenable to crystallization. This chapter focuses on the methods currently employed for predicting disorder and identifying regions involved in induced folding.
Collapse
Affiliation(s)
- Philippe Lieutaud
- AFMB UMR 7257, Aix-Marseille Université, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France
- AFMB UMR 7257, CNRS, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France
| | - François Ferron
- AFMB UMR 7257, Aix-Marseille Université, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France
- AFMB UMR 7257, CNRS, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France
| | - Sonia Longhi
- AFMB UMR 7257, Aix-Marseille Université, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France.
- AFMB UMR 7257, CNRS, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France.
| |
Collapse
|
195
|
Yan J, Dunker AK, Uversky VN, Kurgan L. Molecular recognition features (MoRFs) in three domains of life. MOLECULAR BIOSYSTEMS 2016; 12:697-710. [DOI: 10.1039/c5mb00640f] [Citation(s) in RCA: 103] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
MoRFs are widespread intrinsically disordered protein-binding regions that have similar abundance and amino acid composition across the three domains of life.
Collapse
Affiliation(s)
- Jing Yan
- Department of Electrical and Computer Engineering
- University of Alberta
- Edmonton
- Canada
| | - A. Keith Dunker
- Center for Computational Biology and Bioinformatics
- Indiana University School of Medicine
- Indianapolis
- USA
- Indiana University School of Informatics
| | - Vladimir N. Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute
- Morsani College of Medicine
- University of South Florida
- Tampa
- USA
| | - Lukasz Kurgan
- Department of Electrical and Computer Engineering
- University of Alberta
- Edmonton
- Canada
- Department of Computer Science
| |
Collapse
|
196
|
Nagulapalli M, Maji S, Dwivedi N, Dahiya P, Thakur JK. Evolution of disorder in Mediator complex and its functional relevance. Nucleic Acids Res 2015; 44:1591-612. [PMID: 26590257 PMCID: PMC4770211 DOI: 10.1093/nar/gkv1135] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Accepted: 10/18/2015] [Indexed: 12/27/2022] Open
Abstract
Mediator, an important component of eukaryotic transcriptional machinery, is a huge multisubunit complex. Though the complex is known to be conserved across all the eukaryotic kingdoms, the evolutionary topology of its subunits has never been studied. In this study, we profiled disorder in the Mediator subunits of 146 eukaryotes belonging to three kingdoms viz., metazoans, plants and fungi, and attempted to find correlation between the evolution of Mediator complex and its disorder. Our analysis suggests that disorder in Mediator complex have played a crucial role in the evolutionary diversification of complexity of eukaryotic organisms. Conserved intrinsic disordered regions (IDRs) were identified in only six subunits in the three kingdoms whereas unique patterns of IDRs were identified in other Mediator subunits. Acquisition of novel molecular recognition features (MoRFs) through evolution of new subunits or through elongation of the existing subunits was evident in metazoans and plants. A new concept of ‘junction-MoRF’ has been introduced. Evolutionary link between CBP and Med15 has been provided which explain the evolution of extended-IDR in CBP from Med15 KIX-IDR junction-MoRF suggesting role of junction-MoRF in evolution and modulation of protein–protein interaction repertoire. This study can be informative and helpful in understanding the conserved and flexible nature of Mediator complex across eukaryotic kingdoms.
Collapse
Affiliation(s)
- Malini Nagulapalli
- Plant Mediator Lab, National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi 110067, India
| | - Sourobh Maji
- Plant Mediator Lab, National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi 110067, India
| | - Nidhi Dwivedi
- Plant Mediator Lab, National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi 110067, India
| | - Pradeep Dahiya
- Plant Mediator Lab, National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi 110067, India
| | - Jitendra K Thakur
- Plant Mediator Lab, National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi 110067, India
| |
Collapse
|
197
|
Malhis N, Wong ETC, Nassar R, Gsponer J. Computational Identification of MoRFs in Protein Sequences Using Hierarchical Application of Bayes Rule. PLoS One 2015; 10:e0141603. [PMID: 26517836 PMCID: PMC4627796 DOI: 10.1371/journal.pone.0141603] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2015] [Accepted: 10/09/2015] [Indexed: 01/24/2023] Open
Abstract
Motivation Intrinsically disordered regions of proteins play an essential role in the regulation of various biological processes. Key to their regulatory function is often the binding to globular protein domains via sequence elements known as molecular recognition features (MoRFs). Development of computational tools for the identification of candidate MoRF locations in amino acid sequences is an important task and an area of growing interest. Given the relative sparseness of MoRFs in protein sequences, the accuracy of the available MoRF predictors is often inadequate for practical usage, which leaves a significant need and room for improvement. In this work, we introduce MoRFCHiBi_Web, which predicts MoRF locations in protein sequences with higher accuracy compared to current MoRF predictors. Methods Three distinct and largely independent property scores are computed with component predictors and then combined to generate the final MoRF propensity scores. The first score reflects the likelihood of sequence windows to harbour MoRFs and is based on amino acid composition and sequence similarity information. It is generated by MoRFCHiBi using small windows of up to 40 residues in size. The second score identifies long stretches of protein disorder and is generated by ESpritz with the DisProt option. Lastly, the third score reflects residue conservation and is assembled from PSSM files generated by PSI-BLAST. These propensity scores are processed and then hierarchically combined using Bayes rule to generate the final MoRFCHiBi_Web predictions. Results MoRFCHiBi_Web was tested on three datasets. Results show that MoRFCHiBi_Web outperforms previously developed predictors by generating less than half the false positive rate for the same true positive rate at practical threshold values. This level of accuracy paired with its relatively high processing speed makes MoRFCHiBi_Web a practical tool for MoRF prediction. Availability http://morf.chibi.ubc.ca:8080/morf/.
Collapse
Affiliation(s)
- Nawar Malhis
- Centre for High-Throughput Biology, University of British Columbia, Vancouver, BC, Canada
- * E-mail: (NM); (JG)
| | - Eric T. C. Wong
- Centre for High-Throughput Biology, University of British Columbia, Vancouver, BC, Canada
- Department of Biochemistry and Molecular Biology, University of British Columbia, Vancouver, BC, Canada
| | - Roy Nassar
- Centre for High-Throughput Biology, University of British Columbia, Vancouver, BC, Canada
| | - Jörg Gsponer
- Centre for High-Throughput Biology, University of British Columbia, Vancouver, BC, Canada
- Department of Biochemistry and Molecular Biology, University of British Columbia, Vancouver, BC, Canada
- * E-mail: (NM); (JG)
| |
Collapse
|
198
|
Peng Z, Kurgan L. High-throughput prediction of RNA, DNA and protein binding regions mediated by intrinsic disorder. Nucleic Acids Res 2015; 43:e121. [PMID: 26109352 PMCID: PMC4605291 DOI: 10.1093/nar/gkv585] [Citation(s) in RCA: 117] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2015] [Revised: 04/24/2015] [Accepted: 05/24/2015] [Indexed: 01/05/2023] Open
Abstract
Intrinsically disordered proteins and regions (IDPs and IDRs) lack stable 3D structure under physiological conditions in-vitro, are common in eukaryotes, and facilitate interactions with RNA, DNA and proteins. Current methods for prediction of IDPs and IDRs do not provide insights into their functions, except for a handful of methods that address predictions of protein-binding regions. We report first-of-its-kind computational method DisoRDPbind for high-throughput prediction of RNA, DNA and protein binding residues located in IDRs from protein sequences. DisoRDPbind is implemented using a runtime-efficient multi-layered design that utilizes information extracted from physiochemical properties of amino acids, sequence complexity, putative secondary structure and disorder and sequence alignment. Empirical tests demonstrate that it provides accurate predictions that are competitive with other predictors of disorder-mediated protein binding regions and complementary to the methods that predict RNA- and DNA-binding residues annotated based on crystal structures. Application in Homo sapiens, Mus musculus, Caenorhabditis elegans and Drosophila melanogaster proteomes reveals that RNA- and DNA-binding proteins predicted by DisoRDPbind complement and overlap with the corresponding known binding proteins collected from several sources. Also, the number of the putative protein-binding regions predicted with DisoRDPbind correlates with the promiscuity of proteins in the corresponding protein-protein interaction networks. Webserver: http://biomine.ece.ualberta.ca/DisoRDPbind/.
Collapse
Affiliation(s)
- Zhenling Peng
- Center for Applied Mathematics, Tianjin University, Tianjin, 300072, P.R. China Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Alberta, T6G 2V4, Canada
| | - Lukasz Kurgan
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Alberta, T6G 2V4, Canada
| |
Collapse
|
199
|
Permyakov SE, Permyakov EA, Uversky VN. Intrinsically disordered caldesmon binds calmodulin via the "buttons on a string" mechanism. PeerJ 2015; 3:e1265. [PMID: 26417545 PMCID: PMC4582948 DOI: 10.7717/peerj.1265] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2015] [Accepted: 09/03/2015] [Indexed: 01/27/2023] Open
Abstract
We show here that chicken gizzard caldesmon (CaD) and its C-terminal domain (residues 636–771, CaD136) are intrinsically disordered proteins. The computational and experimental analyses of the wild type CaD136 and series of its single tryptophan mutants (W674A, W707A, and W737A) and a double tryptophan mutant (W674A/W707A) suggested that although the interaction of CaD136 with calmodulin (CaM) can be driven by the non-specific electrostatic attraction between these oppositely charged molecules, the specificity of CaD136-CaM binding is likely to be determined by the specific packing of important CaD136 tryptophan residues at the CaD136-CaM interface. It is suggested that this interaction can be described as the “buttons on a charged string” model, where the electrostatic attraction between the intrinsically disordered CaD136 and the CaM is solidified in a “snapping buttons” manner by specific packing of the CaD136 “pliable buttons” (which are the short segments of fluctuating local structure condensed around the tryptophan residues) at the CaD136-CaM interface. Our data also show that all three “buttons” are important for binding, since mutation of any of the tryptophans affects CaD136-CaM binding and since CaD136 remains CaM-buttoned even when two of the three tryptophans are mutated to alanines.
Collapse
Affiliation(s)
- Sergei E Permyakov
- Protein Research Group, Institute for Biological Instrumentation, Russian Academy of Sciences , Pushchino, Moscow Region , Russia
| | - Eugene A Permyakov
- Protein Research Group, Institute for Biological Instrumentation, Russian Academy of Sciences , Pushchino, Moscow Region , Russia
| | - Vladimir N Uversky
- Protein Research Group, Institute for Biological Instrumentation, Russian Academy of Sciences , Pushchino, Moscow Region , Russia ; Department of Molecular Medicine, University of South Florida , Tampa, FL , USA
| |
Collapse
|
200
|
Kowalski A. Abundance of intrinsic structural disorder in the histone H1 subtypes. Comput Biol Chem 2015; 59 Pt A:16-27. [PMID: 26366527 DOI: 10.1016/j.compbiolchem.2015.08.011] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2014] [Revised: 08/03/2015] [Accepted: 08/30/2015] [Indexed: 01/06/2023]
Abstract
The intrinsically disordered proteins consist of partially structured regions linked to the unstructured stretches, which consequently form the transient and dynamic conformational ensembles. They undergo disorder to order transition upon binding their partners. Intrinsic disorder is attributed to histones H1, perceived as assemblers of chromatin structure and the regulators of DNA and proteins activity. In this work, the comparison of intrinsic disorder abundance in the histone H1 subtypes was performed both by the analysis of their amino acid composition and by the prediction of disordered stretches, as well as by identifying molecular recognition features (MoRFs) and ANCHOR protein binding regions (APBR) that are responsible for recognition and binding. Both human and model organisms-animals, plants, fungi and protists-have H1 histone subtypes with the properties typical of disordered state. They possess a significantly higher content of hydrophilic and charged amino acid residues, arranged in the long regions, covering over half of the whole amino acid residues in chain. Almost complete disorder corresponds to histone H1 terminal domains, including MoRFs and ANCHOR. Those motifs were also identified in a more ordered histone H1 globular domain. Compared to the control (globular and fibrous) proteins, H1 histones demonstrate the increased folding rate and a higher proportion of low-complexity segments. The results of this work indicate that intrinsic disorder is an inherent structural property of histone H1 subtypes and it is essential for establishing a protein conformation which defines functional outcomes affecting on DNA- and/or partner protein-dependent cell processes.
Collapse
Affiliation(s)
- Andrzej Kowalski
- Department of Biochemistry and Genetics, Institute of Biology, Jan Kochanowski University, ul. Świętokrzyska 15, 25-406 Kielce, Poland.
| |
Collapse
|