1
|
Gupta MN, Uversky VN. Protein structure-function continuum model: Emerging nexuses between specificity, evolution, and structure. Protein Sci 2024; 33:e4968. [PMID: 38532700 DOI: 10.1002/pro.4968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Revised: 02/18/2024] [Accepted: 03/05/2024] [Indexed: 03/28/2024]
Abstract
The rationale for replacing the old binary of structure-function with the trinity of structure, disorder, and function has gained considerable ground in recent years. A continuum model based on the expanded form of the existing paradigm can now subsume importance of both conformational flexibility and intrinsic disorder in protein function. The disorder is actually critical for understanding the protein-protein interactions in many regulatory processes, formation of membrane-less organelles, and our revised notions of specificity as amply illustrated by moonlighting proteins. While its importance in formation of amyloids and function of prions is often discussed, the roles of intrinsic disorder in infectious diseases and protein function under extreme conditions are also becoming clear. This review is an attempt to discuss how our current understanding of protein function, specificity, and evolution fit better with the continuum model. This integration of structure and disorder under a single model may bring greater clarity in our continuing quest for understanding proteins and molecular mechanisms of their functionality.
Collapse
Affiliation(s)
- Munishwar Nath Gupta
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology, New Delhi, India
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, Florida, USA
| |
Collapse
|
2
|
Lebedenko OO, Salikov VA, Izmailov SA, Podkorytov IS, Skrynnikov NR. Using NMR diffusion data to validate MD models of disordered proteins: Test case of N-terminal tail of histone H4. Biophys J 2024; 123:80-100. [PMID: 37990496 PMCID: PMC10808029 DOI: 10.1016/j.bpj.2023.11.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 10/28/2023] [Accepted: 11/17/2023] [Indexed: 11/23/2023] Open
Abstract
MD simulations can provide uniquely detailed models of intrinsically disordered proteins (IDPs). However, these models need careful experimental validation. The coefficient of translational diffusion Dtr, measurable by pulsed field gradient NMR, offers a potentially useful piece of experimental information related to the compactness of the IDP's conformational ensemble. Here, we investigate, both experimentally and via the MD modeling, the translational diffusion of a 25-residue N-terminal fragment from histone H4 (N-H4). We found that the predicted values of Dtr, as obtained from mean-square displacement of the peptide in the MD simulations, are largely determined by the viscosity of the MD water (which has been reinvestigated as a part of our study). Beyond that, our analysis of the diffusion data indicates that MD simulations of N-H4 in the TIP4P-Ew water give rise to an overly compact conformational ensemble for this peptide. In contrast, TIP4P-D and OPC simulations produce the ensembles that are consistent with the experimental Dtr result. These observations are supported by the analyses of the 15N spin relaxation rates. We also tested a number of empirical methods to predict Dtr based on IDP's coordinates extracted from the MD snapshots. In particular, we show that the popular approach involving the program HYDROPRO can produce misleading results. This happens because HYDROPRO is not intended to predict the diffusion properties of highly flexible biopolymers such as IDPs. Likewise, recent empirical schemes that exploit the relationship between the small-angle x-ray scattering-informed conformational ensembles of IDPs and the respective experimental Dtr values also prove to be problematic. In this sense, the first-principle calculations of Dtr from the MD simulations, such as demonstrated in this work, should provide a useful benchmark for future efforts in this area.
Collapse
Affiliation(s)
- Olga O Lebedenko
- Laboratory of Biomolecular NMR, St. Petersburg State University, St. Petersburg, Russia
| | - Vladislav A Salikov
- Laboratory of Biomolecular NMR, St. Petersburg State University, St. Petersburg, Russia
| | - Sergei A Izmailov
- Laboratory of Biomolecular NMR, St. Petersburg State University, St. Petersburg, Russia
| | - Ivan S Podkorytov
- Laboratory of Biomolecular NMR, St. Petersburg State University, St. Petersburg, Russia
| | - Nikolai R Skrynnikov
- Laboratory of Biomolecular NMR, St. Petersburg State University, St. Petersburg, Russia; Department of Chemistry, Purdue University, West Lafayette, Indiana.
| |
Collapse
|
3
|
Catalanotto C, Barbato C, Cogoni C, Benelli D. The RNA-Binding Function of Ribosomal Proteins and Ribosome Biogenesis Factors in Human Health and Disease. Biomedicines 2023; 11:2969. [PMID: 38001969 PMCID: PMC10669870 DOI: 10.3390/biomedicines11112969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 10/27/2023] [Accepted: 10/30/2023] [Indexed: 11/26/2023] Open
Abstract
The ribosome is a macromolecular complex composed of RNA and proteins that interact through an integrated and interconnected network to preserve its ancient core activities. In this review, we emphasize the pivotal role played by RNA-binding proteins as a driving force in the evolution of the current form of the ribosome, underscoring their importance in ensuring accurate protein synthesis. This category of proteins includes both ribosomal proteins and ribosome biogenesis factors. Impairment of their RNA-binding activity can also lead to ribosomopathies, which is a group of disorders characterized by defects in ribosome biogenesis that are detrimental to protein synthesis and cellular homeostasis. A comprehensive understanding of these intricate processes is essential for elucidating the mechanisms underlying the resulting diseases and advancing potential therapeutic interventions.
Collapse
Affiliation(s)
- Caterina Catalanotto
- Department of Molecular Medicine, Sapienza University of Rome, 00185 Rome, Italy; (C.C.); (C.C.)
| | - Christian Barbato
- National Research Council (CNR), Department of Sense Organs DOS, Institute of Biochemistry and Cell Biology (IBBC), Sapienza University of Rome, 00185 Rome, Italy;
| | - Carlo Cogoni
- Department of Molecular Medicine, Sapienza University of Rome, 00185 Rome, Italy; (C.C.); (C.C.)
| | - Dario Benelli
- Department of Molecular Medicine, Sapienza University of Rome, 00185 Rome, Italy; (C.C.); (C.C.)
| |
Collapse
|
4
|
Zhao B, Ghadermarzi S, Kurgan L. Comparative evaluation of AlphaFold2 and disorder predictors for prediction of intrinsic disorder, disorder content and fully disordered proteins. Comput Struct Biotechnol J 2023; 21:3248-3258. [PMID: 38213902 PMCID: PMC10782001 DOI: 10.1016/j.csbj.2023.06.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/31/2023] [Accepted: 06/01/2023] [Indexed: 01/13/2024] Open
Abstract
We expand studies of AlphaFold2 (AF2) in the context of intrinsic disorder prediction by comparing it against a broad selection of 20 accurate, popular and recently released disorder predictors. We use 25% larger benchmark dataset with 646 proteins and cover protein-level predictions of disorder content and fully disordered proteins. AF2-based disorder predictions secure a relatively high Area Under receiver operating characteristic Curve (AUC) of 0.77 and are statistically outperformed by several modern disorder predictors that secure AUCs around 0.8 with median runtime of about 20 s compared to 1200 s for AF2. Moreover, AF2 provides modestly accurate predictions of fully disordered proteins (F1 = 0.59 vs. 0.91 for the best disorder predictor) and disorder content (mean absolute error of 0.21 vs. 0.15). AF2 also generates statistically more accurate disorder predictions for about 20% of proteins that have relatively short sequences and a few disordered regions that tend to be located at the sequence termini, and which are absent of disordered protein-binding regions. Interestingly, AF2 and the most accurate disorder predictors rely on deep neural networks, suggesting that these models are useful for protein structure and disorder predictions.
Collapse
Affiliation(s)
- Bi Zhao
- Genomics program, College of Public Health, University of South Florida, Tampa, FL, United States
| | - Sina Ghadermarzi
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| |
Collapse
|
5
|
Basu S, Gsponer J, Kurgan L. DEPICTER2: a comprehensive webserver for intrinsic disorder and disorder function prediction. Nucleic Acids Res 2023:7151337. [PMID: 37140058 DOI: 10.1093/nar/gkad330] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 04/12/2023] [Accepted: 04/18/2023] [Indexed: 05/05/2023] Open
Abstract
Intrinsic disorder in proteins is relatively abundant in nature and essential for a broad spectrum of cellular functions. While disorder can be accurately predicted from protein sequences, as it was empirically demonstrated in recent community-organized assessments, it is rather challenging to collect and compile a comprehensive prediction that covers multiple disorder functions. To this end, we introduce the DEPICTER2 (DisorderEd PredictIon CenTER) webserver that offers convenient access to a curated collection of fast and accurate disorder and disorder function predictors. This server includes a state-of-the-art disorder predictor, flDPnn, and five modern methods that cover all currently predictable disorder functions: disordered linkers and protein, peptide, DNA, RNA and lipid binding. DEPICTER2 allows selection of any combination of the six methods, batch predictions of up to 25 proteins per request and provides interactive visualization of the resulting predictions. The webserver is freely available at http://biomine.cs.vcu.edu/servers/DEPICTER2/.
Collapse
Affiliation(s)
- Sushmita Basu
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Jörg Gsponer
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
6
|
Gupta MN, Uversky VN. Moonlighting enzymes: when cellular context defines specificity. Cell Mol Life Sci 2023; 80:130. [PMID: 37093283 PMCID: PMC11073002 DOI: 10.1007/s00018-023-04781-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 03/13/2023] [Accepted: 04/15/2023] [Indexed: 04/25/2023]
Abstract
It is not often realized that the absolute protein specificity is an exception rather than a rule. Two major kinds of protein multi-specificities are promiscuity and moonlighting. This review discusses the idea of enzyme specificity and then focusses on moonlighting. Some important examples of protein moonlighting, such as crystallins, ceruloplasmin, metallothioniens, macrophage migration inhibitory factor, and enzymes of carbohydrate metabolism are discussed. How protein plasticity and intrinsic disorder enable the removing the distinction between enzymes and other biologically active proteins are outlined. Finally, information on important roles of moonlighting in human diseases is updated.
Collapse
Affiliation(s)
- Munishwar Nath Gupta
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology, Hauz Khas, New Delhi, 110016, India
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd., MDC07, Tampa, FL, 33612-4799, USA.
| |
Collapse
|
7
|
Computational prediction of disordered binding regions. Comput Struct Biotechnol J 2023; 21:1487-1497. [PMID: 36851914 PMCID: PMC9957716 DOI: 10.1016/j.csbj.2023.02.018] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 02/08/2023] [Accepted: 02/08/2023] [Indexed: 02/12/2023] Open
Abstract
One of the key features of intrinsically disordered regions (IDRs) is their ability to interact with a broad range of partner molecules. Multiple types of interacting IDRs were identified including molecular recognition fragments (MoRFs), short linear sequence motifs (SLiMs), and protein-, nucleic acids- and lipid-binding regions. Prediction of binding IDRs in protein sequences is gaining momentum in recent years. We survey 38 predictors of binding IDRs that target interactions with a diverse set of partners, such as peptides, proteins, RNA, DNA and lipids. We offer a historical perspective and highlight key events that fueled efforts to develop these methods. These tools rely on a diverse range of predictive architectures that include scoring functions, regular expressions, traditional and deep machine learning and meta-models. Recent efforts focus on the development of deep neural network-based architectures and extending coverage to RNA, DNA and lipid-binding IDRs. We analyze availability of these methods and show that providing implementations and webservers results in much higher rates of citations/use. We also make several recommendations to take advantage of modern deep network architectures, develop tools that bundle predictions of multiple and different types of binding IDRs, and work on algorithms that model structures of the resulting complexes.
Collapse
|
8
|
Intrinsically Disordered Proteins: An Overview. Int J Mol Sci 2022; 23:ijms232214050. [PMID: 36430530 PMCID: PMC9693201 DOI: 10.3390/ijms232214050] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2022] [Revised: 11/07/2022] [Accepted: 11/08/2022] [Indexed: 11/16/2022] Open
Abstract
Many proteins and protein segments cannot attain a single stable three-dimensional structure under physiological conditions; instead, they adopt multiple interconverting conformational states. Such intrinsically disordered proteins or protein segments are highly abundant across proteomes, and are involved in various effector functions. This review focuses on different aspects of disordered proteins and disordered protein regions, which form the basis of the so-called "Disorder-function paradigm" of proteins. Additionally, various experimental approaches and computational tools used for characterizing disordered regions in proteins are discussed. Finally, the role of disordered proteins in diseases and their utility as potential drug targets are explored.
Collapse
|
9
|
van Tartwijk FW, Kaminski CF. Protein Condensation, Cellular Organization, and Spatiotemporal Regulation of Cytoplasmic Properties. Adv Biol (Weinh) 2022; 6:e2101328. [PMID: 35796197 DOI: 10.1002/adbi.202101328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 05/15/2022] [Indexed: 01/28/2023]
Abstract
The cytoplasm is an aqueous, highly crowded solution of active macromolecules. Its properties influence the behavior of proteins, including their folding, motion, and interactions. In particular, proteins in the cytoplasm can interact to form phase-separated assemblies, so-called biomolecular condensates. The interplay between cytoplasmic properties and protein condensation is critical in a number of functional contexts and is the subject of this review. The authors first describe how cytoplasmic properties can affect protein behavior, in particular condensate formation, and then describe the functional implications of this interplay in three cellular contexts, which exemplify how protein self-organization can be adapted to support certain physiological phenotypes. The authors then describe the formation of RNA-protein condensates in highly polarized cells such as neurons, where condensates play a critical role in the regulation of local protein synthesis, and describe how different stressors trigger extensive reorganization of the cytoplasm, both through signaling pathways and through direct stress-induced changes in cytoplasmic properties. Finally, the authors describe changes in protein behavior and cytoplasmic properties that may occur in extremophiles, in particular organisms that have adapted to inhabit environments of extreme temperature, and discuss the implications and functional importance of these changes.
Collapse
Affiliation(s)
- Francesca W van Tartwijk
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge, CB3 0AS, UK
| | - Clemens F Kaminski
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge, CB3 0AS, UK
| |
Collapse
|
10
|
Handa T, Kundu D, Dubey VK. Perspectives on evolutionary and functional importance of intrinsically disordered proteins. Int J Biol Macromol 2022; 224:243-255. [DOI: 10.1016/j.ijbiomac.2022.10.120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 10/08/2022] [Accepted: 10/10/2022] [Indexed: 11/05/2022]
|
11
|
Piersimoni L, Abd El Malek M, Bhatia T, Bender J, Brankatschk C, Calvo Sánchez J, Dayhoff GW, Di Ianni A, Figueroa Parra JO, Garcia-Martinez D, Hesselbarth J, Köppen J, Lauth LM, Lippik L, Machner L, Sachan S, Schmidt L, Selle R, Skalidis I, Sorokin O, Ubbiali D, Voigt B, Wedler A, Wei AAJ, Zorn P, Dunker AK, Köhn M, Sinz A, Uversky VN. Lighting up Nobel Prize-winning studies with protein intrinsic disorder. Cell Mol Life Sci 2022; 79:449. [PMID: 35882686 PMCID: PMC11072364 DOI: 10.1007/s00018-022-04468-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 06/18/2022] [Accepted: 07/04/2022] [Indexed: 11/03/2022]
Abstract
Intrinsically disordered proteins and regions (IDPs and IDRs) and their importance in biology are becoming increasingly recognized in biology, biochemistry, molecular biology and chemistry textbooks, as well as in current protein science and structural biology curricula. We argue that the sequence → dynamic conformational ensemble → function principle is of equal importance as the classical sequence → structure → function paradigm. To highlight this point, we describe the IDPs and/or IDRs behind the discoveries associated with 17 Nobel Prizes, 11 in Physiology or Medicine and 6 in Chemistry. The Nobel Laureates themselves did not always mention that the proteins underlying the phenomena investigated in their award-winning studies are in fact IDPs or contain IDRs. In several cases, IDP- or IDR-based molecular functions have been elucidated, while in other instances, it is recognized that the respective protein(s) contain IDRs, but the specific IDR-based molecular functions have yet to be determined. To highlight the importance of IDPs and IDRs as general principle in biology, we present here illustrative examples of IDPs/IDRs in Nobel Prize-winning mechanisms and processes.
Collapse
Affiliation(s)
- Lolita Piersimoni
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Marina Abd El Malek
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Twinkle Bhatia
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Julian Bender
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Christin Brankatschk
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Jaime Calvo Sánchez
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Guy W Dayhoff
- Department of Chemistry, College of Art and Sciences, University of South Florida, Tampa, FL, 33620, USA
| | - Alessio Di Ianni
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | | | - Dailen Garcia-Martinez
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Julia Hesselbarth
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Janett Köppen
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Luca M Lauth
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Laurin Lippik
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Lisa Machner
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Shubhra Sachan
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Lisa Schmidt
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Robin Selle
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Ioannis Skalidis
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Oleksandr Sorokin
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Daniele Ubbiali
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Bruno Voigt
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Alice Wedler
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Alan An Jung Wei
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Peter Zorn
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Alan Keith Dunker
- Department of Biochemistry and Molecular Biology, Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Marcel Köhn
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany.
| | - Andrea Sinz
- Research Training Group RTG2467, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany.
| | - Vladimir N Uversky
- Department of Molecular Medicine, USF Health Byrd Alzheimer's Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA.
| |
Collapse
|
12
|
Redwan EM, Aljadawi AA, Uversky VN. Hepatitis C Virus Infection and Intrinsic Disorder in the Signaling Pathways Induced by Toll-Like Receptors. BIOLOGY 2022; 11:1091. [PMID: 36101469 PMCID: PMC9312352 DOI: 10.3390/biology11071091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 07/07/2022] [Accepted: 07/19/2022] [Indexed: 11/23/2022]
Abstract
In this study, we examined the interplay between protein intrinsic disorder, hepatitis C virus (HCV) infection, and signaling pathways induced by Toll-like receptors (TLRs). To this end, 10 HCV proteins, 10 human TLRs, and 41 proteins from the TLR-induced downstream pathways were considered from the prevalence of intrinsic disorder. Mapping of the intrinsic disorder to the HCV-TLR interactome and to the TLR-based pathways of human innate immune response to the HCV infection demonstrates that substantial levels of intrinsic disorder are characteristic for proteins involved in the regulation and execution of these innate immunity pathways and in HCV-TLR interaction. Disordered regions, being commonly enriched in sites of various posttranslational modifications, may play important functional roles by promoting protein-protein interactions and support the binding of the analyzed proteins to other partners such as nucleic acids. It seems that this system represents an important illustration of the role of intrinsic disorder in virus-host warfare.
Collapse
Affiliation(s)
- Elrashdy M. Redwan
- Biological Science Department, Faculty of Science, King Abdulaziz University, P.O. Box 80203, Jeddah 21589, Saudi Arabia; (E.M.R.); (A.A.A.)
- Therapeutic and Protective Proteins Laboratory, Protein Research Department, Genetic Engineering and Biotechnology Research Institute, City for Scientific Research and Technology Applications, New Borg EL-Arab, Alexandria 21934, Egypt
| | - Abdullah A. Aljadawi
- Biological Science Department, Faculty of Science, King Abdulaziz University, P.O. Box 80203, Jeddah 21589, Saudi Arabia; (E.M.R.); (A.A.A.)
| | - Vladimir N. Uversky
- Biological Science Department, Faculty of Science, King Abdulaziz University, P.O. Box 80203, Jeddah 21589, Saudi Arabia; (E.M.R.); (A.A.A.)
- Department of Molecular Medicine and USF Health Byrd Alzheimer’s Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| |
Collapse
|
13
|
Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions. Biomolecules 2022; 12:biom12070888. [PMID: 35883444 PMCID: PMC9313023 DOI: 10.3390/biom12070888] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 06/10/2022] [Accepted: 06/10/2022] [Indexed: 11/17/2022] Open
Abstract
Intrinsically disordered regions (IDRs) carry out many cellular functions and vary in length and placement in protein sequences. This diversity leads to variations in the underlying compositional biases, which were demonstrated for the short vs. long IDRs. We analyze compositional biases across four classes of disorder: fully disordered proteins; short IDRs; long IDRs; and binding IDRs. We identify three distinct biases: for the fully disordered proteins, the short IDRs and the long and binding IDRs combined. We also investigate compositional bias for putative disorder produced by leading disorder predictors and find that it is similar to the bias of the native disorder. Interestingly, the accuracy of disorder predictions across different methods is correlated with the correctness of the compositional bias of their predictions highlighting the importance of the compositional bias. The predictive quality is relatively low for the disorder classes with compositional bias that is the most different from the “generic” disorder bias, while being much higher for the classes with the most similar bias. We discover that different predictors perform best across different classes of disorder. This suggests that no single predictor is universally best and motivates the development of new architectures that combine models that target specific disorder classes.
Collapse
|
14
|
Biró B, Zhao B, Kurgan L. Complementarity of the residue-level protein function and structure predictions in human proteins. Comput Struct Biotechnol J 2022; 20:2223-2234. [PMID: 35615015 PMCID: PMC9118482 DOI: 10.1016/j.csbj.2022.05.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 05/02/2022] [Accepted: 05/02/2022] [Indexed: 11/24/2022] Open
Abstract
Sequence-based predictors of the residue-level protein function and structure cover a broad spectrum of characteristics including intrinsic disorder, secondary structure, solvent accessibility and binding to nucleic acids. They were catalogued and evaluated in numerous surveys and assessments. However, methods focusing on a given characteristic are studied separately from predictors of other characteristics, while they are typically used on the same proteins. We fill this void by studying complementarity of a representative collection of methods that target different predictions using a large, taxonomically consistent, and low similarity dataset of human proteins. First, we bridge the gap between the communities that develop structure-trained vs. disorder-trained predictors of binding residues. Motivated by a recent study of the protein-binding residue predictions, we empirically find that combining the structure-trained and disorder-trained predictors of the DNA-binding and RNA-binding residues leads to substantial improvements in predictive quality. Second, we investigate whether diverse predictors generate results that accurately reproduce relations between secondary structure, solvent accessibility, interaction sites, and intrinsic disorder that are present in the experimental data. Our empirical analysis concludes that predictions accurately reflect all combinations of these relations. Altogether, this study provides unique insights that support combining results produced by diverse residue-level predictors of protein function and structure.
Collapse
Affiliation(s)
- Bálint Biró
- Institute of Genetics and Biotechnology, Hungarian University of Agriculture and Life Sciences, Gödöllő, Hungary
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| |
Collapse
|
15
|
Antifeeva IA, Fonin AV, Fefilova AS, Stepanenko OV, Povarova OI, Silonov SA, Kuznetsova IM, Uversky VN, Turoverov KK. Liquid-liquid phase separation as an organizing principle of intracellular space: overview of the evolution of the cell compartmentalization concept. Cell Mol Life Sci 2022; 79:251. [PMID: 35445278 PMCID: PMC11073196 DOI: 10.1007/s00018-022-04276-4] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 03/24/2022] [Accepted: 03/27/2022] [Indexed: 12/14/2022]
Abstract
At the turn of the twenty-first century, fundamental changes took place in the understanding of the structure and function of proteins and then in the appreciation of the intracellular space organization. A rather mechanistic model of the organization of living matter, where the function of proteins is determined by their rigid globular structure, and the intracellular processes occur in rigidly determined compartments, was replaced by an idea that highly dynamic and multifunctional "soft matter" lies at the heart of all living things. According this "new view", the most important role in the spatio-temporal organization of the intracellular space is played by liquid-liquid phase transitions of biopolymers. These self-organizing cellular compartments are open dynamic systems existing at the edge of chaos. They are characterized by the exceptional structural and compositional dynamics, and their multicomponent nature and polyfunctionality provide means for the finely tuned regulation of various intracellular processes. Changes in the external conditions can cause a disruption of the biogenesis of these cellular bodies leading to the irreversible aggregation of their constituent proteins, followed by the transition to a gel-like state and the emergence of amyloid fibrils. This work represents a historical overview of changes in our understanding of the intracellular space compartmentalization. It also reflects methodological breakthroughs that led to a change in paradigms in this area of science and discusses modern ideas about the organization of the intracellular space. It is emphasized here that the membrane-less organelles have to combine a certain resistance to the changes in their environment and, at the same time, show high sensitivity to the external signals, which ensures the normal functioning of the cell.
Collapse
Affiliation(s)
- Iuliia A Antifeeva
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Av., 4, St. Petersburg, 194064, Russia
| | - Alexander V Fonin
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Av., 4, St. Petersburg, 194064, Russia
| | - Anna S Fefilova
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Av., 4, St. Petersburg, 194064, Russia
| | - Olesya V Stepanenko
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Av., 4, St. Petersburg, 194064, Russia
| | - Olga I Povarova
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Av., 4, St. Petersburg, 194064, Russia
| | - Sergey A Silonov
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Av., 4, St. Petersburg, 194064, Russia
| | - Irina M Kuznetsova
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Av., 4, St. Petersburg, 194064, Russia
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd. MDC07, Tampa, FL, 33612, USA.
| | - Konstantin K Turoverov
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Av., 4, St. Petersburg, 194064, Russia.
| |
Collapse
|
16
|
Zhao B, Kurgan L. Deep Learning in Prediction of Intrinsic Disorder in Proteins. Comput Struct Biotechnol J 2022; 20:1286-1294. [PMID: 35356546 PMCID: PMC8927795 DOI: 10.1016/j.csbj.2022.03.003] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 03/04/2022] [Accepted: 03/04/2022] [Indexed: 12/12/2022] Open
|
17
|
Kurgan L. Resources for computational prediction of intrinsic disorder in proteins. Methods 2022; 204:132-141. [DOI: 10.1016/j.ymeth.2022.03.018] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 03/25/2022] [Accepted: 03/29/2022] [Indexed: 12/26/2022] Open
|
18
|
Abstract
This mini-review represents a brief, disorder-centric consideration of the interplay between order and disorder in proteins. The goal here is to show that inside the cell, folding, non-folding, and misfolding of proteins are interlinked on multiple levels. This is evidenced by the highly heterogeneous spatio-temporal structural organization of a protein molecule, where one can find differently (dis)ordered components that can undergo local or global order-to-disorder and disorder-to-order transitions needed for functionality. This is further illustrated by the fact that at particular moments of their life, most notably during their synthesis and degradation, all proteins are at least partially disordered. In addition to these intrinsic forms of disorder, proteins are constantly facing extrinsic disorder, which is intrinsic disorder in their functional partners. All this comprises the multileveled protein disorder cycle.
Collapse
Affiliation(s)
- Vladimir N Uversky
- Department of Molecular Medicine and Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612 USA
| |
Collapse
|
19
|
Kaur H, van der Feltz C, Sun Y, Hoskins AA. Network theory reveals principles of spliceosome structure and dynamics. Structure 2022; 30:190-200.e2. [PMID: 34592160 PMCID: PMC8741635 DOI: 10.1016/j.str.2021.09.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Revised: 06/30/2021] [Accepted: 09/08/2021] [Indexed: 02/07/2023]
Abstract
Cryoelectron microscopy has revolutionized spliceosome structural biology, and structures representing much of the splicing process have been determined. Comparison of these structures is challenging due to extreme dynamics of the splicing machinery and the thousands of changing interactions during splicing. We have used network theory to analyze splicing factor interactions by constructing structure-based networks from protein-protein, protein-RNA, and RNA-RNA interactions found in eight different spliceosome structures. Our analyses reveal that connectivity dynamics result in step-specific impacts of factors on network topology. The spliceosome's connectivity is focused on the active site, in part due to contributions from nonglobular proteins. Many essential factors exhibit large shifts in centralities during splicing. Others show transiently high betweenness centralities at certain stages, thereby suggesting mechanisms for regulating splicing by briefly bridging otherwise poorly connected network nodes. These observations provide insights into organizing principles of the spliceosome and provide frameworks for comparative analysis of other macromolecular machines.
Collapse
Affiliation(s)
- Harpreet Kaur
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, 53706 USA,These authors contributed equally
| | - Clarisse van der Feltz
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, 53706 USA,College of Arts and Sciences, Northwest University, Kirkland, Washington, 98033 USA,These authors contributed equally
| | - Yichen Sun
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, 53706 USA
| | - Aaron A. Hoskins
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, 53706 USA,Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin, 53706 USA,Correspondence:
| |
Collapse
|
20
|
Abstract
INTRODUCTION Intrinsic disorder prediction field develops, assesses, and deploys computational predictors of disorder in protein sequences and constructs and disseminates databases of these predictions. Over 40 years of research resulted in the release of numerous resources. AREAS COVERED We identify and briefly summarize the most comprehensive to date collection of over 100 disorder predictors. We focus on their predictive models, availability and predictive performance. We categorize and study them from a historical point of view to highlight informative trends. EXPERT OPINION We find a consistent trend of improvements in predictive quality as newer and more advanced predictors are developed. The original focus on machine learning methods has shifted to meta-predictors in early 2010s, followed by a recent transition to deep learning. The use of deep learners will continue in foreseeable future given recent and convincing success of these methods. Moreover, a broad range of resources that facilitate convenient collection of accurate disorder predictions is available to users. They include web servers and standalone programs for disorder prediction, servers that combine prediction of disorder and disorder functions, and large databases of pre-computed predictions. We also point to the need to address the shortage of accurate methods that predict disordered binding regions.
Collapse
Affiliation(s)
- Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, Virginia, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, Virginia, USA
| |
Collapse
|
21
|
Zhang F, Zhao B, Shi W, Li M, Kurgan L. DeepDISOBind: accurate prediction of RNA-, DNA- and protein-binding intrinsically disordered residues with deep multi-task learning. Brief Bioinform 2021; 23:6461158. [PMID: 34905768 DOI: 10.1093/bib/bbab521] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 10/30/2021] [Accepted: 11/14/2021] [Indexed: 12/14/2022] Open
Abstract
Proteins with intrinsically disordered regions (IDRs) are common among eukaryotes. Many IDRs interact with nucleic acids and proteins. Annotation of these interactions is supported by computational predictors, but to date, only one tool that predicts interactions with nucleic acids was released, and recent assessments demonstrate that current predictors offer modest levels of accuracy. We have developed DeepDISOBind, an innovative deep multi-task architecture that accurately predicts deoxyribonucleic acid (DNA)-, ribonucleic acid (RNA)- and protein-binding IDRs from protein sequences. DeepDISOBind relies on an information-rich sequence profile that is processed by an innovative multi-task deep neural network, where subsequent layers are gradually specialized to predict interactions with specific partner types. The common input layer links to a layer that differentiates protein- and nucleic acid-binding, which further links to layers that discriminate between DNA and RNA interactions. Empirical tests show that this multi-task design provides statistically significant gains in predictive quality across the three partner types when compared to a single-task design and a representative selection of the existing methods that cover both disorder- and structure-trained tools. Analysis of the predictions on the human proteome reveals that DeepDISOBind predictions can be encoded into protein-level propensities that accurately predict DNA- and RNA-binding proteins and protein hubs. DeepDISOBind is available at https://www.csuligroup.com/DeepDISOBind/.
Collapse
Affiliation(s)
- Fuhao Zhang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, 410083, China
| | - Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA
| | - Wenbo Shi
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, 410083, China
| | - Min Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, 410083, China
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA
| |
Collapse
|
22
|
Zhao B, Katuwawala A, Oldfield CJ, Hu G, Wu Z, Uversky VN, Kurgan L. Intrinsic Disorder in Human RNA-Binding Proteins. J Mol Biol 2021; 433:167229. [PMID: 34487791 DOI: 10.1016/j.jmb.2021.167229] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 08/30/2021] [Accepted: 08/31/2021] [Indexed: 12/24/2022]
Abstract
Although RNA-binding proteins (RBPs) are known to be enriched in intrinsic disorder, no previous analysis focused on RBPs interacting with specific RNA types. We fill this gap with a comprehensive analysis of the putative disorder in RBPs binding to six common RNA types: messenger RNA (mRNA), transfer RNA (tRNA), small nuclear RNA (snRNA), non-coding RNA (ncRNA), ribosomal RNA (rRNA), and internal ribosome RNA (irRNA). We also analyze the amount of putative intrinsic disorder in the RNA-binding domains (RBDs) and non-RNA-binding-domain regions (non-RBD regions). Consistent with previous studies, we show that in comparison with human proteome, RBPs are significantly enriched in disorder. However, closer examination finds significant enrichment in predicted disorder for the mRNA-, rRNA- and snRNA-binding proteins, while the proteins that interact with ncRNA and irRNA are not enriched in disorder, and the tRNA-binding proteins are significantly depleted in disorder. We show a consistent pattern of significant disorder enrichment in the non-RBD regions coupled with low levels of disorder in RBDs, which suggests that disorder is relatively rarely utilized in the RNA-binding regions. Our analysis of the non-RBD regions suggests that disorder harbors posttranslational modification sites and is involved in the putative interactions with DNA. Importantly, we utilize experimental data from DisProt and independent data from Pfam to validate the above observations that rely on the disorder predictions. This study provides new insights into the distribution of disorder across proteins that bind different RNA types and the functional role of disorder in the regions where it is enriched.
Collapse
Affiliation(s)
- Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Christopher J Oldfield
- Department of Microbiology and Immunology, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Gang Hu
- School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin 300071, China
| | - Zhonghua Wu
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin 300071, China
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA.
| |
Collapse
|
23
|
Transcriptional control of ribosome biogenesis in yeast: links to growth and stress signals. Biochem Soc Trans 2021; 49:1589-1599. [PMID: 34240738 PMCID: PMC8421047 DOI: 10.1042/bst20201136] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Revised: 06/14/2021] [Accepted: 06/18/2021] [Indexed: 12/15/2022]
Abstract
Ribosome biogenesis requires prodigious transcriptional output in rapidly growing yeast cells and is highly regulated in response to both growth and stress signals. This minireview focuses on recent developments in our understanding of this regulatory process, with an emphasis on the 138 ribosomal protein genes (RPGs) themselves and a group of >200 ribosome biogenesis (RiBi) genes whose products contribute to assembly but are not part of the ribosome. Expression of most RPGs depends upon Rap1, a pioneer transcription factor (TF) required for the binding of a pair of RPG-specific TFs called Fhl1 and Ifh1. RPG expression is correlated with Ifh1 promoter binding, whereas Rap1 and Fhl1 remain promoter-associated upon stress-induced down regulation. A TF called Sfp1 has also been implicated in RPG regulation, though recent work reveals that its primary function is in activation of RiBi and other growth-related genes. Sfp1 plays an important regulatory role at a small number of RPGs where Rap1–Fhl1–Ifh1 action is subsidiary or non-existent. In addition, nearly half of all RPGs are bound by Hmo1, which either stabilizes or re-configures Fhl1–Ifh1 binding. Recent studies identified the proline rotamase Fpr1, known primarily for its role in rapamycin-mediated inhibition of the TORC1 kinase, as an additional TF at RPG promoters. Fpr1 also affects Fhl1–Ifh1 binding, either independently or in cooperation with Hmo1. Finally, a major recent development was the discovery of a protein homeostasis mechanism driven by unassembled ribosomal proteins, referred to as the Ribosome Assembly Stress Response (RASTR), that controls RPG transcription through the reversible condensation of Ifh1.
Collapse
|
24
|
Oldfield CJ, Peng Z, Kurgan L. Disordered RNA-Binding Region Prediction with DisoRDPbind. Methods Mol Biol 2021; 2106:225-239. [PMID: 31889261 DOI: 10.1007/978-1-0716-0231-7_14] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
RNA chaperone activity is one of the many functions of intrinsically disordered regions (IDRs). IDRs function without the prerequisite of a stable structure. Instead, their functions arise from structural ensembles. A common theme in IDR function is molecular recognition; IDRs mediate interactions with other proteins, RNA, and DNA. Many computational methods are available to predict IDRs from protein sequence, but relatively few are available for predicting IDR functions. Available methods primarily focus on protein-protein interactions. DisoRDPbind was developed to predict several protein functions including interactions with RNA. This method is available as a user-friendly web interface, located at http://biomine.cs.vcu.edu/servers/DisoRDPbind/ . The development and architecture of DisoRDPbind is briefly presented, and its accuracy relative to other RNA-binding residue predictors is discussed. We explain usage of the web interface in detail and provide an example of prediction results and interpretation. While DisoRDPbind does not identify RNA chaperones directly, we provide a case study of an RNA chaperone, HCV core protein, as an example of the method's utility in the study of RNA chaperones.
Collapse
Affiliation(s)
| | - Zhenling Peng
- Center for Applied Mathematics, Tianjin University, Tianjin, People's Republic of China
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
| |
Collapse
|
25
|
Peng Z, Xing Q, Kurgan L. APOD: accurate sequence-based predictor of disordered flexible linkers. Bioinformatics 2021; 36:i754-i761. [PMID: 33381830 PMCID: PMC7773485 DOI: 10.1093/bioinformatics/btaa808] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/07/2020] [Indexed: 12/21/2022] Open
Abstract
Motivation Disordered flexible linkers (DFLs) are abundant and functionally important intrinsically disordered regions that connect protein domains and structural elements within domains and which facilitate disorder-based allosteric regulation. Although computational estimates suggest that thousands of proteins have DFLs, they were annotated experimentally in <200 proteins. This substantial annotation gap can be reduced with the help of accurate computational predictors. The sole predictor of DFLs, DFLpred, trade-off accuracy for shorter runtime by excluding relevant but computationally costly predictive inputs. Moreover, it relies on the local/window-based information while lacking to consider useful protein-level characteristics. Results We conceptualize, design and test APOD (Accurate Predictor Of DFLs), the first highly accurate predictor that utilizes both local- and protein-level inputs that quantify propensity for disorder, sequence composition, sequence conservation and selected putative structural properties. Consequently, APOD offers significantly more accurate predictions when compared with its faster predecessor, DFLpred, and several other alternative ways to predict DFLs. These improvements stem from the use of a more comprehensive set of inputs that cover the protein-level information and the application of a more sophisticated predictive model, a well-parametrized support vector machine. APOD achieves area under the curve = 0.82 (28% improvement over DFLpred) and Matthews correlation coefficient = 0.42 (180% increase over DFLpred) when tested on an independent/low-similarity test dataset. Consequently, APOD is a suitable choice for accurate and small-scale prediction of DFLs. Availability and implementation https://yanglab.nankai.edu.cn/APOD/.
Collapse
Affiliation(s)
- Zhenling Peng
- Center for Applied Mathematics, Tianjin University, Tianjin 300072, China.,School of Statistics and Data Science, Nankai University, Tianjin 300074, China
| | - Qian Xing
- Center for Applied Mathematics, Tianjin University, Tianjin 300072, China
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
26
|
Zhao B, Katuwawala A, Uversky VN, Kurgan L. IDPology of the living cell: intrinsic disorder in the subcellular compartments of the human cell. Cell Mol Life Sci 2021; 78:2371-2385. [PMID: 32997198 PMCID: PMC11071772 DOI: 10.1007/s00018-020-03654-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Revised: 09/09/2020] [Accepted: 09/22/2020] [Indexed: 12/11/2022]
Abstract
Intrinsic disorder can be found in all proteomes of all kingdoms of life and in viruses, being particularly prevalent in the eukaryotes. We conduct a comprehensive analysis of the intrinsic disorder in the human proteins while mapping them into 24 compartments of the human cell. In agreement with previous studies, we show that human proteins are significantly enriched in disorder relative to a generic protein set that represents the protein universe. In fact, the fraction of proteins with long disordered regions and the average protein-level disorder content in the human proteome are about 3 times higher than in the protein universe. Furthermore, levels of intrinsic disorder in the majority of human subcellular compartments significantly exceed the average disorder content in the protein universe. Relative to the overall amount of disorder in the human proteome, proteins localized in the nucleus and cytoskeleton have significantly increased amounts of disorder, measured by both high disorder content and presence of multiple long intrinsically disordered regions. We empirically demonstrate that, on average, human proteins are assigned to 2.3 subcellular compartments, with proteins localized to few subcellular compartments being more disordered than the proteins that are localized to many compartments. Functionally, the disordered proteins localized in the most disorder-enriched subcellular compartments are primarily responsible for interactions with nucleic acids and protein partners. This is the first-time disorder is comprehensively mapped into the human cell. Our observations add a missing piece to the puzzle of functional disorder and its organization inside the cell.
Collapse
Affiliation(s)
- Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA
| | - Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine, USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd. MDC07, Tampa, FL, 33612, USA.
- Laboratory of New Methods in Biology, Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", Pushchino, Russia.
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA.
| |
Collapse
|
27
|
Abstract
To perform an accurate protein synthesis, ribosomes accomplish complex tasks involving the long-range communication between its functional centres such as the peptidyl transfer centre, the tRNA bindings sites and the peptide exit tunnel. How information is transmitted between these sites remains one of the major challenges in current ribosome research. Many experimental studies have revealed that some r-proteins play essential roles in remote communication and the possible involvement of r-protein networks in these processes have been recently proposed. Our phylogenetic, structural and mathematical study reveals that of the three kingdom's r-protein networks converged towards non-random graphs where r-proteins collectively coevolved to optimize interconnection between functional centres. The massive acquisition of conserved aromatic residues at the interfaces and along the extensions of the newly connected eukaryotic r-proteins also highlights that a strong selective pressure acts on their sequences probably for the formation of new allosteric pathways in the network.
Collapse
|
28
|
Kurgan L, Li M, Li Y. The Methods and Tools for Intrinsic Disorder Prediction and their Application to Systems Medicine. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11320-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
|
29
|
Peng Z, Xing Q, Kurgan L. APOD: accurate sequence-based predictor of disordered flexible linkers. BIOINFORMATICS (OXFORD, ENGLAND) 2020; 36:i754-i761. [PMID: 33381830 DOI: 10.1101/2020.12.03.409755] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 09/07/2020] [Indexed: 05/28/2023]
Abstract
MOTIVATION Disordered flexible linkers (DFLs) are abundant and functionally important intrinsically disordered regions that connect protein domains and structural elements within domains and which facilitate disorder-based allosteric regulation. Although computational estimates suggest that thousands of proteins have DFLs, they were annotated experimentally in <200 proteins. This substantial annotation gap can be reduced with the help of accurate computational predictors. The sole predictor of DFLs, DFLpred, trade-off accuracy for shorter runtime by excluding relevant but computationally costly predictive inputs. Moreover, it relies on the local/window-based information while lacking to consider useful protein-level characteristics. RESULTS We conceptualize, design and test APOD (Accurate Predictor Of DFLs), the first highly accurate predictor that utilizes both local- and protein-level inputs that quantify propensity for disorder, sequence composition, sequence conservation and selected putative structural properties. Consequently, APOD offers significantly more accurate predictions when compared with its faster predecessor, DFLpred, and several other alternative ways to predict DFLs. These improvements stem from the use of a more comprehensive set of inputs that cover the protein-level information and the application of a more sophisticated predictive model, a well-parametrized support vector machine. APOD achieves area under the curve = 0.82 (28% improvement over DFLpred) and Matthews correlation coefficient = 0.42 (180% increase over DFLpred) when tested on an independent/low-similarity test dataset. Consequently, APOD is a suitable choice for accurate and small-scale prediction of DFLs. AVAILABILITY AND IMPLEMENTATION https://yanglab.nankai.edu.cn/APOD/.
Collapse
Affiliation(s)
- Zhenling Peng
- Center for Applied Mathematics, Tianjin University, Tianjin 300072, China
- School of Statistics and Data Science, Nankai University, Tianjin 300074, China
| | - Qian Xing
- Center for Applied Mathematics, Tianjin University, Tianjin 300072, China
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
30
|
Abstract
Most RNA-binding modules are small and bind few nucleotides. RNA-binding proteins typically attain the physiological specificity and affinity for their RNA targets by combining several RNA-binding modules. Here, we review how disordered linkers connecting RNA-binding modules govern the specificity and affinity of RNA-protein interactions by regulating the effective concentration of these modules and their relative orientation. RNA-binding proteins also often contain extended intrinsically disordered regions that mediate protein-protein and RNA-protein interactions with multiple partners. We discuss how these regions can connect proteins and RNA resulting in heterogeneous higher-order assemblies such as membrane-less compartments and amyloid-like structures that have the characteristics of multi-modular entities. The assembled state generates additional RNA-binding specificity and affinity properties that contribute to further the function of RNA-binding proteins within the cellular environment.
Collapse
Affiliation(s)
- Diana S M Ottoz
- Department of Genetics and Development, Columbia University Irving Medical Center New York, NY 10032, USA
| | - Luke E Berchowitz
- Department of Genetics and Development, Columbia University Irving Medical Center New York, NY 10032, USA.,Taub Institute for Research on Alzheimer's and the Aging Brain, Columbia University Irving Medical Center New York, NY 10032, USA
| |
Collapse
|
31
|
Armaos A, Zacco E, Sanchez de Groot N, Tartaglia GG. RNA-protein interactions: Central players in coordination of regulatory networks. Bioessays 2020; 43:e2000118. [PMID: 33284474 DOI: 10.1002/bies.202000118] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 09/30/2020] [Accepted: 10/01/2020] [Indexed: 12/12/2022]
Abstract
Changes in the abundance of protein and RNA molecules can impair the formation of complexes in the cell leading to toxicity and death. Here we exploit the information contained in protein, RNA and DNA interaction networks to provide a comprehensive view of the regulation layers controlling the concentration-dependent formation of assemblies in the cell. We present the emerging concept that RNAs can act as scaffolds to promote the formation ribonucleoprotein complexes and coordinate the post-transcriptional layer of gene regulation. We describe the structural and interaction network properties that characterize the ability of protein and RNA molecules to interact and phase separate in liquid-like compartments. Finally, we show that presence of structurally disordered regions in proteins correlate with the propensity to undergo liquid-to-solid phase transitions and cause human diseases. Also see the video abstract here https://youtu.be/kfpqibsNfS0.
Collapse
Affiliation(s)
- Alexandros Armaos
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Universitat Pompeu Fabra (UPF), Barcelona, Spain.,Center for Human Technologies, Istituto Italiano di Tecnologia, Genova, Italy
| | - Elsa Zacco
- Center for Human Technologies, Istituto Italiano di Tecnologia, Genova, Italy
| | - Natalia Sanchez de Groot
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Gian Gaetano Tartaglia
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Universitat Pompeu Fabra (UPF), Barcelona, Spain.,Center for Human Technologies, Istituto Italiano di Tecnologia, Genova, Italy.,Department of Biology 'Charles Darwin', Sapienza University of Rome, Rome, Italy.,Institucio Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| |
Collapse
|
32
|
Comparative Assessment of Intrinsic Disorder Predictions with a Focus on Protein and Nucleic Acid-Binding Proteins. Biomolecules 2020; 10:biom10121636. [PMID: 33291838 PMCID: PMC7762010 DOI: 10.3390/biom10121636] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2020] [Revised: 11/26/2020] [Accepted: 12/03/2020] [Indexed: 01/18/2023] Open
Abstract
With over 60 disorder predictors, users need help navigating the predictor selection task. We review 28 surveys of disorder predictors, showing that only 11 include assessment of predictive performance. We identify and address a few drawbacks of these past surveys. To this end, we release a novel benchmark dataset with reduced similarity to the training sets of the considered predictors. We use this dataset to perform a first-of-its-kind comparative analysis that targets two large functional families of disordered proteins that interact with proteins and with nucleic acids. We show that limiting sequence similarity between the benchmark and the training datasets has a substantial impact on predictive performance. We also demonstrate that predictive quality is sensitive to the use of the well-annotated order and inclusion of the fully structured proteins in the benchmark datasets, both of which should be considered in future assessments. We identify three predictors that provide favorable results using the new benchmark set. While we find that VSL2B offers the most accurate and robust results overall, ESpritz-DisProt and SPOT-Disorder perform particularly well for disordered proteins. Moreover, we find that predictions for the disordered protein-binding proteins suffer low predictive quality compared to generic disordered proteins and the disordered nucleic acids-binding proteins. This can be explained by the high disorder content of the disordered protein-binding proteins, which makes it difficult for the current methods to accurately identify ordered regions in these proteins. This finding motivates the development of a new generation of methods that would target these difficult-to-predict disordered proteins. We also discuss resources that support users in collecting and identifying high-quality disorder predictions.
Collapse
|
33
|
An intrinsically disordered motif regulates the interaction between the p47 adaptor and the p97 AAA+ ATPase. Proc Natl Acad Sci U S A 2020; 117:26226-26236. [PMID: 33028677 DOI: 10.1073/pnas.2013920117] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
VCP/p97, an enzyme critical to proteostasis, is regulated through interactions with protein adaptors targeting it to specific cellular tasks. One such adaptor, p47, forms a complex with p97 to direct lipid membrane remodeling. Here, we use NMR and other biophysical methods to study the structural dynamics of p47 and p47-p97 complexes. Disordered regions in p47 are shown to be critical in directing intra-p47 and p47-p97 interactions via a pair of previously unidentified linear motifs. One of these, an SHP domain, regulates p47 binding to p97 in a manner that depends on the nucleotide state of p97. NMR and electron cryomicroscopy data have been used as restraints in molecular dynamics trajectories to develop structural ensembles for p47-p97 complexes in adenosine diphosphate (ADP)- and adenosine triphosphate (ATP)-bound conformations, highlighting differences in interactions in the two states. Our study establishes the importance of intrinsically disordered regions in p47 for the formation of functional p47-p97 complexes.
Collapse
|
34
|
Zhou J, Oldfield CJ, Yan W, Shen B, Dunker A. Identification of Intrinsic Disorder in Complexes from the Protein Data Bank. ACS OMEGA 2020; 5:17883-17891. [PMID: 32743159 PMCID: PMC7391252 DOI: 10.1021/acsomega.9b03927] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/17/2019] [Accepted: 03/18/2020] [Indexed: 02/08/2023]
Abstract
![]()
Background:
Intrinsically disordered proteins or regions (IDPs
or IDRs) lack stable structures in solution, yet often fold upon binding
with partners. IDPs or IDRs are highly abundant in all proteomes and
represent a significant modification of sequence → structure
→ function paradigm. The Protein Data Bank (PDB) includes complexes
containing disordered segments bound to globular proteins, but the
molecular mechanisms of such binding interactions remain largely unknown.
Results: In this study, we present the results of various disorder
predictions on a nonredundant set of PDB complexes. In contrast to
their structural appearances, many PDB proteins were predicted to
be disordered when separated from their binding partners. These predicted-to-be-disordered
proteins were observed to form structures depending upon various factors,
including heterogroup binding, protein/DNA/RNA binding, disulfide
bonds, and ion binding. Conclusions: This study collects many examples
of disorder-to-order transition in IDP complex formation, thus revealing
the unusual structure–function relationships of IDPs and providing
an additional support for the newly proposed paradigm of the sequence
→ IDP/IDR ensemble → function.
Collapse
Affiliation(s)
- Jianhong Zhou
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
| | - Christopher J. Oldfield
- Computer Science Department, Virginia Commonwealth University, Richmond, Virginia 23284, United States
| | - Wenying Yan
- School of Biology & Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Bairong Shen
- Institutes for Systems Genetics, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - A.Keith Dunker
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
| |
Collapse
|
35
|
Zhang J, Kurgan L. SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences. Bioinformatics 2020; 35:i343-i353. [PMID: 31510679 PMCID: PMC6612887 DOI: 10.1093/bioinformatics/btz324] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Motivation Accurate predictions of protein-binding residues (PBRs) enhances understanding of molecular-level rules governing protein–protein interactions, helps protein–protein docking and facilitates annotation of protein functions. Recent studies show that current sequence-based predictors of PBRs severely cross-predict residues that interact with other types of protein partners (e.g. RNA and DNA) as PBRs. Moreover, these methods are relatively slow, prohibiting genome-scale use. Results We propose a novel, accurate and fast sequence-based predictor of PBRs that minimizes the cross-predictions. Our SCRIBER (SeleCtive pRoteIn-Binding rEsidue pRedictor) method takes advantage of three innovations: comprehensive dataset that covers multiple types of binding residues, novel types of inputs that are relevant to the prediction of PBRs, and an architecture that is tailored to reduce the cross-predictions. The dataset includes complete protein chains and offers improved coverage of binding annotations that are transferred from multiple protein–protein complexes. We utilize innovative two-layer architecture where the first layer generates a prediction of protein-binding, RNA-binding, DNA-binding and small ligand-binding residues. The second layer re-predicts PBRs by reducing overlap between PBRs and the other types of binding residues produced in the first layer. Empirical tests on an independent test dataset reveal that SCRIBER significantly outperforms current predictors and that all three innovations contribute to its high predictive performance. SCRIBER reduces cross-predictions by between 41% and 69% and our conservative estimates show that it is at least 3 times faster. We provide putative PBRs produced by SCRIBER for the entire human proteome and use these results to hypothesize that about 14% of currently known human protein domains bind proteins. Availability and implementation SCRIBER webserver is available at http://biomine.cs.vcu.edu/servers/SCRIBER/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jian Zhang
- School of Computer and Information Technology, Xinyang Normal University, Xinyang, China.,Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| |
Collapse
|
36
|
Yan J, Cheng J, Kurgan L, Uversky VN. Structural and functional analysis of "non-smelly" proteins. Cell Mol Life Sci 2020; 77:2423-2440. [PMID: 31486849 PMCID: PMC11105052 DOI: 10.1007/s00018-019-03292-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 08/21/2019] [Accepted: 08/28/2019] [Indexed: 01/09/2023]
Abstract
Cysteine and aromatic residues are major structure-promoting residues. We assessed the abundance, structural coverage, and functional characteristics of the "non-smelly" proteins, i.e., proteins that do not contain cysteine residues (C-depleted) or cysteine and aromatic residues (CFYWH-depleted), across 817 proteomes from all domains of life. The analysis revealed that although these proteomes contained significant levels of the C-depleted proteins, with prokaryotes being significantly more enriched in such proteins than eukaryotes, the CFYWH-depleted proteins were relatively rare, accounting for about 0.05% of proteomes. Furthermore, CFYWH-depleted proteins were virtually never found in PDB. Depletion in cysteine and in aromatic residues was associated with the substantially increased intrinsic disorder levels across all domains of life. Archaeal and eukaryotic organisms with higher levels of the C-depleted proteins were shown to have higher levels of the intrinsic disorder and lower levels of structural coverage. We also showed that the "non-smelly" proteins typically did not independently fold into monomeric structures, and instead, they fold by interacting with nucleic acids as constituents of the ribosome and nucleosome complexes. They were shown to be involved in translation, transcription, nucleosome assembly, transmembrane transport, and protein folding functions, all of which are known to be associated with the intrinsic disorder. Our data suggested that, in general, structure of monomeric proteins is crucially dependent on the presence of cysteine and aromatic residues.
Collapse
Affiliation(s)
- Jing Yan
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA.
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd., MDC07, Tampa, FL, 33612, USA.
- Protein Research Group, Institute for Biological Instrumentation of the Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
| |
Collapse
|
37
|
Hu G, Wu Z, Oldfield CJ, Wang C, Kurgan L. Quality assessment for the putative intrinsic disorder in proteins. Bioinformatics 2020; 35:1692-1700. [PMID: 30329008 DOI: 10.1093/bioinformatics/bty881] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 09/19/2018] [Accepted: 10/15/2018] [Indexed: 11/15/2022] Open
Abstract
MOTIVATION While putative intrinsic disorder is widely used, none of the predictors provides quality assessment (QA) scores. QA scores estimate the likelihood that predictions are correct at a residue level and have been applied in other bioinformatics areas. We recently reported that QA scores derived from putative disorder propensities perform relatively poorly for native disordered residues. Here we design and validate a general approach to construct QA predictors for disorder predictions. RESULTS The QUARTER (QUality Assessment for pRotein inTrinsic disordEr pRedictions) toolbox of methods accommodates a diverse set of ten disorder predictors. It builds upon several innovative design elements including use and scaling of selected physicochemical properties of the input sequence, post-processing of disorder propensity scores, and a feature selection that optimizes the predictive models to a specific disorder predictor. We empirically establish that each one of these elements contributes to the overall predictive performance of our tool and that QUARTER's outputs significantly outperform QA scores derived from the outputs generated the disorder predictors. The best performing QA scores for a single disorder predictor identify 13% of residues that are predicted with 98% precision. QA scores computed by combining results of the ten disorder predictors cover 40% of residues with 95% precision. Case studies are used to show how to interpret the QA scores. QA scores based on the high precision combined predictions are applied to analyze disorder in the human proteome. AVAILABILITY AND IMPLEMENTATION http://biomine.cs.vcu.edu/servers/QUARTER/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Gang Hu
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, People's Republic of China
| | - Zhonghua Wu
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, People's Republic of China
| | | | - Chen Wang
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| |
Collapse
|
38
|
Banerjee S, Ferdosh S, Ghosh AN, Barat C. Tau protein- induced sequestration of the eukaryotic ribosome: Implications in neurodegenerative disease. Sci Rep 2020; 10:5225. [PMID: 32251304 PMCID: PMC7090008 DOI: 10.1038/s41598-020-61777-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Accepted: 02/28/2020] [Indexed: 12/15/2022] Open
Abstract
The human tau is a microtubule-associated intrinsically unstructured protein that forms intraneuronal cytotoxic deposits in neurodegenerative diseases, like tauopathies. Recent studies indicate that in Alzheimer's disease, ribosomal dysfunction might be a crucial event in the disease pathology. Our earlier studies had demonstrated that amorphous protein aggregation in the presence of ribosome can lead to sequestration of the ribosomal components. The present study aims at determining the effect of incubation of the full-length tau protein (Ht40) and its microtubule binding 4-repeat domain (K18) on the eukaryotic ribosome. Our in vitro studies show that incubation of Ht40 and the K18 tau variants with isolated non-translating yeast ribosome can induce a loss of ribosome physical integrity resulting in formation of tau-rRNA-ribosomal protein aggregates. Incubation with the tau protein variants also led to a disappearance of the peak indicating the ribosome profile of the HeLa cell lysate and suppression of translation in the human in vitro translation system. The incubation of tau protein with the ribosomal RNA leads to the formation of tau-rRNA aggregates. The effect of K18 on the yeast ribosome can be mitigated in the presence of cellular polyanions like heparin and tRNA, thereby indicating the electrostatic nature of the aggregation process.
Collapse
Affiliation(s)
- Senjuti Banerjee
- Department of Biotechnology, St. Xavier's College, Park Street, Kolkata, 700016, West Bengal, India
| | - Sehnaz Ferdosh
- Department of Biotechnology, St. Xavier's College, Park Street, Kolkata, 700016, West Bengal, India
| | - Amar Nath Ghosh
- National Institute of Cholera and Enteric Diseases P-33, C.I.T. Road, Scheme XM, Beleghata, India
| | - Chandana Barat
- Department of Biotechnology, St. Xavier's College, Park Street, Kolkata, 700016, West Bengal, India.
| |
Collapse
|
39
|
Minde DP, Ramakrishna M, Lilley KS. Biotin proximity tagging favours unfolded proteins and enables the study of intrinsically disordered regions. Commun Biol 2020; 3:38. [PMID: 31969649 PMCID: PMC6976632 DOI: 10.1038/s42003-020-0758-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 12/16/2019] [Indexed: 02/07/2023] Open
Abstract
Intrinsically Disordered Regions (IDRs) are enriched in disease-linked proteins known to have multiple post-translational modifications, but there is limited in vivo information about how locally unfolded protein regions contribute to biological functions. We reasoned that IDRs should be more accessible to targeted in vivo biotinylation than ordered protein regions, if they retain their flexibility in human cells. Indeed, we observed increased biotinylation density in predicted IDRs in several cellular compartments >20,000 biotin sites from four proximity proteomics studies. We show that in a biotin ‘painting’ time course experiment, biotinylation events in Escherichia coli ribosomes progress from unfolded and exposed regions at 10 s, to structured and less accessible regions after five minutes. We conclude that biotin proximity tagging favours sites of local disorder in proteins and suggest the possibility of using biotin painting as a method to gain unique insights into in vivo condition-dependent subcellular plasticity of proteins. David-Paul Minde, Manasa Ramakrishna et al. look at intrinsically disordered regions of disease-linked proteins in vivo by biotinylation. They show that biotin “painting” could be used as a method to examine the condition-dependent plasticity of proteins in vivo.
Collapse
Affiliation(s)
- David-Paul Minde
- Department of Biochemistry, Cambridge Centre for Proteomics, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK.
| | - Manasa Ramakrishna
- Medical Research Council Toxicology Unit, University of Cambridge, Lancaster Road, Leicester, LE1 9HN, UK
| | - Kathryn S Lilley
- Department of Biochemistry, Cambridge Centre for Proteomics, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK.
| |
Collapse
|
40
|
Oldfield CJ, Fan X, Wang C, Dunker AK, Kurgan L. Computational Prediction of Intrinsic Disorder in Protein Sequences with the disCoP Meta-predictor. Methods Mol Biol 2020; 2141:21-35. [PMID: 32696351 DOI: 10.1007/978-1-0716-0524-0_2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Intrinsically disordered proteins are either entirely disordered or contain disordered regions in their native state. These proteins and regions function without the prerequisite of a stable structure and were found to be abundant across all kingdoms of life. Experimental annotation of disorder lags behind the rapidly growing number of sequenced proteins, motivating the development of computational methods that predict disorder in protein sequences. DisCoP is a user-friendly webserver that provides accurate sequence-based prediction of protein disorder. It relies on meta-architecture in which the outputs generated by multiple disorder predictors are combined together to improve predictive performance. The architecture of disCoP is presented, and its accuracy relative to several other disorder predictors is briefly discussed. We describe usage of the web interface and explain how to access and read results generated by this computational tool. We also provide an example of prediction results and interpretation. The disCoP's webserver is publicly available at http://biomine.cs.vcu.edu/servers/disCoP/ .
Collapse
Affiliation(s)
| | - Xiao Fan
- Department of Pediatrics, Columbia University, New York, NY, USA
| | - Chen Wang
- Department of Medicine, Columbia University, New York, NY, USA
| | - A Keith Dunker
- Department of Biochemistry and Molecular Biology, Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
| |
Collapse
|
41
|
Oldfield CJ, Peng Z, Uversky VN, Kurgan L. Codon selection reduces GC content bias in nucleic acids encoding for intrinsically disordered proteins. Cell Mol Life Sci 2020; 77:149-160. [PMID: 31175370 PMCID: PMC11104855 DOI: 10.1007/s00018-019-03166-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2019] [Revised: 05/14/2019] [Accepted: 05/28/2019] [Indexed: 02/06/2023]
Abstract
Protein-coding nucleic acids exhibit composition and codon biases between sequences coding for intrinsically disordered regions (IDRs) and those coding for structured regions. IDRs are regions of proteins that are folding self-insufficient and which function without the prerequisite of folded structure. Several authors have investigated composition bias or codon selection in regions encoding for IDRs, primarily in Eukaryota, and concluded that elevated GC content is the result of the biased amino acid composition of IDRs. We substantively extend previous work by examining GC content in regions encoding IDRs, from 44 species in Eukaryota, Archaea, and Bacteria, spanning a wide range of GC content. We confirm that regions coding for IDRs show a significantly elevated GC content, even across all domains of life. Although this is largely attributable to the amino acid composition bias of IDRs, we show that this bias is independent of the overall GC content and, most importantly, we are the first to observe that GC content bias in IDRs is significantly different than expected from IDR amino acid composition alone. We empirically find compensatory codon selection that reduces the observed GC content bias in IDRs. This selection is dependent on the overall GC content of the organism. The codon selection bias manifests as use of infrequent, AT-rich codons in encoding IDRs. Further, we find these relationships to be independent of the intrinsic disorder prediction method used, and independent of estimated translation efficiency. These observations are consistent with the previous work, and we speculate on whether the observed biases are causal or symptomatic of other driving forces.
Collapse
Affiliation(s)
- Christopher J Oldfield
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA.
| | - Zhenling Peng
- Center for Applied Mathematics, Tianjin University, Tianjin, 300072, China
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA
- Institute for Biological Instrumentation, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA.
| |
Collapse
|
42
|
Abstract
Intrinsically disordered regions (IDRs) are estimated to be highly abundant in nature. While only several thousand proteins are annotated with experimentally derived IDRs, computational methods can be used to predict IDRs for the millions of currently uncharacterized protein chains. Several dozen disorder predictors were developed over the last few decades. While some of these methods provide accurate predictions, unavoidably they also make some mistakes. Consequently, one of the challenges facing users of these methods is how to decide which predictions can be trusted and which are likely incorrect. This practical problem can be solved using quality assessment (QA) scores that predict correctness of the underlying (disorder) predictions at a residue level. We motivate and describe a first-of-its-kind toolbox of QA methods, QUARTER (QUality Assessment for pRotein inTrinsic disordEr pRedictions), which provides the scores for a diverse set of ten disorder predictors. QUARTER is available to the end users as a free and convenient webserver at http://biomine.cs.vcu.edu/servers/QUARTER/ . We briefly describe the predictive architecture of QUARTER and provide detailed instructions on how to use the webserver. We also explain how to interpret results produced by QUARTER with the help of a case study.
Collapse
|
43
|
Bandyopadhyay A, Dhar AK, Basu S. Graph coloring: a novel heuristic based on trailing path—properties, perspective and applications in structured networks. Soft comput 2020. [DOI: 10.1007/s00500-019-04278-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
44
|
Katuwawala A, Oldfield CJ, Kurgan L. DISOselect: Disorder predictor selection at the protein level. Protein Sci 2020; 29:184-200. [PMID: 31642118 PMCID: PMC6933862 DOI: 10.1002/pro.3756] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 10/16/2019] [Accepted: 10/17/2019] [Indexed: 12/27/2022]
Abstract
The intense interest in the intrinsically disordered proteins in the life science community, together with the remarkable advancements in predictive technologies, have given rise to the development of a large number of computational predictors of intrinsic disorder from protein sequence. While the growing number of predictors is a positive trend, we have observed a considerable difference in predictive quality among predictors for individual proteins. Furthermore, variable predictor performance is often inconsistent between predictors for different proteins, and the predictor that shows the best predictive performance depends on the unique properties of each protein sequence. We propose a computational approach, DISOselect, to estimate the predictive performance of 12 selected predictors for individual proteins based on their unique sequence-derived properties. This estimation informs the users about the expected predictive quality for a selected disorder predictor and can be used to recommend methods that are likely to provide the best quality predictions. Our solution does not depend on the results of any disorder predictor; the estimations are made based solely on the protein sequence. Our solution significantly improves predictive performance, as judged with a test set of 1,000 proteins, when compared to other alternatives. We have empirically shown that by using the recommended methods the overall predictive performance for a given set of proteins can be improved by a statistically significant margin. DISOselect is freely available for non-commercial users through the webserver at http://biomine.cs.vcu.edu/servers/DISOselect/.
Collapse
Affiliation(s)
- Akila Katuwawala
- Department of Computer ScienceVirginia Commonwealth UniversityRichmondVirginia
| | | | - Lukasz Kurgan
- Department of Computer ScienceVirginia Commonwealth UniversityRichmondVirginia
| |
Collapse
|
45
|
Barik A, Katuwawala A, Hanson J, Paliwal K, Zhou Y, Kurgan L. DEPICTER: Intrinsic Disorder and Disorder Function Prediction Server. J Mol Biol 2019; 432:3379-3387. [PMID: 31870849 DOI: 10.1016/j.jmb.2019.12.030] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Revised: 12/07/2019] [Accepted: 12/15/2019] [Indexed: 01/06/2023]
Abstract
Computational predictions of the intrinsic disorder and its functions are instrumental to facilitate annotation for the millions of unannotated proteins. However, access to these predictors is fragmented and requires substantial effort to find them and to collect and combine their results. The DEPICTER (DisorderEd PredictIon CenTER) server provides first-of-its-kind centralized access to 10 popular disorder and disorder function predictions that cover protein and nucleic acids binding, linkers, and moonlighting regions. It automates the prediction process, runs user-selected methods on the server side, visualizes the results, and outputs all predictions in a consistent and easy-to-parse format. DEPICTER also includes two accurate consensus predictors of disorder and disordered protein binding. Empirical tests on an independent (low similarity) benchmark dataset reveal that the computational tools included in DEPICTER generate accurate predictions that are significantly better than the results secured using sequence alignment. The DEPICTER server is freely available at http://biomine.cs.vcu.edu/servers/DEPICTER/.
Collapse
Affiliation(s)
- Amita Barik
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA; Department of Biotechnology, National Institute of Technology, Durgapur, India
| | - Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA
| | - Jack Hanson
- Signal Processing Laboratory, Griffith University, Brisbane, QLD, 4122, Australia
| | - Kuldip Paliwal
- Signal Processing Laboratory, Griffith University, Brisbane, QLD, 4122, Australia
| | - Yaoqi Zhou
- School of Information and Communication Technology, Griffith University, Gold Coast, QLD, 4222, Australia; Institute for Glycomics, Griffith University, Gold Coast, QLD, 4222, Australia
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA.
| |
Collapse
|
46
|
Ghadermarzi S, Li X, Li M, Kurgan L. Sequence-Derived Markers of Drug Targets and Potentially Druggable Human Proteins. Front Genet 2019; 10:1075. [PMID: 31803227 PMCID: PMC6872670 DOI: 10.3389/fgene.2019.01075] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2019] [Accepted: 10/09/2019] [Indexed: 12/16/2022] Open
Abstract
Recent research shows that majority of the druggable human proteome is yet to be annotated and explored. Accurate identification of these unexplored druggable proteins would facilitate development, screening, repurposing, and repositioning of drugs, as well as prediction of new drug–protein interactions. We contrast the current drug targets against the datasets of non-druggable and possibly druggable proteins to formulate markers that could be used to identify druggable proteins. We focus on the markers that can be extracted from protein sequences or names/identifiers to ensure that they can be applied across the entire human proteome. These markers quantify key features covered in the past works (topological features of PPIs, cellular functions, and subcellular locations) and several novel factors (intrinsic disorder, residue-level conservation, alternative splicing isoforms, domains, and sequence-derived solvent accessibility). We find that the possibly druggable proteins have significantly higher abundance of alternative splicing isoforms, relatively large number of domains, higher degree of centrality in the protein-protein interaction networks, and lower numbers of conserved and surface residues, when compared with the non-druggable proteins. We show that the current drug targets and possibly druggable proteins share involvement in the catalytic and signaling functions. However, unlike the drug targets, the possibly druggable proteins participate in the metabolic and biosynthesis processes, are enriched in the intrinsic disorder, interact with proteins and nucleic acids, and are localized across the cell. To sum up, we formulate several markers that can help with finding novel druggable human proteins and provide interesting insights into the cellular functions and subcellular locations of the current drug targets and potentially druggable proteins.
Collapse
Affiliation(s)
- Sina Ghadermarzi
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Xingyi Li
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Min Li
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| |
Collapse
|
47
|
Katuwawala A, Oldfield CJ, Kurgan L. Accuracy of protein-level disorder predictions. Brief Bioinform 2019; 21:1509-1522. [DOI: 10.1093/bib/bbz100] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Revised: 06/22/2019] [Accepted: 07/15/2019] [Indexed: 01/15/2023] Open
Abstract
Abstract
Experimental annotations of intrinsic disorder are available for 0.1% of 147 000 000 of currently sequenced proteins. Over 60 sequence-based disorder predictors were developed to help bridge this gap. Current benchmarks of these methods assess predictive performance on datasets of proteins; however, predictions are often interpreted for individual proteins. We demonstrate that the protein-level predictive performance varies substantially from the dataset-level benchmarks. Thus, we perform first-of-its-kind protein-level assessment for 13 popular disorder predictors using 6200 disorder-annotated proteins. We show that the protein-level distributions are substantially skewed toward high predictive quality while having long tails of poor predictions. Consequently, between 57% and 75% proteins secure higher predictive performance than the currently used dataset-level assessment suggests, but as many as 30% of proteins that are located in the long tails suffer low predictive performance. These proteins typically have relatively high amounts of disorder, in contrast to the mostly structured proteins that are predicted accurately by all 13 methods. Interestingly, each predictor provides the most accurate results for some number of proteins, while the best-performing at the dataset-level method is in fact the best for only about 30% of proteins. Moreover, the majority of proteins are predicted more accurately than the dataset-level performance of the most accurate tool by at least four disorder predictors. While these results suggests that disorder predictors outperform their current benchmark performance for the majority of proteins and that they complement each other, novel tools that accurately identify the hard-to-predict proteins and that make accurate predictions for these proteins are needed.
Collapse
Affiliation(s)
- Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, USA
- Department of Computer Science, Virginia Commonwealth University, USA
| | - Christopher J Oldfield
- Department of Computer Science, Virginia Commonwealth University, USA
- Department of Computer Science, Virginia Commonwealth University, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, USA
- Department of Computer Science, Virginia Commonwealth University, USA
| |
Collapse
|
48
|
Loos MS, Ramakrishnan R, Vranken W, Tsirigotaki A, Tsare EP, Zorzini V, Geyter JD, Yuan B, Tsamardinos I, Klappa M, Schymkowitz J, Rousseau F, Karamanou S, Economou A. Structural Basis of the Subcellular Topology Landscape of Escherichia coli. Front Microbiol 2019; 10:1670. [PMID: 31404336 PMCID: PMC6677119 DOI: 10.3389/fmicb.2019.01670] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Accepted: 07/08/2019] [Indexed: 11/21/2022] Open
Abstract
Cellular proteomes are distributed in multiple compartments: on DNA, ribosomes, on and inside membranes, or they become secreted. Structural properties that allow polypeptides to occupy subcellular niches, particularly to after crossing membranes, remain unclear. We compared intrinsic and extrinsic features in cytoplasmic and secreted polypeptides of the Escherichia coli K-12 proteome. Structural features between the cytoplasmome and secretome are sharply distinct, such that a signal peptide-agnostic machine learning tool distinguishes cytoplasmic from secreted proteins with 95.5% success. Cytoplasmic polypeptides are enriched in aliphatic, aromatic, charged and hydrophobic residues, unique folds and higher early folding propensities. Secretory polypeptides are enriched in polar/small amino acids, β folds, have higher backbone dynamics, higher disorder and contact order and are more often intrinsically disordered. These non-random distributions and experimental evidence imply that evolutionary pressure selected enhanced secretome flexibility, slow folding and looser structures, placing the secretome in a distinct protein class. These adaptations protect the secretome from premature folding during its cytoplasmic transit, optimize its lipid bilayer crossing and allowed it to acquire cell envelope specific chemistries. The latter may favor promiscuous multi-ligand binding, sensing of stress and cell envelope structure changes. In conclusion, enhanced flexibility, slow folding, looser structures and unique folds differentiate the secretome from the cytoplasmome. These findings have wide implications on the structural diversity and evolution of modern proteomes and the protein folding problem.
Collapse
Affiliation(s)
- Maria S Loos
- Department of Microbiology and Immunology, Laboratory of Molecular Bacteriology, Rega Institute, KU Leuven, Leuven, Belgium
| | - Reshmi Ramakrishnan
- Department of Microbiology and Immunology, Laboratory of Molecular Bacteriology, Rega Institute, KU Leuven, Leuven, Belgium.,VIB Switch Laboratory, Department for Cellular and Molecular Medicine, VIB-KU Leuven Center for Brain & Disease Research, KU Leuven, Leuven, Belgium
| | - Wim Vranken
- Interuniversity Institute of Bioinformatics in Brussels, Free University of Brussels, Brussels, Belgium.,Structural Biology Brussels, Vrije Universiteit Brussel and Center for Structural Biology, Brussels, Belgium
| | - Alexandra Tsirigotaki
- Department of Microbiology and Immunology, Laboratory of Molecular Bacteriology, Rega Institute, KU Leuven, Leuven, Belgium
| | - Evrydiki-Pandora Tsare
- Metabolic Engineering & Systems Biology Laboratory, Institute of Chemical Engineering Sciences, Foundation for Research and Technology-Hellas, Patras, Greece
| | - Valentina Zorzini
- Department of Microbiology and Immunology, Laboratory of Molecular Bacteriology, Rega Institute, KU Leuven, Leuven, Belgium
| | - Jozefien De Geyter
- Department of Microbiology and Immunology, Laboratory of Molecular Bacteriology, Rega Institute, KU Leuven, Leuven, Belgium
| | - Biao Yuan
- Department of Microbiology and Immunology, Laboratory of Molecular Bacteriology, Rega Institute, KU Leuven, Leuven, Belgium
| | - Ioannis Tsamardinos
- Gnosis Data Analysis PC, Heraklion, Greece.,Department of Computer Science, University of Crete, Heraklion, Greece
| | - Maria Klappa
- Metabolic Engineering & Systems Biology Laboratory, Institute of Chemical Engineering Sciences, Foundation for Research and Technology-Hellas, Patras, Greece
| | - Joost Schymkowitz
- VIB Switch Laboratory, Department for Cellular and Molecular Medicine, VIB-KU Leuven Center for Brain & Disease Research, KU Leuven, Leuven, Belgium
| | - Frederic Rousseau
- VIB Switch Laboratory, Department for Cellular and Molecular Medicine, VIB-KU Leuven Center for Brain & Disease Research, KU Leuven, Leuven, Belgium
| | - Spyridoula Karamanou
- Department of Microbiology and Immunology, Laboratory of Molecular Bacteriology, Rega Institute, KU Leuven, Leuven, Belgium
| | - Anastassios Economou
- Department of Microbiology and Immunology, Laboratory of Molecular Bacteriology, Rega Institute, KU Leuven, Leuven, Belgium.,Gnosis Data Analysis PC, Heraklion, Greece
| |
Collapse
|
49
|
Balcerak A, Trebinska-Stryjewska A, Konopinski R, Wakula M, Grzybowska EA. RNA-protein interactions: disorder, moonlighting and junk contribute to eukaryotic complexity. Open Biol 2019; 9:190096. [PMID: 31213136 PMCID: PMC6597761 DOI: 10.1098/rsob.190096] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
RNA-protein interactions are crucial for most biological processes in all organisms. However, it appears that the complexity of RNA-based regulation increases with the complexity of the organism, creating additional regulatory circuits, the scope of which is only now being revealed. It is becoming apparent that previously unappreciated features, such as disordered structural regions in proteins or non-coding regions in DNA leading to higher plasticity and pliability in RNA-protein complexes, are in fact essential for complex, precise and fine-tuned regulation. This review addresses the issue of the role of RNA-protein interactions in generating eukaryotic complexity, focusing on the newly characterized disordered RNA-binding motifs, moonlighting of metabolic enzymes, RNA-binding proteins interactions with different RNA species and their participation in regulatory networks of higher order.
Collapse
Affiliation(s)
- Anna Balcerak
- 1 The Maria Sklodowska-Curie Institute-Oncology Center , Roentgena 5, 02-781 Warsaw , Poland
| | - Alicja Trebinska-Stryjewska
- 1 The Maria Sklodowska-Curie Institute-Oncology Center , Roentgena 5, 02-781 Warsaw , Poland.,2 Biomedical Engineering Centre, Institute of Optoelectronics, Military University of Technology , Sylwestra Kaliskiego 2, 00-908 Warsaw , Poland
| | - Ryszard Konopinski
- 1 The Maria Sklodowska-Curie Institute-Oncology Center , Roentgena 5, 02-781 Warsaw , Poland
| | - Maciej Wakula
- 1 The Maria Sklodowska-Curie Institute-Oncology Center , Roentgena 5, 02-781 Warsaw , Poland
| | - Ewa Anna Grzybowska
- 1 The Maria Sklodowska-Curie Institute-Oncology Center , Roentgena 5, 02-781 Warsaw , Poland
| |
Collapse
|
50
|
Lee HM, Kwon SB, Son A, Kim DH, Kim KH, Lim J, Kwon YG, Kang JS, Lee BK, Byun YH, Seong BL. Stabilization of Intrinsically Disordered DKK2 Protein by Fusion to RNA-Binding Domain. Int J Mol Sci 2019; 20:ijms20112847. [PMID: 31212691 PMCID: PMC6600415 DOI: 10.3390/ijms20112847] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 05/11/2019] [Accepted: 06/10/2019] [Indexed: 12/26/2022] Open
Abstract
Intrinsic disorders are a common feature of hub proteins in eukaryotic interactomes controlling the signaling pathways. The intrinsically disordered proteins (IDPs) are prone to misfolding, and maintaining their functional stability remains a major challenge in validating their therapeutic potentials. Considering that IDPs are highly enriched in RNA-binding proteins (RBPs), here we reasoned and confirmed that IDPs could be stabilized by fusion to RBPs. Dickkopf2 (DKK2), Wnt antagonist and a prototype IDP, was fused with lysyl-tRNA synthetase (LysRS), with or without the fragment crystallizable (Fc) domain of an immunoglobulin and expressed predominantly as a soluble form from a bacterial host. The functional competence was confirmed by in vitro Wnt signaling reporter and tube formation in human umbilical vein endothelial cells (HUVECs) and in vivo Matrigel plug assay. The removal of LysRS by site-specific protease cleavage prompted the insoluble aggregation, confirming that the linkage to RBP chaperones the functional competence of IDPs. While addressing to DKK2 as a key modulator for cancer and ischemic vascular diseases, our results suggest the use of RBPs as stabilizers of disordered proteinaceous materials for acquiring and maintaining the structural stability and functional competence, which would impact the druggability of a variety of IDPs from human proteome.
Collapse
Affiliation(s)
- Hye Min Lee
- Department of Biotechnology, College of Life Sciences and Biotechnology, Yonsei University, Seoul 03722, Korea.
- Vaccine Translational Research Center, Yonsei University, Seoul 03722, Korea.
| | - Soon Bin Kwon
- Department of Biotechnology, College of Life Sciences and Biotechnology, Yonsei University, Seoul 03722, Korea.
- Vaccine Translational Research Center, Yonsei University, Seoul 03722, Korea.
| | - Ahyun Son
- Department of Biotechnology, College of Life Sciences and Biotechnology, Yonsei University, Seoul 03722, Korea.
- Vaccine Translational Research Center, Yonsei University, Seoul 03722, Korea.
| | - Doo Hyun Kim
- Department of Pharmacology, and Center for Cancer Research and Diagnostic Medicine, IBST, School of Medicine, Konkuk University, Seoul 05030, Korea.
| | - Kyun-Hwan Kim
- Department of Pharmacology, and Center for Cancer Research and Diagnostic Medicine, IBST, School of Medicine, Konkuk University, Seoul 05030, Korea.
| | - Jonghyo Lim
- Department of Biochemistry, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea.
| | - Young-Guen Kwon
- Department of Biochemistry, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea.
| | - Jin Sun Kang
- ProCell R&D Institute, ProCell Therapeutics, Inc., Ace-Twin Tower II, Guro3-dong, Guro-gu, Seoul 08381, Korea.
| | - Byung Kyu Lee
- ProCell R&D Institute, ProCell Therapeutics, Inc., Ace-Twin Tower II, Guro3-dong, Guro-gu, Seoul 08381, Korea.
| | - Young Ho Byun
- Department of Biotechnology, College of Life Sciences and Biotechnology, Yonsei University, Seoul 03722, Korea.
- Vaccine Translational Research Center, Yonsei University, Seoul 03722, Korea.
| | - Baik L Seong
- Department of Biotechnology, College of Life Sciences and Biotechnology, Yonsei University, Seoul 03722, Korea.
- Vaccine Translational Research Center, Yonsei University, Seoul 03722, Korea.
| |
Collapse
|