1
|
Idrees S, Paudel KR, Sadaf T, Hansbro PM. Uncovering domain motif interactions using high-throughput protein-protein interaction detection methods. FEBS Lett 2024; 598:725-742. [PMID: 38439692 DOI: 10.1002/1873-3468.14841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 01/09/2024] [Accepted: 02/18/2024] [Indexed: 03/06/2024]
Abstract
Protein-protein interactions (PPIs) are often mediated by short linear motifs (SLiMs) in one protein and domain in another, known as domain-motif interactions (DMIs). During the past decade, SLiMs have been studied to find their role in cellular functions such as post-translational modifications, regulatory processes, protein scaffolding, cell cycle progression, cell adhesion, cell signalling and substrate selection for proteasomal degradation. This review provides a comprehensive overview of the current PPI detection techniques and resources, focusing on their relevance to capturing interactions mediated by SLiMs. We also address the challenges associated with capturing DMIs. Moreover, a case study analysing the BioGrid database as a source of DMI prediction revealed significant known DMI enrichment in different PPI detection methods. Overall, it can be said that current high-throughput PPI detection methods can be a reliable source for predicting DMIs.
Collapse
Affiliation(s)
- Sobia Idrees
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia
- Centre for Inflammation, Centenary Institute and Faculty of Science, School of Life Sciences, University of Technology Sydney, Australia
| | - Keshav Raj Paudel
- Centre for Inflammation, Centenary Institute and Faculty of Science, School of Life Sciences, University of Technology Sydney, Australia
| | - Tayyaba Sadaf
- Centre for Inflammation, Centenary Institute and Faculty of Science, School of Life Sciences, University of Technology Sydney, Australia
| | - Philip M Hansbro
- Centre for Inflammation, Centenary Institute and Faculty of Science, School of Life Sciences, University of Technology Sydney, Australia
| |
Collapse
|
2
|
Idrees S, Paudel KR. Proteome-wide assessment of human interactome as a source of capturing domain-motif and domain-domain interactions. J Cell Commun Signal 2024; 18:e12014. [PMID: 38545252 PMCID: PMC10964934 DOI: 10.1002/ccs3.12014] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2023] [Accepted: 12/11/2023] [Indexed: 06/29/2024] Open
Abstract
Protein-protein interactions (PPIs) play a crucial role in various biological processes by establishing domain-motif (DMI) and domain-domain interactions (DDIs). While the existence of real DMIs/DDIs is generally assumed, it is rarely tested; therefore, this study extensively compared high-throughput methods and public PPI repositories as sources for DMI and DDI prediction based on the assumption that the human interactome provides sufficient data for the reliable identification of DMIs and DDIs. Different datasets from leading high-throughput methods (Yeast two-hybrid [Y2H], Affinity Purification coupled Mass Spectrometry [AP-MS], and Co-fractionation-coupled Mass Spectrometry) were assessed for their ability to capture DMIs and DDIs using known DMI/DDI information. High-throughput methods were not notably worse than PPI databases and, in some cases, appeared better. In conclusion, all PPI datasets demonstrated significant enrichment in DMIs and DDIs (p-value <0.001), establishing Y2H and AP-MS as reliable methods for predicting these interactions. This study provides valuable insights for biologists in selecting appropriate methods for predicting DMIs, ultimately aiding in SLiM discovery.
Collapse
Affiliation(s)
- Sobia Idrees
- School of Biotechnology and Biomolecular SciencesUniversity of New South WalesSydneyNew South WalesAustralia
- Centre for InflammationCentenary Institute and the University of Technology SydneySchool of Life SciencesFaculty of ScienceSydneyNew South WalesAustralia
| | - Keshav Raj Paudel
- Centre for InflammationCentenary Institute and the University of Technology SydneySchool of Life SciencesFaculty of ScienceSydneyNew South WalesAustralia
| |
Collapse
|
3
|
Kurgan L, Hu G, Wang K, Ghadermarzi S, Zhao B, Malhis N, Erdős G, Gsponer J, Uversky VN, Dosztányi Z. Tutorial: a guide for the selection of fast and accurate computational tools for the prediction of intrinsic disorder in proteins. Nat Protoc 2023; 18:3157-3172. [PMID: 37740110 DOI: 10.1038/s41596-023-00876-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 06/21/2023] [Indexed: 09/24/2023]
Abstract
Intrinsic disorder is instrumental for a wide range of protein functions, and its analysis, using computational predictions from primary structures, complements secondary and tertiary structure-based approaches. In this Tutorial, we provide an overview and comparison of 23 publicly available computational tools with complementary parameters useful for intrinsic disorder prediction, partly relying on results from the Critical Assessment of protein Intrinsic Disorder prediction experiment. We consider factors such as accuracy, runtime, availability and the need for functional insights. The selected tools are available as web servers and downloadable programs, offer state-of-the-art predictions and can be used in a high-throughput manner. We provide examples and instructions for the selected tools to illustrate practical aspects related to the submission, collection and interpretation of predictions, as well as the timing and their limitations. We highlight two predictors for intrinsically disordered proteins, flDPnn as accurate and fast and IUPred as very fast and moderately accurate, while suggesting ANCHOR2 and MoRFchibi as two of the best-performing predictors for intrinsically disordered region binding. We link these tools to additional resources, including databases of predictions and web servers that integrate multiple predictive methods. Altogether, this Tutorial provides a hands-on guide to comparatively evaluating multiple predictors, submitting and collecting their own predictions, and reading and interpreting results. It is suitable for experimentalists and computational biologists interested in accurately and conveniently identifying intrinsic disorder, facilitating the functional characterization of the rapidly growing collections of protein sequences.
Collapse
Affiliation(s)
- Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
| | - Gang Hu
- School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China
| | - Kui Wang
- School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China
| | - Sina Ghadermarzi
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| | - Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| | - Nawar Malhis
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Gábor Erdős
- MTA-ELTE Momentum Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, Budapest, Hungary
| | - Jörg Gsponer
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada.
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
- Byrd Alzheimer's Center and Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
| | - Zsuzsanna Dosztányi
- MTA-ELTE Momentum Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, Budapest, Hungary.
| |
Collapse
|
4
|
Idrees S, Paudel KR, Sadaf T, Hansbro PM. How different viruses perturb host cellular machinery via short linear motifs. EXCLI JOURNAL 2023; 22:1113-1128. [PMID: 38054205 PMCID: PMC10694346 DOI: 10.17179/excli2023-6328] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 10/18/2023] [Indexed: 12/07/2023]
Abstract
The virus interacts with its hosts by developing protein-protein interactions. Most viruses employ protein interactions to imitate the host protein: A viral protein with the same amino acid sequence or structure as the host protein attaches to the host protein's binding partner and interferes with the host protein's pathways. Being opportunistic, viruses have evolved to manipulate host cellular mechanisms by mimicking short linear motifs. In this review, we shed light on the current understanding of mimicry via short linear motifs and focus on viral mimicry by genetically different viral subtypes by providing recent examples of mimicry evidence and how high-throughput methods can be a reliable source to study SLiM-mediated viral mimicry.
Collapse
Affiliation(s)
- Sobia Idrees
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
- Centre for Inflammation, Centenary Institute and the University of Technology Sydney, School of Life Sciences, Faculty of Science, Sydney, New South Wales, Australia
| | - Keshav Raj Paudel
- Centre for Inflammation, Centenary Institute and the University of Technology Sydney, School of Life Sciences, Faculty of Science, Sydney, New South Wales, Australia
| | - Tayyaba Sadaf
- Centre for Inflammation, Centenary Institute and the University of Technology Sydney, School of Life Sciences, Faculty of Science, Sydney, New South Wales, Australia
| | - Philip M. Hansbro
- Centre for Inflammation, Centenary Institute and the University of Technology Sydney, School of Life Sciences, Faculty of Science, Sydney, New South Wales, Australia
| |
Collapse
|
5
|
Peng Z, Li Z, Meng Q, Zhao B, Kurgan L. CLIP: accurate prediction of disordered linear interacting peptides from protein sequences using co-evolutionary information. Brief Bioinform 2023; 24:6858950. [PMID: 36458437 DOI: 10.1093/bib/bbac502] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 09/30/2022] [Accepted: 10/24/2022] [Indexed: 12/04/2022] Open
Abstract
One of key features of intrinsically disordered regions (IDRs) is facilitation of protein-protein and protein-nucleic acids interactions. These disordered binding regions include molecular recognition features (MoRFs), short linear motifs (SLiMs) and longer binding domains. Vast majority of current predictors of disordered binding regions target MoRFs, with a handful of methods that predict SLiMs and disordered protein-binding domains. A new and broader class of disordered binding regions, linear interacting peptides (LIPs), was introduced recently and applied in the MobiDB resource. LIPs are segments in protein sequences that undergo disorder-to-order transition upon binding to a protein or a nucleic acid, and they cover MoRFs, SLiMs and disordered protein-binding domains. Although current predictors of MoRFs and disordered protein-binding regions could be used to identify some LIPs, there are no dedicated sequence-based predictors of LIPs. To this end, we introduce CLIP, a new predictor of LIPs that utilizes robust logistic regression model to combine three complementary types of inputs: co-evolutionary information derived from multiple sequence alignments, physicochemical profiles and disorder predictions. Ablation analysis suggests that the co-evolutionary information is particularly useful for this prediction and that combining the three inputs provides substantial improvements when compared to using these inputs individually. Comparative empirical assessments using low-similarity test datasets reveal that CLIP secures area under receiver operating characteristic curve (AUC) of 0.8 and substantially improves over the results produced by the closest current tools that predict MoRFs and disordered protein-binding regions. The webserver of CLIP is freely available at http://biomine.cs.vcu.edu/servers/CLIP/ and the standalone code can be downloaded from http://yanglab.qd.sdu.edu.cn/download/CLIP/.
Collapse
Affiliation(s)
- Zhenling Peng
- Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, 266237, China.,Frontier Science Center for Nonlinear Expectations, Ministry of Education, Qingdao, 266237, China
| | - Zixia Li
- Center for Applied Mathematics, Tianjin University, Tianjin, 300072, China
| | - Qiaozhen Meng
- College of Intelligence and Computing, Tianjin University, Tianjin, 300072, China
| | - Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
6
|
Zheng W, Du Z, Ko SB, Wickramasinghe N, Yang S. Incorporation of D 2O-Induced Fluorine Chemical Shift Perturbations into Ensemble-Structure Characterization of the ERalpha Disordered Region. J Phys Chem B 2022; 126:9176-9186. [PMID: 36331868 PMCID: PMC10066504 DOI: 10.1021/acs.jpcb.2c05456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Structural characterization of intrinsically disordered proteins (IDPs) requires a concerted effort between experiments and computations by accounting for their conformational heterogeneity. Given the diversity of experimental tools providing local and global structural information, constructing an experimental restraint-satisfying structural ensemble remains challenging. Here, we use the disordered N-terminal domain (NTD) of the estrogen receptor alpha (ERalpha) as a model system to combine existing small-angle X-ray scattering (SAXS) and hydroxyl radical protein footprinting (HRPF) data and newly acquired solvent accessibility data via D2O-induced fluorine chemical shifting (DFCS) measurements. A new set of DFCS data for the solvent exposure of a set of 12 amino acid positions were added to complement previously acquired HRPF measurements for the solvent exposure of the other 16 nonoverlapping amino acids, thereby improving the NTD ensemble characterization considerably. We also found that while choosing an initial ensemble of structures generated from a different atomic-level force field or sampling/modeling method can lead to distinct contact maps even when the same sets of experimental measurements were used for ensemble-fitting, comparative analyses from these initial ensembles reveal commonly recurring structural features in their ensemble-averaged contact map. Specifically, nonlocal or long-range transient interactions were found consistently between the N-terminal segments and the central region, sufficient to mediate the conformational ensemble and regulate how the NTD interacts with its coactivator proteins.
Collapse
Affiliation(s)
- Wenwei Zheng
- College of Integrative Sciences and Arts, Arizona State University, Mesa, Arizona 85212, United States
| | - Zhanwen Du
- Center for Proteomics and Department of Nutrition, School of Medicine, Case Western Reserve University, Cleveland, Ohio, 44106, United States
| | - Soo Bin Ko
- Center for Proteomics and Department of Nutrition, School of Medicine, Case Western Reserve University, Cleveland, Ohio, 44106, United States
| | - Nalinda Wickramasinghe
- Chemistry-NMR Facility, Case Western Reserve University, Cleveland, Ohio 44106, United States
| | - Sichun Yang
- Center for Proteomics and Department of Nutrition, School of Medicine, Case Western Reserve University, Cleveland, Ohio, 44106, United States
| |
Collapse
|
7
|
Kulkarni P, Leite VBP, Roy S, Bhattacharyya S, Mohanty A, Achuthan S, Singh D, Appadurai R, Rangarajan G, Weninger K, Orban J, Srivastava A, Jolly MK, Onuchic JN, Uversky VN, Salgia R. Intrinsically disordered proteins: Ensembles at the limits of Anfinsen's dogma. BIOPHYSICS REVIEWS 2022; 3:011306. [PMID: 38505224 PMCID: PMC10903413 DOI: 10.1063/5.0080512] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Accepted: 02/17/2022] [Indexed: 03/21/2024]
Abstract
Intrinsically disordered proteins (IDPs) are proteins that lack rigid 3D structure. Hence, they are often misconceived to present a challenge to Anfinsen's dogma. However, IDPs exist as ensembles that sample a quasi-continuum of rapidly interconverting conformations and, as such, may represent proteins at the extreme limit of the Anfinsen postulate. IDPs play important biological roles and are key components of the cellular protein interaction network (PIN). Many IDPs can interconvert between disordered and ordered states as they bind to appropriate partners. Conformational dynamics of IDPs contribute to conformational noise in the cell. Thus, the dysregulation of IDPs contributes to increased noise and "promiscuous" interactions. This leads to PIN rewiring to output an appropriate response underscoring the critical role of IDPs in cellular decision making. Nonetheless, IDPs are not easily tractable experimentally. Furthermore, in the absence of a reference conformation, discerning the energy landscape representation of the weakly funneled IDPs in terms of reaction coordinates is challenging. To understand conformational dynamics in real time and decipher how IDPs recognize multiple binding partners with high specificity, several sophisticated knowledge-based and physics-based in silico sampling techniques have been developed. Here, using specific examples, we highlight recent advances in energy landscape visualization and molecular dynamics simulations to discern conformational dynamics and discuss how the conformational preferences of IDPs modulate their function, especially in phenotypic switching. Finally, we discuss recent progress in identifying small molecules targeting IDPs underscoring the potential therapeutic value of IDPs. Understanding structure and function of IDPs can not only provide new insight on cellular decision making but may also help to refine and extend Anfinsen's structure/function paradigm.
Collapse
Affiliation(s)
- Prakash Kulkarni
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Vitor B. P. Leite
- Departamento de Física, Instituto de Biociências, Letras e Ciências Exatas, Universidade Estadual Paulista (UNESP), São José do Rio Preto, São Paulo 15054-000, Brazil
| | - Susmita Roy
- Department of Chemical Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, West Bengal 741246, India
| | - Supriyo Bhattacharyya
- Translational Bioinformatics, Center for Informatics, Department of Computational and Quantitative Medicine, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Atish Mohanty
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Srisairam Achuthan
- Center for Informatics, Division of Research Informatics, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Divyoj Singh
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Rajeswari Appadurai
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Govindan Rangarajan
- Department of Mathematics, Indian Institute of Science, Bangalore 560012, India
| | - Keith Weninger
- Department of Physics, North Carolina State University, Raleigh, North Carolina 27695, USA
| | | | - Anand Srivastava
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Mohit Kumar Jolly
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Jose N. Onuchic
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005-1892, USA
| | | | - Ravi Salgia
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| |
Collapse
|
8
|
Ismail H, Liu X, Yang F, Li J, Zahid A, Dou Z, Liu X, Yao X. Mechanisms and regulation underlying membraneless organelle plasticity control. J Mol Cell Biol 2021; 13:239-258. [PMID: 33914074 PMCID: PMC8339361 DOI: 10.1093/jmcb/mjab028] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2020] [Revised: 03/31/2021] [Accepted: 04/06/2021] [Indexed: 12/11/2022] Open
Abstract
Evolution has enabled living cells to adopt their structural and functional complexity by organizing intricate cellular compartments, such as membrane-bound and membraneless organelles (MLOs), for spatiotemporal catalysis of physiochemical reactions essential for cell plasticity control. Emerging evidence and view support the notion that MLOs are built by multivalent interactions of biomolecules via phase separation and transition mechanisms. In healthy cells, dynamic chemical modifications regulate MLO plasticity, and reversible phase separation is essential for cell homeostasis. Emerging evidence revealed that aberrant phase separation results in numerous neurodegenerative disorders, cancer, and other diseases. In this review, we provide molecular underpinnings on (i) mechanistic understanding of phase separation, (ii) unifying structural and mechanistic principles that underlie this phenomenon, (iii) various mechanisms that are used by cells for the regulation of phase separation, and (iv) emerging therapeutic and other applications.
Collapse
Affiliation(s)
- Hazrat Ismail
- MOE Key Laboratory for Membraneless Organelles & Cellular Dynamics and CAS Center for Excellence in Molecular Cell Science, Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei 230027, China
| | - Xu Liu
- MOE Key Laboratory for Membraneless Organelles & Cellular Dynamics and CAS Center for Excellence in Molecular Cell Science, Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei 230027, China
- Keck Center for Organoids Plasticity Control, Atlanta, GA 30310, USA
| | - Fengrui Yang
- MOE Key Laboratory for Membraneless Organelles & Cellular Dynamics and CAS Center for Excellence in Molecular Cell Science, Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei 230027, China
- Keck Center for Organoids Plasticity Control, Atlanta, GA 30310, USA
| | - Junying Li
- MOE Key Laboratory for Membraneless Organelles & Cellular Dynamics and CAS Center for Excellence in Molecular Cell Science, Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei 230027, China
- Anhui Key Laboratory for Cellular Dynamics & Chemical Biology, Hefei 230027, China
| | - Ayesha Zahid
- MOE Key Laboratory for Membraneless Organelles & Cellular Dynamics and CAS Center for Excellence in Molecular Cell Science, Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei 230027, China
| | - Zhen Dou
- MOE Key Laboratory for Membraneless Organelles & Cellular Dynamics and CAS Center for Excellence in Molecular Cell Science, Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei 230027, China
| | - Xing Liu
- MOE Key Laboratory for Membraneless Organelles & Cellular Dynamics and CAS Center for Excellence in Molecular Cell Science, Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei 230027, China
- Anhui Key Laboratory for Cellular Dynamics & Chemical Biology, Hefei 230027, China
| | - Xuebiao Yao
- MOE Key Laboratory for Membraneless Organelles & Cellular Dynamics and CAS Center for Excellence in Molecular Cell Science, Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei 230027, China
- Anhui Key Laboratory for Cellular Dynamics & Chemical Biology, Hefei 230027, China
| |
Collapse
|
9
|
Katuwawala A, Kurgan L. Comparative Assessment of Intrinsic Disorder Predictions with a Focus on Protein and Nucleic Acid-Binding Proteins. Biomolecules 2020; 10:E1636. [PMID: 33291838 PMCID: PMC7762010 DOI: 10.3390/biom10121636] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2020] [Revised: 11/26/2020] [Accepted: 12/03/2020] [Indexed: 01/18/2023] Open
Abstract
With over 60 disorder predictors, users need help navigating the predictor selection task. We review 28 surveys of disorder predictors, showing that only 11 include assessment of predictive performance. We identify and address a few drawbacks of these past surveys. To this end, we release a novel benchmark dataset with reduced similarity to the training sets of the considered predictors. We use this dataset to perform a first-of-its-kind comparative analysis that targets two large functional families of disordered proteins that interact with proteins and with nucleic acids. We show that limiting sequence similarity between the benchmark and the training datasets has a substantial impact on predictive performance. We also demonstrate that predictive quality is sensitive to the use of the well-annotated order and inclusion of the fully structured proteins in the benchmark datasets, both of which should be considered in future assessments. We identify three predictors that provide favorable results using the new benchmark set. While we find that VSL2B offers the most accurate and robust results overall, ESpritz-DisProt and SPOT-Disorder perform particularly well for disordered proteins. Moreover, we find that predictions for the disordered protein-binding proteins suffer low predictive quality compared to generic disordered proteins and the disordered nucleic acids-binding proteins. This can be explained by the high disorder content of the disordered protein-binding proteins, which makes it difficult for the current methods to accurately identify ordered regions in these proteins. This finding motivates the development of a new generation of methods that would target these difficult-to-predict disordered proteins. We also discuss resources that support users in collecting and identifying high-quality disorder predictions.
Collapse
Affiliation(s)
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA;
| |
Collapse
|
10
|
Agarwal PR, Lahiri A. Comparative study of the SBP-box gene family in rice siblings. J Biosci 2020. [DOI: 10.1007/s12038-020-00048-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
11
|
Abstract
Short linear motifs (SLiMs) are important mediators of interactions between intrinsically disordered regions of proteins and their interaction partners. Here, we detail instructions for the computational prediction of SLiMs in disordered protein regions, using the main tools of the SLiMSuite package: (1) SLiMProb identifies and calculates enrichment of predefined motifs in a set of proteins; (2) SLiMFinder predicts SLiMs de novo in a set of proteins, accounting for evolutionary relationships; (3) QSLiMFinder increases SLiMFinder sensitivity by focusing SLiM prediction on a specific query protein/region; (4) CompariMotif compares predicted SLiMs to known SLiMs or other SLiM predictions to identify common patterns. For each tool, command-line and online server examples are provided. Detailed notes provide additional advice on different applications of SLiMSuite, including batch running of multiple datasets and conservation masking using alignments of predicted orthologues.
Collapse
|
12
|
Prytuliak R, Volkmer M, Meier M, Habermann BH. HH-MOTiF: de novo detection of short linear motifs in proteins by Hidden Markov Model comparisons. Nucleic Acids Res 2019; 45:W470-W477. [PMID: 28460141 PMCID: PMC5570144 DOI: 10.1093/nar/gkx341] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2017] [Accepted: 04/18/2017] [Indexed: 12/18/2022] Open
Abstract
Short linear motifs (SLiMs) in proteins are self-sufficient functional sequences that specify interaction sites for other molecules and thus mediate a multitude of functions. Computational, as well as experimental biological research would significantly benefit, if SLiMs in proteins could be correctly predicted de novo with high sensitivity. However, de novo SLiM prediction is a difficult computational task. When considering recall and precision, the performances of published methods indicate remaining challenges in SLiM discovery. We have developed HH-MOTiF, a web-based method for SLiM discovery in sets of mainly unrelated proteins. HH-MOTiF makes use of evolutionary information by creating Hidden Markov Models (HMMs) for each input sequence and its closely related orthologs. HMMs are compared against each other to retrieve short stretches of homology that represent potential SLiMs. These are transformed to hierarchical structures, which we refer to as motif trees, for further processing and evaluation. Our approach allows us to identify degenerate SLiMs, while still maintaining a reasonably high precision. When considering a balanced measure for recall and precision, HH-MOTiF performs better on test data compared to other SLiM discovery methods. HH-MOTiF is freely available as a web-server at http://hh-motif.biochem.mpg.de.
Collapse
Affiliation(s)
- Roman Prytuliak
- Computational Biology Group, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Michael Volkmer
- Computational Biology Group, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Markus Meier
- Research Group Quantitative Biology and Bioinformatics, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Bianca H Habermann
- Computational Biology Group, Max Planck Institute of Biochemistry, Martinsried, Germany.,Computational Biology Group, Developmental Biology Institute of Marseille (IBDM) UMR 7288, CNRS, Aix Marseille Université, Marseille 13288 Cedex 9, France
| |
Collapse
|
13
|
Krystkowiak I, Davey NE. SLiMSearch: a framework for proteome-wide discovery and annotation of functional modules in intrinsically disordered regions. Nucleic Acids Res 2019; 45:W464-W469. [PMID: 28387819 PMCID: PMC5570202 DOI: 10.1093/nar/gkx238] [Citation(s) in RCA: 72] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2017] [Accepted: 04/05/2017] [Indexed: 12/12/2022] Open
Abstract
The extensive intrinsically disordered regions of higher eukaryotic proteomes contain vast numbers of functional interaction modules known as short linear motifs (SLiMs). Here, we present SLiMSearch, a motif discovery tool that scans a motif consensus, representing the specificity determinants of a motif-binding domain, against a proteome to discover putative novel motif instances. SLiMSearch applies several distinct and complementary approaches exploiting the common properties of SLiMs to predict novel motifs. Consensus matches are annotated with overlapping sequence annotation, including feature information describing protein modular architecture, post-translational modification, structure, sequence variation and experimental characterisation of functional regions. Discriminatory motif attributes such as conservation and accessibility are also calculated. In addition, SLiMSearch provides functional enrichment and evolutionary analysis tools. The enrichment tool analyses GO terms, keywords and interacting partner enrichment to indicate possible motif function. The evolutionary tool evaluates motif taxonomic range and the conservation of motif sequence context. Consensus matches can be filtered based on motif attributes such as accessibility and taxonomic range; or by the localisation, interacting partners or ontology annotation of the peptide-containing protein. SLiMSearch supports a range of species of experimental and therapeutic relevance and is available online at http://slim.ucd.ie/slimsearch/.
Collapse
Affiliation(s)
- Izabella Krystkowiak
- Conway Institute of Biomolecular & Biomedical Research, University College Dublin, Belfield, Dublin 4, Ireland.,UCD School of Medicine & Medical Science, University College Dublin, Belfield, Dublin 4, Ireland
| | - Norman E Davey
- Conway Institute of Biomolecular & Biomedical Research, University College Dublin, Belfield, Dublin 4, Ireland.,UCD School of Medicine & Medical Science, University College Dublin, Belfield, Dublin 4, Ireland
| |
Collapse
|
14
|
Abstract
Eukaryotic cells organize their intracellular components into organelles that can be membrane-bound or membraneless. A large number of membraneless organelles, including nucleoli, Cajal bodies, P-bodies, and stress granules, exist as liquid droplets within the cell and arise from the condensation of cellular material in a process termed liquid-liquid phase separation (LLPS). Beyond a mere organizational tool, concentrating cellular components into membraneless organelles tunes biochemical reactions and improves cellular fitness during stress. In this review, we provide an overview of the molecular underpinnings of the formation and regulation of these membraneless organelles. This molecular understanding explains emergent properties of these membraneless organelles and shines new light on neurodegenerative diseases, which may originate from disturbances in LLPS and membraneless organelles.
Collapse
Affiliation(s)
- Edward Gomes
- From the Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104
| | - James Shorter
- From the Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104
| |
Collapse
|
15
|
IDPs in macromolecular complexes: the roles of multivalent interactions in diverse assemblies. Curr Opin Struct Biol 2018; 49:36-43. [PMID: 29306779 DOI: 10.1016/j.sbi.2017.12.007] [Citation(s) in RCA: 85] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Revised: 12/18/2017] [Accepted: 12/19/2017] [Indexed: 01/21/2023]
Abstract
Intrinsically disordered proteins (IDPs) have critical roles in a diverse array of cellular functions. Of relevance here is that they are components of macromolecular complexes, where their conformational flexibility helps mediate interactions with binding partners. IDPs often interact with their binding partners through short sequence motifs, commonly repeated within the disordered regions. As such, multivalent interactions are common for IDPs and their binding partners within macromolecular complexes. Here we discuss the importance of IDP multivalency in three very different macromolecular assemblies: biomolecular condensates, the nuclear pore, and the cytoskeleton.
Collapse
|
16
|
Abstract
Intrinsically disordered proteins and regions (IDPs and IDRs) are involved in a wide range of cellular functions and they often facilitate interactions with RNAs, DNAs, and proteins. Although many computational methods can predict IDPs and IDRs in protein sequences, only a few methods predict their functions and these functions primarily concern protein binding. We describe how to use the first computational method DisoRDPbind for high-throughput prediction of multiple functions of disordered regions. Our method predicts the RNA-, DNA-, and protein-binding residues located in IDRs in the input protein sequences. DisoRDPbind provides accurate predictions and is sufficiently fast to make predictions for full genomes. Our method is implemented as a user-friendly webserver that is freely available at http://biomine.ece.ualberta.ca/DisoRDPbind/ . We overview our predictor, discuss how to run the webserver, and show how to interpret the corresponding results. We also demonstrate the utility of our method based on two case studies, human BRCA1 protein that binds various proteins and DNA, and yeast 60S ribosomal protein L4 that interacts with proteins and RNA.
Collapse
|
17
|
Conformational analysis of short polar side-chain amino-acids through umbrella sampling and DFT calculations. J Mol Model 2016; 22:273. [PMID: 27783230 DOI: 10.1007/s00894-016-3139-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Accepted: 10/09/2016] [Indexed: 10/20/2022]
Abstract
Molecular and quantum mechanics calculations were carried out in a series of tripeptides (GXG, where X = D, N and C) as models of the unfolded states of proteins. The selected central amino acids, especially aspartic acid (D) and asparagine (N) are known to present significant average conformations in partially allowed areas of the Ramachandran plot, which have been suggested to be important in unfolded protein regions. In this report, we present the calculation of the propensity values through an umbrella sampling procedure in combination with the calculation of the NMR J-coupling constants obtained by a DFT model. The experimental NMR observations can be reasonably explained in terms of a conformational distribution where PPII and β basins sum up propensities above 0.9. The conformational analysis of the side chain dihedral angle (χ1), along with the computation of 3J(HαHβ), revealed a preference for the g - and g + rotamers. These may be connected with the presence of intermolecular H-bonding and carbonyl-carbonyl interactions sampled in the PPII and β basins. Taking into account all those results, it can be established that these residues show a similar behavior to other amino acids in short peptides regarding backbone φ,ψ dihedral angle distribution, in agreement with some experimental analysis of capped dipeptides.
Collapse
|
18
|
Caulobacter PopZ forms an intrinsically disordered hub in organizing bacterial cell poles. Proc Natl Acad Sci U S A 2016; 113:12490-12495. [PMID: 27791060 DOI: 10.1073/pnas.1602380113] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Despite their relative simplicity, bacteria have complex anatomy at the subcellular level. At the cell poles of Caulobacter crescentus, a 177-amino acid (aa) protein called PopZ self-assembles into 3D polymeric superstructures. Remarkably, we find that this assemblage interacts directly with at least eight different proteins, which are involved in cell cycle regulation and chromosome segregation. The binding determinants within PopZ include 24 aa at the N terminus, a 32-aa region near the C-terminal homo-oligomeric assembly domain, and portions of an intervening linker region. Together, the N-terminal 133 aa of PopZ are sufficient for interacting with all binding partners, even in the absence of homo-oligomeric assembly. Structural analysis of this region revealed that it is intrinsically disordered, similar to p53 and other hub proteins that organize complex signaling networks in eukaryotic cells. Through live-cell photobleaching, we find rapid binding kinetics between PopZ and its partners, suggesting many pole-localized proteins become concentrated at cell poles through rapid cycles of binding and unbinding within the PopZ scaffold. We conclude that some bacteria, similar to their eukaryotic counterparts, use intrinsically disordered hub proteins for network assembly and subcellular organization.
Collapse
|
19
|
Structural Basis for Recombinatorial Permissiveness in the Generation of Anaplasma marginale Msp2 Antigenic Variants. Infect Immun 2016; 84:2740-7. [PMID: 27400719 DOI: 10.1128/iai.00391-16] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Accepted: 07/05/2016] [Indexed: 12/20/2022] Open
Abstract
Sequential expression of outer membrane protein antigenic variants is an evolutionarily convergent mechanism used by bacterial pathogens to escape host immune clearance and establish persistent infection. Variants must be sufficiently structurally distinct to escape existing immune effectors yet retain the core structural elements required for localization and function within the outer membrane. We examined this balance using Anaplasma marginale, which generates antigenic variants in the outer membrane protein Msp2 using gene conversion. The overwhelming majority of Msp2 variants expressed during long-term persistent infection are mosaics, derived by recombination of oligonucleotide segments from multiple alleles to form unique hypervariable regions (HVR). As a result, the mosaics are not under long-term selective pressure to encode a functional protein; consequently, we hypothesized that the Msp2 HVR is structurally permissive for mosaic expression. Using an integrated approach of predictive modeling with determination of the native Msp2 protein structure and function, we demonstrate that structured elements, most notably, β-sheets, are significantly concentrated in the highly conserved N- and C-terminal domains. In contrast, the HVR is overwhelmingly a random coil, with the structured α-helices and β-sheets being confined to the genomically defined structural tethers that separate the antigenically variable microdomains. This structure is supported by the surface exposure of the HVR microdomains and the slow diffusion-type porin function in native Msp2. Importantly, the predominance of the random coil provides plasticity for the formation of functional HVR mosaics and realization of the full potential of segmental gene conversion to dramatically expand the variant repertoire.
Collapse
|
20
|
DeForte S, Uversky VN. Order, Disorder, and Everything in Between. Molecules 2016; 21:molecules21081090. [PMID: 27548131 PMCID: PMC6274243 DOI: 10.3390/molecules21081090] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Revised: 08/10/2016] [Accepted: 08/11/2016] [Indexed: 02/04/2023] Open
Abstract
In addition to the “traditional” proteins characterized by the unique crystal-like structures needed for unique functions, it is increasingly recognized that many proteins or protein regions (collectively known as intrinsically disordered proteins (IDPs) and intrinsically disordered protein regions (IDPRs)), being biologically active, do not have a specific 3D-structure in their unbound states under physiological conditions. There are also subtler categories of disorder, such as conditional (or dormant) disorder and partial disorder. Both the ability of a protein/region to fold into a well-ordered functional unit or to stay intrinsically disordered but functional are encoded in the amino acid sequence. Structurally, IDPs/IDPRs are characterized by high spatiotemporal heterogeneity and exist as dynamic structural ensembles. It is important to remember, however, that although structure and disorder are often treated as binary states, they actually sit on a structural continuum.
Collapse
Affiliation(s)
- Shelly DeForte
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA.
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA.
- USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA.
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg 194064, Russia.
| |
Collapse
|