1
|
Hou Z, Liu H. Mapping the Protein Kinome: Current Strategy and Future Direction. Cells 2023; 12:cells12060925. [PMID: 36980266 PMCID: PMC10047437 DOI: 10.3390/cells12060925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 02/23/2023] [Accepted: 03/13/2023] [Indexed: 03/30/2023] Open
Abstract
The kinome includes over 500 different protein kinases, which form an integrated kinase network that regulates cellular phosphorylation signals. The kinome plays a central role in almost every cellular process and has strong linkages with many diseases. Thus, the evaluation of the cellular kinome in the physiological environment is essential to understand biological processes, disease development, and to target therapy. Currently, a number of strategies for kinome analysis have been developed, which are based on monitoring the phosphorylation of kinases or substrates. They have enabled researchers to tackle increasingly complex biological problems and pathological processes, and have promoted the development of kinase inhibitors. Additionally, with the increasing interest in how kinases participate in biological processes at spatial scales, it has become urgent to develop tools to estimate spatial kinome activity. With multidisciplinary efforts, a growing number of novel approaches have the potential to be applied to spatial kinome analysis. In this paper, we review the widely used methods used for kinome analysis and the challenges encountered in their applications. Meanwhile, potential approaches that may be of benefit to spatial kinome study are explored.
Collapse
Affiliation(s)
- Zhanwu Hou
- Center for Mitochondrial Biology and Medicine, Douglas C. Wallace Institute for Mitochondrial and Epigenetic Information Sciences, The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an 710049, China
| | - Huadong Liu
- School of Health and Life Science, University of Health and Rehabilitation Sciences, Qingdao 266071, China
| |
Collapse
|
2
|
Invergo BM. Accurate, high-coverage assignment of in vivo protein kinases to phosphosites from in vitro phosphoproteomic specificity data. PLoS Comput Biol 2022; 18:e1010110. [PMID: 35560139 PMCID: PMC9132282 DOI: 10.1371/journal.pcbi.1010110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 05/25/2022] [Accepted: 04/15/2022] [Indexed: 12/03/2022] Open
Abstract
Phosphoproteomic experiments routinely observe thousands of phosphorylation sites. To understand the intracellular signaling processes that generated this data, one or more causal protein kinases must be assigned to each phosphosite. However, limited knowledge of kinase specificity typically restricts assignments to a small subset of a kinome. Starting from a statistical model of a high-throughput, in vitro kinase-substrate assay, I have developed an approach to high-coverage, multi-label kinase-substrate assignment called IV-KAPhE (“In vivo-Kinase Assignment for Phosphorylation Evidence”). Tested on human data, IV-KAPhE outperforms other methods of similar scope. Such computational methods generally predict a densely connected kinase-substrate network, with most sites targeted by multiple kinases, pointing either to unaccounted-for biochemical constraints or significant cross-talk and signaling redundancy. I show that such predictions can potentially identify biased kinase-site misannotations within families of closely related kinase isozymes and they provide a robust basis for kinase activity analysis. Proteins can pass around information inside cells about changes in the environment. This process, called intracellular signaling, helps to trigger appropriate cellular responses to environmental changes. One of the main ways information is passed to proteins is through chemical “tagging,” called phosphorylation, by enzymes called protein kinases. We can measure the phosphorylation state of practically all proteins in a cell at any moment. Starting from known cases of phosphorylation by a kinase, many computational methods have been developed to predict if the kinase might tag a certain spot on another protein or if an observed tag was attached by the kinase, with different models for each kinase. I have developed a new method that instead uses a single model to assign one or more kinases to each observed tag, built from the latest large-scale experimental data. This change in focus and unbiased training data allows my method to be significantly more accurate than past methods. I also explored useful applications for my method. For example, I used it to show that much of our knowledge about which kinase is responsible for each tag is probably inaccurately biased towards the commonly studied ones.
Collapse
Affiliation(s)
- Brandon M. Invergo
- Translational Research Exchange @ Exeter, University of Exeter, Exeter, United Kingdom
- * E-mail:
| |
Collapse
|
3
|
Guo X, He H, Yu J, Shi S. PKSPS: a novel method for predicting kinase of specific phosphorylation sites based on maximum weighted bipartite matching algorithm and phosphorylation sequence enrichment analysis. Brief Bioinform 2021; 23:6398688. [PMID: 34661630 DOI: 10.1093/bib/bbab436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 09/10/2021] [Accepted: 09/21/2021] [Indexed: 11/14/2022] Open
Abstract
With the development of biotechnology, a large number of phosphorylation sites have been experimentally confirmed and collected, but only a few of them have kinase annotations. Since experimental methods to detect kinases at specific phosphorylation sites are expensive and accidental, some computational methods have been proposed to predict the kinase of these sites, but most methods only consider single sequence information or single functional network information. In this study, a new method Predicting Kinase of Specific Phosphorylation Sites (PKSPS) is developed to predict kinases of specific phosphorylation sites in human proteins by combining PKSPS-Net with PKSPS-Seq, which considers protein-protein interaction (PPI) network information and sequence information. For PKSPS-Net, kinase-kinase and substrate-substrate similarity are quantified based on the topological similarity of proteins in the PPI network, and maximum weighted bipartite matching algorithm is proposed to predict kinase-substrate relationship. In PKSPS-Seq, phosphorylation sequence enrichment analysis is used to analyze the similarity of local sequences around phosphorylation sites and predict the kinase of specific phosphorylation sites (KSP). PKSPS has been proved to be more effective than the PKSPS-Net or PKSPS-Seq on different sets of kinases. Further comparison results show that the PKSPS method performs better than existing methods. Finally, the case study demonstrates the effectiveness of the PKSPS in predicting kinases of specific phosphorylation sites. The open source code and data of the PKSPS can be obtained from https://github.com/guoxinyunncu/PKSPS.
Collapse
Affiliation(s)
- Xinyun Guo
- Department of Mathematics and Numerical Simulation and High-Performance Computing Laboratory, School of Sciences, Nanchang University, Nanchang 330031, China
| | - Huan He
- Department of Mathematics and Numerical Simulation and High-Performance Computing Laboratory, School of Sciences, Nanchang University, Nanchang 330031, China
| | - Jialin Yu
- Department of Mathematics and Numerical Simulation and High-Performance Computing Laboratory, School of Sciences, Nanchang University, Nanchang 330031, China
| | - Shaoping Shi
- Department of Mathematics and Numerical Simulation and High-Performance Computing Laboratory, School of Sciences, Nanchang University, Nanchang 330031, China
| |
Collapse
|
4
|
Computational Phosphorylation Network Reconstruction: An Update on Methods and Resources. Methods Mol Biol 2021. [PMID: 34270057 DOI: 10.1007/978-1-0716-1625-3_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2023]
Abstract
Most proteins undergo some form of modification after translation, and phosphorylation is one of the most relevant and ubiquitous post-translational modifications. The succession of protein phosphorylation and dephosphorylation catalyzed by protein kinase and phosphatase, respectively, constitutes a key mechanism of molecular information flow in cellular systems. The protein interactions of kinases, phosphatases, and their regulatory subunits and substrates are the main part of phosphorylation networks. To elucidate the landscape of phosphorylation events has been a central goal pursued by both experimental and computational approaches. Substrate specificity (e.g., sequence, structure) or the phosphoproteome has been utilized in an array of different statistical learning methods to infer phosphorylation networks. In this chapter, different computational phosphorylation network inference-related methods and resources are summarized and discussed.
Collapse
|
5
|
Xue B, Jordan B, Rizvi S, Naegle KM. KinPred: A unified and sustainable approach for harnessing proteome-level human kinase-substrate predictions. PLoS Comput Biol 2021; 17:e1008681. [PMID: 33556051 PMCID: PMC7895412 DOI: 10.1371/journal.pcbi.1008681] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Revised: 02/19/2021] [Accepted: 01/07/2021] [Indexed: 12/22/2022] Open
Abstract
Tyrosine and serine/threonine kinases are essential regulators of cell processes and are important targets for human therapies. Unfortunately, very little is known about specific kinase-substrate relationships, making it difficult to infer meaning from dysregulated phosphoproteomic datasets or for researchers to identify possible kinases that regulate specific or novel phosphorylation sites. The last two decades have seen an explosion in algorithms to extrapolate from what little is known into the larger unknown-predicting kinase relationships with site-specific substrates using a variety of approaches that include the sequence-specificity of kinase catalytic domains and various other factors, such as evolutionary relationships, co-expression, and protein-protein interaction networks. Unfortunately, a number of limitations prevent researchers from easily harnessing these resources, such as loss of resource accessibility, limited information in publishing that results in a poor mapping to a human reference, and not being updated to match the growth of the human phosphoproteome. Here, we propose a methodological framework for publishing predictions in a unified way, which entails ensuring predictions have been run on a current reference proteome, mapping the same substrates and kinases across resources to a common reference, filtering for the human phosphoproteome, and providing methods for updating the resource easily in the future. We applied this framework on three currently available resources, published in the last decade, which provide kinase-specific predictions in the human proteome. Using the unified datasets, we then explore the role of study bias, the emergent network properties of these predictive algorithms, and comparisons within and between predictive algorithms. The combination of the code for unification and analysis, as well as the unified predictions are available under the resource we named KinPred. We believe this resource will be useful for a wide range of applications and establishes best practices for long-term usability and sustainability for new and existing predictive algorithms.
Collapse
Affiliation(s)
- Bingjie Xue
- Department of Biomedical Engineering and the Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia, United States of America
| | - Benjamin Jordan
- Department of Biomedical Engineering and the Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia, United States of America
| | - Saqib Rizvi
- Department of Biomedical Engineering and the Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia, United States of America
| | - Kristen M. Naegle
- Department of Biomedical Engineering and the Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia, United States of America
- * E-mail:
| |
Collapse
|
6
|
Shi XX, Wu FX, Mei LC, Wang YL, Hao GF, Yang GF. Bioinformatics toolbox for exploring protein phosphorylation network. Brief Bioinform 2020; 22:5871447. [PMID: 32666116 DOI: 10.1093/bib/bbaa134] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 05/15/2020] [Accepted: 06/02/2020] [Indexed: 01/23/2023] Open
Abstract
A clear systematic delineation of the interactions between phosphorylation sites on substrates and their effector kinases plays a fundamental role in revealing cellular activities, understanding signaling modulation mechanisms and proposing novel hypotheses. The emergence of bioinformatics tools contributes to studying phosphorylation network. Some of them feature the visualization of network, enabling more effective trace of the underlying biological problems in a clear and succinct way. In this review, we aimed to provide a toolbox for exploring phosphorylation network. We first systematically surveyed 19 tools that are available for exploring phosphorylation networks, and subsequently comparatively analyzed and summarized these tools to guide tool selection in terms of functionality, data sources, performance, network visualization and implementation, and finally briefly discussed the application cases of these tools. In different scenarios, the conclusion on the suitability of a tool for a specific user may vary. Nevertheless, easily accessible bioinformatics tools are proved to facilitate biological findings. Hopefully, this work might also assist non-specialists, students, as well as computational scientists who aim at developing novel tools in the field of phosphorylation modification.
Collapse
Affiliation(s)
- Xing-Xing Shi
- College of Chemistry, Central China Normal University (CCNU)
| | | | | | - Yu-Liang Wang
- College of Chemistry, Central China Normal University (CCNU)
| | - Ge-Fei Hao
- Bioinformatics in State Key Laboratory Breeding Base of Green Pesticide and Agricultural Bioengineering of GZU and College of Chemistry of CCNU
| | | |
Collapse
|
7
|
Savage SR, Zhang B. Using phosphoproteomics data to understand cellular signaling: a comprehensive guide to bioinformatics resources. Clin Proteomics 2020; 17:27. [PMID: 32676006 PMCID: PMC7353784 DOI: 10.1186/s12014-020-09290-x] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Accepted: 07/04/2020] [Indexed: 12/19/2022] Open
Abstract
Mass spectrometry-based phosphoproteomics is becoming an essential methodology for the study of global cellular signaling. Numerous bioinformatics resources are available to facilitate the translation of phosphopeptide identification and quantification results into novel biological and clinical insights, a critical step in phosphoproteomics data analysis. These resources include knowledge bases of kinases and phosphatases, phosphorylation sites, kinase inhibitors, and sequence variants affecting kinase function, and bioinformatics tools that can predict phosphorylation sites in addition to the kinase that phosphorylates them, infer kinase activity, and predict the effect of mutations on kinase signaling. However, these resources exist in silos and it is challenging to select among multiple resources with similar functions. Therefore, we put together a comprehensive collection of resources related to phosphoproteomics data interpretation, compared the use of tools with similar functions, and assessed the usability from the standpoint of typical biologists or clinicians. Overall, tools could be improved by standardization of enzyme names, flexibility of data input and output format, consistent maintenance, and detailed manuals.
Collapse
Affiliation(s)
- Sara R. Savage
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN USA
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX USA
| |
Collapse
|
8
|
Deznabi I, Arabaci B, Koyutürk M, Tastan O. DeepKinZero: zero-shot learning for predicting kinase-phosphosite associations involving understudied kinases. Bioinformatics 2020; 36:3652-3661. [PMID: 32044914 PMCID: PMC7320620 DOI: 10.1093/bioinformatics/btaa013] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Revised: 12/17/2019] [Accepted: 01/06/2020] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Protein phosphorylation is a key regulator of protein function in signal transduction pathways. Kinases are the enzymes that catalyze the phosphorylation of other proteins in a target-specific manner. The dysregulation of phosphorylation is associated with many diseases including cancer. Although the advances in phosphoproteomics enable the identification of phosphosites at the proteome level, most of the phosphoproteome is still in the dark: more than 95% of the reported human phosphosites have no known kinases. Determining which kinase is responsible for phosphorylating a site remains an experimental challenge. Existing computational methods require several examples of known targets of a kinase to make accurate kinase-specific predictions, yet for a large body of kinases, only a few or no target sites are reported. RESULTS We present DeepKinZero, the first zero-shot learning approach to predict the kinase acting on a phosphosite for kinases with no known phosphosite information. DeepKinZero transfers knowledge from kinases with many known target phosphosites to those kinases with no known sites through a zero-shot learning model. The kinase-specific positional amino acid preferences are learned using a bidirectional recurrent neural network. We show that DeepKinZero achieves significant improvement in accuracy for kinases with no known phosphosites in comparison to the baseline model and other methods available. By expanding our knowledge on understudied kinases, DeepKinZero can help to chart the phosphoproteome atlas. AVAILABILITY AND IMPLEMENTATION The source codes are available at https://github.com/Tastanlab/DeepKinZero. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Iman Deznabi
- Computer Engineering Department, Bilkent University, Ankara 06800, Turkey
- College of Information and Computer Sciences, University of Massachusetts, Amherst, MA 01003, USA
| | - Busra Arabaci
- Computer Engineering Department, Bilkent University, Ankara 06800, Turkey
| | - Mehmet Koyutürk
- Department of Computer and Data Sciences
- Center for Proteomics & Bioinformatics, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Oznur Tastan
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul 34956, Turkey
| |
Collapse
|
9
|
Rashid MM, Shatabda S, Hasan MM, Kurata H. Recent Development of Machine Learning Methods in Microbial Phosphorylation Sites. Curr Genomics 2020; 21:194-203. [PMID: 33071613 PMCID: PMC7521030 DOI: 10.2174/1389202921666200427210833] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Revised: 04/12/2020] [Accepted: 04/13/2020] [Indexed: 01/10/2023] Open
Abstract
A variety of protein post-translational modifications has been identified that control many cellular functions. Phosphorylation studies in mycobacterial organisms have shown critical importance in diverse biological processes, such as intercellular communication and cell division. Recent technical advances in high-precision mass spectrometry have determined a large number of microbial phosphorylated proteins and phosphorylation sites throughout the proteome analysis. Identification of phosphorylated proteins with specific modified residues through experimentation is often labor-intensive, costly and time-consuming. All these limitations could be overcome through the application of machine learning (ML) approaches. However, only a limited number of computational phosphorylation site prediction tools have been developed so far. This work aims to present a complete survey of the existing ML-predictors for microbial phosphorylation. We cover a variety of important aspects for developing a successful predictor, including operating ML algorithms, feature selection methods, window size, and software utility. Initially, we review the currently available phosphorylation site databases of the microbiome, the state-of-the-art ML approaches, working principles, and their performances. Lastly, we discuss the limitations and future directions of the computational ML methods for the prediction of phosphorylation.
Collapse
Affiliation(s)
| | | | - Md. Mehedi Hasan
- Address correspondence to these authors at the Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Tel: +81-948-297-828;, E-mail: and Biomedical Informatics R&D Center, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Tel: +81-948-297-828; E-mail:
| | - Hiroyuki Kurata
- Address correspondence to these authors at the Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Tel: +81-948-297-828;, E-mail: and Biomedical Informatics R&D Center, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Tel: +81-948-297-828; E-mail:
| |
Collapse
|
10
|
Lin S, Wang C, Zhou J, Shi Y, Ruan C, Tu Y, Yao L, Peng D, Xue Y. EPSD: a well-annotated data resource of protein phosphorylation sites in eukaryotes. Brief Bioinform 2020; 22:298-307. [PMID: 32008039 DOI: 10.1093/bib/bbz169] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Revised: 11/25/2019] [Accepted: 12/10/2019] [Indexed: 12/16/2022] Open
Abstract
As an important post-translational modification (PTM), protein phosphorylation is involved in the regulation of almost all of biological processes in eukaryotes. Due to the rapid progress in mass spectrometry-based phosphoproteomics, a large number of phosphorylation sites (p-sites) have been characterized but remain to be curated. Here, we briefly summarized the current progresses in the development of data resources for the collection, curation, integration and annotation of p-sites in eukaryotic proteins. Also, we designed the eukaryotic phosphorylation site database (EPSD), which contained 1 616 804 experimentally identified p-sites in 209 326 phosphoproteins from 68 eukaryotic species. In EPSD, we not only collected 1 451 629 newly identified p-sites from high-throughput (HTP) phosphoproteomic studies, but also integrated known p-sites from 13 additional databases. Moreover, we carefully annotated the phosphoproteins and p-sites of eight model organisms by integrating the knowledge from 100 additional resources that covered 15 aspects, including phosphorylation regulator, genetic variation and mutation, functional annotation, structural annotation, physicochemical property, functional domain, disease-associated information, protein-protein interaction, drug-target relation, orthologous information, biological pathway, transcriptional regulator, mRNA expression, protein expression/proteomics and subcellular localization. We anticipate that the EPSD can serve as a useful resource for further analysis of eukaryotic phosphorylation. With a data volume of 14.1 GB, EPSD is free for all users at http://epsd.biocuckoo.cn/.
Collapse
Affiliation(s)
| | | | - Jiaqi Zhou
- Huazhong University of Science and Technology
| | - Ying Shi
- Huazhong University of Science and Technology
| | - Chen Ruan
- Huazhong University of Science and Technology
| | - Yiran Tu
- Huazhong University of Science and Technology
| | - Lan Yao
- Huazhong University of Science and Technology
| | - Di Peng
- Huazhong University of Science and Technology
| | - Yu Xue
- Huazhong University of Science and Technology
| |
Collapse
|
11
|
Intelligent Estimation of Vitrinite Reflectance of Coal from Photomicrographs Based on Machine Learning. ENERGIES 2019. [DOI: 10.3390/en12203855] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The accurate measurement of vitrinite reflectance (especially for mean maximum vitrinite reflectance, MMVR) is an important issue in the fields of coal mining and processing. However, the application of MMVR has been somewhat hampered by the subjective and the time-consuming characteristic of manual measurements. Semi-automated methods that are oversimplified might affect the accuracy in measuring MMVR values. To address these concerns, we propose a novel MMVR measurement strategy based on machine learning (MMVRML). Considering the complex nature of coal, adaptive K-means clustering is firstly employed to automatically detect the number of clusters (i.e., maceral groups) in photomicrographs. Furthermore, comprehensive features along with a support vector machine are utilized to intelligently identify the regions with vitrinite. The largest region with vitrinite in each photomicrograph is gridded for further regression analysis. Evaluations on 78 photomicrographs show that the model based on random forest and 15 simplified grayscale features achieves the state-of-the-art root mean square error of 0.0424. In addition, to facilitate the usage of petrologists without strong expertise in the machine learning domain, we released the first non-commercial standalone software for estimating MMVR.
Collapse
|
12
|
Intelligent Identification of Maceral Components of Coal Based on Image Segmentation and Classification. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9163245] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
An intelligent analytical technique which is able to accurately identify maceral components is highly desired in the fields of mining and geology. However, currently available methods based on fixed-size window neglect the shape information, and thus do not work in identifying maceral composition from one entire photomicrograph. To address these concerns, we propose a novel Maceral Identification strategy based on image Segmentation and Classification (MISC). Considering the complex and heterogeneous nature of coal, a two-level coarse-to-fine clustering method based on K-means is employed to divide microscopic images into a sequence of regions with similar attributes (i.e., binder, vitrinite, liptinite and inertinite). Furthermore, comprehensive features along with random forest are utilized to automatically classify binder and seven types of maceral components, including vitrinite, fusinite, semifusinite, cutinite, sporinite, inertodetrinite and micrinite. Evaluations on 39 microscopic images show that the proposed method achieves the state-of-the-art accuracy of 90.44% and serves as the baseline for future research on maceral analysis. In addition, to support the decisions of petrologists during maceral analysis, we developed a standalone software, which is freely available at https:/github.com/GuyooGu/MISC-Master.
Collapse
|
13
|
Chen Q, Deng C, Lan W, Liu Z, Zheng R, Liu J, Wang J. Identifying Interactions Between Kinases and Substrates Based on Protein-Protein Interaction Network. J Comput Biol 2019; 26:836-845. [PMID: 30990327 DOI: 10.1089/cmb.2019.0048] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Protein phosphorylation is a kind of important post-translational modification of protein, which plays a critical role in many biological processes of eukaryote. Identifying kinase-substrate interactions is helpful to understand the mechanism of many diseases. Many computational algorithms for kinase-substrate interactions identification have been proposed. However, most of those methods are mainly focused on utilizing protein local sequence information. In this article, we propose a new computational method to predict kinase-substrate interactions based on protein-protein interaction (PPI) network. Different from existing methods, the PPI network is utilized to measure the similarities of kinase-kinase and substrate-substrate, respectively. Then, the pairwise similarities of kinase-kinase and substrate-substrate are adjusted based on the assumption that the similarities of kinase-kinase and substrate-substrate are more reliable if they are in the same cluster. Finally, the bi-random walk is used to predict potential kinase-substrate interactions. The experimental results show that our method outperforms other state-of-the-art algorithms in performance. Furthermore, the case study demonstrates that it is effective in predicting potential kinase-substrate interactions.
Collapse
Affiliation(s)
- Qingfeng Chen
- 1School of Computer, Electronics and Information, Guangxi University, Nanning, China
- 2State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning, China
| | - Canshang Deng
- 1School of Computer, Electronics and Information, Guangxi University, Nanning, China
| | - Wei Lan
- 1School of Computer, Electronics and Information, Guangxi University, Nanning, China
| | - Zhixian Liu
- 2State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning, China
| | - Ruiqing Zheng
- 3School of Computer Science and Engineering, Central South University, Changsha, China
| | - Jin Liu
- 3School of Computer Science and Engineering, Central South University, Changsha, China
| | - Jianxin Wang
- 3School of Computer Science and Engineering, Central South University, Changsha, China
| |
Collapse
|
14
|
KSIMC: Predicting Kinase⁻Substrate Interactions Based on Matrix Completion. Int J Mol Sci 2019; 20:ijms20020302. [PMID: 30646505 PMCID: PMC6358935 DOI: 10.3390/ijms20020302] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2018] [Revised: 12/31/2018] [Accepted: 01/07/2019] [Indexed: 12/17/2022] Open
Abstract
Protein phosphorylation is an important chemical modification catalyzed by kinases. It plays important roles in many cellular processes. Predicting kinase–substrate interactions is vital to understanding the mechanism of many diseases. Many computational methods have been proposed to identify kinase–substrate interactions. However, the prediction accuracy still needs to be improved. Therefore, it is necessary to develop an efficient computational method to predict kinase–substrate interactions. In this paper, we propose a novel computational approach, KSIMC, to identify kinase–substrate interactions based on matrix completion. Firstly, the kinase similarity and substrate similarity are calculated by aligning sequence of kinase–kinase and substrate–substrate, respectively. Then, the original association network is adjusted based on the similarities. Finally, the matrix completion is used to predict potential kinase–substrate interactions. The experiment results show that our method outperforms other state-of-the-art algorithms in performance. Furthermore, the relevant databases and scientific literature verify the effectiveness of our algorithm for new kinase–substrate interaction identification.
Collapse
|
15
|
Zhang QB, Yu K, Liu Z, Wang D, Zhao Y, Yin S, Liu Z. Prediction of prkC-mediated protein serine/threonine phosphorylation sites for bacteria. PLoS One 2018; 13:e0203840. [PMID: 30278050 PMCID: PMC6168130 DOI: 10.1371/journal.pone.0203840] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2018] [Accepted: 08/28/2018] [Indexed: 11/18/2022] Open
Abstract
As an abundant post-translational modification, reversible phosphorylation is critical for the dynamic regulation of various biological processes. prkC, a critical serine/threonine-protein kinase in bacteria, plays important roles in regulation of signaling transduction. Identification of prkC-specific phosphorylation sites is fundamental for understanding the molecular mechanism of phosphorylation-mediated signaling. However, experimental identification of substrates for prkC is time-consuming and labor-intensive, and computational methods for kinase-specific phosphorylation prediction in bacteria have yet to be developed. In this study, we manually curated the experimentally identified substrates and phosphorylation sites of prkC from the published literature. The analyses of the sequence preferences showed that the substrate recognition pattern for prkC might be miscellaneous, and a complex strategy should be employed to predict potential prkC-specific phosphorylation sites. To develop the predictor, the amino acid location feature extraction method and the support vector machine algorithm were employed, and the methods achieved promising performance. Through 10-fold cross validation, the predictor reached a sensitivity of 91.67% at the specificity of 95.12%. Then, we developed freely accessible software, which is provided at http://free.cancerbio.info/prkc/. Based on the predictor, hundreds of potential prkC-specific phosphorylation sites were annotated based on the known bacterial phosphorylation sites. prkC-PSP was the first predictor for prkC-specific phosphorylation sites, and its prediction performance was promising. We anticipated that these analyses and the predictor could be helpful for further studies of prkC-mediated phosphorylation.
Collapse
Affiliation(s)
- Qing-bin Zhang
- Key Laboratory of Oral Medicine, Guangzhou Institute of Oral Disease, Stomatology Hospital of Guangzhou Medical University, Guangzhou, Guangdong, China
- * E-mail: (QbZ); (ZL)
| | - Kai Yu
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China
| | - Zekun Liu
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China
- Department of Hepatobiliary Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Dawei Wang
- Department of Thoracic Surgery, China Meitan General Hospital, Beijing, China
| | - Yuanyuan Zhao
- School of Arts and Media, Hefei Normal University, Hefei, Anhui, China
| | - Sanjun Yin
- Healthtimegene Institute, Shenzhen, China
| | - Zexian Liu
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China
- * E-mail: (QbZ); (ZL)
| |
Collapse
|
16
|
Cao M, Chen G, Wang L, Wen P, Shi S. Computational Prediction and Analysis for Tyrosine Post-Translational Modifications via Elastic Net. J Chem Inf Model 2018; 58:1272-1281. [DOI: 10.1021/acs.jcim.7b00688] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
17
|
Wang M, Wang T, Li A. ksrMKL: a novel method for identification of kinase-substrate relationships using multiple kernel learning. PeerJ 2017; 5:e4182. [PMID: 29340231 PMCID: PMC5741978 DOI: 10.7717/peerj.4182] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 12/01/2017] [Indexed: 01/24/2023] Open
Abstract
Phosphorylation exerts a crucial role in multiple biological cellular processes which is catalyzed by protein kinases and closely related to many diseases. Identification of kinase-substrate relationships is important for understanding phosphorylation and provides a fundamental basis for further disease-related research and drug design. In this study, we develop a novel computational method to identify kinase-substrate relationships based on multiple kernel learning. The comparative analysis is based on a 10-fold cross-validation process and the dataset collected from the Phospho.ELM database. The results show that ksrMKL is greatly improved in various measures when compared with the single kernel support vector machine. Furthermore, with an independent test dataset extracted from the PhosphoSitePlus database, we compare ksrMKL with two existing kinase-substrate relationship prediction tools, namely iGPS and PKIS. The experimental results show that ksrMKL has better prediction performance than these existing tools.
Collapse
Affiliation(s)
- Minghui Wang
- School of Information Science and Technology, University of Science and Technology of China, Hefei, China.,Centers for Biomedical Engineering, University of Science and Technology of China, Hefei, China
| | - Tao Wang
- School of Information Science and Technology, University of Science and Technology of China, Hefei, China
| | - Ao Li
- School of Information Science and Technology, University of Science and Technology of China, Hefei, China.,Centers for Biomedical Engineering, University of Science and Technology of China, Hefei, China
| |
Collapse
|
18
|
Wilson LJ, Linley A, Hammond DE, Hood FE, Coulson JM, MacEwan DJ, Ross SJ, Slupsky JR, Smith PD, Eyers PA, Prior IA. New Perspectives, Opportunities, and Challenges in Exploring the Human Protein Kinome. Cancer Res 2017; 78:15-29. [DOI: 10.1158/0008-5472.can-17-2291] [Citation(s) in RCA: 97] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2017] [Revised: 09/22/2017] [Accepted: 10/31/2017] [Indexed: 11/16/2022]
|
19
|
Chen Q, Wang Y, Chen B, Zhang C, Wang L, Li J. Using propensity scores to predict the kinases of unannotated phosphopeptides. Knowl Based Syst 2017. [DOI: 10.1016/j.knosys.2017.08.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|
20
|
A Novel Phosphorylation Site-Kinase Network-Based Method for the Accurate Prediction of Kinase-Substrate Relationships. BIOMED RESEARCH INTERNATIONAL 2017; 2017:1826496. [PMID: 29312990 PMCID: PMC5660750 DOI: 10.1155/2017/1826496] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2017] [Revised: 08/14/2017] [Accepted: 09/05/2017] [Indexed: 01/06/2023]
Abstract
Protein phosphorylation is catalyzed by kinases which regulate many aspects that control death, movement, and cell growth. Identification of the phosphorylation site-specific kinase-substrate relationships (ssKSRs) is important for understanding cellular dynamics and provides a fundamental basis for further disease-related research and drug design. Although several computational methods have been developed, most of these methods mainly use local sequence of phosphorylation sites and protein-protein interactions (PPIs) to construct the prediction model. While phosphorylation presents very complicated processes and is usually involved in various biological mechanisms, the aforementioned information is not sufficient for accurate prediction. In this study, we propose a new and powerful computational approach named KSRPred for ssKSRs prediction, by introducing a novel phosphorylation site-kinase network (pSKN) profiles that can efficiently incorporate the relationships between various protein kinases and phosphorylation sites. The experimental results show that the pSKN profiles can efficiently improve the prediction performance in collaboration with local sequence and PPI information. Furthermore, we compare our method with the existing ssKSRs prediction tools and the results demonstrate that KSRPred can significantly improve the prediction performance compared with existing tools.
Collapse
|
21
|
Various Mechanisms Involve the Nuclear Factor (Erythroid-Derived 2)-Like (NRF2) to Achieve Cytoprotection in Long-Term Cisplatin-Treated Urothelial Carcinoma Cell Lines. Int J Mol Sci 2017; 18:ijms18081680. [PMID: 28767070 PMCID: PMC5578070 DOI: 10.3390/ijms18081680] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Revised: 07/21/2017] [Accepted: 07/27/2017] [Indexed: 02/08/2023] Open
Abstract
Therapeutic efficacy of cisplatin-based chemotherapy for advanced-stage urothelial carcinoma (UC) is limited by drug resistance. The nuclear factor (erythroid-derived 2)-like 2 (NRF2) pathway is a major regulator of cytoprotective responses. We investigated its involvement in cisplatin resistance in long-term cisplatin treated UC cell lines (LTTs). Expression of NRF2 pathway components and targets was evaluated by qRT-PCR and western blotting in LTT sublines from four different parental cells. NRF2 transcriptional activity was determined by reporter assays and total glutathione (GSH) was quantified enzymatically. Effects of siRNA-mediated NRF2 knockdown on chemosensitivity were analysed by viability assays, γH2AX immunofluorescence, and flow cytometry. Increased expression of NRF2, its positive regulator p62/SQSTM1, and elevated NRF2 activity was observed in 3/4 LTTs, which correlated with KEAP1 expression. Expression of cytoprotective enzymes and GSH concentration were upregulated in some LTTs. NRF2 knockdown resulted in downregulation of cytoprotective enzymes and resensitised 3/4 LTTs towards cisplatin as demonstrated by reduced IC50 values, increased γH2AX foci formation, and elevated number of apoptotic cells. In conclusion, while LTT lines displayed diversity in NRF2 activation, NRF2 signalling contributed to cisplatin resistance in LTT lines, albeit in diverse ways. Accordingly, inhibition of NRF2 can be used to resensitise UC cells to cisplatin, but responses in patients may likewise be variable.
Collapse
|
22
|
Technological advances for interrogating the human kinome. Biochem Soc Trans 2017; 45:65-77. [PMID: 28202660 DOI: 10.1042/bst20160163] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Revised: 10/20/2016] [Accepted: 10/25/2016] [Indexed: 12/12/2022]
Abstract
There is increasing appreciation among researchers and clinicians of the value of investigating biology and pathobiology at the level of cellular kinase (kinome) activity. Kinome analysis provides valuable opportunity to gain insights into complex biology (including disease pathology), identify biomarkers of critical phenotypes (including disease prognosis and evaluation of therapeutic efficacy), and identify targets for therapeutic intervention through kinase inhibitors. The growing interest in kinome analysis has fueled efforts to develop and optimize technologies that enable characterization of phosphorylation-mediated signaling events in a cost-effective, high-throughput manner. In this review, we highlight recent advances to the central technologies currently available for kinome profiling and offer our perspectives on the key challenges remaining to be addressed.
Collapse
|
23
|
Qin GM, Li RY, Zhao XM. PhosD: inferring kinase-substrate interactions based on protein domains. Bioinformatics 2017; 33:1197-1204. [PMID: 28031187 DOI: 10.1093/bioinformatics/btw792] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2016] [Accepted: 12/09/2016] [Indexed: 12/26/2022] Open
Abstract
Motivation Identifying the kinase-substrate relationships is vital to understanding the phosphorylation events and various biological processes, especially signal transductions. Although large amount of phosphorylation sites have been detected, unfortunately, it is rarely known which kinases activate those sites. Despite distinct computational approaches have been proposed to predict the kinase-substrate interactions, the prediction accuracy still needs to be improved. Results In this paper, we propose a novel probabilistic model named as PhosD to predict kinase-substrate relationships based on protein domains with the assumption that kinase-substrate interactions are accomplished with kinase-domain interactions. By further taking into account protein-protein interactions, our PhosD outperforms other popular approaches on several benchmark datasets with higher precision. In addition, some of our predicted kinase-substrate relationships are validated by signaling pathways, indicating the predictive power of our approach. Furthermore, we notice that given a kinase, the more substrates are known for the kinase the more accurate its predicted substrates will be, and the domains involved in kinase-substrate interactions are found to be more conserved across proteins phosphorylated by multiple kinases. These findings can help develop more efficient computational approaches in the future. Availability and Implementation The data and results are available at http://comp-sysbio.org/phosd. Contact xm_zhao@tongji.edu.cn. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Gui-Min Qin
- School of Electronics and Information Engineering, Tongji University, Shanghai 201804, China.,School of Software
| | - Rui-Yi Li
- School of Computer Science and Technology, Xidian University, Xi'an 710071, China
| | - Xing-Ming Zhao
- School of Electronics and Information Engineering, Tongji University, Shanghai 201804, China
| |
Collapse
|
24
|
Xu X, Wang M. Inferring Disease Associated Phosphorylation Sites via Random Walk on Multi-Layer Heterogeneous Network. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2016; 13:836-844. [PMID: 26584500 DOI: 10.1109/tcbb.2015.2498548] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
As protein phosphorylation plays an important role in numerous cellular processes, many studies have been undertaken to analyze phosphorylation-related activities for drug design and disease treatment. However, although progresses have been made in illustrating the relationship between phosphorylation and diseases, no existing method focuses on disease-associated phosphorylation sites prediction. In this work, we proposed a multi-layer heterogeneous network model that makes use of the kinase information to infer disease-phosphorylation site relationship and implemented random walk on the heterogeneous network. Experimental results reveal that multi-layer heterogeneous network model with kinase layer is superior in inferring disease-phosphorylation site relationship when comparing with existing random walk model and common used classification methods.
Collapse
|
25
|
Wang M, Jiang Y, Xu X. A novel method for predicting post-translational modifications on serine and threonine sites by using site-modification network profiles. MOLECULAR BIOSYSTEMS 2016; 11:3092-100. [PMID: 26344496 DOI: 10.1039/c5mb00384a] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Post-translational modifications (PTMs) regulate many aspects of biological behaviours including protein-protein interactions and cellular processes. Identification of PTM sites is helpful for understanding the PTM regulatory mechanisms. The PTMs on serine and threonine sites include phosphorylation, O-linked glycosylation and acetylation. Although a lot of computational approaches have been developed for PTM site prediction, currently most of them generate the predictive models by employing only local sequence information and few of them consider the relationship between different PTMs. In this paper, by adopting the site-modification network (SMNet) profiles that efficiently incorporate in situ PTM information, we develop a novel method to predict PTM sites on serine and threonine. PTM data are collected from various PTM databases and the SMNet is built to reflect the relationship between multiple PTMs, from which SMNet profiles are extracted to train predictive models based on SVM. Performance analysis of the SVM models shows that the SMNet profiles play an important role in accurately predicting PTM sites on serine and threonine. Furthermore, the proposed method is compared with existing PTM prediction approaches. The results from 10-fold cross-validation demonstrate that the proposed method with SMNet profiles performs remarkably better than existing methods, suggesting the power of SMNet profiles in identifying PTM sites.
Collapse
Affiliation(s)
- Minghui Wang
- School of Information Science and Technology, University of Science and Technology of China, Hefei AH230027, People's Republic of China
| | | | | |
Collapse
|
26
|
Eyre NS, Hampton-Smith RJ, Aloia AL, Eddes JS, Simpson KJ, Hoffmann P, Beard MR. Phosphorylation of NS5A Serine-235 is essential to hepatitis C virus RNA replication and normal replication compartment formation. Virology 2016; 491:27-44. [PMID: 26874015 DOI: 10.1016/j.virol.2016.01.018] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2015] [Revised: 01/21/2016] [Accepted: 01/23/2016] [Indexed: 01/09/2023]
Abstract
Hepatitis C virus (HCV) NS5A protein is essential for HCV RNA replication and virus assembly. Here we report the identification of NS5A phosphorylation sites Ser-222, Ser-235 and Thr-348 during an infectious HCV replication cycle and demonstrate that Ser-235 phosphorylation is essential for HCV RNA replication. Confocal microscopy revealed that both phosphoablatant (S235A) and phosphomimetic (S235D) mutants redistribute NS5A to large juxta-nuclear foci that display altered colocalization with known replication complex components. Using electron microscopy (EM) we found that S235D alters virus-induced membrane rearrangements while EM using 'APEX2'-tagged viruses demonstrated S235D-mediated enrichment of NS5A in irregular membranous foci. Finally, using a customized siRNA screen of candidate NS5A kinases and subsequent analysis using a phospho-specific antibody, we show that phosphatidylinositol-4 kinase III alpha (PI4KIIIα) is important for Ser-235 phosphorylation. We conclude that Ser-235 phosphorylation of NS5A is essential for HCV RNA replication and normal replication complex formation and is regulated by PI4KIIIα.
Collapse
Affiliation(s)
- Nicholas S Eyre
- School of Biological Sciences and Research Centre for Infectious Diseases, University of Adelaide, Adelaide, Australia; Centre for Cancer Biology, SA Pathology, Adelaide, Australia.
| | - Rachel J Hampton-Smith
- School of Biological Sciences and Research Centre for Infectious Diseases, University of Adelaide, Adelaide, Australia; Centre for Cancer Biology, SA Pathology, Adelaide, Australia
| | - Amanda L Aloia
- School of Biological Sciences and Research Centre for Infectious Diseases, University of Adelaide, Adelaide, Australia; Centre for Cancer Biology, SA Pathology, Adelaide, Australia
| | - James S Eddes
- Adelaide Proteomics Centre, School of Biological Sciences, University of Adelaide, Adelaide, Australia
| | - Kaylene J Simpson
- Victorian Centre for Functional Genomics, Peter MacCallum Cancer Centre, East Melbourne, Australia; The Sir Peter MacCallum Department of Oncology, University of Melbourne, Parkville, Australia
| | - Peter Hoffmann
- Adelaide Proteomics Centre, School of Biological Sciences, University of Adelaide, Adelaide, Australia; Institute for Photonics and Advanced Sensing (IPAS), University of Adelaide, Adelaide, Australia
| | - Michael R Beard
- School of Biological Sciences and Research Centre for Infectious Diseases, University of Adelaide, Adelaide, Australia; Centre for Cancer Biology, SA Pathology, Adelaide, Australia
| |
Collapse
|
27
|
Wang B, Wang M, Jiang Y, Sun D, Xu X. A novel network-based computational method to predict protein phosphorylation on tyrosine sites. J Bioinform Comput Biol 2016; 13:1542005. [DOI: 10.1142/s0219720015420056] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Phosphorylation plays a great role in regulating a variety of cellular processes and the identification of tyrosine phosphorylation sites is fundamental for understanding the post-translational modification (PTM) regulation processes. Although a lot of computational methods have been developed, most of them only concern local sequence information and few studies focus on the tyrosine sites with in situ PTM information, which refers to different types of PTM occurring on the same modification site. In this study, by constructing the site-modification network that efficiently incorporates in situ PTM information, we introduce a novel network-based computational method, site-modification network-based inference (SMNBI) to predict tyrosine phosphorylation. In order to verify the effectiveness of the proposed method, we compare it with other network-based computational methods. The results clearly show the superior performance of SMNBI. Besides, we extensively compare SMNBI with other sequence-based methods including SVM and Bayesian decision theory. The evaluation demonstrates the power of site-modification network in predicting tyrosine phosphorylation. The proposed method is freely available at http://bioinformatics.ustc.edu.cn/smnbi/ .
Collapse
Affiliation(s)
- Binghua Wang
- School of Information Science and Technology, University of Science and Technology of China, Hefei 230027, China
| | - Minghui Wang
- School of Information Science and Technology, University of Science and Technology of China, Hefei 230027, China
- Centers for Biomedical Engineering, University of Science and Technology of China, Hefei 230027, China
| | - Yujie Jiang
- School of Information Science and Technology, University of Science and Technology of China, Hefei 230027, China
| | - Dongdong Sun
- School of Information Science and Technology, University of Science and Technology of China, Hefei 230027, China
| | - Xiaoyi Xu
- School of Information Science and Technology, University of Science and Technology of China, Hefei 230027, China
| |
Collapse
|
28
|
Li H, Wang M, Xu X. Prediction of kinase–substrate relations based on heterogeneous networks. J Bioinform Comput Biol 2016; 13:1542003. [DOI: 10.1142/s0219720015420032] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Protein phosphorylation catalyzed by kinases plays essential roles in various intracellular processes. With an increasing number of phosphorylation sites verified experimentally by high-throughput technologies and assigned as substrates of specific kinases, prediction of potential kinase–substrate relations (KSRs) attracts increasing attention. Although a large number of computational methods have been designed, most of them only focus on local protein sequence information. A few KSR prediction approaches integrate protein–protein interaction and protein sequence information into existing machine learning algorithms at the cost of high feature dimensions or reduced sensitivity. In this work, we introduce two novel heterogeneous networks, HetNet-PPI and HetNet-SEQ, by incorporating PPI and similarity of protein sequences into the kinase–substrate heterogeneous networks, respectively. Based on these two heterogeneous networks, we further propose two new KSR prediction methods, HeteSim-PPI and HeteSim-SEQ, by adopting the HeteSim algorithm, which is recently proposed for relevance measure in heterogeneous information networks. Comprehensive evaluation results of the two methods show that similarity of protein sequences is more effective in improving KSR prediction performance as HeteSim-SEQ outperforms HeteSim-PPI in most cases. Further comparison results demonstrate that HeteSim-SEQ is superior to existing methods including BDT, SVM and iGPS, suggesting the effectiveness of the proposed network-based method in predicting potential KSRs.
Collapse
Affiliation(s)
- Haichun Li
- School of Information Science and Technology, University of Science and Technology of China, Hefei 230027, P. R. China
| | - Minghui Wang
- School of Information Science and Technology, University of Science and Technology of China, Hefei 230027, P. R. China
- Centers for Biomedical Engineering, University of Science and Technology of China, Hefei 230027, P. R. China
| | - Xiaoyi Xu
- School of Information Science and Technology, University of Science and Technology of China, Hefei 230027, P. R. China
| |
Collapse
|
29
|
Reeks C, Screaton RA. Identification of Kinase-substrate Pairs Using High Throughput Screening. J Vis Exp 2015:e53152. [PMID: 26383144 DOI: 10.3791/53152] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022] Open
Abstract
We have developed a screening platform to identify dedicated human protein kinases for phosphorylated substrates which can be used to elucidate novel signal transduction pathways. Our approach features the use of a library of purified GST-tagged human protein kinases and a recombinant protein substrate of interest. We have used this technology to identify MAP/microtubule affinity-regulating kinase 2 (MARK2) as the kinase for a glucose-regulated site on CREB-Regulated Transcriptional Coactivator 2 (CRTC2), a protein required for beta cell proliferation, as well as the Axl family of tyrosine kinases as regulators of cell metastasis by phosphorylation of the adaptor protein ELMO. We describe this technology and discuss how it can help to establish a comprehensive map of how cells respond to environmental stimuli.
Collapse
Affiliation(s)
- Courtney Reeks
- Children's Hospital of Eastern Ontario Research Institute
| | - Robert A Screaton
- Sunnybrook Research Institute, University of Toronto; Department of Biochemistry, University of Toronto;
| |
Collapse
|
30
|
Knowledge-Based Analysis for Detecting Key Signaling Events from Time-Series Phosphoproteomics Data. PLoS Comput Biol 2015; 11:e1004403. [PMID: 26252020 PMCID: PMC4529189 DOI: 10.1371/journal.pcbi.1004403] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Accepted: 06/11/2015] [Indexed: 12/24/2022] Open
Abstract
Cell signaling underlies transcription/epigenetic control of a vast majority of cell-fate decisions. A key goal in cell signaling studies is to identify the set of kinases that underlie key signaling events. In a typical phosphoproteomics study, phosphorylation sites (substrates) of active kinases are quantified proteome-wide. By analyzing the activities of phosphorylation sites over a time-course, the temporal dynamics of signaling cascades can be elucidated. Since many substrates of a given kinase have similar temporal kinetics, clustering phosphorylation sites into distinctive clusters can facilitate identification of their respective kinases. Here we present a knowledge-based CLUster Evaluation (CLUE) approach for identifying the most informative partitioning of a given temporal phosphoproteomics data. Our approach utilizes prior knowledge, annotated kinase-substrate relationships mined from literature and curated databases, to first generate biologically meaningful partitioning of the phosphorylation sites and then determine key kinases associated with each cluster. We demonstrate the utility of the proposed approach on two time-series phosphoproteomics datasets and identify key kinases associated with human embryonic stem cell differentiation and insulin signaling pathway. The proposed approach will be a valuable resource in the identification and characterizing of signaling networks from phosphoproteomics data. A key goal in cell signaling studies is to identify the set of kinases that underlie key signaling events. Mass spectrometry-based technologies have emerged as a powerful tool to profile proteome-wide phosphorylation events in vivo at a single amino acid resolution with high precision. However, development of algorithms to analyze and identify signaling events from high-throughput phosphoproteomics data is still in its infancy. Here we propose a knowledge-based CLUster Evaluation (CLUE) approach for identifying key signaling cascades from time-series phosphoproteomics data. Our approach utilizes known kinase-substrate annotations from curated phosphoproteomics databases to first determine the optimal clustering of the phosphorylation sites and then identify enriched kinase(s). We apply CLUE on time-series phosphoproteomics datasets and identify key kinases associated with human embryonic stem cell differentiation and insulin signaling pathway.
Collapse
|
31
|
Computational and statistical methods for high-throughput analysis of post-translational modifications of proteins. J Proteomics 2015. [PMID: 26216596 DOI: 10.1016/j.jprot.2015.07.016] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The investigation of post-translational modifications (PTMs) represents one of the main research focuses for the study of protein function and cell signaling. Mass spectrometry instrumentation with increasing sensitivity improved protocols for PTM enrichment and recently established pipelines for high-throughput experiments allow large-scale identification and quantification of several PTM types. This review addresses the concurrently emerging challenges for the computational analysis of the resulting data and presents PTM-centered approaches for spectra identification, statistical analysis, multivariate analysis and data interpretation. We furthermore discuss the potential of future developments that will help to gain deep insight into the PTM-ome and its biological role in cells. This article is part of a Special Issue entitled: Computational Proteomics.
Collapse
|
32
|
Abstract
The succession of protein activation and deactivation mediated by phosphorylation and dephosphorylation events constitutes a key mechanism of molecular information transfer in cellular systems. To deduce the details of those molecular information cascades and networks has been a central goal pursued by both experimental and computational approaches. Many computational network reconstruction methods employing an array of different statistical learning methods have been developed to infer phosphorylation networks based on different types of molecular data sets such as protein sequence, protein structure, or phosphoproteomics data. In this chapter, different computational network inference methods and resources for biological network reconstruction with a particular focus on phosphorylation networks are surveyed.
Collapse
|
33
|
Palmeri A, Ferrè F, Helmer-Citterich M. Exploiting holistic approaches to model specificity in protein phosphorylation. Front Genet 2014; 5:315. [PMID: 25324856 PMCID: PMC4179730 DOI: 10.3389/fgene.2014.00315] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2014] [Accepted: 08/21/2014] [Indexed: 12/27/2022] Open
Abstract
Phosphate plays a chemically unique role in shaping cellular signaling of all current living systems, especially eukaryotes. Protein phosphorylation has been studied at several levels, from the near-site context, both in sequence and structure, to the crowded cellular environment, and ultimately to the systems-level perspective. Despite the tremendous advances in mass spectrometry and efforts dedicated to the development of ad hoc highly sophisticated methods, phosphorylation site inference and associated kinase identification are still unresolved problems in kinome biology. The sequence and structure of the substrate near-site context are not sufficient alone to model the in vivo phosphorylation rules, and they should be integrated with orthogonal information in all possible applications. Here we provide an overview of the different contexts that contribute to protein phosphorylation, discussing their potential impact in phosphorylation site annotation and in predicting kinase-substrate specificity.
Collapse
Affiliation(s)
- Antonio Palmeri
- Department of Biology, Centre for Molecular Bioinformatics, University of Rome Tor Vergata Rome, Italy
| | - Fabrizio Ferrè
- Department of Biology, Centre for Molecular Bioinformatics, University of Rome Tor Vergata Rome, Italy
| | - Manuela Helmer-Citterich
- Department of Biology, Centre for Molecular Bioinformatics, University of Rome Tor Vergata Rome, Italy
| |
Collapse
|
34
|
Chen YA, Eschrich SA. Computational methods and opportunities for phosphorylation network medicine. Transl Cancer Res 2014; 3:266-278. [PMID: 25530950 PMCID: PMC4271781] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Protein phosphorylation, one of the most ubiquitous post-translational modifications (PTM) of proteins, is known to play an essential role in cell signaling and regulation. With the increasing understanding of the complexity and redundancy of cell signaling, there is a growing recognition that targeting the entire network or system could be a necessary and advantageous strategy for treating cancer. Protein kinases, the proteins that add a phosphate group to the substrate proteins during phosphorylation events, have become one of the largest groups of 'druggable' targets in cancer therapeutics in recent years. Kinase inhibitors are being regularly used in clinics for cancer treatment. This therapeutic paradigm shift in cancer research is partly due to the generation and availability of high-dimensional proteomics data. Generation of this data, in turn, is enabled by increased use of mass-spectrometry (MS)-based or other high-throughput proteomics platforms as well as companion public databases and computational tools. This review briefly summarizes the current state and progress on phosphoproteomics identification, quantification, and platform related characteristics. We review existing database resources, computational tools, methods for phosphorylation network inference, and ultimately demonstrate the connection to therapeutics. Finally, many research opportunities exist for bioinformaticians or biostatisticians based on developments and limitations of the current and emerging technologies.
Collapse
Affiliation(s)
- Yian Ann Chen
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, 12902 Magnolia Drive Tampa, FL 33612, USA
| | - Steven A Eschrich
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, 12902 Magnolia Drive Tampa, FL 33612, USA
| |
Collapse
|
35
|
Suo SB, Qiu JD, Shi SP, Chen X, Liang RP. PSEA: Kinase-specific prediction and analysis of human phosphorylation substrates. Sci Rep 2014; 4:4524. [PMID: 24681538 PMCID: PMC3970127 DOI: 10.1038/srep04524] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2013] [Accepted: 03/11/2014] [Indexed: 11/09/2022] Open
Abstract
Protein phosphorylation catalysed by kinases plays crucial regulatory roles in intracellular signal transduction. With the increasing number of kinase-specific phosphorylation sites and disease-related phosphorylation substrates that have been identified, the desire to explore the regulatory relationship between protein kinases and disease-related phosphorylation substrates is motivated. In this work, we analysed the kinases' characteristic of all disease-related phosphorylation substrates by using our developed Phosphorylation Set Enrichment Analysis (PSEA) method. We evaluated the efficiency of our method with independent test and concluded that our approach is reliable for identifying kinases responsible for phosphorylated substrates. In addition, we found that Mitogen-activated protein kinase (MAPK) and Glycogen synthase kinase (GSK) families are more associated with abnormal phosphorylation. It can be anticipated that our method might be helpful to identify the mechanism of phosphorylation and the relationship between kinase and phosphorylation related diseases. A user-friendly web interface is now freely available at http://bioinfo.ncu.edu.cn/PKPred_Home.aspx.
Collapse
Affiliation(s)
- Sheng-Bao Suo
- Department of Chemistry, Nanchang University, Nanchang, 330031, China
| | - Jian-Ding Qiu
- 1] Department of Chemistry, Nanchang University, Nanchang, 330031, China [2] Department of Chemical Engineering, Pingxiang College, Pingxiang, 337055, China
| | - Shao-Ping Shi
- 1] Department of Chemistry, Nanchang University, Nanchang, 330031, China [2] Department of Mathematics, Nanchang University, Nanchang, 330031, China
| | - Xiang Chen
- Department of Chemistry, Nanchang University, Nanchang, 330031, China
| | - Ru-Ping Liang
- Department of Chemistry, Nanchang University, Nanchang, 330031, China
| |
Collapse
|