1
|
Zhu L, Wang L, Yang Z, Xu P, Yang S. PPSNO: A Feature-Rich SNO Sites Predictor by Stacking Ensemble Strategy from Protein Sequence-Derived Information. Interdiscip Sci 2024; 16:192-217. [PMID: 38206557 DOI: 10.1007/s12539-023-00595-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 11/20/2023] [Accepted: 11/21/2023] [Indexed: 01/12/2024]
Abstract
The protein S-nitrosylation (SNO) is a significant post-translational modification that affects the stability, activity, cellular localization, and function of proteins. Therefore, highly accurate prediction of SNO sites aids in grasping biological function mechanisms. In this document, we have constructed a predictor, named PPSNO, forecasting protein SNO sites using stacked integrated learning. PPSNO integrates multiple machine learning techniques into an ensemble model, enhancing its predictive accuracy. First, we established benchmark datasets by collecting SNO sites from various sources, including literature, databases, and other predictors. Second, various techniques for feature extraction are applied to derive characteristics from protein sequences, which are subsequently amalgamated into the PPSNO predictor for training. Five-fold cross-validation experiments show that PPSNO outperformed existing predictors, such as PSNO, PreSNO, pCysMod, DeepNitro, RecSNO, and Mul-SNO. The PPSNO predictor achieved an impressive accuracy of 92.8%, an area under the curve (AUC) of 96.1%, a Matthews correlation coefficient (MCC) of 81.3%, an F1-score of 85.6%, an SN of 79.3%, an SP of 97.7%, and an average precision (AP) of 92.2%. We also employed ROC curves, PR curves, and radar plots to show the superior performance of PPSNO. Our study shows that fused protein sequence features and two-layer stacked ensemble models can improve the accuracy of predicting SNO sites, which can aid in comprehending cellular processes and disease mechanisms. The codes and data are available at https://github.com/serendipity-wly/PPSNO .
Collapse
Affiliation(s)
- Lun Zhu
- School of Computer Science and Artificial Intelligence Aliyun School of Big Data School of Software, Changzhou University, Changzhou, 213164, China
| | - Liuyang Wang
- School of Computer Science and Artificial Intelligence Aliyun School of Big Data School of Software, Changzhou University, Changzhou, 213164, China
| | - Zexi Yang
- School of Computer Science and Artificial Intelligence Aliyun School of Big Data School of Software, Changzhou University, Changzhou, 213164, China
| | - Piao Xu
- College of Economics and Management, Nanjing Forestry University, Nanjing, 210037, China
| | - Sen Yang
- School of Computer Science and Artificial Intelligence Aliyun School of Big Data School of Software, Changzhou University, Changzhou, 213164, China.
- The Affiliated Changzhou No. 2 People's Hospital of Nanjing Medical University, Changzhou, 213164, China.
| |
Collapse
|
2
|
Jia J, Lv P, Wei X, Qiu W. SNO-DCA: A model for predicting S-nitrosylation sites based on densely connected convolutional networks and attention mechanism. Heliyon 2024; 10:e23187. [PMID: 38148797 PMCID: PMC10750070 DOI: 10.1016/j.heliyon.2023.e23187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 11/22/2023] [Accepted: 11/29/2023] [Indexed: 12/28/2023] Open
Abstract
Protein S-nitrosylation is a reversible oxidative reduction post-translational modification that is widely present in the biological community. S-nitrosylation can regulate protein function and is closely associated with a variety of diseases, thus identifying S-nitrosylation sites are crucial for revealing the function of proteins and related drug discovery. Traditional experimental methods are time-consuming and expensive; therefore, it is necessary to explore more efficient computational methods. Deep learning algorithms perform well in the field of bioinformatics sites prediction, and many studies show that they outperform existing machine learning algorithms. In this work, we proposed a deep learning algorithm-based predictor SNO-DCA for distinguishing between S-nitrosylated and non-S-nitrosylated sequences. First, one-hot encoding of protein sequences was performed. Second, the dense convolutional blocks were used to capture feature information, and an attention module was added to weigh different features to improve the prediction ability of the model. The 10-fold cross-validation and independent testing experimental results show that our SNO-DCA model outperforms existing S-nitrosylation sites prediction models under imbalanced data. In this paper, a web server prediction website: https://sno.cangmang.xyz/SNO-DCA/was established to provide an online prediction service for users. SNO-DCA can be available at https://github.com/peanono/SNO-DCA.
Collapse
Affiliation(s)
- Jianhua Jia
- Computer Department, Jingdezhen Ceramic University, Jingdezhen, 330403, China
| | - Peinuo Lv
- Computer Department, Jingdezhen Ceramic University, Jingdezhen, 330403, China
| | - Xin Wei
- Business School, Jiangxi Institute of Fashion Technology, Nanchang, 330201, China
| | - Wangren Qiu
- Computer Department, Jingdezhen Ceramic University, Jingdezhen, 330403, China
| |
Collapse
|
3
|
Pratyush P, Pokharel S, Saigo H, KC DB. pLMSNOSite: an ensemble-based approach for predicting protein S-nitrosylation sites by integrating supervised word embedding and embedding from pre-trained protein language model. BMC Bioinformatics 2023; 24:41. [PMID: 36755242 PMCID: PMC9909867 DOI: 10.1186/s12859-023-05164-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 01/30/2023] [Indexed: 02/10/2023] Open
Abstract
BACKGROUND Protein S-nitrosylation (SNO) plays a key role in transferring nitric oxide-mediated signals in both animals and plants and has emerged as an important mechanism for regulating protein functions and cell signaling of all main classes of protein. It is involved in several biological processes including immune response, protein stability, transcription regulation, post translational regulation, DNA damage repair, redox regulation, and is an emerging paradigm of redox signaling for protection against oxidative stress. The development of robust computational tools to predict protein SNO sites would contribute to further interpretation of the pathological and physiological mechanisms of SNO. RESULTS Using an intermediate fusion-based stacked generalization approach, we integrated embeddings from supervised embedding layer and contextualized protein language model (ProtT5) and developed a tool called pLMSNOSite (protein language model-based SNO site predictor). On an independent test set of experimentally identified SNO sites, pLMSNOSite achieved values of 0.340, 0.735 and 0.773 for MCC, sensitivity and specificity respectively. These results show that pLMSNOSite performs better than the compared approaches for the prediction of S-nitrosylation sites. CONCLUSION Together, the experimental results suggest that pLMSNOSite achieves significant improvement in the prediction performance of S-nitrosylation sites and represents a robust computational approach for predicting protein S-nitrosylation sites. pLMSNOSite could be a useful resource for further elucidation of SNO and is publicly available at https://github.com/KCLabMTU/pLMSNOSite .
Collapse
Affiliation(s)
- Pawel Pratyush
- grid.259979.90000 0001 0663 5937Department of Computer Science, Michigan Technological University, Houghton, MI USA
| | - Suresh Pokharel
- grid.259979.90000 0001 0663 5937Department of Computer Science, Michigan Technological University, Houghton, MI USA
| | - Hiroto Saigo
- grid.177174.30000 0001 2242 4849Department of Electrical Engineering and Computer Science, Kyushu University, 744, Motooka, Nishi-Ku, 819-0395 Japan
| | - Dukka B. KC
- grid.259979.90000 0001 0663 5937Department of Computer Science, Michigan Technological University, Houghton, MI USA
| |
Collapse
|
4
|
Zhao Q, Ma J, Wang Y, Xie F, Lv Z, Xu Y, Shi H, Han K. Mul-SNO: A novel prediction tool for S-nitrosylation sites based on deep learning methods. IEEE J Biomed Health Inform 2021; 26:2379-2387. [PMID: 34762593 DOI: 10.1109/jbhi.2021.3123503] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Protein s-nitrosylation (SNO is one of the most important post-translational modifications and is formed by the covalent modification of nitric oxide and cysteine residues. Extensive studies have shown that SNO plays a pivotal role in the plant immune response and treating various major human diseases. In recent years, SNO sites have become a hot research topic. Traditional biochemical methods for SNO site identification are time-consuming and costly. In this study, we developed an economical and efficient SNO site prediction tool named Mul-SNO. Mul-SNO ensembled current popular and powerful deep learning model bidirectional long short-term memory (BiLSTM and bidirectional encoder representations from Transformers (BERT . Compared with existing state-of-the-art methods, Mul-SNO obtained better ACC of 0.911 and 0.796 based on 10-fold cross-validation and independent data sets, respectively. The prediction server can be obtained for free at http://lab.malab.cn/~mjq/Mul-SNO/.
Collapse
|
5
|
A global map of associations between types of protein posttranslational modifications and human genetic diseases. iScience 2021; 24:102917. [PMID: 34430807 PMCID: PMC8365368 DOI: 10.1016/j.isci.2021.102917] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Revised: 06/27/2021] [Accepted: 07/27/2021] [Indexed: 12/14/2022] Open
Abstract
There are >200 types of protein posttranslational modifications (PTMs) described in eukaryotes, each with unique proteome coverage and functions. We hypothesized that some genetic diseases may be caused by the removal of a specific type of PTMs by genomic variants and the consequent deregulation of particular functions. We collected >320,000 human PTMs representing 59 types and crossed them with >4M nonsynonymous DNA variants annotated with predicted pathogenicity and disease associations. We report >1.74M PTM-variant co-occurrences that an enrichment analysis distributed into 215 pairwise associations between 18 PTM types and 148 genetic diseases. Of them, 42% were not previously described. Removal of lysine acetylation exerts the most pronounced effect, and less studied PTM types such as S-glutathionylation or S-nitrosylation show relevance. Using pathogenicity predictions, we identified PTM sites that may produce particular diseases if prevented. Our results provide evidence of a substantial impact of PTM-specific removal on the pathogenesis of genetic diseases and phenotypes. There is an enrichment of disease-associated nsSNVs preventing certain types of PTMs We report 215 pairwise associations between 18 PTM types and 148 genetic diseases The removal of lysine acetylation exerts the most pronounced effect We report a set of PTM sites that may produce particular diseases if prevented
Collapse
|
6
|
Massa CM, Liu Z, Taylor S, Pettit AP, Stakheyeva MN, Korotkova E, Popova V, Atochina-Vasserman EN, Gow AJ. Biological Mechanisms of S-Nitrosothiol Formation and Degradation: How Is Specificity of S-Nitrosylation Achieved? Antioxidants (Basel) 2021; 10:antiox10071111. [PMID: 34356344 PMCID: PMC8301044 DOI: 10.3390/antiox10071111] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 06/03/2021] [Accepted: 06/08/2021] [Indexed: 01/21/2023] Open
Abstract
The modification of protein cysteine residues underlies some of the diverse biological functions of nitric oxide (NO) in physiology and disease. The formation of stable nitrosothiols occurs under biologically relevant conditions and time scales. However, the factors that determine the selective nature of this modification remain poorly understood, making it difficult to predict thiol targets and thus construct informatics networks. In this review, the biological chemistry of NO will be considered within the context of nitrosothiol formation and degradation whilst considering how specificity is achieved in this important post-translational modification. Since nitrosothiol formation requires a formal one-electron oxidation, a classification of reaction mechanisms is proposed regarding which species undergoes electron abstraction: NO, thiol or S-NO radical intermediate. Relevant kinetic, thermodynamic and mechanistic considerations will be examined and the impact of sources of NO and the chemical nature of potential reaction targets is also discussed.
Collapse
Affiliation(s)
- Christopher M. Massa
- Department of Pharmacology & Toxicology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, NJ 08848, USA; (C.M.M.); (Z.L.); (S.T.); (A.P.P.)
| | - Ziping Liu
- Department of Pharmacology & Toxicology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, NJ 08848, USA; (C.M.M.); (Z.L.); (S.T.); (A.P.P.)
| | - Sheryse Taylor
- Department of Pharmacology & Toxicology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, NJ 08848, USA; (C.M.M.); (Z.L.); (S.T.); (A.P.P.)
| | - Ashley P. Pettit
- Department of Pharmacology & Toxicology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, NJ 08848, USA; (C.M.M.); (Z.L.); (S.T.); (A.P.P.)
| | - Marena N. Stakheyeva
- RASA Center in Tomsk, Tomsk Polytechnic University, 634050 Tomsk, Russia; (M.N.S.); (E.N.A.-V.)
- Institute of Natural Resources, Tomsk Polytechnic University, Lenin Av. 30, 634050 Tomsk, Russia; (E.K.); (V.P.)
| | - Elena Korotkova
- Institute of Natural Resources, Tomsk Polytechnic University, Lenin Av. 30, 634050 Tomsk, Russia; (E.K.); (V.P.)
| | - Valentina Popova
- Institute of Natural Resources, Tomsk Polytechnic University, Lenin Av. 30, 634050 Tomsk, Russia; (E.K.); (V.P.)
| | - Elena N. Atochina-Vasserman
- RASA Center in Tomsk, Tomsk Polytechnic University, 634050 Tomsk, Russia; (M.N.S.); (E.N.A.-V.)
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Andrew J. Gow
- Department of Pharmacology & Toxicology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, NJ 08848, USA; (C.M.M.); (Z.L.); (S.T.); (A.P.P.)
- RASA Center in Tomsk, Tomsk Polytechnic University, 634050 Tomsk, Russia; (M.N.S.); (E.N.A.-V.)
- Correspondence: ; Tel.: +1-848-445-4612
| |
Collapse
|
7
|
Rajpoot S, Wary KK, Ibbott R, Liu D, Saqib U, Thurston TLM, Baig MS. TIRAP in the Mechanism of Inflammation. Front Immunol 2021; 12:697588. [PMID: 34305934 PMCID: PMC8297548 DOI: 10.3389/fimmu.2021.697588] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 06/23/2021] [Indexed: 12/15/2022] Open
Abstract
The Toll-interleukin-1 Receptor (TIR) domain-containing adaptor protein (TIRAP) represents a key intracellular signalling molecule regulating diverse immune responses. Its capacity to function as an adaptor molecule has been widely investigated in relation to Toll-like Receptor (TLR)-mediated innate immune signalling. Since the discovery of TIRAP in 2001, initial studies were mainly focused on its role as an adaptor protein that couples Myeloid differentiation factor 88 (MyD88) with TLRs, to activate MyD88-dependent TLRs signalling. Subsequent studies delineated TIRAP’s role as a transducer of signalling events through its interaction with non-TLR signalling mediators. Indeed, the ability of TIRAP to interact with an array of intracellular signalling mediators suggests its central role in various immune responses. Therefore, continued studies that elucidate the molecular basis of various TIRAP-protein interactions and how they affect the signalling magnitude, should provide key information on the inflammatory disease mechanisms. This review summarizes the TIRAP recruitment to activated receptors and discusses the mechanism of interactions in relation to the signalling that precede acute and chronic inflammatory diseases. Furthermore, we highlighted the significance of TIRAP-TIR domain containing binding sites for several intracellular inflammatory signalling molecules. Collectively, we discuss the importance of the TIR domain in TIRAP as a key interface involved in protein interactions which could hence serve as a therapeutic target to dampen the extent of acute and chronic inflammatory conditions.
Collapse
Affiliation(s)
- Sajjan Rajpoot
- Department of Biosciences and Biomedical Engineering (BSBE), Indian Institute of Technology Indore (IITI), Indore, India
| | - Kishore K Wary
- Department of Pharmacology and Regenerative Medicine, The University of Illinois at Chicago, Chicago, IL, United States
| | - Rachel Ibbott
- MRC Centre for Molecular Bacteriology and Infection, Imperial College London, London, United Kingdom
| | - Dongfang Liu
- Department of Pathology, Immunology and Laboratory Medicine, Rutgers University-New Jersey Medical School, Newark, NJ, United States.,School of Graduate Studies, Rutgers Biomedical and Health Sciences, Newark, NJ, United States.,Center for Immunity and Inflammation, New Jersey Medical School, Rutgers-The State University of New Jersey, Newark, NJ, United States
| | - Uzma Saqib
- Discipline of Chemistry, Indian Institute of Technology Indore (IITI), Indore, India
| | - Teresa L M Thurston
- MRC Centre for Molecular Bacteriology and Infection, Imperial College London, London, United Kingdom
| | - Mirza S Baig
- Department of Biosciences and Biomedical Engineering (BSBE), Indian Institute of Technology Indore (IITI), Indore, India
| |
Collapse
|
8
|
Sacchi S, Rabattoni V, Miceli M, Pollegioni L. Yin and Yang in Post-Translational Modifications of Human D-Amino Acid Oxidase. Front Mol Biosci 2021; 8:684934. [PMID: 34041270 PMCID: PMC8141710 DOI: 10.3389/fmolb.2021.684934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Accepted: 04/23/2021] [Indexed: 11/30/2022] Open
Abstract
In the central nervous system, the flavoprotein D-amino acid oxidase is responsible for catabolizing D-serine, the main endogenous coagonist of N-methyl-D-aspartate receptor. Dysregulation of D-serine brain levels in humans has been associated with neurodegenerative and psychiatric disorders. This D-amino acid is synthesized by the enzyme serine racemase, starting from the corresponding L-enantiomer, and degraded by both serine racemase (via an elimination reaction) and the flavoenzyme D-amino acid oxidase. To shed light on the role of human D-amino acid oxidase (hDAAO) in D-serine metabolism, the structural/functional relationships of this enzyme have been investigated in depth and several strategies aimed at controlling the enzymatic activity have been identified. Here, we focused on the effect of post-translational modifications: by using a combination of structural analyses, biochemical methods, and cellular studies, we investigated whether hDAAO is subjected to nitrosylation, sulfhydration, and phosphorylation. hDAAO is S-nitrosylated and this negatively affects its activity. In contrast, the hydrogen sulfide donor NaHS seems to alter the enzyme conformation, stabilizing a species with higher affinity for the flavin adenine dinucleotide cofactor and thus positively affecting enzymatic activity. Moreover, hDAAO is phosphorylated in cerebellum; however, the protein kinase involved is still unknown. Taken together, these findings indicate that D-serine levels can be also modulated by post-translational modifications of hDAAO as also known for the D-serine synthetic enzyme serine racemase.
Collapse
Affiliation(s)
- Silvia Sacchi
- "The Protein Factory 2.0", Dipartimento di Biotecnologie e Scienze della Vita, Università degli Studi Dell'Insubria, Varese, Italy
| | - Valentina Rabattoni
- "The Protein Factory 2.0", Dipartimento di Biotecnologie e Scienze della Vita, Università degli Studi Dell'Insubria, Varese, Italy
| | - Matteo Miceli
- "The Protein Factory 2.0", Dipartimento di Biotecnologie e Scienze della Vita, Università degli Studi Dell'Insubria, Varese, Italy
| | - Loredano Pollegioni
- "The Protein Factory 2.0", Dipartimento di Biotecnologie e Scienze della Vita, Università degli Studi Dell'Insubria, Varese, Italy
| |
Collapse
|
9
|
Li S, Yu K, Wu G, Zhang Q, Wang P, Zheng J, Liu ZX, Wang J, Gao X, Cheng H. pCysMod: Prediction of Multiple Cysteine Modifications Based on Deep Learning Framework. Front Cell Dev Biol 2021; 9:617366. [PMID: 33732693 PMCID: PMC7959776 DOI: 10.3389/fcell.2021.617366] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Accepted: 01/12/2021] [Indexed: 12/18/2022] Open
Abstract
Thiol groups on cysteines can undergo multiple post-translational modifications (PTMs), acting as a molecular switch to maintain redox homeostasis and regulating a series of cell signaling transductions. Identification of sophistical protein cysteine modifications is crucial for dissecting its underlying regulatory mechanism. Instead of a time-consuming and labor-intensive experimental method, various computational methods have attracted intense research interest due to their convenience and low cost. Here, we developed the first comprehensive deep learning based tool pCysMod for multiple protein cysteine modification prediction, including S-nitrosylation, S-palmitoylation, S-sulfenylation, S-sulfhydration, and S-sulfinylation. Experimentally verified cysteine sites curated from literature and sites collected by other databases and predicting tools were integrated as benchmark dataset. Several protein sequence features were extracted and united into a deep learning model, and the hyperparameters were optimized by particle swarm optimization algorithms. Cross-validations indicated our model showed excellent robustness and outperformed existing tools, which was able to achieve an average AUC of 0.793, 0.807, 0.796, 0.793, and 0.876 for S-nitrosylation, S-palmitoylation, S-sulfenylation, S-sulfhydration, and S-sulfinylation, demonstrating pCysMod was stable and suitable for protein cysteine modification prediction. Besides, we constructed a comprehensive protein cysteine modification prediction web server based on this model to benefit the researches finding the potential modification sites of their interested proteins, which could be accessed at http://pcysmod.omicsbio.info. This work will undoubtedly greatly promote the study of protein cysteine modification and contribute to clarifying the biological regulation mechanisms of cysteine modification within and among the cells.
Collapse
Affiliation(s)
- Shihua Li
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China.,School of Life Sciences, Zhengzhou University, Zhengzhou, China
| | - Kai Yu
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China
| | - Guandi Wu
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China
| | - Qingfeng Zhang
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China
| | - Panqin Wang
- School of Life Sciences, Zhengzhou University, Zhengzhou, China
| | - Jian Zheng
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China
| | - Ze-Xian Liu
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China
| | - Jichao Wang
- CAS Key Lab of Biobased Materials, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao, China
| | - Xinjiao Gao
- MOE Key Laboratory for Membraneless Organelles and Cellular Dynamics, Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, China
| | - Han Cheng
- School of Life Sciences, Zhengzhou University, Zhengzhou, China
| |
Collapse
|
10
|
Recent Advances in Predicting Protein S-Nitrosylation Sites. BIOMED RESEARCH INTERNATIONAL 2021; 2021:5542224. [PMID: 33628788 PMCID: PMC7892234 DOI: 10.1155/2021/5542224] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2021] [Revised: 01/24/2021] [Accepted: 01/25/2021] [Indexed: 01/09/2023]
Abstract
Protein S-nitrosylation (SNO) is a process of covalent modification of nitric oxide (NO) and its derivatives and cysteine residues. SNO plays an essential role in reversible posttranslational modifications of proteins. The accurate prediction of SNO sites is crucial in revealing a certain biological mechanism of NO regulation and related drug development. Identification of the sites of SNO in proteins is currently a very hot topic. In this review, we briefly summarize recent advances in computationally identifying SNO sites. The challenges and future perspectives for identifying SNO sites are also discussed. We anticipate that this review will provide insights into research on SNO site prediction.
Collapse
|
11
|
Kumar M, Papaleo E. A pan-cancer assessment of alterations of the kinase domain of ULK1, an upstream regulator of autophagy. Sci Rep 2020; 10:14874. [PMID: 32913252 PMCID: PMC7483646 DOI: 10.1038/s41598-020-71527-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 06/22/2020] [Indexed: 02/06/2023] Open
Abstract
Autophagy is a key clearance process to recycle damaged cellular components. One important upstream regulator of autophagy is ULK1 kinase. Several three-dimensional structures of the ULK1 catalytic domain are available, but a comprehensive study, including molecular dynamics, is missing. Also, an exhaustive description of ULK1 alterations found in cancer samples is presently lacking. We here applied a framework which links -omics data to structural protein ensembles to study ULK1 alterations from genomics data available for more than 30 cancer types. We predicted the effects of mutations on ULK1 function and structural stability, accounting for protein dynamics, and the different layers of changes that a mutation can induce in a protein at the functional and structural level. ULK1 is down-regulated in gynecological tumors. In other cancer types, ULK2 could compensate for ULK1 downregulation and, in the majority of the cases, no marked changes in expression have been found. 36 missense mutations of ULK1, not limited to the catalytic domain, are co-occurring with mutations in a large number of ULK1 interactors or substrates, suggesting a pronounced effect of the upstream steps of autophagy in many cancer types. Moreover, our results pinpoint that more than 50% of the mutations in the kinase domain of ULK1, here investigated, are predicted to affect protein stability. Three mutations (S184F, D102N, and A28V) are predicted with only impact on kinase activity, either modifying the functional dynamics or the capability to exert effects from distal sites to the functional and catalytic regions. The framework here applied could be extended to other protein targets to aid the classification of missense mutations from cancer genomics studies, as well as to prioritize variants for experimental validation, or to select the appropriate biological readouts for experiments.
Collapse
Affiliation(s)
- Mukesh Kumar
- Computational Biology Laboratory, Center for Autophagy, Recycling and Disease (CARD), Danish Cancer Society Research Center, Strandboulevarden 49, 2100, Copenhagen, Denmark
| | - Elena Papaleo
- Computational Biology Laboratory, Center for Autophagy, Recycling and Disease (CARD), Danish Cancer Society Research Center, Strandboulevarden 49, 2100, Copenhagen, Denmark.
- Translational Disease System Biology, Faculty of Health and Medical Sciences, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
12
|
Navarro MV, Chaves AFA, Castilho DG, Casula I, Calado JCP, Conceição PM, Iwai LK, de Castro BF, Batista WL. Effect of Nitrosative Stress on the S-Nitroso-Proteome of Paracoccidioides brasiliensis. Front Microbiol 2020; 11:1184. [PMID: 32582109 PMCID: PMC7287035 DOI: 10.3389/fmicb.2020.01184] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2020] [Accepted: 05/08/2020] [Indexed: 12/12/2022] Open
Abstract
The fungi Paracoccidioides brasiliensis and Paracoccidioides lutzii are the causative agents of paracoccidioidomycosis (PCM), a systemic mycosis endemic to Latin America. This fungus is considered a facultative intracellular pathogen that is able to survive and replicate inside macrophages. The survival of the fungus during infection depends on its adaptability to various conditions, such as nitrosative/oxidative stress produced by the host immune cells, particularly alveolar macrophages. Currently, there is little knowledge about the Paracoccidioides spp. signaling pathways involved in the fungus evasion mechanism of the host defense response. However, it is known that some of these pathways are triggered by reactive oxygen species and reactive nitrogen species (ROS/RNS) produced by host cells. Considering that the effects of NO (nitric oxide) on pathogens are concentration dependent, such effects could alter the redox state of cysteine residues by influencing (activating or inhibiting) a variety of protein functions, notably S-nitrosylation, a highly important NO-dependent posttranslational modification that regulates cellular functions and signaling pathways. It has been demonstrated by our group that P. brasiliensis yeast cells proliferate when exposed to low NO concentrations. Thus, this work investigated the modulation profile of S-nitrosylated proteins of P. brasiliensis, as well as identifying S-nitrosylation sites after treatment with RNS. Through mass spectrometry analysis (LC-MS/MS) and label-free quantification, it was possible to identify 474 proteins in the S-nitrosylated proteome study. With this approach, we observed that proteins treated with NO at low concentrations presented a proliferative response pattern, with several proteins involved in cellular cycle regulation and growth being activated. These proteins appear to play important roles in fungal virulence. On the other hand, fungus stimulated by high NO concentrations exhibited a survival response pattern. Among these S-nitrosylated proteins we identified several potential molecular targets for fungal disease therapy, including cell wall integrity (CWI) pathway, amino acid and folic acid metabolisms. In addition, we detected that the transnitrosylation/denitrosylation redox signaling are preserved in this fungus. Finally, this work may help to uncover the beneficial and antifungal properties of NO in the P. brasiliensis and point to useful targets for the development of antifungal drugs.
Collapse
Affiliation(s)
- Marina V Navarro
- Department of Microbiology, Immunology and Parasitology, Escola Paulista de Medicina, Universidade Federal de São Paulo, São Paulo, Brazil
| | - Alison F A Chaves
- Department of Microbiology, Immunology and Parasitology, Escola Paulista de Medicina, Universidade Federal de São Paulo, São Paulo, Brazil
| | - Daniele G Castilho
- Department of Microbiology, Immunology and Parasitology, Escola Paulista de Medicina, Universidade Federal de São Paulo, São Paulo, Brazil
| | - Isis Casula
- Department of Pharmaceutical Sciences, Instituto de Ciências Ambientais, Químicas e Farmacêuticas, Universidade Federal de São Paulo, Diadema, Brazil
| | - Juliana C P Calado
- Department of Microbiology, Immunology and Parasitology, Escola Paulista de Medicina, Universidade Federal de São Paulo, São Paulo, Brazil
| | - Palloma M Conceição
- Department of Pharmaceutical Sciences, Instituto de Ciências Ambientais, Químicas e Farmacêuticas, Universidade Federal de São Paulo, Diadema, Brazil
| | - Leo K Iwai
- Laboratory of Applied Toxinology, Center of Toxins, Immune-response and Cell Signaling, Instituto Butantan, São Paulo, Brazil
| | - Beatriz F de Castro
- Department of Pharmaceutical Sciences, Instituto de Ciências Ambientais, Químicas e Farmacêuticas, Universidade Federal de São Paulo, Diadema, Brazil
| | - Wagner L Batista
- Department of Microbiology, Immunology and Parasitology, Escola Paulista de Medicina, Universidade Federal de São Paulo, São Paulo, Brazil.,Department of Pharmaceutical Sciences, Instituto de Ciências Ambientais, Químicas e Farmacêuticas, Universidade Federal de São Paulo, Diadema, Brazil
| |
Collapse
|
13
|
Huang KY, Lee TY, Kao HJ, Ma CT, Lee CC, Lin TH, Chang WC, Huang HD. dbPTM in 2019: exploring disease association and cross-talk of post-translational modifications. Nucleic Acids Res 2020; 47:D298-D308. [PMID: 30418626 PMCID: PMC6323979 DOI: 10.1093/nar/gky1074] [Citation(s) in RCA: 138] [Impact Index Per Article: 34.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2018] [Accepted: 10/19/2018] [Indexed: 12/25/2022] Open
Abstract
The dbPTM (http://dbPTM.mbc.nctu.edu.tw/) has been maintained for over 10 years with the aim to provide functional and structural analyses for post-translational modifications (PTMs). In this update, dbPTM not only integrates more experimentally validated PTMs from available databases and through manual curation of literature but also provides PTM-disease associations based on non-synonymous single nucleotide polymorphisms (nsSNPs). The high-throughput deep sequencing technology has led to a surge in the data generated through analysis of association between SNPs and diseases, both in terms of growth amount and scope. This update thus integrated disease-associated nsSNPs from dbSNP based on genome-wide association studies. The PTM substrate sites located at a specified distance in terms of the amino acids encoded from nsSNPs were deemed to have an association with the involved diseases. In recent years, increasing evidence for crosstalk between PTMs has been reported. Although mass spectrometry-based proteomics has substantially improved our knowledge about substrate site specificity of single PTMs, the fact that the crosstalk of combinatorial PTMs may act in concert with the regulation of protein function and activity is neglected. Because of the relatively limited information about concurrent frequency and functional relevance of PTM crosstalk, in this update, the PTM sites neighboring other PTM sites in a specified window length were subjected to motif discovery and functional enrichment analysis. This update highlights the current challenges in PTM crosstalk investigation and breaks the bottleneck of how proteomics may contribute to understanding PTM codes, revealing the next level of data complexity and proteomic limitation in prospective PTM research.
Collapse
Affiliation(s)
- Kai-Yao Huang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Life and Health Science, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Tzong-Yi Lee
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Life and Health Science, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Hui-Ju Kao
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 32003, Taiwan
| | - Chen-Tse Ma
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 32003, Taiwan
| | - Chao-Chun Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 32003, Taiwan
| | - Tsai-Hsuan Lin
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 32003, Taiwan
| | - Wen-Chi Chang
- Institute of Tropical Plant Sciences, College of Biosciences and Biotechnology, National Cheng Kung University, Tainan 70101, Taiwan
| | - Hsien-Da Huang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Life and Health Science, The Chinese University of Hong Kong, Shenzhen 518172, China
| |
Collapse
|
14
|
Wong A, Donaldson L, Portes MT, Eppinger J, Feijó JA, Gehring C. Arabidopsis DIACYLGLYCEROL KINASE4 is involved in nitric oxide-dependent pollen tube guidance and fertilization. Development 2020; 147:dev.183715. [PMID: 32220864 DOI: 10.1242/dev.183715] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Accepted: 02/26/2020] [Indexed: 12/16/2022]
Abstract
Nitric oxide (NO) is a key signaling molecule that regulates diverse biological processes in both animals and plants, including important roles in male gamete physiology. In plants, NO is generated in pollen tubes (PTs) and affects intracellular responses through the modulation of Ca2+ signaling, actin organization, vesicle trafficking and cell wall deposition, bearing consequences in pollen-stigma interactions and PT guidance. In contrast, the NO-responsive proteins that mediate these responses remain elusive. Here, we show that PTs of Arabidopsis thaliana mutants impaired in the pollen-specific DIACYLGLYCEROL KINASE4 (DGK4) grow slower and become partially insensitive to NO-dependent growth inhibition and re-orientation responses. Recombinant DGK4 protein yields NO-responsive spectral and catalytic changes in vitro that are compatible with a role in NO perception and signaling in PTs. In addition to the expected phosphatidic acid-producing kinase activity, DGK4 recombinant protein also revealed guanylyl cyclase activity, as inferred by sequence analysis. Our results are compatible with a role for the fast-diffusible NO gas in signaling and cell-cell communication via the modulation of DGK4 activity during the progamic phase of angiosperm reproduction.
Collapse
Affiliation(s)
- Aloysius Wong
- Department of Biology, College of Science and Technology, Wenzhou-Kean University, 88 Daxue Road, Ouhai, Wenzhou, Zhejiang Province 325060, China.,Division of Biological and Environmental Sciences and Engineering, 4700 King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
| | - Lara Donaldson
- Division of Biological and Environmental Sciences and Engineering, 4700 King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia.,Department of Molecular and Cell Biology, University of Cape Town, Rondebosch 7701, South Africa
| | - Maria Teresa Portes
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742-5815, USA
| | - Jörg Eppinger
- Division of Physical Sciences and Engineering, Biological and Organometallic Catalysis Laboratory, KAUST Catalysis Center, 4700 King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
| | - José A Feijó
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742-5815, USA
| | - Christoph Gehring
- Division of Biological and Environmental Sciences and Engineering, 4700 King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
| |
Collapse
|
15
|
Huang KY, Hsu JBK, Lee TY. Characterization and Identification of Lysine Succinylation Sites based on Deep Learning Method. Sci Rep 2019; 9:16175. [PMID: 31700141 PMCID: PMC6838336 DOI: 10.1038/s41598-019-52552-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2019] [Accepted: 10/18/2019] [Indexed: 12/14/2022] Open
Abstract
Succinylation is a type of protein post-translational modification (PTM), which can play important roles in a variety of cellular processes. Due to an increasing number of site-specific succinylated peptides obtained from high-throughput mass spectrometry (MS), various tools have been developed for computationally identifying succinylated sites on proteins. However, most of these tools predict succinylation sites based on traditional machine learning methods. Hence, this work aimed to carry out the succinylation site prediction based on a deep learning model. The abundance of MS-verified succinylated peptides enabled the investigation of substrate site specificity of succinylation sites through sequence-based attributes, such as position-specific amino acid composition, the composition of k-spaced amino acid pairs (CKSAAP), and position-specific scoring matrix (PSSM). Additionally, the maximal dependence decomposition (MDD) was adopted to detect the substrate signatures of lysine succinylation sites by dividing all succinylated sequences into several groups with conserved substrate motifs. According to the results of ten-fold cross-validation, the deep learning model trained using PSSM and informative CKSAAP attributes can reach the best predictive performance and also perform better than traditional machine-learning methods. Moreover, an independent testing dataset that truly did not exist in the training dataset was used to compare the proposed method with six existing prediction tools. The testing dataset comprised of 218 positive and 2621 negative instances, and the proposed model could yield a promising performance with 84.40% sensitivity, 86.99% specificity, 86.79% accuracy, and an MCC value of 0.489. Finally, the proposed method has been implemented as a web-based prediction tool (CNN-SuccSite), which is now freely accessible at http://csb.cse.yzu.edu.tw/CNN-SuccSite/.
Collapse
Affiliation(s)
- Kai-Yao Huang
- Department of Medical Research, Hsinchu Mackay Memorial Hospital, Hsinchu city, 300, Taiwan
| | - Justin Bo-Kai Hsu
- Department of Medical Research, Taipei Medical University Hospital, Taipei city, 110, Taiwan
| | - Tzong-Yi Lee
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, 518172, China. .,School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen, 518172, China.
| |
Collapse
|
16
|
Discovery of a Nitric Oxide-Responsive Protein in Arabidopsis thaliana. Molecules 2019; 24:molecules24152691. [PMID: 31344907 PMCID: PMC6696476 DOI: 10.3390/molecules24152691] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2019] [Revised: 07/20/2019] [Accepted: 07/22/2019] [Indexed: 11/17/2022] Open
Abstract
In plants, much like in animals, nitric oxide (NO) has been established as an important gaseous signaling molecule. However, contrary to animal systems, NO-sensitive or NO-responsive proteins that bind NO in the form of a sensor or participating in redox reactions have remained elusive. Here, we applied a search term constructed based on conserved and functionally annotated amino acids at the centers of Heme Nitric Oxide/Oxygen (H-NOX) domains in annotated and experimentally-tested gas-binding proteins from lower and higher eukaryotes, in order to identify candidate NO-binding proteins in Arabidopsis thaliana. The selection of candidate NO-binding proteins identified from the motif search was supported by structural modeling. This approach identified AtLRB3 (At4g01160), a member of the Light Response Bric-a-Brac/Tramtrack/Broad Complex (BTB) family, as a candidate NO-binding protein. AtLRB3 was heterologously expressed and purified, and then tested for NO-response. Spectroscopic data confirmed that AtLRB3 contains a histidine-ligated heme cofactor and importantly, the addition of NO to AtLRB3 yielded absorption characteristics reminiscent of canonical H-NOX proteins. Furthermore, substitution of the heme iron-coordinating histidine at the H-NOX center with a leucine strongly impaired the NO-response. Our finding therefore established AtLRB3 as a NO-interacting protein and future characterizations will focus on resolving the nature of this response.
Collapse
|
17
|
Khan YD, Batool A, Rasool N, Khan SA, Chou KC. Prediction of Nitrosocysteine Sites Using Position and Composition Variant Features. LETT ORG CHEM 2019. [DOI: 10.2174/1570178615666180802122953] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
S-nitrosylation is one of the most prominent posttranslational modification among proteins. It involves the addition of nitrogen oxide group to cysteine thiols forming S-nitrosocysteine. Evidence suggests that S-nitrosylation plays a foremost role in numerous human diseases and disorders. The incorporation of techniques for robust identification of S-nitrosylated proteins is highly anticipated in biological research and drug discovery. The proposed system endeavors a novel strategy based on a statistical and computational intelligent methods for the identification of S-nitrosocystiene sites within a given primary protein sequence. For this purpose, 5-step rule was approached comprising of benchmark dataset creation, mathematical modelling, prediction, evaluation and web-server development. For position relative feature extraction, statistical moments were used and a multilayer neural network was trained adapting Gradient Descent and Adaptive Learning algorithms. The results were comparatively analyzed with existing techniques using benchmark datasets. It is inferred through conclusive experimentation that the proposed scheme is very propitious, accurate and exceptionally effective for the prediction of S-nitrosocystiene in protein sequences.
Collapse
Affiliation(s)
- Yaser Daanial Khan
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| | - Aroosa Batool
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| | - Nouman Rasool
- Department of Life Sciences, School of Science, University of Management and Technology, Lahore, Pakistan
| | - Sher Afzal Khan
- Faculty of Computing and Information Technology in Rabigh, King Abdulaziz University, Jeddah, 21577, Saudi Arabia
| | - Kuo-Chen Chou
- Gordon Life Science Institute, Boston, MA 02478, United States
| |
Collapse
|
18
|
Li T, Song R, Yin Q, Gao M, Chen Y. Identification of S-nitrosylation sites based on multiple features combination. Sci Rep 2019; 9:3098. [PMID: 30816267 PMCID: PMC6395632 DOI: 10.1038/s41598-019-39743-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Accepted: 02/01/2019] [Indexed: 01/24/2023] Open
Abstract
Protein S-nitrosylation (SNO) is a typical reversible, redox-dependent and post-translational modification that involves covalent modification of cysteine residues with nitric oxide (NO) for the thiol group. Numerous experiments have shown that SNO plays a major role in cell function and pathophysiology. In order to rapidly analysis the big sets of data, the computing methods for identifying the SNO sites are being considered as necessary auxiliary tools. In this study, multiple features including Parallel correlation pseudo amino acid composition (PC-PseAAC), Basic kmer1 (kmer1), Basic kmer2 (kmer2), General parallel correlation pseudo amino acid composition (PC-PseAAC_G), Adapted Normal distribution Bi-Profile Bayes (ANBPB), Double Bi-Profile Bayes (DBPB), Bi-Profile Bayes (BPB), Incorporating Amino Acid Pairwise (IAAPair) and Position-specific Tri-Amino Acid Propensity(PSTAAP) were employed to extract the sequence information. To remove information redundancy, information gain (IG) was applied to evaluate the importance of amino acids, which is the information entropy of class after subtracting the conditional entropy for the given amino acid. The prediction performance of the SNO sites was found to be best by using the cross-validation and independent tests. In addition, we also calculated four commonly used performance measurements, i.e. Sensitivity (Sn), Specificity (Sp), Accuracy (Acc), and the Matthew's Correlation Coefficient (MCC). For the training dataset, the overall Acc was 83.11%, the MCC was 0.6617. For an independent test dataset, Acc was 73.17%, and MCC was 0.3788. The results indicate that our method is likely to complement the existing prediction methods and is a useful tool for effective identification of the SNO sites.
Collapse
Affiliation(s)
- Taoying Li
- Department of Maritime Economics and Management, Dalian Maritime University, No. 1 Linghai Road, Dalian, 116026, China.
| | - Runyu Song
- Department of Maritime Economics and Management, Dalian Maritime University, No. 1 Linghai Road, Dalian, 116026, China
| | - Qian Yin
- Department of Maritime Economics and Management, Dalian Maritime University, No. 1 Linghai Road, Dalian, 116026, China
| | - Mingyue Gao
- Department of Maritime Economics and Management, Dalian Maritime University, No. 1 Linghai Road, Dalian, 116026, China
| | - Yan Chen
- Department of Maritime Economics and Management, Dalian Maritime University, No. 1 Linghai Road, Dalian, 116026, China
| |
Collapse
|
19
|
Huang KY, Kao HJ, Hsu JBK, Weng SL, Lee TY. Characterization and identification of lysine glutarylation based on intrinsic interdependence between positions in the substrate sites. BMC Bioinformatics 2019; 19:384. [PMID: 30717647 PMCID: PMC7394328 DOI: 10.1186/s12859-018-2394-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Accepted: 09/25/2018] [Indexed: 01/06/2023] Open
Abstract
Background Glutarylation, the addition of a glutaryl group (five carbons) to a lysine residue of a protein molecule, is an important post-translational modification and plays a regulatory role in a variety of physiological and biological processes. As the number of experimentally identified glutarylated peptides increases, it becomes imperative to investigate substrate motifs to enhance the study of protein glutarylation. We carried out a bioinformatics investigation of glutarylation sites based on amino acid composition using a public database containing information on 430 non-homologous glutarylation sites. Results The TwoSampleLogo analysis indicates that positively charged and polar amino acids surrounding glutarylated sites may be associated with the specificity in substrate site of protein glutarylation. Additionally, the chi-squared test was utilized to explore the intrinsic interdependence between two positions around glutarylation sites. Further, maximal dependence decomposition (MDD), which consists of partitioning a large-scale dataset into subgroups with statistically significant amino acid conservation, was used to capture motif signatures of glutarylation sites. We considered single features, such as amino acid composition (AAC), amino acid pair composition (AAPC), and composition of k-spaced amino acid pairs (CKSAAP), as well as the effectiveness of incorporating MDD-identified substrate motifs into an integrated prediction model. Evaluation by five-fold cross-validation showed that AAC was most effective in discriminating between glutarylation and non-glutarylation sites, according to support vector machine (SVM). Conclusions The SVM model integrating MDD-identified substrate motifs performed well, with a sensitivity of 0.677, a specificity of 0.619, an accuracy of 0.638, and a Matthews Correlation Coefficient (MCC) value of 0.28. Using an independent testing dataset (46 glutarylated and 92 non-glutarylated sites) obtained from the literature, we demonstrated that the integrated SVM model could improve the predictive performance effectively, yielding a balanced sensitivity and specificity of 0.652 and 0.739, respectively. This integrated SVM model has been implemented as a web-based system (MDDGlutar), which is now freely available at http://csb.cse.yzu.edu.tw/MDDGlutar/. Electronic supplementary material The online version of this article (10.1186/s12859-018-2394-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Kai-Yao Huang
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, 518172, China.,Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, 518172, China
| | - Hui-Ju Kao
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, 518172, China.,Department of Computer Science and Engineering, Yuan Ze University, Taoyuan city, 320, Taiwan
| | - Justin Bo-Kai Hsu
- Department of Medical Research, Taipei Medical University Hospital, Taipei city, 110, Taiwan
| | - Shun-Long Weng
- Department of Medicine, Mackay Medical College, New Taipei City, 252, Taiwan.,Mackay Medicine, Nursing and Management College, Taipei, 112, Taiwan.,Department of Obstetrics and Gynecology, Hsinchu Mackay Memorial Hospital, Hsin-Chu, 300, Taiwan
| | - Tzong-Yi Lee
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, 518172, China. .,Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, 518172, China.
| |
Collapse
|
20
|
Abstract
S-nitrosylation is an essential and reversible posttranslational modification of proteins involved in numerous biological processes. The experimental determination of S-nitrosylation sites is laborious and time-consuming. Therefore, the use of computational prediction tools of this modification represents a convenient first approach to generate useful information for subsequent experimental verification. Here we describe an in silico analysis pipeline to integrate the use of several bionformatic tools while dealing with big query protein sets.
Collapse
|
21
|
Hasan MM, Manavalan B, Khatun MS, Kurata H. Prediction of S-nitrosylation sites by integrating support vector machines and random forest. Mol Omics 2019; 15:451-458. [DOI: 10.1039/c9mo00098d] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Cysteine S-nitrosylation is a type of reversible post-translational modification of proteins, which controls diverse biological processes.
Collapse
Affiliation(s)
- Md. Mehedi Hasan
- Department of Bioscience and Bioinformatics
- Kyushu Institute of Technology
- Iizuka
- Japan
- Japan Society for the Promotion of Science
| | | | - Mst. Shamima Khatun
- Department of Bioscience and Bioinformatics
- Kyushu Institute of Technology
- Iizuka
- Japan
| | - Hiroyuki Kurata
- Department of Bioscience and Bioinformatics
- Kyushu Institute of Technology
- Iizuka
- Japan
- Biomedical Informatics R&D Center
| |
Collapse
|
22
|
López-Grueso MJ, González-Ojeda R, Requejo-Aguilar R, McDonagh B, Fuentes-Almagro CA, Muntané J, Bárcena JA, Padilla CA. Thioredoxin and glutaredoxin regulate metabolism through different multiplex thiol switches. Redox Biol 2018; 21:101049. [PMID: 30639960 PMCID: PMC6327914 DOI: 10.1016/j.redox.2018.11.007] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Revised: 11/08/2018] [Accepted: 11/11/2018] [Indexed: 12/19/2022] Open
Abstract
The aim of the present study was to define the role of Trx and Grx on metabolic thiol redox regulation and identify their protein and metabolite targets. The hepatocarcinoma-derived HepG2 cell line under both normal and oxidative/nitrosative conditions by overexpression of NO synthase (NOS3) was used as experimental model. Grx1 or Trx1 silencing caused conspicuous changes in the redox proteome reflected by significant changes in the reduced/oxidized ratios of specific Cys's including several glycolytic enzymes. Cys91 of peroxiredoxin-6 (PRDX6) and Cys153 of phosphoglycerate mutase-1 (PGAM1), that are known to be involved in progression of tumor growth, are reported here for the first time as specific targets of Grx1. A group of proteins increased their CysRED/CysOX ratio upon Trx1 and/or Grx1 silencing, including caspase-3 Cys163, glyceraldehyde-3-phosphate dehydrogenase (GAPDH) Cys247 and triose-phosphate isomerase (TPI) Cys255 likely by enhancement of NOS3 auto-oxidation. The activities of several glycolytic enzymes were also significantly affected. Glycolysis metabolic flux increased upon Trx1 silencing, whereas silencing of Grx1 had the opposite effect. Diversion of metabolic fluxes toward synthesis of fatty acids and phospholipids was observed in siRNA-Grx1 treated cells, while siRNA-Trx1 treated cells showed elevated levels of various sphingomyelins and ceramides and signs of increased protein degradation. Glutathione synthesis was stimulated by both treatments. These data indicate that Trx and Grx have both, common and specific protein Cys redox targets and that down regulation of either redoxin has markedly different metabolic outcomes. They reflect the delicate sensitivity of redox equilibrium to changes in any of the elements involved and the difficulty of forecasting metabolic responses to redox environmental changes. Trx1 and Grx1 Cys redox targets are abundant among Glycolytic enzymes. PRDX6-Cys91 and PGAM-Cys153 are specific targets of Grx1. Down regulation of thioredoxin and glutaredoxin have different metabolic outcomes. Glutathione synthesis and membrane lipid composition are sensitive to Trx1 and Grx1 down regulation. Redoxins down regulation also induce target Cys reductive changes under NOS3 overexpression.
Collapse
Affiliation(s)
- M J López-Grueso
- Dept. Biochemistry and Molecular Biology, University of Córdoba, Córdoba, Spain; Maimónides Biomedical Research Institute of Córdoba (IMIBIC), Córdoba, Spain
| | - R González-Ojeda
- Institute of Biomedicine of Seville (IBIS), IBiS/"Virgen del Rocío" University Hospital/CSIC/University of Seville, Seville, Spain
| | - R Requejo-Aguilar
- Dept. Biochemistry and Molecular Biology, University of Córdoba, Córdoba, Spain; Maimónides Biomedical Research Institute of Córdoba (IMIBIC), Córdoba, Spain
| | - B McDonagh
- Dept. of Physiology, School of Medicine, NUI Galway, Ireland
| | | | - J Muntané
- Dept. of Physiology, School of Medicine, NUI Galway, Ireland
| | - J A Bárcena
- Dept. Biochemistry and Molecular Biology, University of Córdoba, Córdoba, Spain; Maimónides Biomedical Research Institute of Córdoba (IMIBIC), Córdoba, Spain.
| | - C A Padilla
- Dept. Biochemistry and Molecular Biology, University of Córdoba, Córdoba, Spain; Maimónides Biomedical Research Institute of Córdoba (IMIBIC), Córdoba, Spain
| |
Collapse
|
23
|
DeepNitro: Prediction of Protein Nitration and Nitrosylation Sites by Deep Learning. GENOMICS PROTEOMICS & BIOINFORMATICS 2018; 16:294-306. [PMID: 30268931 PMCID: PMC6205083 DOI: 10.1016/j.gpb.2018.04.007] [Citation(s) in RCA: 59] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2018] [Revised: 04/12/2018] [Accepted: 04/27/2018] [Indexed: 11/24/2022]
Abstract
Protein nitration and nitrosylation are essential post-translational modifications (PTMs) involved in many fundamental cellular processes. Recent studies have revealed that excessive levels of nitration and nitrosylation in some critical proteins are linked to numerous chronic diseases. Therefore, the identification of substrates that undergo such modifications in a site-specific manner is an important research topic in the community and will provide candidates for targeted therapy. In this study, we aimed to develop a computational tool for predicting nitration and nitrosylation sites in proteins. We first constructed four types of encoding features, including positional amino acid distributions, sequence contextual dependencies, physicochemical properties, and position-specific scoring features, to represent the modified residues. Based on these encoding features, we established a predictor called DeepNitro using deep learning methods for predicting protein nitration and nitrosylation. Using n-fold cross-validation, our evaluation shows great AUC values for DeepNitro, 0.65 for tyrosine nitration, 0.80 for tryptophan nitration, and 0.70 for cysteine nitrosylation, respectively, demonstrating the robustness and reliability of our tool. Also, when tested in the independent dataset, DeepNitro is substantially superior to other similar tools with a 7%−42% improvement in the prediction performance. Taken together, the application of deep learning method and novel encoding schemes, especially the position-specific scoring feature, greatly improves the accuracy of nitration and nitrosylation site prediction and may facilitate the prediction of other PTM sites. DeepNitro is implemented in JAVA and PHP and is freely available for academic research at http://deepnitro.renlab.org.
Collapse
|
24
|
Bignon E, Allega MF, Lucchetta M, Tiberti M, Papaleo E. Computational Structural Biology of S-nitrosylation of Cancer Targets. Front Oncol 2018; 8:272. [PMID: 30155439 PMCID: PMC6102371 DOI: 10.3389/fonc.2018.00272] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Accepted: 07/02/2018] [Indexed: 12/15/2022] Open
Abstract
Nitric oxide (NO) plays an essential role in redox signaling in normal and pathological cellular conditions. In particular, it is well known to react in vivo with cysteines by the so-called S-nitrosylation reaction. S-nitrosylation is a selective and reversible post-translational modification that exerts a myriad of different effects, such as the modulation of protein conformation, activity, stability, and biological interaction networks. We have appreciated, over the last years, the role of S-nitrosylation in normal and disease conditions. In this context, structural and computational studies can help to dissect the complex and multifaceted role of this redox post-translational modification. In this review article, we summarized the current state-of-the-art on the mechanism of S-nitrosylation, along with the structural and computational studies that have helped to unveil its effects and biological roles. We also discussed the need to move new steps forward especially in the direction of employing computational structural biology to address the molecular and atomistic details of S-nitrosylation. Indeed, this redox modification has been so far an underappreciated redox post-translational modification by the computational biochemistry community. In our review, we primarily focus on S-nitrosylated proteins that are attractive cancer targets due to the emerging relevance of this redox modification in a cancer setting.
Collapse
Affiliation(s)
- Emmanuelle Bignon
- Computational Biology Laboratory Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Maria Francesca Allega
- Computational Biology Laboratory Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Marta Lucchetta
- Computational Biology Laboratory Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Matteo Tiberti
- Computational Biology Laboratory Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Elena Papaleo
- Computational Biology Laboratory Danish Cancer Society Research Center, Copenhagen, Denmark.,Translational Disease Systems Biology, Faculty of Health and Medical Sciences, Novo Nordisk Foundation Center for Protein Research University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
25
|
Study of adenylyl cyclase-GαS interactions and identification of novel AC ligands. Mol Cell Biochem 2018; 446:63-72. [DOI: 10.1007/s11010-018-3273-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2017] [Accepted: 01/04/2018] [Indexed: 10/18/2022]
|
26
|
Kao HJ, Weng SL, Huang KY, Kaunang FJ, Hsu JBK, Huang CH, Lee TY. MDD-carb: a combinatorial model for the identification of protein carbonylation sites with substrate motifs. BMC SYSTEMS BIOLOGY 2017; 11:137. [PMID: 29322938 PMCID: PMC5763492 DOI: 10.1186/s12918-017-0511-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Background Carbonylation, which takes place through oxidation of reactive oxygen species (ROS) on specific residues, is an irreversibly oxidative modification of proteins. It has been reported that the carbonylation is related to a number of metabolic or aging diseases including diabetes, chronic lung disease, Parkinson’s disease, and Alzheimer’s disease. Due to the lack of computational methods dedicated to exploring motif signatures of protein carbonylation sites, we were motivated to exploit an iterative statistical method to characterize and identify carbonylated sites with motif signatures. Results By manually curating experimental data from research articles, we obtained 332, 144, 135, and 140 verified substrate sites for K (lysine), R (arginine), T (threonine), and P (proline) residues, respectively, from 241 carbonylated proteins. In order to examine the informative attributes for classifying between carbonylated and non-carbonylated sites, multifarious features including composition of twenty amino acids (AAC), composition of amino acid pairs (AAPC), position-specific scoring matrix (PSSM), and positional weighted matrix (PWM) were investigated in this study. Additionally, in an attempt to explore the motif signatures of carbonylation sites, an iterative statistical method was adopted to detect statistically significant dependencies of amino acid compositions between specific positions around substrate sites. Profile hidden Markov model (HMM) was then utilized to train a predictive model from each motif signature. Moreover, based on the method of support vector machine (SVM), we adopted it to construct an integrative model by combining the values of bit scores obtained from profile HMMs. The combinatorial model could provide an enhanced performance with evenly predictive sensitivity and specificity in the evaluation of cross-validation and independent testing. Conclusion This study provides a new scheme for exploring potential motif signatures at substrate sites of protein carbonylation. The usefulness of the revealed motifs in the identification of carbonylated sites is demonstrated by their effective performance in cross-validation and independent testing. Finally, these substrate motifs were adopted to build an available online resource (MDD-Carb, http://csb.cse.yzu.edu.tw/MDDCarb/) and are also anticipated to facilitate the study of large-scale carbonylated proteomes. Electronic supplementary material The online version of this article (10.1186/s12918-017-0511-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Hui-Ju Kao
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, city, 320, Taiwan
| | - Shun-Long Weng
- Department of Medicine, Mackay Medical College, New Taipei City, 252, Taiwan.,Department of Obstetrics and Gynecology, Hsinchu Mackay Memorial Hospital, Hsinchu, city, 300, Taiwan.,Mackay Junior College of Medicine, Nursing and Management, Taipei, city, 112, Taiwan
| | - Kai-Yao Huang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, city, 320, Taiwan.,Department of Medical Research, Hsinchu Mackay Memorial Hospital, Hsinchu, city, 300, Taiwan
| | - Fergie Joanda Kaunang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, city, 320, Taiwan
| | - Justin Bo-Kai Hsu
- Department of Medical Research, Taipei Medical University Hospital, Taipei, city, 110, Taiwan
| | - Chien-Hsun Huang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, city, 320, Taiwan. .,Tao-Yuan Hospital, Ministry of Health & Welfare, Taoyuan, 320, Taiwan.
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, city, 320, Taiwan. .,Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan, 320, Taiwan.
| |
Collapse
|
27
|
Su MG, Weng JTY, Hsu JBK, Huang KY, Chi YH, Lee TY. Investigation and identification of functional post-translational modification sites associated with drug binding and protein-protein interactions. BMC SYSTEMS BIOLOGY 2017; 11:132. [PMID: 29322920 PMCID: PMC5763307 DOI: 10.1186/s12918-017-0506-1] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Background Protein post-translational modification (PTM) plays an essential role in various cellular processes that modulates the physical and chemical properties, folding, conformation, stability and activity of proteins, thereby modifying the functions of proteins. The improved throughput of mass spectrometry (MS) or MS/MS technology has not only brought about a surge in proteome-scale studies, but also contributed to a fruitful list of identified PTMs. However, with the increase in the number of identified PTMs, perhaps the more crucial question is what kind of biological mechanisms these PTMs are involved in. This is particularly important in light of the fact that most protein-based pharmaceuticals deliver their therapeutic effects through some form of PTM. Yet, our understanding is still limited with respect to the local effects and frequency of PTM sites near pharmaceutical binding sites and the interfaces of protein-protein interaction (PPI). Understanding PTM’s function is critical to our ability to manipulate the biological mechanisms of protein. Results In this study, to understand the regulation of protein functions by PTMs, we mapped 25,835 PTM sites to proteins with available three-dimensional (3D) structural information in the Protein Data Bank (PDB), including 1785 modified PTM sites on the 3D structure. Based on the acquired structural PTM sites, we proposed to use five properties for the structural characterization of PTM substrate sites: the spatial composition of amino acids, residues and side-chain orientations surrounding the PTM substrate sites, as well as the secondary structure, division of acidity and alkaline residues, and solvent-accessible surface area. We further mapped the structural PTM sites to the structures of drug binding and PPI sites, identifying a total of 1917 PTM sites that may affect PPI and 3951 PTM sites associated with drug-target binding. An integrated analytical platform (CruxPTM), with a variety of methods and online molecular docking tools for exploring the structural characteristics of PTMs, is presented. In addition, all tertiary structures of PTM sites on proteins can be visualized using the JSmol program. Conclusion Resolving the function of PTM sites is important for understanding the role that proteins play in biological mechanisms. Our work attempted to delineate the structural correlation between PTM sites and PPI or drug-target binding. CurxPTM could help scientists narrow the scope of their PTM research and enhance the efficiency of PTM identification in the face of big proteome data. CruxPTM is now available at http://csb.cse.yzu.edu.tw/CruxPTM/. Electronic supplementary material The online version of this article (10.1186/s12918-017-0506-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Min-Gang Su
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan
| | - Julia Tzu-Ya Weng
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan
| | - Justin Bo-Kai Hsu
- Department of Medical Research, Taipei Medical University Hospital, Taipei, 110, Taiwan
| | - Kai-Yao Huang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan.,Department of Medical Research, Hsinchu Mackay Memorial Hospital, Hsinchu City, 300, Taiwan
| | - Yu-Hsiang Chi
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan. .,Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan, 320, Taiwan.
| |
Collapse
|
28
|
Carmona R, Jimenez-Quesada MJ, Lima-Cabello E, Traverso JÁ, Castro AJ, Claros MG, de Dios Alché J. S-nitroso- and nitro- proteomes in the olive ( Olea europaea L.) pollen. Predictive versus experimental data by nano-LC-MS. Data Brief 2017; 15:474-477. [PMID: 29062872 PMCID: PMC5647517 DOI: 10.1016/j.dib.2017.09.058] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Revised: 07/13/2017] [Accepted: 09/26/2017] [Indexed: 11/27/2022] Open
Abstract
The data presented here are related to the research article entitled "Generation of nitric oxide by olive (Olea europaea L.) pollen during in vitro germination and assessment of the S-nitroso- and nitro-proteomes by computational predictive methods" doi:10.1016/j.niox.2017.06.005 (Jimenez-Quesada et al., 2017) [1]. Predicted cysteine S-nitrosylation and Tyr-nitration sites in proteins derived from a de novo assembled and annotated pollen transcriptome from olive tree (Olea europaea L.) were obtained after using well-established predictive tools in silico. Predictions were performed using both default and highly restrictive thresholds. Numerous gene products identified with these characteristics are listed here. An experimental validation of the data, consisting in nano-LC-MS (Liquid Chromatography-Mass Spectrometry) determination of olive pollen proteins after immunoprecipitation with antibodies to anti-S-nitrosoCys and anti-3-NT (NitroTyrosine) allowed identification of numerous proteins subjected to these two post-translational modifications, which are listed here together with information regarding their cross-presence among the predictions.
Collapse
Affiliation(s)
- Rosario Carmona
- Plant Reproductive Biology Laboratory, Department of Biochemistry, Cellular and Molecular Biology of Plants, Estación Experimental del Zaidín (CSIC), Profesor Albareda 1, 18008 Granada, Spain
| | - María José Jimenez-Quesada
- Plant Reproductive Biology Laboratory, Department of Biochemistry, Cellular and Molecular Biology of Plants, Estación Experimental del Zaidín (CSIC), Profesor Albareda 1, 18008 Granada, Spain
| | - Elena Lima-Cabello
- Plant Reproductive Biology Laboratory, Department of Biochemistry, Cellular and Molecular Biology of Plants, Estación Experimental del Zaidín (CSIC), Profesor Albareda 1, 18008 Granada, Spain
| | - José Ángel Traverso
- Plant Reproductive Biology Laboratory, Department of Biochemistry, Cellular and Molecular Biology of Plants, Estación Experimental del Zaidín (CSIC), Profesor Albareda 1, 18008 Granada, Spain
| | - Antonio Jesús Castro
- Plant Reproductive Biology Laboratory, Department of Biochemistry, Cellular and Molecular Biology of Plants, Estación Experimental del Zaidín (CSIC), Profesor Albareda 1, 18008 Granada, Spain
| | - M. Gonzalo Claros
- Departamento de Biología Molecular y Bioquímica, Universidad de Málaga, Málaga, Spain
| | - Juan de Dios Alché
- Plant Reproductive Biology Laboratory, Department of Biochemistry, Cellular and Molecular Biology of Plants, Estación Experimental del Zaidín (CSIC), Profesor Albareda 1, 18008 Granada, Spain
| |
Collapse
|
29
|
Wang H, Chen X, Li C, Liu Y, Yang F, Wang C. Sequence-Based Prediction of Cysteine Reactivity Using Machine Learning. Biochemistry 2017; 57:451-460. [DOI: 10.1021/acs.biochem.7b00897] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Affiliation(s)
- Haobo Wang
- Synthetic
and Functional Biomolecules Center, Beijing National Laboratory for
Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular
Engineering of Ministry of Education, Peking University, Beijing 100871, China
- Department
of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Xuemin Chen
- Synthetic
and Functional Biomolecules Center, Beijing National Laboratory for
Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular
Engineering of Ministry of Education, Peking University, Beijing 100871, China
- Department
of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Can Li
- Department
of Chemical Engineering, Tsinghua University, Beijing 100084, China
- Peking-Tsinghua
Center for Life Sciences, Peking University, Beijing 100871, China
| | - Yuan Liu
- Synthetic
and Functional Biomolecules Center, Beijing National Laboratory for
Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular
Engineering of Ministry of Education, Peking University, Beijing 100871, China
- Department
of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Peking-Tsinghua
Center for Life Sciences, Peking University, Beijing 100871, China
| | - Fan Yang
- Synthetic
and Functional Biomolecules Center, Beijing National Laboratory for
Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular
Engineering of Ministry of Education, Peking University, Beijing 100871, China
- Department
of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Chu Wang
- Synthetic
and Functional Biomolecules Center, Beijing National Laboratory for
Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular
Engineering of Ministry of Education, Peking University, Beijing 100871, China
- Department
of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Peking-Tsinghua
Center for Life Sciences, Peking University, Beijing 100871, China
| |
Collapse
|
30
|
Weng SL, Kao HJ, Huang CH, Lee TY. MDD-Palm: Identification of protein S-palmitoylation sites with substrate motifs based on maximal dependence decomposition. PLoS One 2017; 12:e0179529. [PMID: 28662047 PMCID: PMC5491019 DOI: 10.1371/journal.pone.0179529] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2016] [Accepted: 05/31/2017] [Indexed: 12/14/2022] Open
Abstract
S-palmitoylation, the covalent attachment of 16-carbon palmitic acids to a cysteine residue via a thioester linkage, is an important reversible lipid modification that plays a regulatory role in a variety of physiological and biological processes. As the number of experimentally identified S-palmitoylated peptides increases, it is imperative to investigate substrate motifs to facilitate the study of protein S-palmitoylation. Based on 710 non-homologous S-palmitoylation sites obtained from published databases and the literature, we carried out a bioinformatics investigation of S-palmitoylation sites based on amino acid composition. Two Sample Logo indicates that positively charged and polar amino acids surrounding S-palmitoylated sites may be associated with the substrate site specificity of protein S-palmitoylation. Additionally, maximal dependence decomposition (MDD) was applied to explore the motif signatures of S-palmitoylation sites by categorizing a large-scale dataset into subgroups with statistically significant conservation of amino acids. Single features such as amino acid composition (AAC), amino acid pair composition (AAPC), position specific scoring matrix (PSSM), position weight matrix (PWM), amino acid substitution matrix (BLOSUM62), and accessible surface area (ASA) were considered, along with the effectiveness of incorporating MDD-identified substrate motifs into a two-layered prediction model. Evaluation by five-fold cross-validation showed that a hybrid of AAC and PSSM performs best at discriminating between S-palmitoylation and non-S-palmitoylation sites, according to the support vector machine (SVM). The two-layered SVM model integrating MDD-identified substrate motifs performed well, with a sensitivity of 0.79, specificity of 0.80, accuracy of 0.80, and Matthews Correlation Coefficient (MCC) value of 0.45. Using an independent testing dataset (613 S-palmitoylated and 5412 non-S-palmitoylated sites) obtained from the literature, we demonstrated that the two-layered SVM model could outperform other prediction tools, yielding a balanced sensitivity and specificity of 0.690 and 0.694, respectively. This two-layered SVM model has been implemented as a web-based system (MDD-Palm), which is now freely available at http://csb.cse.yzu.edu.tw/MDDPalm/.
Collapse
Affiliation(s)
- Shun-Long Weng
- Department of Medicine, Mackay Medical College, New Taipei City, Taiwan
- Department of Obstetrics and Gynecology, Hsinchu Mackay Memorial Hospital, Hsinchu city, Taiwan
- Mackay Junior College of Medicine, Nursing and Management, Taipei, Taiwan
| | - Hui-Ju Kao
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, Taiwan
| | - Chien-Hsun Huang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, Taiwan
- Tao-Yuan Hospital, Ministry of Health & Welfare, Taoyuan, Taiwan
- * E-mail: (TYL); (CHH)
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, Taiwan
- Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan, Taiwan
- * E-mail: (TYL); (CHH)
| |
Collapse
|
31
|
Zhou C, Liang J, Cheng S, Shi T, Houk KN, Wei DQ, Zhao YL. Ab initio molecular metadynamics simulation for S-nitrosylation by nitric oxide: S-nitroxide as the key intermediate. MOLECULAR SIMULATION 2017. [DOI: 10.1080/08927022.2017.1319059] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Chao Zhou
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, MOE-LSB & MOE-LSC, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, P.R. China
| | - Juan Liang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, MOE-LSB & MOE-LSC, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, P.R. China
| | - Shangli Cheng
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, MOE-LSB & MOE-LSC, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, P.R. China
| | - Ting Shi
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, MOE-LSB & MOE-LSC, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, P.R. China
| | - K. N. Houk
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, MOE-LSB & MOE-LSC, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, P.R. China
| | - Yi-Lei Zhao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, MOE-LSB & MOE-LSC, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, P.R. China
| |
Collapse
|
32
|
Weng SL, Huang KY, Kaunang FJ, Huang CH, Kao HJ, Chang TH, Wang HY, Lu JJ, Lee TY. Investigation and identification of protein carbonylation sites based on position-specific amino acid composition and physicochemical features. BMC Bioinformatics 2017; 18:66. [PMID: 28361707 PMCID: PMC5374553 DOI: 10.1186/s12859-017-1472-8] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
BACKGROUND Protein carbonylation, an irreversible and non-enzymatic post-translational modification (PTM), is often used as a marker of oxidative stress. When reactive oxygen species (ROS) oxidized the amino acid side chains, carbonyl (CO) groups are produced especially on Lysine (K), Arginine (R), Threonine (T), and Proline (P). Nevertheless, due to the lack of information about the carbonylated substrate specificity, we were encouraged to develop a systematic method for a comprehensive investigation of protein carbonylation sites. RESULTS After the removal of redundant data from multipe carbonylation-related articles, totally 226 carbonylated proteins in human are regarded as training dataset, which consisted of 307, 126, 128, and 129 carbonylation sites for K, R, T and P residues, respectively. To identify the useful features in predicting carbonylation sites, the linear amino acid sequence was adopted not only to build up the predictive model from training dataset, but also to compare the effectiveness of prediction with other types of features including amino acid composition (AAC), amino acid pair composition (AAPC), position-specific scoring matrix (PSSM), positional weighted matrix (PWM), solvent-accessible surface area (ASA), and physicochemical properties. The investigation of position-specific amino acid composition revealed that the positively charged amino acids (K and R) are remarkably enriched surrounding the carbonylated sites, which may play a functional role in discriminating between carbonylation and non-carbonylation sites. A variety of predictive models were built using various features and three different machine learning methods. Based on the evaluation by five-fold cross-validation, the models trained with PWM feature could provide better sensitivity in the positive training dataset, while the models trained with AAindex feature achieved higher specificity in the negative training dataset. Additionally, the model trained using hybrid features, including PWM, AAC and AAindex, obtained best MCC values of 0.432, 0.472, 0.443 and 0.467 on K, R, T and P residues, respectively. CONCLUSION When comparing to an existing prediction tool, the selected models trained with hybrid features provided a promising accuracy on an independent testing dataset. In short, this work not only characterized the carbonylated substrate preference, but also demonstrated that the proposed method could provide a feasible means for accelerating preliminary discovery of protein carbonylation.
Collapse
Affiliation(s)
- Shun-Long Weng
- Department of Obstetrics and Gynecology, Hsinchu Mackay Memorial Hospital, Hsin-Chu, 300, Taiwan.,Mackay Medicine, Nursing and Management College, Taipei, 112, Taiwan.,Department of Medicine, Mackay Medical College, New Taipei City, 252, Taiwan
| | - Kai-Yao Huang
- Department of Medical Research, Hsinchu Mackay Memorial Hospital, Hsin-Chu, 300, Taiwan.,Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan
| | - Fergie Joanda Kaunang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan
| | - Chien-Hsun Huang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan.,Tao-Yuan Hospital, Ministry of Health & Welfare, Taoyuan, 320, Taiwan
| | - Hui-Ju Kao
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan
| | - Tzu-Hao Chang
- Graduate Institute of Biomedical Informatics, Taipei Medical University, Taipei, 110, Taiwan
| | - Hsin-Yao Wang
- Department of Laboratory Medicine, Chang Gung Memorial Hospital at Linkou, Taoyuan, 333, Taiwan
| | - Jang-Jih Lu
- Department of Laboratory Medicine, Chang Gung Memorial Hospital at Linkou, Taoyuan, 333, Taiwan. .,Department of Medical Biotechnology and Laboratory Science, Chang Gung University, Taoyuan, 333, Taiwan.
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan. .,Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan, 320, Taiwan.
| |
Collapse
|
33
|
Nguyen VN, Huang KY, Huang CH, Lai KR, Lee TY. A New Scheme to Characterize and Identify Protein Ubiquitination Sites. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:393-403. [PMID: 26887002 DOI: 10.1109/tcbb.2016.2520939] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Protein ubiquitination, involving the conjugation of ubiquitin on lysine residue, serves as an important modulator of many cellular functions in eukaryotes. Recent advancements in proteomic technology have stimulated increasing interest in identifying ubiquitination sites. However, most computational tools for predicting ubiquitination sites are focused on small-scale data. With an increasing number of experimentally verified ubiquitination sites, we were motivated to design a predictive model for identifying lysine ubiquitination sites for large-scale proteome dataset. This work assessed not only single features, such as amino acid composition (AAC), amino acid pair composition (AAPC) and evolutionary information, but also the effectiveness of incorporating two or more features into a hybrid approach to model construction. The support vector machine (SVM) was applied to generate the prediction models for ubiquitination site identification. Evaluation by five-fold cross-validation showed that the SVM models learned from the combination of hybrid features delivered a better prediction performance. Additionally, a motif discovery tool, MDDLogo, was adopted to characterize the potential substrate motifs of ubiquitination sites. The SVM models integrating the MDDLogo-identified substrate motifs could yield an average accuracy of 68.70 percent. Furthermore, the independent testing result showed that the MDDLogo-clustered SVM models could provide a promising accuracy (78.50 percent) and perform better than other prediction tools. Two cases have demonstrated the effective prediction of ubiquitination sites with corresponding substrate motifs.
Collapse
|
34
|
Tichá T, Luhová L, Petřivalský M. Functions and Metabolism of S-Nitrosothiols and S-Nitrosylation of Proteins in Plants: The Role of GSNOR. ACTA ACUST UNITED AC 2016. [DOI: 10.1007/978-3-319-40713-5_9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
35
|
Nguyen VN, Huang KY, Weng JTY, Lai KR, Lee TY. UbiNet: an online resource for exploring the functional associations and regulatory networks of protein ubiquitylation. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw054. [PMID: 27114492 PMCID: PMC4843525 DOI: 10.1093/database/baw054] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/18/2015] [Accepted: 03/20/2016] [Indexed: 12/19/2022]
Abstract
Protein ubiquitylation catalyzed by E3 ubiquitin ligases are crucial in the regulation of many cellular processes. Owing to the high throughput of mass spectrometry-based proteomics, a number of methods have been developed for the experimental determination of ubiquitylation sites, leading to a large collection of ubiquitylation data. However, there exist no resources for the exploration of E3-ligase-associated regulatory networks of for ubiquitylated proteins in humans. Therefore, the UbiNet database was developed to provide a full investigation of protein ubiquitylation networks by incorporating experimentally verified E3 ligases, ubiquitylated substrates and protein-protein interactions (PPIs). To date, UbiNet has accumulated 43 948 experimentally verified ubiquitylation sites from 14 692 ubiquitylated proteins of humans. Additionally, we have manually curated 499 E3 ligases as well as two E1 activating and 46 E2 conjugating enzymes. To delineate the regulatory networks among E3 ligases and ubiquitylated proteins, a total of 430 530 PPIs were integrated into UbiNet for the exploration of ubiquitylation networks with an interactive network viewer. A case study demonstrated that UbiNet was able to decipher a scheme for the ubiquitylation of tumor proteins p63 and p73 that is consistent with their functions. Although the essential role of Mdm2 in p53 regulation is well studied, UbiNet revealed that Mdm2 and additional E3 ligases might be implicated in the regulation of other tumor proteins by protein ubiquitylation. Moreover, UbiNet could identify potential substrates for a specific E3 ligase based on PPIs and substrate motifs. With limited knowledge about the mechanisms through which ubiquitylated proteins are regulated by E3 ligases, UbiNet offers users an effective means for conducting preliminary analyses of protein ubiquitylation. The UbiNet database is now freely accessible via http://csb.cse.yzu.edu.tw/UbiNet/ The content is regularly updated with the literature and newly released data.Database URL: http://csb.cse.yzu.edu.tw/UbiNet/.
Collapse
Affiliation(s)
- Van-Nui Nguyen
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan University of Information and Communication Technology, Thai Nguyen University, Vietnam and
| | - Kai-Yao Huang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan
| | - Julia Tzu-Ya Weng
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan Innovation Center for Big Data and Digital Convergence, Yuan Ze University, 320, Taiwan
| | - K Robert Lai
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan Innovation Center for Big Data and Digital Convergence, Yuan Ze University, 320, Taiwan
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan Innovation Center for Big Data and Digital Convergence, Yuan Ze University, 320, Taiwan
| |
Collapse
|
36
|
Bui VM, Weng SL, Lu CT, Chang TH, Weng JTY, Lee TY. SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites. BMC Genomics 2016; 17 Suppl 1:9. [PMID: 26819243 PMCID: PMC4895302 DOI: 10.1186/s12864-015-2299-1] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Background Protein S-sulfenylation is a type of post-translational modification (PTM) involving the covalent binding of a hydroxyl group to the thiol of a cysteine amino acid. Recent evidence has shown the importance of S-sulfenylation in various biological processes, including transcriptional regulation, apoptosis and cytokine signaling. Determining the specific sites of S-sulfenylation is fundamental to understanding the structures and functions of S-sulfenylated proteins. However, the current lack of reliable tools often limits researchers to use expensive and time-consuming laboratory techniques for the identification of S-sulfenylation sites. Thus, we were motivated to develop a bioinformatics method for investigating S-sulfenylation sites based on amino acid compositions and physicochemical properties. Results In this work, physicochemical properties were utilized not only to identify S-sulfenylation sites from 1,096 experimentally verified S-sulfenylated proteins, but also to compare the effectiveness of prediction with other characteristics such as amino acid composition (AAC), amino acid pair composition (AAPC), solvent-accessible surface area (ASA), amino acid substitution matrix (BLOSUM62), position-specific scoring matrix (PSSM), and positional weighted matrix (PWM). Various prediction models were built using support vector machine (SVM) and evaluated by five-fold cross-validation. The model constructed from hybrid features, including PSSM and physicochemical properties, yielded the best performance with sensitivity, specificity, accuracy and MCC measurements of 0.746, 0.737, 0.738 and 0.337, respectively. The selected model also provided a promising accuracy (0.693) on an independent testing dataset. Additionally, we employed TwoSampleLogo to help discover the difference of amino acid composition among S-sulfenylation, S-glutathionylation and S-nitrosylation sites. Conclusion This work proposed a computational method to explore informative features and functions for protein S-sulfenylation. Evaluation by five-fold cross validation indicated that the selected features were effective in the identification of S-sulfenylation sites. Moreover, the independent testing results demonstrated that the proposed method could provide a feasible means for conducting preliminary analyses of protein S-sulfenylation. We also anticipate that the uncovered differences in amino acid composition may facilitate future studies of the extensive crosstalk among S-sulfenylation, S-glutathionylation and S-nitrosylation. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-2299-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Van-Minh Bui
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan.
| | - Shun-Long Weng
- Department of Obstetrics and Gynecology, Hsinchu Mackay Memorial Hospital, Hsin-Chu, 300, Taiwan. .,Mackay Junior College of Medicine, Nursing and Management, Taipei, 112, Taiwan. .,Department of Medicine, Mackay Medical College, New Taipei City, 252, Taiwan.
| | - Cheng-Tsung Lu
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan.
| | - Tzu-Hao Chang
- Graduate Institute of Biomedical Informatics, Taipei Medical University, Taipei, 110, Taiwan.
| | - Julia Tzu-Ya Weng
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan. .,Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan, 320, Taiwan.
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan. .,Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan, 320, Taiwan.
| |
Collapse
|
37
|
Huang CH, Su MG, Kao HJ, Jhong JH, Weng SL, Lee TY. UbiSite: incorporating two-layered machine learning method with substrate motifs to predict ubiquitin-conjugation site on lysines. BMC SYSTEMS BIOLOGY 2016; 10 Suppl 1:6. [PMID: 26818456 PMCID: PMC4895383 DOI: 10.1186/s12918-015-0246-z] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Background The conjugation of ubiquitin to a substrate protein (protein ubiquitylation), which involves a sequential process – E1 activation, E2 conjugation and E3 ligation, is crucial to the regulation of protein function and activity in eukaryotes. This ubiquitin-conjugation process typically binds the last amino acid of ubiquitin (glycine 76) to a lysine residue of a target protein. The high-throughput of mass spectrometry-based proteomics has stimulated a large-scale identification of ubiquitin-conjugated peptides. Hence, a new web resource, UbiSite, was developed to identify ubiquitin-conjugation site on lysines based on large-scale proteome dataset. Results Given a total of 37,647 ubiquitin-conjugated proteins, including 128026 ubiquitylated peptides, obtained from various resources, this study carries out a large-scale investigation on ubiquitin-conjugation sites based on sequenced and structural characteristics. A TwoSampleLogo reveals that a significant depletion of histidine (H), arginine (R) and cysteine (C) residues around ubiquitylation sites may impact the conjugation of ubiquitins in closed three-dimensional environments. Based on the large-scale ubiquitylation dataset, a motif discovery tool, MDDLogo, has been adopted to characterize the potential substrate motifs for ubiquitin conjugation. Not only are single features such as amino acid composition (AAC), positional weighted matrix (PWM), position-specific scoring matrix (PSSM) and solvent-accessible surface area (SASA) considered, but also the effectiveness of incorporating MDDLogo-identified substrate motifs into a two-layered prediction model is taken into account. Evaluation by five-fold cross-validation showed that PSSM is the best feature in discriminating between ubiquitylation and non-ubiquitylation sites, based on support vector machine (SVM). Additionally, the two-layered SVM model integrating MDDLogo-identified substrate motifs could obtain a promising accuracy and the Matthews Correlation Coefficient (MCC) at 81.06 % and 0.586, respectively. Furthermore, the independent testing showed that the two-layered SVM model could outperform other prediction tools, reaching at 85.10 % sensitivity, 69.69 % specificity, 73.69 % accuracy and the 0.483 of MCC value. Conclusion The independent testing result indicated the effectiveness of incorporating MDDLogo-identified motifs into the prediction of ubiquitylation sites. In order to provide meaningful assistance to researchers interested in large-scale ubiquitinome data, the two-layered SVM model has been implemented onto a web-based system (UbiSite), which is freely available at http://csb.cse.yzu.edu.tw/UbiSite/. Two cases given in the UbiSite provide a demonstration of effective identification of ubiquitylation sites with reference to substrate motifs. Electronic supplementary material The online version of this article (doi:10.1186/s12918-015-0246-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Chien-Hsun Huang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan. .,Ministry of Health & Welfare, Tao-Yuan Hospital, Taoyuan, 320, Taiwan.
| | - Min-Gang Su
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan.
| | - Hui-Ju Kao
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan.
| | - Jhih-Hua Jhong
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan.
| | - Shun-Long Weng
- Department of Obstetrics and Gynecology, Hsinchu Mackay Memorial Hospital, Hsin-Chu, 300, Taiwan. .,Mackay Junior College of Medicine, Nursing and Management , Taipei, 112, Taiwan. .,Department of Medicine, Mackay Medical College, New Taipei City, 252, Taiwan.
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan. .,Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan, 320, Taiwan.
| |
Collapse
|
38
|
Huang KY, Weng JTY, Lee TY, Weng SL. A new scheme to discover functional associations and regulatory networks of E3 ubiquitin ligases. BMC SYSTEMS BIOLOGY 2016; 10 Suppl 1:3. [PMID: 26818115 PMCID: PMC4895279 DOI: 10.1186/s12918-015-0244-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Background Protein ubiquitination catalyzed by E3 ubiquitin ligases play important modulatory roles in various biological processes. With the emergence of high-throughput mass spectrometry technology, the proteomics research community embraced the development of numerous experimental methods for the determination of ubiquitination sites. The result is an accumulation of ubiquitinome data, coupled with a lack of available resources for investigating the regulatory networks among E3 ligases and ubiquitinated proteins. In this study, by integrating existing ubiquitinome data, experimentally validated E3 ligases and established protein-protein interactions, we have devised a strategy to construct a comprehensive map of protein ubiquitination networks. Results In total, 41,392 experimentally verified ubiquitination sites from 12,786 ubiquitinated proteins of humans have been obtained for this study. Additional 494 E3 ligases along with 1220 functional annotations and 28588 protein domains were manually curated. To characterize the regulatory networks among E3 ligases and ubiquitinated proteins, a well-established network viewer was utilized for the exploration of ubiquitination networks from 40892 protein-protein interactions. The effectiveness of the proposed approach was demonstrated in a case study examining E3 ligases involved in the ubiquitination of tumor suppressor p53. In addition to Mdm2, a known regulator of p53, the investigation also revealed other potential E3 ligases that may participate in the ubiquitination of p53. Conclusion Aside from the ability to facilitate comprehensive investigations of protein ubiquitination networks, by integrating information regarding protein-protein interactions and substrate specificities, the proposed method could discover potential E3 ligases for ubiquitinated proteins. Our strategy presents an efficient means for the preliminary screen of ubiquitination networks and overcomes the challenge as a result of limited knowledge about E3 ligase-regulated ubiquitination. Electronic supplementary material The online version of this article (doi:10.1186/s12918-015-0244-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Kai-Yao Huang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan.
| | - Julia Tzu-Ya Weng
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan. .,Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan, 320, Taiwan.
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 320, Taiwan. .,Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan, 320, Taiwan.
| | - Shun-Long Weng
- Department of Obstetrics and Gynecology, Hsinchu Mackay Memorial Hospital, Hsin-Chu, 300, Taiwan. .,Mackay Junior College of Medicine, Nursing and Management, Taipei, 112, Taiwan. .,Department of Medicine, Mackay Medical College, New Taipei City, 252, Taiwan.
| |
Collapse
|
39
|
Huang KY, Su MG, Kao HJ, Hsieh YC, Jhong JH, Cheng KH, Huang HD, Lee TY. dbPTM 2016: 10-year anniversary of a resource for post-translational modification of proteins. Nucleic Acids Res 2015; 44:D435-46. [PMID: 26578568 PMCID: PMC4702878 DOI: 10.1093/nar/gkv1240] [Citation(s) in RCA: 131] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2015] [Accepted: 11/02/2015] [Indexed: 01/23/2023] Open
Abstract
Owing to the importance of the post-translational modifications (PTMs) of proteins in regulating biological processes, the dbPTM (http://dbPTM.mbc.nctu.edu.tw/) was developed as a comprehensive database of experimentally verified PTMs from several databases with annotations of potential PTMs for all UniProtKB protein entries. For this 10th anniversary of dbPTM, the updated resource provides not only a comprehensive dataset of experimentally verified PTMs, supported by the literature, but also an integrative interface for accessing all available databases and tools that are associated with PTM analysis. As well as collecting experimental PTM data from 14 public databases, this update manually curates over 12 000 modified peptides, including the emerging S-nitrosylation, S-glutathionylation and succinylation, from approximately 500 research articles, which were retrieved by text mining. As the number of available PTM prediction methods increases, this work compiles a non-homologous benchmark dataset to evaluate the predictive power of online PTM prediction tools. An increasing interest in the structural investigation of PTM substrate sites motivated the mapping of all experimental PTM peptides to protein entries of Protein Data Bank (PDB) based on database identifier and sequence identity, which enables users to examine spatially neighboring amino acids, solvent-accessible surface area and side-chain orientations for PTM substrate sites on tertiary structures. Since drug binding in PDB is annotated, this update identified over 1100 PTM sites that are associated with drug binding. The update also integrates metabolic pathways and protein-protein interactions to support the PTM network analysis for a group of proteins. Finally, the web interface is redesigned and enhanced to facilitate access to this resource.
Collapse
Affiliation(s)
- Kai-Yao Huang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Min-Gang Su
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Hui-Ju Kao
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Yun-Chung Hsieh
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Jhih-Hua Jhong
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Kuang-Hao Cheng
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Hsien-Da Huang
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu 300, Taiwan Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu 300, Taiwan
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan 320, Taiwan
| |
Collapse
|
40
|
Bui VM, Lu CT, Ho TT, Lee TY. MDD-SOH: exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs. Bioinformatics 2015; 32:165-72. [PMID: 26411868 DOI: 10.1093/bioinformatics/btv558] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2015] [Accepted: 09/18/2015] [Indexed: 01/12/2023] Open
Abstract
UNLABELLED S-sulfenylation (S-sulphenylation, or sulfenic acid), the covalent attachment of S-hydroxyl (-SOH) to cysteine thiol, plays a significant role in redox regulation of protein functions. Although sulfenic acid is transient and labile, most of its physiological activities occur under control of S-hydroxylation. Therefore, discriminating the substrate site of S-sulfenylated proteins is an essential task in computational biology for the furtherance of protein structures and functions. Research into S-sulfenylated protein is currently very limited, and no dedicated tools are available for the computational identification of SOH sites. Given a total of 1096 experimentally verified S-sulfenylated proteins from humans, this study carries out a bioinformatics investigation on SOH sites based on amino acid composition and solvent-accessible surface area. A TwoSampleLogo indicates that the positively and negatively charged amino acids flanking the SOH sites may impact the formulation of S-sulfenylation in closed three-dimensional environments. In addition, the substrate motifs of SOH sites are studied using the maximal dependence decomposition (MDD). Based on the concept of binary classification between SOH and non-SOH sites, Support vector machine (SVM) is applied to learn the predictive model from MDD-identified substrate motifs. According to the evaluation results of 5-fold cross-validation, the integrated SVM model learned from substrate motifs yields an average accuracy of 0.87, significantly improving the prediction of SOH sites. Furthermore, the integrated SVM model also effectively improves the predictive performance in an independent testing set. Finally, the integrated SVM model is applied to implement an effective web resource, named MDD-SOH, to identify SOH sites with their corresponding substrate motifs. AVAILABILITY AND IMPLEMENTATION The MDD-SOH is now freely available to all interested users at http://csb.cse.yzu.edu.tw/MDDSOH/. All of the data set used in this work is also available for download in the website. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online. CONTACT francis@saturn.yzu.edu.tw.
Collapse
Affiliation(s)
- Van-Minh Bui
- Department of Computer Science and Engineering and
| | | | - Thi-Trang Ho
- Department of Computer Science and Engineering and
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering and Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan 320, Taiwan
| |
Collapse
|
41
|
Chaki M, Shekariesfahlan A, Ageeva A, Mengel A, von Toerne C, Durner J, Lindermayr C. Identification of nuclear target proteins for S-nitrosylation in pathogen-treated Arabidopsis thaliana cell cultures. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2015; 238:115-26. [PMID: 26259180 DOI: 10.1016/j.plantsci.2015.06.011] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Revised: 06/05/2015] [Accepted: 06/08/2015] [Indexed: 05/18/2023]
Abstract
Nitric oxide (NO) is a significant signalling molecule involved in the regulation of many different physiological processes in plants. One of the most imperative regulatory modes of action of NO is protein S-nitrosylation--the covalent attachment of an NO group to the sulfur atom of cysteine residues. In this study, we focus on S-nitrosylation of Arabidopsis nuclear proteins after pathogen infection. After treatment of Arabidopsis suspension cell cultures with pathogens, nuclear proteins were extracted and treated with the S-nitrosylating agent S-nitrosoglutathione (GSNO). A biotin switch assay was performed and biotin-labelled proteins were purified by neutravidin affinity chromatography and identified by mass spectrometry. A total of 135 proteins were identified, whereas nuclear localization has been described for 122 proteins of them. 117 of these proteins contain at least one cysteine residue. Most of the S-nitrosylated candidates were involved in protein and RNA metabolism, stress response, and cell organization and division. Interestingly, two plant-specific histone deacetylases were identified suggesting that nitric oxide regulated epigenetic processes in plants. In sum, this work provides a new collection of targets for protein S-nitrosylation in Arabidopsis and gives insight into the regulatory function of NO in the nucleus during plant defense response. Moreover, our data extend the knowledge on the regulatory function of NO in events located in the nucleus.
Collapse
Affiliation(s)
- Mounira Chaki
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München-German Research Center for Environmental Health, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany
| | - Azam Shekariesfahlan
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München-German Research Center for Environmental Health, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany
| | - Alexandra Ageeva
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München-German Research Center for Environmental Health, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany
| | - Alexander Mengel
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München-German Research Center for Environmental Health, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany
| | - Christine von Toerne
- Research Unit Protein Science, Helmholtz Zentrum München-German Research Center for Environmental Health, 85764 Neuherberg, Germany
| | - Jörg Durner
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München-German Research Center for Environmental Health, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany; Chair of Biochemical Plant Pathology, Technische Universität München, 85354 Freising, Germany
| | - Christian Lindermayr
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München-German Research Center for Environmental Health, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany.
| |
Collapse
|
42
|
Cheng S, Shi T, Wang XL, Liang J, Wu H, Xie L, Li Y, Zhao YL. Features of S-nitrosylation based on statistical analysis and molecular dynamics simulation: cysteine acidity, surrounding basicity, steric hindrance and local flexibility. MOLECULAR BIOSYSTEMS 2015; 10:2597-606. [PMID: 25030274 DOI: 10.1039/c4mb00322e] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
S-Nitrosylation is involved in protein functional regulation and cellular signal transduction. Although intensive efforts have been made, the molecular mechanisms of S-nitrosylation have not yet been fully understood. In this work, we carried out a survey on 213 protein structures with S-nitrosylated cysteine sites and molecular dynamic simulations of hemoglobin as a case study. It was observed that the S-nitrosylated cysteines showed a lower pKa, a higher population of basic residues, a lower population of big-volume residues in the neighborhood, and relatively higher flexibility. The case study of hemoglobin showed that, compared to that in the T-state, Cysβ93 in the R-state hemoglobin possessed the above structural features, in agreement with the previous report that the R-state was more reactive in S-nitrosylation. Moreover, basic residues moved closer to the Cysβ93 in the dep-R-state hemoglobin, while big-volume residues approached the Cysβ93 in the dep-T-state. Using the four characteristics, i.e. cysteine acidity, surrounding basicity, steric hindrance, and local flexibility, a 3-dimensional model of S-nitrosylation was constructed to explain 61.9% of the S-nitrosylated and 58.1% of the non-S-nitrosylated cysteines. Our study suggests that cysteine deprotonation is a prerequisite for protein S-nitrosylation, and these characteristics might be useful in identifying specificity of protein S-nitrosylation.
Collapse
Affiliation(s)
- Shangli Cheng
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.
| | | | | | | | | | | | | | | |
Collapse
|
43
|
Akter S, Huang J, Waszczak C, Jacques S, Gevaert K, Van Breusegem F, Messens J. Cysteines under ROS attack in plants: a proteomics view. JOURNAL OF EXPERIMENTAL BOTANY 2015; 66:2935-44. [PMID: 25750420 DOI: 10.1093/jxb/erv044] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Plants generate reactive oxygen species (ROS) as part of their metabolism and in response to various external stress factors, potentially causing significant damage to biomolecules and cell structures. During the course of evolution, plants have adapted to ROS toxicity, and use ROS as signalling messengers that activate defence responses. Cysteine (Cys) residues in proteins are one of the most sensitive targets for ROS-mediated post-translational modifications, and they have become key residues for ROS signalling studies. The reactivity of Cys residues towards ROS, and their ability to react to different oxidation states, allow them to appear at the crossroads of highly dynamic oxidative events. As such, a redox-active cysteine can be present as S-glutathionylated (-SSG), disulfide bonded (S-S), sulfenylated (-SOH), sulfinylated (-SO2H), and sulfonylated (-SO3H). The sulfenic acid (-SOH) form has been considered as part of ROS-sensing pathways, as it leads to further modifications which affect protein structure and function. Redox proteomic studies are required to understand how and why cysteines undergo oxidative post-translational modifications and to identify the ROS-sensor proteins. Here, we update current knowledge of cysteine reactivity with ROS. Further, we give an overview of proteomic techniques that have been applied to identify different redox-modified cysteines in plants. There is a particular focus on the identification of sulfenylated proteins, which have the potential to be involved in plant signal transduction.
Collapse
Affiliation(s)
- Salma Akter
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium Structural Biology Research Centre, VIB, 1050 Brussels, Belgium Brussels Centre for Redox Biology, 1050 Brussels, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, 1050 Brussels, Belgium Faculty of Biological Sciences, University of Dhaka, 1000 Dhaka, Bangladesh
| | - Jingjing Huang
- Structural Biology Research Centre, VIB, 1050 Brussels, Belgium Brussels Centre for Redox Biology, 1050 Brussels, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, 1050 Brussels, Belgium
| | - Cezary Waszczak
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium Structural Biology Research Centre, VIB, 1050 Brussels, Belgium Brussels Centre for Redox Biology, 1050 Brussels, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, 1050 Brussels, Belgium
| | - Silke Jacques
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium Department of Medical Protein Research, VIB, 9000 Gent, Belgium Department of Biochemistry, Ghent University, 9000 Gent, Belgium
| | - Kris Gevaert
- Department of Medical Protein Research, VIB, 9000 Gent, Belgium Department of Biochemistry, Ghent University, 9000 Gent, Belgium
| | - Frank Van Breusegem
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
| | - Joris Messens
- Structural Biology Research Centre, VIB, 1050 Brussels, Belgium Brussels Centre for Redox Biology, 1050 Brussels, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, 1050 Brussels, Belgium
| |
Collapse
|
44
|
Nguyen VN, Huang KY, Huang CH, Chang TH, Bretaña N, Lai K, Weng J, Lee TY. Characterization and identification of ubiquitin conjugation sites with E3 ligase recognition specificities. BMC Bioinformatics 2015; 16 Suppl 1:S1. [PMID: 25707307 PMCID: PMC4331700 DOI: 10.1186/1471-2105-16-s1-s1] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Background In eukaryotes, ubiquitin-conjugation is an important mechanism underlying proteasome-mediated degradation of proteins, and as such, plays an essential role in the regulation of many cellular processes. In the ubiquitin-proteasome pathway, E3 ligases play important roles by recognizing a specific protein substrate and catalyzing the attachment of ubiquitin to a lysine (K) residue. As more and more experimental data on ubiquitin conjugation sites become available, it becomes possible to develop prediction models that can be scaled to big data. However, no development that focuses on the investigation of ubiquitinated substrate specificities has existed. Herein, we present an approach that exploits an iteratively statistical method to identify ubiquitin conjugation sites with substrate site specificities. Results In this investigation, totally 6259 experimentally validated ubiquitinated proteins were obtained from dbPTM. After having filtered out homologous fragments with 40% sequence identity, the training data set contained 2658 ubiquitination sites (positive data) and 5532 non-ubiquitinated sites (negative data). Due to the difficulty in characterizing the substrate site specificities of E3 ligases by conventional sequence logo analysis, a recursively statistical method has been applied to obtain significant conserved motifs. The profile hidden Markov model (profile HMM) was adopted to construct the predictive models learned from the identified substrate motifs. A five-fold cross validation was then used to evaluate the predictive model, achieving sensitivity, specificity, and accuracy of 73.07%, 65.46%, and 67.93%, respectively. Additionally, an independent testing set, completely blind to the training data of the predictive model, was used to demonstrate that the proposed method could provide a promising accuracy (76.13%) and outperform other ubiquitination site prediction tool. Conclusion A case study demonstrated the effectiveness of the characterized substrate motifs for identifying ubiquitination sites. The proposed method presents a practical means of preliminary analysis and greatly diminishes the total number of potential targets required for further experimental confirmation. This method may help unravel their mechanisms and roles in E3 recognition and ubiquitin-mediated protein degradation.
Collapse
|
45
|
Lamotte O, Bertoldo JB, Besson-Bard A, Rosnoblet C, Aimé S, Hichami S, Terenzi H, Wendehenne D. Protein S-nitrosylation: specificity and identification strategies in plants. Front Chem 2015; 2:114. [PMID: 25750911 PMCID: PMC4285867 DOI: 10.3389/fchem.2014.00114] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2014] [Accepted: 12/08/2014] [Indexed: 12/23/2022] Open
Abstract
The role of nitric oxide (NO) as a major regulator of plant physiological functions has become increasingly evident. To further improve our understanding of its role, within the last few years plant biologists have begun to embrace the exciting opportunity of investigating protein S-nitrosylation, a major reversible NO-dependent post-translational modification (PTM) targeting specific Cys residues and widely studied in animals. Thanks to the development of dedicated proteomic approaches, in particular the use of the biotin switch technique (BST) combined with mass spectrometry, hundreds of plant protein candidates for S-nitrosylation have been identified. Functional studies focused on specific proteins provided preliminary comprehensive views of how this PTM impacts the structure and function of proteins and, more generally, of how NO might regulate biological plant processes. The aim of this review is to detail the basic principle of protein S-nitrosylation, to provide information on the biochemical and structural features of the S-nitrosylation sites and to describe the proteomic strategies adopted to investigate this PTM in plants. Limits of the current approaches and tomorrow's challenges are also discussed.
Collapse
Affiliation(s)
- Olivier Lamotte
- CNRS, UMR 1347 Agroécologie Dijon, France ; ERL CNRS 6300 Dijon, France
| | - Jean B Bertoldo
- Departamento de Bioquímica Centro de Ciências Biológicas, Centro de Biologia Molecular Estrutural, Universidade Federal de Santa Catarina Florianópolis, Brasil
| | - Angélique Besson-Bard
- ERL CNRS 6300 Dijon, France ; Université de Bourgogne, UMR 1347 Agroécologie Dijon, France
| | - Claire Rosnoblet
- ERL CNRS 6300 Dijon, France ; Université de Bourgogne, UMR 1347 Agroécologie Dijon, France
| | - Sébastien Aimé
- ERL CNRS 6300 Dijon, France ; Institut National de la Recherche Agronomique, UMR 1347 Agroécologie Dijon, France
| | - Siham Hichami
- ERL CNRS 6300 Dijon, France ; Université de Bourgogne, UMR 1347 Agroécologie Dijon, France
| | - Hernán Terenzi
- Departamento de Bioquímica Centro de Ciências Biológicas, Centro de Biologia Molecular Estrutural, Universidade Federal de Santa Catarina Florianópolis, Brasil
| | - David Wendehenne
- ERL CNRS 6300 Dijon, France ; Université de Bourgogne, UMR 1347 Agroécologie Dijon, France
| |
Collapse
|
46
|
Wu HY, Lu CT, Kao HJ, Chen YJ, Chen YJ, Lee TY. Characterization and identification of protein O-GlcNAcylation sites with substrate specificity. BMC Bioinformatics 2014; 15 Suppl 16:S1. [PMID: 25521204 PMCID: PMC4290634 DOI: 10.1186/1471-2105-15-s16-s1] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Background Protein O-GlcNAcylation, involving the attachment of single N-acetylglucosamine (GlcNAc) to the hydroxyl group of serine or threonine residues. Elucidation of O-GlcNAcylation sites on proteins is required in order to decipher its crucial roles in regulating cellular processes and aid in drug design. With an increasing number of O-GlcNAcylation sites identified by mass spectrometry (MS)-based proteomics, several methods have been proposed for the computational identification of O-GlcNAcylation sites. However, no development that focuses on the investigation of O-GlcNAcylated substrate motifs has existed. Thus, we were motivated to design a new method for the identification of protein O-GlcNAcylation sites with the consideration of substrate site specificity. Results In this study, 375 experimentally verified O-GlcNAcylation sites were collected from dbOGAP, which is an integrated resource for protein O-GlcNAcylation. Due to the difficulty in characterizing the substrate motifs by conventional sequence logo analysis, a recursively statistical method has been applied to obtain significant conserved motifs. To construct the predictive models learned from the identified substrate motifs, we adopted Support Vector Machines (SVMs). A five-fold cross validation was used to evaluate the predictive model, achieving sensitivity, specificity, and accuracy of 0.76, 0.80, and 0.78, respectively. Additionally, an independent testing set, which was really blind to the training data of predictive model, was used to demonstrate that the proposed method could provide a promising accuracy (0.94) and outperform three other O-GlcNAcylation site prediction tools. Conclusion This work proposed a computational method to identify informative substrate motifs for O-GlcNAcylation sites. The evaluation of cross validation and independent testing indicated that the identified motifs were effective in the identification of O-GlcNAcylation sites. A case study demonstrated that the proposed method could be a feasible means of conducting preliminary analyses of protein O-GlcNAcylation. We also anticipated that the revealed substrate motif may facilitate the study of extensive crosstalk between O-GlcNAcylation and phosphorylation. This method may help unravel their mechanisms and roles in signaling, transcription, chronic disease, and cancer.
Collapse
|
47
|
Chen YJ, Lu CT, Su MG, Huang KY, Ching WC, Yang HH, Liao YC, Chen YJ, Lee TY. dbSNO 2.0: a resource for exploring structural environment, functional and disease association and regulatory network of protein S-nitrosylation. Nucleic Acids Res 2014; 43:D503-11. [PMID: 25399423 PMCID: PMC4383970 DOI: 10.1093/nar/gku1176] [Citation(s) in RCA: 63] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Given the increasing number of proteins reported to be regulated by S-nitrosylation (SNO), it is considered to act, in a manner analogous to phosphorylation, as a pleiotropic regulator that elicits dual effects to regulate diverse pathophysiological processes by altering protein function, stability, and conformation change in various cancers and human disorders. Due to its importance in regulating protein functions and cell signaling, dbSNO (http://dbSNO.mbc.nctu.edu.tw) is extended as a resource for exploring structural environment of SNO substrate sites and regulatory networks of S-nitrosylated proteins. An increasing interest in the structural environment of PTM substrate sites motivated us to map all manually curated SNO peptides (4165 SNO sites within 2277 proteins) to PDB protein entries by sequence identity, which provides the information of spatial amino acid composition, solvent-accessible surface area, spatially neighboring amino acids, and side chain orientation for 298 substrate cysteine residues. Additionally, the annotations of protein molecular functions, biological processes, functional domains and human diseases are integrated to explore the functional and disease associations for S-nitrosoproteome. In this update, users are allowed to search a group of interested proteins/genes and the system reconstructs the SNO regulatory network based on the information of metabolic pathways and protein-protein interactions. Most importantly, an endogenous yet pathophysiological S-nitrosoproteomic dataset from colorectal cancer patients was adopted to demonstrate that dbSNO could discover potential SNO proteins involving in the regulation of NO signaling for cancer pathways.
Collapse
Affiliation(s)
- Yi-Ju Chen
- Institute of Chemistry, Academia Sinica, Taipei 115, Taiwan
| | - Cheng-Tsung Lu
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Min-Gang Su
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Kai-Yao Huang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Wei-Chieh Ching
- Graduate Institute of Life Sciences, National Defense Medical Center, Taipei 114, Taiwan
| | - Hsiao-Hsiang Yang
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Yen-Chen Liao
- Institute of Chemistry, Academia Sinica, Taipei 115, Taiwan Department of Chemistry, National Taiwan University, Taipei 114, Taiwan
| | - Yu-Ju Chen
- Institute of Chemistry, Academia Sinica, Taipei 115, Taiwan Department of Chemistry, National Taiwan University, Taipei 114, Taiwan
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan 320, Taiwan
| |
Collapse
|
48
|
Chaki M, Kovacs I, Spannagl M, Lindermayr C. Computational prediction of candidate proteins for S-nitrosylation in Arabidopsis thaliana. PLoS One 2014; 9:e110232. [PMID: 25333472 PMCID: PMC4204854 DOI: 10.1371/journal.pone.0110232] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Accepted: 09/17/2014] [Indexed: 02/04/2023] Open
Abstract
Nitric oxide (NO) is an important signaling molecule that regulates many physiological processes in plants. One of the most important regulatory mechanisms of NO is S-nitrosylation-the covalent attachment of NO to cysteine residues. Although the involvement of cysteine S-nitrosylation in the regulation of protein functions is well established, its substrate specificity remains unknown. Identification of candidates for S-nitrosylation and their target cysteine residues is fundamental for studying the molecular mechanisms and regulatory roles of S-nitrosylation in plants. Several experimental methods that are based on the biotin switch have been developed to identify target proteins for S-nitrosylation. However, these methods have their limits. Thus, computational methods are attracting considerable attention for the identification of modification sites in proteins. Using GPS-SNO version 1.0, a recently developed S-nitrosylation site-prediction program, a set of 16,610 candidate proteins for S-nitrosylation containing 31,900 S-nitrosylation sites was isolated from the entire Arabidopsis proteome using the medium threshold. In the compartments "chloroplast," "CUL4-RING ubiquitin ligase complex," and "membrane" more than 70% of the proteins were identified as candidates for S-nitrosylation. The high number of identified candidates in the proteome reflects the importance of redox signaling in these compartments. An analysis of the functional distribution of the predicted candidates showed that proteins involved in signaling processes exhibited the highest prediction rate. In a set of 46 proteins, where 53 putative S-nitrosylation sites were already experimentally determined, the GPS-SNO program predicted 60 S-nitrosylation sites, but only 11 overlap with the results of the experimental approach. In general, a computer-assisted method for the prediction of targets for S-nitrosylation is a very good tool; however, further development, such as including the three dimensional structure of proteins in such analyses, would improve the identification of S-nitrosylation sites.
Collapse
Affiliation(s)
- Mounira Chaki
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany
| | - Izabella Kovacs
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany
| | - Manuel Spannagl
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany
| | - Christian Lindermayr
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany
| |
Collapse
|
49
|
Prediction of S-nitrosylation modification sites based on kernel sparse representation classification and mRMR algorithm. BIOMED RESEARCH INTERNATIONAL 2014; 2014:438341. [PMID: 25184139 PMCID: PMC4145740 DOI: 10.1155/2014/438341] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2014] [Accepted: 07/23/2014] [Indexed: 12/02/2022]
Abstract
Protein S-nitrosylation plays a very important role in a wide variety of cellular biological activities. Hitherto, accurate prediction of S-nitrosylation sites is still of great challenge. In this paper, we presented a framework to computationally predict S-nitrosylation sites based on kernel sparse representation classification and minimum Redundancy Maximum Relevance algorithm. As much as 666 features derived from five categories of amino acid properties and one protein structure feature are used for numerical representation of proteins. A total of 529 protein sequences collected from the open-access databases and published literatures are used to train and test our predictor. Computational results show that our predictor achieves Matthews' correlation coefficients of 0.1634 and 0.2919 for the training set and the testing set, respectively, which are better than those of k-nearest neighbor algorithm, random forest algorithm, and sparse representation classification algorithm. The experimental results also indicate that 134 optimal features can better represent the peptides of protein S-nitrosylation than the original 666 redundant features. Furthermore, we constructed an independent testing set of 113 protein sequences to evaluate the robustness of our predictor. Experimental result showed that our predictor also yielded good performance on the independent testing set with Matthews' correlation coefficients of 0.2239.
Collapse
|
50
|
Chen YJ, Ching WC, Chen JS, Lee TY, Lu CT, Chou HC, Lin PY, Khoo KH, Chen JH, Chen YJ. Decoding the S-Nitrosoproteomic Atlas in Individualized Human Colorectal Cancer Tissues Using a Label-Free Quantitation Strategy. J Proteome Res 2014; 13:4942-58. [DOI: 10.1021/pr5002675] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Yi-Ju Chen
- Institute
of Chemistry, Academia Sinica, Taipei, Taiwan
- Institute
of Biochemical Sciences, National Taiwan University, Taipei, Taiwan
| | - Wei-Chieh Ching
- Institute
of Chemistry, Academia Sinica, Taipei, Taiwan
- Graduate
Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan
| | - Jinn-Shiun Chen
- Colorectal
Section, Department of Surgery, Chang Gung Memorial Hospital, Taoyuan, Taiwan
- School
of Medicine, Chang Gung University, Taoyuan, Taiwan
| | - Tzong-Yi Lee
- Department
of Computer Science and Engineering, Yuan Ze University, Taoyuan, Taiwan
| | - Cheng-Tsung Lu
- Department
of Computer Science and Engineering, Yuan Ze University, Taoyuan, Taiwan
| | | | - Pei-Yi Lin
- Institute
of Chemistry, Academia Sinica, Taipei, Taiwan
| | - Kay-Hooi Khoo
- Institute
of Biochemical Sciences, National Taiwan University, Taipei, Taiwan
- Institute
of Biological Chemistry, Academia Sinica, Taipei, Taiwan
| | - Jenn-Han Chen
- Translation
Medicine Lab, Cancer Center, Wan-Fang Hospital, Taipei, Taiwan
| | - Yu-Ju Chen
- Institute
of Chemistry, Academia Sinica, Taipei, Taiwan
| |
Collapse
|