1
|
Pan J, You ZH, Li LP, Huang WZ, Guo JX, Yu CQ, Wang LP, Zhao ZY. DWPPI: A Deep Learning Approach for Predicting Protein–Protein Interactions in Plants Based on Multi-Source Information With a Large-Scale Biological Network. Front Bioeng Biotechnol 2022; 10:807522. [PMID: 35387292 PMCID: PMC8978800 DOI: 10.3389/fbioe.2022.807522] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 02/25/2022] [Indexed: 12/30/2022] Open
Abstract
The prediction of protein–protein interactions (PPIs) in plants is vital for probing the cell function. Although multiple high-throughput approaches in the biological domain have been developed to identify PPIs, with the increasing complexity of PPI network, these methods fall into laborious and time-consuming situations. Thus, it is essential to develop an effective and feasible computational method for the prediction of PPIs in plants. In this study, we present a network embedding-based method, called DWPPI, for predicting the interactions between different plant proteins based on multi-source information and combined with deep neural networks (DNN). The DWPPI model fuses the protein natural language sequence information (attribute information) and protein behavior information to represent plant proteins as feature vectors and finally sends these features to a deep learning–based classifier for prediction. To validate the prediction performance of DWPPI, we performed it on three model plant datasets: Arabidopsis thaliana (A. thaliana), mazie (Zea mays), and rice (Oryza sativa). The experimental results with the fivefold cross-validation technique demonstrated that DWPPI obtains great performance with the AUC (area under ROC curves) values of 0.9548, 0.9867, and 0.9213, respectively. To further verify the predictive capacity of DWPPI, we compared it with some different state-of-the-art machine learning classifiers. Moreover, case studies were performed with the AC149810.2_FGP003 protein. As a result, 14 of the top 20 PPI pairs identified by DWPPI with the highest scores were confirmed by the literature. These excellent results suggest that the DWPPI model can act as a promising tool for related plant molecular biology.
Collapse
Affiliation(s)
- Jie Pan
- School of Information Engineering, Xijing University, Xi’an, China
| | - Zhu-Hong You
- School of Information Engineering, Xijing University, Xi’an, China
| | - Li-Ping Li
- School of Information Engineering, Xijing University, Xi’an, China
- College of Grassland and Environment Science, Xinjiang Agricultural University, Urumqi, China
- *Correspondence: Li-Ping Li, ; Chang-Qing Yu,
| | - Wen-Zhun Huang
- School of Information Engineering, Xijing University, Xi’an, China
| | - Jian-Xin Guo
- School of Information Engineering, Xijing University, Xi’an, China
| | - Chang-Qing Yu
- School of Information Engineering, Xijing University, Xi’an, China
- *Correspondence: Li-Ping Li, ; Chang-Qing Yu,
| | - Li-Ping Wang
- School of Information Engineering, Xijing University, Xi’an, China
| | - Zheng-Yang Zhao
- School of Information Engineering, Xijing University, Xi’an, China
| |
Collapse
|
2
|
Mass Spectrometry- and Computational Structural Biology-Based Investigation of Proteins and Peptides. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2019; 1140:265-287. [PMID: 31347053 DOI: 10.1007/978-3-030-15950-4_15] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Recent developments of mass spectrometry (MS) allow us to identify, estimate, and characterize proteins and protein complexes. At the same time, structural biology helps to determine the protein structure and its structure-function relationship. Together, they aid to understand the protein structure, property, function, protein-complex assembly, protein-protein interaction, and dynamics. The present chapter is organized with illustrative results to demonstrate how experimental mass spectrometry can be combined with computational structural biology for detailed studies of protein's structures. We have used tumor differentiation factor protein/peptide as ligand and Hsp70/Hsp90 as receptor protein as examples to study ligand-protein interaction. To investigate possible protein conformation, we will describe two proteins-lysozyme and myoglobin. As an application of MS-based assignment of disulfide bridges, the case of the spider venom polypeptide Phα1β will also be discussed.
Collapse
|
3
|
Using breast milk to assess breast cancer risk: the role of mass spectrometry-based proteomics. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 806:399-408. [PMID: 24952194 DOI: 10.1007/978-3-319-06068-2_19] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Although mammography and treatment advances have led to declines in breast cancer mortality in the United States, breast cancer remains a major cause of morbidity and mortality. Breast cancer in young women is associated with increased mortality and current methods of detecting breast cancers in this group of women have known limitations. Tools for accurately assessing personal breast cancer risk in young women are needed to identify those women who would benefit the most from earlier intervention. Proteomic analysis of breast milk could identify biomarkers of breast cancer risk and provide a tool for identifying women at increased risk. A preliminary analysis of milk from four women provides a proof of concept for using breast milk to assess breast cancer risk.
Collapse
|
4
|
Ngounou Wetie AG, Sokolowska I, Woods AG, Roy U, Deinhardt K, Darie CC. Protein-protein interactions: switch from classical methods to proteomics and bioinformatics-based approaches. Cell Mol Life Sci 2014; 71:205-28. [PMID: 23579629 PMCID: PMC11113707 DOI: 10.1007/s00018-013-1333-1] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2012] [Revised: 03/25/2013] [Accepted: 03/26/2013] [Indexed: 11/28/2022]
Abstract
Following the sequencing of the human genome and many other organisms, research on protein-coding genes and their functions (functional genomics) has intensified. Subsequently, with the observation that proteins are indeed the molecular effectors of most cellular processes, the discipline of proteomics was born. Clearly, proteins do not function as single entities but rather as a dynamic network of team players that have to communicate. Though genetic (yeast two-hybrid Y2H) and biochemical methods (co-immunoprecipitation Co-IP, affinity purification AP) were the methods of choice at the beginning of the study of protein-protein interactions (PPI), in more recent years there has been a shift towards proteomics-based methods and bioinformatics-based approaches. In this review, we first describe in depth PPIs and we make a strong case as to why unraveling the interactome is the next challenge in the field of proteomics. Furthermore, classical methods of investigation of PPIs and structure-based bioinformatics approaches are presented. The greatest emphasis is placed on proteomic methods, especially native techniques that were recently developed and that have been shown to be reliable. Finally, we point out the limitations of these methods and the need to set up a standard for the validation of PPI experiments.
Collapse
Affiliation(s)
- Armand G. Ngounou Wetie
- Department of Chemistry and Biomolecular Science, Biochemistry and Proteomics Group, Clarkson University, 8 Clarkson Avenue, Potsdam, NY 13699-5810 USA
| | - Izabela Sokolowska
- Department of Chemistry and Biomolecular Science, Biochemistry and Proteomics Group, Clarkson University, 8 Clarkson Avenue, Potsdam, NY 13699-5810 USA
| | - Alisa G. Woods
- Department of Chemistry and Biomolecular Science, Biochemistry and Proteomics Group, Clarkson University, 8 Clarkson Avenue, Potsdam, NY 13699-5810 USA
| | - Urmi Roy
- Department of Chemistry and Biomolecular Science, Biochemistry and Proteomics Group, Clarkson University, 8 Clarkson Avenue, Potsdam, NY 13699-5810 USA
| | - Katrin Deinhardt
- Centre for Biological Sciences, University of Southampton, Life Sciences Building 85, Southampton, SO17 1BJ UK
- Institute for Life Sciences, University of Southampton, Life Sciences Building 85, Southampton, SO17 1BJ UK
| | - Costel C. Darie
- Department of Chemistry and Biomolecular Science, Biochemistry and Proteomics Group, Clarkson University, 8 Clarkson Avenue, Potsdam, NY 13699-5810 USA
| |
Collapse
|
5
|
Sokolowska I, Woods AG, Gawinowicz MA, Roy U, Darie CC. Characterization of tumor differentiation factor (TDF) and its receptor (TDF-R). Cell Mol Life Sci 2013; 70:2835-48. [PMID: 23076253 PMCID: PMC11113447 DOI: 10.1007/s00018-012-1185-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2012] [Revised: 09/27/2012] [Accepted: 10/01/2012] [Indexed: 10/27/2022]
Abstract
Tumor differentiation factor (TDF) is an under-investigated protein produced by the pituitary with no definitive function. TDF is secreted into the bloodstream and targets the breast and prostate, suggesting that it has an endocrine function. Initially, TDF was indirectly discovered based on the differentiation effect of alkaline pituitary extracts of the mammosomatotropic tumor MtTWlO on MTW9/PI rat mammary tumor cells. Years later, the cDNA clone responsible for this differentiation activity was isolated from a human pituitary cDNA library using expression cloning. The cDNA encoded a 108-amino-acid polypeptide that had differentiation activity on MCF7 breast cancer cells and on DU145 prostate cancer cells in vitro and in vivo. Recently, our group focused on identification of the TDF receptor (TDF-R). As potential TDF-R candidates, we identified the members of the Heat Shock 70-kDa family of proteins (HSP70) in both MCF7 and BT-549 human breast cancer cells (HBCC) and PC3, DU145, and LNCaP human prostate cancer cells (HPCC), but not in HeLa cells, NG108 neuroblastoma, or HDF-a and BLK CL.4 cells fibroblasts or fibroblast-like cells. Here we review the current advances on TDF, with particular focus on the structural investigation of its receptor and on its functional effects on breast and prostate cells.
Collapse
Affiliation(s)
- Izabela Sokolowska
- Biochemistry and Proteomics Group, Department of Chemistry and Biomolecular Science, Clarkson University, 8 Clarkson Avenue, Potsdam, NY 13699-5810 USA
| | - Alisa G. Woods
- Biochemistry and Proteomics Group, Department of Chemistry and Biomolecular Science, Clarkson University, 8 Clarkson Avenue, Potsdam, NY 13699-5810 USA
| | - Mary Ann Gawinowicz
- Protein Core Facility, College of Physicians and Surgeons, Columbia University, 160 Fort Washington Avenue, Room 415, New York, NY 10032 USA
| | - Urmi Roy
- Biochemistry and Proteomics Group, Department of Chemistry and Biomolecular Science, Clarkson University, 8 Clarkson Avenue, Potsdam, NY 13699-5810 USA
| | - Costel C. Darie
- Biochemistry and Proteomics Group, Department of Chemistry and Biomolecular Science, Clarkson University, 8 Clarkson Avenue, Potsdam, NY 13699-5810 USA
| |
Collapse
|
6
|
Ngounou Wetie AG, Sokolowska I, Woods AG, Wormwood KL, Dao S, Patel S, Clarkson BD, Darie CC. Automated Mass Spectrometry–Based Functional Assay for the Routine Analysis of the Secretome. ACTA ACUST UNITED AC 2013; 18:19-29. [DOI: 10.1177/2211068212454738] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|