1
|
Al-gafari M, Jagadeesan SK, Kazmirchuk TDD, Takallou S, Wang J, Hajikarimlou M, Ramessur NB, Darwish W, Bradbury-Jost C, Moteshareie H, Said KB, Samanfar B, Golshani A. Investigating the Activities of CAF20 and ECM32 in the Regulation of PGM2 mRNA Translation. BIOLOGY 2024; 13:884. [PMID: 39596839 PMCID: PMC11592143 DOI: 10.3390/biology13110884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 10/17/2024] [Accepted: 10/22/2024] [Indexed: 11/29/2024]
Abstract
Translation is a fundamental process in biology, and understanding its mechanisms is crucial to comprehending cellular functions and diseases. The regulation of this process is closely linked to the structure of mRNA, as these regions prove vital to modulating translation efficiency and control. Thus, identifying and investigating these fundamental factors that influence the processing and unwinding of structured mRNAs would be of interest due to the widespread impact in various fields of biology. To this end, we employed a computational approach and identified genes that may be involved in the translation of structured mRNAs. The approach is based on the enrichment of interactions and co-expression of genes with those that are known to influence translation and helicase activity. The in silico prediction found CAF20 and ECM32 to be highly ranked candidates that may play a role in unwinding mRNA. The activities of neither CAF20 nor ECM32 have previously been linked to the translation of PGM2 mRNA or other structured mRNAs. Our follow-up investigations with these two genes provided evidence of their participation in the translation of PGM2 mRNA and several other synthetic structured mRNAs.
Collapse
Affiliation(s)
- Mustafa Al-gafari
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Sasi Kumar Jagadeesan
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Thomas David Daniel Kazmirchuk
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Sarah Takallou
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Jiashu Wang
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Maryam Hajikarimlou
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Nishka Beersing Ramessur
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
| | - Waleed Darwish
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
| | - Calvin Bradbury-Jost
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Houman Moteshareie
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
- Healthy Environments and Consumer Safety Branch, Health Canada, Ottawa, ON K1A 0K9, Canada
| | - Kamaledin B. Said
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Department of Pathology and Microbiology, College of Medicine, University of Hail, Hail P.O. Box 2240, Saudi Arabia
| | - Bahram Samanfar
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre (ORDC), Ottawa, ON K1A 0C6, Canada
| | - Ashkan Golshani
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| |
Collapse
|
2
|
Fu A, Kazmirchuk TDD, Bradbury-Jost C, Golshani A, Othman M. Platelet-Type von Willebrand Disease: Complex Pathophysiology and Insights on Novel Therapeutic and Diagnostic Strategies. Semin Thromb Hemost 2024. [PMID: 39191406 DOI: 10.1055/s-0044-1789183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/29/2024]
Abstract
von Willebrand disease (VWD) is the most common well-studied genetic bleeding disorder worldwide. Much less is known about platelet-type VWD (PT-VWD), a rare platelet function defect, and a "nonidentical" twin bleeding phenotype to type 2B VWD (2B-VWD). Rather than a defect in the von Willebrand factor (VWF) gene, PT-VWD is caused by a platelet GP1BA mutation leading to a hyperaffinity of the glycoprotein Ibα (GPIbα) platelet surface receptor for VWF, and thus increased platelet clearing and high-molecular-weight VWF multimer elimination. Nine GP1BA gene mutations are known. It is historically believed that this enhanced binding was enabled by the β-switch region of GPIbα adopting an extended β-hairpin form. Recent evidence suggests the pathological conformation that destabilizes the compact triangular form of the R-loop-the GPIbα protein's region for VWF binding. PT-VWD is often misdiagnosed as 2B-VWD, even the though distinction between the two is crucial for proper treatment, as the former requires platelet transfusions, while the latter requires VWF/FVIII concentrate administration. Nevertheless, these PT-VWD treatments remain unsatisfactory, owing to their high cost, low availability, risk of alloimmunity, and the need to carefully balance platelet administration. Antibodies such as 6B4 remain undependable as an alternative therapy due to their questionable efficacy and high costs for this purpose. On the other hand, synthetic peptide therapeutics developed with In-Silico Protein Synthesizer to disrupt the association between GPIbα and VWF show preliminary promise as a therapy based on in vitro experiments. Such peptides could serve as an effective diagnostic technology for discriminating between 2B-VWD and PT-VWD, or potentially all forms of VWD, based on their high specificity. This field is rapidly growing and the current review sheds light on the complex pathology and some novel potential therapeutic and diagnostic strategies.
Collapse
Affiliation(s)
- Anne Fu
- Department of Biomedical and Molecular Sciences, School of Medicine, Queen's University, Kingston, Ontario, Canada
| | - Thomas D D Kazmirchuk
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, Ontario, Canada
| | - Calvin Bradbury-Jost
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, Ontario, Canada
| | - Ashkan Golshani
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, Ontario, Canada
| | - Maha Othman
- Department of Biomedical and Molecular Sciences, School of Medicine, Queen's University, Kingston, Ontario, Canada
- School of Baccalaureate Nursing, St. Lawrence College, Kingston, Ontario, Canada
- Department of Clinical Pathology, Faculty of Medicine, Mansoura University, Mansoura City, Egypt
| |
Collapse
|
3
|
Kazmirchuk TDD, Bradbury-Jost C, Withey TA, Gessese T, Azad T, Samanfar B, Dehne F, Golshani A. Peptides of a Feather: How Computation Is Taking Peptide Therapeutics under Its Wing. Genes (Basel) 2023; 14:1194. [PMID: 37372372 PMCID: PMC10298604 DOI: 10.3390/genes14061194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 05/24/2023] [Accepted: 05/26/2023] [Indexed: 06/29/2023] Open
Abstract
Leveraging computation in the development of peptide therapeutics has garnered increasing recognition as a valuable tool to generate novel therapeutics for disease-related targets. To this end, computation has transformed the field of peptide design through identifying novel therapeutics that exhibit enhanced pharmacokinetic properties and reduced toxicity. The process of in-silico peptide design involves the application of molecular docking, molecular dynamics simulations, and machine learning algorithms. Three primary approaches for peptide therapeutic design including structural-based, protein mimicry, and short motif design have been predominantly adopted. Despite the ongoing progress made in this field, there are still significant challenges pertaining to peptide design including: enhancing the accuracy of computational methods; improving the success rate of preclinical and clinical trials; and developing better strategies to predict pharmacokinetics and toxicity. In this review, we discuss past and present research pertaining to the design and development of in-silico peptide therapeutics in addition to highlighting the potential of computation and artificial intelligence in the future of disease therapeutics.
Collapse
Affiliation(s)
- Thomas David Daniel Kazmirchuk
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, ON K1S 5B6, Canada
| | - Calvin Bradbury-Jost
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, ON K1S 5B6, Canada
| | - Taylor Ann Withey
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, ON K1S 5B6, Canada
| | - Tadesse Gessese
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, ON K1S 5B6, Canada
| | - Taha Azad
- Department of Microbiology and Infectious Diseases, Université de Sherbrooke, Sherbrooke, QC J1E 4K8, Canada
- Centre de Recherche du Centre Hospitalier Universitaire de Sherbrooke (CHUS), Sherbrooke, QC J1H 5N4, Canada
| | - Bahram Samanfar
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, ON K1S 5B6, Canada
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre (ORDC), Ottawa, ON K1A 0C6, Canada
| | - Frank Dehne
- School of Computer Science, Carleton University, Ottawa, ON K1S 5B6, Canada
| | - Ashkan Golshani
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, ON K1S 5B6, Canada
| |
Collapse
|
4
|
Huang Y, Wuchty S, Zhou Y, Zhang Z. SGPPI: structure-aware prediction of protein-protein interactions in rigorous conditions with graph convolutional network. Brief Bioinform 2023; 24:6995378. [PMID: 36682013 DOI: 10.1093/bib/bbad020] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 11/17/2022] [Accepted: 01/05/2023] [Indexed: 01/23/2023] Open
Abstract
While deep learning (DL)-based models have emerged as powerful approaches to predict protein-protein interactions (PPIs), the reliance on explicit similarity measures (e.g. sequence similarity and network neighborhood) to known interacting proteins makes these methods ineffective in dealing with novel proteins. The advent of AlphaFold2 presents a significant opportunity and also a challenge to predict PPIs in a straightforward way based on monomer structures while controlling bias from protein sequences. In this work, we established Structure and Graph-based Predictions of Protein Interactions (SGPPI), a structure-based DL framework for predicting PPIs, using the graph convolutional network. In particular, SGPPI focused on protein patches on the protein-protein binding interfaces and extracted the structural, geometric and evolutionary features from the residue contact map to predict PPIs. We demonstrated that our model outperforms traditional machine learning methods and state-of-the-art DL-based methods using non-representation-bias benchmark datasets. Moreover, our model trained on human dataset can be reliably transferred to predict yeast PPIs, indicating that SGPPI can capture converging structural features of protein interactions across various species. The implementation of SGPPI is available at https://github.com/emerson106/SGPPI.
Collapse
Affiliation(s)
- Yan Huang
- State Key Laboratory of Livestock and Poultry Biotechnology Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
- Department of Biomedical Informatics, Ministry of Education Key Laboratory of Molecular Cardiovascular Sciences, Center for Non-Coding RNA Medicine, School of Basic Medical Sciences, Peking University, Beijing 100191, China
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL 33146, USA
- Department of Biology, University of Miami, Coral Gables, FL 33146, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL 33136, USA
- Institute of Data Science and Computing, University of Miami, Coral Gables, FL 33146, USA
| | - Yuan Zhou
- Department of Biomedical Informatics, Ministry of Education Key Laboratory of Molecular Cardiovascular Sciences, Center for Non-Coding RNA Medicine, School of Basic Medical Sciences, Peking University, Beijing 100191, China
| | - Ziding Zhang
- State Key Laboratory of Livestock and Poultry Biotechnology Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| |
Collapse
|
5
|
Bandyopadhyay SS, Halder AK, Saha S, Chatterjee P, Nasipuri M, Basu S. Assessment of GO-Based Protein Interaction Affinities in the Large-Scale Human-Coronavirus Family Interactome. Vaccines (Basel) 2023; 11:549. [PMID: 36992133 PMCID: PMC10059867 DOI: 10.3390/vaccines11030549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 02/19/2023] [Accepted: 02/23/2023] [Indexed: 03/03/2023] Open
Abstract
SARS-CoV-2 is a novel coronavirus that replicates itself via interacting with the host proteins. As a result, identifying virus and host protein-protein interactions could help researchers better understand the virus disease transmission behavior and identify possible COVID-19 drugs. The International Committee on Virus Taxonomy has determined that nCoV is genetically 89% compared to the SARS-CoV epidemic in 2003. This paper focuses on assessing the host-pathogen protein interaction affinity of the coronavirus family, having 44 different variants. In light of these considerations, a GO-semantic scoring function is provided based on Gene Ontology (GO) graphs for determining the binding affinity of any two proteins at the organism level. Based on the availability of the GO annotation of the proteins, 11 viral variants, viz., SARS-CoV-2, SARS, MERS, Bat coronavirus HKU3, Bat coronavirus Rp3/2004, Bat coronavirus HKU5, Murine coronavirus, Bovine coronavirus, Rat coronavirus, Bat coronavirus HKU4, Bat coronavirus 133/2005, are considered from 44 viral variants. The fuzzy scoring function of the entire host-pathogen network has been processed with ~180 million potential interactions generated from 19,281 host proteins and around 242 viral proteins. ~4.5 million potential level one host-pathogen interactions are computed based on the estimated interaction affinity threshold. The resulting host-pathogen interactome is also validated with state-of-the-art experimental networks. The study has also been extended further toward the drug-repurposing study by analyzing the FDA-listed COVID drugs.
Collapse
Affiliation(s)
- Soumyendu Sekhar Bandyopadhyay
- Department of Computer Science and Engineering, Jadavpur University, Kolkata 700032, India
- Department of Computer Science and Engineering, School of Engineering and Technology, Adamas University, Kolkata 700126, India
| | - Anup Kumar Halder
- Faculty of Mathematics and Information Sciences, Warsaw University of Technology, 00-662 Warsaw, Poland
| | - Sovan Saha
- Department of Computer Science and Engineering (Artificial Intelligence and Machine Learning), Techno Main Salt Lake, Sector V, Kolkata 700091, India
| | - Piyali Chatterjee
- Department of Computer Science and Engineering, Netaji Subhash Engineering College, Kolkata 700152, India
| | - Mita Nasipuri
- Department of Computer Science and Engineering, Jadavpur University, Kolkata 700032, India
| | - Subhadip Basu
- Department of Computer Science and Engineering, Jadavpur University, Kolkata 700032, India
| |
Collapse
|
6
|
Murakami Y, Mizuguchi K. Recent developments of sequence-based prediction of protein-protein interactions. Biophys Rev 2022; 14:1393-1411. [PMID: 36589735 PMCID: PMC9789376 DOI: 10.1007/s12551-022-01038-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/08/2022] [Indexed: 12/25/2022] Open
Abstract
The identification of protein-protein interactions (PPIs) can lead to a better understanding of cellular functions and biological processes of proteins and contribute to the design of drugs to target disease-causing PPIs. In addition, targeting host-pathogen PPIs is useful for elucidating infection mechanisms. Although several experimental methods have been used to identify PPIs, these methods can yet to draw complete PPI networks. Hence, computational techniques are increasingly required for the prediction of potential PPIs, which have never been seen experimentally. Recent high-performance sequence-based methods have contributed to the construction of PPI networks and the elucidation of pathogenetic mechanisms in specific diseases. However, the usefulness of these methods depends on the quality and quantity of training data of PPIs. In this brief review, we introduce currently available PPI databases and recent sequence-based methods for predicting PPIs. Also, we discuss key issues in this field and present future perspectives of the sequence-based PPI predictions.
Collapse
Affiliation(s)
- Yoichi Murakami
- grid.440890.10000 0004 0640 9413Tokyo University of Information Sciences, 4-1 Onaridai, Wakaba-Ku, Chiba, 265-8501 Japan
| | - Kenji Mizuguchi
- grid.136593.b0000 0004 0373 3971Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita-Shi, Osaka, 565-0871 Japan ,grid.482562.fNational Institutes of Biomedical Innovation, Health and Nutrition, 7-6-8 Saito Asagi, Ibaraki, Osaka 567-0085 Japan
| |
Collapse
|
7
|
Hajikarimlou M, Hooshyar M, Moutaoufik M, Aly K, Azad T, Takallou S, Jagadeesan S, Phanse S, Said K, Samanfar B, Bell J, Dehne F, Babu M, Golshani A. A computational approach to rapidly design peptides that detect SARS-CoV-2 surface protein S. NAR Genom Bioinform 2022; 4:lqac058. [PMID: 36004308 PMCID: PMC9394169 DOI: 10.1093/nargab/lqac058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 06/10/2022] [Accepted: 08/01/2022] [Indexed: 11/12/2022] Open
Abstract
Abstract
The coronavirus disease 19 (COVID-19) caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) prompted the development of diagnostic and therapeutic frameworks for timely containment of this pandemic. Here, we utilized our non-conventional computational algorithm, InSiPS, to rapidly design and experimentally validate peptides that bind to SARS-CoV-2 spike (S) surface protein. We previously showed that this method can be used to develop peptides against yeast proteins, however, the applicability of this method to design peptides against other proteins has not been investigated. In the current study, we demonstrate that two sets of peptides developed using InSiPS method can detect purified SARS-CoV-2 S protein via ELISA and Surface Plasmon Resonance (SPR) approaches, suggesting the utility of our strategy in real time COVID-19 diagnostics. Mass spectrometry-based salivary peptidomics shortlist top SARS-CoV-2 peptides detected in COVID-19 patients’ saliva, rendering them attractive SARS-CoV-2 diagnostic targets that, when subjected to our computational platform, can streamline the development of potent peptide diagnostics of SARS-CoV-2 variants of concern. Our approach can be rapidly implicated in diagnosing other communicable diseases of immediate threat.
Collapse
Affiliation(s)
- Maryam Hajikarimlou
- Ottawa Institute of Systems Biology, University of Ottawa , Health Science Campus, Ottawa , Ontario , Canada
- Department of Biology, Carleton University , Ottawa , Ontario , Canada
| | - Mohsen Hooshyar
- Ottawa Institute of Systems Biology, University of Ottawa , Health Science Campus, Ottawa , Ontario , Canada
- Department of Biology, Carleton University , Ottawa , Ontario , Canada
| | - Mohamed Taha Moutaoufik
- Department of Biochemistry, Research and Innovation Centre, University of Regina , Regina , Canada
| | - Khaled A Aly
- Department of Biochemistry, Research and Innovation Centre, University of Regina , Regina , Canada
| | - Taha Azad
- The Ottawa Hospital Research Institute 501 Smyth Road , Ottawa , Ontario , Canada
| | - Sarah Takallou
- Ottawa Institute of Systems Biology, University of Ottawa , Health Science Campus, Ottawa , Ontario , Canada
- Department of Biology, Carleton University , Ottawa , Ontario , Canada
| | - Sasi Jagadeesan
- Ottawa Institute of Systems Biology, University of Ottawa , Health Science Campus, Ottawa , Ontario , Canada
- Department of Biology, Carleton University , Ottawa , Ontario , Canada
| | - Sadhna Phanse
- Department of Biochemistry, Research and Innovation Centre, University of Regina , Regina , Canada
| | - Kamaledin B Said
- Department of Biology, Carleton University , Ottawa , Ontario , Canada
- Department of Pathology and Microbiology, College of Medicine, University of Hail , Saudi Arabia
| | - Bahram Samanfar
- Ottawa Institute of Systems Biology, University of Ottawa , Health Science Campus, Ottawa , Ontario , Canada
- Department of Biology, Carleton University , Ottawa , Ontario , Canada
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre (ORDC) , Ottawa , Ontario , Canada
| | - John C Bell
- The Ottawa Hospital Research Institute 501 Smyth Road , Ottawa , Ontario , Canada
| | - Frank Dehne
- School of Computer Science, Carleton University , Ottawa , Ontario , Canada
| | - Mohan Babu
- Department of Biochemistry, Research and Innovation Centre, University of Regina , Regina , Canada
| | - Ashkan Golshani
- Ottawa Institute of Systems Biology, University of Ottawa , Health Science Campus, Ottawa , Ontario , Canada
- Department of Biology, Carleton University , Ottawa , Ontario , Canada
| |
Collapse
|
8
|
Dick K, Pattang A, Hooker J, Nissan N, Sadowski M, Barnes B, Tan LH, Burnside D, Phanse S, Aoki H, Babu M, Dehne F, Golshani A, Cober ER, Green JR, Samanfar B. Human-Soybean Allergies: Elucidation of the Seed Proteome and Comprehensive Protein-Protein Interaction Prediction. J Proteome Res 2021; 20:4925-4947. [PMID: 34582199 DOI: 10.1021/acs.jproteome.1c00138] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The soybean crop, Glycine max (L.) Merr., is consumed by humans, Homo sapiens, worldwide. While the respective bodies of literature and -omics data for each of these organisms are extensive, comparatively few studies investigate the molecular biological processes occurring between the two. We are interested in elucidating the network of protein-protein interactions (PPIs) involved in human-soybean allergies. To this end, we leverage state-of-the-art sequence-based PPI predictors amenable to predicting the enormous comprehensive interactome between human and soybean. A network-based analytical approach is proposed, leveraging similar interaction profiles to identify candidate allergens and proteins involved in the allergy response. Interestingly, the predicted interactome can be explored from two complementary perspectives: which soybean proteins are predicted to interact with specific human proteins and which human proteins are predicted to interact with specific soybean proteins. A total of eight proteins (six specific to the human proteome and two to the soy proteome) have been identified and supported by the literature to be involved in human health, specifically related to immunological and neurological pathways. This study, beyond generating the most comprehensive human-soybean interactome to date, elucidated a soybean seed interactome and identified several proteins putatively consequential to human health.
Collapse
Affiliation(s)
- Kevin Dick
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| | - Arezo Pattang
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, Ontario, Canada K1A 0C6
- Department of Biology and Institute of Biochemistry, and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| | - Julia Hooker
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, Ontario, Canada K1A 0C6
- Department of Biology and Institute of Biochemistry, and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| | - Nour Nissan
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, Ontario, Canada K1A 0C6
- Department of Biology and Institute of Biochemistry, and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| | - Michael Sadowski
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, Ontario, Canada K1A 0C6
- Department of Biology and Institute of Biochemistry, and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| | - Bradley Barnes
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| | - Le Hoa Tan
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, Ontario, Canada K1A 0C6
- Department of Biology and Institute of Biochemistry, and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| | - Daniel Burnside
- Department of Biology and Institute of Biochemistry, and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| | - Sadhna Phanse
- Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada S4S 0A2
| | - Hiroyuki Aoki
- Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada S4S 0A2
| | - Mohan Babu
- Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada S4S 0A2
| | - Frank Dehne
- School of Computer Science, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| | - Ashkan Golshani
- Department of Biology and Institute of Biochemistry, and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| | - Elroy R Cober
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, Ontario, Canada K1A 0C6
| | - James R Green
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| | - Bahram Samanfar
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, Ontario, Canada K1A 0C6
- Department of Biology and Institute of Biochemistry, and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada K1S 5B6
| |
Collapse
|
9
|
Dick K, Chopra A, Biggar KK, Green JR. Multi-schema computational prediction of the comprehensive SARS-CoV-2 vs. human interactome. PeerJ 2021; 9:e11117. [PMID: 33868814 PMCID: PMC8029698 DOI: 10.7717/peerj.11117] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 02/24/2021] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Understanding the disease pathogenesis of the novel coronavirus, denoted SARS-CoV-2, is critical to the development of anti-SARS-CoV-2 therapeutics. The global propagation of the viral disease, denoted COVID-19 ("coronavirus disease 2019"), has unified the scientific community in searching for possible inhibitory small molecules or polypeptides. A holistic understanding of the SARS-CoV-2 vs. human inter-species interactome promises to identify putative protein-protein interactions (PPI) that may be considered targets for the development of inhibitory therapeutics. METHODS We leverage two state-of-the-art, sequence-based PPI predictors (PIPE4 & SPRINT) capable of generating the comprehensive SARS-CoV-2 vs. human interactome, comprising approximately 285,000 pairwise predictions. Three prediction schemas (all, proximal, RP-PPI) are leveraged to obtain our highest-confidence subset of PPIs and human proteins predicted to interact with each of the 14 SARS-CoV-2 proteins considered in this study. Notably, the use of the Reciprocal Perspective (RP) framework demonstrates improved predictive performance in multiple cross-validation experiments. RESULTS The all schema identified 279 high-confidence putative interactions involving 225 human proteins, the proximal schema identified 129 high-confidence putative interactions involving 126 human proteins, and the RP-PPI schema identified 539 high-confidence putative interactions involving 494 human proteins. The intersection of the three sets of predictions comprise the seven highest-confidence PPIs. Notably, the Spike-ACE2 interaction was the highest ranked for both the PIPE4 and SPRINT predictors with the all and proximal schemas, corroborating existing evidence for this PPI. Several other predicted PPIs are biologically relevant within the context of the original SARS-CoV virus. Furthermore, the PIPE-Sites algorithm was used to identify the putative subsequence that might mediate each interaction and thereby inform the design of inhibitory polypeptides intended to disrupt the corresponding host-pathogen interactions. CONCLUSION We publicly released the comprehensive sets of PPI predictions and their corresponding PIPE-Sites landscapes in the following DataVerse repository: https://www.doi.org/10.5683/SP2/JZ77XA. The information provided represents theoretical modeling only and caution should be exercised in its use. It is intended as a resource for the scientific community at large in furthering our understanding of SARS-CoV-2.
Collapse
Affiliation(s)
- Kevin Dick
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Ontario, Canada
- Institute for Data Science, Carleton University, Ottawa, Ontario, Canada
| | - Anand Chopra
- Institute of Biochemistry, Carleton University, Ottawa, Ontario, Canada
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
| | - Kyle K. Biggar
- Institute of Biochemistry, Carleton University, Ottawa, Ontario, Canada
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
| | - James R. Green
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Ontario, Canada
- Institute for Data Science, Carleton University, Ottawa, Ontario, Canada
| |
Collapse
|
10
|
Annie Lee ES, Zhou P, Wong AKC. WeMine Aligned Pattern Clustering System for Biosequence Pattern Analysis. Bioinformatics 2021. [DOI: 10.36255/exonpublications.bioinformatics.2021.ch8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
11
|
Li Z, Nie R, You Z, Cao C, Li J. Using discriminative vector machine model with 2DPCA to predict interactions among proteins. BMC Bioinformatics 2019; 20:694. [PMID: 31874626 PMCID: PMC6929273 DOI: 10.1186/s12859-019-3268-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Background The interactions among proteins act as crucial roles in most cellular processes. Despite enormous effort put for identifying protein-protein interactions (PPIs) from a large number of organisms, existing firsthand biological experimental methods are high cost, low efficiency, and high false-positive rate. The application of in silico methods opens new doors for predicting interactions among proteins, and has been attracted a great deal of attention in the last decades. Results Here we present a novelty computational model with the adoption of our proposed Discriminative Vector Machine (DVM) model and a 2-Dimensional Principal Component Analysis (2DPCA) descriptor to identify candidate PPIs only based on protein sequences. To be more specific, a 2DPCA descriptor is employed to capture discriminative feature information from Position-Specific Scoring Matrix (PSSM) of amino acid sequences by the tool of PSI-BLAST. Then, a robust and powerful DVM classifier is employed to infer PPIs. When applied on both gold benchmark datasets of Yeast and H. pylori, our model obtained mean prediction accuracies as high as of 97.06 and 92.89%, respectively, which demonstrates a noticeable improvement than some state-of-the-art methods. Moreover, we constructed Support Vector Machines (SVM) based predictive model and made comparison it with our model on Human benchmark dataset. In addition, to further demonstrate the predictive reliability of our proposed method, we also carried out extensive experiments for identifying cross-species PPIs on five other species datasets. Conclusions All the experimental results indicate that our method is very effective for identifying potential PPIs and could serve as a practical approach to aid bioexperiment in proteomics research.
Collapse
Affiliation(s)
- Zhengwei Li
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, 221116, China.,Mine Digitization Engineering Research Center of Ministry of Education, China University of Mining and Technology, Xuzhou, 221116, China.,Institute of Machine Learning and Systems Biology, College of Electronics and Information Engineering, Tongji University, Shanghai, 201804, China.,KUNPAND Communications (Kunshan) Co., Ltd., Suzhou, 215300, China
| | - Ru Nie
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, 221116, China. .,Mine Digitization Engineering Research Center of Ministry of Education, China University of Mining and Technology, Xuzhou, 221116, China.
| | - Zhuhong You
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, Urumqi, 830011, China
| | - Chen Cao
- Departments of Biochemistry & Molecular Biology and Medical Genetics, Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, T2N 4N1, Canada
| | - Jiashu Li
- Mine Digitization Engineering Research Center of Ministry of Education, China University of Mining and Technology, Xuzhou, 221116, China.
| |
Collapse
|
12
|
Hashemifar S, Neyshabur B, Khan AA, Xu J. Predicting protein-protein interactions through sequence-based deep learning. Bioinformatics 2019; 34:i802-i810. [PMID: 30423091 DOI: 10.1093/bioinformatics/bty573] [Citation(s) in RCA: 177] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Motivation High-throughput experimental techniques have produced a large amount of protein-protein interaction (PPI) data, but their coverage is still low and the PPI data is also very noisy. Computational prediction of PPIs can be used to discover new PPIs and identify errors in the experimental PPI data. Results We present a novel deep learning framework, DPPI, to model and predict PPIs from sequence information alone. Our model efficiently applies a deep, Siamese-like convolutional neural network combined with random projection and data augmentation to predict PPIs, leveraging existing high-quality experimental PPI data and evolutionary information of a protein pair under prediction. Our experimental results show that DPPI outperforms the state-of-the-art methods on several benchmarks in terms of area under precision-recall curve (auPR), and computationally is more efficient. We also show that DPPI is able to predict homodimeric interactions where other methods fail to work accurately, and the effectiveness of DPPI in specific applications such as predicting cytokine-receptor binding affinities. Availability and implementation Predicting protein-protein interactions through sequence-based deep learning): https://github.com/hashemifar/DPPI/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | - Aly A Khan
- Toyota Technological Institute at Chicago, Chicago, IL, USA
| | - Jinbo Xu
- Toyota Technological Institute at Chicago, Chicago, IL, USA
| |
Collapse
|
13
|
Burnside D, Schoenrock A, Moteshareie H, Hooshyar M, Basra P, Hajikarimlou M, Dick K, Barnes B, Kazmirchuk T, Jessulat M, Pitre S, Samanfar B, Babu M, Green JR, Wong A, Dehne F, Biggar KK, Golshani A. In Silico Engineering of Synthetic Binding Proteins from Random Amino Acid Sequences. iScience 2018; 11:375-387. [PMID: 30660105 PMCID: PMC6348295 DOI: 10.1016/j.isci.2018.11.038] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2018] [Revised: 10/19/2018] [Accepted: 11/28/2018] [Indexed: 12/29/2022] Open
Abstract
Synthetic proteins with high affinity and selectivity for a protein target can be used as research tools, biomarkers, and pharmacological agents, but few methods exist to design such proteins de novo. To this end, the In-Silico Protein Synthesizer (InSiPS) was developed to design synthetic binding proteins (SBPs) that bind pre-determined targets while minimizing off-target interactions. InSiPS is a genetic algorithm that refines a pool of random sequences over hundreds of generations of mutation and selection to produce SBPs with pre-specified binding characteristics. As a proof of concept, we design SBPs against three yeast proteins and demonstrate binding and functional inhibition of two of three targets in vivo. Peptide SPOT arrays confirm binding sites, and a permutation array demonstrates target specificity. Our foundational approach will support the field of de novo design of small binding polypeptide motifs and has robust applicability while offering potential advantages over the limited number of techniques currently available. InSiPS engineers synthetic binding proteins (SBPs) using primary protein sequence SBPs are designed to a bind a target protein and avoid “off-target” interactions Binding and functional inhibition of two of three target proteins in yeast is demonstrated Our new approach offers advantages over alternative tools that rely on 3D models
Collapse
Affiliation(s)
- Daniel Burnside
- Department of Biology, Carleton University, Ottawa, ON K1S5B6, Canada; Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Andrew Schoenrock
- School of Computer Science, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Houman Moteshareie
- Department of Biology, Carleton University, Ottawa, ON K1S5B6, Canada; Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Mohsen Hooshyar
- Department of Biology, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Prabh Basra
- Department of Biology, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Maryam Hajikarimlou
- Department of Biology, Carleton University, Ottawa, ON K1S5B6, Canada; Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Kevin Dick
- Department of Systems and Computer Engineering, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Brad Barnes
- School of Computer Science, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Tom Kazmirchuk
- Department of Biology, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Matthew Jessulat
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, SK S4S 0A2, Canada
| | - Sylvain Pitre
- School of Computer Science, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Bahram Samanfar
- Department of Biology, Carleton University, Ottawa, ON K1S5B6, Canada; Ottawa Research and Development Centre (ORDC), Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C5, Canada
| | - Mohan Babu
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, SK S4S 0A2, Canada
| | - James R Green
- Department of Systems and Computer Engineering, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Alex Wong
- Department of Biology, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Frank Dehne
- School of Computer Science, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Kyle K Biggar
- Department of Biology, Carleton University, Ottawa, ON K1S5B6, Canada; Institute of Biochemistry, Carleton University, Ottawa, ON K1S5B6, Canada
| | - Ashkan Golshani
- Department of Biology, Carleton University, Ottawa, ON K1S5B6, Canada; Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON K1S5B6, Canada; Institute of Biochemistry, Carleton University, Ottawa, ON K1S5B6, Canada.
| |
Collapse
|
14
|
Reciprocal Perspective for Improved Protein-Protein Interaction Prediction. Sci Rep 2018; 8:11694. [PMID: 30076341 PMCID: PMC6076239 DOI: 10.1038/s41598-018-30044-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Accepted: 07/20/2018] [Indexed: 02/06/2023] Open
Abstract
All protein-protein interaction (PPI) predictors require the determination of an operational decision threshold when differentiating positive PPIs from negatives. Historically, a single global threshold, typically optimized via cross-validation testing, is applied to all protein pairs. However, we here use data visualization techniques to show that no single decision threshold is suitable for all protein pairs, given the inherent diversity of protein interaction profiles. The recent development of high throughput PPI predictors has enabled the comprehensive scoring of all possible protein-protein pairs. This, in turn, has given rise to context, enabling us now to evaluate a PPI within the context of all possible predictions. Leveraging this context, we introduce a novel modeling framework called Reciprocal Perspective (RP), which estimates a localized threshold on a per-protein basis using several rank order metrics. By considering a putative PPI from the perspective of each of the proteins within the pair, RP rescores the predicted PPI and applies a cascaded Random Forest classifier leading to improvements in recall and precision. We here validate RP using two state-of-the-art PPI predictors, the Protein-protein Interaction Prediction Engine and the Scoring PRotein INTeractions methods, over five organisms: Homo sapiens, Saccharomyces cerevisiae, Arabidopsis thaliana, Caenorhabditis elegans, and Mus musculus. Results demonstrate the application of a post hoc RP rescoring layer significantly improves classification (p < 0.001) in all cases over all organisms and this new rescoring approach can apply to any PPI prediction method.
Collapse
|
15
|
Göktepe YE, Kodaz H. Prediction of Protein-Protein Interactions Using An Effective Sequence Based Combined Method. Neurocomputing 2018. [DOI: 10.1016/j.neucom.2018.03.062] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
16
|
Tran L, Hamp T, Rost B. ProfPPIdb: Pairs of physical protein-protein interactions predicted for entire proteomes. PLoS One 2018; 13:e0199988. [PMID: 30020956 PMCID: PMC6051629 DOI: 10.1371/journal.pone.0199988] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2018] [Accepted: 06/17/2018] [Indexed: 01/05/2023] Open
Abstract
MOTIVATION Protein-protein interactions (PPIs) play a key role in many cellular processes. Most annotations of PPIs mix experimental and computational data. The mix optimizes coverage, but obfuscates the annotation origin. Some resources excel at focusing on reliable experimental data. Here, we focused on new pairs of interacting proteins for several model organisms based solely on sequence-based prediction methods. RESULTS We extracted reliable experimental data about which proteins interact (binary) for eight diverse model organisms from public databases, namely from Escherichia coli, Schizosaccharomyces pombe, Plasmodium falciparum, Drosophila melanogaster, Caenorhabditis elegans, Mus musculus, Rattus norvegicus, Arabidopsis thaliana, and for the previously used Homo sapiens and Saccharomyces cerevisiae. Those data were the base to develop a PPI prediction method for each model organism. The method used evolutionary information through a profile-kernel Support Vector Machine (SVM). With the resulting eight models, we predicted all possible protein pairs in each organism and made the top predictions available through a web application. Almost all of the PPIs made available were predicted between proteins that have not been observed in any interaction, in particular for less well-studied organisms. Thus, our work complements existing resources and is particularly helpful for designing experiments because of its uniqueness. Experimental annotations and computational predictions are strongly influenced by the fact that some proteins have many partners and others few. To optimize machine learning, recent methods explicitly ignored such a network-structure and rely either on domain knowledge or sequence-only methods. Our approach is independent of domain-knowledge and leverages evolutionary information. The database interface representing our results is accessible from https://rostlab.org/services/ppipair/. The data can also be downloaded from https://figshare.com/collections/ProfPPI-DB/4141784.
Collapse
Affiliation(s)
- Linh Tran
- Imperial College London (ICL), Department of Computing, United Kingdom
- Technical University of Munich (TUM), Department of Informatics, Bioinformatics & Computational Biology - i12, Boltzmannstr, Germany
- * E-mail:
| | - Tobias Hamp
- Technical University of Munich (TUM), Department of Informatics, Bioinformatics & Computational Biology - i12, Boltzmannstr, Germany
| | - Burkhard Rost
- Technical University of Munich (TUM), Department of Informatics, Bioinformatics & Computational Biology - i12, Boltzmannstr, Germany
- Technical University of Munich (TUM), Institute for Advanced Study (TUM-IAS), Lichtenbergstr, Germany
| |
Collapse
|
17
|
Omidi K, Jessulat M, Hooshyar M, Burnside D, Schoenrock A, Kazmirchuk T, Hajikarimlou M, Daniel M, Moteshareie H, Bhojoo U, Sanders M, Ramotar D, Dehne F, Samanfar B, Babu M, Golshani A. Uncharacterized ORF HUR1 influences the efficiency of non-homologous end-joining repair in Saccharomyces cerevisiae. Gene 2018; 639:128-136. [DOI: 10.1016/j.gene.2017.10.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2017] [Revised: 06/25/2017] [Accepted: 10/02/2017] [Indexed: 01/05/2023]
|
18
|
Kotlyar M, Rossos AEM, Jurisica I. Prediction of Protein-Protein Interactions. ACTA ACUST UNITED AC 2017; 60:8.2.1-8.2.14. [PMID: 29220074 DOI: 10.1002/cpbi.38] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
The authors provide an overview of physical protein-protein interaction prediction, covering the main strategies for predicting interactions, approaches for assessing predictions, and online resources for accessing predictions. This unit focuses on the main advancements in each of these areas over the last decade. The methods and resources that are presented here are not an exhaustive set, but characterize the current state of the field-highlighting key challenges and achievements. © 2017 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Max Kotlyar
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Andrea E M Rossos
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Igor Jurisica
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Departments of Medical Biophysics and Computer Science, University of Toronto, Ontario, Canada.,Institute of Neuroimmunology, Slovak Academy of Sciences, Bratislava, Slovakia
| |
Collapse
|
19
|
Kazmirchuk T, Dick K, Burnside DJ, Barnes B, Moteshareie H, Hajikarimlou M, Omidi K, Ahmed D, Low A, Lettl C, Hooshyar M, Schoenrock A, Pitre S, Babu M, Cassol E, Samanfar B, Wong A, Dehne F, Green JR, Golshani A. Designing anti-Zika virus peptides derived from predicted human-Zika virus protein-protein interactions. Comput Biol Chem 2017; 71:180-187. [DOI: 10.1016/j.compbiolchem.2017.10.011] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Revised: 10/03/2017] [Accepted: 10/27/2017] [Indexed: 01/22/2023]
|
20
|
Shatnawi M. Protein-Protein Interaction Prediction: Recent Advances. 2017 28TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA) 2017:69-73. [DOI: 10.1109/dexa.2017.30] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
21
|
Gupta S, Verheggen K, Tavernier J, Martens L. Unbiased Protein Association Study on the Public Human Proteome Reveals Biological Connections between Co-Occurring Protein Pairs. J Proteome Res 2017; 16:2204-2212. [PMID: 28480704 PMCID: PMC5491052 DOI: 10.1021/acs.jproteome.6b01066] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
![]()
Mass-spectrometry-based, high-throughput
proteomics experiments
produce large amounts of data. While typically acquired to answer
specific biological questions, these data can also be reused in orthogonal
ways to reveal new biological knowledge. We here present a novel method
for such orthogonal data reuse of public proteomics data. Our method
elucidates biological relationships between proteins based on the
co-occurrence of these proteins across human experiments in the PRIDE
database. The majority of the significantly co-occurring protein pairs
that were detected by our method have been successfully mapped to
existing biological knowledge. The validity of our novel method is
substantiated by the extremely few pairs that can be mapped to existing
knowledge based on random associations between the same set of proteins.
Moreover, using literature searches and the STRING database, we were
able to derive meaningful biological associations for unannotated
protein pairs that were detected using our method, further illustrating
that as-yet unknown associations present highly interesting targets
for follow-up analysis.
Collapse
Affiliation(s)
- Surya Gupta
- VIB-UGent Center for Medical Biotechnology, VIB , A. Baertsoenkaai 3, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University , B-9000 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University , B-9000 Ghent, Belgium
| | - Kenneth Verheggen
- VIB-UGent Center for Medical Biotechnology, VIB , A. Baertsoenkaai 3, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University , B-9000 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University , B-9000 Ghent, Belgium
| | - Jan Tavernier
- VIB-UGent Center for Medical Biotechnology, VIB , A. Baertsoenkaai 3, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University , B-9000 Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB , A. Baertsoenkaai 3, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University , B-9000 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University , B-9000 Ghent, Belgium
| |
Collapse
|
22
|
You ZH, Zhou M, Luo X, Li S. Highly Efficient Framework for Predicting Interactions Between Proteins. IEEE TRANSACTIONS ON CYBERNETICS 2017; 47:731-743. [PMID: 28113829 DOI: 10.1109/tcyb.2016.2524994] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Protein-protein interactions (PPIs) play a central role in many biological processes. Although a large amount of human PPI data has been generated by high-throughput experimental techniques, they are very limited compared to the estimated 130 000 protein interactions in humans. Hence, automatic methods for human PPI-detection are highly desired. This work proposes a novel framework, i.e., Low-rank approximation-kernel Extreme Learning Machine (LELM), for detecting human PPI from a protein's primary sequences automatically. It has three main steps: 1) mapping each protein sequence into a matrix built on all kinds of adjacent amino acids; 2) applying the low-rank approximation model to the obtained matrix to solve its lowest rank representation, which reflects its true subspace structures; and 3) utilizing a powerful kernel extreme learning machine to predict the probability for PPI based on this lowest rank representation. Experimental results on a large-scale human PPI dataset demonstrate that the proposed LELM has significant advantages in accuracy and efficiency over the state-of-art approaches. Hence, this work establishes a new and effective way for the automatic detection of PPI.
Collapse
|
23
|
Schoenrock A, Burnside D, Moteshareie H, Pitre S, Hooshyar M, Green JR, Golshani A, Dehne F, Wong A. Evolution of protein-protein interaction networks in yeast. PLoS One 2017; 12:e0171920. [PMID: 28248977 PMCID: PMC5382968 DOI: 10.1371/journal.pone.0171920] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Accepted: 01/28/2017] [Indexed: 01/04/2023] Open
Abstract
Interest in the evolution of protein-protein and genetic interaction networks has been rising in recent years, but the lack of large-scale high quality comparative datasets has acted as a barrier. Here, we carried out a comparative analysis of computationally predicted protein-protein interaction (PPI) networks from five closely related yeast species. We used the Protein-protein Interaction Prediction Engine (PIPE), which uses a database of known interactions to make sequence-based PPI predictions, to generate high quality predicted interactomes. Simulated proteomes and corresponding PPI networks were used to provide null expectations for the extent and nature of PPI network evolution. We found strong evidence for conservation of PPIs, with lower than expected levels of change in PPIs for about a quarter of the proteome. Furthermore, we found that changes in predicted PPI networks are poorly predicted by sequence divergence. Our analyses identified a number of functional classes experiencing fewer PPI changes than expected, suggestive of purifying selection on PPIs. Our results demonstrate the added benefit of considering predicted PPI networks when studying the evolution of closely related organisms.
Collapse
Affiliation(s)
| | | | | | - Sylvain Pitre
- School of Computer Science, Carleton University, Ottawa, Canada
| | | | - James R. Green
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Canada
| | | | - Frank Dehne
- School of Computer Science, Carleton University, Ottawa, Canada
| | - Alex Wong
- Department of Biology, Carleton University, Ottawa, Canada
| |
Collapse
|
24
|
Samanfar B, Molnar SJ, Charette M, Schoenrock A, Dehne F, Golshani A, Belzile F, Cober ER. Mapping and identification of a potential candidate gene for a novel maturity locus, E10, in soybean. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2017; 130:377-390. [PMID: 27832313 DOI: 10.1007/s00122-016-2819-7] [Citation(s) in RCA: 85] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Accepted: 10/27/2016] [Indexed: 05/04/2023]
Abstract
KEY MESSAGE E10 is a new maturity locus in soybean and FT4 is the predicted/potential functional gene underlying the locus. Flowering and maturity time traits play crucial roles in economic soybean production. Early maturity is critical for north and west expansion of soybean in Canada. To date, 11 genes/loci have been identified which control time to flowering and maturity; however, the molecular bases of almost half of them are not yet clear. We have identified a new maturity locus called "E10" located at the end of chromosome Gm08. The gene symbol E10e10 has been approved by the Soybean Genetics Committee. The e10e10 genotype results in 5-10 days earlier maturity than E10E10. A set of presumed E10E10 and e10e10 genotypes was used to identify contrasting SSR and SNP haplotypes. These haplotypes, and their association with maturity, were maintained through five backcross generations. A functional genomics approach using a predicted protein-protein interaction (PPI) approach (Protein-protein Interaction Prediction Engine, PIPE) was used to investigate approximately 75 genes located in the genomic region that SSR and SNP analyses identified as the location of the E10 locus. The PPI analysis identified FT4 as the most likely candidate gene underlying the E10 locus. Sequence analysis of the two FT4 alleles identified three SNPs, in the 5'UTR, 3'UTR and fourth exon in the coding region, which result in differential mRNA structures. Allele-specific markers were developed for this locus and are available for soybean breeders to efficiently develop earlier maturing cultivars using molecular marker assisted breeding.
Collapse
Affiliation(s)
- Bahram Samanfar
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON, K1A 0C6, Canada
| | - Stephen J Molnar
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON, K1A 0C6, Canada
| | - Martin Charette
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON, K1A 0C6, Canada
| | - Andrew Schoenrock
- School of Computer Science, Carleton University, Ottawa, ON, K1S 5B6, Canada
| | - Frank Dehne
- School of Computer Science, Carleton University, Ottawa, ON, K1S 5B6, Canada
| | - Ashkan Golshani
- Department of Biology and Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON, K1S 5B6, Canada
| | - François Belzile
- Département de Phytologie and Institut de Biologie Intégrative et des Systèmes, Université Laval, Quebec City, QC, G1V 0A6, Canada
| | - Elroy R Cober
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON, K1A 0C6, Canada.
| |
Collapse
|
25
|
Sze-To A, Fung S, Lee ESA, Wong AK. Prediction of Protein–Protein Interaction via co-occurring Aligned Pattern Clusters. Methods 2016; 110:26-34. [DOI: 10.1016/j.ymeth.2016.07.018] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2016] [Revised: 06/25/2016] [Accepted: 07/26/2016] [Indexed: 10/21/2022] Open
|
26
|
Peace RJ, Biggar KK, Storey KB, Green JR. A framework for improving microRNA prediction in non-human genomes. Nucleic Acids Res 2015; 43:e138. [PMID: 26163062 PMCID: PMC4787757 DOI: 10.1093/nar/gkv698] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Accepted: 06/28/2015] [Indexed: 11/12/2022] Open
Abstract
The prediction of novel pre-microRNA (miRNA) from genomic sequence has received considerable attention recently. However, the majority of studies have focused on the human genome. Previous studies have demonstrated that sensitivity (correctly detecting true miRNA) is sustained when human-trained methods are applied to other species, however they have failed to report the dramatic drop in specificity (the ability to correctly reject non-miRNA sequences) in non-human genomes. Considering the ratio of true miRNA sequences to pseudo-miRNA sequences is on the order of 1:1000, such low specificity prevents the application of most existing tools to non-human genomes, as the number of false positives overwhelms the true predictions. We here introduce a framework (SMIRP) for creating species-specific miRNA prediction systems, leveraging sequence conservation and phylogenetic distance information. Substantial improvements in specificity and precision are obtained for four non-human test species when our framework is applied to three different prediction systems representing two types of classifiers (support vector machine and Random Forest), based on three different feature sets, with both human-specific and taxon-wide training data. The SMIRP framework is potentially applicable to all miRNA prediction systems and we expect substantial improvement in precision and specificity, while sustaining sensitivity, independent of the machine learning technique chosen.
Collapse
Affiliation(s)
- Robert J Peace
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Canada
| | - Kyle K Biggar
- Institute of Biochemistry and Department of Biology, Carleton University, Ottawa, Canada Department of Biochemistry, University of Western Ontario, London, Canada
| | - Kenneth B Storey
- Institute of Biochemistry and Department of Biology, Carleton University, Ottawa, Canada
| | - James R Green
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Canada
| |
Collapse
|
27
|
You ZH, Chan KCC, Hu P. Predicting protein-protein interactions from primary protein sequences using a novel multi-scale local feature representation scheme and the random forest. PLoS One 2015; 10:e0125811. [PMID: 25946106 PMCID: PMC4422660 DOI: 10.1371/journal.pone.0125811] [Citation(s) in RCA: 98] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2014] [Accepted: 03/04/2015] [Indexed: 11/18/2022] Open
Abstract
The study of protein-protein interactions (PPIs) can be very important for the understanding of biological cellular functions. However, detecting PPIs in the laboratories are both time-consuming and expensive. For this reason, there has been much recent effort to develop techniques for computational prediction of PPIs as this can complement laboratory procedures and provide an inexpensive way of predicting the most likely set of interactions at the entire proteome scale. Although much progress has already been achieved in this direction, the problem is still far from being solved. More effective approaches are still required to overcome the limitations of the current ones. In this study, a novel Multi-scale Local Descriptor (MLD) feature representation scheme is proposed to extract features from a protein sequence. This scheme can capture multi-scale local information by varying the length of protein-sequence segments. Based on the MLD, an ensemble learning method, the Random Forest (RF) method, is used as classifier. The MLD feature representation scheme facilitates the mining of interaction information from multi-scale continuous amino acid segments, making it easier to capture multiple overlapping continuous binding patterns within a protein sequence. When the proposed method is tested with the PPI data of Saccharomyces cerevisiae, it achieves a prediction accuracy of 94.72% with 94.34% sensitivity at the precision of 98.91%. Extensive experiments are performed to compare our method with existing sequence-based method. Experimental results show that the performance of our predictor is better than several other state-of-the-art predictors also with the H. pylori dataset. The reason why such good results are achieved can largely be credited to the learning capabilities of the RF model and the novel MLD feature representation scheme. The experiment results show that the proposed approach can be very promising for predicting PPIs and can be a useful tool for future proteomic studies.
Collapse
Affiliation(s)
- Zhu-Hong You
- Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China; School of Electronics and Information Engineering, Tongji University, Shanghai, China
| | - Keith C C Chan
- Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China
| | - Pengwei Hu
- Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China
| |
Collapse
|
28
|
Hamp T, Rost B. Evolutionary profiles improve protein–protein interaction prediction from sequence. Bioinformatics 2015; 31:1945-50. [DOI: 10.1093/bioinformatics/btv077] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2014] [Accepted: 02/02/2015] [Indexed: 11/14/2022] Open
|
29
|
Luo X, You Z, Zhou M, Li S, Leung H, Xia Y, Zhu Q. A highly efficient approach to protein interactome mapping based on collaborative filtering framework. Sci Rep 2015; 5:7702. [PMID: 25572661 PMCID: PMC4287731 DOI: 10.1038/srep07702] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Accepted: 12/08/2014] [Indexed: 12/17/2022] Open
Abstract
The comprehensive mapping of protein-protein interactions (PPIs) is highly desired for one to gain deep insights into both fundamental cell biology processes and the pathology of diseases. Finely-set small-scale experiments are not only very expensive but also inefficient to identify numerous interactomes despite their high accuracy. High-throughput screening techniques enable efficient identification of PPIs; yet the desire to further extract useful knowledge from these data leads to the problem of binary interactome mapping. Network topology-based approaches prove to be highly efficient in addressing this problem; however, their performance deteriorates significantly on sparse putative PPI networks. Motivated by the success of collaborative filtering (CF)-based approaches to the problem of personalized-recommendation on large, sparse rating matrices, this work aims at implementing a highly efficient CF-based approach to binary interactome mapping. To achieve this, we first propose a CF framework for it. Under this framework, we model the given data into an interactome weight matrix, where the feature-vectors of involved proteins are extracted. With them, we design the rescaled cosine coefficient to model the inter-neighborhood similarity among involved proteins, for taking the mapping process. Experimental results on three large, sparse datasets demonstrate that the proposed approach outperforms several sophisticated topology-based approaches significantly.
Collapse
Affiliation(s)
- Xin Luo
- X. Luo, Y. Xia and Q. Zhu are with the College of Computer Science, Chongqing University, Chongqing, 400044 China
- X. Luo, Z. You, S. Li and H. Leung are with the Department of Computing, Hong Kong Polytechnic University, Hong Kong, HK 999077, China
| | - Zhuhong You
- X. Luo, Z. You, S. Li and H. Leung are with the Department of Computing, Hong Kong Polytechnic University, Hong Kong, HK 999077, China
| | - Mengchu Zhou
- M. Zhou is with the Department of Electrical and Computer Engineering, New Jersey Institute of Technology, Newark, NJ 07102, USA
| | - Shuai Li
- X. Luo, Z. You, S. Li and H. Leung are with the Department of Computing, Hong Kong Polytechnic University, Hong Kong, HK 999077, China
| | - Hareton Leung
- X. Luo, Z. You, S. Li and H. Leung are with the Department of Computing, Hong Kong Polytechnic University, Hong Kong, HK 999077, China
| | - Yunni Xia
- X. Luo, Y. Xia and Q. Zhu are with the College of Computer Science, Chongqing University, Chongqing, 400044 China
| | - Qingsheng Zhu
- X. Luo, Y. Xia and Q. Zhu are with the College of Computer Science, Chongqing University, Chongqing, 400044 China
| |
Collapse
|
30
|
Schoenrock A, Samanfar B, Pitre S, Hooshyar M, Jin K, Phillips CA, Wang H, Phanse S, Omidi K, Gui Y, Alamgir M, Wong A, Barrenäs F, Babu M, Benson M, Langston MA, Green JR, Dehne F, Golshani A. Efficient prediction of human protein-protein interactions at a global scale. BMC Bioinformatics 2014; 15:383. [PMID: 25492630 PMCID: PMC4272565 DOI: 10.1186/s12859-014-0383-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Accepted: 11/12/2014] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Our knowledge of global protein-protein interaction (PPI) networks in complex organisms such as humans is hindered by technical limitations of current methods. RESULTS On the basis of short co-occurring polypeptide regions, we developed a tool called MP-PIPE capable of predicting a global human PPI network within 3 months. With a recall of 23% at a precision of 82.1%, we predicted 172,132 putative PPIs. We demonstrate the usefulness of these predictions through a range of experiments. CONCLUSIONS The speed and accuracy associated with MP-PIPE can make this a potential tool to study individual human PPI networks (from genomic sequences alone) for personalized medicine.
Collapse
Affiliation(s)
| | | | - Sylvain Pitre
- School of Computer Science, Carleton University, Ottawa, Canada.
| | | | - Ke Jin
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada.
| | - Charles A Phillips
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, Tennessee, USA.
| | - Hui Wang
- Department of Pediatrics, Gothenburg University, Gothenburg, Sweden. .,The Centre for Individualized Medication, Linköping University, Linköping, Sweden.
| | - Sadhna Phanse
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada.
| | - Katayoun Omidi
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Yuan Gui
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Md Alamgir
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Alex Wong
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Fredrik Barrenäs
- Department of Pediatrics, Gothenburg University, Gothenburg, Sweden. .,The Centre for Individualized Medication, Linköping University, Linköping, Sweden.
| | - Mohan Babu
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, Saskatchewan, Canada.
| | - Mikael Benson
- Department of Pediatrics, Gothenburg University, Gothenburg, Sweden. .,The Centre for Individualized Medication, Linköping University, Linköping, Sweden.
| | - Michael A Langston
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, Tennessee, USA.
| | - James R Green
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Canada.
| | - Frank Dehne
- School of Computer Science, Carleton University, Ottawa, Canada.
| | | |
Collapse
|
31
|
Large-scale protein-protein interactions detection by integrating big biosensing data with computational model. BIOMED RESEARCH INTERNATIONAL 2014; 2014:598129. [PMID: 25215285 PMCID: PMC4151593 DOI: 10.1155/2014/598129] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/23/2014] [Accepted: 07/24/2014] [Indexed: 01/12/2023]
Abstract
Protein-protein interactions are the basis of biological functions, and studying these interactions on a molecular level is of crucial importance for understanding the functionality of a living cell. During the past decade, biosensors have emerged as an important tool for the high-throughput identification of proteins and their interactions. However, the high-throughput experimental methods for identifying PPIs are both time-consuming and expensive. On the other hand, high-throughput PPI data are often associated with high false-positive and high false-negative rates. Targeting at these problems, we propose a method for PPI detection by integrating biosensor-based PPI data with a novel computational model. This method was developed based on the algorithm of extreme learning machine combined with a novel representation of protein sequence descriptor. When performed on the large-scale human protein interaction dataset, the proposed method achieved 84.8% prediction accuracy with 84.08% sensitivity at the specificity of 85.53%. We conducted more extensive experiments to compare the proposed method with the state-of-the-art techniques, support vector machine. The achieved results demonstrate that our approach is very promising for detecting new PPIs, and it can be a helpful supplement for biosensor-based PPI data detection.
Collapse
|
32
|
Murakami Y, Mizuguchi K. Homology-based prediction of interactions between proteins using Averaged One-Dependence Estimators. BMC Bioinformatics 2014; 15:213. [PMID: 24953126 PMCID: PMC4229973 DOI: 10.1186/1471-2105-15-213] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2014] [Accepted: 06/17/2014] [Indexed: 02/02/2023] Open
Abstract
Background Identification of protein-protein interactions (PPIs) is essential for a better understanding of biological processes, pathways and functions. However, experimental identification of the complete set of PPIs in a cell/organism (“an interactome”) is still a difficult task. To circumvent limitations of current high-throughput experimental techniques, it is necessary to develop high-performance computational methods for predicting PPIs. Results In this article, we propose a new computational method to predict interaction between a given pair of protein sequences using features derived from known homologous PPIs. The proposed method is capable of predicting interaction between two proteins (of unknown structure) using Averaged One-Dependence Estimators (AODE) and three features calculated for the protein pair: (a) sequence similarities to a known interacting protein pair (FSeq), (b) statistical propensities of domain pairs observed in interacting proteins (FDom) and (c) a sum of edge weights along the shortest path between homologous proteins in a PPI network (FNet). Feature vectors were defined to lie in a half-space of the symmetrical high-dimensional feature space to make them independent of the protein order. The predictability of the method was assessed by a 10-fold cross validation on a recently created human PPI dataset with randomly sampled negative data, and the best model achieved an Area Under the Curve of 0.79 (pAUC0.5% = 0.16). In addition, the AODE trained on all three features (named PSOPIA) showed better prediction performance on a separate independent data set than a recently reported homology-based method. Conclusions Our results suggest that FNet, a feature representing proximity in a known PPI network between two proteins that are homologous to a target protein pair, contributes to the prediction of whether the target proteins interact or not. PSOPIA will help identify novel PPIs and estimate complete PPI networks. The method proposed in this article is freely available on the web at http://mizuguchilab.org/PSOPIA.
Collapse
Affiliation(s)
- Yoichi Murakami
- Bioinformatics Project, National Institute of Biomedical Innovation, 7-6-8 Saito-Asagi, Ibaraki, Osaka 567-0085, Japan.
| | | |
Collapse
|
33
|
Omidi K, Hooshyar M, Jessulat M, Samanfar B, Sanders M, Burnside D, Pitre S, Schoenrock A, Xu J, Babu M, Golshani A. Phosphatase complex Pph3/Psy2 is involved in regulation of efficient non-homologous end-joining pathway in the yeast Saccharomyces cerevisiae. PLoS One 2014; 9:e87248. [PMID: 24498054 PMCID: PMC3909046 DOI: 10.1371/journal.pone.0087248] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Accepted: 12/20/2013] [Indexed: 11/19/2022] Open
Abstract
One of the main mechanisms for double stranded DNA break (DSB) repair is through the non-homologous end-joining (NHEJ) pathway. Using plasmid and chromosomal repair assays, we showed that deletion mutant strains for interacting proteins Pph3p and Psy2p had reduced efficiencies in NHEJ. We further observed that this activity of Pph3p and Psy2p appeared linked to cell cycle Rad53p and Chk1p checkpoint proteins. Pph3/Psy2 is a phosphatase complex, which regulates recovery from the Rad53p DNA damage checkpoint. Overexpression of Chk1p checkpoint protein in a parallel pathway to Rad53p compensated for the deletion of PPH3 or PSY2 in a chromosomal repair assay. Double mutant strains Δpph3/Δchk1 and Δpsy2/Δchk1 showed additional reductions in the efficiency of plasmid repair, compared to both single deletions which is in agreement with the activity of Pph3p and Psy2p in a parallel pathway to Chk1p. Genetic interaction analyses also supported a role for Pph3p and Psy2p in DNA damage repair, the NHEJ pathway, as well as cell cycle progression. Collectively, we report that the activity of Pph3p and Psy2p further connects NHEJ repair to cell cycle progression.
Collapse
Affiliation(s)
- Katayoun Omidi
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| | - Mohsen Hooshyar
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| | - Matthew Jessulat
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, Saskatchewan, Canada
| | - Bahram Samanfar
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| | - Megan Sanders
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| | - Daniel Burnside
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| | - Sylvain Pitre
- Department of Computer Science, Carleton University, Ottawa, Ontario, Canada
| | - Andrew Schoenrock
- Department of Computer Science, Carleton University, Ottawa, Ontario, Canada
| | - Jianhua Xu
- College of Pharmaceutical Sciences, Zhejian University, Hangzhou, Zhejiang, China
| | - Mohan Babu
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, Saskatchewan, Canada
| | - Ashkan Golshani
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| |
Collapse
|