1
|
Fu A, Kazmirchuk TDD, Bradbury-Jost C, Golshani A, Othman M. Platelet-Type von Willebrand Disease: Complex Pathophysiology and Insights on Novel Therapeutic and Diagnostic Strategies. Semin Thromb Hemost 2024. [PMID: 39191406 DOI: 10.1055/s-0044-1789183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/29/2024]
Abstract
von Willebrand disease (VWD) is the most common well-studied genetic bleeding disorder worldwide. Much less is known about platelet-type VWD (PT-VWD), a rare platelet function defect, and a "nonidentical" twin bleeding phenotype to type 2B VWD (2B-VWD). Rather than a defect in the von Willebrand factor (VWF) gene, PT-VWD is caused by a platelet GP1BA mutation leading to a hyperaffinity of the glycoprotein Ibα (GPIbα) platelet surface receptor for VWF, and thus increased platelet clearing and high-molecular-weight VWF multimer elimination. Nine GP1BA gene mutations are known. It is historically believed that this enhanced binding was enabled by the β-switch region of GPIbα adopting an extended β-hairpin form. Recent evidence suggests the pathological conformation that destabilizes the compact triangular form of the R-loop-the GPIbα protein's region for VWF binding. PT-VWD is often misdiagnosed as 2B-VWD, even the though distinction between the two is crucial for proper treatment, as the former requires platelet transfusions, while the latter requires VWF/FVIII concentrate administration. Nevertheless, these PT-VWD treatments remain unsatisfactory, owing to their high cost, low availability, risk of alloimmunity, and the need to carefully balance platelet administration. Antibodies such as 6B4 remain undependable as an alternative therapy due to their questionable efficacy and high costs for this purpose. On the other hand, synthetic peptide therapeutics developed with In-Silico Protein Synthesizer to disrupt the association between GPIbα and VWF show preliminary promise as a therapy based on in vitro experiments. Such peptides could serve as an effective diagnostic technology for discriminating between 2B-VWD and PT-VWD, or potentially all forms of VWD, based on their high specificity. This field is rapidly growing and the current review sheds light on the complex pathology and some novel potential therapeutic and diagnostic strategies.
Collapse
Affiliation(s)
- Anne Fu
- Department of Biomedical and Molecular Sciences, School of Medicine, Queen's University, Kingston, Ontario, Canada
| | - Thomas D D Kazmirchuk
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, Ontario, Canada
| | - Calvin Bradbury-Jost
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, Ontario, Canada
| | - Ashkan Golshani
- Department of Biology, and the Ottawa Institute of Systems Biology (OISB), Carleton University, Ottawa, Ontario, Canada
| | - Maha Othman
- Department of Biomedical and Molecular Sciences, School of Medicine, Queen's University, Kingston, Ontario, Canada
- School of Baccalaureate Nursing, St. Lawrence College, Kingston, Ontario, Canada
- Department of Clinical Pathology, Faculty of Medicine, Mansoura University, Mansoura City, Egypt
| |
Collapse
|
2
|
Jagadeesan SK, Potter T, Al-Gafari M, Hooshyar M, Hewapathirana CM, Takallou S, Hajikarimlou M, Burnside D, Samanfar B, Moteshareie H, Smith M, Golshani A. Discovery and identification of genes involved in DNA damage repair in yeast. Gene 2022; 831:146549. [PMID: 35569766 DOI: 10.1016/j.gene.2022.146549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Revised: 02/16/2022] [Accepted: 05/06/2022] [Indexed: 11/04/2022]
Abstract
DNA repair defects are common in tumour cells and can lead to misrepair of double-strand breaks (DSBs), posing a significant challenge to cellular integrity. The overall mechanisms of DSB have been known for decades. However, the list of the genes that affect the efficiency of DSB repair continues to grow. Additional factors that play a role in DSB repair pathways have yet to be identified. In this study, we present a computational approach to identify novel gene functions that are involved in DNA damage repair in Saccharomyces cerevisiae. Among the primary candidates, GAL7, YMR130W, and YHI9 were selected for further analysis since they had not previously been identified as being active in DNA repair pathways. Originally, GAL7 was linked to galactose metabolism. YHI9 and YMR130W encode proteins of unknown functions. Laboratory testing of deletion strains gal7Δ, ymr130wΔ, and yhi9Δ implicated all 3 genes in Homologous Recombination (HR) and/or Non-Homologous End Joining (NHEJ) repair pathways, and enhanced sensitivity to DNA damage-inducing drugs suggested involvement in the broader DNA damage repair machinery. A subsequent genetic interaction analysis revealed interconnections of these three genes, most strikingly through SIR2, SIR3 and SIR4 that are involved in chromatin regulation and DNA damage repair network.
Collapse
Affiliation(s)
- Sasi Kumar Jagadeesan
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada; Department of Biology, Carleton University, Ottawa, Ontario, Canada.
| | - Taylor Potter
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada; Department of Biology, Carleton University, Ottawa, Ontario, Canada.
| | - Mustafa Al-Gafari
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada; Department of Biology, Carleton University, Ottawa, Ontario, Canada.
| | - Mohsen Hooshyar
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada; Department of Biology, Carleton University, Ottawa, Ontario, Canada.
| | | | - Sarah Takallou
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada; Department of Biology, Carleton University, Ottawa, Ontario, Canada.
| | - Maryam Hajikarimlou
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada; Department of Biology, Carleton University, Ottawa, Ontario, Canada.
| | - Daniel Burnside
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada; Department of Biology, Carleton University, Ottawa, Ontario, Canada.
| | - Bahram Samanfar
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre (ORDC), Ottawa, Ontario, Canada.
| | - Houman Moteshareie
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada; Department of Biology, Carleton University, Ottawa, Ontario, Canada.
| | - Myron Smith
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada.
| | - Ashkan Golshani
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada; Department of Biology, Carleton University, Ottawa, Ontario, Canada.
| |
Collapse
|
3
|
McCafferty CL, Marcotte EM, Taylor DW. Simplified geometric representations of protein structures identify complementary interaction interfaces. Proteins 2021; 89:348-360. [PMID: 33140424 PMCID: PMC7855953 DOI: 10.1002/prot.26020] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 09/22/2020] [Accepted: 10/25/2020] [Indexed: 12/12/2022]
Abstract
Protein-protein interactions are critical to protein function, but three-dimensional (3D) arrangements of interacting proteins have proven hard to predict, even given the identities and 3D structures of the interacting partners. Specifically, identifying the relevant pairwise interaction surfaces remains difficult, often relying on shape complementarity with molecular docking while accounting for molecular motions to optimize rigid 3D translations and rotations. However, such approaches can be computationally expensive, and faster, less accurate approximations may prove useful for large-scale prediction and assembly of 3D structures of multi-protein complexes. We asked if a reduced representation of protein geometry retains enough information about molecular properties to predict pairwise protein interaction interfaces that are tolerant of limited structural rearrangements. Here, we describe a reduced representation of 3D protein accessible surfaces on which molecular properties such as charge, hydrophobicity, and evolutionary rate can be easily mapped, implemented in the MorphProt package. Pairs of surfaces are compared to rapidly assess partner-specific potential surface complementarity. On two available benchmarks of 185 overall known protein complexes, we observe predictions comparable to other structure-based tools at correctly identifying protein interaction surfaces. Furthermore, we examined the effect of molecular motion through normal mode simulation on a benchmark receptor-ligand pair and observed no marked loss of predictive accuracy for distortions of up to 6 Å Cα-RMSD. Thus, a shape reduction of protein surfaces retains considerable information about surface complementarity, offers enhanced speed of comparison relative to more complex geometric representations, and exhibits tolerance to conformational changes.
Collapse
Affiliation(s)
- Caitlyn L. McCafferty
- Department of Molecular BiosciencesUniversity of Texas at AustinAustinTexasUSA
- Center for Systems and Synthetic BiologyUniversity of Texas at AustinAustinTexasUSA
- Institute for Cellular and Molecular BiologyUniversity of Texas at AustinAustinTexasUSA
| | - Edward M. Marcotte
- Department of Molecular BiosciencesUniversity of Texas at AustinAustinTexasUSA
- Center for Systems and Synthetic BiologyUniversity of Texas at AustinAustinTexasUSA
- Institute for Cellular and Molecular BiologyUniversity of Texas at AustinAustinTexasUSA
| | - David W. Taylor
- Department of Molecular BiosciencesUniversity of Texas at AustinAustinTexasUSA
- Center for Systems and Synthetic BiologyUniversity of Texas at AustinAustinTexasUSA
- Institute for Cellular and Molecular BiologyUniversity of Texas at AustinAustinTexasUSA
- LIVESTRONG Cancer InstitutesDell Medical SchoolAustinTexasUSA
| |
Collapse
|
4
|
Hooshyar M, Jessulat M, Burnside D, Kluew A, Babu M, Golshani A. Deletion of yeast TPK1 reduces the efficiency of non-homologous end joining DNA repair. Biochem Biophys Res Commun 2020; 533:899-904. [DOI: 10.1016/j.bbrc.2020.09.083] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 09/20/2020] [Indexed: 12/12/2022]
|
5
|
DeCaprio J, Kohl TO. Tandem Immunoaffinity Purification Using Anti-FLAG and Anti-HA Antibodies. Cold Spring Harb Protoc 2019; 2019:2019/2/pdb.prot098657. [PMID: 30710027 DOI: 10.1101/pdb.prot098657] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
The immunoaffinity purification of target proteins followed by the identification and characterization of associated proteins by mass spectrometry is a widely used technique. An immunoaffinity purification bears resemblance to a standard immunoprecipitation; however, the end product for mass spectrometric analysis in the femtomole (10-15) to attomole (10-18) range is required to be of exceptional purity. This high degree of sensitivity in detection renders it of extreme importance to eliminate most if not all of the nonspecific background proteins and can be achieved by performing a tandem affinity purification (TAP). In TAP, the cDNA of the target protein is engineered to contain at least two different epitope tags, and the target protein is extracted under nondenaturing conditions upon expression using an appropriate protein expression platform (CHO cells, HEK 293 cells, or yeast). The expressed protein is initially immunoprecipitated using an antibody against one epitope tag and is eluted in the presence of excess peptide by competition for antibody-binding sites, before being reimmunoprecipitated using an antibody that specifically recognizes the second epitope. These sequential immunoprecipitations significantly reduce the presence of associated nonspecific proteins. Numerous combinations of epitope tags have been applied for tandem affinity purification, and this protocol illustrates the use of tandem hemagglutinin (HA) and FLAG epitope tags. The first immunoprecipitation uses an anti-FLAG antibody followed by the elution in the presence of a competing FLAG peptide before the reimmunoprecipitation of the protein using an anti-HA antibody. Numerous high-quality antiepitope tag antibodies are commercially available from different antibody manufacturers.
Collapse
|
6
|
You ZH, Huang W, Zhang S, Huang YA, Yu CQ, Li LP. An Efficient Ensemble Learning Approach for Predicting Protein-Protein Interactions by Integrating Protein Primary Sequence and Evolutionary Information. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 16:809-817. [PMID: 30475726 DOI: 10.1109/tcbb.2018.2882423] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Protein-protein interactions (PPIs) perform a very important function in many cellular processes, including signal transduction, post-translational modifications, apoptosis, and cell growth. Deregulation of PPIs results in many diseases, including cancer and pernicious anemia. Although many high-throughput methods have been applied to generate a large amount of PPIs data, they are generally expensive, inefficient and labor-intensive. Hence, there is an urgent need for developing a computational method to accurately and rapidly detect PPIs. In this article, we proposed a highly efficient approach to predict PPIs by integrating a new protein sequence substitution matrix feature representation and ensemble weighted sparse representation model classifier. The proposed method is demonstrated on Saccharomyces cerevisiae dataset and achieved 99.26% prediction accuracy with 98.53% sensitivity at precision of 100%, which is shown to have much higher predictive accuracy than current state-of-the-art algorithms. Extensive experiments are performed with the benchmark data set from Human and Helicobacter pylori that the proposed method achieves outstanding better success rates than other existing approaches in this problem. Experiment results illustrate that our proposed method presents an economical approach for computational building of PPI networks, which can be a helpful supplementary method for future proteomics researches.
Collapse
|
7
|
Omidi K, Jessulat M, Hooshyar M, Burnside D, Schoenrock A, Kazmirchuk T, Hajikarimlou M, Daniel M, Moteshareie H, Bhojoo U, Sanders M, Ramotar D, Dehne F, Samanfar B, Babu M, Golshani A. Uncharacterized ORF HUR1 influences the efficiency of non-homologous end-joining repair in Saccharomyces cerevisiae. Gene 2018; 639:128-136. [DOI: 10.1016/j.gene.2017.10.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2017] [Revised: 06/25/2017] [Accepted: 10/02/2017] [Indexed: 01/05/2023]
|
8
|
Tian Z, Guo M, Wang C, Xing L, Wang L, Zhang Y. Constructing an integrated gene similarity network for the identification of disease genes. J Biomed Semantics 2017; 8:32. [PMID: 29297379 PMCID: PMC5763299 DOI: 10.1186/s13326-017-0141-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Discovering novel genes that are involved human diseases is a challenging task in biomedical research. In recent years, several computational approaches have been proposed to prioritize candidate disease genes. Most of these methods are mainly based on protein-protein interaction (PPI) networks. However, since these PPI networks contain false positives and only cover less half of known human genes, their reliability and coverage are very low. Therefore, it is highly necessary to fuse multiple genomic data to construct a credible gene similarity network and then infer disease genes on the whole genomic scale. RESULTS We proposed a novel method, named RWRB, to infer causal genes of interested diseases. First, we construct five individual gene (protein) similarity networks based on multiple genomic data of human genes. Then, an integrated gene similarity network (IGSN) is reconstructed based on similarity network fusion (SNF) method. Finally, we employee the random walk with restart algorithm on the phenotype-gene bilayer network, which combines phenotype similarity network, IGSN as well as phenotype-gene association network, to prioritize candidate disease genes. We investigate the effectiveness of RWRB through leave-one-out cross-validation methods in inferring phenotype-gene relationships. Results show that RWRB is more accurate than state-of-the-art methods on most evaluation metrics. Further analysis shows that the success of RWRB is benefited from IGSN which has a wider coverage and higher reliability comparing with current PPI networks. Moreover, we conduct a comprehensive case study for Alzheimer's disease and predict some novel disease genes that supported by literature. CONCLUSIONS RWRB is an effective and reliable algorithm in prioritizing candidate disease genes on the genomic scale. Software and supplementary information are available at http://nclab.hit.edu.cn/~tianzhen/RWRB/ .
Collapse
Affiliation(s)
- Zhen Tian
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - Maozu Guo
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - Chunyu Wang
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - LinLin Xing
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - Lei Wang
- Institute of Health Service and Medical Information Academy of Military Medical Sciences Beijing, Beijing, 100850 China
| | - Yin Zhang
- Institute of Health Service and Medical Information Academy of Military Medical Sciences Beijing, Beijing, 100850 China
| |
Collapse
|
9
|
Samanfar B, Molnar SJ, Charette M, Schoenrock A, Dehne F, Golshani A, Belzile F, Cober ER. Mapping and identification of a potential candidate gene for a novel maturity locus, E10, in soybean. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2017; 130:377-390. [PMID: 27832313 DOI: 10.1007/s00122-016-2819-7] [Citation(s) in RCA: 85] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Accepted: 10/27/2016] [Indexed: 05/04/2023]
Abstract
KEY MESSAGE E10 is a new maturity locus in soybean and FT4 is the predicted/potential functional gene underlying the locus. Flowering and maturity time traits play crucial roles in economic soybean production. Early maturity is critical for north and west expansion of soybean in Canada. To date, 11 genes/loci have been identified which control time to flowering and maturity; however, the molecular bases of almost half of them are not yet clear. We have identified a new maturity locus called "E10" located at the end of chromosome Gm08. The gene symbol E10e10 has been approved by the Soybean Genetics Committee. The e10e10 genotype results in 5-10 days earlier maturity than E10E10. A set of presumed E10E10 and e10e10 genotypes was used to identify contrasting SSR and SNP haplotypes. These haplotypes, and their association with maturity, were maintained through five backcross generations. A functional genomics approach using a predicted protein-protein interaction (PPI) approach (Protein-protein Interaction Prediction Engine, PIPE) was used to investigate approximately 75 genes located in the genomic region that SSR and SNP analyses identified as the location of the E10 locus. The PPI analysis identified FT4 as the most likely candidate gene underlying the E10 locus. Sequence analysis of the two FT4 alleles identified three SNPs, in the 5'UTR, 3'UTR and fourth exon in the coding region, which result in differential mRNA structures. Allele-specific markers were developed for this locus and are available for soybean breeders to efficiently develop earlier maturing cultivars using molecular marker assisted breeding.
Collapse
Affiliation(s)
- Bahram Samanfar
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON, K1A 0C6, Canada
| | - Stephen J Molnar
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON, K1A 0C6, Canada
| | - Martin Charette
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON, K1A 0C6, Canada
| | - Andrew Schoenrock
- School of Computer Science, Carleton University, Ottawa, ON, K1S 5B6, Canada
| | - Frank Dehne
- School of Computer Science, Carleton University, Ottawa, ON, K1S 5B6, Canada
| | - Ashkan Golshani
- Department of Biology and Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON, K1S 5B6, Canada
| | - François Belzile
- Département de Phytologie and Institut de Biologie Intégrative et des Systèmes, Université Laval, Quebec City, QC, G1V 0A6, Canada
| | - Elroy R Cober
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON, K1A 0C6, Canada.
| |
Collapse
|
10
|
Skrlj B, Kunej T. Computational identification of non-synonymous polymorphisms within regions corresponding to protein interaction sites. Comput Biol Med 2016; 79:30-35. [PMID: 27744178 DOI: 10.1016/j.compbiomed.2016.10.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Revised: 10/02/2016] [Accepted: 10/03/2016] [Indexed: 11/24/2022]
Abstract
BACKGROUND Protein-protein interactions (PPI) play an important role in function of all organisms and enable understanding of underlying metabolic processes. Computational predictions of PPIs are an important aspect in proteomics, as experimental methods may result in high degree of false positive results and are more expensive. Although there are many databases collecting predicted PPIs, exploration of genetics information underlying PPI interactions has not been investigated thoroughly. The aim of the present study was to identify genomic locations corresponding to regions involved in predicted PPIs and to collect non-synonymous polymorphisms (nsSNPs) located within those regions; which we termed PPI-SNPs. METHODS Predicted PPIs were obtained from PiSITE database (http://pisite.hgc.jp). Non-synonymous SNPs mapped on protein structural data (PDBs) were obtained from the UCSC server. Polymorphism locations on protein structures were mapped to predicted PPI regions. DAVID tool was used for pathway enrichment and gene cluster analysis (https://david.ncifcrf.gov/). RESULTS We collected 544 polymorphisms located within predicted PPI sites that map to 197 genes. We identified 9 SNPs, previously associated with diseases, but not yet associated with PPI sites. We also found examples in which polymorphisms located within predicted PPI regions are also occurring within previously experimentally validated PPIs and within experimentally determined functional domains. CONCLUSIONS Our study provides the first catalog of nsSNPs located within predicted PPIs. These prioritized SNPs present the basis for planning experimental validation of SNPs that cause gain or loss of PPIs. Our implementation is expandable, as datasets used are constantly updated.
Collapse
Affiliation(s)
- Blaz Skrlj
- Department of Animal Science, Biotechnical Faculty, University of Ljubljana, Groblje 3, Slovenia.
| | - Tanja Kunej
- Department of Animal Science, Biotechnical Faculty, University of Ljubljana, Groblje 3, Slovenia.
| |
Collapse
|
11
|
Garzón JI, Deng L, Murray D, Shapira S, Petrey D, Honig B. A computational interactome and functional annotation for the human proteome. eLife 2016; 5. [PMID: 27770567 PMCID: PMC5115866 DOI: 10.7554/elife.18715] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2016] [Accepted: 10/19/2016] [Indexed: 12/14/2022] Open
Abstract
We present a database, PrePPI (Predicting Protein-Protein Interactions), of more than 1.35 million predicted protein-protein interactions (PPIs). Of these at least 127,000 are expected to constitute direct physical interactions although the actual number may be much larger (~500,000). The current PrePPI, which contains predicted interactions for about 85% of the human proteome, is related to an earlier version but is based on additional sources of interaction evidence and is far larger in scope. The use of structural relationships allows PrePPI to infer numerous previously unreported interactions. PrePPI has been subjected to a series of validation tests including reproducing known interactions, recapitulating multi-protein complexes, analysis of disease associated SNPs, and identifying functional relationships between interacting proteins. We show, using Gene Set Enrichment Analysis (GSEA), that predicted interaction partners can be used to annotate a protein's function. We provide annotations for most human proteins, including many annotated as having unknown function.
Collapse
Affiliation(s)
- José Ignacio Garzón
- Center for Computational Biology and Bioinformatics, Department of Systems Biology, Columbia University, New York, United States
| | - Lei Deng
- Center for Computational Biology and Bioinformatics, Department of Systems Biology, Columbia University, New York, United States.,School of Software, Central South University, Changsha, China
| | - Diana Murray
- Center for Computational Biology and Bioinformatics, Department of Systems Biology, Columbia University, New York, United States
| | - Sagi Shapira
- Center for Computational Biology and Bioinformatics, Department of Systems Biology, Columbia University, New York, United States.,Department of Microbiology and Immunology, Columbia University, New York, United States
| | - Donald Petrey
- Center for Computational Biology and Bioinformatics, Department of Systems Biology, Columbia University, New York, United States.,Howard Hughes Medical Institute, Columbia University, New York, United States
| | - Barry Honig
- Center for Computational Biology and Bioinformatics, Department of Systems Biology, Columbia University, New York, United States.,Howard Hughes Medical Institute, Columbia University, New York, United States.,Department of Biochemistry and Molecular Biophysics, Columbia University, New York, United States.,Department of Medicine, Columbia University, New York, United States.,Zuckerman Mind Brain Behavior Institute, Columbia University, New York, United States
| |
Collapse
|
12
|
Schoenrock A, Samanfar B, Pitre S, Hooshyar M, Jin K, Phillips CA, Wang H, Phanse S, Omidi K, Gui Y, Alamgir M, Wong A, Barrenäs F, Babu M, Benson M, Langston MA, Green JR, Dehne F, Golshani A. Efficient prediction of human protein-protein interactions at a global scale. BMC Bioinformatics 2014; 15:383. [PMID: 25492630 PMCID: PMC4272565 DOI: 10.1186/s12859-014-0383-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Accepted: 11/12/2014] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Our knowledge of global protein-protein interaction (PPI) networks in complex organisms such as humans is hindered by technical limitations of current methods. RESULTS On the basis of short co-occurring polypeptide regions, we developed a tool called MP-PIPE capable of predicting a global human PPI network within 3 months. With a recall of 23% at a precision of 82.1%, we predicted 172,132 putative PPIs. We demonstrate the usefulness of these predictions through a range of experiments. CONCLUSIONS The speed and accuracy associated with MP-PIPE can make this a potential tool to study individual human PPI networks (from genomic sequences alone) for personalized medicine.
Collapse
Affiliation(s)
| | | | - Sylvain Pitre
- School of Computer Science, Carleton University, Ottawa, Canada.
| | | | - Ke Jin
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada.
| | - Charles A Phillips
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, Tennessee, USA.
| | - Hui Wang
- Department of Pediatrics, Gothenburg University, Gothenburg, Sweden. .,The Centre for Individualized Medication, Linköping University, Linköping, Sweden.
| | - Sadhna Phanse
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada.
| | - Katayoun Omidi
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Yuan Gui
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Md Alamgir
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Alex Wong
- Department of Biology, Carleton University, Ottawa, Canada.
| | - Fredrik Barrenäs
- Department of Pediatrics, Gothenburg University, Gothenburg, Sweden. .,The Centre for Individualized Medication, Linköping University, Linköping, Sweden.
| | - Mohan Babu
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, Saskatchewan, Canada.
| | - Mikael Benson
- Department of Pediatrics, Gothenburg University, Gothenburg, Sweden. .,The Centre for Individualized Medication, Linköping University, Linköping, Sweden.
| | - Michael A Langston
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, Tennessee, USA.
| | - James R Green
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Canada.
| | - Frank Dehne
- School of Computer Science, Carleton University, Ottawa, Canada.
| | | |
Collapse
|
13
|
Grauffel C, Stote RH, Dejaegere A. Molecular dynamics for computational proteomics of methylated histone H3. Biochim Biophys Acta Gen Subj 2014; 1850:1026-1040. [PMID: 25240462 DOI: 10.1016/j.bbagen.2014.09.015] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Revised: 09/09/2014] [Accepted: 09/10/2014] [Indexed: 11/30/2022]
Abstract
BACKGROUND Post-translational modifications of histones, and in particular of their disordered N-terminal tails, play a major role in epigenetic regulation. The identification of proteins and proteic domains that specifically bind modified histones is therefore of paramount importance to understand the molecular mechanisms of epigenetics. METHODS We performed an energetic analysis using the MM/PBSA method in order to study known complexes between methylated histone H3 and effector domains of the PHD family. We then developed a simple molecular dynamics based predictive model based on our analysis. RESULTS We present a thorough validation of our procedure, followed by the computational predictions of new PHD domains specific for binding histone H3 methylated on lysine 4 (K4). CONCLUSIONS PHD domains recognize methylated K4 on histone H3 in the context of a linear interaction motif (LIM) formed by the first four amino acids of histone H3 as opposed to recognition of a single methylated site. PHD domains with different sequences find chemically equivalent solutions for stabilizing the histone LIM and these can be identified from energetic analysis. This analysis, in turn, allows for the identification of new PHD domains that bind methylated H3K4 using information that cannot be retrieved from sequence comparison alone. GENERAL SIGNIFICANCE Molecular dynamics simulations can be used to devise computational proteomics protocols that are both easy to implement and interpret, and that yield reliable predictions that compare favorably to and complement experimental proteomics methods. This article is part of a Special Issue entitled Recent developments of molecular dynamics.
Collapse
Affiliation(s)
- Cédric Grauffel
- Department of Integrative Structural Biology, Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC), Institut National de la Santé et de la Recherche Médicale (INSERM) U964, Centre National de la Recherche Scientifique (CNRS) UMR7104, Université de Strasbourg, 67404 Illkirch, France
| | - Roland H Stote
- Department of Integrative Structural Biology, Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC), Institut National de la Santé et de la Recherche Médicale (INSERM) U964, Centre National de la Recherche Scientifique (CNRS) UMR7104, Université de Strasbourg, 67404 Illkirch, France
| | - Annick Dejaegere
- Department of Integrative Structural Biology, Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC), Institut National de la Santé et de la Recherche Médicale (INSERM) U964, Centre National de la Recherche Scientifique (CNRS) UMR7104, Université de Strasbourg, 67404 Illkirch, France
| |
Collapse
|
14
|
Abstract
The past decade has seen a dramatic expansion in the number and range of techniques available to obtain genome-wide information and to analyze this information so as to infer both the functions of individual molecules and how they interact to modulate the behavior of biological systems. Here, we review these techniques, focusing on the construction of physical protein-protein interaction networks, and highlighting approaches that incorporate protein structure, which is becoming an increasingly important component of systems-level computational techniques. We also discuss how network analyses are being applied to enhance our basic understanding of biological systems and their disregulation, as well as how these networks are being used in drug development.
Collapse
Affiliation(s)
- Donald Petrey
- Center for Computational Biology and Bioinformatics, Department of Systems Biology
| | | |
Collapse
|
15
|
DeMille D, Bikman BT, Mathis AD, Prince JT, Mackay JT, Sowa SW, Hall TD, Grose JH. A comprehensive protein-protein interactome for yeast PAS kinase 1 reveals direct inhibition of respiration through the phosphorylation of Cbf1. Mol Biol Cell 2014; 25:2199-215. [PMID: 24850888 PMCID: PMC4091833 DOI: 10.1091/mbc.e13-10-0631] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
PAS kinase is a conserved sensory protein kinase required for glucose homeostasis. The interactome for yeast PAS kinase 1 (Psk1) is identified, revealing 93 binding partners. Evidence is provided for in vivo phosphorylation of Cbf1 and subsequent inhibition of respiration, supporting a role for Psk1 in partitioning glucose for cell growth. Per-Arnt-Sim (PAS) kinase is a sensory protein kinase required for glucose homeostasis in yeast, mice, and humans, yet little is known about the molecular mechanisms of its function. Using both yeast two-hybrid and copurification approaches, we identified the protein–protein interactome for yeast PAS kinase 1 (Psk1), revealing 93 novel putative protein binding partners. Several of the Psk1 binding partners expand the role of PAS kinase in glucose homeostasis, including new pathways involved in mitochondrial metabolism. In addition, the interactome suggests novel roles for PAS kinase in cell growth (gene/protein expression, replication/cell division, and protein modification and degradation), vacuole function, and stress tolerance. In vitro kinase studies using a subset of 25 of these binding partners identified Mot3, Zds1, Utr1, and Cbf1 as substrates. Further evidence is provided for the in vivo phosphorylation of Cbf1 at T211/T212 and for the subsequent inhibition of respiration. This respiratory role of PAS kinase is consistent with the reported hypermetabolism of PAS kinase–deficient mice, identifying a possible molecular mechanism and solidifying the evolutionary importance of PAS kinase in the regulation of glucose homeostasis.
Collapse
Affiliation(s)
- Desiree DeMille
- Department of Microbiology and Molecular Biology, Brigham Young University, Provo, UT 84602
| | - Benjamin T Bikman
- Department of Physiology and Developmental Biology, Brigham Young University, Provo, UT 84602
| | - Andrew D Mathis
- Department of Chemistry, Brigham Young University, Provo, UT 84602
| | - John T Prince
- Department of Chemistry, Brigham Young University, Provo, UT 84602
| | - Jordan T Mackay
- Department of Microbiology and Molecular Biology, Brigham Young University, Provo, UT 84602
| | - Steven W Sowa
- Department of Microbiology and Molecular Biology, Brigham Young University, Provo, UT 84602
| | - Tacie D Hall
- Department of Microbiology and Molecular Biology, Brigham Young University, Provo, UT 84602
| | - Julianne H Grose
- Department of Microbiology and Molecular Biology, Brigham Young University, Provo, UT 84602
| |
Collapse
|
16
|
Omidi K, Hooshyar M, Jessulat M, Samanfar B, Sanders M, Burnside D, Pitre S, Schoenrock A, Xu J, Babu M, Golshani A. Phosphatase complex Pph3/Psy2 is involved in regulation of efficient non-homologous end-joining pathway in the yeast Saccharomyces cerevisiae. PLoS One 2014; 9:e87248. [PMID: 24498054 PMCID: PMC3909046 DOI: 10.1371/journal.pone.0087248] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Accepted: 12/20/2013] [Indexed: 11/19/2022] Open
Abstract
One of the main mechanisms for double stranded DNA break (DSB) repair is through the non-homologous end-joining (NHEJ) pathway. Using plasmid and chromosomal repair assays, we showed that deletion mutant strains for interacting proteins Pph3p and Psy2p had reduced efficiencies in NHEJ. We further observed that this activity of Pph3p and Psy2p appeared linked to cell cycle Rad53p and Chk1p checkpoint proteins. Pph3/Psy2 is a phosphatase complex, which regulates recovery from the Rad53p DNA damage checkpoint. Overexpression of Chk1p checkpoint protein in a parallel pathway to Rad53p compensated for the deletion of PPH3 or PSY2 in a chromosomal repair assay. Double mutant strains Δpph3/Δchk1 and Δpsy2/Δchk1 showed additional reductions in the efficiency of plasmid repair, compared to both single deletions which is in agreement with the activity of Pph3p and Psy2p in a parallel pathway to Chk1p. Genetic interaction analyses also supported a role for Pph3p and Psy2p in DNA damage repair, the NHEJ pathway, as well as cell cycle progression. Collectively, we report that the activity of Pph3p and Psy2p further connects NHEJ repair to cell cycle progression.
Collapse
Affiliation(s)
- Katayoun Omidi
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| | - Mohsen Hooshyar
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| | - Matthew Jessulat
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, Saskatchewan, Canada
| | - Bahram Samanfar
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| | - Megan Sanders
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| | - Daniel Burnside
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| | - Sylvain Pitre
- Department of Computer Science, Carleton University, Ottawa, Ontario, Canada
| | - Andrew Schoenrock
- Department of Computer Science, Carleton University, Ottawa, Ontario, Canada
| | - Jianhua Xu
- College of Pharmaceutical Sciences, Zhejian University, Hangzhou, Zhejiang, China
| | - Mohan Babu
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, Saskatchewan, Canada
| | - Ashkan Golshani
- Department of Biology, Carleton University, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, Carleton University, Ottawa, Ontario, Canada
| |
Collapse
|
17
|
Casado-Vela J, Fuentes M, Franco-Zorrilla JM. Screening of Protein–Protein and Protein–DNA Interactions Using Microarrays. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2014; 95:231-81. [DOI: 10.1016/b978-0-12-800453-1.00008-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
18
|
Structure homology and interaction redundancy for discovering virus-host protein interactions. EMBO Rep 2013; 14:938-44. [PMID: 24008843 DOI: 10.1038/embor.2013.130] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2013] [Revised: 07/29/2013] [Accepted: 07/31/2013] [Indexed: 11/08/2022] Open
Abstract
Virus-host interactomes are instrumental to understand global perturbations of cellular functions induced by infection and discover new therapies. The construction of such interactomes is, however, technically challenging and time consuming. Here we describe an original method for the prediction of high-confidence interactions between viral and human proteins through a combination of structure and high-quality interactome data. Validation was performed for the NS1 protein of the influenza virus, which led to the identification of new host factors that control viral replication.
Collapse
|
19
|
Csermely P, Korcsmáros T, Kiss HJM, London G, Nussinov R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacol Ther 2013; 138:333-408. [PMID: 23384594 PMCID: PMC3647006 DOI: 10.1016/j.pharmthera.2013.01.016] [Citation(s) in RCA: 512] [Impact Index Per Article: 42.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 01/22/2013] [Indexed: 02/02/2023]
Abstract
Despite considerable progress in genome- and proteome-based high-throughput screening methods and in rational drug design, the increase in approved drugs in the past decade did not match the increase of drug development costs. Network description and analysis not only give a systems-level understanding of drug action and disease complexity, but can also help to improve the efficiency of drug design. We give a comprehensive assessment of the analytical tools of network topology and dynamics. The state-of-the-art use of chemical similarity, protein structure, protein-protein interaction, signaling, genetic interaction and metabolic networks in the discovery of drug targets is summarized. We propose that network targeting follows two basic strategies. The "central hit strategy" selectively targets central nodes/edges of the flexible networks of infectious agents or cancer cells to kill them. The "network influence strategy" works against other diseases, where an efficient reconfiguration of rigid networks needs to be achieved by targeting the neighbors of central nodes/edges. It is shown how network techniques can help in the identification of single-target, edgetic, multi-target and allo-network drug target candidates. We review the recent boom in network methods helping hit identification, lead selection optimizing drug efficacy, as well as minimizing side-effects and drug toxicity. Successful network-based drug development strategies are shown through the examples of infections, cancer, metabolic diseases, neurodegenerative diseases and aging. Summarizing >1200 references we suggest an optimized protocol of network-aided drug development, and provide a list of systems-level hallmarks of drug quality. Finally, we highlight network-related drug development trends helping to achieve these hallmarks by a cohesive, global approach.
Collapse
Affiliation(s)
- Peter Csermely
- Department of Medical Chemistry, Semmelweis University, P.O. Box 260, H-1444 Budapest 8, Hungary.
| | | | | | | | | |
Collapse
|
20
|
Protein-Protein Interactions: Gene Acronym Redundancies and Current Limitations Precluding Automated Data Integration. Proteomes 2013; 1:3-24. [PMID: 28250396 PMCID: PMC5314489 DOI: 10.3390/proteomes1010003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2013] [Revised: 05/16/2013] [Accepted: 05/21/2013] [Indexed: 12/31/2022] Open
Abstract
Understanding protein interaction networks and their dynamic changes is a major challenge in modern biology. Currently, several experimental and in silico approaches allow the screening of protein interactors in a large-scale manner. Therefore, the bulk of information on protein interactions deposited in databases and peer-reviewed published literature is constantly growing. Multiple databases interfaced from user-friendly web tools recently emerged to facilitate the task of protein interaction data retrieval and data integration. Nevertheless, as we evidence in this report, despite the current efforts towards data integration, the quality of the information on protein interactions retrieved by in silico approaches is frequently incomplete and may even list false interactions. Here we point to some obstacles precluding confident data integration, with special emphasis on protein interactions, which include gene acronym redundancies and protein synonyms. Three human proteins (choline kinase, PPIase and uromodulin) and three different web-based data search engines focused on protein interaction data retrieval (PSICQUIC, DASMI and BIPS) were used to explain the potential occurrence of undesired errors that should be considered by researchers in the field. We demonstrate that, despite the recent initiatives towards data standardization, manual curation of protein interaction networks based on literature searches are still required to remove potential false positives. A three-step workflow consisting of: (i) data retrieval from multiple databases, (ii) peer-reviewed literature searches, and (iii) data curation and integration, is proposed as the best strategy to gather updated information on protein interactions. Finally, this strategy was applied to compile bona fide information on human DREAM protein interactome, which constitutes liable training datasets that can be used to improve computational predictions.
Collapse
|
21
|
Krokhotin A, Liwo A, Niemi AJ, Scheraga HA. Coexistence of phases in a protein heterodimer. J Chem Phys 2012; 137:035101. [PMID: 22830730 DOI: 10.1063/1.4734019] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
A heterodimer consisting of two or more different kinds of proteins can display an enormous number of distinct molecular architectures. The conformational entropy is an essential ingredient in the Helmholtz free energy and, consequently, these heterodimers can have a very complex phase structure. Here, it is proposed that there is a state of proteins, in which the different components of a heterodimer exist in different phases. For this purpose, the structures in the protein data bank (PDB) have been analyzed, with radius of gyration as the order parameter. Two major classes of heterodimers with their protein components coexisting in different phases have been identified. An example is the PDB structure 3DXC. This is a transcriptionally active dimer. One of the components is an isoform of the intra-cellular domain of the Alzheimer-disease related amyloid precursor protein (AICD), and the other is a nuclear multidomain adaptor protein in the Fe65 family. It is concluded from the radius of gyration that neither of the two components in this dimer is in its own collapsed phase, corresponding to a biologically active protein. The UNRES energy function has been utilized to confirm that, if the two components are separated from each other, each of them collapses. The results presented in this work show that heterodimers whose protein components coexist in different phases, can have intriguing physical properties with potentially important biological consequences.
Collapse
Affiliation(s)
- Andrey Krokhotin
- Department of Physics and Astronomy and Science for Life Laboratory, Uppsala University, P.O. Box 803, S-75108 Uppsala, Sweden.
| | | | | | | |
Collapse
|
22
|
Edberg A, Soeria-Atmadja D, Bergman Laurila J, Johansson F, Gustafsson MG, Hammerling U. Assessing Relative Bioactivity of Chemical Substances Using Quantitative Molecular Network Topology Analysis. J Chem Inf Model 2012; 52:1238-49. [DOI: 10.1021/ci200429f] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- Anna Edberg
- Division of Food
Data, National Food Agency, SE-75126 Uppsala, Sweden
| | - Daniel Soeria-Atmadja
- Division of R&D Information, AstraZeneca Research and Development, SE-15185, Södertälje, Sweden
| | | | - Fredrik Johansson
- Division of Information
Technology,
National Food Agency, SE-75126 Uppsala, Sweden
| | - Mats G. Gustafsson
- Division of Cancer Pharmacology and Computational Medicine, Department of Medical Sciences, Uppsala University and Uppsala Academic Hospital, SE-75185 Uppsala, Sweden
| | - Ulf Hammerling
- Department of Risk Benefit Assessment,
National Food Agency, SE-75126 Uppsala, Sweden
| |
Collapse
|
23
|
Pitre S, Hooshyar M, Schoenrock A, Samanfar B, Jessulat M, Green JR, Dehne F, Golshani A. Short Co-occurring Polypeptide Regions Can Predict Global Protein Interaction Maps. Sci Rep 2012; 2:239. [PMID: 22355752 PMCID: PMC3269044 DOI: 10.1038/srep00239] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2011] [Accepted: 12/14/2011] [Indexed: 11/16/2022] Open
Abstract
A goal of the post-genomics era has been to elucidate a detailed global map of protein-protein interactions (PPIs) within a cell. Here, we show that the presence of co-occurring short polypeptide sequences between interacting protein partners appears to be conserved across different organisms. We present an algorithm to automatically generate PPI prediction method parameters for various organisms and illustrate that global PPIs can be predicted from previously reported PPIs within the same or a different organism using protein primary sequences. The PPI prediction code is further accelerated through the use of parallel multi-core programming, which improves its usability for large scale or proteome-wide PPI prediction. We predict and analyze hundreds of novel human PPIs, experimentally confirm protein functions and importantly predict the first genome-wide PPI maps for S. pombe (∼9,000 PPIs) and C. elegans (∼37,500 PPIs).
Collapse
|