201
|
Schaefer MH, Yang JS, Serrano L, Kiel C. Protein conservation and variation suggest mechanisms of cell type-specific modulation of signaling pathways. PLoS Comput Biol 2014; 10:e1003659. [PMID: 24922536 PMCID: PMC4055412 DOI: 10.1371/journal.pcbi.1003659] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2014] [Accepted: 04/21/2014] [Indexed: 02/04/2023] Open
Abstract
Many proteins and signaling pathways are present in most cell types and tissues and yet perform specialized functions. To elucidate mechanisms by which these ubiquitous pathways are modulated, we overlaid information about cross-cell line protein abundance and variability, and evolutionary conservation onto functional pathway components and topological layers in the pathway hierarchy. We found that the input (receptors) and the output (transcription factors) layers evolve more rapidly than proteins in the intermediary transmission layer. In contrast, protein expression variability decreases from the input to the output layer. We observed that the differences in protein variability between the input and transmission layer can be attributed to both the network position and the tendency of variable proteins to physically interact with constitutively expressed proteins. Differences in protein expression variability and conservation are also accompanied by the tendency of conserved and constitutively expressed proteins to acquire somatic mutations, while germline mutations tend to occur in cell type-specific proteins. Thus, conserved core proteins in the transmission layer could perform a fundamental role in most cell types and are therefore less tolerant to germline mutations. In summary, we propose that the core signal transmission machinery is largely modulated by a variable input layer through physical protein interactions. We hypothesize that the bow-tie organization of cellular signaling on the level of protein abundance variability contributes to the specificity of the signal response in different cell types. Cell function is determined by highly organized networks of biological molecules. An important class of protein pathways maintains the transmission of signals from the cell membrane to the nucleus. These signaling pathways are reused for different purposes at an evolutionary scale and in different cell types of the same organism. However, it is largely unknown how this flexibility is achieved and how this flexibility is balanced with the high degree of evolutionary conservation of some signaling proteins and the need for robustness against intra- and extra-cellular perturbations.We show how functional roles of signaling proteins determine patterns of evolutionary conservation, protein abundance (the average over different human cell lines and its variability) and disease mutations. Projecting pathway annotations on protein-protein interaction (PPI) networks, a picture emerges in which PPIs between variable and less conserved receptors and stable and conserved proteins of the core signal transmission machinery largely modulate signaling activity in a tissue-specific manner. This has important implications for the distribution of disease mutations in signaling pathways, which need to be considered for the understanding of their effect.
Collapse
Affiliation(s)
- Martin H. Schaefer
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- * E-mail: (MHS); (LS); (CK)
| | - Jae-Seong Yang
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Luis Serrano
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
- * E-mail: (MHS); (LS); (CK)
| | - Christina Kiel
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- * E-mail: (MHS); (LS); (CK)
| |
Collapse
|
202
|
Huang Q, Chang J, Cheung MK, Nong W, Li L, Lee MT, Kwan HS. Human proteins with target sites of multiple post-translational modification types are more prone to be involved in disease. J Proteome Res 2014; 13:2735-48. [PMID: 24754740 DOI: 10.1021/pr401019d] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Many proteins can be modified by multiple types of post-translational modifications (Mtp-proteins). Although some post-translational modifications (PTMs) have recently been found to be associated with life-threatening diseases like cancers and neurodegenerative disorders, the underlying mechanisms remain enigmatic to date. In this study, we examined the relationship of human Mtp-proteins and disease and systematically characterized features of these proteins. Our results indicated that Mtp-proteins are significantly more inclined to participate in disease than proteins carrying no known PTM sites. Mtp-proteins were found significantly enriched in protein complexes, having more protein partners and preferred to act as hubs/superhubs in protein-protein interaction (PPI) networks. They possess a distinct functional focus, such as chromatin assembly or disassembly, and reside in biased, multiple subcellular localizations. Moreover, most Mtp-proteins harbor more intrinsically disordered regions than the others. Mtp-proteins carrying PTM types biased toward locating in the ordered regions were mainly related to protein-DNA complex assembly. Examination of the energetic effects of PTMs on the stability of PPI revealed that only a small fraction of single PTM events influence the binding energy of >2 kcal/mol, whereas the binding energy can change dramatically by combinations of multiple PTM types. Our work not only expands the understanding of Mtp-proteins but also discloses the potential ability of Mtp-proteins to act as key elements in disease development.
Collapse
Affiliation(s)
- Qianli Huang
- School of Life Sciences, The Chinese University of Hong Kong , Shatin, Hong Kong SAR 852000, China
| | | | | | | | | | | | | |
Collapse
|
203
|
Mina M, Guzzi PH. Improving the Robustness of Local Network Alignment: Design and Extensive Assessment of a Markov Clustering-Based Approach. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:561-572. [PMID: 26356023 DOI: 10.1109/tcbb.2014.2318707] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
The analysis of protein behavior at the network level had been applied to elucidate the mechanisms of protein interaction that are similar in different species. Published network alignment algorithms proved to be able to recapitulate known conserved modules and protein complexes, and infer new conserved interactions confirmed by wet lab experiments. In the meantime, however, a plethora of continuously evolving protein-protein interaction (PPI) data sets have been developed, each featuring different levels of completeness and reliability. For instance, algorithms performance may vary significantly when changing the data set used in their assessment. Moreover, existing papers did not deeply investigate the robustness of alignment algorithms. For instance, some algorithms performances vary significantly when changing the data set used in their assessment. In this work, we design an extensive assessment of current algorithms discussing the robustness of the results on the basis of input networks. We also present AlignMCL, a local network alignment algorithm based on an improved model of alignment graph and Markov Clustering. AlignMCL performs better than other state-of-the-art local alignment algorithms over different updated data sets. In addition, AlignMCL features high levels of robustness, producing similar results regardless the selected data set.
Collapse
|
204
|
Butland SL, Sanders SS, Schmidt ME, Riechers SP, Lin DTS, Martin DDO, Vaid K, Graham RK, Singaraja RR, Wanker EE, Conibear E, Hayden MR. The palmitoyl acyltransferase HIP14 shares a high proportion of interactors with huntingtin: implications for a role in the pathogenesis of Huntington's disease. Hum Mol Genet 2014; 23:4142-60. [PMID: 24705354 DOI: 10.1093/hmg/ddu137] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
HIP14 is the most highly conserved of 23 human palmitoyl acyltransferases (PATs) that catalyze the post-translational addition of palmitate to proteins, including huntingtin (HTT). HIP14 is dysfunctional in the presence of mutant HTT (mHTT), the causative gene for Huntington disease (HD), and we hypothesize that reduced palmitoylation of HTT and other HIP14 substrates contributes to the pathogenesis of the disease. Here we describe the yeast two-hybrid (Y2H) interactors of HIP14 in the first comprehensive study of interactors of a mammalian PAT. Unexpectedly, we discovered a highly significant overlap between HIP14 interactors and 370 published interactors of HTT, 4-fold greater than for control proteins (P = 8 × 10(-5)). Nearly half of the 36 shared interactors are already implicated in HD, supporting a direct link between HIP14 and the disease. The HIP14 Y2H interaction set is significantly enriched for palmitoylated proteins that are candidate substrates. We confirmed that three of them, GPM6A, and the Sprouty domain-containing proteins SPRED1 and SPRED3, are indeed palmitoylated by HIP14; the first enzyme known to palmitoylate these proteins. These novel substrates functions might be affected by reduced palmitoylation in HD. We also show that the vesicular cargo adapter optineurin, an established HTT-binding protein, co-immunoprecipitates with HIP14 but is not palmitoylated. mHTT leads to mislocalization of optineurin and aberrant cargo trafficking. Therefore, it is possible that optineurin regulates trafficking of HIP14 to its substrates. Taken together, our data raise the possibility that defective palmitoylation by HIP14 might be an important mechanism that contributes to the pathogenesis of HD.
Collapse
Affiliation(s)
- Stefanie L Butland
- Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, Child & Family Research Institute, University of British Columbia, Vancouver, BC, Canada V5Z 4H4
| | - Shaun S Sanders
- Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, Child & Family Research Institute, University of British Columbia, Vancouver, BC, Canada V5Z 4H4
| | - Mandi E Schmidt
- Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, Child & Family Research Institute, University of British Columbia, Vancouver, BC, Canada V5Z 4H4
| | - Sean-Patrick Riechers
- Neuroproteomics, Max Delbrueck Center for Molecular Medicine, Berlin-Buch 13125, Germany
| | - David T S Lin
- Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, Child & Family Research Institute, University of British Columbia, Vancouver, BC, Canada V5Z 4H4
| | - Dale D O Martin
- Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, Child & Family Research Institute, University of British Columbia, Vancouver, BC, Canada V5Z 4H4
| | - Kuljeet Vaid
- Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, Child & Family Research Institute, University of British Columbia, Vancouver, BC, Canada V5Z 4H4
| | - Rona K Graham
- Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, Child & Family Research Institute, University of British Columbia, Vancouver, BC, Canada V5Z 4H4
| | - Roshni R Singaraja
- Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, Child & Family Research Institute, University of British Columbia, Vancouver, BC, Canada V5Z 4H4
| | - Erich E Wanker
- Neuroproteomics, Max Delbrueck Center for Molecular Medicine, Berlin-Buch 13125, Germany
| | - Elizabeth Conibear
- Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, Child & Family Research Institute, University of British Columbia, Vancouver, BC, Canada V5Z 4H4
| | - Michael R Hayden
- Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, Child & Family Research Institute, University of British Columbia, Vancouver, BC, Canada V5Z 4H4
| |
Collapse
|
205
|
Hindumathi V, Kranthi T, Rao SB, Manimaran P. The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach. MOLECULAR BIOSYSTEMS 2014; 10:1450-60. [PMID: 24647578 DOI: 10.1039/c4mb00004h] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
With rapidly changing technology, prediction of candidate genes has become an indispensable task in recent years mainly in the field of biological research. The empirical methods for candidate gene prioritization that succors to explore the potential pathway between genetic determinants and complex diseases are highly cumbersome and labor intensive. In such a scenario predicting potential targets for a disease state through in silico approaches are of researcher's interest. The prodigious availability of protein interaction data coupled with gene annotation renders an ease in the accurate determination of disease specific candidate genes. In our work we have prioritized the cervix related cancer candidate genes by employing Csaba Ortutay and his co-workers approach of identifying the candidate genes through graph theoretical centrality measures and gene ontology. With the advantage of the human protein interaction data, cervical cancer gene sets and the ontological terms, we were able to predict 15 novel candidates for cervical carcinogenesis. The disease relevance of the anticipated candidate genes was corroborated through a literature survey. Also the presence of the drugs for these candidates was detected through Therapeutic Target Database (TTD) and DrugMap Central (DMC) which affirms that they may be endowed as potential drug targets for cervical cancer.
Collapse
Affiliation(s)
- V Hindumathi
- C R Rao Advanced Institute of Mathematics, Statistics and Computer Science, University of Hyderabad Campus, Prof. C R Rao Road, Gachibowli, Hyderabad - 500046, India.
| | | | | | | |
Collapse
|
206
|
Zhou X, Liu J. Inferring gene dependency network specific to phenotypic alteration based on gene expression data and clinical information of breast cancer. PLoS One 2014; 9:e92023. [PMID: 24637666 PMCID: PMC3956890 DOI: 10.1371/journal.pone.0092023] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2013] [Accepted: 02/19/2014] [Indexed: 12/19/2022] Open
Abstract
Although many methods have been proposed to reconstruct gene regulatory network, most of them, when applied in the sample-based data, can not reveal the gene regulatory relations underlying the phenotypic change (e.g. normal versus cancer). In this paper, we adopt phenotype as a variable when constructing the gene regulatory network, while former researches either neglected it or only used it to select the differentially expressed genes as the inputs to construct the gene regulatory network. To be specific, we integrate phenotype information with gene expression data to identify the gene dependency pairs by using the method of conditional mutual information. A gene dependency pair (A,B) means that the influence of gene A on the phenotype depends on gene B. All identified gene dependency pairs constitute a directed network underlying the phenotype, namely gene dependency network. By this way, we have constructed gene dependency network of breast cancer from gene expression data along with two different phenotype states (metastasis and non-metastasis). Moreover, we have found the network scale free, indicating that its hub genes with high out-degrees may play critical roles in the network. After functional investigation, these hub genes are found to be biologically significant and specially related to breast cancer, which suggests that our gene dependency network is meaningful. The validity has also been justified by literature investigation. From the network, we have selected 43 discriminative hubs as signature to build the classification model for distinguishing the distant metastasis risks of breast cancer patients, and the result outperforms those classification models with published signatures. In conclusion, we have proposed a promising way to construct the gene regulatory network by using sample-based data, which has been shown to be effective and accurate in uncovering the hidden mechanism of the biological process and identifying the gene signature for phenotypic change.
Collapse
Affiliation(s)
| | - Juan Liu
- School of computer, Wuhan University, Wuhan, China
- * E-mail:
| |
Collapse
|
207
|
Kalantzaki K, Bei ES, Exarchos KP, Zervakis M, Garofalakis M, Fotiadis DI. Nonparametric network design and analysis of disease genes in oral cancer progression. IEEE J Biomed Health Inform 2014; 18:562-73. [PMID: 24608056 DOI: 10.1109/jbhi.2013.2274643] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Biological networks in living organisms can be seen as the ultimate means of understanding the underlying mechanisms in complex diseases, such as oral cancer. During the last decade, many algorithms based on high-throughput genomic data have been developed to unravel the complexity of gene network construction and their progression in time. However, the small size of samples compared to the number of observed genes makes the inference of the network structure quite challenging. In this study, we propose a framework for constructing and analyzing gene networks from sparse experimental temporal data and investigate its potential in oral cancer. We use two network models based on partial correlations and kernel density estimation, in order to capture the genetic interactions. Using this network construction framework on real clinical data of the tissue and blood at different time stages, we identified common disease-related structures that may decipher the association between disease state and biological processes in oral cancer. Our study emphasizes an altered MET (hepatocyte growth factor receptor) network during oral cancer progression. In addition, we demonstrate that the functional changes of gene interactions during oral cancer progression might be particularly useful for patient categorization at the time of diagnosis and/or at follow-up periods.
Collapse
|
208
|
Sobolev BN, Veselovsky AV, Poroikov VV. Prediction of protein post-translational modifications: main trends and methods. RUSSIAN CHEMICAL REVIEWS 2014. [DOI: 10.1070/rc2014v083n02abeh004377] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
209
|
Integrating systems biology sources illuminates drug action. Clin Pharmacol Ther 2014; 95:663-9. [PMID: 24577151 PMCID: PMC4029855 DOI: 10.1038/clpt.2014.51] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2013] [Accepted: 02/18/2014] [Indexed: 12/26/2022]
Abstract
There are significant gaps in our understanding of the pathways by which drugs act. This incomplete knowledge limits our ability to use mechanistic molecular information rationally to repurpose drugs, understand their side effects, and predict their interactions with other drugs. Here we present DrugRouter: a novel method for generating drug-specific pathways of action by linking target genes, disease genes and pharmacogenes using gene interaction networks. We construct pathways for over a hundred drugs, and show that the genes included in our pathways (1) co-occur with the query drug in the literature, (2) significantly overlap or are adjacent to known drug-response pathways, and (3) are adjacent to genes that are hits in genome wide association studies assessing drug response. Finally, these computed pathways suggest novel drug repositioning opportunities (e.g., statins for follicular thyroid cancer), gene-side effect associations, and gene-drug interactions. Thus, DrugRouter generates hypotheses about drug actions using systems biology data.
Collapse
|
210
|
Gillis J, Ballouz S, Pavlidis P. Bias tradeoffs in the creation and analysis of protein-protein interaction networks. J Proteomics 2014; 100:44-54. [PMID: 24480284 DOI: 10.1016/j.jprot.2014.01.020] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2013] [Revised: 01/13/2014] [Accepted: 01/17/2014] [Indexed: 02/04/2023]
Abstract
UNLABELLED Networks constructed from aggregated protein-protein interaction data are commonplace in biology. But the studies these data are derived from were conducted with their own hypotheses and foci. Focusing on data from budding yeast present in BioGRID, we determine that many of the downstream signals present in network data are significantly impacted by biases in the original data. We determine the degree to which selection bias in favor of biologically interesting bait proteins goes down with study size, while we also find that promiscuity in prey contributes more substantially in larger studies. We analyze interaction studies over time with respect to data in the Gene Ontology and find that reproducibly observed interactions are less likely to favor multifunctional proteins. We find that strong alignment between co-expression and protein-protein interaction data occurs only for extreme co-expression values, and use this data to suggest candidates for targets likely to reveal novel biology in follow-up studies. BIOLOGICAL SIGNIFICANCE Protein-protein interaction data finds particularly heavy use in the interpretation of disease-causal variants. In principle, network data allows researchers to find novel commonalities among candidate genes. In this study, we detail several of the most salient biases contributing to aggregated protein-protein interaction databases. We find strong evidence for the role of selection and laboratory biases. Many of these effects contribute to the commonalities researchers find for disease genes. In order for characterization of disease genes and their interactions to not simply be an artifact of researcher preference, it is imperative to identify data biases explicitly. Based on this, we also suggest ways to move forward in producing candidates less influenced by prior knowledge. This article is part of a Special Issue entitled: Can Proteomics Fill the Gap Between Genomics and Phenotypes?
Collapse
Affiliation(s)
- Jesse Gillis
- Cold Spring Harbor Laboratory, Stanley Institute for Cognitive Genomics, 500 Sunnyside Boulevard, Woodbury, NY 11797, United States.
| | - Sara Ballouz
- Cold Spring Harbor Laboratory, Stanley Institute for Cognitive Genomics, 500 Sunnyside Boulevard, Woodbury, NY 11797, United States.
| | - Paul Pavlidis
- Department of Psychiatry and Centre for High-Throughput Biology, University of British Columbia, 2185 East Mall., Vancouver, BC V6T 1Z4, Canada.
| |
Collapse
|
211
|
Dapat C, Saito R, Suzuki H, Horigome T. Quantitative phosphoproteomic analysis of host responses in human lung epithelial (A549) cells during influenza virus infection. Virus Res 2014; 179:53-63. [DOI: 10.1016/j.virusres.2013.11.012] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2013] [Revised: 11/05/2013] [Accepted: 11/11/2013] [Indexed: 10/26/2022]
|
212
|
Abstract
Alpha-solenoids are flexible protein structural domains formed by ensembles of alpha-helical repeats (Armadillo and HEAT repeats among others). While homology can be used to detect many of these repeats, some alpha-solenoids have very little sequence homology to proteins of known structure and we expect that many remain undetected. We previously developed a method for detection of alpha-helical repeats based on a neural network trained on a dataset of protein structures. Here we improved the detection algorithm and updated the training dataset using recently solved structures of alpha-solenoids. Unexpectedly, we identified occurrences of alpha-solenoids in solved protein structures that escaped attention, for example within the core of the catalytic subunit of PI3KC. Our results expand the current set of known alpha-solenoids. Application of our tool to the protein universe allowed us to detect their significant enrichment in proteins interacting with many proteins, confirming that alpha-solenoids are generally involved in protein-protein interactions. We then studied the taxonomic distribution of alpha-solenoids to discuss an evolutionary scenario for the emergence of this type of domain, speculating that alpha-solenoids have emerged in multiple taxa in independent events by convergent evolution. We observe a higher rate of alpha-solenoids in eukaryotic genomes and in some prokaryotic families, such as Cyanobacteria and Planctomycetes, which could be associated to increased cellular complexity. The method is available at http://cbdm.mdc-berlin.de/~ard2/.
Collapse
|
213
|
Kalathur RKR, Pinto JP, Hernández-Prieto MA, Machado RSR, Almeida D, Chaurasia G, Futschik ME. UniHI 7: an enhanced database for retrieval and interactive analysis of human molecular interaction networks. Nucleic Acids Res 2013; 42:D408-14. [PMID: 24214987 PMCID: PMC3965034 DOI: 10.1093/nar/gkt1100] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Unified Human Interactome (UniHI) (http://www.unihi.org) is a database for retrieval, analysis and visualization of human molecular interaction networks. Its primary aim is to provide a comprehensive and easy-to-use platform for network-based investigations to a wide community of researchers in biology and medicine. Here, we describe a major update (version 7) of the database previously featured in NAR Database Issue. UniHI 7 currently includes almost 350 000 molecular interactions between genes, proteins and drugs, as well as numerous other types of data such as gene expression and functional annotation. Multiple options for interactive filtering and highlighting of proteins can be employed to obtain more reliable and specific network structures. Expression and other genomic data can be uploaded by the user to examine local network structures. Additional built-in tools enable ready identification of known drug targets, as well as of biological processes, phenotypes and pathways enriched with network proteins. A distinctive feature of UniHI 7 is its user-friendly interface designed to be utilized in an intuitive manner, enabling researchers less acquainted with network analysis to perform state-of-the-art network-based investigations.
Collapse
Affiliation(s)
- Ravi Kiran Reddy Kalathur
- Centre for Molecular and Structural Biomedicine, University of Algarve, Faro, Portugal and Institute for Theoretical Biology, Charité, Humboldt-University, Berlin, Germany
| | | | | | | | | | | | | |
Collapse
|
214
|
Acencio ML, Bovolenta LA, Camilo E, Lemke N. Prediction of oncogenic interactions and cancer-related signaling networks based on network topology. PLoS One 2013; 8:e77521. [PMID: 24204854 PMCID: PMC3808429 DOI: 10.1371/journal.pone.0077521] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2013] [Accepted: 09/03/2013] [Indexed: 12/01/2022] Open
Abstract
Cancer has been increasingly recognized as a systems biology disease since many investigators have demonstrated that this malignant phenotype emerges from abnormal protein-protein, regulatory and metabolic interactions induced by simultaneous structural and regulatory changes in multiple genes and pathways. Therefore, the identification of oncogenic interactions and cancer-related signaling networks is crucial for better understanding cancer. As experimental techniques for determining such interactions and signaling networks are labor-intensive and time-consuming, the development of a computational approach capable to accomplish this task would be of great value. For this purpose, we present here a novel computational approach based on network topology and machine learning capable to predict oncogenic interactions and extract relevant cancer-related signaling subnetworks from an integrated network of human genes interactions (INHGI). This approach, called graph2sig, is twofold: first, it assigns oncogenic scores to all interactions in the INHGI and then these oncogenic scores are used as edge weights to extract oncogenic signaling subnetworks from INHGI. Regarding the prediction of oncogenic interactions, we showed that graph2sig is able to recover 89% of known oncogenic interactions with a precision of 77%. Moreover, the interactions that received high oncogenic scores are enriched in genes for which mutations have been causally implicated in cancer. We also demonstrated that graph2sig is potentially useful in extracting oncogenic signaling subnetworks: more than 80% of constructed subnetworks contain more than 50% of original interactions in their corresponding oncogenic linear pathways present in the KEGG PATHWAY database. In addition, the potential oncogenic signaling subnetworks discovered by graph2sig are supported by experimental evidence. Taken together, these results suggest that graph2sig can be a useful tool for investigators involved in cancer research interested in detecting signaling networks most prone to contribute with the emergence of malignant phenotype.
Collapse
Affiliation(s)
- Marcio Luis Acencio
- Department of Physics and Biophysics, Botucatu Biosciences Institute, UNESP – Univ Estadual Paulista, Botucatu, São Paulo, Brazil
- * E-mail:
| | - Luiz Augusto Bovolenta
- Department of Physics and Biophysics, Botucatu Biosciences Institute, UNESP – Univ Estadual Paulista, Botucatu, São Paulo, Brazil
| | - Esther Camilo
- Department of Physics and Biophysics, Botucatu Biosciences Institute, UNESP – Univ Estadual Paulista, Botucatu, São Paulo, Brazil
| | - Ney Lemke
- Department of Physics and Biophysics, Botucatu Biosciences Institute, UNESP – Univ Estadual Paulista, Botucatu, São Paulo, Brazil
| |
Collapse
|
215
|
Klapa MI, Tsafou K, Theodoridis E, Tsakalidis A, Moschonas NK. Reconstruction of the experimentally supported human protein interactome: what can we learn? BMC SYSTEMS BIOLOGY 2013; 7:96. [PMID: 24088582 PMCID: PMC4015887 DOI: 10.1186/1752-0509-7-96] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/26/2013] [Accepted: 09/25/2013] [Indexed: 02/02/2023]
Abstract
BACKGROUND Understanding the topology and dynamics of the human protein-protein interaction (PPI) network will significantly contribute to biomedical research, therefore its systematic reconstruction is required. Several meta-databases integrate source PPI datasets, but the protein node sets of their networks vary depending on the PPI data combined. Due to this inherent heterogeneity, the way in which the human PPI network expands via multiple dataset integration has not been comprehensively analyzed. We aim at assembling the human interactome in a global structured way and exploring it to gain insights of biological relevance. RESULTS First, we defined the UniProtKB manually reviewed human "complete" proteome as the reference protein-node set and then we mined five major source PPI datasets for direct PPIs exclusively between the reference proteins. We updated the protein and publication identifiers and normalized all PPIs to the UniProt identifier level. The reconstructed interactome covers approximately 60% of the human proteome and has a scale-free structure. No apparent differentiating gene functional classification characteristics were identified for the unrepresented proteins. The source dataset integration augments the network mainly in PPIs. Polyubiquitin emerged as the highest-degree node, but the inclusion of most of its identified PPIs may be reconsidered. The high number (>300) of connections of the subsequent fifteen proteins correlates well with their essential biological role. According to the power-law network structure, the unrepresented proteins should mainly have up to four connections with equally poorly-connected interactors. CONCLUSIONS Reconstructing the human interactome based on the a priori definition of the protein nodes enabled us to identify the currently included part of the human "complete" proteome, and discuss the role of the proteins within the network topology with respect to their function. As the network expansion has to comply with the scale-free theory, we suggest that the core of the human interactome has essentially emerged. Thus, it could be employed in systems biology and biomedical research, despite the considerable number of currently unrepresented proteins. The latter are probably involved in specialized physiological conditions, justifying the scarcity of related PPI information, and their identification can assist in designing relevant functional experiments and targeted text mining algorithms.
Collapse
Affiliation(s)
- Maria I Klapa
- Department of General Biology, School of Medicine, University of Patras, Rio, Patras, Greece.
| | | | | | | | | |
Collapse
|
216
|
Ruiz Esparza-Garrido R, Velázquez-Flores MÁ, Diegopérez-Ramírez J, López-Aguilar E, Siordia-Reyes G, Hernández-Ortiz M, Martínez-Batallar AG, Encarnación-Guevara S, Salamanca-Gómez F, Arenas-Aranda DJ. A proteomic approach of pediatric astrocytomas: MiRNAs and network insight. J Proteomics 2013; 94:162-75. [PMID: 24060999 DOI: 10.1016/j.jprot.2013.09.009] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2013] [Revised: 07/23/2013] [Accepted: 09/12/2013] [Indexed: 12/13/2022]
Abstract
UNLABELLED Pediatric astrocytomas, a leading cause of death associated with cancer, are the most common primary central nervous system tumors found in children. Most studies of these tumors focus on adults, not on children. We examined the global protein and microRNA expression pattern by 2D SDS-PAGE, mass spectrometry (MALDI-TOF), and RT(2) miRNA PCR Array System. Proteomic studies revealed 49 proteins with changes on the expression. Interactome showed that vimentin, calreticulin, and 14-3-3 epsilon protein are hub proteins in these neoplasms. MicroRNA analyses demonstrated for the first time novel microRNAs involved in the astrocytoma biology. In conclusion, our results show that novel proteins and microRNAs with expression changes on pediatric astrocytoma could serve as biomarkers of tumor progression. BIOLOGICAL SIGNIFICANCE Astrocytomas are tumors that progress rapidly and that invade surrounding tissues. Although some drugs have been developed to treat these neoplasms, the mortality of patients is still very high. In this study, we describe for the first time, to our knowledge, some proteins and miRNAs associated with the biology of astrocytic tumors that could be postulated as possible diagnostic or prognostic biomarkers. Altogether, our results indicate that large-scale analyses allow making a fairly accurate prediction of different cellular processes altered in astrocytic tumors.
Collapse
Affiliation(s)
- Ruth Ruiz Esparza-Garrido
- Unidad de Investigación Médica en Genética Humana, Hospital de Pediatría, Centro Médico Nacional Siglo XXI, IMSS, 06720 México, D.F., Mexico; Posgrado en Ciencias Biológicas, Universidad Nacional Autónoma de México, Av. Universidad 3000, C.P. 04510 Coyoacán, D. F., Mexico
| | | | | | | | | | | | | | | | | | | |
Collapse
|
217
|
Wang X, Thijssen B, Yu H. Target essentiality and centrality characterize drug side effects. PLoS Comput Biol 2013; 9:e1003119. [PMID: 23874169 PMCID: PMC3708859 DOI: 10.1371/journal.pcbi.1003119] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2012] [Accepted: 05/15/2013] [Indexed: 01/19/2023] Open
Abstract
To investigate factors contributing to drug side effects, we systematically examine relationships between 4,199 side effects associated with 996 drugs and their 647 human protein targets. We find that it is the number of essential targets, not the number of total targets, that determines the side effects of corresponding drugs. Furthermore, within the context of a three-dimensional interaction network with atomic-resolution interaction interfaces, we find that drugs causing more side effects are also characterized by high degree and betweenness of their targets and highly shared interaction interfaces on these targets. Our findings suggest that both essentiality and centrality of a drug target are key factors contributing to side effects and should be taken into consideration in rational drug design. The ultimate goal of medical research is to develop effective treatments for disease with minimal side effects. Currently, about 20% of drug candidates failed at clinical trial phases II and III due to safety issues. Therefore, understanding the determining factors of drug side effects is of paramount importance to human health and the pharmaceutical industry. Here, we present the first systematic study to uncover key factors leading to drug side effects within the framework of the human protein interactome network. Our results show that it is the number of essential targets, not the number of total targets, of a drug that determines the occurrence of its side effects. Furthermore, we find that the centrality, both degree and betweenness, of the drug targets is also an important determining factor of drug side effects. Our findings will shed light on new factors to be incorporated into the drug development pipeline.
Collapse
Affiliation(s)
- Xiujuan Wang
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York, United States of America
| | - Bram Thijssen
- Department of Bioinformatics, Maastricht University, Maastricht, The Netherlands
| | - Haiyuan Yu
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York, United States of America
- * E-mail:
| |
Collapse
|
218
|
Munier S, Rolland T, Diot C, Jacob Y, Naffakh N. Exploration of binary virus-host interactions using an infectious protein complementation assay. Mol Cell Proteomics 2013; 12:2845-55. [PMID: 23816991 DOI: 10.1074/mcp.m113.028688] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
A precise mapping of pathogen-host interactions is essential for comprehensive understanding of the processes of infection and pathogenesis. The most frequently used techniques for interactomics are the yeast two-hybrid binary methodologies, which do not recapitulate the pathogen life cycle, and the tandem affinity purification mass spectrometry co-complex methodologies, which cannot distinguish direct from indirect interactions. New technologies are thus needed to improve the mapping of pathogen-host interactions. In the current study, we detected binary interactions between influenza A virus polymerase and host proteins during the course of an actual viral infection, using a new strategy based on trans-complementation of the Gluc1 and Gluc2 fragments of Gaussia princeps luciferase. Infectious recombinant influenza viruses that encode a Gluc1-tagged polymerase subunit were engineered to infect cultured cells transiently expressing a selected set of Gluc2-tagged cellular proteins involved in nucleocytoplasmic trafficking pathways. A random set and a literature-curated set of Gluc2-tagged cellular proteins were tested in parallel. Our assay allowed the sensitive and accurate recovery of previously described interactions, and it revealed 30% of positive, novel viral-host protein-protein interactions within the exploratory set. In addition to cellular proteins involved in the nuclear import pathway, components of the nuclear pore complex such as NUP62 and mRNA export factors such as NXF1, RMB15B, and DDX19B were identified for the first time as interactors of the viral polymerase. Gene silencing experiments further showed that NUP62 is required for efficient viral replication. Our findings give new insights regarding the subversion of host nucleocytoplasmic trafficking pathways by influenza A viruses. They also demonstrate the potential of our infectious protein complementation assay for high-throughput exploration of influenza virus interactomics in infected cells. With more infectious reverse genetics systems becoming available, this strategy should be widely applicable to numerous pathogens.
Collapse
Affiliation(s)
- Sandie Munier
- Institut Pasteur, Unité de Génétique Moléculaire des Virus à ARN, Département de Virologie, F-75015 Paris, France
| | | | | | | | | |
Collapse
|
219
|
Cohen LD, Zuchman R, Sorokina O, Müller A, Dieterich DC, Armstrong JD, Ziv T, Ziv NE. Metabolic turnover of synaptic proteins: kinetics, interdependencies and implications for synaptic maintenance. PLoS One 2013; 8:e63191. [PMID: 23658807 PMCID: PMC3642143 DOI: 10.1371/journal.pone.0063191] [Citation(s) in RCA: 157] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2013] [Accepted: 03/29/2013] [Indexed: 01/11/2023] Open
Abstract
Chemical synapses contain multitudes of proteins, which in common with all proteins, have finite lifetimes and therefore need to be continuously replaced. Given the huge numbers of synaptic connections typical neurons form, the demand to maintain the protein contents of these connections might be expected to place considerable metabolic demands on each neuron. Moreover, synaptic proteostasis might differ according to distance from global protein synthesis sites, the availability of distributed protein synthesis facilities, trafficking rates and synaptic protein dynamics. To date, the turnover kinetics of synaptic proteins have not been studied or analyzed systematically, and thus metabolic demands or the aforementioned relationships remain largely unknown. In the current study we used dynamic Stable Isotope Labeling with Amino acids in Cell culture (SILAC), mass spectrometry (MS), Fluorescent Non-Canonical Amino acid Tagging (FUNCAT), quantitative immunohistochemistry and bioinformatics to systematically measure the metabolic half-lives of hundreds of synaptic proteins, examine how these depend on their pre/postsynaptic affiliation or their association with particular molecular complexes, and assess the metabolic load of synaptic proteostasis. We found that nearly all synaptic proteins identified here exhibited half-lifetimes in the range of 2-5 days. Unexpectedly, metabolic turnover rates were not significantly different for presynaptic and postsynaptic proteins, or for proteins for which mRNAs are consistently found in dendrites. Some functionally or structurally related proteins exhibited very similar turnover rates, indicating that their biogenesis and degradation might be coupled, a possibility further supported by bioinformatics-based analyses. The relatively low turnover rates measured here (∼0.7% of synaptic protein content per hour) are in good agreement with imaging-based studies of synaptic protein trafficking, yet indicate that the metabolic load synaptic protein turnover places on individual neurons is very substantial.
Collapse
Affiliation(s)
- Laurie D. Cohen
- Technion Faculty of Medicine, Lorry Lokey Center for Life Sciences and Engineering, Technion, Haifa, Israel
- Network Biology Research Laboratories, Lorry Lokey Center for Life Sciences and Engineering, Technion, Haifa, Israel
| | - Rina Zuchman
- Smoler Proteomics Center, Faculty of Biology, Technion, Haifa, Israel
| | - Oksana Sorokina
- Institute for Adaptive and Neural Computation, University of Edinburgh, Edinburgh, United Kingdom
| | - Anke Müller
- Leibniz-Institute for Neurobiology, Magdeburg, Germany
- Institute for Pharmacology and Toxicology, Otto-von-Guericke University, Magdeburg, Germany
| | - Daniela C. Dieterich
- Leibniz-Institute for Neurobiology, Magdeburg, Germany
- Institute for Pharmacology and Toxicology, Otto-von-Guericke University, Magdeburg, Germany
| | - J. Douglas Armstrong
- Institute for Adaptive and Neural Computation, University of Edinburgh, Edinburgh, United Kingdom
| | - Tamar Ziv
- Smoler Proteomics Center, Faculty of Biology, Technion, Haifa, Israel
| | - Noam E. Ziv
- Technion Faculty of Medicine, Lorry Lokey Center for Life Sciences and Engineering, Technion, Haifa, Israel
- Network Biology Research Laboratories, Lorry Lokey Center for Life Sciences and Engineering, Technion, Haifa, Israel
- * E-mail:
| |
Collapse
|
220
|
Scifo E, Szwajda A, Dębski J, Uusi-Rauva K, Kesti T, Dadlez M, Gingras AC, Tyynelä J, Baumann MH, Jalanko A, Lalowski M. Drafting the CLN3 protein interactome in SH-SY5Y human neuroblastoma cells: a label-free quantitative proteomics approach. J Proteome Res 2013; 12:2101-15. [PMID: 23464991 DOI: 10.1021/pr301125k] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Neuronal ceroid lipofuscinoses (NCL) are the most common inherited progressive encephalopathies of childhood. One of the most prevalent forms of NCL, Juvenile neuronal ceroid lipofuscinosis (JNCL) or CLN3 disease (OMIM: 204200), is caused by mutations in the CLN3 gene on chromosome 16p12.1. Despite progress in the NCL field, the primary function of ceroid-lipofuscinosis neuronal protein 3 (CLN3) remains elusive. In this study, we aimed to clarify the role of human CLN3 in the brain by identifying CLN3-associated proteins using a Tandem Affinity Purification coupled to Mass Spectrometry (TAP-MS) strategy combined with Significance Analysis of Interactome (SAINT). Human SH-SY5Y-NTAP-CLN3 stable cells were used to isolate native protein complexes for subsequent TAP-MS. Bioinformatic analyses of isolated complexes yielded 58 CLN3 interacting partners (IP) including 42 novel CLN3 IP, as well as 16 CLN3 high confidence interacting partners (HCIP) previously identified in another high-throughput study by Behrends et al., 2010. Moreover, 31 IP of ceroid-lipofuscinosis neuronal protein 5 (CLN5) were identified (18 of which were in common with the CLN3 bait). Our findings support previously suggested involvement of CLN3 in transmembrane transport, lipid homeostasis and neuronal excitability, as well as link it to G-protein signaling and protein folding/sorting in the ER.
Collapse
Affiliation(s)
- Enzo Scifo
- Meilahti Clinical Proteomics Core Facility, Institute of Biomedicine/Anatomy, and Finnish Graduate School of Neuroscience, University of Helsinki, Helsinki, Finland.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
221
|
Bertolazzi P, Bock ME, Guerra C. On the functional and structural characterization of hubs in protein–protein interaction networks. Biotechnol Adv 2013; 31:274-86. [DOI: 10.1016/j.biotechadv.2012.12.002] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2012] [Revised: 11/13/2012] [Accepted: 12/01/2012] [Indexed: 01/07/2023]
|
222
|
Furlong LI. Human diseases through the lens of network biology. Trends Genet 2013; 29:150-9. [DOI: 10.1016/j.tig.2012.11.004] [Citation(s) in RCA: 150] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2012] [Revised: 10/24/2012] [Accepted: 11/09/2012] [Indexed: 12/13/2022]
|
223
|
Clancy T, Rødland EA, Nygard S, Hovig E. Predicting physical interactions between protein complexes. Mol Cell Proteomics 2013; 12:1723-34. [PMID: 23438732 DOI: 10.1074/mcp.o112.019828] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Protein complexes enact most biochemical functions in the cell. Dynamic interactions between protein complexes are frequent in many cellular processes. As they are often of a transient nature, they may be difficult to detect using current genome-wide screens. Here, we describe a method to computationally predict physical interactions between protein complexes, applied to both humans and yeast. We integrated manually curated protein complexes and physical protein interaction networks, and we designed a statistical method to identify pairs of protein complexes where the number of protein interactions between a complex pair is due to an actual physical interaction between the complexes. An evaluation against manually curated physical complex-complex interactions in yeast revealed that 50% of these interactions could be predicted in this manner. A community network analysis of the highest scoring pairs revealed a biologically sensible organization of physical complex-complex interactions in the cell. Such analyses of proteomes may serve as a guide to the discovery of novel functional cellular relationships.
Collapse
Affiliation(s)
- Trevor Clancy
- Department of Tumor Biology, Institute for Cancer Research, The Norwegian Radium Hospital and Oslo University Hospital, Oslo, Norway.
| | | | | | | |
Collapse
|
224
|
|
225
|
Gillis J, Pavlidis P. Assessing identity, redundancy and confounds in Gene Ontology annotations over time. ACTA ACUST UNITED AC 2013; 29:476-82. [PMID: 23297035 DOI: 10.1093/bioinformatics/bts727] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION The Gene Ontology (GO) is heavily used in systems biology, but the potential for redundancy, confounds with other data sources and problems with stability over time have been little explored. RESULTS We report that GO annotations are stable over short periods, with 3% of genes not being most semantically similar to themselves between monthly GO editions. However, we find that genes can alter their 'functional identity' over time, with 20% of genes not matching to themselves (by semantic similarity) after 2 years. We further find that annotation bias in GO, in which some genes are more characterized than others, has declined in yeast, but generally increased in humans. Finally, we discovered that many entries in protein interaction databases are owing to the same published reports that are used for GO annotations, with 66% of assessed GO groups exhibiting this confound. We provide a case study to illustrate how this information can be used in analyses of gene sets and networks. AVAILABILITY Data available at http://chibi.ubc.ca/assessGO.
Collapse
Affiliation(s)
- Jesse Gillis
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, 192B Genome Research Center, 500 Sunnyside Boulevard, Woodbury, NY 11797, USA
| | | |
Collapse
|
226
|
Adding protein context to the human protein-protein interaction network to reveal meaningful interactions. PLoS Comput Biol 2013; 9:e1002860. [PMID: 23300433 PMCID: PMC3536619 DOI: 10.1371/journal.pcbi.1002860] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2012] [Accepted: 11/09/2012] [Indexed: 01/31/2023] Open
Abstract
Interactions of proteins regulate signaling, catalysis, gene expression and many other cellular functions. Therefore, characterizing the entire human interactome is a key effort in current proteomics research. This challenge is complicated by the dynamic nature of protein-protein interactions (PPIs), which are conditional on the cellular context: both interacting proteins must be expressed in the same cell and localized in the same organelle to meet. Additionally, interactions underlie a delicate control of signaling pathways, e.g. by post-translational modifications of the protein partners - hence, many diseases are caused by the perturbation of these mechanisms. Despite the high degree of cell-state specificity of PPIs, many interactions are measured under artificial conditions (e.g. yeast cells are transfected with human genes in yeast two-hybrid assays) or even if detected in a physiological context, this information is missing from the common PPI databases. To overcome these problems, we developed a method that assigns context information to PPIs inferred from various attributes of the interacting proteins: gene expression, functional and disease annotations, and inferred pathways. We demonstrate that context consistency correlates with the experimental reliability of PPIs, which allows us to generate high-confidence tissue- and function-specific subnetworks. We illustrate how these context-filtered networks are enriched in bona fide pathways and disease proteins to prove the ability of context-filters to highlight meaningful interactions with respect to various biological questions. We use this approach to study the lung-specific pathways used by the influenza virus, pointing to IRAK1, BHLHE40 and TOLLIP as potential regulators of influenza virus pathogenicity, and to study the signalling pathways that play a role in Alzheimer's disease, identifying a pathway involving the altered phosphorylation of the Tau protein. Finally, we provide the annotated human PPI network via a web frontend that allows the construction of context-specific networks in several ways. Protein-protein-interactions (PPIs) participate in virtually all biological processes. However, the PPI map is not static but the pairs of proteins that interact depends on the type of cell, the subcellular localization and modifications of the participating proteins, among many other factors. Therefore, it is important to understand the specific conditions under which a PPI happens. Unfortunately, experimental methods often do not provide this information or, even worse, measure PPIs under artificial conditions not found in biological systems. We developed a method to infer this missing information from properties of the interacting proteins, such as in which cell types the proteins are found, which functions they fulfill and whether they are known to play a role in disease. We show that PPIs for which we can infer conditions under which they happen have a higher experimental reliability. Also, our inference agrees well with known pathways and disease proteins. Since diseases usually affect specific cell types, we study PPI networks of influenza proteins in lung tissues and of Alzheimer's disease proteins in neural tissues. In both cases, we can highlight interesting interactions potentially playing a role in disease progression.
Collapse
|
227
|
Suter B, Fontaine JF, Yildirimman R, Raskó T, Schaefer MH, Rasche A, Porras P, Vázquez-Álvarez BM, Russ J, Rau K, Foulle R, Zenkner M, Saar K, Herwig R, Andrade-Navarro MA, Wanker EE. Development and application of a DNA microarray-based yeast two-hybrid system. Nucleic Acids Res 2012; 41:1496-507. [PMID: 23275563 PMCID: PMC3561971 DOI: 10.1093/nar/gks1329] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
The yeast two-hybrid (Y2H) system is the most widely applied methodology for systematic protein–protein interaction (PPI) screening and the generation of comprehensive interaction networks. We developed a novel Y2H interaction screening procedure using DNA microarrays for high-throughput quantitative PPI detection. Applying a global pooling and selection scheme to a large collection of human open reading frames, proof-of-principle Y2H interaction screens were performed for the human neurodegenerative disease proteins huntingtin and ataxin-1. Using systematic controls for unspecific Y2H results and quantitative benchmarking, we identified and scored a large number of known and novel partner proteins for both huntingtin and ataxin-1. Moreover, we show that this parallelized screening procedure and the global inspection of Y2H interaction data are uniquely suited to define specific PPI patterns and their alteration by disease-causing mutations in huntingtin and ataxin-1. This approach takes advantage of the specificity and flexibility of DNA microarrays and of the existence of solid-related statistical methods for the analysis of DNA microarray data, and allows a quantitative approach toward interaction screens in human and in model organisms.
Collapse
Affiliation(s)
- Bernhard Suter
- Max Delbrueck Center for Molecular Medicine, Berlin 13125, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
228
|
Wright PC, Jaffe S, Noirel J, Zou X. Opportunities for protein interaction network-guided cellular engineering. IUBMB Life 2012; 65:17-27. [DOI: 10.1002/iub.1114] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2012] [Revised: 10/14/2012] [Accepted: 10/15/2012] [Indexed: 01/23/2023]
|
229
|
von Eichborn J, Dunkel M, Gohlke BO, Preissner SC, Hoffmann MF, Bauer JMJ, Armstrong JD, Schaefer MH, Andrade-Navarro MA, Le Novere N, Croning MDR, Grant SGN, van Nierop P, Smit AB, Preissner R. SynSysNet: integration of experimental data on synaptic protein-protein interactions with drug-target relations. Nucleic Acids Res 2012; 41:D834-40. [PMID: 23143269 PMCID: PMC3531074 DOI: 10.1093/nar/gks1040] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
We created SynSysNet, available online at http://bioinformatics.charite.de/synsysnet, to provide a platform that creates a comprehensive 4D network of synaptic interactions. Neuronal synapses are fundamental structures linking nerve cells in the brain and they are responsible for neuronal communication and information processing. These processes are dynamically regulated by a network of proteins. New developments in interaction proteomics and yeast two-hybrid methods allow unbiased detection of interactors. The consolidation of data from different resources and methods is important to understand the relation to human behaviour and disease and to identify new therapeutic approaches. To this end, we established SynSysNet from a set of ∼1000 synapse specific proteins, their structures and small-molecule interactions. For two-thirds of these, 3D structures are provided (from Protein Data Bank and homology modelling). Drug-target interactions for 750 approved drugs and 50 000 compounds, as well as 5000 experimentally validated protein–protein interactions, are included. The resulting interaction network and user-selected parts can be viewed interactively and exported in XGMML. Approximately 200 involved pathways can be explored regarding drug-target interactions. Homology-modelled structures are downloadable in Protein Data Bank format, and drugs are available as MOL-files. Protein–protein interactions and drug-target interactions can be viewed as networks; corresponding PubMed IDs or sources are given.
Collapse
Affiliation(s)
- Joachim von Eichborn
- Structural Bioinformatics Group, Institute of Physiology, Charité-University Medicine Berlin, Lindenberger Weg 80, 13125 Berlin, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
230
|
Kamburov A, Grossmann A, Herwig R, Stelzl U. Cluster-based assessment of protein-protein interaction confidence. BMC Bioinformatics 2012; 13:262. [PMID: 23050565 PMCID: PMC3532186 DOI: 10.1186/1471-2105-13-262] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2012] [Accepted: 08/16/2012] [Indexed: 11/10/2022] Open
Abstract
Background Protein-protein interaction networks are key to a systems-level understanding of cellular biology. However, interaction data can contain a considerable fraction of false positives. Several methods have been proposed to assess the confidence of individual interactions. Most of them require the integration of additional data like protein expression and interaction homology information. While being certainly useful, such additional data are not always available and may introduce additional bias and ambiguity. Results We propose a novel, network topology based interaction confidence assessment method called CAPPIC (cluster-based assessment of protein-protein interaction confidence). It exploits the network’s inherent modular architecture for assessing the confidence of individual interactions. Our method determines algorithmic parameters intrinsically and does not require any parameter input or reference sets for confidence scoring. Conclusions On the basis of five yeast and two human physical interactome maps inferred using different techniques, we show that CAPPIC reliably assesses interaction confidence and its performance compares well to other approaches that are also based on network topology. The confidence score correlates with the agreement in localization and biological process annotations of interacting proteins. Moreover, it corroborates experimental evidence of physical interactions. Our method is not limited to physical interactome maps as we exemplify with a large yeast genetic interaction network. An implementation of CAPPIC is available at
http://intscore.molgen.mpg.de.
Collapse
Affiliation(s)
- Atanas Kamburov
- Department of Vertebrate Genomics, Max Planck Institute for Molecular Genetics, 14195 Berlin, Ihnestr, 63-73, Germany.
| | | | | | | |
Collapse
|
231
|
Kamburov A, Stelzl U, Herwig R. IntScore: a web tool for confidence scoring of biological interactions. Nucleic Acids Res 2012; 40:W140-6. [PMID: 22649056 PMCID: PMC3394291 DOI: 10.1093/nar/gks492] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Knowledge of all molecular interactions that potentially take place in the cell is a key for a detailed understanding of cellular processes. Currently available interaction data, such as protein–protein interaction maps, are known to contain false positives that inevitably diminish the accuracy of network-based inferences. Interaction confidence scoring is thus a crucial intermediate step after obtaining interaction data and before using it in an interaction network-based inference approach. It enables to weight individual interactions according to the likelihood that they actually take place in the cell, and can be used to filter out false positives. We describe a web tool called IntScore which calculates confidence scores for user-specified sets of interactions. IntScore provides six network topology- and annotation-based confidence scoring methods. It also enables the integration of scores calculated by the different methods into an aggregate score using machine learning approaches. IntScore is user-friendly and extensively documented. It is freely available at http://intscore.molgen.mpg.de.
Collapse
Affiliation(s)
- Atanas Kamburov
- Vertebrate Genomics Department and Otto-Warburg Laboratory, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany.
| | | | | |
Collapse
|
232
|
Schaefer MH, Wanker EE, Andrade-Navarro MA. Evolution and function of CAG/polyglutamine repeats in protein-protein interaction networks. Nucleic Acids Res 2012; 40:4273-87. [PMID: 22287626 PMCID: PMC3378862 DOI: 10.1093/nar/gks011] [Citation(s) in RCA: 151] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Expanded runs of consecutive trinucleotide CAG repeats encoding polyglutamine (polyQ) stretches are observed in the genes of a large number of patients with different genetic diseases such as Huntington's and several Ataxias. Protein aggregation, which is a key feature of most of these diseases, is thought to be triggered by these expanded polyQ sequences in disease-related proteins. However, polyQ tracts are a normal feature of many human proteins, suggesting that they have an important cellular function. To clarify the potential function of polyQ repeats in biological systems, we systematically analyzed available information stored in sequence and protein interaction databases. By integrating genomic, phylogenetic, protein interaction network and functional information, we obtained evidence that polyQ tracts in proteins stabilize protein interactions. This happens most likely through structural changes whereby the polyQ sequence extends a neighboring coiled-coil region to facilitate its interaction with a coiled-coil region in another protein. Alteration of this important biological function due to polyQ expansion results in gain of abnormal interactions, leading to pathological effects like protein aggregation. Our analyses suggest that research on polyQ proteins should shift focus from expanded polyQ proteins into the characterization of the influence of the wild-type polyQ on protein interactions.
Collapse
Affiliation(s)
- Martin H. Schaefer
- Computational Biology and Data Mining and Neuroproteomics, Max Delbrück Center for Molecular Medicine, Robert-Rössle-Strasse 10, 13125 Berlin, Germany
| | - Erich E. Wanker
- Computational Biology and Data Mining and Neuroproteomics, Max Delbrück Center for Molecular Medicine, Robert-Rössle-Strasse 10, 13125 Berlin, Germany
| | - Miguel A. Andrade-Navarro
- Computational Biology and Data Mining and Neuroproteomics, Max Delbrück Center for Molecular Medicine, Robert-Rössle-Strasse 10, 13125 Berlin, Germany
- *To whom correspondence should be addressed. Tel: +49 30 9406 4250; Fax: +49 30 9406 4240;
| |
Collapse
|