Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Turner B, Razick S, Turinsky AL, Vlasblom J, Crowdy EK, Cho E, Morrison K, Donaldson IM, Wodak SJ. iRefWeb: interactive analysis of consolidated protein interaction data and their supporting evidence. Database (Oxford) 2010;2010:baq023. [PMID: 20940177 PMCID: PMC2963317 DOI: 10.1093/database/baq023] [Citation(s) in RCA: 146] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

For:	Turner B, Razick S, Turinsky AL, Vlasblom J, Crowdy EK, Cho E, Morrison K, Donaldson IM, Wodak SJ. iRefWeb: interactive analysis of consolidated protein interaction data and their supporting evidence. Database (Oxford) 2010;2010:baq023. [PMID: 20940177 PMCID: PMC2963317 DOI: 10.1093/database/baq023] [Citation(s) in RCA: 146] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Number

Cited by Other Article(s)

101

Quantifying protein interaction dynamics by SWATH mass spectrometry: application to the 14-3-3 system. Nat Methods 2013;10:1246-53. [PMID: 24162925 DOI: 10.1038/nmeth.2703] [Citation(s) in RCA: 244] [Impact Index Per Article: 22.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2013] [Accepted: 09/25/2013] [Indexed: 12/22/2022]

102

Jayaswal V, Schramm SJ, Mann GJ, Wilkins MR, Yang YH. VAN: an R package for identifying biologically perturbed networks via differential variability analysis. BMC Res Notes 2013;6:430. [PMID: 24156242 PMCID: PMC4015612 DOI: 10.1186/1756-0500-6-430] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2013] [Accepted: 10/18/2013] [Indexed: 12/26/2022] Open

Abstract

BACKGROUND

Large-scale molecular interaction networks are dynamic in nature and are of special interest in the analysis of complex diseases, which are characterized by network-level perturbations rather than changes in individual genes/proteins. The methods developed for the identification of differentially expressed genes or gene sets are not suitable for network-level analyses. Consequently, bioinformatics approaches that enable a joint analysis of high-throughput transcriptomics datasets and large-scale molecular interaction networks for identifying perturbed networks are gaining popularity. Typically, these approaches require the sequential application of multiple bioinformatics techniques - ID mapping, network analysis, and network visualization. Here, we present the Variability Analysis in Networks (VAN) software package: a collection of R functions to streamline this bioinformatics analysis.

FINDINGS

VAN determines whether there are network-level perturbations across biological states of interest. It first identifies hubs (densely connected proteins/microRNAs) in a network and then uses them to extract network modules (comprising of a hub and all its interaction partners). The function identifySignificantHubs identifies dysregulated modules (i.e. modules with changes in expression correlation between a hub and its interaction partners) using a single expression and network dataset. The function summarizeHubData identifies dysregulated modules based on a meta-analysis of multiple expression and/or network datasets. VAN also converts protein identifiers present in a MITAB-formatted interaction network to gene identifiers (UniProt identifier to Entrez identifier or gene symbol using the function generatePpiMap) and generates microRNA-gene interaction networks using TargetScan and Microcosm databases (generateMicroRnaMap). The function obtainCancerInfo is used to identify hubs (corresponding to significantly perturbed modules) that are already causally associated with cancer(s) in the Cancer Gene Census database. Additionally, VAN supports the visualization of changes to network modules in R and Cytoscape (visualizeNetwork and obtainPairSubset, respectively). We demonstrate the utility of VAN using a gene expression data from metastatic melanoma and a protein-protein interaction network from the Human Protein Reference Database.

CONCLUSIONS

Our package provides a comprehensive and user-friendly platform for the integrative analysis of -omics data to identify disease-associated network modules. This bioinformatics approach, which is essentially focused on the question of explaining phenotype with a 'network type' and in particular, how regulation is changing among different states of interest, is relevant to many questions including those related to network perturbations across developmental timelines.

Collapse

103

The functional interactome landscape of the human histone deacetylase family. Mol Syst Biol 2013;9:672. [PMID: 23752268 PMCID: PMC3964310 DOI: 10.1038/msb.2013.26] [Citation(s) in RCA: 218] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2013] [Accepted: 04/29/2013] [Indexed: 12/22/2022] Open

Abstract

This study presents the first global protein interaction network for all 11 human HDACs in T cells and an integrative mass spectrometry approach for profiling relative interaction stability within isolated protein complexes.

T-cell lines stably expressing each of the human HDACs (1 - 11), C-terminally tagged with both EGFP and FLAG, were generated using retroviral transduction.

Affinity purification coupled to mass spectrometry-based proteomics (AP-MS) was used to build the first global protein interaction network for all eleven human HDACs in T cells.

An optimized label free AP-MS and computational workflow was developed for profiling relative interaction stability among isolated protein complexes.

HDAC11 is a member of the “survival of motor neuron” protein complex with a functional role in mRNA splicing.

Histone deacetylases (HDACs) are a diverse family of essential transcriptional regulatory enzymes, that function through the spatial and temporal recruitment of protein complexes. As the composition and regulation of HDAC complexes are only partially characterized, we built the first global protein interaction network for all 11 human HDACs in T cells. Integrating fluorescence microscopy, immunoaffinity purifications, quantitative mass spectrometry, and bioinformatics, we identified over 200 unreported interactions for both well-characterized and lesser-studied HDACs, a subset of which were validated by orthogonal approaches. We establish HDAC11 as a member of the survival of motor neuron complex and pinpoint a functional role in mRNA splicing. We designed a complementary label-free and metabolic-labeling mass spectrometry-based proteomics strategy for profiling interaction stability among different HDAC classes, revealing that HDAC1 interactions within chromatin-remodeling complexes are largely stable, while transcription factors preferentially exist in rapid equilibrium. Overall, this study represents a valuable resource for investigating HDAC functions in health and disease, encompassing emerging themes of HDAC regulation in cell cycle and RNA processing and a deeper functional understanding of HDAC complex stability.

Collapse

104

Wang X, Thijssen B, Yu H. Target essentiality and centrality characterize drug side effects. PLoS Comput Biol 2013;9:e1003119. [PMID: 23874169 PMCID: PMC3708859 DOI: 10.1371/journal.pcbi.1003119] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2012] [Accepted: 05/15/2013] [Indexed: 01/19/2023] Open

105

Schramm SJ, Li SS, Jayaswal V, Fung DCY, Campain AE, Pang CNI, Scolyer RA, Yang YH, Mann GJ, Wilkins MR. Disturbed protein-protein interaction networks in metastatic melanoma are associated with worse prognosis and increased functional mutation burden. Pigment Cell Melanoma Res 2013;26:708-22. [PMID: 23738911 DOI: 10.1111/pcmr.12126] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2013] [Accepted: 05/30/2013] [Indexed: 12/15/2022]

106

Schäffer AA. Digenic inheritance in medical genetics. J Med Genet 2013;50:641-52. [PMID: 23785127 PMCID: PMC3778050 DOI: 10.1136/jmedgenet-2013-101713] [Citation(s) in RCA: 139] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

107

Das J, Vo TV, Wei X, Mellor JC, Tong V, Degatano AG, Wang X, Wang L, Cordero NA, Kruer-Zerhusen N, Matsuyama A, Pleiss JA, Lipkin SM, Yoshida M, Roth FP, Yu H. Cross-species protein interactome mapping reveals species-specific wiring of stress response pathways. Sci Signal 2013;6:ra38. [PMID: 23695164 DOI: 10.1126/scisignal.2003350] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Affiliation(s)

Jishnu Das Department of Biological Statistics and Computational Biology Cornell University, Ithaca, NY 14853, USA.,Weill Institute for Cell and Molecular Biology Cornell University, Ithaca, NY 14853, USA
Tommy V Vo Weill Institute for Cell and Molecular Biology Cornell University, Ithaca, NY 14853, USA.,Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
Xiaomu Wei Weill Institute for Cell and Molecular Biology Cornell University, Ithaca, NY 14853, USA.,Department of Medicine, Weill Cornell College of Medicine, New York, NY 10021, USA
Joseph C Mellor Donnelly Centre, University of Toronto, Toronto, ON M5S-3E1, Canada
Virginia Tong Weill Institute for Cell and Molecular Biology Cornell University, Ithaca, NY 14853, USA
Andrew G Degatano Weill Institute for Cell and Molecular Biology Cornell University, Ithaca, NY 14853, USA
Xiujuan Wang Department of Biological Statistics and Computational Biology Cornell University, Ithaca, NY 14853, USA.,Weill Institute for Cell and Molecular Biology Cornell University, Ithaca, NY 14853, USA
Lihua Wang Weill Institute for Cell and Molecular Biology Cornell University, Ithaca, NY 14853, USA
Nicolas A Cordero Weill Institute for Cell and Molecular Biology Cornell University, Ithaca, NY 14853, USA
Nathan Kruer-Zerhusen Weill Institute for Cell and Molecular Biology Cornell University, Ithaca, NY 14853, USA.,Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
Akihisa Matsuyama Chemical Genetics Laboratory, RIKEN Advanced Science Institute, Wako, Saitama 351-0198, Japan.,CREST Research Project, JST, Kawaguchi, Saitama 332-0012, Japan
Jeffrey A Pleiss Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
Steven M Lipkin Department of Medicine, Weill Cornell College of Medicine, New York, NY 10021, USA
Minoru Yoshida Chemical Genetics Laboratory, RIKEN Advanced Science Institute, Wako, Saitama 351-0198, Japan.,CREST Research Project, JST, Kawaguchi, Saitama 332-0012, Japan.,Department of Biotechnology, Graduate School of Agriculture and Life Sciences, University of Tokyo, Bunkyo-ku, Tokyo 113-8657, Japan
Frederick P Roth Donnelly Centre, University of Toronto, Toronto, ON M5S-3E1, Canada.,Departments of Molecular Genetics and Computer Science, University of Toronto, Toronto, ON M5S-3E1, Canada.,Center for Cancer Systems Biology, Dana-Farber Cancer Institute, Boston, MA 02115.,Harvard Medical School, Boston, MA 02115.,Samuel Lunenfeld Research Institute, Mt. Sinai Hospital, Toronto, ON M5G-1X5, Canada.,Genetic Networks Program, Canadian Institute for Advanced Research, Toronto, ON M5G-1Z8, Canada
Haiyuan Yu Department of Biological Statistics and Computational Biology Cornell University, Ithaca, NY 14853, USA.,Weill Institute for Cell and Molecular Biology Cornell University, Ithaca, NY 14853, USA

Collapse

108

Tripathi LP, Kambara H, Chen YA, Nishimura Y, Moriishi K, Okamoto T, Morita E, Abe T, Mori Y, Matsuura Y, Mizuguchi K. Understanding the Biological Context of NS5A–Host Interactions in HCV Infection: A Network-Based Approach. J Proteome Res 2013;12:2537-51. [DOI: 10.1021/pr3011217] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

109

Varjosalo M, Keskitalo S, Van Drogen A, Nurkkala H, Vichalkovski A, Aebersold R, Gstaiger M. The protein interaction landscape of the human CMGC kinase group. Cell Rep 2013;3:1306-20. [PMID: 23602568 DOI: 10.1016/j.celrep.2013.03.027] [Citation(s) in RCA: 151] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2012] [Revised: 03/01/2013] [Accepted: 03/18/2013] [Indexed: 12/24/2022] Open

110

Meyer MJ, Das J, Wang X, Yu H. INstruct: a database of high-quality 3D structurally resolved protein interactome networks. ACTA ACUST UNITED AC 2013;29:1577-9. [PMID: 23599502 DOI: 10.1093/bioinformatics/btt181] [Citation(s) in RCA: 102] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

111

Ferreira RM, Rybarczyk-Filho JL, Dalmolin RJS, Castro MAA, Moreira JCF, Brunnet LG, de Almeida RMC. Preferential duplication of intermodular hub genes: an evolutionary signature in eukaryotes genome networks. PLoS One 2013;8:e56579. [PMID: 23468868 PMCID: PMC3582557 DOI: 10.1371/journal.pone.0056579] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2012] [Accepted: 01/14/2013] [Indexed: 12/31/2022] Open

112

Clancy T, Rødland EA, Nygard S, Hovig E. Predicting physical interactions between protein complexes. Mol Cell Proteomics 2013;12:1723-34. [PMID: 23438732 DOI: 10.1074/mcp.o112.019828] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open

113

Li C, Liakata M, Rebholz-Schuhmann D. Biological network extraction from scientific literature: state of the art and challenges. Brief Bioinform 2013;15:856-77. [PMID: 23434632 DOI: 10.1093/bib/bbt006] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

114

Blandin G, Marchand S, Charton K, Danièle N, Gicquel E, Boucheteil JB, Bentaib A, Barrault L, Stockholm D, Bartoli M, Richard I. A human skeletal muscle interactome centered on proteins involved in muscular dystrophies: LGMD interactome. Skelet Muscle 2013;3:3. [PMID: 23414517 PMCID: PMC3610214 DOI: 10.1186/2044-5040-3-3] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2012] [Accepted: 02/07/2013] [Indexed: 02/01/2023] Open

115

A survey of protein interaction data and multigenic inherited disorders. BMC Bioinformatics 2013;14:47. [PMID: 23398688 PMCID: PMC3598893 DOI: 10.1186/1471-2105-14-47] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2012] [Accepted: 02/05/2013] [Indexed: 11/15/2022] Open

116

Predicting PDZ domain mediated protein interactions from structure. BMC Bioinformatics 2013;14:27. [PMID: 23336252 PMCID: PMC3602153 DOI: 10.1186/1471-2105-14-27] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2012] [Accepted: 12/19/2012] [Indexed: 12/03/2022] Open

Abstract

Background

PDZ domains are structural protein domains that recognize simple linear amino acid motifs, often at protein C-termini, and mediate protein-protein interactions (PPIs) in important biological processes, such as ion channel regulation, cell polarity and neural development. PDZ domain-peptide interaction predictors have been developed based on domain and peptide sequence information. Since domain structure is known to influence binding specificity, we hypothesized that structural information could be used to predict new interactions compared to sequence-based predictors.

Results

We developed a novel computational predictor of PDZ domain and C-terminal peptide interactions using a support vector machine trained with PDZ domain structure and peptide sequence information. Performance was estimated using extensive cross validation testing. We used the structure-based predictor to scan the human proteome for ligands of 218 PDZ domains and show that the predictions correspond to known PDZ domain-peptide interactions and PPIs in curated databases. The structure-based predictor is complementary to the sequence-based predictor, finding unique known and novel PPIs, and is less dependent on training–testing domain sequence similarity. We used a functional enrichment analysis of our hits to create a predicted map of PDZ domain biology. This map highlights PDZ domain involvement in diverse biological processes, some only found by the structure-based predictor. Based on this analysis, we predict novel PDZ domain involvement in xenobiotic metabolism and suggest new interactions for other processes including wound healing and Wnt signalling.

Conclusions

We built a structure-based predictor of PDZ domain-peptide interactions, which can be used to scan C-terminal proteomes for PDZ interactions. We also show that the structure-based predictor finds many known PDZ mediated PPIs in human that were not found by our previous sequence-based predictor and is less dependent on training–testing domain sequence similarity. Using both predictors, we defined a functional map of human PDZ domain biology and predict novel PDZ domain function. Users may access our structure-based and previous sequence-based predictors at http://webservice.baderlab.org/domains/POW.

Collapse

117

Naegle KM, White FM, Lauffenburger DA, Yaffe MB. Robust co-regulation of tyrosine phosphorylation sites on proteins reveals novel protein interactions. MOLECULAR BIOSYSTEMS 2013;8:2771-82. [PMID: 22851037 DOI: 10.1039/c2mb25200g] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Abstract

Cell signaling networks propagate information from extracellular cues via dynamic modulation of protein-protein interactions in a context-dependent manner. Networks based on receptor tyrosine kinases (RTKs), for example, phosphorylate intracellular proteins in response to extracellular ligands, resulting in dynamic protein-protein interactions that drive phenotypic changes. Most commonly used methods for discovering these protein-protein interactions, however, are optimized for detecting stable, longer-lived complexes, rather than the type of transient interactions that are essential components of dynamic signaling networks such as those mediated by RTKs. Substrate phosphorylation downstream of RTK activation modifies substrate activity and induces phospho-specific binding interactions, resulting in the formation of large transient macromolecular signaling complexes. Since protein complex formation should follow the trajectory of events that drive it, we reasoned that mining phosphoproteomic datasets for highly similar dynamic behavior of measured phosphorylation sites on different proteins could be used to predict novel, transient protein-protein interactions that had not been previously identified. We applied this method to explore signaling events downstream of EGFR stimulation. Our computational analysis of robustly co-regulated phosphorylation sites, based on multiple clustering analysis of quantitative time-resolved mass-spectrometry phosphoproteomic data, not only identified known sitewise-specific recruitment of proteins to EGFR, but also predicted novel, a priori interactions. A particularly intriguing prediction of EGFR interaction with the cytoskeleton-associated protein PDLIM1 was verified within cells using co-immunoprecipitation and in situ proximity ligation assays. Our approach thus offers a new way to discover protein-protein interactions in a dynamic context- and phosphorylation site-specific manner.

Collapse

118

Xin X, Gfeller D, Cheng J, Tonikian R, Sun L, Guo A, Lopez L, Pavlenco A, Akintobi A, Zhang Y, Rual JF, Currell B, Seshagiri S, Hao T, Yang X, Shen YA, Salehi-Ashtiani K, Li J, Cheng AT, Bouamalay D, Lugari A, Hill DE, Grimes ML, Drubin DG, Grant BD, Vidal M, Boone C, Sidhu SS, Bader GD. SH3 interactome conserves general function over specific form. Mol Syst Biol 2013;9:652. [PMID: 23549480 PMCID: PMC3658277 DOI: 10.1038/msb.2013.9] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2012] [Accepted: 02/20/2013] [Indexed: 12/20/2022] Open

Affiliation(s)

Xiaofeng Xin The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
David Gfeller The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
Jackie Cheng Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
Raffi Tonikian The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
Lin Sun Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ, USA
Ailan Guo Cell Signaling Technology, Danvers, MA, USA
Lianet Lopez The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
Alevtina Pavlenco The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
Adenrele Akintobi Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ, USA
Yingnan Zhang Department of Early Discovery Biochemistry, Genentech, South San Francisco, CA, USA
Jean-François Rual Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Bridget Currell Department of Molecular Biology, Genentech, South San Francisco, CA, USA
Somasekar Seshagiri Department of Molecular Biology, Genentech, South San Francisco, CA, USA
Tong Hao Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Xinping Yang Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Yun A Shen Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Kourosh Salehi-Ashtiani Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Jingjing Li The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
Aaron T Cheng Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
Dryden Bouamalay Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
Adrien Lugari IMR Laboratory, UPR 3243, Institut de Microbiologie de la Méditérannée, CNRS and Aix-Marseille Université, Marseille Cedex 20, France
David E Hill Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Mark L Grimes Division of Biological Sciences, Center for Structural and Functional Neuroscience, The University of Montana, Missoula, MT, USA
David G Drubin Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
Barth D Grant Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ, USA
Marc Vidal Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Charles Boone The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
Sachdev S Sidhu The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
Gary D Bader The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada Department of Computer Science, University of Toronto, Toronto, Ontario, Canada

Collapse

119

Choi H, Liu G, Mellacheruvu D, Tyers M, Gingras AC, Nesvizhskii AI. Analyzing protein-protein interactions from affinity purification-mass spectrometry data with SAINT. ACTA ACUST UNITED AC 2012;Chapter 8:8.15.1-8.15.23. [PMID: 22948729 DOI: 10.1002/0471250953.bi0815s39] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

120

Choi H. Computational detection of protein complexes in AP-MS experiments. Proteomics 2012;12:1663-8. [PMID: 22711593 DOI: 10.1002/pmic.201100508] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

121

Armean IM, Lilley KS, Trotter MWB. Popular computational methods to assess multiprotein complexes derived from label-free affinity purification and mass spectrometry (AP-MS) experiments. Mol Cell Proteomics 2012;12:1-13. [PMID: 23071097 DOI: 10.1074/mcp.r112.019554] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Abstract

Advances in sensitivity, resolution, mass accuracy, and throughput have considerably increased the number of protein identifications made via mass spectrometry. Despite these advances, state-of-the-art experimental methods for the study of protein-protein interactions yield more candidate interactions than may be expected biologically owing to biases and limitations in the experimental methodology. In silico methods, which distinguish between true and false interactions, have been developed and applied successfully to reduce the number of false positive results yielded by physical interaction assays. Such methods may be grouped according to: (1) the type of data used: methods based on experiment-specific measurements (e.g., spectral counts or identification scores) versus methods that extract knowledge encoded in external annotations (e.g., public interaction and functional categorisation databases); (2) the type of algorithm applied: the statistical description and estimation of physical protein properties versus predictive supervised machine learning or text-mining algorithms; (3) the type of protein relation evaluated: direct (binary) interaction of two proteins in a cocomplex versus probability of any functional relationship between two proteins (e.g., co-occurrence in a pathway, sub cellular compartment); and (4) initial motivation: elucidation of experimental data by evaluation versus prediction of novel protein-protein interaction, to be experimentally validated a posteriori. This work reviews several popular computational scoring methods and software platforms for protein-protein interactions evaluation according to their methodology, comparative strengths and weaknesses, data representation, accessibility, and availability. The scoring methods and platforms described include: CompPASS, SAINT, Decontaminator, MINT, IntAct, STRING, and FunCoup. References to related work are provided throughout in order to provide a concise but thorough introduction to a rapidly growing interdisciplinary field of investigation.

Collapse

122

Isokpehi RD, Udensi UK, Anyanwu MN, Mbah AN, Johnson MO, Edusei K, Bauer MA, Hall RA, Awofolu OR. Knowledge building insights on biomarkers of arsenic toxicity to keratinocytes and melanocytes. Biomark Insights 2012;7:127-41. [PMID: 23115478 PMCID: PMC3480875 DOI: 10.4137/bmi.s7799] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Abstract

Exposure to inorganic arsenic induces skin cancer and abnormal pigmentation in susceptible humans. High-throughput gene transcription assays such as DNA microarrays allow for the identification of biological pathways affected by arsenic that lead to initiation and progression of skin cancer and abnormal pigmentation. The overall purpose of the reported research was to determine knowledge building insights on biomarker genes for arsenic toxicity to human epidermal cells by integrating a collection of gene lists annotated with biological information. The information sets included toxicogenomics gene-chemical interaction; enzymes encoded in the human genome; enriched biological information associated with genes; environmentally relevant gene sequence variation; and effects of non-synonymous single nucleotide polymorphisms (SNPs) on protein function. Molecular network construction for arsenic upregulated genes TNFSF18 (tumor necrosis factor [ligand] superfamily member 18) and IL1R2 (interleukin 1 Receptor, type 2) revealed subnetwork interconnections to E2F4, an oncogenic transcription factor, predominantly expressed at the onset of keratinocyte differentiation. Visual analytics integration of gene information sources helped identify RAC1, a GTP binding protein, and TFRC, an iron uptake protein as prioritized arsenic-perturbed protein targets for biological processes leading to skin hyperpigmentation. RAC1 regulates the formation of dendrites that transfer melanin from melanocytes to neighboring keratinocytes. Increased melanocyte dendricity is correlated with hyperpigmentation. TFRC is a key determinant of the amount and location of iron in the epidermis. Aberrant TFRC expression could impair cutaneous iron metabolism leading to abnormal pigmentation seen in some humans exposed to arsenicals. The reported findings contribute to insights on how arsenic could impair the function of genes and biological pathways in epidermal cells. Finally, we developed visual analytics resources to facilitate further exploration of the information and knowledge building insights on arsenic toxicity to human epidermal keratinocytes and melanocytes.

Collapse

123

New horizons for antiviral drug discovery from virus–host protein interaction networks. Curr Opin Virol 2012;2:606-13. [DOI: 10.1016/j.coviro.2012.09.001] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2012] [Revised: 09/05/2012] [Accepted: 09/05/2012] [Indexed: 12/21/2022]

124

Babu M, Vlasblom J, Pu S, Guo X, Graham C, Bean BDM, Burston HE, Vizeacoumar FJ, Snider J, Phanse S, Fong V, Tam YYC, Davey M, Hnatshak O, Bajaj N, Chandran S, Punna T, Christopolous C, Wong V, Yu A, Zhong G, Li J, Stagljar I, Conibear E, Wodak SJ, Emili A, Greenblatt JF. Interaction landscape of membrane-protein complexes in Saccharomyces cerevisiae. Nature 2012;489:585-9. [PMID: 22940862 DOI: 10.1038/nature11354] [Citation(s) in RCA: 176] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2011] [Accepted: 06/27/2012] [Indexed: 01/03/2023]

125

De Las Rivas J, Fontanillo C. Protein-protein interaction networks: unraveling the wiring of molecular machines within the cell. Brief Funct Genomics 2012;11:489-96. [PMID: 22908212 DOI: 10.1093/bfgp/els036] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open

126

Tejera E, Bernardes J, Rebelo I. Preeclampsia: a bioinformatics approach through protein-protein interaction networks analysis. BMC SYSTEMS BIOLOGY 2012;6:97. [PMID: 22873350 PMCID: PMC3483240 DOI: 10.1186/1752-0509-6-97] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/05/2012] [Accepted: 07/23/2012] [Indexed: 01/29/2023]

127

Arnold R, Boonen K, Sun MG, Kim PM. Computational analysis of interactomes: current and future perspectives for bioinformatics approaches to model the host-pathogen interaction space. Methods 2012;57:508-18. [PMID: 22750305 PMCID: PMC7128575 DOI: 10.1016/j.ymeth.2012.06.011] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2012] [Revised: 06/20/2012] [Accepted: 06/21/2012] [Indexed: 11/05/2022] Open

128

Das J, Yu H. HINT: High-quality protein interactomes and their applications in understanding human disease. BMC SYSTEMS BIOLOGY 2012;6:92. [PMID: 22846459 PMCID: PMC3483187 DOI: 10.1186/1752-0509-6-92] [Citation(s) in RCA: 287] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2011] [Accepted: 06/30/2012] [Indexed: 12/22/2022]

129

Orchard S, Kerrien S, Abbani S, Aranda B, Bhate J, Bidwell S, Bridge A, Briganti L, Brinkman FSL, Brinkman F, Cesareni G, Chatr-aryamontri A, Chautard E, Chen C, Dumousseau M, Goll J, Hancock REW, Hancock R, Hannick LI, Jurisica I, Khadake J, Lynn DJ, Mahadevan U, Perfetto L, Raghunath A, Ricard-Blum S, Roechert B, Salwinski L, Stümpflen V, Tyers M, Uetz P, Xenarios I, Hermjakob H. Protein interaction data curation: the International Molecular Exchange (IMEx) consortium. Nat Methods 2012;9:345-50. [PMID: 22453911 DOI: 10.1038/nmeth.1931] [Citation(s) in RCA: 385] [Impact Index Per Article: 32.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

130

Taylor IW, Wrana JL. Protein interaction networks in medicine and disease. Proteomics 2012;12:1706-16. [DOI: 10.1002/pmic.201100594] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

131

Tripathi LP, Kambara H, Moriishi K, Morita E, Abe T, Mori Y, Chen YA, Matsuura Y, Mizuguchi K. Proteomic analysis of hepatitis C virus (HCV) core protein transfection and host regulator PA28γ knockout in HCV pathogenesis: a network-based study. J Proteome Res 2012;11:3664-79. [PMID: 22646850 DOI: 10.1021/pr300121a] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

132

Ihara S, Kida H, Arase H, Tripathi LP, Chen YA, Kimura T, Yoshida M, Kashiwa Y, Hirata H, Fukamizu R, Inoue R, Hasegawa K, Goya S, Takahashi R, Minami T, Tsujino K, Suzuki M, Kohmo S, Inoue K, Nagatomo I, Takeda Y, Kijima T, Mizuguchi K, Tachibana I, Kumanogoh A. Inhibitory Roles of Signal Transducer and Activator of Transcription 3 in Antitumor Immunity during Carcinogen-Induced Lung Tumorigenesis. Cancer Res 2012;72:2990-9. [DOI: 10.1158/0008-5472.can-11-4062] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

133

Fiume M, Smith EJM, Brook A, Strbenac D, Turner B, Mezlini AM, Robinson MD, Wodak SJ, Brudno M. Savant Genome Browser 2: visualization and analysis for population-scale genomics. Nucleic Acids Res 2012;40:W615-21. [PMID: 22638571 PMCID: PMC3394255 DOI: 10.1093/nar/gks427] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open

134

Kholodenko B, Yaffe MB, Kolch W. Computational approaches for analyzing information flow in biological networks. Sci Signal 2012;5:re1. [PMID: 22510471 DOI: 10.1126/scisignal.2002961] [Citation(s) in RCA: 126] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

135

Schaefer MH, Fontaine JF, Vinayagam A, Porras P, Wanker EE, Andrade-Navarro MA. HIPPIE: Integrating protein interaction networks with experiment based quality scores. PLoS One 2012;7:e31826. [PMID: 22348130 PMCID: PMC3279424 DOI: 10.1371/journal.pone.0031826] [Citation(s) in RCA: 231] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2011] [Accepted: 01/12/2012] [Indexed: 01/03/2023] Open

136

Wang X, Wei X, Thijssen B, Das J, Lipkin SM, Yu H. Three-dimensional reconstruction of protein networks provides insight into human genetic disease. Nat Biotechnol 2012;30:159-64. [PMID: 22252508 DOI: 10.1038/nbt.2106] [Citation(s) in RCA: 280] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2011] [Accepted: 12/19/2011] [Indexed: 01/13/2023]

137

De Las Rivas J, Prieto C. Protein interactions: mapping interactome networks to support drug target discovery and selection. Methods Mol Biol 2012;910:279-96. [PMID: 22821600 DOI: 10.1007/978-1-61779-965-5_12] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

138

Sun MGF, Kim PM. Evolution of biological interaction networks: from models to real data. Genome Biol 2011;12:235. [PMID: 22204388 PMCID: PMC3334609 DOI: 10.1186/gb-2011-12-12-235] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2011] [Accepted: 12/12/2011] [Indexed: 01/19/2023] Open

139

Mora A, Donaldson IM. iRefR: an R package to manipulate the iRefIndex consolidated protein interaction database. BMC Bioinformatics 2011;12:455. [PMID: 22115179 PMCID: PMC3282787 DOI: 10.1186/1471-2105-12-455] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2011] [Accepted: 11/24/2011] [Indexed: 11/19/2022] Open

140

Razick S, Mora A, Michalickova K, Boddie P, Donaldson IM. iRefScape. A Cytoscape plug-in for visualization and data mining of protein interaction data from iRefIndex. BMC Bioinformatics 2011;12:388. [PMID: 21975162 PMCID: PMC3228863 DOI: 10.1186/1471-2105-12-388] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2011] [Accepted: 10/05/2011] [Indexed: 11/10/2022] Open

141

Lu Z, Kao HY, Wei CH, Huang M, Liu J, Kuo CJ, Hsu CN, Tsai RTH, Dai HJ, Okazaki N, Cho HC, Gerner M, Solt I, Agarwal S, Liu F, Vishnyakova D, Ruch P, Romacker M, Rinaldi F, Bhattacharya S, Srinivasan P, Liu H, Torii M, Matos S, Campos D, Verspoor K, Livingston KM, Wilbur WJ. The gene normalization task in BioCreative III. BMC Bioinformatics 2011;12 Suppl 8:S2. [PMID: 22151901 PMCID: PMC3269937 DOI: 10.1186/1471-2105-12-s8-s2] [Citation(s) in RCA: 79] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

We report the Gene Normalization (GN) challenge in BioCreative III where participating teams were asked to return a ranked list of identifiers of the genes detected in full-text articles. For training, 32 fully and 500 partially annotated articles were prepared. A total of 507 articles were selected as the test set. Due to the high annotation cost, it was not feasible to obtain gold-standard human annotations for all test articles. Instead, we developed an Expectation Maximization (EM) algorithm approach for choosing a small number of test articles for manual annotation that were most capable of differentiating team performance. Moreover, the same algorithm was subsequently used for inferring ground truth based solely on team submissions. We report team performance on both gold standard and inferred ground truth using a newly proposed metric called Threshold Average Precision (TAP-k).

RESULTS

We received a total of 37 runs from 14 different teams for the task. When evaluated using the gold-standard annotations of the 50 articles, the highest TAP-k scores were 0.3297 (k=5), 0.3538 (k=10), and 0.3535 (k=20), respectively. Higher TAP-k scores of 0.4916 (k=5, 10, 20) were observed when evaluated using the inferred ground truth over the full test set. When combining team results using machine learning, the best composite system achieved TAP-k scores of 0.3707 (k=5), 0.4311 (k=10), and 0.4477 (k=20) on the gold standard, representing improvements of 12.4%, 21.8%, and 26.6% over the best team results, respectively.

CONCLUSIONS

By using full text and being species non-specific, the GN task in BioCreative III has moved closer to a real literature curation task than similar tasks in the past and presents additional challenges for the text mining community, as revealed in the overall team results. By evaluating teams using the gold standard, we show that the EM algorithm allows team submissions to be differentiated while keeping the manual annotation effort feasible. Using the inferred ground truth we show measures of comparative performance between teams. Finally, by comparing team rankings on gold standard vs. inferred ground truth, we further demonstrate that the inferred ground truth is as effective as the gold standard for detecting good team performance.

Collapse

Affiliation(s)

Zhiyong Lu National Center for Biotechnology Information (NCBI), 8600 Rockville Pike, Bethesda, Maryland 20894, USA
Hung-Yu Kao Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan, R.O.C
Chih-Hsuan Wei Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan, R.O.C
Minlie Huang Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China
Jingchen Liu Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China
Cheng-Ju Kuo Institute of Information Science, Academia Sinica, Taipei 115, Taiwan
Chun-Nan Hsu Institute of Information Science, Academia Sinica, Taipei 115, Taiwan Information Science Institute, University of Southern California, Marina del Rey, California, USA
Richard Tzong-Han Tsai Department of Computer Science and Engineering, Yuan Ze University, Chung-Li, Taiwan, R.O.C
Hong-Jie Dai Department of Computer Science, National Tsing-Hua University, Hsinchu, Taiwan, R.O.C Institute of Information Science, Academic Sinica, Taipei, Taiwan, R.O.C
Naoaki Okazaki Interfaculty Initiative in Information Studies, University of Tokyo, Japan
Han-Cheol Cho Graduate School of Information Science and Technology, University of Tokyo, Japan
Martin Gerner Faculty of Life Sciences, University of Manchester, Manchester, M13 9PT, UK
Illes Solt Department of Telecommunications and Media Informatics, Budapest University of Technology and Economics, 1117 Budapest, Hungary
Shashank Agarwal Medical Informatics, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
Feifan Liu Medical Informatics, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
Dina Vishnyakova BiTem Group, Division of Medical Information Sciences, University of Geneva, Switzerland
Patrick Ruch BiTeM Group, Information Science Department, University of Applied Science, Geneva, Switzerland
Martin Romacker NITAS/TMS, Text Mining Services, Novartis AG, Switzerland
Fabio Rinaldi Institute of Computational Linguistics, University of Zurich, Zurich, Switzerland
Sanmitra Bhattacharya Department of Computer Science, The University of Iowa, Iowa City, Iowa 52242, USA
Padmini Srinivasan Department of Computer Science, The University of Iowa, Iowa City, Iowa 52242, USA
Hongfang Liu Department of Health Sciences Research, Mayo Clinic College of Medicine, Rochester, MN 55905 USA
Manabu Torii Lab of Text Intelligence in Biomedicine, Georgetown University Medical Center, 4000 Reservoir Rd., NW, Washington, DC 20057 USA
Sergio Matos DETI/IEETA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
David Campos DETI/IEETA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
Karin Verspoor Center for Computational Pharmacology, University of Colorado School of Medicine, Aurora, Colorado, USA
Kevin M Livingston Center for Computational Pharmacology, University of Colorado School of Medicine, Aurora, Colorado, USA
W John Wilbur National Center for Biotechnology Information (NCBI), 8600 Rockville Pike, Bethesda, Maryland 20894, USA

Collapse

142

Interaction databases on the same page. Nat Biotechnol 2011;29:391-3. [PMID: 21552234 DOI: 10.1038/nbt.1867] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

143

Stojmirović A, Yu YK. ppiTrim: constructing non-redundant and up-to-date interactomes. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2011;2011:bar036. [PMID: 21873645 PMCID: PMC3162744 DOI: 10.1093/database/bar036] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Abstract

Robust advances in interactome analysis demand comprehensive, non-redundant and consistently annotated data sets. By non-redundant, we mean that the accounting of evidence for every interaction should be faithful: each independent experimental support is counted exactly once, no more, no less. While many interactions are shared among public repositories, none of them contains the complete known interactome for any model organism. In addition, the annotations of the same experimental result by different repositories often disagree. This brings up the issue of which annotation to keep while consolidating evidences that are the same. The iRefIndex database, including interactions from most popular repositories with a standardized protein nomenclature, represents a significant advance in all aspects, especially in comprehensiveness. However, iRefIndex aims to maintain all information/annotation from original sources and requires users to perform additional processing to fully achieve the aforementioned goals. Another issue has to do with protein complexes. Some databases represent experimentally observed complexes as interactions with more than two participants, while others expand them into binary interactions using spoke or matrix model. To avoid untested interaction information buildup, it is preferable to replace the expanded protein complexes, either from spoke or matrix models, with a flat list of complex members.

To address these issues and to achieve our goals, we have developed ppiTrim, a script that processes iRefIndex to produce non-redundant, consistently annotated data sets of physical interactions. Our script proceeds in three stages: mapping all interactants to gene identifiers and removing all undesired raw interactions, deflating potentially expanded complexes, and reconciling for each interaction the annotation labels among different source databases. As an illustration, we have processed the three largest organismal data sets: yeast, human and fruitfly. While ppiTrim can resolve most apparent conflicts between different labelings, we also discovered some unresolvable disagreements mostly resulting from different annotation policies among repositories.

Database URL:http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads/ppiTrim.html

Collapse

144

Fernández‐Recio J. Prediction of protein binding sites and hot spots. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2011. [DOI: 10.1002/wcms.45] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

145

Azuaje FJ, Wang H, Zheng H, Léonard F, Rolland-Turner M, Zhang L, Devaux Y, Wagner DR. Predictive integration of gene functional similarity and co-expression defines treatment response of endothelial progenitor cells. BMC SYSTEMS BIOLOGY 2011;5:46. [PMID: 21447198 PMCID: PMC3080295 DOI: 10.1186/1752-0509-5-46] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/17/2010] [Accepted: 03/30/2011] [Indexed: 01/04/2023]

Abstract

Background

Endothelial progenitor cells (EPCs) have been implicated in different processes crucial to vasculature repair, which may offer the basis for new therapeutic strategies in cardiovascular disease. Despite advances facilitated by functional genomics, there is a lack of systems-level understanding of treatment response mechanisms of EPCs. In this research we aimed to characterize the EPCs response to adenosine (Ado), a cardioprotective factor, based on the systems-level integration of gene expression data and prior functional knowledge. Specifically, we set out to identify novel biosignatures of Ado-treatment response in EPCs.

Results

The predictive integration of gene expression data and standardized functional similarity information enabled us to identify new treatment response biosignatures. Gene expression data originated from Ado-treated and -untreated EPCs samples, and functional similarity was estimated with Gene Ontology (GO)-based similarity information. These information sources enabled us to implement and evaluate an integrated prediction approach based on the concept of k-nearest neighbours learning (kNN). The method can be executed by expert- and data-driven input queries to guide the search for biologically meaningful biosignatures. The resulting integrated kNN system identified new candidate EPC biosignatures that can offer high classification performance (areas under the operating characteristic curve > 0.8). We also showed that the proposed models can outperform those discovered by standard gene expression analysis. Furthermore, we report an initial independent in vitro experimental follow-up, which provides additional evidence of the potential validity of the top biosignature.

Conclusion

Response to Ado treatment in EPCs can be accurately characterized with a new method based on the combination of gene co-expression data and GO-based similarity information. It also exploits the incorporation of human expert-driven queries as a strategy to guide the automated search for candidate biosignatures. The proposed biosignature improves the systems-level characterization of EPCs. The new integrative predictive modeling approach can also be applied to other phenotype characterization or biomarker discovery problems.

Collapse

146

Hao Y, Merkoulovitch A, Vlasblom J, Pu S, Turinsky AL, Roudeva D, Turner B, Greenblatt J, Wodak SJ. OrthoNets: simultaneous visual analysis of orthologs and their interaction neighborhoods across different organisms. Bioinformatics 2011;27:883-4. [PMID: 21257609 PMCID: PMC3051336 DOI: 10.1093/bioinformatics/btr035] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

147

Choi H, Larsen B, Lin ZY, Breitkreutz A, Mellacheruvu D, Fermin D, Qin ZS, Tyers M, Gingras AC, Nesvizhskii AI. SAINT: probabilistic scoring of affinity purification-mass spectrometry data. Nat Methods 2011;8:70-3. [PMID: 21131968 PMCID: PMC3064265 DOI: 10.1038/nmeth.1541] [Citation(s) in RCA: 531] [Impact Index Per Article: 40.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2010] [Accepted: 11/09/2010] [Indexed: 01/12/2023]

148

Turinsky AL, Razick S, Turner B, Donaldson IM, Wodak SJ. Literature curation of protein interactions: measuring agreement across major public databases. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2010;2010:baq026. [PMID: 21183497 PMCID: PMC3011985 DOI: 10.1093/database/baq026] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]