1
|
Abstract
iRefWeb is a resource that provides web interface to a large collection of protein-protein interactions aggregated from major primary databases. The underlying data-consolidation process, called iRefIndex, implements a rigorous methodology of identifying redundant protein sequences and integrating disparate data records that reference the same peptide sequences, despite many potential differences in data identifiers across various source databases. iRefWeb offers a unified user interface to all interaction records and associated information collected by iRefIndex, in addition to a number of data filters and visual features that present the supporting evidence. Users of iRefWeb can explore the consolidated landscape of protein-protein interactions, establish the provenance and reliability of each data record, and compare annotations performed by different data curator teams. The iRefWeb portal is freely available at http://wodaklab.org/iRefWeb .
Collapse
|
2
|
Safari-Alighiarloo N, Taghizadeh M, Tabatabaei SM, Shahsavari S, Namaki S, Khodakarim S, Rezaei-Tavirani M. Identification of new key genes for type 1 diabetes through construction and analysis of protein-protein interaction networks based on blood and pancreatic islet transcriptomes. J Diabetes 2017; 9:764-777. [PMID: 27625010 DOI: 10.1111/1753-0407.12483] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/05/2016] [Revised: 08/17/2016] [Accepted: 09/08/2016] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Type 1 diabetes (T1D) is an autoimmune disease in which pancreatic β-cells are destroyed by infiltrating immune cells. Bilateral cooperation of pancreatic β-cells and immune cells has been proposed in the progression of T1D, but as yet no systems study has investigated this possibility. The aims of the study were to elucidate the underlying molecular mechanisms and identify key genes associated with T1D risk using a network biology approach. METHODS Interactome (protein-protein interaction [PPI]) and transcriptome data were integrated to construct networks of differentially expressed genes in peripheral blood mononuclear cells (PBMCs) and pancreatic β-cells. Centrality, modularity, and clique analyses of networks were used to get more meaningful biological information. RESULTS Analysis of genes expression profiles revealed several cytokines and chemokines in β-cells and their receptors in PBMCs, which is supports the dialogue between these two tissues in terms of PPIs. Functional modules and complexes analysis unraveled most significant biological pathways such as immune response, apoptosis, spliceosome, proteasome, and pathways of protein synthesis in the tissues. Finally, Y-box binding protein 1 (YBX1), SRSF protein kinase 1 (SRPK1), proteasome subunit alpha1/ 3, (PSMA1/3), X-ray repair cross complementing 6 (XRCC6), Cbl proto-oncogene (CBL), SRC proto-oncogene, non-receptor tyrosine kinase (SRC), phosphoinositide-3-kinase regulatory subunit 1 (PIK3R1), phospholipase C gamma 1 (PLCG1), SHC adaptor protein1 (SHC1) and ubiquitin conjugating enzyme E2 N (UBE2N) were identified as key markers that were hub-bottleneck genes involved in functional modules and complexes. CONCLUSIONS This study provide new insights into network biomarkers that may be considered potential therapeutic targets.
Collapse
Affiliation(s)
- Nahid Safari-Alighiarloo
- Proteomics Research Center, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Mohammad Taghizadeh
- Bioinformatics Department, Institute of Biochemistry and Biophysics, Tehran University, Tehran, Iran
| | - Seyyed Mohammad Tabatabaei
- Medical Informatics Department, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Soodeh Shahsavari
- Biostatistics Department, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Saeed Namaki
- Department of Immunology, Faculty of Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Soheila Khodakarim
- Department of Epidemiology, School of Public Health, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Mostafa Rezaei-Tavirani
- Proteomics Research Center, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
3
|
Safari-Alighiarloo N, Rezaei-Tavirani M, Taghizadeh M, Tabatabaei SM, Namaki S. Network-based analysis of differentially expressed genes in cerebrospinal fluid (CSF) and blood reveals new candidate genes for multiple sclerosis. PeerJ 2016; 4:e2775. [PMID: 28028462 PMCID: PMC5183126 DOI: 10.7717/peerj.2775] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2016] [Accepted: 11/08/2016] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND The involvement of multiple genes and missing heritability, which are dominant in complex diseases such as multiple sclerosis (MS), entail using network biology to better elucidate their molecular basis and genetic factors. We therefore aimed to integrate interactome (protein-protein interaction (PPI)) and transcriptomes data to construct and analyze PPI networks for MS disease. METHODS Gene expression profiles in paired cerebrospinal fluid (CSF) and peripheral blood mononuclear cells (PBMCs) samples from MS patients, sampled in relapse or remission and controls, were analyzed. Differentially expressed genes which determined only in CSF (MS vs. control) and PBMCs (relapse vs. remission) separately integrated with PPI data to construct the Query-Query PPI (QQPPI) networks. The networks were further analyzed to investigate more central genes, functional modules and complexes involved in MS progression. RESULTS The networks were analyzed and high centrality genes were identified. Exploration of functional modules and complexes showed that the majority of high centrality genes incorporated in biological pathways driving MS pathogenesis. Proteasome and spliceosome were also noticeable in enriched pathways in PBMCs (relapse vs. remission) which were identified by both modularity and clique analyses. Finally, STK4, RB1, CDKN1A, CDK1, RAC1, EZH2, SDCBP genes in CSF (MS vs. control) and CDC37, MAP3K3, MYC genes in PBMCs (relapse vs. remission) were identified as potential candidate genes for MS, which were the more central genes involved in biological pathways. DISCUSSION This study showed that network-based analysis could explicate the complex interplay between biological processes underlying MS. Furthermore, an experimental validation of candidate genes can lead to identification of potential therapeutic targets.
Collapse
Affiliation(s)
- Nahid Safari-Alighiarloo
- Proteomics Research Center, Department of Basic Science, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences , Tehran , Iran
| | - Mostafa Rezaei-Tavirani
- Proteomics Research Center, Department of Basic Science, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences , Tehran , Iran
| | - Mohammad Taghizadeh
- Bioinformatics Department, Institute of Biochemistry and Biophysics, Tehran University , Tehran , Iran
| | - Seyyed Mohammad Tabatabaei
- Medical Informatics Department, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences , Tehran , Iran
| | - Saeed Namaki
- Immunology Department, Faculty of Medical Sciences, Shahid Beheshti University of Medical Sciences , Tehran , Iran
| |
Collapse
|
4
|
Abstract
In this chapter, we discuss in detail two essential methods used to evaluate the interaction of Myc with another protein of interest: co-immunoprecipitation (Co-IP) and in vitro pull-down assays. Co-IP is a method that, by immunoaffinity, allows the identification of protein-protein interactions within cells. We provide methods to conduct Co-IPs from whole-cell extracts as well as cytoplasmic and nuclear-enriched fractions. By contrast, the pull-down assay evaluates whether a bait protein that is bound to a solid support can specifically interact with a prey protein that is in solution. We provide methods to conduct in vitro pull-downs and further detail how to use this assay to distinguish whether a protein-protein interaction is direct or indirect. We also discuss methods used to screen for Myc interactors and provide an in silico strategy to help prioritize hits for further validation using the described Co-IP and in vitro pull-down assays.
Collapse
|
5
|
Pastrello C, Pasini E, Kotlyar M, Otasek D, Wong S, Sangrar W, Rahmati S, Jurisica I. Integration, visualization and analysis of human interactome. Biochem Biophys Res Commun 2014; 445:757-73. [DOI: 10.1016/j.bbrc.2014.01.151] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2013] [Accepted: 01/24/2014] [Indexed: 02/06/2023]
|
6
|
Rid R, Strasser W, Siegl D, Frech C, Kommenda M, Kern T, Hintner H, Bauer JW, Önder K. PRIMOS: an integrated database of reassessed protein-protein interactions providing web-based access to in silico validation of experimentally derived data. Assay Drug Dev Technol 2014; 11:333-46. [PMID: 23772554 DOI: 10.1089/adt.2013.506] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Steady improvements in proteomics present a bioinformatic challenge to retrieve, store, and process the accumulating and often redundant amount of information. In particular, a large-scale comparison and analysis of protein-protein interaction (PPI) data requires tools for data interpretation as well as validation. At this juncture, the Protein Interaction and Molecule Search (PRIMOS) platform represents a novel web portal that unifies six primary PPI databases (BIND, Biomolecular Interaction Network Database; DIP, Database of Interacting Proteins; HPRD, Human Protein Reference Database; IntAct; MINT, Molecular Interaction Database; and MIPS, Munich Information Center for Protein Sequences) into a single consistent repository, which currently includes more than 196,700 redundancy-removed PPIs. PRIMOS supports three advanced search strategies centering on disease-relevant PPIs, on inter- and intra-organismal crosstalk relations (e.g., pathogen-host interactions), and on highly connected protein nodes analysis ("hub" identification). The main novelties distinguishing PRIMOS from other secondary PPI databases are the reassessment of known PPIs, and the capacity to validate personal experimental data by our peer-reviewed, homology-based validation. This article focuses on definite PRIMOS use cases (presentation of embedded biological concepts, example applications) to demonstrate its broad functionality and practical value. PRIMOS is publicly available at http://primos.fh-hagenberg.at.
Collapse
Affiliation(s)
- Raphaela Rid
- Division of Molecular Dermatology, Department of Dermatology, Paracelsus Medical University Salzburg, Salzburg, Austria
| | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Turinsky AL, Razick S, Turner B, Donaldson IM, Wodak SJ. Navigating the global protein-protein interaction landscape using iRefWeb. Methods Mol Biol 2014; 1091:315-31. [PMID: 24203342 DOI: 10.1007/978-1-62703-691-7_22] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
iRefWeb is a bioinformatics resource that offers access to a large collection of data on protein-protein interactions in over a thousand organisms. This collection is consolidated from 14 major public databases that curate the scientific literature. The collection is enhanced with a range of versatile data filters and search options that categorize various types of protein-protein interactions and protein complexes. Users of iRefWeb are able to retrieve all curated interactions for a given organism or those involving a given protein (or a list of proteins), narrow down their search results based on different supporting evidence, and assess the reliability of these interactions using various criteria. They may also examine all data and annotations related to any publication that described the interaction-detection experiments. iRefWeb is freely available to the research community worldwide at http://wodaklab.org/iRefWeb .
Collapse
Affiliation(s)
- Andrei L Turinsky
- Molecular Structure and Function program, Hospital for Sick Children, Toronto, ON, Canada
| | | | | | | | | |
Collapse
|
8
|
Mosca R, Pons T, Céol A, Valencia A, Aloy P. Towards a detailed atlas of protein–protein interactions. Curr Opin Struct Biol 2013; 23:929-40. [DOI: 10.1016/j.sbi.2013.07.005] [Citation(s) in RCA: 87] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2013] [Revised: 07/04/2013] [Accepted: 07/08/2013] [Indexed: 12/30/2022]
|
9
|
del-Toro N, Dumousseau M, Orchard S, Jimenez RC, Galeota E, Launay G, Goll J, Breuer K, Ono K, Salwinski L, Hermjakob H. A new reference implementation of the PSICQUIC web service. Nucleic Acids Res 2013; 41:W601-6. [PMID: 23671334 PMCID: PMC3977660 DOI: 10.1093/nar/gkt392] [Citation(s) in RCA: 77] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
The Proteomics Standard Initiative Common QUery InterfaCe (PSICQUIC) specification was
created by the Human Proteome Organization Proteomics Standards Initiative (HUPO-PSI) to
enable computational access to molecular-interaction data resources by means of a standard
Web Service and query language. Currently providing >150 million binary interaction
evidences from 28 servers globally, the PSICQUIC interface allows the concurrent search of
multiple molecular-interaction information resources using a single query. Here, we
present an extension of the PSICQUIC specification (version 1.3), which has been released
to be compliant with the enhanced standards in molecular interactions. The new release
also includes a new reference implementation of the PSICQUIC server available to the data
providers. It offers augmented web service capabilities and improves the user experience.
PSICQUIC has been running for almost 5 years, with a user base growing from only 4 data
providers to 28 (April 2013) allowing access to 151 310 109 binary interactions. The power
of this web service is shown in PSICQUIC View web application, an example of how to
simultaneously query, browse and download results from the different PSICQUIC servers.
This application is free and open to all users with no login requirement (http://www.ebi.ac.uk/Tools/webservices/psicquic/view/main.xhtml).
Collapse
Affiliation(s)
- Noemi del-Toro
- EMBL-European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Breuer K, Foroushani AK, Laird MR, Chen C, Sribnaia A, Lo R, Winsor GL, Hancock REW, Brinkman FSL, Lynn DJ. InnateDB: systems biology of innate immunity and beyond--recent updates and continuing curation. Nucleic Acids Res 2012. [PMID: 23180781 PMCID: PMC3531080 DOI: 10.1093/nar/gks1147] [Citation(s) in RCA: 838] [Impact Index Per Article: 69.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
InnateDB (http://www.innatedb.com) is an integrated analysis platform that has been specifically designed to facilitate systems-level analyses of mammalian innate immunity networks, pathways and genes. In this article, we provide details of recent updates and improvements to the database. InnateDB now contains >196 000 human, mouse and bovine experimentally validated molecular interactions and 3000 pathway annotations of relevance to all mammalian cellular systems (i.e. not just immune relevant pathways and interactions). In addition, the InnateDB team has, to date, manually curated in excess of 18 000 molecular interactions of relevance to innate immunity, providing unprecedented insight into innate immunity networks, pathways and their component molecules. More recently, InnateDB has also initiated the curation of allergy- and asthma-related interactions. Furthermore, we report a range of improvements to our integrated bioinformatics solutions including web service access to InnateDB interaction data using Proteomics Standards Initiative Common Query Interface, enhanced Gene Ontology analysis for innate immunity, and the availability of new network visualizations tools. Finally, the recent integration of bovine data makes InnateDB the first integrated network analysis platform for this agriculturally important model organism.
Collapse
Affiliation(s)
- Karin Breuer
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, V5A1S6, Canada
| | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Biophysical and computational fragment-based approaches to targeting protein-protein interactions: applications in structure-guided drug discovery. Q Rev Biophys 2012; 45:383-426. [PMID: 22971516 DOI: 10.1017/s0033583512000108] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Drug discovery has classically targeted the active sites of enzymes or ligand-binding sites of receptors and ion channels. In an attempt to improve selectivity of drug candidates, modulation of protein-protein interfaces (PPIs) of multiprotein complexes that mediate conformation or colocation of components of cell-regulatory pathways has become a focus of interest. However, PPIs in multiprotein systems continue to pose significant challenges, as they are generally large, flat and poor in distinguishing features, making the design of small molecule antagonists a difficult task. Nevertheless, encouragement has come from the recognition that a few amino acids - so-called hotspots - may contribute the majority of interaction-free energy. The challenges posed by protein-protein interactions have led to a wellspring of creative approaches, including proteomimetics, stapled α-helical peptides and a plethora of antibody inspired molecular designs. Here, we review a more generic approach: fragment-based drug discovery. Fragments allow novel areas of chemical space to be explored more efficiently, but the initial hits have low affinity. This means that they will not normally disrupt PPIs, unless they are tethered, an approach that has been pioneered by Wells and co-workers. An alternative fragment-based approach is to stabilise the uncomplexed components of the multiprotein system in solution and employ conventional fragment-based screening. Here, we describe the current knowledge of the structures and properties of protein-protein interactions and the small molecules that can modulate them. We then describe the use of sensitive biophysical methods - nuclear magnetic resonance, X-ray crystallography, surface plasmon resonance, differential scanning fluorimetry or isothermal calorimetry - to screen and validate fragment binding. Fragment hits can subsequently be evolved into larger molecules with higher affinity and potency. These may provide new leads for drug candidates that target protein-protein interactions and have therapeutic value.
Collapse
|
12
|
Becnel LB, McKenna NJ. Minireview: progress and challenges in proteomics data management, sharing, and integration. Mol Endocrinol 2012; 26:1660-74. [PMID: 22902541 DOI: 10.1210/me.2012-1180] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
The proteome represents the identity, expression levels, interacting partners, and posttranslational modifications of proteins expressed within any given cell. Proteomic studies aim to census the quantitative and qualitative factors regulating the biological relationships of proteins acting in concert as functional cellular networks. In the field of endocrinology, proteomics has been of considerable value in determining the function and mechanism of action of endocrine signaling molecules in the cell membrane, cytoplasm, and nucleus and for the discovery of proteins as candidates for clinical biomarkers. The volume of data that can be generated by proteomics methodologies, up to gigabytes of data within a few hours, brings with it its own logistical hurdles and presents significant challenges to realizing the full potential of these datasets. In this minireview, we describe selected current proteomics methodologies and their application in basic and translational endocrinology before focusing on mass spectrometry as a model for current progress and challenges in data analysis, management, sharing, and integration.
Collapse
Affiliation(s)
- Lauren B Becnel
- Department of Medicine, Hematology and Oncology, Baylor College of Medicine, 1 Baylor Plaza MS-BCM305, Houston, Texas 77030, USA.
| | | |
Collapse
|
13
|
Koh GCKW, Porras P, Aranda B, Hermjakob H, Orchard SE. Analyzing protein-protein interaction networks. J Proteome Res 2012; 11:2014-31. [PMID: 22385417 DOI: 10.1021/pr201211w] [Citation(s) in RCA: 116] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The advent of the "omics" era in biology research has brought new challenges and requires the development of novel strategies to answer previously intractable questions. Molecular interaction networks provide a framework to visualize cellular processes, but their complexity often makes their interpretation an overwhelming task. The inherently artificial nature of interaction detection methods and the incompleteness of currently available interaction maps call for a careful and well-informed utilization of this valuable data. In this tutorial, we aim to give an overview of the key aspects that any researcher needs to consider when working with molecular interaction data sets and we outline an example for interactome analysis. Using the molecular interaction database IntAct, the software platform Cytoscape, and its plugins BiNGO and clusterMaker, and taking as a starting point a list of proteins identified in a mass spectrometry-based proteomics experiment, we show how to build, visualize, and analyze a protein-protein interaction network.
Collapse
Affiliation(s)
- Gavin C K W Koh
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | | | | | | | | |
Collapse
|
14
|
Integration of Biomolecular Interaction Data in a Genomic and Proteomic Data Warehouse to Support Biomedical Knowledge Discovery. COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS 2012. [DOI: 10.1007/978-3-642-35686-5_10] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
15
|
Licata L, Briganti L, Peluso D, Perfetto L, Iannuccelli M, Galeota E, Sacco F, Palma A, Nardozza AP, Santonico E, Castagnoli L, Cesareni G. MINT, the molecular interaction database: 2012 update. Nucleic Acids Res 2011; 40:D857-61. [PMID: 22096227 PMCID: PMC3244991 DOI: 10.1093/nar/gkr930] [Citation(s) in RCA: 711] [Impact Index Per Article: 54.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
The Molecular INTeraction Database (MINT, http://mint.bio.uniroma2.it/mint/) is a public repository for protein-protein interactions (PPI) reported in peer-reviewed journals. The database grows steadily over the years and at September 2011 contains approximately 235,000 binary interactions captured from over 4750 publications. The web interface allows the users to search, visualize and download interactions data. MINT is one of the members of the International Molecular Exchange consortium (IMEx) and adopts the Molecular Interaction Ontology of the Proteomics Standard Initiative (PSI-MI) standards for curation and data exchange. MINT data are freely accessible and downloadable at http://mint.bio.uniroma2.it/mint/download.do. We report here the growth of the database, the major changes in curation policy and a new algorithm to assign a confidence to each interaction.
Collapse
Affiliation(s)
- Luana Licata
- Department of Biology, University of Rome Tor Vergata, Via della Ricerca Scientifica, 00133 Rome, Italy
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Lee S, Salwinski L, Zhang C, Chu D, Sampankanpanich C, Reyes NA, Vangeloff A, Xing F, Li X, Wu TT, Sahasrabudhe S, Deng H, LaCount DJ, Sun R. An integrated approach to elucidate the intra-viral and viral-cellular protein interaction networks of a gamma-herpesvirus. PLoS Pathog 2011; 7:e1002297. [PMID: 22028648 PMCID: PMC3197595 DOI: 10.1371/journal.ppat.1002297] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2011] [Accepted: 08/17/2011] [Indexed: 12/22/2022] Open
Abstract
Genome-wide yeast two-hybrid (Y2H) screens were conducted to elucidate the molecular functions of open reading frames (ORFs) encoded by murine γ-herpesvirus 68 (MHV-68). A library of 84 MHV-68 genes and gene fragments was generated in a Gateway entry plasmid and transferred to Y2H vectors. All possible pair-wise interactions between viral proteins were tested in the Y2H assay, resulting in the identification of 23 intra-viral protein-protein interactions (PPIs). Seventy percent of the interactions between viral proteins were confirmed by co-immunoprecipitation experiments. To systematically investigate virus-cellular protein interactions, the MHV-68 Y2H constructs were screened against a cellular cDNA library, yielding 243 viral-cellular PPIs involving 197 distinct cellar proteins. Network analyses indicated that cellular proteins targeted by MHV-68 had more partners in the cellular PPI network and were located closer to each other than expected by chance. Taking advantage of this observation, we scored the cellular proteins based on their network distances from other MHV-68-interacting proteins and segregated them into high (Y2H-HP) and low priority/not-scored (Y2H-LP/NS) groups. Significantly more genes from Y2H-HP altered MHV-68 replication when their expression was inhibited with siRNAs (53% of genes from Y2H-HP, 21% of genes from Y2H-LP/NS, and 16% of genes randomly chosen from the human PPI network; p<0.05). Enriched Gene Ontology (GO) terms in the Y2H-HP group included regulation of apoptosis, protein kinase cascade, post-translational protein modification, transcription from RNA polymerase II promoter, and IκB kinase/NFκB cascade. Functional validation assays indicated that PCBP1, which interacted with MHV-68 ORF34, may be involved in regulating late virus gene expression in a manner consistent with the effects of its viral interacting partner. Our study integrated Y2H screening with multiple functional validation approaches to create γ-herpes viral-viral and viral-cellular protein interaction networks. Persistent infections by the herpesviruses Epstein Barr virus (EBV) and Kaposi's sarcoma herpesvirus (KSHV) are associated with tumor formation. To better understand how these and other related viruses interact with their host cells to promote virus replication and cause disease, we studied murine gamma-herpesvirus 68 (MHV-68). MHV-68 belongs to the same group of herpesviruses as EBV and KSHV, but has the advantage of being able to replicate efficiently in cell culture. Our study used genome-wide screens to identify 23 protein-protein interactions between the 80 MHV-68 proteins. Several of these interactions are likely to be important for assembling new viruses. We also discovered 243 interactions between MHV-68 and cellular proteins. To help prioritize cellular proteins for follow up studies, we developed a new computational tool to analyze our data. Proteins with high priority scores were more likely to affect viral replication than low priority proteins. Among the cellular proteins that had the greatest effect on MHV-68 replication was PCBP1, which negatively regulated MHV-68 late gene expression. This study identified many novel cellular proteins involved in MHV-68 replication and established a method to identify important proteins from high-throughput virus-cellular protein-protein interaction data sets.
Collapse
Affiliation(s)
- Shaoying Lee
- School of Dentistry, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Molecular and Medical Pharmacology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Lukasz Salwinski
- UCLA DOE-Institute for Genomics and Proteomics, University of California Los Angeles, Los Angeles, California, United States of America
| | - Chaoying Zhang
- Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West LaFayette, Indiana, United States of America
| | - Derrick Chu
- Department of Molecular Cell and Developmental Biology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Claire Sampankanpanich
- Department of Molecular Cell and Developmental Biology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Nichole A. Reyes
- Department of Molecular and Medical Pharmacology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Abbey Vangeloff
- Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West LaFayette, Indiana, United States of America
| | - Fangfang Xing
- Department of Molecular Cell and Developmental Biology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Xudong Li
- Department of Molecular and Medical Pharmacology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Ting-Ting Wu
- Department of Molecular and Medical Pharmacology, University of California Los Angeles, Los Angeles, California, United States of America
| | | | - Hongyu Deng
- School of Dentistry, University of California Los Angeles, Los Angeles, California, United States of America
- Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Douglas J. LaCount
- Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West LaFayette, Indiana, United States of America
- * E-mail: (DJL); (RS)
| | - Ren Sun
- Department of Molecular and Medical Pharmacology, University of California Los Angeles, Los Angeles, California, United States of America
- * E-mail: (DJL); (RS)
| |
Collapse
|
17
|
|
18
|
Stojmirović A, Yu YK. ppiTrim: constructing non-redundant and up-to-date interactomes. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2011; 2011:bar036. [PMID: 21873645 PMCID: PMC3162744 DOI: 10.1093/database/bar036] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Robust advances in interactome analysis demand comprehensive, non-redundant and consistently annotated data sets. By non-redundant, we mean that the accounting of evidence for every interaction should be faithful: each independent experimental support is counted exactly once, no more, no less. While many interactions are shared among public repositories, none of them contains the complete known interactome for any model organism. In addition, the annotations of the same experimental result by different repositories often disagree. This brings up the issue of which annotation to keep while consolidating evidences that are the same. The iRefIndex database, including interactions from most popular repositories with a standardized protein nomenclature, represents a significant advance in all aspects, especially in comprehensiveness. However, iRefIndex aims to maintain all information/annotation from original sources and requires users to perform additional processing to fully achieve the aforementioned goals. Another issue has to do with protein complexes. Some databases represent experimentally observed complexes as interactions with more than two participants, while others expand them into binary interactions using spoke or matrix model. To avoid untested interaction information buildup, it is preferable to replace the expanded protein complexes, either from spoke or matrix models, with a flat list of complex members. To address these issues and to achieve our goals, we have developed ppiTrim, a script that processes iRefIndex to produce non-redundant, consistently annotated data sets of physical interactions. Our script proceeds in three stages: mapping all interactants to gene identifiers and removing all undesired raw interactions, deflating potentially expanded complexes, and reconciling for each interaction the annotation labels among different source databases. As an illustration, we have processed the three largest organismal data sets: yeast, human and fruitfly. While ppiTrim can resolve most apparent conflicts between different labelings, we also discovered some unresolvable disagreements mostly resulting from different annotation policies among repositories. Database URL:http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads/ppiTrim.html
Collapse
Affiliation(s)
- Aleksandar Stojmirović
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | |
Collapse
|
19
|
Lopes TJS, Schaefer M, Shoemaker J, Matsuoka Y, Fontaine JF, Neumann G, Andrade-Navarro MA, Kawaoka Y, Kitano H. Tissue-specific subnetworks and characteristics of publicly available human protein interaction databases. ACTA ACUST UNITED AC 2011; 27:2414-21. [PMID: 21798963 DOI: 10.1093/bioinformatics/btr414] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Protein-protein interaction (PPI) databases are widely used tools to study cellular pathways and networks; however, there are several databases available that still do not account for cell type-specific differences. Here, we evaluated the characteristics of six interaction databases, incorporated tissue-specific gene expression information and finally, investigated if the most popular proteins of scientific literature are involved in good quality interactions. RESULTS We found that the evaluated databases are comparable in terms of node connectivity (i.e. proteins with few interaction partners also have few interaction partners in other databases), but may differ in the identity of interaction partners. We also observed that the incorporation of tissue-specific expression information significantly altered the interaction landscape and finally, we demonstrated that many of the most intensively studied proteins are engaged in interactions associated with low confidence scores. In summary, interaction databases are valuable research tools but may lead to different predictions on interactions or pathways. The accuracy of predictions can be improved by incorporating datasets on organ- and cell type-specific gene expression, and by obtaining additional interaction evidence for the most 'popular' proteins. CONTACT kitano@sbi.jp SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Tiago J S Lopes
- JST ERATO KAWAOKA Infection-induced Host Responses Project, Tokyo, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Yakhini Z, Jurisica I. Cancer computational biology. BMC Bioinformatics 2011; 12:120. [PMID: 21521513 PMCID: PMC3111371 DOI: 10.1186/1471-2105-12-120] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2011] [Accepted: 04/26/2011] [Indexed: 01/18/2023] Open
|
21
|
Fortney K, Jurisica I. Integrative computational biology for cancer research. Hum Genet 2011; 130:465-81. [PMID: 21691773 PMCID: PMC3179275 DOI: 10.1007/s00439-011-0983-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2011] [Accepted: 04/02/2011] [Indexed: 12/21/2022]
Abstract
Over the past two decades, high-throughput (HTP) technologies such as microarrays and mass spectrometry have fundamentally changed clinical cancer research. They have revealed novel molecular markers of cancer subtypes, metastasis, and drug sensitivity and resistance. Some have been translated into the clinic as tools for early disease diagnosis, prognosis, and individualized treatment and response monitoring. Despite these successes, many challenges remain: HTP platforms are often noisy and suffer from false positives and false negatives; optimal analysis and successful validation require complex workflows; and great volumes of data are accumulating at a rapid pace. Here we discuss these challenges, and show how integrative computational biology can help diminish them by creating new software tools, analytical methods, and data standards.
Collapse
Affiliation(s)
- Kristen Fortney
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | | |
Collapse
|
22
|
|
23
|
Szklarczyk D, Franceschini A, Kuhn M, Simonovic M, Roth A, Minguez P, Doerks T, Stark M, Muller J, Bork P, Jensen LJ, von Mering C. The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res 2011; 39:D561-8. [PMID: 21045058 PMCID: PMC3013807 DOI: 10.1093/nar/gkq973] [Citation(s) in RCA: 2554] [Impact Index Per Article: 196.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2010] [Accepted: 10/03/2010] [Indexed: 12/12/2022] Open
Abstract
An essential prerequisite for any systems-level understanding of cellular functions is to correctly uncover and annotate all functional interactions among proteins in the cell. Toward this goal, remarkable progress has been made in recent years, both in terms of experimental measurements and computational prediction techniques. However, public efforts to collect and present protein interaction information have struggled to keep up with the pace of interaction discovery, partly because protein-protein interaction information can be error-prone and require considerable effort to annotate. Here, we present an update on the online database resource Search Tool for the Retrieval of Interacting Genes (STRING); it provides uniquely comprehensive coverage and ease of access to both experimental as well as predicted interaction information. Interactions in STRING are provided with a confidence score, and accessory information such as protein domains and 3D structures is made available, all within a stable and consistent identifier space. New features in STRING include an interactive network viewer that can cluster networks on demand, updated on-screen previews of structural information including homology models, extensive data updates and strongly improved connectivity and integration with third-party resources. Version 9.0 of STRING covers more than 1100 completely sequenced organisms; the resource can be reached at http://string-db.org.
Collapse
Affiliation(s)
- Damian Szklarczyk
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Andrea Franceschini
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Michael Kuhn
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Milan Simonovic
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Alexander Roth
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Pablo Minguez
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Tobias Doerks
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Manuel Stark
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Jean Muller
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Peer Bork
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Lars J. Jensen
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Christian von Mering
- Faculty of Health Sciences, Novo Nordisk Foundation Centre for Protein Research, University of Copenhagen, Denmark, Faculty of Science, Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland, Biotechnology Center, Technical University Dresden, Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany, Institute of Genetics and Molecular and Cellular Biology, CNRS, INSERM, University of Strasbourg, Genetic Diagnostics Laboratory, CHU Strasbourg Nouvel Hôpital Civil, Strasbourg, France and Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| |
Collapse
|
24
|
Abstract
The groundwork allowing the systematic capture of proteomics data has now largely been completed, with the design and publication of exchange formats and interchange standards by the Human Proteome Organisation Proteomics Standards Initiative (HUPO-PSI). Our focus can now shift to gathering the ever-increasing amounts of generated data, and finding novel ways to catalog and present it so that a deeper understanding of basic science, health, and disease can be gained by scientists mining these increasingly rich resources.
Collapse
|
25
|
Abstract
In the past decades, a variety of publicly available data repositories and resources have been developed to support protein related information management, data-driven hypothesis generation and biological knowledge discovery. However, there is also an increasing confusion for the researchers who are trying to quickly find the appropriate resources to help them solve their problems. In this chapter, we present a comprehensive review (with categorization and description) of major protein bioinformatics databases and resources that are relevant to comparative proteomics research. We conclude the chapter by discussing the challenges and opportunities for developing new protein bioinformatics databases.
Collapse
|
26
|
Turinsky AL, Razick S, Turner B, Donaldson IM, Wodak SJ. Literature curation of protein interactions: measuring agreement across major public databases. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2010; 2010:baq026. [PMID: 21183497 PMCID: PMC3011985 DOI: 10.1093/database/baq026] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Literature curation of protein interaction data faces a number of challenges. Although curators increasingly adhere to standard data representations, the data that various databases actually record from the same published information may differ significantly. Some of the reasons underlying these differences are well known, but their global impact on the interactions collectively curated by major public databases has not been evaluated. Here we quantify the agreement between curated interactions from 15 471 publications shared across nine major public databases. Results show that on average, two databases fully agree on 42% of the interactions and 62% of the proteins curated from the same publication. Furthermore, a sizable fraction of the measured differences can be attributed to divergent assignments of organism or splice isoforms, different organism focus and alternative representations of multi-protein complexes. Our findings highlight the impact of divergent curation policies across databases, and should be relevant to both curators and data consumers interested in analyzing protein-interaction data generated by the scientific community. Database URL:http://wodaklab.org/iRefWeb
Collapse
Affiliation(s)
- Andrei L Turinsky
- Molecular Structure and Function Program, Hospital for Sick Children, 555 University Avenue, Toronto, Ontario, Canada
| | | | | | | | | |
Collapse
|
27
|
Abstract
Binary subcomplexes in proteins database (BISC) is a new protein-protein interaction (PPI) database linking up the two communities most active in their characterization: structural biology and functional genomics researchers. The BISC resource offers users (i) a structural perspective and related information about binary subcomplexes (i.e. physical direct interactions between proteins) that are either structurally characterized or modellable entries in the main functional genomics PPI databases BioGRID, IntAct and HPRD; (ii) selected web services to further investigate the validity of postulated PPI by inspection of their hypothetical modelled interfaces. Among other uses we envision that this resource can help identify possible false positive PPI in current database records. BISC is freely available at http://bisc.cse.ucsc.edu.
Collapse
|
28
|
Turner B, Razick S, Turinsky AL, Vlasblom J, Crowdy EK, Cho E, Morrison K, Donaldson IM, Wodak SJ. iRefWeb: interactive analysis of consolidated protein interaction data and their supporting evidence. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2010; 2010:baq023. [PMID: 20940177 PMCID: PMC2963317 DOI: 10.1093/database/baq023] [Citation(s) in RCA: 146] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
We present iRefWeb, a web interface to protein interaction data consolidated from 10 public databases: BIND, BioGRID, CORUM, DIP, IntAct, HPRD, MINT, MPact, MPPI and OPHID. iRefWeb enables users to examine aggregated interactions for a protein of interest, and presents various statistical summaries of the data across databases, such as the number of organism-specific interactions, proteins and cited publications. Through links to source databases and supporting evidence, researchers may gauge the reliability of an interaction using simple criteria, such as the detection methods, the scale of the study (high- or low-throughput) or the number of cited publications. Furthermore, iRefWeb compares the information extracted from the same publication by different databases, and offers means to follow-up possible inconsistencies. We provide an overview of the consolidated protein–protein interaction landscape and show how it can be automatically cropped to aid the generation of meaningful organism-specific interactomes. iRefWeb can be accessed at: http://wodaklab.org/iRefWeb. Database URL: http://wodaklab.org/iRefWeb/
Collapse
Affiliation(s)
- Brian Turner
- Molecular Structure and Function Program, Hospital for Sick Children, Toronto, ON, Canada
| | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Vizcaíno JA, Foster JM, Martens L. Proteomics data repositories: providing a safe haven for your data and acting as a springboard for further research. J Proteomics 2010; 73:2136-46. [PMID: 20615486 PMCID: PMC2958306 DOI: 10.1016/j.jprot.2010.06.008] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2010] [Revised: 06/15/2010] [Accepted: 06/18/2010] [Indexed: 01/19/2023]
Abstract
Despite the fact that data deposition is not a generalised fact yet in the field of proteomics, several mass spectrometry (MS) based proteomics repositories are publicly available for the scientific community. The main existing resources are: the Global Proteome Machine Database (GPMDB), PeptideAtlas, the PRoteomics IDEntifications database (PRIDE), Tranche, and NCBI Peptidome. In this review the capabilities of each of these will be described, paying special attention to four key properties: data types stored, applicable data submission strategies, supported formats, and available data mining and visualization tools. Additionally, the data contents from model organisms will be enumerated for each resource. There are other valuable smaller and/or more specialized repositories but they will not be covered in this review. Finally, the concept behind the ProteomeXchange consortium, a collaborative effort among the main resources in the field, will be introduced.
Collapse
Key Words
- cv, controlled vocabulary
- hgnc, hugo gene nomenclature committee
- mcp, molecular and cellular proteomics
- mrm, multiple reaction monitoring
- nih, national institutes of health
- ols, ontology lookup service
- picr, protein identifier cross-referencing
- psi, proteomics standards initiative
- qc, quality control
- srm, selected reaction monitoring
- sbeams, systems biology experiment analysis management system
- tpp, trans proteomics pipeline.
- proteomics
- databases
- bioinformatics
- data standards
- repositories
Collapse
Affiliation(s)
- Juan Antonio Vizcaíno
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Joseph M. Foster
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Lennart Martens
- Department of Medical Protein Research, VIB, B-9000 Ghent, Belgium
- Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| |
Collapse
|
30
|
Chautard E, Fatoux-Ardore M, Ballut L, Thierry-Mieg N, Ricard-Blum S. MatrixDB, the extracellular matrix interaction database. Nucleic Acids Res 2010; 39:D235-40. [PMID: 20852260 PMCID: PMC3013758 DOI: 10.1093/nar/gkq830] [Citation(s) in RCA: 102] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
MatrixDB (http://matrixdb.ibcp.fr) is a freely available database focused on interactions established by extracellular proteins and polysaccharides. Only few databases report protein-polysaccharide interactions and, to the best of our knowledge, there is no other database of extracellular interactions. MatrixDB takes into account the multimeric nature of several extracellular protein families for the curation of interactions, and reports interactions with individual polypeptide chains or with multimers, considered as permanent complexes, when appropriate. MatrixDB is a member of the International Molecular Exchange consortium (IMEx) and has adopted the PSI-MI standards for the curation and the exchange of interaction data. MatrixDB stores experimental data from our laboratory, data from literature curation, data imported from IMEx databases, and data from the Human Protein Reference Database. MatrixDB is focused on mammalian interactions, but aims to integrate interaction datasets of model organisms when available. MatrixDB provides direct links to databases recapitulating mutations in genes encoding extracellular proteins, to UniGene and to the Human Protein Atlas that shows expression and localization of proteins in a large variety of normal human tissues and cells. MatrixDB allows researchers to perform customized queries and to build tissue- and disease-specific interaction networks that can be visualized and analyzed with Cytoscape or Medusa.
Collapse
Affiliation(s)
- Emilie Chautard
- Institut de Biologie et Chimie des Protéines, UMR 5086 CNRS-Université Lyon 1, IFR 128 Biosciences Gerland-Lyon Sud, 7 passage du Vercors 69367, Lyon Cedex 07, France
| | | | | | | | | |
Collapse
|
31
|
Hakenberg J, Leaman R, Vo NH, Jonnalagadda S, Sullivan R, Miller C, Tari L, Baral C, Gonzalez G. Efficient extraction of protein-protein interactions from full-text articles. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2010; 7:481-494. [PMID: 20498514 DOI: 10.1109/tcbb.2010.51] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Proteins and their interactions govern virtually all cellular processes, such as regulation, signaling, metabolism, and structure. Most experimental findings pertaining to such interactions are discussed in research papers, which, in turn, get curated by protein interaction databases. Authors, editors, and publishers benefit from efforts to alleviate the tasks of searching for relevant papers, evidence for physical interactions, and proper identifiers for each protein involved. The BioCreative II.5 community challenge addressed these tasks in a competition-style assessment to evaluate and compare different methodologies, to make aware of the increasing accuracy of automated methods, and to guide future implementations. In this paper, we present our approaches for protein-named entity recognition, including normalization, and for extraction of protein-protein interactions from full text. Our overall goal is to identify efficient individual components, and we compare various compositions to handle a single full-text article in between 10 seconds and 2 minutes. We propose strategies to transfer document-level annotations to the sentence-level, which allows for the creation of a more fine-grained training corpus; we use this corpus to automatically derive around 5,000 patterns. We rank sentences by relevance to the task of finding novel interactions with physical evidence, using a sentence classifier built from this training corpus. Heuristics for paraphrasing sentences help to further remove unnecessary information that might interfere with patterns, such as additional adjectives, clauses, or bracketed expressions. In BioCreative II.5, we achieved an f-score of 22 percent for finding protein interactions, and 43 percent for mapping proteins to UniProt IDs; disregarding species, f-scores are 30 percent and 55 percent, respectively. On average, our best-performing setup required around 2 minutes per full text. All data and pattern sets as well as Java classes that extend- - third-party software are available as supplementary information (see Appendix).
Collapse
Affiliation(s)
- Jörg Hakenberg
- Department of Computer Science, Arizona State University, Tempe, AZ 85281-8809, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Orchard S, Aranda B, Hermjakob H. The publication and database deposition of molecular interaction data. ACTA ACUST UNITED AC 2010; Chapter 25:25.3.1-25.3.13. [PMID: 20393973 DOI: 10.1002/0471140864.ps2503s60] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Depositing data to a public domain interaction database not only improves the quality and quantity of interactions available to the user community, but also increases the visibility of the data, with members of the International Molecular Exchange (IMEx) databases making the information available in all participating resources. Datasets submitted prior to publication will be issued an accession number that may be included in a publication and which increases user accessibility to the data. No dataset is too small for submission, and the database curators will provide assistance in ensuring the information is correctly represented. This unit provides several alternative protocols to assist the author in preparing and submitting data as an integral part of the manuscript-preparation process. Which method the author selects is largely dictated by the amount of data to be deposited. In addition, two support protocols describe assignment of unambiguous accession numbers and use of controlled vocabulary terms.
Collapse
Affiliation(s)
- Sandra Orchard
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, United Kingdom
| | | | | |
Collapse
|
33
|
Lee K, Thorneycroft D, Achuthan P, Hermjakob H, Ideker T. Mapping plant interactomes using literature curated and predicted protein-protein interaction data sets. THE PLANT CELL 2010; 22:997-1005. [PMID: 20371643 PMCID: PMC2879763 DOI: 10.1105/tpc.109.072736] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Most cellular processes are enabled by cohorts of interacting proteins that form dynamic networks within the plant proteome. The study of these networks can provide insight into protein function and provide new avenues for research. This article informs the plant science community of the currently available sources of protein interaction data and discusses how they can be useful to researchers. Using our recently curated IntAct Arabidopsis thaliana protein-protein interaction data set as an example, we discuss potentials and limitations of the plant interactomes generated to date. In addition, we present our efforts to add value to the interaction data by using them to seed a proteome-wide map of predicted protein subcellular locations.
Collapse
Affiliation(s)
- KiYoung Lee
- Department of Biomedical Informatics, Ajou University School of Medicine, Suwon 443-749, Korea.
| | | | | | | | | |
Collapse
|
34
|
Montecchi-Palazzi L, Kerrien S, Reisinger F, Aranda B, Jones AR, Martens L, Hermjakob H. The PSI semantic validator: a framework to check MIAPE compliance of proteomics data. Proteomics 2010; 9:5112-9. [PMID: 19834897 DOI: 10.1002/pmic.200900189] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The Human Proteome Organization's Proteomics Standards Initiative (PSI) promotes the development of exchange standards to improve data integration and interoperability. PSI specifies the suitable level of detail required when reporting a proteomics experiment (via the Minimum Information About a Proteomics Experiment), and provides extensible markup language (XML) exchange formats and dedicated controlled vocabularies (CVs) that must be combined to generate a standard compliant document. The framework presented here tackles the issue of checking that experimental data reported using a specific format, CVs and public bio-ontologies (e.g. Gene Ontology, NCBI taxonomy) are compliant with the Minimum Information About a Proteomics Experiment recommendations. The semantic validator not only checks the XML syntax but it also enforces rules regarding the use of an ontology class or CV terms by checking that the terms exist in the resource and that they are used in the correct location of a document. Moreover, this framework is extremely fast, even on sizable data files, and flexible, as it can be adapted to any standard by customizing the parameters it requires: an XML Schema Definition, one or more CVs or ontologies, and a mapping file describing in a formal way how the semantic resources and the format are interrelated. As such, the validator provides a general solution to the common problem in data exchange: how to validate the correct usage of a data standard beyond simple XML Schema Definition validation. The framework source code and its various applications can be found at http://psidev.info/validator.
Collapse
Affiliation(s)
- Luisa Montecchi-Palazzi
- European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK.
| | | | | | | | | | | | | |
Collapse
|
35
|
Brooksbank C, Cameron G, Thornton J. The European Bioinformatics Institute's data resources. Nucleic Acids Res 2010; 38:D17-25. [PMID: 19934258 PMCID: PMC2808956 DOI: 10.1093/nar/gkp986] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2009] [Accepted: 10/15/2009] [Indexed: 11/13/2022] Open
Abstract
The wide uptake of next-generation sequencing and other ultra-high throughput technologies by life scientists with a diverse range of interests, spanning fundamental biological research, medicine, agriculture and environmental science, has led to unprecedented growth in the amount of data generated. It has also put the need for unrestricted access to biological data at the centre of biology. The European Bioinformatics Institute (EMBL-EBI) is unique in Europe and is one of only two organisations worldwide providing access to a comprehensive, integrated set of these collections. Here, we describe how the EMBL-EBI's biomolecular databases are evolving to cope with increasing levels of submission, a growing and diversifying user base, and the demand for new types of data. All of the resources described here can be accessed from the EMBL-EBI website: http://www.ebi.ac.uk.
Collapse
Affiliation(s)
- Catherine Brooksbank
- EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
| | | | | |
Collapse
|
36
|
Salwinski L, Licata L, Winter A, Thorneycroft D, Khadake J, Ceol A, Aryamontri AC, Oughtred R, Livstone M, Boucher L, Botstein D, Dolinski K, Berardini T, Huala E, Tyers M, Eisenberg D, Cesareni G, Hermjakob H. Recurated protein interaction datasets. Nat Methods 2009; 6:860-1. [DOI: 10.1038/nmeth1209-860] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
37
|
Ceol A, Chatr Aryamontri A, Licata L, Peluso D, Briganti L, Perfetto L, Castagnoli L, Cesareni G. MINT, the molecular interaction database: 2009 update. Nucleic Acids Res 2009; 38:D532-9. [PMID: 19897547 PMCID: PMC2808973 DOI: 10.1093/nar/gkp983] [Citation(s) in RCA: 372] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
MINT (http://mint.bio.uniroma2.it/mint) is a public repository for molecular interactions reported in peer-reviewed journals. Since its last report, MINT has grown considerably in size and evolved in scope to meet the requirements of its users. The main changes include a more precise definition of the curation policy and the development of an enhanced and user-friendly interface to facilitate the analysis of the ever-growing interaction dataset. MINT has adopted the PSI-MI standards for the annotation and for the representation of molecular interactions and is a member of the IMEx consortium.
Collapse
Affiliation(s)
- Arnaud Ceol
- Department of Biology, University of Rome Tor Vergata, Via della Ricerca Scientifica, 00133 Rome.
| | | | | | | | | | | | | | | |
Collapse
|
38
|
Aranda B, Achuthan P, Alam-Faruque Y, Armean I, Bridge A, Derow C, Feuermann M, Ghanbarian AT, Kerrien S, Khadake J, Kerssemakers J, Leroy C, Menden M, Michaut M, Montecchi-Palazzi L, Neuhauser SN, Orchard S, Perreau V, Roechert B, van Eijk K, Hermjakob H. The IntAct molecular interaction database in 2010. Nucleic Acids Res 2009; 38:D525-31. [PMID: 19850723 PMCID: PMC2808934 DOI: 10.1093/nar/gkp878] [Citation(s) in RCA: 524] [Impact Index Per Article: 34.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
IntAct is an open-source, open data molecular interaction database and toolkit. Data is abstracted from the literature or from direct data depositions by expert curators following a deep annotation model providing a high level of detail. As of September 2009, IntAct contains over 200.000 curated binary interaction evidences. In response to the growing data volume and user requests, IntAct now provides a two-tiered view of the interaction data. The search interface allows the user to iteratively develop complex queries, exploiting the detailed annotation with hierarchical controlled vocabularies. Results are provided at any stage in a simplified, tabular view. Specialized views then allows 'zooming in' on the full annotation of interactions, interactors and their properties. IntAct source code and data are freely available at http://www.ebi.ac.uk/intact.
Collapse
Affiliation(s)
- B Aranda
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Vizcaíno JA, Côté R, Reisinger F, M. Foster J, Mueller M, Rameseder J, Hermjakob H, Martens L. A guide to the Proteomics Identifications Database proteomics data repository. Proteomics 2009; 9:4276-83. [PMID: 19662629 PMCID: PMC2970915 DOI: 10.1002/pmic.200900402] [Citation(s) in RCA: 207] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2009] [Revised: 06/24/2009] [Accepted: 06/25/2009] [Indexed: 01/02/2023]
Abstract
The Proteomics Identifications Database (PRIDE, www.ebi.ac.uk/pride) is one of the main repositories of MS derived proteomics data. Here, we point out the main functionalities of PRIDE both as a submission repository and as a source for proteomics data. We describe the main features for data retrieval and visualization available through the PRIDE web and BioMart interfaces. We also highlight the mechanism by which tailored queries in the BioMart can join PRIDE to other resources such as Reactome, Ensembl or UniProt to execute extremely powerful across-domain queries. We then present the latest improvements in the PRIDE submission process, using the new easy-to-use, platform-independent graphical user interface submission tool PRIDE Converter. Finally, we speak about future plans and the role of PRIDE in the ProteomExchange consortium.
Collapse
Affiliation(s)
- Juan Antonio Vizcaíno
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Richard Côté
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Florian Reisinger
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Joseph M. Foster
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Michael Mueller
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Jonathan Rameseder
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
- Computational and Systems Biology Initiative, Massachusetts Institute of TechnologyCambridge, MA, USA
| | - Henning Hermjakob
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Lennart Martens
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| |
Collapse
|
40
|
Blankenburg H, Finn RD, Prlić A, Jenkinson AM, Ramírez F, Emig D, Schelhorn SE, Büch J, Lengauer T, Albrecht M. DASMI: exchanging, annotating and assessing molecular interaction data. ACTA ACUST UNITED AC 2009; 25:1321-8. [PMID: 19420069 PMCID: PMC2677739 DOI: 10.1093/bioinformatics/btp142] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Ever increasing amounts of biological interaction data are being accumulated worldwide, but they are currently not readily accessible to the biologist at a single site. New techniques are required for retrieving, sharing and presenting data spread over the Internet. RESULTS We introduce the DASMI system for the dynamic exchange, annotation and assessment of molecular interaction data. DASMI is based on the widely used Distributed Annotation System (DAS) and consists of a data exchange specification, web servers for providing the interaction data and clients for data integration and visualization. The decentralized architecture of DASMI affords the online retrieval of the most recent data from distributed sources and databases. DASMI can also be extended easily by adding new data sources and clients. We describe all DASMI components and demonstrate their use for protein and domain interactions. AVAILABILITY The DASMI tools are available at http://www.dasmi.de/ and http://ipfam.sanger.ac.uk/graph. The DAS registry and the DAS 1.53E specification is found at http://www.dasregistry.org/.
Collapse
Affiliation(s)
- Hagen Blankenburg
- Max Planck Institute for Informatics, Campus E 1.4, 66123 Saarbrücken, Germany
| | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Bacha J, Brodie JS, Loose MW. myGRN: a database and visualisation system for the storage and analysis of developmental genetic regulatory networks. BMC DEVELOPMENTAL BIOLOGY 2009; 9:33. [PMID: 19500400 PMCID: PMC2702357 DOI: 10.1186/1471-213x-9-33] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/29/2009] [Accepted: 06/06/2009] [Indexed: 11/23/2022]
Abstract
Background Biological processes are regulated by complex interactions between transcription factors and signalling molecules, collectively described as Genetic Regulatory Networks (GRNs). The characterisation of these networks to reveal regulatory mechanisms is a long-term goal of many laboratories. However compiling, visualising and interacting with such networks is non-trivial. Current tools and databases typically focus on GRNs within simple, single celled organisms. However, data is available within the literature describing regulatory interactions in multi-cellular organisms, although not in any systematic form. This is particularly true within the field of developmental biology, where regulatory interactions should also be tagged with information about the time and anatomical location of development in which they occur. Description We have developed myGRN (), a web application for storing and interrogating interaction data, with an emphasis on developmental processes. Users can submit interaction and gene expression data, either curated from published sources or derived from their own unpublished data. All interactions associated with publications are publicly visible, and unpublished interactions can only be shared between collaborating labs prior to publication. Users can group interactions into discrete networks based on specific biological processes. Various filters allow dynamic production of network diagrams based on a range of information including tissue location, developmental stage or basic topology. Individual networks can be viewed using myGRV, a tool focused on displaying developmental networks, or exported in a range of formats compatible with third party tools. Networks can also be analysed for the presence of common network motifs. We demonstrate the capabilities of myGRN using a network of zebrafish interactions integrated with expression data from the zebrafish database, ZFIN. Conclusion Here we are launching myGRN as a community-based repository for interaction networks, with a specific focus on developmental networks. We plan to extend its functionality, as well as use it to study networks involved in embryonic development in the future.
Collapse
Affiliation(s)
- Jamil Bacha
- Institute of Genetics, University of Nottingham, Nottingham, UK.
| | | | | |
Collapse
|
42
|
Blankenburg H, Ramírez F, Büch J, Albrecht M. DASMIweb: online integration, analysis and assessment of distributed protein interaction data. Nucleic Acids Res 2009; 37:W122-8. [PMID: 19502495 PMCID: PMC2703953 DOI: 10.1093/nar/gkp438] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
In recent years, we have witnessed a substantial increase of the amount of available protein interaction data. However, most data are currently not readily accessible to the biologist at a single site, but scattered over multiple online repositories. Therefore, we have developed the DASMIweb server that affords the integration, analysis and qualitative assessment of distributed sources of interaction data in a dynamic fashion. Since DASMIweb allows for querying many different resources of protein and domain interactions simultaneously, it serves as an important starting point for interactome studies and assists the user in finding publicly accessible interaction data with minimal effort. The pool of queried resources is fully configurable and supports the inclusion of own interaction data or confidence scores. In particular, DASMIweb integrates confidence measures like functional similarity scores to assess individual interactions. The retrieved results can be exported in different file formats like MITAB or SIF. DASMIweb is freely available at http://www.dasmiweb.de.
Collapse
Affiliation(s)
- Hagen Blankenburg
- Max Planck Institute for Informatics, Campus E1.4, 66123 Saarbrücken, Germany.
| | | | | | | |
Collapse
|
43
|
Cusick ME, Yu H, Smolyar A, Venkatesan K, Carvunis AR, Simonis N, Rual JF, Borick H, Braun P, Dreze M, Vandenhaute J, Galli M, Yazaki J, Hill DE, Ecker JR, Roth FP, Vidal M. Literature-curated protein interaction datasets. Nat Methods 2009; 6:39-46. [PMID: 19116613 PMCID: PMC2683745 DOI: 10.1038/nmeth.1284] [Citation(s) in RCA: 234] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
High quality datasets are needed to understand how global and local properties of protein-protein interaction, or “interactome”, networks relate to biological mechanisms, and to guide research on individual proteins. Evaluations of existing curation of protein interaction experiments reported in the literature find that curation can be error prone and possibly of lower quality than commonly assumed.
Collapse
Affiliation(s)
- Michael E Cusick
- Center for Cancer Systems Biology and Department of Cancer Biology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, Massachusetts 02115, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Orchard S, Albar JP, Deutsch EW, Binz PA, Jones AR, Creasy D, Hermjakob H. Annual spring meeting of the Proteomics Standards Initiative 23-25 April 2008, Toledo, Spain. Proteomics 2009; 8:4168-72. [PMID: 18814335 DOI: 10.1002/pmic.200800555] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The role of the Human Proteome Organisation Proteomics Standards Initiative (HUPO-PSI) is to produce and release community-accepted reporting requirements, interchange formats and controlled vocabularies for mass spectrometry proteomics and related technologies such as gel electrophoresis, column chromatography and molecular interactions. A number of significant advances were made at this workshop, with the new MS standard, mzML, being finalised prior to release on 1(st) June 2008 and analysisXML, which will allow protein and peptide identifications and post-translational modifications to be captured, being prepared to enter the review process this summer. The accompanying controlled vocabularies are continuing to evolve and a number of standards papers are now being finalised prior to publication.
Collapse
Affiliation(s)
- Sandra Orchard
- EMBL Outstation - European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK.
| | | | | | | | | | | | | |
Collapse
|
45
|
Morsy M, Gouthu S, Orchard S, Thorneycroft D, Harper JF, Mittler R, Cushman JC. Charting plant interactomes: possibilities and challenges. TRENDS IN PLANT SCIENCE 2008; 13:183-91. [PMID: 18329319 DOI: 10.1016/j.tplants.2008.01.006] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2007] [Revised: 01/17/2008] [Accepted: 01/25/2008] [Indexed: 05/22/2023]
Abstract
Protein-protein interactions are essential for nearly all cellular processes. Therefore, an important goal of post-genomic research for defining gene function and understanding the function of macromolecular complexes involves creating 'interactome' maps from empirical or inferred datasets. Systematic efforts to conduct high-throughput surveys of protein-protein interactions in plants are needed to chart the complex and dynamic interaction networks that occur throughout plant development. However, no single approach can build a complete map of the interactome. Here, we review the utility and potential of various experimental approaches for creating large-scale protein-protein interaction maps in plants. Bioinformatics approaches for curating and assessing the confidence of these datasets through inter-species comparisons will be crucial in achieving a complete understanding of protein interaction networks in plants.
Collapse
Affiliation(s)
- Mustafa Morsy
- Department of Biochemistry and Molecular Biology, MS200, University of Nevada, Reno, NV 89557, USA
| | | | | | | | | | | | | |
Collapse
|
46
|
Orchard S, Martens L, Tasman J, Binz PA, Albar JP, Hermjakob H. 6th HUPO Annual World Congress – Proteomics Standards Initiative Workshop 6–10 October 2007, Seoul, Korea. Proteomics 2008; 8:1331-3. [DOI: 10.1002/pmic.200701086] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
47
|
Protein-protein interactions: analysis and prediction. MODERN GENOME ANNOTATION 2008. [PMCID: PMC7120725 DOI: 10.1007/978-3-211-75123-7_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Proteins represent the tools and appliances of the cell — they assemble into larger structural elements, catalyze the biochemical reactions of metabolism, transmit signals, move cargo across membrane boundaries and carry out many other tasks. For most of these functions proteins cannot act in isolation but require close cooperation with other proteins to accomplish their task. Often, this collaborative action implies physical interaction of the proteins involved. Accordingly, experimental detection, in silico prediction and computational analysis of protein-protein interactions (PPI) have attracted great attention in the quest for discovering functional links among proteins and deciphering the complex networks of the cell.
Collapse
|