Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Alfarano C, Andrade CE, Anthony K, Bahroos N, Bajec M, Bantoft K, Betel D, Bobechko B, Boutilier K, Burgess E, Buzadzija K, Cavero R, D'Abreo C, Donaldson I, Dorairajoo D, Dumontier MJ, Dumontier MR, Earles V, Farrall R, Feldman H, Garderman E, Gong Y, Gonzaga R, Grytsan V, Gryz E, Gu V, Haldorsen E, Halupa A, Haw R, Hrvojic A, Hurrell L, Isserlin R, Jack F, Juma F, Khan A, Kon T, Konopinsky S, Le V, Lee E, Ling S, Magidin M, Moniakis J, Montojo J, Moore S, Muskat B, Ng I, Paraiso JP, Parker B, Pintilie G, Pirone R, Salama JJ, Sgro S, Shan T, Shu Y, Siew J, Skinner D, Snyder K, Stasiuk R, Strumpf D, Tuekam B, Tao S, Wang Z, White M, Willis R, Wolting C, Wong S, Wrong A, Xin C, Yao R, Yates B, Zhang S, Zheng K, Pawson T, Ouellette BFF, Hogue CWV. The Biomolecular Interaction Network Database and related tools 2005 update. Nucleic Acids Res 2005;33:D418-24. [PMID: 15608229 PMCID: PMC540005 DOI: 10.1093/nar/gki051] [Citation(s) in RCA: 447] [Impact Index Per Article: 22.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

For:	Alfarano C, Andrade CE, Anthony K, Bahroos N, Bajec M, Bantoft K, Betel D, Bobechko B, Boutilier K, Burgess E, Buzadzija K, Cavero R, D'Abreo C, Donaldson I, Dorairajoo D, Dumontier MJ, Dumontier MR, Earles V, Farrall R, Feldman H, Garderman E, Gong Y, Gonzaga R, Grytsan V, Gryz E, Gu V, Haldorsen E, Halupa A, Haw R, Hrvojic A, Hurrell L, Isserlin R, Jack F, Juma F, Khan A, Kon T, Konopinsky S, Le V, Lee E, Ling S, Magidin M, Moniakis J, Montojo J, Moore S, Muskat B, Ng I, Paraiso JP, Parker B, Pintilie G, Pirone R, Salama JJ, Sgro S, Shan T, Shu Y, Siew J, Skinner D, Snyder K, Stasiuk R, Strumpf D, Tuekam B, Tao S, Wang Z, White M, Willis R, Wolting C, Wong S, Wrong A, Xin C, Yao R, Yates B, Zhang S, Zheng K, Pawson T, Ouellette BFF, Hogue CWV. The Biomolecular Interaction Network Database and related tools 2005 update. Nucleic Acids Res 2005;33:D418-24. [PMID: 15608229 PMCID: PMC540005 DOI: 10.1093/nar/gki051] [Citation(s) in RCA: 447] [Impact Index Per Article: 22.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Number

Cited by Other Article(s)

251

Winsor GL, Van Rossum T, Lo R, Khaira B, Whiteside MD, Hancock REW, Brinkman FSL. Pseudomonas Genome Database: facilitating user-friendly, comprehensive comparisons of microbial genomes. Nucleic Acids Res 2008;37:D483-8. [PMID: 18978025 PMCID: PMC2686508 DOI: 10.1093/nar/gkn861] [Citation(s) in RCA: 197] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

252

Jensen LJ, Kuhn M, Stark M, Chaffron S, Creevey C, Muller J, Doerks T, Julien P, Roth A, Simonovic M, Bork P, von Mering C. STRING 8--a global view on proteins and their functional interactions in 630 organisms. Nucleic Acids Res 2008;37:D412-6. [PMID: 18940858 PMCID: PMC2686466 DOI: 10.1093/nar/gkn760] [Citation(s) in RCA: 1909] [Impact Index Per Article: 112.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

253

Aychek T, Miller K, Sagi-Assif O, Levy-Nissenbaum O, Israeli-Amit M, Pasmanik-Chor M, Jacob-Hirsch J, Amariglio N, Rechavi G, Witz IP. E-selectin regulates gene expression in metastatic colorectal carcinoma cells and enhances HMGB1 release. Int J Cancer 2008;123:1741-50. [DOI: 10.1002/ijc.23375] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

254

Gao X, Jin C, Ren J, Yao X, Xue Y. Proteome-wide prediction of PKA phosphorylation sites in eukaryotic kingdom. Genomics 2008;92:457-63. [PMID: 18817865 DOI: 10.1016/j.ygeno.2008.08.013] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2008] [Revised: 08/25/2008] [Accepted: 08/27/2008] [Indexed: 01/27/2023]

255

The interactome: predicting the protein-protein interactions in cells. Cell Mol Biol Lett 2008;14:1-22. [PMID: 18839074 PMCID: PMC6275871 DOI: 10.2478/s11658-008-0024-7] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2008] [Accepted: 04/03/2008] [Indexed: 12/03/2022] Open

256

Pan W. Network-based model weighting to detect multiple loci influencing complex diseases. Hum Genet 2008;124:225-34. [PMID: 18719944 PMCID: PMC3341661 DOI: 10.1007/s00439-008-0545-1] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2008] [Accepted: 08/12/2008] [Indexed: 01/20/2023]

257

Ullah H, Scappini EL, Moon AF, Williams LV, Armstrong DL, Pedersen LC. Structure of a signal transduction regulator, RACK1, from Arabidopsis thaliana. Protein Sci 2008;17:1771-80. [PMID: 18715992 PMCID: PMC2548356 DOI: 10.1110/ps.035121.108] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2008] [Revised: 06/20/2008] [Accepted: 06/25/2008] [Indexed: 01/09/2023]

258

Razick S, Magklaras G, Donaldson IM. iRefIndex: a consolidated protein interaction database with provenance. BMC Bioinformatics 2008;9:405. [PMID: 18823568 PMCID: PMC2573892 DOI: 10.1186/1471-2105-9-405] [Citation(s) in RCA: 420] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2008] [Accepted: 09/30/2008] [Indexed: 01/05/2023] Open

259

Guan Y, Myers CL, Lu R, Lemischka IR, Bult CJ, Troyanskaya OG. A genomewide functional network for the laboratory mouse. PLoS Comput Biol 2008;4:e1000165. [PMID: 18818725 PMCID: PMC2527685 DOI: 10.1371/journal.pcbi.1000165] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2008] [Accepted: 07/21/2008] [Indexed: 11/19/2022] Open

Abstract

Establishing a functional network is invaluable to our understanding of gene function, pathways, and systems-level properties of an organism and can be a powerful resource in directing targeted experiments. In this study, we present a functional network for the laboratory mouse based on a Bayesian integration of diverse genetic and functional genomic data. The resulting network includes probabilistic functional linkages among 20,581 protein-coding genes. We show that this network can accurately predict novel functional assignments and network components and present experimental evidence for predictions related to Nanog homeobox (Nanog), a critical gene in mouse embryonic stem cell pluripotency. An analysis of the global topology of the mouse functional network reveals multiple biologically relevant systems-level features of the mouse proteome. Specifically, we identify the clustering coefficient as a critical characteristic of central modulators that affect diverse pathways as well as genes associated with different phenotype traits and diseases. In addition, a cross-species comparison of functional interactomes on a genomic scale revealed distinct functional characteristics of conserved neighborhoods as compared to subnetworks specific to higher organisms. Thus, our global functional network for the laboratory mouse provides the community with a key resource for discovering protein functions and novel pathway components as well as a tool for exploring systems-level topological and evolutionary features of cellular interactomes. To facilitate exploration of this network by the biomedical research community, we illustrate its application in function and disease gene discovery through an interactive, Web-based, publicly available interface at http://mouseNET.princeton.edu.

Functionally related proteins interact in diverse ways to carry out biological processes, and each protein often participates in multiple pathways. Proteins are therefore organized into a complex network through which different functions of the cell are carried out. An accurate description of such a network is invaluable to our understanding of both the system-level features of a cell and those of an individual biological process. In this study, we used a probabilistic model to combine information from diverse genome-scale studies as well as individual investigations to generate a global functional network for mouse. Our analysis of the global topology of this network reveals biologically relevant systems-level characteristics of the mouse proteome, including conservation of functional neighborhoods and network features characteristic of known disease genes and key transcriptional regulators. We have made this network publicly available for search and dynamic exploration by researchers in the community. Our Web interface enables users to easily generate hypotheses regarding potential functional roles of uncharacterized proteins, investigate possible links between their proteins of interest and disease, and identify new players in specific biological processes.

Collapse

260

Bodén M, Teasdale RD. Determining nucleolar association from sequence by leveraging protein-protein interactions. J Comput Biol 2008;15:291-304. [PMID: 18333760 DOI: 10.1089/cmb.2007.0163] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

261

Hsing M, Byler KG, Cherkasov A. The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks. BMC SYSTEMS BIOLOGY 2008;2:80. [PMID: 18796161 PMCID: PMC2553323 DOI: 10.1186/1752-0509-2-80] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/01/2008] [Accepted: 09/16/2008] [Indexed: 11/10/2022]

Abstract

BACKGROUND

Protein-protein interactions mediate a wide range of cellular functions and responses and have been studied rigorously through recent large-scale proteomics experiments and bioinformatics analyses. One of the most important findings of those endeavours was the observation that 'hub' proteins participate in significant numbers of protein interactions and play critical roles in the organization and function of cellular protein interaction networks (PINs) 12. It has also been demonstrated that such hub proteins may constitute an important pool of attractive drug targets.Thus, it is crucial to be able to identify hub proteins based not only on experimental data but also by means of bioinformatics predictions.

RESULTS

A hub protein classifier has been developed based on the available interaction data and Gene Ontology (GO) annotations for proteins in the Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens genomes. In particular, by utilizing the machine learning method of boosting trees we were able to create a predictive bioinformatics tool for the identification of proteins that are likely to play the role of a hub in protein interaction networks. Testing the developed hub classifier on external sets of experimental protein interaction data in Methicillin-resistant Staphylococcus aureus (MRSA) 252 and Caenorhabditis elegans demonstrated that our approach can predict hub proteins with a high degree of accuracy.A practical application of the developed bioinformatics method has been illustrated by the effective protein bait selection for large-scale pull-down experiments that aim to map complete protein-protein interaction networks for several species.

CONCLUSION

The successful development of an accurate hub classifier demonstrated that highly-connected proteins tend to share certain relevant functional properties reflected in their Gene Ontology annotations. It is anticipated that the developed bioinformatics hub classifier will represent a useful tool for the theoretical prediction of highly-interacting proteins, the study of cellular network organizations, and the identification of prospective drug targets - even in those organisms that currently lack large-scale protein interaction data.

Collapse

262

Hofmann-Apitius M, Fluck J, Furlong L, Fornes O, Kolárik C, Hanser S, Boeker M, Schulz S, Sanz F, Klinger R, Mevissen T, Gattermayer T, Oliva B, Friedrich CM. Knowledge environments representing molecular entities for the virtual physiological human. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2008;366:3091-3110. [PMID: 18559317 DOI: 10.1098/rsta.2008.0099] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

263

Rajagopala SV, Goll J, Gowda NDD, Sunil KC, Titz B, Mukherjee A, Mary SS, Raviswaran N, Poojari CS, Ramachandra S, Shtivelband S, Blazie SM, Hofmann J, Uetz P. MPI-LIT: a literature-curated dataset of microbial binary protein--protein interactions. ACTA ACUST UNITED AC 2008;24:2622-7. [PMID: 18786976 DOI: 10.1093/bioinformatics/btn481] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

264

Lynn DJ, Winsor GL, Chan C, Richard N, Laird MR, Barsky A, Gardy JL, Roche FM, Chan THW, Shah N, Lo R, Naseer M, Que J, Yau M, Acab M, Tulpan D, Whiteside MD, Chikatamarla A, Mah B, Munzner T, Hokamp K, Hancock REW, Brinkman FSL. InnateDB: facilitating systems-level analyses of the mammalian innate immune response. Mol Syst Biol 2008;4:218. [PMID: 18766178 PMCID: PMC2564732 DOI: 10.1038/msb.2008.55] [Citation(s) in RCA: 287] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2008] [Accepted: 07/17/2008] [Indexed: 01/31/2023] Open

265

Identifying components of complexes. Methods Mol Biol 2008. [PMID: 18712308 DOI: 10.1007/978-1-60327-429-6_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

266

Bergholdt R, Størling ZM, Lage K, Karlberg EO, Olason PI, Aalund M, Nerup J, Brunak S, Workman CT, Pociot F. Integrative analysis for finding genes and networks involved in diabetes and other complex diseases. Genome Biol 2008;8:R253. [PMID: 18045462 PMCID: PMC2258178 DOI: 10.1186/gb-2007-8-11-r253] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2007] [Revised: 10/31/2007] [Accepted: 11/28/2007] [Indexed: 01/17/2023] Open

267

Burgoon LD, Zacharewski TR. Bioinformatics: databasing and gene annotation. Methods Mol Biol 2008;460:145-57. [PMID: 18449486 DOI: 10.1007/978-1-60327-048-9_7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]

268

Pattin KA, Moore JH. Exploiting the proteome to improve the genome-wide genetic analysis of epistasis in common human diseases. Hum Genet 2008;124:19-29. [PMID: 18551320 PMCID: PMC2780579 DOI: 10.1007/s00439-008-0522-8] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2008] [Accepted: 05/26/2008] [Indexed: 11/24/2022]

269

Hormozdiari F, Berenbrink P, Pržulj N, Sahinalp SC. Not all scale-free networks are born equal: the role of the seed graph in PPI network evolution. PLoS Comput Biol 2008;3:e118. [PMID: 17616981 PMCID: PMC1913096 DOI: 10.1371/journal.pcbi.0030118] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2006] [Accepted: 05/10/2007] [Indexed: 11/18/2022] Open

270

Evlampiev K, Isambert H. Conservation and topology of protein interaction networks under duplication-divergence evolution. Proc Natl Acad Sci U S A 2008;105:9863-8. [PMID: 18632555 PMCID: PMC2481380 DOI: 10.1073/pnas.0804119105] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2007] [Indexed: 11/18/2022] Open

Abstract

Genomic duplication-divergence processes are the primary source of new protein functions and thereby contribute to the evolutionary expansion of functional molecular networks. Yet, it is still unclear to what extent such duplication-divergence processes also restrict by construction the emerging properties of molecular networks, regardless of any specific cellular functions. We address this question, here, focusing on the evolution of protein-protein interaction (PPI) networks. We solve a general duplication-divergence model, based on the statistically necessary deletions of protein-protein interactions arising from stochastic duplications at various genomic scales, from single-gene to whole-genome duplications. Major evolutionary scenarios are shown to depend on two global parameters only: (i) a protein conservation index (M), which controls the evolutionary history of PPI networks, and (ii) a distinct topology index (M') controlling their resulting structure. We then demonstrate that conserved, nondense networks, which are of prime biological relevance, are also necessarily scale-free by construction, irrespective of any evolutionary variations or fluctuations of the model parameters. It is shown to result from a fundamental linkage between individual protein conservation and network topology under general duplication-divergence evolution. By contrast, we find that conservation of network motifs with two or more proteins cannot be indefinitely preserved under general duplication-divergence evolution (independently from any network rewiring dynamics), in broad agreement with empirical evidence between phylogenetically distant species. All in all, these evolutionary constraints, inherent to duplication-divergence processes, appear to have largely controlled the overall topology and scale-dependent conservation of PPI networks, regardless of any specific biological function.

Collapse

271

Mahdavi MA, Lin YH. Prediction of protein-protein interactions using protein signature profiling. GENOMICS PROTEOMICS & BIOINFORMATICS 2008;5:177-86. [PMID: 18267299 PMCID: PMC5963007 DOI: 10.1016/s1672-0229(08)60005-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]

272

Huttenhower C, Troyanskaya OG. Assessing the functional structure of genomic data. Bioinformatics 2008;24:i330-8. [PMID: 18586732 PMCID: PMC2718638 DOI: 10.1093/bioinformatics/btn160] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

273

Goll J, Rajagopala SV, Shiau SC, Wu H, Lamb BT, Uetz P. MPIDB: the microbial protein interaction database. ACTA ACUST UNITED AC 2008;24:1743-4. [PMID: 18556668 PMCID: PMC2638870 DOI: 10.1093/bioinformatics/btn285] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

274

Bethard S, Lu Z, Martin JH, Hunter L. Semantic role labeling for protein transport predicates. BMC Bioinformatics 2008;9:277. [PMID: 18547432 PMCID: PMC2474622 DOI: 10.1186/1471-2105-9-277] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2008] [Accepted: 06/11/2008] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Automatic semantic role labeling (SRL) is a natural language processing (NLP) technique that maps sentences to semantic representations. This technique has been widely studied in the recent years, but mostly with data in newswire domains. Here, we report on a SRL model for identifying the semantic roles of biomedical predicates describing protein transport in GeneRIFs - manually curated sentences focusing on gene functions. To avoid the computational cost of syntactic parsing, and because the boundaries of our protein transport roles often did not match up with syntactic phrase boundaries, we approached this problem with a word-chunking paradigm and trained support vector machine classifiers to classify words as being at the beginning, inside or outside of a protein transport role.

RESULTS

We collected a set of 837 GeneRIFs describing movements of proteins between cellular components, whose predicates were annotated for the semantic roles AGENT, PATIENT, ORIGIN and DESTINATION. We trained these models with the features of previous word-chunking models, features adapted from phrase-chunking models, and features derived from an analysis of our data. Our models were able to label protein transport semantic roles with 87.6% precision and 79.0% recall when using manually annotated protein boundaries, and 87.0% precision and 74.5% recall when using automatically identified ones.

CONCLUSION

We successfully adapted the word-chunking classification paradigm to semantic role labeling, applying it to a new domain with predicates completely absent from any previous studies. By combining the traditional word and phrasal role labeling features with biomedical features like protein boundaries and MEDPOST part of speech tags, we were able to address the challenges posed by the new domain data and subsequently build robust models that achieved F-measures as high as 83.1. This system for extracting protein transport information from GeneRIFs performs well even with proteins identified automatically, and is therefore more robust than the rule-based methods previously used to extract protein transport roles.

Collapse

275

Franke L, de Kovel CG, Aulchenko YS, Trynka G, Zhernakova A, Hunt KA, Blauw HM, van den Berg LH, Ophoff R, Deloukas P, van Heel DA, Wijmenga C. Detection, imputation, and association analysis of small deletions and null alleles on oligonucleotide arrays. Am J Hum Genet 2008;82:1316-33. [PMID: 18519066 PMCID: PMC2427186 DOI: 10.1016/j.ajhg.2008.05.008] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2008] [Revised: 03/21/2008] [Accepted: 05/13/2008] [Indexed: 12/14/2022] Open

Affiliation(s)

Lude Franke Complex Genetics Section, DBG-Department of Medical Genetics, University Medical Centre Utrecht, 3584 CG Utrecht, The Netherlands Genetics Department, University Medical Centre Groningen and University of Groningen, 9700 RB Groningen, The Netherlands
Carolien G.F. de Kovel Complex Genetics Section, DBG-Department of Medical Genetics, University Medical Centre Utrecht, 3584 CG Utrecht, The Netherlands
Yurii S. Aulchenko Department of Epidemiology & Biostatistics, Erasmus MC Rotterdam, 3000 CA Rotterdam, The Netherlands
Gosia Trynka Genetics Department, University Medical Centre Groningen and University of Groningen, 9700 RB Groningen, The Netherlands
Alexandra Zhernakova Complex Genetics Section, DBG-Department of Medical Genetics, University Medical Centre Utrecht, 3584 CG Utrecht, The Netherlands
Karen A. Hunt Institute of Cell and Molecular Science, Barts and The London School of Medicine and Dentistry, London, E1 2AT, UK
Hylke M. Blauw Department of Neurology, Rudolf Magnus Institute of Neuroscience, University Medical Center Utrecht, 3584 CX Utrecht, The Netherlands
Leonard H. van den Berg Department of Neurology, Rudolf Magnus Institute of Neuroscience, University Medical Center Utrecht, 3584 CX Utrecht, The Netherlands
Roel Ophoff Complex Genetics Section, DBG-Department of Medical Genetics, University Medical Centre Utrecht, 3584 CG Utrecht, The Netherlands Center for Neurobehavioral Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
Panagiotis Deloukas Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
David A. van Heel Institute of Cell and Molecular Science, Barts and The London School of Medicine and Dentistry, London, E1 2AT, UK
Cisca Wijmenga Complex Genetics Section, DBG-Department of Medical Genetics, University Medical Centre Utrecht, 3584 CG Utrecht, The Netherlands Genetics Department, University Medical Centre Groningen and University of Groningen, 9700 RB Groningen, The Netherlands

Collapse

276

Aguilar D, Skrabanek L, Gross SS, Oliva B, Campagne F. Beyond tissueInfo: functional prediction using tissue expression profile similarity searches. Nucleic Acids Res 2008;36:3728-37. [PMID: 18483083 PMCID: PMC2441795 DOI: 10.1093/nar/gkn233] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

277

Willis RC, Hogue CWV. Searching, viewing, and visualizing data in the Biomolecular Interaction Network Database (BIND). CURRENT PROTOCOLS IN BIOINFORMATICS 2008;Chapter 8:8.9.1-8.9.30. [PMID: 18428770 DOI: 10.1002/0471250953.bi0809s12] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

278

Xue Y, Ren J, Gao X, Jin C, Wen L, Yao X. GPS 2.0, a tool to predict kinase-specific phosphorylation sites in hierarchy. Mol Cell Proteomics 2008;7:1598-608. [PMID: 18463090 DOI: 10.1074/mcp.m700574-mcp200] [Citation(s) in RCA: 536] [Impact Index Per Article: 31.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open

279

Thum KE, Shin MJ, Gutiérrez RA, Mukherjee I, Katari MS, Nero D, Shasha D, Coruzzi GM. An integrated genetic, genomic and systems approach defines gene networks regulated by the interaction of light and carbon signaling pathways in Arabidopsis. BMC SYSTEMS BIOLOGY 2008;2:31. [PMID: 18387196 PMCID: PMC2335094 DOI: 10.1186/1752-0509-2-31] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2007] [Accepted: 04/04/2008] [Indexed: 11/10/2022]

Abstract

BACKGROUND

Light and carbon are two important interacting signals affecting plant growth and development. The mechanism(s) and/or genes involved in sensing and/or mediating the signaling pathways involving these interactions are unknown. This study integrates genetic, genomic and systems approaches to identify a genetically perturbed gene network that is regulated by the interaction of carbon and light signaling in Arabidopsis.

RESULTS

Carbon and light insensitive (cli) mutants were isolated. Microarray data from cli186 is analyzed to identify the genes, biological processes and gene networks affected by the integration of light and carbon pathways. Analysis of this data reveals 966 genes regulated by light and/or carbon signaling in wild-type. In cli186, 216 of these light/carbon regulated genes are misregulated in response to light and/or carbon treatments where 78% are misregulated in response to light and carbon interactions. Analysis of the gene lists show that genes in the biological processes "energy" and "metabolism" are over-represented among the 966 genes regulated by carbon and/or light in wild-type, and the 216 misregulated genes in cli186. To understand connections among carbon and/or light regulated genes in wild-type and the misregulated genes in cli186, the microarray data is interpreted in the context of metabolic and regulatory networks. The network created from the 966 light/carbon regulated genes in wild-type, reveals that cli186 is affected in the light and/or carbon regulation of a network of 60 connected genes, including six transcription factors. One transcription factor, HAT22 appears to be a regulatory "hub" in the cli186 network as it shows regulatory connections linking a metabolic network of genes involved in "amino acid metabolism", "C-compound/carbohydrate metabolism" and "glycolysis/gluconeogenesis".

CONCLUSION

The global misregulation of gene networks controlled by light and carbon signaling in cli186 indicates that it represents one of the first Arabidopsis mutants isolated that is specifically disrupted in the integration of both carbon and light signals to control the regulation of metabolic, developmental and regulatory genes. The network analysis of misregulated genes suggests that CLI186 acts to integrate light and carbon signaling interactions and is a master regulator connecting the regulation of a host of downstream metabolic and regulatory processes.

Collapse

280

Walking the interactome for prioritization of candidate disease genes. Am J Hum Genet 2008;82:949-58. [PMID: 18371930 DOI: 10.1016/j.ajhg.2008.02.013] [Citation(s) in RCA: 837] [Impact Index Per Article: 49.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2007] [Revised: 01/18/2008] [Accepted: 02/19/2008] [Indexed: 11/21/2022] Open

281

Ratushny V, Golemis EA. Resolving the network of cell signaling pathways using the evolving yeast two-hybrid system. Biotechniques 2008;44:655-62. [PMID: 18474041 PMCID: PMC2526548 DOI: 10.2144/000112797] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

282

Lara MF, Santos M, Ruiz S, Segrelles C, Moral M, Martínez-Cruz AB, Hernández P, Martínez-Palacio J, Lorz C, García-Escudero R, Paramio JM. p107 acts as a tumor suppressor in pRb-deficient epidermis. Mol Carcinog 2008;47:105-13. [PMID: 17932945 DOI: 10.1002/mc.20367] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

283

Aragues R, Sander C, Oliva B. Predicting cancer involvement of genes from heterogeneous data. BMC Bioinformatics 2008;9:172. [PMID: 18371197 PMCID: PMC2330045 DOI: 10.1186/1471-2105-9-172] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2007] [Accepted: 03/27/2008] [Indexed: 11/10/2022] Open

284

Marcatili P, Bussotti G, Tramontano A. The MoVIN server for the analysis of protein interaction networks. BMC Bioinformatics 2008;9 Suppl 2:S11. [PMID: 18387199 PMCID: PMC2323660 DOI: 10.1186/1471-2105-9-s2-s11] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Abstract

BACKGROUND

Protein-protein interactions are at the basis of most cellular processes and crucial for many bio-technological applications. During the last few years the development of high-throughput technologies has produced several large-scale protein-protein interaction data sets for various organisms. It is important to develop tools for dissecting their content and analyse the information they embed by data-integration and computational methods.

RESULTS

Interactions can be mediated by the presence of specific features, such as motifs, surface patches and domains. The co-occurrence of these features on proteins interacting with the same protein can indicate mutually exclusive interactions and, therefore, can be used for inferring the involvement of the proteins in common biological processes. We present here a publicly available server that allows the user to investigate protein interaction data in light of other biological information, such as their sequences, presence of specific domains, process and component ontologies. The server can be effectively used to construct a high-confidence set of mutually exclusive interactions by identifying similar features in groups of proteins sharing a common interaction partner. As an example, we describe here the identification of common motifs, function, cellular localization and domains in different datasets of yeast interactions.

CONCLUSIONS

The server can be used to analyse user-supplied datasets, it contains pre-processed data for four yeast Protein Protein interaction datasets and the results of their statistical analysis. These show that the presence of common motifs in proteins interacting with the same partner is a valuable source of information, it can be used to investigate the properties of the interacting proteins and provides information that can be effectively integrated with other sources. As more experimental interaction data become available, this tool will become more and more useful to gain a more detailed picture of the interactome.

Collapse

285

Aguilar D, Oliva B. Topological comparison of methods for predicting transcriptional cooperativity in yeast. BMC Genomics 2008;9:137. [PMID: 18366726 PMCID: PMC2315657 DOI: 10.1186/1471-2164-9-137] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2007] [Accepted: 03/25/2008] [Indexed: 11/10/2022] Open

286

Ramsey SA, Klemm SL, Zak DE, Kennedy KA, Thorsson V, Li B, Gilchrist M, Gold ES, Johnson CD, Litvak V, Navarro G, Roach JC, Rosenberger CM, Rust AG, Yudkovsky N, Aderem A, Shmulevich I. Uncovering a macrophage transcriptional program by integrating evidence from motif scanning and expression dynamics. PLoS Comput Biol 2008;4:e1000021. [PMID: 18369420 PMCID: PMC2265556 DOI: 10.1371/journal.pcbi.1000021] [Citation(s) in RCA: 132] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2007] [Accepted: 02/04/2008] [Indexed: 01/04/2023] Open

Abstract

Macrophages are versatile immune cells that can detect a variety of pathogen-associated molecular patterns through their Toll-like receptors (TLRs). In response to microbial challenge, the TLR-stimulated macrophage undergoes an activation program controlled by a dynamically inducible transcriptional regulatory network. Mapping a complex mammalian transcriptional network poses significant challenges and requires the integration of multiple experimental data types. In this work, we inferred a transcriptional network underlying TLR-stimulated murine macrophage activation. Microarray-based expression profiling and transcription factor binding site motif scanning were used to infer a network of associations between transcription factor genes and clusters of co-expressed target genes. The time-lagged correlation was used to analyze temporal expression data in order to identify potential causal influences in the network. A novel statistical test was developed to assess the significance of the time-lagged correlation. Several associations in the resulting inferred network were validated using targeted ChIP-on-chip experiments. The network incorporates known regulators and gives insight into the transcriptional control of macrophage activation. Our analysis identified a novel regulator (TGIF1) that may have a role in macrophage activation.

Macrophages play a vital role in host defense against infection by recognizing pathogens through pattern recognition receptors, such as the Toll-like receptors (TLRs), and mounting an immune response. Stimulation of TLRs initiates a complex transcriptional program in which induced transcription factor genes dynamically regulate downstream genes. Microarray-based transcriptional profiling has proved useful for mapping such transcriptional programs in simpler model organisms; however, mammalian systems present difficulties such as post-translational regulation of transcription factors, combinatorial gene regulation, and a paucity of available gene-knockout expression data. Additional evidence sources, such as DNA sequence-based identification of transcription factor binding sites, are needed. In this work, we computationally inferred a transcriptional network for TLR-stimulated murine macrophages. Our approach combined sequence scanning with time-course expression data in a probabilistic framework. Expression data were analyzed using the time-lagged correlation. A novel, unbiased method was developed to assess the significance of the time-lagged correlation. The inferred network of associations between transcription factor genes and co-expressed gene clusters was validated with targeted ChIP-on-chip experiments, and yielded insights into the macrophage activation program, including a potential novel regulator. Our general approach could be used to analyze other complex mammalian systems for which time-course expression data are available.

Collapse

287

Linking entries in protein interaction database to structured text: the FEBS Letters experiment. FEBS Lett 2008;582:1171-7. [PMID: 18328820 DOI: 10.1016/j.febslet.2008.02.071] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

288

Brayer KJ, Kulshreshtha S, Segal DJ. The protein-binding potential of C2H2 zinc finger domains. Cell Biochem Biophys 2008;51:9-19. [PMID: 18286240 DOI: 10.1007/s12013-008-9007-6] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2007] [Accepted: 12/28/2007] [Indexed: 12/22/2022]

289

Hancock AM, Witonsky DB, Gordon AS, Eshel G, Pritchard JK, Coop G, Di Rienzo A. Adaptations to climate in candidate genes for common metabolic disorders. PLoS Genet 2008;4:e32. [PMID: 18282109 PMCID: PMC2242814 DOI: 10.1371/journal.pgen.0040032] [Citation(s) in RCA: 201] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2007] [Accepted: 12/26/2007] [Indexed: 12/25/2022] Open

290

Lee I, Lehner B, Crombie C, Wong W, Fraser AG, Marcotte EM. A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans. Nat Genet 2008;40:181-8. [PMID: 18223650 DOI: 10.1038/ng.2007.70] [Citation(s) in RCA: 224] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2007] [Accepted: 11/06/2007] [Indexed: 11/09/2022]

291

Carter P, Lee D, Orengo C. Chapter 1. Target selection in structural genomics projects to increase knowledge of protein structure and function space. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2008;75:1-52. [PMID: 20731988 DOI: 10.1016/s0065-3233(07)75001-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

292

Yao L, Rzhetsky A. Quantitative systems-level determinants of human genes targeted by successful drugs. Genome Res 2007;18:206-13. [PMID: 18083776 DOI: 10.1101/gr.6888208] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

293

Tsui IFL, Chari R, Buys TPH, Lam WL. Public databases and software for the pathway analysis of cancer genomes. Cancer Inform 2007;3:379-97. [PMID: 19455256 PMCID: PMC2410087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

294

Betel D, Breitkreuz KE, Isserlin R, Dewar-Darch D, Tyers M, Hogue CWV. Structure-templated predictions of novel protein interactions from sequence information. PLoS Comput Biol 2007;3:1783-9. [PMID: 17892321 PMCID: PMC1988853 DOI: 10.1371/journal.pcbi.0030182] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2007] [Accepted: 08/02/2007] [Indexed: 11/18/2022] Open

Abstract

The multitude of functions performed in the cell are largely controlled by a set of carefully orchestrated protein interactions often facilitated by specific binding of conserved domains in the interacting proteins. Interacting domains commonly exhibit distinct binding specificity to short and conserved recognition peptides called binding profiles. Although many conserved domains are known in nature, only a few have well-characterized binding profiles. Here, we describe a novel predictive method known as domain–motif interactions from structural topology (D-MIST) for elucidating the binding profiles of interacting domains. A set of domains and their corresponding binding profiles were derived from extant protein structures and protein interaction data and then used to predict novel protein interactions in yeast. A number of the predicted interactions were verified experimentally, including new interactions of the mitotic exit network, RNA polymerases, nucleotide metabolism enzymes, and the chaperone complex. These results demonstrate that new protein interactions can be predicted exclusively from sequence information.

Many functions performed within a living cell are mediated by specific interactions between proteins. Precise geometric and chemical matches between segments of the protein structures facilitate those interactions. Such binding surfaces are often evolutionarily conserved elements of protein structures known as conserved domains that recognize specific binding elements on the interacting proteins. Binding domains and their corresponding interacting profiles constitute basic interacting modules that are replicated in multiple protein pairs, where they mediate similar interactions. Although many conserved domains are identified, only a handful have known, well-characterized binding elements. This paper describes a computational method that aims to elucidate the binding specificity of many domains. The utility of the derived binding specificity is demonstrated by predicting new interactions between yeast proteins. The predictions are based solely on sequence information by identifying the conserved domains and their corresponding binding sequences. A number of the predicted interactions were confirmed experimentally, demonstrating the feasibility of this approach.

Collapse

295

Genes and (common) pathways underlying drug addiction. PLoS Comput Biol 2007;4:e2. [PMID: 18179280 PMCID: PMC2174978 DOI: 10.1371/journal.pcbi.0040002] [Citation(s) in RCA: 148] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2007] [Accepted: 11/19/2007] [Indexed: 11/19/2022] Open

296

Sanz R, Aragüés R, Stresing V, Martín B, Landemaine T, Oliva B, Driouch K, Lidereau R, Sierra A. Functional pathways shared by liver and lung metastases: a mitochondrial chaperone machine is up-regulated in soft-tissue breast cancer metastasis. Clin Exp Metastasis 2007;24:673-83. [DOI: 10.1007/s10585-007-9124-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2007] [Accepted: 10/12/2007] [Indexed: 12/19/2022]

297

Evlampiev K, Isambert H. Modeling protein network evolution under genome duplication and domain shuffling. BMC SYSTEMS BIOLOGY 2007;1:49. [PMID: 17999763 PMCID: PMC2245809 DOI: 10.1186/1752-0509-1-49] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2007] [Accepted: 11/13/2007] [Indexed: 12/26/2022]

Abstract

BACKGROUND

Successive whole genome duplications have recently been firmly established in all major eukaryote kingdoms. Such exponential evolutionary processes must have largely contributed to shape the topology of protein-protein interaction (PPI) networks by outweighing, in particular, all time-linear network growths modeled so far.

RESULTS

We propose and solve a mathematical model of PPI network evolution under successive genome duplications. This demonstrates, from first principles, that evolutionary conservation and scale-free topology are intrinsically linked properties of PPI networks and emerge from i) prevailing exponential network dynamics under duplication and ii) asymmetric divergence of gene duplicates. While required, we argue that this asymmetric divergence arises, in fact, spontaneously at the level of protein-binding sites. This supports a refined model of PPI network evolution in terms of protein domains under exponential and asymmetric duplication/divergence dynamics, with multidomain proteins underlying the combinatorial formation of protein complexes. Genome duplication then provides a powerful source of PPI network innovation by promoting local rearrangements of multidomain proteins on a genome wide scale. Yet, we show that the overall conservation and topology of PPI networks are robust to extensive domain shuffling of multidomain proteins as well as to finer details of protein interaction and evolution. Finally, large scale features of direct and indirect PPI networks of S. cerevisiae are well reproduced numerically with only two adjusted parameters of clear biological significance (i.e. network effective growth rate and average number of protein-binding domains per protein).

CONCLUSION

This study demonstrates the statistical consequences of genome duplication and domain shuffling on the conservation and topology of PPI networks over a broad evolutionary scale across eukaryote kingdoms. In particular, scale-free topologies of PPI networks, which are found to be robust to extensive shuffling of protein domains, appear to be a simple consequence of the conservation of protein-binding domains under asymmetric duplication/divergence dynamics in the course of evolution.

Collapse

298

Fehrmann RSN, Li XY, van der Zee AGJ, de Jong S, Te Meerman GJ, de Vries EGE, Crijns APG. Profiling studies in ovarian cancer: a review. Oncologist 2007;12:960-6. [PMID: 17766655 DOI: 10.1634/theoncologist.12-8-960] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Abstract

Ovarian cancer is a heterogeneous disease with respect to histopathology, molecular biology, and clinical outcome. In advanced stages, surgery and chemotherapy result in an approximately 25% overall 5-year survival rate, pointing to a strong need to identify subgroups of patients that may benefit from targeted innovative molecular therapy. This review summarizes: (a) microarray research identifying gene-expression profiles in ovarian cancer; (b) the methodological flaws in the available microarray studies; and (c) applications of pathway analysis to define new molecular subgroups. Microarray technology now permits the analysis of expression levels of thousands of genes. So far seven studies have aimed to identify a genetic profile that can predict survival/clinical outcome and/or response to platinum-based therapy. To date, the clinical evidence of prognostic microarray studies has only reached the level of small retrospective studies, and there are other issues that may explain the nonreproducibility among the reported prognostic profiles, such as overfitting, technical platform differences, and accuracy of measurements. We consider pathway analysis a promising new strategy. The accumulation of small differential expressions within a meaningful molecular regulatory network might lead to a critical threshold level, resulting in ovarian cancer. Microarray technologies have already provided valuable expression data for classifying ovarian cancer and the first clues about which molecular changes in ovarian cancer could be exploited in new treatment strategies. Further improvements in technology as well as in study design, combined with pathway analysis, will allow us to detect even more subtle tumor expression differences among subgroups of ovarian cancer patients. Disclosure of potential conflicts of interest is found at the end of this article.

Collapse

299

Chuang HY, Lee E, Liu YT, Lee D, Ideker T. Network-based classification of breast cancer metastasis. Mol Syst Biol 2007;3:140. [PMID: 17940530 PMCID: PMC2063581 DOI: 10.1038/msb4100180] [Citation(s) in RCA: 1019] [Impact Index Per Article: 56.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2007] [Accepted: 08/20/2007] [Indexed: 12/19/2022] Open

300

Kerrien S, Orchard S, Montecchi-Palazzi L, Aranda B, Quinn AF, Vinod N, Bader GD, Xenarios I, Wojcik J, Sherman D, Tyers M, Salama JJ, Moore S, Ceol A, Chatr-Aryamontri A, Oesterheld M, Stümpflen V, Salwinski L, Nerothin J, Cerami E, Cusick ME, Vidal M, Gilson M, Armstrong J, Woollard P, Hogue C, Eisenberg D, Cesareni G, Apweiler R, Hermjakob H. Broadening the horizon--level 2.5 of the HUPO-PSI format for molecular interactions. BMC Biol 2007;5:44. [PMID: 17925023 PMCID: PMC2189715 DOI: 10.1186/1741-7007-5-44] [Citation(s) in RCA: 205] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2007] [Accepted: 10/09/2007] [Indexed: 11/15/2022] Open

Abstract

Background

Molecular interaction Information is a key resource in modern biomedical research. Publicly available data have previously been provided in a broad array of diverse formats, making access to this very difficult. The publication and wide implementation of the Human Proteome Organisation Proteomics Standards Initiative Molecular Interactions (HUPO PSI-MI) format in 2004 was a major step towards the establishment of a single, unified format by which molecular interactions should be presented, but focused purely on protein-protein interactions.

Results

The HUPO-PSI has further developed the PSI-MI XML schema to enable the description of interactions between a wider range of molecular types, for example nucleic acids, chemical entities, and molecular complexes. Extensive details about each supported molecular interaction can now be captured, including the biological role of each molecule within that interaction, detailed description of interacting domains, and the kinetic parameters of the interaction. The format is supported by data management and analysis tools and has been adopted by major interaction data providers. Additionally, a simpler, tab-delimited format MITAB2.5 has been developed for the benefit of users who require only minimal information in an easy to access configuration.

Conclusion

The PSI-MI XML2.5 and MITAB2.5 formats have been jointly developed by interaction data producers and providers from both the academic and commercial sector, and are already widely implemented and well supported by an active development community. PSI-MI XML2.5 enables the description of highly detailed molecular interaction data and facilitates data exchange between databases and users without loss of information. MITAB2.5 is a simpler format appropriate for fast Perl parsing or loading into Microsoft Excel.

Collapse