Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Murali TM, Wu CJ, Kasif S. The art of gene function prediction. Nat Biotechnol 2007;24:1474-5; author reply 1475-6. [PMID: 17160037 DOI: 10.1038/nbt1206-1474] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Number

Cited by Other Article(s)

Zhu YH, Zhang C, Liu Y, Omenn GS, Freddolino PL, Yu DJ, Zhang Y. TripletGO: Integrating Transcript Expression Profiles with Protein Homology Inferences for Gene Function Prediction. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022;20:1013-1027. [PMID: 35568117 PMCID: PMC10025770 DOI: 10.1016/j.gpb.2022.03.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 03/02/2022] [Accepted: 04/16/2022] [Indexed: 01/13/2023]

Law JN, Akers K, Tasnina N, Santina CMD, Deutsch S, Kshirsagar M, Klein-Seetharaman J, Crovella M, Rajagopalan P, Kasif S, Murali TM. Interpretable network propagation with application to expanding the repertoire of human proteins that interact with SARS-CoV-2. Gigascience 2021;10:giab082. [PMID: 34966926 PMCID: PMC8716363 DOI: 10.1093/gigascience/giab082] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 09/21/2021] [Accepted: 11/28/2021] [Indexed: 01/02/2023] Open

We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data. PLoS Biol 2020;18:e3000999. [PMID: 33253151 PMCID: PMC7728211 DOI: 10.1371/journal.pbio.3000999] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Revised: 12/10/2020] [Indexed: 01/18/2023] Open

Pournoor E, Mousavian Z, Dalini AN, Masoudi-Nejad A. Identification of Key Components in Colon Adenocarcinoma Using Transcriptome to Interactome Multilayer Framework. Sci Rep 2020;10:4991. [PMID: 32193399 PMCID: PMC7081269 DOI: 10.1038/s41598-020-59605-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Accepted: 01/31/2020] [Indexed: 12/21/2022] Open

Havugimana PC, Hu P, Emili A. Protein complexes, big data, machine learning and integrative proteomics: lessons learned over a decade of systematic analysis of protein interaction networks. Expert Rev Proteomics 2017;14:845-855. [PMID: 28918672 DOI: 10.1080/14789450.2017.1374179] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Li J, Li X, Zhu B. User opinion classification in social media: A global consistency maximization approach. INFORMATION & MANAGEMENT 2016. [DOI: 10.1016/j.im.2016.06.004] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Panwar B, Menon R, Eksi R, Li HD, Omenn GS, Guan Y. Genome-Wide Functional Annotation of Human Protein-Coding Splice Variants Using Multiple Instance Learning. J Proteome Res 2016;15:1747-53. [PMID: 27142340 DOI: 10.1021/acs.jproteome.5b00883] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Pushing the annotation of cellular activities to a higher resolution: Predicting functions at the isoform level. Methods 2015;93:110-8. [PMID: 26238263 DOI: 10.1016/j.ymeth.2015.07.016] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2015] [Revised: 07/20/2015] [Accepted: 07/29/2015] [Indexed: 12/23/2022] Open

Frasca M, Bassis S, Valentini G. Learning node labels with multi-category Hopfield networks. Neural Comput Appl 2015. [DOI: 10.1007/s00521-015-1965-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Wang S, Cho H, Zhai C, Berger B, Peng J. Exploiting ontology graph for predicting sparsely annotated gene function. Bioinformatics 2015;31:i357-64. [PMID: 26072504 PMCID: PMC4542782 DOI: 10.1093/bioinformatics/btv260] [Citation(s) in RCA: 74] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open

Sedaghat N, Saegusa T, Randolph T, Shojaie A. Comparative study of computational methods for reconstructing genetic networks of cancer-related pathways. Cancer Inform 2014;13:55-66. [PMID: 25288880 PMCID: PMC4179645 DOI: 10.4137/cin.s13781] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2014] [Revised: 05/08/2014] [Accepted: 05/10/2014] [Indexed: 12/16/2022] Open

Mulder NJ, Akinola RO, Mazandu GK, Rapanoel H. Using biological networks to improve our understanding of infectious diseases. Comput Struct Biotechnol J 2014;11:1-10. [PMID: 25379138 PMCID: PMC4212278 DOI: 10.1016/j.csbj.2014.08.006] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Yu D, Kim M, Xiao G, Hwang TH. Review of biological network data and its applications. Genomics Inform 2013;11:200-10. [PMID: 24465231 PMCID: PMC3897847 DOI: 10.5808/gi.2013.11.4.200] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2013] [Revised: 11/20/2013] [Accepted: 11/21/2013] [Indexed: 12/16/2022] Open

Anton BP, Chang YC, Brown P, Choi HP, Faller LL, Guleria J, Hu Z, Klitgord N, Levy-Moonshine A, Maksad A, Mazumdar V, McGettrick M, Osmani L, Pokrzywa R, Rachlin J, Swaminathan R, Allen B, Housman G, Monahan C, Rochussen K, Tao K, Bhagwat AS, Brenner SE, Columbus L, de Crécy-Lagard V, Ferguson D, Fomenkov A, Gadda G, Morgan RD, Osterman AL, Rodionov DA, Rodionova IA, Rudd KE, Söll D, Spain J, Xu SY, Bateman A, Blumenthal RM, Bollinger JM, Chang WS, Ferrer M, Friedberg I, Galperin MY, Gobeill J, Haft D, Hunt J, Karp P, Klimke W, Krebs C, Macelis D, Madupu R, Martin MJ, Miller JH, O'Donovan C, Palsson B, Ruch P, Setterdahl A, Sutton G, Tate J, Yakunin A, Tchigvintsev D, Plata G, Hu J, Greiner R, Horn D, Sjölander K, Salzberg SL, Vitkup D, Letovsky S, Segrè D, DeLisi C, Roberts RJ, Steffen M, Kasif S. The COMBREX project: design, methodology, and initial results. PLoS Biol 2013;11:e1001638. [PMID: 24013487 PMCID: PMC3754883 DOI: 10.1371/journal.pbio.1001638] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Affiliation(s)

Brian P. Anton New England Biolabs, Ipswich, Massachusetts, United States of America * E-mail: (BPA); (SK)
Yi-Chien Chang Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
Peter Brown Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Han-Pil Choi Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Lina L. Faller Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
Jyotsna Guleria Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Zhenjun Hu Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
Niels Klitgord Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
Ami Levy-Moonshine Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Almaz Maksad Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Varun Mazumdar Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
Mark McGettrick Diatom Software LLC, Holliston, Massachusetts, United States of America
Lais Osmani Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Revonda Pokrzywa Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
John Rachlin Diatom Software LLC, Holliston, Massachusetts, United States of America
Rajeswari Swaminathan Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Benjamin Allen Program for Evolutionary Dynamics, Harvard University, Cambridge, Massachusetts, United States of America Department of Mathematics, Emmanuel College, Boston, Massachusetts, United States of America
Genevieve Housman Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Caitlin Monahan Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Krista Rochussen Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Kevin Tao Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Ashok S. Bhagwat Department of Chemistry, Wayne State University, Detroit, Michigan, United States of America
Steven E. Brenner Department of Plant and Microbial Biology, University of California, Berkeley, California, United States of America
Linda Columbus Department of Chemistry, University of Virginia, Charlottesville, Virginia, United States of America
Valérie de Crécy-Lagard Department of Microbiology and Cell Science, University of Florida, Gainesville, Florida, United States of America
Donald Ferguson Department of Microbiology, Miami University, Oxford, Ohio, United States of America
Alexey Fomenkov New England Biolabs, Ipswich, Massachusetts, United States of America
Giovanni Gadda Department of Chemistry, Georgia State University, Atlanta, Georgia, United States of America
Richard D. Morgan New England Biolabs, Ipswich, Massachusetts, United States of America
Andrei L. Osterman Bioinformatics and Systems Biology, Sanford Burnham Medical Research Institute, La Jolla, California, United States of America
Dmitry A. Rodionov Bioinformatics and Systems Biology, Sanford Burnham Medical Research Institute, La Jolla, California, United States of America
Irina A. Rodionova Bioinformatics and Systems Biology, Sanford Burnham Medical Research Institute, La Jolla, California, United States of America
Kenneth E. Rudd Department of Biochemistry and Molecular Biology, University of Miami, Miami, Florida, United States of America
Dieter Söll Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
James Spain School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, Georgia, United States of America
Shuang-yong Xu New England Biolabs, Ipswich, Massachusetts, United States of America
Alex Bateman European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, United Kingdom
Robert M. Blumenthal Department of Medical Microbiology and Immunology, and Program in Bioinformatics, University of Toledo, Toledo, Ohio, United States of America
J. Martin Bollinger Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, United States of America
Woo-Suk Chang Department of Biology, University of Texas-Arlington, Arlington, Texas, United States of America
Manuel Ferrer Spanish National Research Council (CSIC), Institute of Catalysis, Madrid, Spain
Iddo Friedberg Department of Microbiology, Miami University, Oxford, Ohio, United States of America
Michael Y. Galperin National Center for Biotechnology Information (NCBI), National Institutes of Health (NIH), Bethesda, Maryland, United States of America
Julien Gobeill Department of Library and Information Sciences, University of Applied Sciences Western Switzerland, Geneva, Switzerland Bibliomics and Text Mining Group, Swiss Institute of Bioinformatics, Geneva, Switzerland
Daniel Haft J. Craig Venter Institute, Rockville, Maryland, United States of America
John Hunt Biological Sciences, Columbia University, New York, New York, United States of America
Peter Karp Bioinformatics Research Group, Artificial Intelligence Center, SRI International, Menlo Park, California, United States of America
William Klimke National Center for Biotechnology Information (NCBI), National Institutes of Health (NIH), Bethesda, Maryland, United States of America
Carsten Krebs Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, United States of America
Dana Macelis New England Biolabs, Ipswich, Massachusetts, United States of America
Ramana Madupu J. Craig Venter Institute, Rockville, Maryland, United States of America
Maria J. Martin European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, United Kingdom
Jeffrey H. Miller Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, Los Angeles, California, United States of America
Claire O'Donovan European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, United Kingdom
Bernhard Palsson Department of Bioengineering, University of California, San Diego, La Jolla, California, United States of America
Patrick Ruch Department of Library and Information Sciences, University of Applied Sciences Western Switzerland, Geneva, Switzerland Bibliomics and Text Mining Group, Swiss Institute of Bioinformatics, Geneva, Switzerland
Aaron Setterdahl Department of Chemistry, Indiana University Southeast, New Albany, Indiana, United States of America
Granger Sutton J. Craig Venter Institute, Rockville, Maryland, United States of America
John Tate Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, United Kingdom
Alexander Yakunin Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada
Dmitri Tchigvintsev Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada
Germán Plata Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America Integrated Program in Cellular, Molecular, Structural, and Genetic Studies, Columbia University, New York, New York, United States of America
Jie Hu Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America
Russell Greiner Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada
David Horn School of Physics and Astronomy, Tel Aviv University, Tel Aviv, Israel
Kimmen Sjölander Berkeley Phylogenomics Group, University of California, Berkeley, California, United States of America
Steven L. Salzberg Departments of Medicine and Biostatistics, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
Dennis Vitkup Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America
Stanley Letovsky Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
Daniel Segrè Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
Charles DeLisi Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
Richard J. Roberts New England Biolabs, Ipswich, Massachusetts, United States of America Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
Martin Steffen Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
Simon Kasif Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America * E-mail: (BPA); (SK)

Collapse

Frasca M, Bertoni A, Re M, Valentini G. A neural network algorithm for semi-supervised node label learning from unbalanced data. Neural Netw 2013;43:84-98. [DOI: 10.1016/j.neunet.2013.01.021] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2012] [Revised: 01/28/2013] [Accepted: 01/29/2013] [Indexed: 01/03/2023]

Zhang J, Li L, Peng L, Sun Y, Li J. An efficient weighted graph strategy to identify differentiation associated genes in embryonic stem cells. PLoS One 2013;8:e62716. [PMID: 23638139 PMCID: PMC3637163 DOI: 10.1371/journal.pone.0062716] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2012] [Accepted: 03/25/2013] [Indexed: 11/18/2022] Open

Hu P, Jiang H, Emili A. Incorporating Correlations among Gene Ontology Terms into Predicting Protein Functions. Bioinformatics 2013. [DOI: 10.4018/978-1-4666-3604-0.ch045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open

Chua HN, Wong L. Predicting Protein Functions from Protein Interaction Networks. ACTA ACUST UNITED AC 2012. [DOI: 10.4018/ijkdb.2012100104] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]

Chu LH, Rivera CG, Popel AS, Bader JS. Constructing the angiome: a global angiogenesis protein interaction network. Physiol Genomics 2012;44:915-24. [PMID: 22911453 DOI: 10.1152/physiolgenomics.00181.2011] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

A Resource of Quantitative Functional Annotation for Homo sapiens Genes. G3-GENES GENOMES GENETICS 2012;2:223-33. [PMID: 22384401 PMCID: PMC3284330 DOI: 10.1534/g3.111.000828] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2011] [Accepted: 11/23/2011] [Indexed: 01/31/2023]

WANG JINGYAN, LI YONGPING. SEQUENTIAL LINEAR NEIGHBORHOOD PROPAGATION FOR SEMI-SUPERVISED PROTEIN FUNCTION PREDICTION. J Bioinform Comput Biol 2012;9:663-79. [DOI: 10.1142/s0219720011005550] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2010] [Revised: 10/14/2010] [Accepted: 03/21/2011] [Indexed: 11/18/2022]

Mazandu GK, Mulder NJ. Using the underlying biological organization of the Mycobacterium tuberculosis functional network for protein function prediction. INFECTION GENETICS AND EVOLUTION 2011;12:922-32. [PMID: 22085822 DOI: 10.1016/j.meegid.2011.10.027] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2011] [Revised: 10/25/2011] [Accepted: 10/28/2011] [Indexed: 10/15/2022]

Abstract

Despite ever-increasing amounts of sequence and functional genomics data, there is still a deficiency of functional annotation for many newly sequenced proteins. For Mycobacterium tuberculosis (MTB), more than half of its genome is still uncharacterized, which hampers the search for new drug targets within the bacterial pathogen and limits our understanding of its pathogenicity. As for many other genomes, the annotations of proteins in the MTB proteome were generally inferred from sequence homology, which is effective but its applicability has limitations. We have carried out large-scale biological data integration to produce an MTB protein functional interaction network. Protein functional relationships were extracted from the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database, and additional functional interactions from microarray, sequence and protein signature data. The confidence level of protein relationships in the additional functional interaction data was evaluated using a dynamic data-driven scoring system. This functional network has been used to predict functions of uncharacterized proteins using Gene Ontology (GO) terms, and the semantic similarity between these terms measured using a state-of-the-art GO similarity metric. To achieve better trade-off between improvement of quality, genomic coverage and scalability, this prediction is done by observing the key principles driving the biological organization of the functional network. This study yields a new functionally characterized MTB strain CDC1551 proteome, consisting of 3804 and 3698 proteins out of 4195 with annotations in terms of the biological process and molecular function ontologies, respectively. These data can contribute to research into the Development of effective anti-tubercular drugs with novel biological mechanisms of action.

Collapse

Murali TM, Dyer MD, Badger D, Tyler BM, Katze MG. Network-based prediction and analysis of HIV dependency factors. PLoS Comput Biol 2011;7:e1002164. [PMID: 21966263 PMCID: PMC3178628 DOI: 10.1371/journal.pcbi.1002164] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2010] [Accepted: 06/30/2011] [Indexed: 01/27/2023] Open

Abstract

HIV Dependency Factors (HDFs) are a class of human proteins that are essential for HIV replication, but are not lethal to the host cell when silenced. Three previous genome-wide RNAi experiments identified HDF sets with little overlap. We combine data from these three studies with a human protein interaction network to predict new HDFs, using an intuitive algorithm called SinkSource and four other algorithms published in the literature. Our algorithm achieves high precision and recall upon cross validation, as do the other methods. A number of HDFs that we predict are known to interact with HIV proteins. They belong to multiple protein complexes and biological processes that are known to be manipulated by HIV. We also demonstrate that many predicted HDF genes show significantly different programs of expression in early response to SIV infection in two non-human primate species that differ in AIDS progression. Our results suggest that many HDFs are yet to be discovered and that they have potential value as prognostic markers to determine pathological outcome and the likelihood of AIDS development. More generally, if multiple genome-wide gene-level studies have been performed at independent labs to study the same biological system or phenomenon, our methodology is applicable to interpret these studies simultaneously in the context of molecular interaction networks and to ask if they reinforce or contradict each other.

Medicines to cure infectious diseases usually target proteins in the pathogens. Since pathogens have short life cycles, the targeted proteins can rapidly evolve and make the medicines ineffective, especially in viruses such as HIV. However, since viruses have very small genomes, they must exploit the cellular machinery of the host to propagate. Therefore, disrupting the activity of selected host proteins may impede viruses. Three recent experiments have discovered hundreds of such proteins in human cells that HIV depends upon. Surprisingly, these three sets have very little overlap. In this work, we demonstrate that this discrepancy can be explained by considering physical interactions between the human proteins in these studies. Moreover, we exploit these interactions to predict new dependency factors for HIV. Our predictions show very significant overlaps with human proteins that are known to interact with HIV proteins and with human cellular processes that are known to be subverted by the virus. Most importantly, we show that proteins predicted by us may play a prominent role in affecting HIV-related disease progression in lymph nodes. Therefore, our predictions constitute a powerful resource for experimentalists who desire to discover new human proteins that can control the spread of HIV.

Collapse

Rivera CG, Mellberg S, Claesson-Welsh L, Bader JS, Popel AS. Analysis of VEGF--a regulated gene expression in endothelial cells to identify genes linked to angiogenesis. PLoS One 2011;6:e24887. [PMID: 21931866 PMCID: PMC3172305 DOI: 10.1371/journal.pone.0024887] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2011] [Accepted: 08/23/2011] [Indexed: 02/06/2023] Open

Mazandu GK, Mulder NJ. Scoring protein relationships in functional interaction networks predicted from sequence data. PLoS One 2011;6:e18607. [PMID: 21526183 PMCID: PMC3079720 DOI: 10.1371/journal.pone.0018607] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2010] [Accepted: 03/07/2011] [Indexed: 11/21/2022] Open

Smoot M, Ono K, Ideker T, Maere S. PiNGO: a Cytoscape plugin to find candidate genes in biological networks. Bioinformatics 2011;27:1030-1. [PMID: 21278188 DOI: 10.1093/bioinformatics/btr045] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open

Mostafavi S, Goldenberg A, Morris Q. Predicting node characteristics from molecular networks. Methods Mol Biol 2011;781:399-414. [PMID: 21877293 DOI: 10.1007/978-1-61779-276-2_20] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2022]

Heo HS, Oh SJ, Kim JM, Kim HS, Chung HY. TREP_DB: transcriptional regulatory elements pattern database. Biochem Biophys Res Commun 2010;394:309-316. [PMID: 20206134 DOI: 10.1016/j.bbrc.2010.02.169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2010] [Accepted: 02/26/2010] [Indexed: 05/28/2023]

Bradford JR, Needham CJ, Tedder P, Care MA, Bulpitt AJ, Westhead DR. GO-At: in silico prediction of gene function in Arabidopsis thaliana by combining heterogeneous data. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2010;61:713-721. [PMID: 19947983 DOI: 10.1111/j.1365-313x.2009.04097.x] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]

Hu P, Jiang H, Emili A. Predicting protein functions by relaxation labelling protein interaction network. BMC Bioinformatics 2010;11 Suppl 1:S64. [PMID: 20122240 PMCID: PMC3009538 DOI: 10.1186/1471-2105-11-s1-s64] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Biochemical networks: the evolution of gene annotation. Nat Chem Biol 2010;6:4-5. [PMID: 20016491 DOI: 10.1038/nchembio.288] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Li X, Chen H, Li J, Zhang Z. Gene function prediction with gene interaction networks: a context graph kernel approach. ACTA ACUST UNITED AC 2009;14:119-28. [PMID: 19789115 DOI: 10.1109/titb.2009.2033116] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Hu P, Janga SC, Babu M, Díaz-Mejía JJ, Butland G, Yang W, Pogoutse O, Guo X, Phanse S, Wong P, Chandran S, Christopoulos C, Nazarians-Armavil A, Nasseri NK, Musso G, Ali M, Nazemof N, Eroukova V, Golshani A, Paccanaro A, Greenblatt JF, Moreno-Hagelsieb G, Emili A. Global functional atlas of Escherichia coli encompassing previously uncharacterized proteins. PLoS Biol 2009;7:e96. [PMID: 19402753 PMCID: PMC2672614 DOI: 10.1371/journal.pbio.1000096] [Citation(s) in RCA: 268] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2008] [Accepted: 03/16/2009] [Indexed: 12/28/2022] Open

Affiliation(s)

Pingzhao Hu Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Sarath Chandra Janga Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom
Mohan Babu Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
J. Javier Díaz-Mejía Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada Department of Biology, Wilfrid Laurier University, Waterloo, Ontario, Canada
Gareth Butland Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Wenhong Yang Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Oxana Pogoutse Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Xinghua Guo Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Sadhna Phanse Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Peter Wong Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Shamanta Chandran Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Constantine Christopoulos Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Anaies Nazarians-Armavil Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Negin Karimi Nasseri Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Gabriel Musso Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Mehrab Ali Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Nazila Nazemof Department of Biology and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Canada
Veronika Eroukova Department of Biology and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Canada
Ashkan Golshani Department of Biology and Ottawa Institute of Systems Biology, Carleton University, Ottawa, Canada
Alberto Paccanaro Department of Computer Science, Royal Holloway, University of London, Egham, United Kingdom
Jack F Greenblatt Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Gabriel Moreno-Hagelsieb Department of Biology, Wilfrid Laurier University, Waterloo, Ontario, Canada * To whom correspondence should be addressed. E-mail: (GM-H); (AE)
Andrew Emili Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada * To whom correspondence should be addressed. E-mail: (GM-H); (AE)

Collapse

Huttenhower C, Haley EM, Hibbs MA, Dumeaux V, Barrett DR, Coller HA, Troyanskaya OG. Exploring the human genome with functional maps. Genes Dev 2009;19:1093-106. [PMID: 19246570 PMCID: PMC2694471 DOI: 10.1101/gr.082214.108] [Citation(s) in RCA: 166] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2008] [Accepted: 02/09/2009] [Indexed: 11/24/2022]

Hess DC, Myers CL, Huttenhower C, Hibbs MA, Hayes AP, Paw J, Clore JJ, Mendoza RM, Luis BS, Nislow C, Giaever G, Costanzo M, Troyanskaya OG, Caudy AA. Computationally driven, quantitative experiments discover genes required for mitochondrial biogenesis. PLoS Genet 2009;5:e1000407. [PMID: 19300474 PMCID: PMC2648979 DOI: 10.1371/journal.pgen.1000407] [Citation(s) in RCA: 116] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2008] [Accepted: 02/05/2009] [Indexed: 01/09/2023] Open

Abstract

Mitochondria are central to many cellular processes including respiration, ion homeostasis, and apoptosis. Using computational predictions combined with traditional quantitative experiments, we have identified 100 proteins whose deficiency alters mitochondrial biogenesis and inheritance in Saccharomyces cerevisiae. In addition, we used computational predictions to perform targeted double-mutant analysis detecting another nine genes with synthetic defects in mitochondrial biogenesis. This represents an increase of about 25% over previously known participants. Nearly half of these newly characterized proteins are conserved in mammals, including several orthologs known to be involved in human disease. Mutations in many of these genes demonstrate statistically significant mitochondrial transmission phenotypes more subtle than could be detected by traditional genetic screens or high-throughput techniques, and 47 have not been previously localized to mitochondria. We further characterized a subset of these genes using growth profiling and dual immunofluorescence, which identified genes specifically required for aerobic respiration and an uncharacterized cytoplasmic protein required for normal mitochondrial motility. Our results demonstrate that by leveraging computational analysis to direct quantitative experimental assays, we have characterized mutants with subtle mitochondrial defects whose phenotypes were undetected by high-throughput methods.

Mitochondria are the proverbial powerhouses of the cell, running the fundamental biochemical processes that produce energy from nutrients using oxygen. These processes are conserved in all eukaryotes, from humans to model organisms such as baker's yeast. In humans, mitochondrial dysfunction plays a role in a variety of diseases, including diabetes, neuromuscular disorders, and aging. In order to better understand fundamental mitochondrial biology, we studied genes involved in mitochondrial biogenesis in the yeast S. cerevisiae, discovering over 100 proteins with novel roles in this process. These experiments assigned function to 5% of the genes whose function was not known. In order to achieve this rapid rate of discovery, we developed a system incorporating highly quantitative experimental assays and an integrated, iterative process of computational protein function prediction. Beginning from relatively little prior knowledge, we found that computational predictions achieved about 60% accuracy and rapidly guided our laboratory work towards hundreds of promising candidate genes. Thus, in addition to providing a more thorough understanding of mitochondrial biology, this study establishes a framework for successfully integrating computation and experimentation to drive biological discovery. A companion manuscript, published in PLoS Computational Biology (doi:10.1371/journal.pcbi.1000322), discusses observations and conclusions important for the computational community.

Collapse

Affiliation(s)

David C. Hess Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
Chad L. Myers Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America Department of Computer Science and Engineering, University of Minnesota, Minneapolis, Minnesota, United States of America
Curtis Huttenhower Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America
Matthew A. Hibbs Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America
Alicia P. Hayes Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
Jadine Paw Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
John J. Clore Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
Rosa M. Mendoza Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
Bryan San Luis Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Corey Nislow Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Guri Giaever Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Michael Costanzo Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
Olga G. Troyanskaya Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America * E-mail: (OGT); (AAC)
Amy A. Caudy Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America * E-mail: (OGT); (AAC)

Collapse

Discovering biological networks from diverse functional genomic data. Methods Mol Biol 2009;563:157-75. [PMID: 19597785 DOI: 10.1007/978-1-60761-175-2_9] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Global alignment of multiple protein interaction networks with application to functional orthology detection. Proc Natl Acad Sci U S A 2008;105:12763-8. [PMID: 18725631 DOI: 10.1073/pnas.0806627105] [Citation(s) in RCA: 211] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Jiang X, Nariai N, Steffen M, Kasif S, Kolaczyk ED. Integration of relational and hierarchical network information for protein function prediction. BMC Bioinformatics 2008;9:350. [PMID: 18721473 PMCID: PMC2535605 DOI: 10.1186/1471-2105-9-350] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2008] [Accepted: 08/22/2008] [Indexed: 11/22/2022] Open

Abstract

Background

In the current climate of high-throughput computational biology, the inference of a protein's function from related measurements, such as protein-protein interaction relations, has become a canonical task. Most existing technologies pursue this task as a classification problem, on a term-by-term basis, for each term in a database, such as the Gene Ontology (GO) database, a popular rigorous vocabulary for biological functions. However, ontology structures are essentially hierarchies, with certain top to bottom annotation rules which protein function predictions should in principle follow. Currently, the most common approach to imposing these hierarchical constraints on network-based classifiers is through the use of transitive closure to predictions.

Results

We propose a probabilistic framework to integrate information in relational data, in the form of a protein-protein interaction network, and a hierarchically structured database of terms, in the form of the GO database, for the purpose of protein function prediction. At the heart of our framework is a factorization of local neighborhood information in the protein-protein interaction network across successive ancestral terms in the GO hierarchy. We introduce a classifier within this framework, with computationally efficient implementation, that produces GO-term predictions that naturally obey a hierarchical 'true-path' consistency from root to leaves, without the need for further post-processing.

Conclusion

A cross-validation study, using data from the yeast Saccharomyces cerevisiae, shows our method offers substantial improvements over both standard 'guilt-by-association' (i.e., Nearest-Neighbor) and more refined Markov random field methods, whether in their original form or when post-processed to artificially impose 'true-path' consistency. Further analysis of the results indicates that these improvements are associated with increased predictive capabilities (i.e., increased positive predictive value), and that this increase is consistent uniformly with GO-term depth. Additional in silico validation on a collection of new annotations recently added to GO confirms the advantages suggested by the cross-validation study. Taken as a whole, our results show that a hierarchical approach to network-based protein function prediction, that exploits the ontological structure of protein annotation databases in a principled manner, can offer substantial advantages over the successive application of 'flat' network-based methods.

Collapse

Aguilar-Ruiz JS, Moore JH, Ritchie MD. Filling the gap between biology and computer science. BioData Min 2008;1:1. [PMID: 18822148 PMCID: PMC2547862 DOI: 10.1186/1756-0381-1-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2008] [Accepted: 07/17/2008] [Indexed: 01/07/2023] Open

Peña-Castillo L, Tasan M, Myers CL, Lee H, Joshi T, Zhang C, Guan Y, Leone M, Pagnani A, Kim WK, Krumpelman C, Tian W, Obozinski G, Qi Y, Mostafavi S, Lin GN, Berriz GF, Gibbons FD, Lanckriet G, Qiu J, Grant C, Barutcuoglu Z, Hill DP, Warde-Farley D, Grouios C, Ray D, Blake JA, Deng M, Jordan MI, Noble WS, Morris Q, Klein-Seetharaman J, Bar-Joseph Z, Chen T, Sun F, Troyanskaya OG, Marcotte EM, Xu D, Hughes TR, Roth FP. A critical assessment of Mus musculus gene function prediction using integrated genomic evidence. Genome Biol 2008;9 Suppl 1:S2. [PMID: 18613946 PMCID: PMC2447536 DOI: 10.1186/gb-2008-9-s1-s2] [Citation(s) in RCA: 197] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Evidence-based annotation of the malaria parasite's genome using comparative expression profiling. PLoS One 2008;3:e1570. [PMID: 18270564 PMCID: PMC2215772 DOI: 10.1371/journal.pone.0001570] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2007] [Accepted: 01/09/2008] [Indexed: 11/19/2022] Open

Chua HN, Sung WK, Wong L. An efficient strategy for extensive integration of diverse biological data for protein function prediction. Bioinformatics 2007;23:3364-73. [DOI: 10.1093/bioinformatics/btm520] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open