Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Liu Y, Wei X, Chen W, Hu L, He Z. A graph-traversal approach to identify influential nodes in a network. PATTERNS 2021;2:100321. [PMID: 34553168 PMCID: PMC8441579 DOI: 10.1016/j.patter.2021.100321] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 05/16/2021] [Accepted: 07/07/2021] [Indexed: 11/19/2022]

Abstract

Influential node identification plays a significant role in understanding network structure and functions. Here we propose a general method for detecting influential nodes in a graph-traversal framework. We evaluate the influence of each node by constructing a breadth-first search (BFS) tree in which the target node is the root node. From the BFS tree, we generate a curve in which the x axis is the level number and the y axis is the cumulative scores of all nodes visited so far. We use the area under the curve value as the final influence score of the target node. Experimental results on various networks across different domains demonstrate that our method can be significantly superior to widely used centrality measures on the task of influential node detection.

•

We propose an influential node detection method, TARank, in a graph-traversal framework

•

We evaluate the influence of each node by constructing a breadth-first search tree

•

TARank is capable of enhancing existing centrality measures

•

TARank can yield new, yet effective, centrality measures as well

The discovery of influential nodes is a fundamental research issue in network science. To quantify the influence of each node in a network, various methods have been presented in the literature. To the best of our knowledge, no previous research efforts address the influential node identification problem from a graph-traversal perspective. To fulfill this void, we propose the TARank method that integrates the information collected from the breadth-first search tree to identify influential nodes. The formulation under the graph-traversal framework opens the door to a fundamentally new type of method of influential node identification. In the future, more effective recognition methods can be expected to be constructed based on this general framework. Since empirical studies have validated the effectiveness of TARank, it would be plausible to employ this method in different applications to reveal new findings.

Collapse

Walch P, Selkrig J, Knodler LA, Rettel M, Stein F, Fernandez K, Viéitez C, Potel CM, Scholzen K, Geyer M, Rottner K, Steele-Mortimer O, Savitski MM, Holden DW, Typas A. Global mapping of Salmonella enterica-host protein-protein interactions during infection. Cell Host Microbe 2021;29:1316-1332.e12. [PMID: 34237247 PMCID: PMC8561747 DOI: 10.1016/j.chom.2021.06.004] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 02/24/2021] [Accepted: 05/21/2021] [Indexed: 11/16/2022]

Affiliation(s)

Philipp Walch European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany; Heidelberg University, Faculty of Biosciences, Heidelberg, Germany
Joel Selkrig European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
Leigh A Knodler Paul G. Allen School for Global Health, College of Veterinary Medicine, Washington State University, Pullman, USA; Laboratory of Intracellular Parasites, Rocky Mountain Laboratories, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Hamilton, MT, USA
Mandy Rettel EMBL, Proteomics Core Facility, Heidelberg, Germany
Frank Stein EMBL, Proteomics Core Facility, Heidelberg, Germany
Keith Fernandez European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
Cristina Viéitez European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany; EMBL European Bioinformatics Institute, (EMBL-EBI), Hinxton, UK
Clément M Potel European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
Karoline Scholzen European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
Matthias Geyer Institute of Structural Biology, University of Bonn, Bonn, Germany
Klemens Rottner Division of Molecular Cell Biology, Zoological Institute, TU Braunschweig, Braunschweig, Germany; Molecular Cell Biology Group, Helmholtz Centre for Infection Research, Braunschweig, Germany
Olivia Steele-Mortimer Laboratory of Intracellular Parasites, Rocky Mountain Laboratories, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Hamilton, MT, USA
Mikhail M Savitski European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany; EMBL, Proteomics Core Facility, Heidelberg, Germany
David W Holden MRC Centre for Molecular Bacteriology and Infection, Imperial College, London, UK
Athanasios Typas European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany.

Collapse

He Z, Zhao C, Liang H, Xu B, Zou Q. Protein Complexes Identification with Family-Wise Error Rate Control. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020;17:2062-2073. [PMID: 31027047 DOI: 10.1109/tcbb.2019.2912602] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Wu Z, Liao Q, Liu B. A comprehensive review and evaluation of computational methods for identifying protein complexes from protein–protein interaction networks. Brief Bioinform 2019;21:1531-1548. [DOI: 10.1093/bib/bbz085] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 06/17/2019] [Accepted: 06/17/2019] [Indexed: 02/04/2023] Open

Tian B, Duan Q, Zhao C, Teng B, He Z. Reinforce: An Ensemble Approach for Inferring PPI Network from AP-MS Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019;16:365-376. [PMID: 28534782 DOI: 10.1109/tcbb.2017.2705060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Su Y, Zhao C, Chen Z, Tian B, He Z. On the statistical significance of protein complex. QUANTITATIVE BIOLOGY 2018. [DOI: 10.1007/s40484-018-0153-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Alvarez-Ponce D. Recording negative results of protein-protein interaction assays: an easy way to deal with the biases and errors of interactomic data sets. Brief Bioinform 2018;18:1017-1020. [PMID: 27542401 DOI: 10.1093/bib/bbw075] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2016] [Indexed: 11/13/2022] Open

Tian B, Zhao C, Gu F, He Z. A two-step framework for inferring direct protein-protein interaction network from AP-MS data. BMC SYSTEMS BIOLOGY 2017;11:82. [PMID: 28950876 PMCID: PMC5615237 DOI: 10.1186/s12918-017-0452-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]

Ou-Yang L, Zhang XF, Dai DQ, Wu MY, Zhu Y, Liu Z, Yan H. Protein complex detection based on partially shared multi-view clustering. BMC Bioinformatics 2016;17:371. [PMID: 27623844 PMCID: PMC5022186 DOI: 10.1186/s12859-016-1164-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Accepted: 07/23/2016] [Indexed: 01/05/2023] Open

Abstract

Background

Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex detection has received increased attention in the literature. However, most existing works focus on predicting protein complexes from a single type of data, either physical interaction data or co-complex interaction data. These two types of data provide compatible and complementary information, so it is necessary to integrate them to discover the underlying structures and obtain better performance in complex detection.

Results

In this study, we propose a novel multi-view clustering algorithm, called the Partially Shared Multi-View Clustering model (PSMVC), to carry out such an integrated analysis. Unlike traditional multi-view learning algorithms that focus on mining either consistent or complementary information embedded in the multi-view data, PSMVC can jointly explore the shared and specific information inherent in different views. In our experiments, we compare the complexes detected by PSMVC from single data source with those detected from multiple data sources. We observe that jointly analyzing multi-view data benefits the detection of protein complexes. Furthermore, extensive experiment results demonstrate that PSMVC performs much better than 16 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques.

Conclusions

In this work, we demonstrate that when integrating multiple data sources, using partially shared multi-view clustering model can help to identify protein complexes which are not readily identifiable by conventional single-view-based methods and other integrative analysis methods. All the results and source codes are available on https://github.com/Oyl-CityU/PSMVC.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-016-1164-9) contains supplementary material, which is available to authorized users.

Collapse

PIPINO: A Software Package to Facilitate the Identification of Protein-Protein Interactions from Affinity Purification Mass Spectrometry Data. BIOMED RESEARCH INTERNATIONAL 2016;2016:2891918. [PMID: 26966684 PMCID: PMC4761381 DOI: 10.1155/2016/2891918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Revised: 11/28/2015] [Accepted: 11/29/2015] [Indexed: 11/17/2022]

Ou-Yang L, Wu M, Zhang XF, Dai DQ, Li XL, Yan H. A two-layer integration framework for protein complex detection. BMC Bioinformatics 2016;17:100. [PMID: 26911324 PMCID: PMC4765032 DOI: 10.1186/s12859-016-0939-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2015] [Accepted: 01/27/2016] [Indexed: 01/05/2023] Open

Abstract

Background

Protein complexes carry out nearly all signaling and functional processes within cells. The study of protein complexes is an effective strategy to analyze cellular functions and biological processes. With the increasing availability of proteomics data, various computational methods have recently been developed to predict protein complexes. However, different computational methods are based on their own assumptions and designed to work on different data sources, and various biological screening methods have their unique experiment conditions, and are often different in scale and noise level. Therefore, a single computational method on a specific data source is generally not able to generate comprehensive and reliable prediction results.

Results

In this paper, we develop a novel Two-layer INtegrative Complex Detection (TINCD) model to detect protein complexes, leveraging the information from both clustering results and raw data sources. In particular, we first integrate various clustering results to construct consensus matrices for proteins to measure their overall co-complex propensity. Second, we combine these consensus matrices with the co-complex score matrix derived from Tandem Affinity Purification/Mass Spectrometry (TAP) data and obtain an integrated co-complex similarity network via an unsupervised metric fusion method. Finally, a novel graph regularized doubly stochastic matrix decomposition model is proposed to detect overlapping protein complexes from the integrated similarity network.

Conclusions

Extensive experimental results demonstrate that TINCD performs much better than 21 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-016-0939-3) contains supplementary material, which is available to authorized users.

Collapse

Zhang XF, Ou-Yang L, Hu X, Dai DQ. Identifying binary protein-protein interactions from affinity purification mass spectrometry data. BMC Genomics 2015;16:745. [PMID: 26438428 PMCID: PMC4595009 DOI: 10.1186/s12864-015-1944-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2014] [Accepted: 09/22/2015] [Indexed: 02/04/2023] Open

Abstract

Background

The identification of protein-protein interactions contributes greatly to the understanding of functional organization within cells. With the development of affinity purification-mass spectrometry (AP-MS) techniques, several computational scoring methods have been proposed to detect protein interactions from AP-MS data. However, most of the current methods focus on the detection of co-complex interactions and do not discriminate between direct physical interactions and indirect interactions. Consequently, less is known about the precise physical wiring diagram within cells.

Results

In this paper, we develop a Binary Interaction Network Model (BINM) to computationally identify direct physical interactions from co-complex interactions which can be inferred from purification data using previous scoring methods. This model provides a mathematical framework for capturing topological relationships between direct physical interactions and observed co-complex interactions. It reassigns a confidence score to each observed interaction to indicate its propensity to be a direct physical interaction. Then observed interactions with high confidence scores are predicted as direct physical interactions. We run our model on two yeast co-complex interaction networks which are constructed by two different scoring methods on a same combined AP-MS data. The direct physical interactions identified by various methods are comprehensively benchmarked against different reference sets that provide both direct and indirect evidence for physical contacts. Experiment results show that our model has a competitive performance over the state-of-the-art methods.

Conclusions

According to the results obtained in this study, BINM is a powerful scoring method that can solely use network topology to predict direct physical interactions from AP-MS data. This study provides us an alternative approach to explore the information inherent in AP-MS data. The software can be downloaded from https://github.com/Zhangxf-ccnu/BINM.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1944-z) contains supplementary material, which is available to authorized users.

Collapse