1
|
Aref S, Mostajabdaveh M, Chheda H. Bayan algorithm: Detecting communities in networks through exact and approximate optimization of modularity. Phys Rev E 2024; 110:044315. [PMID: 39562863 DOI: 10.1103/physreve.110.044315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Accepted: 09/24/2024] [Indexed: 11/21/2024]
Abstract
Community detection is a classic network problem with extensive applications in various fields. Its most common method is using modularity maximization heuristics which rarely return an optimal partition or anything similar. Partitions with globally optimal modularity are difficult to compute, and therefore have been underexplored. Using structurally diverse networks, we compare 30 community detection methods including our proposed algorithm that offers optimality and approximation guarantees: the Bayan algorithm. Unlike existing methods, Bayan globally maximizes modularity or approximates it within a factor. Our results show the distinctive accuracy and stability of maximum-modularity partitions in retrieving planted partitions at rates higher than most alternatives for a wide range of parameter settings in two standard benchmarks. Compared to the partitions from 29 other algorithms, maximum-modularity partitions have the best medians for description length, coverage, performance, average conductance, and well clusteredness. These advantages come at the cost of additional computations which Bayan makes possible for small networks (networks that have up to 3000 edges in their largest connected component). Bayan is several times faster than using open-source and commercial solvers for modularity maximization, making it capable of finding optimal partitions for instances that cannot be optimized by any other existing method. Our results point to a few well-performing algorithms, among which Bayan stands out as the most reliable method for small networks. A python implementation of the Bayan algorithm (bayanpy) is publicly available through the package installer for python.
Collapse
|
2
|
Kajihara KT, Hynson NA. Networks as tools for defining emergent properties of microbiomes and their stability. MICROBIOME 2024; 12:184. [PMID: 39342398 PMCID: PMC11439251 DOI: 10.1186/s40168-024-01868-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 07/04/2024] [Indexed: 10/01/2024]
Abstract
The potential promise of the microbiome to ameliorate a wide range of societal and ecological challenges, from disease prevention and treatment to the restoration of entire ecosystems, hinges not only on microbiome engineering but also on the stability of beneficial microbiomes. Yet the properties of microbiome stability remain elusive and challenging to discern due to the complexity of interactions and often intractable diversity within these communities of bacteria, archaea, fungi, and other microeukaryotes. Networks are powerful tools for the study of complex microbiomes, with the potential to elucidate structural patterns of stable communities and generate testable hypotheses for experimental validation. However, the implementation of these analyses introduces a cascade of dichotomies and decision trees due to the lack of consensus on best practices. Here, we provide a road map for network-based microbiome studies with an emphasis on discerning properties of stability. We identify important considerations for data preparation, network construction, and interpretation of network properties. We also highlight remaining limitations and outstanding needs for this field. This review also serves to clarify the varying schools of thought on the application of network theory for microbiome studies and to identify practices that enhance the reproducibility and validity of future work. Video Abstract.
Collapse
Affiliation(s)
- Kacie T Kajihara
- Pacific Biosciences Research Center, University of Hawai'i at Mānoa, Honolulu, HI, 96822, USA.
| | - Nicole A Hynson
- Pacific Biosciences Research Center, University of Hawai'i at Mānoa, Honolulu, HI, 96822, USA
| |
Collapse
|
3
|
Peel L, Schaub MT. Detectability of hierarchical communities in networks. Phys Rev E 2024; 110:034306. [PMID: 39425354 DOI: 10.1103/physreve.110.034306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Accepted: 08/15/2024] [Indexed: 10/21/2024]
Abstract
We study the problem of recovering a planted hierarchy of partitions in a network. The detectability of a single planted partition has previously been analyzed in detail and a phase transition has been identified below which the partition cannot be detected. Here we show that, in the hierarchical setting, there exist additional phases in which the presence of multiple consistent partitions can either help or hinder detection. Accordingly, the detectability limit for nonhierarchical partitions typically provides insufficient information about the detectability of the complete hierarchical structure, as we highlight with several constructive examples.
Collapse
|
4
|
Gadár L, Abonyi J. Finding multifaceted communities in multiplex networks. Sci Rep 2024; 14:14521. [PMID: 38914589 PMCID: PMC11196740 DOI: 10.1038/s41598-024-65049-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 06/17/2024] [Indexed: 06/26/2024] Open
Abstract
Identifying communities in multilayer networks is crucial for understanding the structural dynamics of complex systems. Traditional community detection algorithms often overlook the presence of overlapping edges within communities, despite the potential significance of such relationships. In this work, we introduce a novel modularity measure designed to uncover communities where nodes share specific multiple facets of connectivity. Our approach leverages a null network, an empirical layer of the multiplex network, not a random network, that can be one of the network layers or a complement graph of that, depending on the objective. By analyzing real-world social networks, we validate the effectiveness of our method in identifying meaningful communities with overlapping edges. The proposed approach offers valuable insights into the structural dynamics of multiplex systems, shedding light on nodes that share similar multifaceted connections.
Collapse
Affiliation(s)
- László Gadár
- HUN-REN-PE Complex Systems Monitoring Research Group, University of Pannonia, Veszprém, Hungary.
| | - János Abonyi
- HUN-REN-PE Complex Systems Monitoring Research Group, University of Pannonia, Veszprém, Hungary
| |
Collapse
|
5
|
Gaiteri C, Connell DR, Sultan FA, Iatrou A, Ng B, Szymanski BK, Zhang A, Tasaki S. Robust, scalable, and informative clustering for diverse biological networks. Genome Biol 2023; 24:228. [PMID: 37828545 PMCID: PMC10571258 DOI: 10.1186/s13059-023-03062-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 09/19/2023] [Indexed: 10/14/2023] Open
Abstract
Clustering molecular data into informative groups is a primary step in extracting robust conclusions from big data. However, due to foundational issues in how they are defined and detected, such clusters are not always reliable, leading to unstable conclusions. We compare popular clustering algorithms across thousands of synthetic and real biological datasets, including a new consensus clustering algorithm-SpeakEasy2: Champagne. These tests identify trends in performance, show no single method is universally optimal, and allow us to examine factors behind variation in performance. Multiple metrics indicate SpeakEasy2 generally provides robust, scalable, and informative clusters for a range of applications.
Collapse
Affiliation(s)
- Chris Gaiteri
- Department of Psychiatry and Behavioral Sciences, SUNY Upstate Medical University, Syracuse, NY, USA.
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA.
- Department of Neurological Sciences, Rush University Medical Center, Chicago, IL, USA.
| | - David R Connell
- Rush University Graduate College, Rush University Medical Center, Chicago, IL, USA
| | - Faraz A Sultan
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Artemis Iatrou
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
- Department of Psychiatry, McLean Hospital, Harvard Medical School, Harvard University, Belmont, MA, USA
| | - Bernard Ng
- Department of Psychiatry and Behavioral Sciences, SUNY Upstate Medical University, Syracuse, NY, USA
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Boleslaw K Szymanski
- Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, USA
- Network Science and Technology Center, Rensselaer Polytechnic Institute, Troy, NY, USA
- Academy of Social Sciences, Łódź, Poland
| | - Ada Zhang
- Department of Psychiatry and Behavioral Sciences, SUNY Upstate Medical University, Syracuse, NY, USA
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Shinya Tasaki
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
- Department of Neurological Sciences, Rush University Medical Center, Chicago, IL, USA
| |
Collapse
|
6
|
Peixoto TP, Kirkley A. Implicit models, latent compression, intrinsic biases, and cheap lunches in community detection. Phys Rev E 2023; 108:024309. [PMID: 37723811 DOI: 10.1103/physreve.108.024309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 08/02/2023] [Indexed: 09/20/2023]
Abstract
The task of community detection, which aims to partition a network into clusters of nodes to summarize its large-scale structure, has spawned the development of many competing algorithms with varying objectives. Some community detection methods are inferential, explicitly deriving the clustering objective through a probabilistic generative model, while other methods are descriptive, dividing a network according to an objective motivated by a particular application, making it challenging to compare these methods on the same scale. Here we present a solution to this problem that associates any community detection objective, inferential or descriptive, with its corresponding implicit network generative model. This allows us to compute the description length of a network and its partition under arbitrary objectives, providing a principled measure to compare the performance of different algorithms without the need for "ground-truth" labels. Our approach also gives access to instances of the community detection problem that are optimal to any given algorithm and in this way reveals intrinsic biases in popular descriptive methods, explaining their tendency to overfit. Using our framework, we compare a number of community detection methods on artificial networks and on a corpus of over 500 structurally diverse empirical networks. We find that more expressive community detection methods exhibit consistently superior compression performance on structured data instances, without having degraded performance on a minority of situations where more specialized algorithms perform optimally. Our results undermine the implications of the "no free lunch" theorem for community detection, both conceptually and in practice, since it is confined to unstructured data instances, unlike relevant community detection problems which are structured by requirement.
Collapse
Affiliation(s)
- Tiago P Peixoto
- Department of Network and Data Science, Central European University, 1100 Vienna, Austria
| | - Alec Kirkley
- Institute of Data Science, University of Hong Kong, Hong Kong; Department of Urban Planning and Design, University of Hong Kong, Hong Kong; and Urban Systems Institute, University of Hong Kong, Hong Kong
| |
Collapse
|
7
|
Garrels T, Khodabakhsh A, Renard BY, Baum K. LazyFox: fast and parallelized overlapping community detection in large graphs. PeerJ Comput Sci 2023; 9:e1291. [PMID: 37346513 PMCID: PMC10280410 DOI: 10.7717/peerj-cs.1291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 02/20/2023] [Indexed: 06/23/2023]
Abstract
The detection of communities in graph datasets provides insight about a graph's underlying structure and is an important tool for various domains such as social sciences, marketing, traffic forecast, and drug discovery. While most existing algorithms provide fast approaches for community detection, their results usually contain strictly separated communities. However, most datasets would semantically allow for or even require overlapping communities that can only be determined at much higher computational cost. We build on an efficient algorithm, Fox, that detects such overlapping communities. Fox measures the closeness of a node to a community by approximating the count of triangles which that node forms with that community. We propose LazyFox, a multi-threaded adaptation of the Fox algorithm, which provides even faster detection without an impact on community quality. This allows for the analyses of significantly larger and more complex datasets. LazyFox enables overlapping community detection on complex graph datasets with millions of nodes and billions of edges in days instead of weeks. As part of this work, LazyFox's implementation was published and is available as a tool under an MIT licence at https://github.com/TimGarrels/LazyFox.
Collapse
Affiliation(s)
- Tim Garrels
- Hasso Plattner Institute for Digital Engineering gGmbH, Potsdam, Germany
- Digital Engineering Faculty, University of Potsdam, Potsdam, Germany
| | - Athar Khodabakhsh
- Hasso Plattner Institute for Digital Engineering gGmbH, Potsdam, Germany
| | - Bernhard Y. Renard
- Hasso Plattner Institute for Digital Engineering gGmbH, Potsdam, Germany
- Digital Engineering Faculty, University of Potsdam, Potsdam, Germany
- Department of Mathematics and Computer Science, Free University Berlin, Berlin, Germany
| | - Katharina Baum
- Hasso Plattner Institute for Digital Engineering gGmbH, Potsdam, Germany
- Digital Engineering Faculty, University of Potsdam, Potsdam, Germany
- Windreich Department of Artificial Intelligence and Human Health, Icahn School of Medicine at Mount Sinai, New York, USA
- Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, USA
| |
Collapse
|
8
|
Pathak A, Menon SN, Sinha S. Mesoscopic architecture enhances communication across the macaque connectome revealing structure-function correspondence in the brain. Phys Rev E 2022; 106:054304. [PMID: 36559437 DOI: 10.1103/physreve.106.054304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Accepted: 09/13/2022] [Indexed: 06/17/2023]
Abstract
Analyzing the brain in terms of organizational structures at intermediate scales provides an approach to unravel the complexity arising from interactions between its large number of components. Focusing on a wiring diagram that spans the cortex, basal ganglia, and thalamus of the macaque brain, we identify robust modules in the network that provide a mesoscopic-level description of its topological architecture. Surprisingly, we find that the modular architecture facilitates rapid communication across the network, instead of localizing activity as is typically expected in networks having community organization. By considering processes of diffusive spreading and coordination, we demonstrate that the specific pattern of inter- and intramodular connectivity in the network allows propagation to be even faster than equivalent randomized networks with or without modular structure. This pattern of connectivity is seen at different scales and is conserved across principal cortical divisions, as well as subcortical structures. Furthermore, we find that the physical proximity between areas is insufficient to explain the modular organization, as similar mesoscopic structures can be obtained even after factoring out the effect of distance constraints on the connectivity. By supplementing the topological information about the macaque connectome with physical locations, volumes, and functions of the constituent areas and analyzing this augmented dataset, we reveal a counterintuitive role played by the modular architecture of the brain in promoting global coordination of its activity. It suggests a possible explanation for the ubiquity of modularity in brain networks.
Collapse
Affiliation(s)
- Anand Pathak
- The Institute of Mathematical Sciences, CIT Campus, Taramani, Chennai 600113, India
- Homi Bhabha National Institute, Anushaktinagar, Mumbai 400 094, India
| | - Shakti N Menon
- The Institute of Mathematical Sciences, CIT Campus, Taramani, Chennai 600113, India
| | - Sitabhra Sinha
- The Institute of Mathematical Sciences, CIT Campus, Taramani, Chennai 600113, India
- Homi Bhabha National Institute, Anushaktinagar, Mumbai 400 094, India
| |
Collapse
|
9
|
Bara J, Lev O, Turrini P. Predicting voting outcomes in the presence of communities, echo chambers and multiple parties. ARTIF INTELL 2022. [DOI: 10.1016/j.artint.2022.103773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
10
|
Roy UK, Muhuri PK, Biswas SK. NeSiFC: Neighbors' Similarity-Based Fuzzy Community Detection Using Modified Local Random Walk. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:10014-10026. [PMID: 34166209 DOI: 10.1109/tcyb.2021.3071542] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
This article proposes a neighbors' similarity-based fuzzy community detection (FCD) method, which we call "NeSiFC." In the proposed NeSiFC approach, we compute the similarity between two neighbors by introducing a modified local random walk (mLRW). Basically, in a network, a node and its' neighbors with noticeable similarities among them construct a community. To measure this similarity, we introduce a new metric, called the peripheral similarity index (PSI). This PSI is used to construct the transition probability matrix for the mLRW. The mLRW is applied for each node until it meets a parameter called step coefficient. The mLRW gives better neighbors' similarity for community detection. Finally, a fuzzy membership function is used iteratively to compute the membership degrees for all nodes with reference to existing communities. The proposed NeSiFC has no dependence on the network characteristics, and no adjustment or fine tuning of more than one parameter is needed. To show the efficacy of the proposed NeSiFC approach, we provide a thorough comparative performance analysis considering a set of well-known FCD algorithms viz., the genetic algorithm for fuzzy community detection, membership degree propagation, center-based fuzzy graph clustering, FMM/H2, and FuzAg on a set of popular benchmarks, as well as real-world datasets. For both disjoint and overlapping community structures, results of various accuracy and quality metrics indicate the outstanding performance of our proposed NeSiFC approach. The asymptotic complexity of the proposed NeSiFC is found as O(n2).
Collapse
|
11
|
Jesan T, Sinha S. Modular organization of gene–tumor association network allows identification of key molecular players in cancer. J Biosci 2022. [PMID: 36222154 DOI: 10.1007/s12038-022-00292-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
12
|
Detecting overlapping communities in complex networks using non-cooperative games. Sci Rep 2022; 12:11054. [PMID: 35773382 PMCID: PMC9247049 DOI: 10.1038/s41598-022-15095-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 06/17/2022] [Indexed: 11/08/2022] Open
Abstract
Detecting communities in complex networks is of paramount importance, and its wide range of real-life applications in various areas has caused a lot of attention to be paid to it, and many efforts have been made to have efficient and accurate algorithms for this purpose. In this paper, we proposed a non-cooperative game theoretic-based algorithm that is able to detect overlapping communities. In this algorithm, nodes are regarded as players, and communities are assumed to be groups of players with similar strategies. Our two-phase algorithm detects communities and the overlapping nodes in separate phases that, while increasing the accuracy, especially in detecting overlapping nodes, brings about higher algorithm speed. Moreover, there is no need for setting parameters regarding the size or number of communities, and the absence of any stochastic process caused this algorithm to be stable. By appropriately adjusting stop criteria, our algorithm can be categorized among those with linear time complexity, making it highly scalable for large networks. Experiments on synthetic and real-world networks demonstrate our algorithm's good performance compared to similar algorithms in terms of detected overlapping nodes, detected communities size distribution, modularity, and normalized mutual information.
Collapse
|
13
|
Martínez-González JU, Riascos AP. Activity of vehicles in the bus rapid transit system Metrobús in Mexico City. Sci Rep 2022; 12:98. [PMID: 34997045 PMCID: PMC8742109 DOI: 10.1038/s41598-021-04037-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Accepted: 12/10/2021] [Indexed: 12/27/2022] Open
Abstract
In this paper, we analyze a massive dataset with registers of the movement of vehicles in the bus rapid transit system Metrobús in Mexico City from February 2020 to April 2021. With these records and a division of the system into 214 geographical regions (segments), we characterize the vehicles' activity through the statistical analysis of speeds in each zone. We use the Kullback-Leibler distance to compare the movement of vehicles in each segment and its evolution. The results for the dynamics in different zones are represented as a network where nodes define segments of the system Metrobús and edges describe similarity in the activity of vehicles. Community detection algorithms in this network allow the identification of patterns considering different levels of similarity in the distribution of speeds providing a framework for unsupervised classification of the movement of vehicles. The methods developed in this research are general and can be implemented to describe the activity of different transportation systems with detailed records of the movement of users or vehicles.
Collapse
Affiliation(s)
- Jaspe U Martínez-González
- Instituto de Física, Universidad Nacional Autónoma de México, Ciudad Universitaria, 04510, Mexico City, Mexico
| | - Alejandro P Riascos
- Instituto de Física, Universidad Nacional Autónoma de México, Ciudad Universitaria, 04510, Mexico City, Mexico.
| |
Collapse
|
14
|
Klein B, Holmér L, Smith KM, Johnson MM, Swain A, Stolp L, Teufel AI, Kleppe AS. A computational exploration of resilience and evolvability of protein-protein interaction networks. Commun Biol 2021; 4:1352. [PMID: 34857859 PMCID: PMC8639913 DOI: 10.1038/s42003-021-02867-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Accepted: 11/03/2021] [Indexed: 11/09/2022] Open
Abstract
Protein-protein interaction (PPI) networks represent complex intra-cellular protein interactions, and the presence or absence of such interactions can lead to biological changes in an organism. Recent network-based approaches have shown that a phenotype's PPI network's resilience to environmental perturbations is related to its placement in the tree of life; though we still do not know how or why certain intra-cellular factors can bring about this resilience. Here, we explore the influence of gene expression and network properties on PPI networks' resilience. We use publicly available data of PPIs for E. coli, S. cerevisiae, and H. sapiens, where we compute changes in network resilience as new nodes (proteins) are added to the networks under three node addition mechanisms-random, degree-based, and gene-expression-based attachments. By calculating the resilience of the resulting networks, we estimate the effectiveness of these node addition mechanisms. We demonstrate that adding nodes with gene-expression-based preferential attachment (as opposed to random or degree-based) preserves and can increase the original resilience of PPI network in all three species, regardless of gene expression distribution or network structure. These findings introduce a general notion of prospective resilience, which highlights the key role of network structures in understanding the evolvability of phenotypic traits.
Collapse
Affiliation(s)
- Brennan Klein
- Network Science Institute, Northeastern University, Boston, MA, USA. .,Laboratory for the Modeling of Biological and Socio-Technical Systems, Northeastern University, Boston, MA, USA.
| | - Ludvig Holmér
- grid.419684.60000 0001 1214 1861Center for Data Analytics, Stockholm School of Economics, Stockholm, Sweden
| | - Keith M. Smith
- grid.12361.370000 0001 0727 0669Department of Physics and Mathematics, Nottingham Trent University, Nottingham, UK
| | - Mackenzie M. Johnson
- grid.89336.370000 0004 1936 9924Department of Integrative Biology, University of Texas at Austin, Austin, TX USA
| | - Anshuman Swain
- grid.164295.d0000 0001 0941 7177Department of Biology, University of Maryland, College Park, MD USA
| | - Laura Stolp
- grid.7177.60000000084992262Graduate School of Science, University of Amsterdam, Amsterdam, The Netherlands
| | - Ashley I. Teufel
- grid.89336.370000 0004 1936 9924Department of Integrative Biology, University of Texas at Austin, Austin, TX USA ,grid.209665.e0000 0001 1941 1940Santa Fe Institute, Santa Fe, NM USA ,grid.469272.c0000 0001 0180 5693Texas A&M University, San Antonio, San Antonio, TX USA
| | - April S. Kleppe
- grid.5949.10000 0001 2172 9288Institute for Evolution and Biodiversity, University of Münster, Münster, Germany ,grid.7048.b0000 0001 1956 2722Department of Clinical Medicine (MOMA), Aarhus University, Aarhus, Denmark
| |
Collapse
|
15
|
Faccin M, Schaub MT, Delvenne JC. State Aggregations in Markov Chains and Block Models of Networks. PHYSICAL REVIEW LETTERS 2021; 127:078301. [PMID: 34459654 DOI: 10.1103/physrevlett.127.078301] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Revised: 06/17/2021] [Accepted: 07/15/2021] [Indexed: 06/13/2023]
Abstract
We consider state-aggregation schemes for Markov chains from an information-theoretic perspective. Specifically, we consider aggregating the states of a Markov chain such that the mutual information of the aggregated states separated by T time steps is maximized. We show that for T=1 this recovers the maximum-likelihood estimator of the degree-corrected stochastic block model as a particular case, which enables us to explain certain features of the likelihood landscape of this generative network model from a dynamical lens. We further highlight how we can uncover coherent, long-range dynamical modules for which considering a timescale T≫1 is essential. We demonstrate our results using synthetic flows and real-world ocean currents, where we are able to recover the fundamental features of the surface currents of the oceans.
Collapse
Affiliation(s)
- Mauro Faccin
- ICTEAM, Université catholique de Louvain, 1348 Louvain-la-Neuve, Belgium
| | - Michael T Schaub
- Department of Engineering Science, University of Oxford, Oxford OX1 2JD, United Kingdom
- Department of Computer Science, RWTH Aachen University, 52074 Aachen, Germany
| | - Jean-Charles Delvenne
- ICTEAM, Université catholique de Louvain, 1348 Louvain-la-Neuve, Belgium
- CORE, Université catholique de Louvain, 1348 Louvain-la-Neuve, Belgium
| |
Collapse
|
16
|
Korhonen O, Zanin M, Papo D. Principles and open questions in functional brain network reconstruction. Hum Brain Mapp 2021; 42:3680-3711. [PMID: 34013636 PMCID: PMC8249902 DOI: 10.1002/hbm.25462] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Revised: 03/11/2021] [Accepted: 04/10/2021] [Indexed: 12/12/2022] Open
Abstract
Graph theory is now becoming a standard tool in system-level neuroscience. However, endowing observed brain anatomy and dynamics with a complex network representation involves often covert theoretical assumptions and methodological choices which affect the way networks are reconstructed from experimental data, and ultimately the resulting network properties and their interpretation. Here, we review some fundamental conceptual underpinnings and technical issues associated with brain network reconstruction, and discuss how their mutual influence concurs in clarifying the organization of brain function.
Collapse
Affiliation(s)
- Onerva Korhonen
- Department of Computer ScienceAalto University, School of ScienceHelsinki
- Centre for Biomedical TechnologyUniversidad Politécnica de MadridPozuelo de Alarcón
| | - Massimiliano Zanin
- Instituto de Física Interdisciplinar y Sistemas Complejos IFISC (CSIC‐UIB), Campus UIBPalma de MallorcaSpain
| | - David Papo
- Fondazione Istituto Italiano di TecnologiaFerrara
- Department of Neuroscience and Rehabilitation, Section of PhysiologyUniversity of FerraraFerrara
| |
Collapse
|
17
|
Akbar S, Saritha SK. Quantum inspired community detection for analysis of biodiversity change driven by land-use conversion and climate change. Sci Rep 2021; 11:14332. [PMID: 34253748 PMCID: PMC8275618 DOI: 10.1038/s41598-021-93122-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 06/21/2021] [Indexed: 02/06/2023] Open
Abstract
Community detection remains little explored in the analysis of biodiversity change. The challenges linked with global biodiversity change have also multiplied manifold in the past few decades. Moreover, most studies concerning biodiversity change lack the quantitative treatment central to species distribution modeling. Empirical analysis of species distribution and abundance is thus integral to the study of biodiversity loss and biodiversity alterations. Community detection is therefore expected to efficiently model the topological aspect of biodiversity change driven by land-use conversion and climate change; given that it has already proven superior for diverse problems in the domain of social network analysis and subgroup discovery in complex systems. Thus, quantum inspired community detection is proposed as a novel technique to predict biodiversity change considering tiger population in eighteen states of India; leading to benchmarking of two novel datasets. Elements of land-use conversion and climate change are explored to design these datasets viz.-Landscape based distribution and Number of tiger reserves based distribution respectively; for predicting regions expected to maximize Tiger population growth. Furthermore, validation of the proposed framework on the said datasets is performed using standard community detection metrics like-Modularity, Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), Degree distribution, Degree centrality and Edge-betweenness centrality. Quantum inspired community detection has also been successful in demonstrating an association between biodiversity change, land-use conversion and climate change; validated statistically by Pearson's correlation coefficient and p value test. Finally, modularity distribution based on parameter tuning establishes the superiority of the second dataset based on the number of Tiger reserves-in predicting regions maximizing Tiger population growth fostering species distribution and abundance; apart from scripting a stronger correlation of biodiversity change with land-use conversion.
Collapse
Affiliation(s)
- Sana Akbar
- Department of CSE, MANIT, Bhopal, India.
| | | |
Collapse
|
18
|
Abstract
AbstractIn this paper we utilize an opportunity to construct ground truths for topics in the field of atomic, molecular and optical physics. Our research questions in this paper focus on (i) how to construct a ground truth for topics and (ii) the suitability of common algorithms applied to bibliometric networks to reconstruct these topics. We use the ground truths to test two data models (direct citation and bibliographic coupling) with two algorithms (the Leiden algorithm and the Infomap algorithm). Our results are discomforting: none of the four combinations leads to a consistent reconstruction of the ground truths. No combination of data model and algorithm simultaneously reconstructs all micro-level topics at any resolution level. Meso-level topics are not reconstructed at all. This suggests (a) that we are currently unable to predict which combination of data model, algorithm and parameter setting will adequately reconstruct which (types of) topics, and (b) that a combination of several data models, algorithms and parameter settings appears to be necessary to reconstruct all or most topics in a set of papers.
Collapse
|
19
|
Verma P, Goyal R. Influence propagation based community detection in complex networks. MACHINE LEARNING WITH APPLICATIONS 2021. [DOI: 10.1016/j.mlwa.2020.100019] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
|
20
|
Correspondence analysis-based network clustering and importance of degenerate solutions unification of spectral clustering and modularity maximization. SOCIAL NETWORK ANALYSIS AND MINING 2020. [DOI: 10.1007/s13278-020-00686-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
21
|
The modular organization of brain cortical connectivity across the human lifespan. Neuroimage 2020; 218:116974. [DOI: 10.1016/j.neuroimage.2020.116974] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Revised: 02/17/2020] [Accepted: 05/17/2020] [Indexed: 11/19/2022] Open
|
22
|
Škrlj B, Kralj J, Lavrač N. Embedding-based Silhouette community detection. Mach Learn 2020; 109:2161-2193. [PMID: 33191975 PMCID: PMC7652809 DOI: 10.1007/s10994-020-05882-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Revised: 12/22/2019] [Accepted: 05/07/2020] [Indexed: 11/29/2022]
Abstract
Mining complex data in the form of networks is of increasing interest in many scientific disciplines. Network communities correspond to densely connected subnetworks, and often represent key functional parts of real-world systems. This paper proposes the embedding-based Silhouette community detection (SCD), an approach for detecting communities, based on clustering of network node embeddings, i.e. real valued representations of nodes derived from their neighborhoods. We investigate the performance of the proposed SCD approach on 234 synthetic networks, as well as on a real-life social network. Even though SCD is not based on any form of modularity optimization, it performs comparably or better than state-of-the-art community detection algorithms, such as the InfoMap and Louvain. Further, we demonstrate that SCD's outputs can be used along with domain ontologies in semantic subgroup discovery, yielding human-understandable explanations of communities detected in a real-life protein interaction network. Being embedding-based, SCD is widely applicable and can be tested out-of-the-box as part of many existing network learning and exploration pipelines.
Collapse
Affiliation(s)
- Blaž Škrlj
- Jožef Stefan Institute, Jamova 39, 1000 Ljubljana, Slovenia
- Jožef Stefan International Postgraduate School, Jamova 39, 1000 Ljubljana, Slovenia
| | - Jan Kralj
- Jožef Stefan Institute, Jamova 39, 1000 Ljubljana, Slovenia
- CosyLab, Gerbičeva ulica 64, 1000 Ljubljana, Slovenia
| | - Nada Lavrač
- Jožef Stefan Institute, Jamova 39, 1000 Ljubljana, Slovenia
- University of Nova Gorica, Vipavska 13, 5000 Nova Gorica, Slovenia
| |
Collapse
|
23
|
Pascual‐García A, Bell T. functionInk: An efficient method to detect functional groups in multidimensional networks reveals the hidden structure of ecological communities. Methods Ecol Evol 2020. [DOI: 10.1111/2041-210x.13377] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
| | - Thomas Bell
- Department of Life Sciences Imperial College London Ascot UK
| |
Collapse
|
24
|
Haneef F, Abbasi RA, Sindhu MA, Khattak AS, Noor MN, Aljohani NR, Daud A, Arafat S. Using network science to understand the link between subjects and professions. COMPUTERS IN HUMAN BEHAVIOR 2020. [DOI: 10.1016/j.chb.2019.106228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
25
|
Patelli A, Gabrielli A, Cimini G. Generalized Markov stability of network communities. Phys Rev E 2020; 101:052301. [PMID: 32575290 DOI: 10.1103/physreve.101.052301] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2019] [Accepted: 03/24/2020] [Indexed: 11/07/2022]
Abstract
We address the problem of community detection in networks by introducing a general definition of Markov stability, based on the difference between the probability fluxes of a Markov chain on the network at different timescales. The specific implementation of the quality function and the resulting optimal community structure thus become dependent both on the type of Markov process and on the specific Markov times considered. For instance, if we use a natural Markov chain dynamics and discount its stationary distribution (that is, we take as reference process the dynamics at infinite time) we obtain the standard formulation of the Markov stability. Notably, the possibility to use finite-time transition probabilities to define the reference process naturally allows detecting communities at different resolutions, without the need to consider a continuous-time Markov chain in the small time limit. The main advantage of our general formulation of Markov stability based on dynamical flows is that we work with lumped Markov chains on network partitions, having the same stationary distribution of the original process. In this way the form of the quality function becomes invariant under partitioning, leading to a self-consistent definition of community structures at different aggregation scales.
Collapse
Affiliation(s)
- Aurelio Patelli
- Istituto dei Sistemi Complessi (CNR), UoS Dipartimento di Fisica, "Sapienza" Università di Roma, 00185 Rome, Italy.,Service de Physique de l'Etat Condensé, UMR 3680 CEA-CNRS, Université Paris-Saclay, CEA-Saclay, 91191 Gif-sur-Yvette, France
| | - Andrea Gabrielli
- Istituto dei Sistemi Complessi (CNR), UoS Dipartimento di Fisica, "Sapienza" Università di Roma, 00185 Rome, Italy.,Dipartimento di Ingegneria, Università Roma 3, 00146 Rome, Italy
| | - Giulio Cimini
- Istituto dei Sistemi Complessi (CNR), UoS Dipartimento di Fisica, "Sapienza" Università di Roma, 00185 Rome, Italy.,Dipartimento di Fisica, Università di Roma Tor Vergata, 00133 Rome, Italy
| |
Collapse
|
26
|
Ioannoni V, Vitale T, Costa C, Elliott I. Depicting communities of Romani studies: on the who, when and where of Roma related scientific publications. Scientometrics 2020. [DOI: 10.1007/s11192-020-03352-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
27
|
Faskowitz J, Sporns O. Mapping the community structure of the rat cerebral cortex with weighted stochastic block modeling. Brain Struct Funct 2020; 225:71-84. [PMID: 31760493 PMCID: PMC11220483 DOI: 10.1007/s00429-019-01984-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Accepted: 11/09/2019] [Indexed: 01/01/2023]
Abstract
The anatomical architecture of the mammalian brain can be modeled as the connectivity between functionally distinct areas of cortex and sub-cortex, which we refer to as the connectome. The community structure of the connectome describes how the network can be parsed into meaningful groups of nodes. This process, called community detection, is commonly carried out to find internally densely connected communities-a modular topology. However, other community structure patterns are possible. Here we employ the weighted stochastic block model (WSBM), which can identify a wide range of topologies, to the rat cerebral cortex connectome, to probe the network for evidence of modular, core, periphery, and disassortative organization. Despite its algorithmic flexibility, the WSBM identifies substantial modular and assortative topology throughout the rat cerebral cortex connectome, significantly aligning to the modular approach in some parts of the network. Significant deviations from modular partitions include the identification of communities that are highly enriched in core (rich club) areas. A comparison of the WSBM and modular models demonstrates that the former, when applied as a generative model, more closely captures several nodal network attributes. An analysis of variation across an ensemble of partitions reveals that certain parts of the network participate in multiple topological regimes. Overall, our findings demonstrate the potential benefits of adopting the WSBM, which can be applied to a single weighted and directed matrix such as the rat cerebral cortex connectome, to identify community structure with a broad definition that transcends the common modular approach.
Collapse
Affiliation(s)
- Joshua Faskowitz
- Program in Neuroscience, Indiana University, Bloomington, IN, USA.
- Department of Psychological and Brain Sciences, Indiana University, 1101 E. 10th Street, Bloomington, IN, 47405, USA.
| | - Olaf Sporns
- Program in Neuroscience, Indiana University, Bloomington, IN, USA
- Department of Psychological and Brain Sciences, Indiana University, 1101 E. 10th Street, Bloomington, IN, 47405, USA
- Indiana University Network Science Institute, Indiana University, Bloomington, IN, USA
| |
Collapse
|
28
|
|
29
|
Guo J, Singh P, Bassler KE. Reduced network extremal ensemble learning (RenEEL) scheme for community detection in complex networks. Sci Rep 2019; 9:14234. [PMID: 31578406 PMCID: PMC6775136 DOI: 10.1038/s41598-019-50739-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Accepted: 09/17/2019] [Indexed: 11/30/2022] Open
Abstract
We introduce an ensemble learning scheme for community detection in complex networks. The scheme uses a Machine Learning algorithmic paradigm we call Extremal Ensemble Learning. It uses iterative extremal updating of an ensemble of network partitions, which can be found by a conventional base algorithm, to find a node partition that maximizes modularity. At each iteration, core groups of nodes that are in the same community in every ensemble partition are identified and used to form a reduced network. Partitions of the reduced network are then found and used to update the ensemble. The smaller size of the reduced network makes the scheme efficient. We use the scheme to analyze the community structure in a set of commonly studied benchmark networks and find that it outperforms all other known methods for finding the partition with maximum modularity.
Collapse
Affiliation(s)
- Jiahao Guo
- Department of Physics, University of Houston, Houston, Texas, 77204, USA.,Texas Center for Superconductivity, University of Houston, Houston, Texas, 77204, USA
| | - Pramesh Singh
- Department of Physics, University of Houston, Houston, Texas, 77204, USA.,Texas Center for Superconductivity, University of Houston, Houston, Texas, 77204, USA
| | - Kevin E Bassler
- Department of Physics, University of Houston, Houston, Texas, 77204, USA. .,Texas Center for Superconductivity, University of Houston, Houston, Texas, 77204, USA. .,Department of Mathematics, University of Houston, Houston, Texas, 77204, USA.
| |
Collapse
|
30
|
Li X, Wu X, Xu S, Qing S, Chang PC. A novel complex network community detection approach using discrete particle swarm optimization with particle diversity and mutation. Appl Soft Comput 2019. [DOI: 10.1016/j.asoc.2019.05.003] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
31
|
Schaub MT, Delvenne JC, Lambiotte R, Barahona M. Multiscale dynamical embeddings of complex networks. Phys Rev E 2019; 99:062308. [PMID: 31330590 DOI: 10.1103/physreve.99.062308] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2018] [Indexed: 11/07/2022]
Abstract
Complex systems and relational data are often abstracted as dynamical processes on networks. To understand, predict, and control their behavior, a crucial step is to extract reduced descriptions of such networks. Inspired by notions from control theory, we propose a time-dependent dynamical similarity measure between nodes, which quantifies the effect a node-input has on the network. This dynamical similarity induces an embedding that can be employed for several analysis tasks. Here we focus on (i) dimensionality reduction, i.e., projecting nodes onto a low-dimensional space that captures dynamic similarity at different timescales, and (ii) how to exploit our embeddings to uncover functional modules. We exemplify our ideas through case studies focusing on directed networks without strong connectivity and signed networks. We further highlight how certain ideas from community detection can be generalized and linked to control theory, by using the here developed dynamical perspective.
Collapse
Affiliation(s)
- Michael T Schaub
- Institute for Data, Systems and Society, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.,Department of Engineering Science, University of Oxford, Oxford, United Kingdom
| | - Jean-Charles Delvenne
- ICTEAM, Université catholique de Louvain, B-1348 Louvain-la-Neuve, Belgium.,CORE, Université catholique de Louvain, B-1348 Louvain-la-Neuve, Belgium
| | - Renaud Lambiotte
- Mathematical Institute, University of Oxford, Oxford, United Kingdom
| | - Mauricio Barahona
- Department of Mathematics, Imperial College London, London SW7 2AZ, United Kingdom
| |
Collapse
|
32
|
Mu C, Zhang J, Liu Y, Qu R, Huang T. Multi-objective ant colony optimization algorithm based on decomposition for community detection in complex networks. Soft comput 2019. [DOI: 10.1007/s00500-019-03820-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
33
|
Altuncu MT, Mayer E, Yaliraki SN, Barahona M. From free text to clusters of content in health records: an unsupervised graph partitioning approach. APPLIED NETWORK SCIENCE 2019; 4:2. [PMID: 30906850 PMCID: PMC6400329 DOI: 10.1007/s41109-018-0109-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Accepted: 11/06/2018] [Indexed: 05/07/2023]
Abstract
Electronic healthcare records contain large volumes of unstructured data in different forms. Free text constitutes a large portion of such data, yet this source of richly detailed information often remains under-used in practice because of a lack of suitable methodologies to extract interpretable content in a timely manner. Here we apply network-theoretical tools to the analysis of free text in Hospital Patient Incident reports in the English National Health Service, to find clusters of reports in an unsupervised manner and at different levels of resolution based directly on the free text descriptions contained within them. To do so, we combine recently developed deep neural network text-embedding methodologies based on paragraph vectors with multi-scale Markov Stability community detection applied to a similarity graph of documents obtained from sparsified text vector similarities. We showcase the approach with the analysis of incident reports submitted in Imperial College Healthcare NHS Trust, London. The multiscale community structure reveals levels of meaning with different resolution in the topics of the dataset, as shown by relevant descriptive terms extracted from the groups of records, as well as by comparing a posteriori against hand-coded categories assigned by healthcare personnel. Our content communities exhibit good correspondence with well-defined hand-coded categories, yet our results also provide further medical detail in certain areas as well as revealing complementary descriptors of incidents beyond the external classification. We also discuss how the method can be used to monitor reports over time and across different healthcare providers, and to detect emerging trends that fall outside of pre-existing categories.
Collapse
Affiliation(s)
- M. Tarik Altuncu
- Department of Mathematics, Imperial College London, South Kensington campus, London, SW7 2AZ UK
- EPSRC Centre for Mathematics of Precision Healthcare, Imperial College London, South Kensington campus, London, SW7 2AZ UK
| | - Erik Mayer
- Centre for Health Policy, Institute of Global Health Innovation, Imperial College London, St Mary’s campus, London, W2 1NY UK
- EPSRC Centre for Mathematics of Precision Healthcare, Imperial College London, South Kensington campus, London, SW7 2AZ UK
| | - Sophia N. Yaliraki
- Department of Chemistry, Imperial College London, South Kensington campus, London, SW7 2AZ UK
- EPSRC Centre for Mathematics of Precision Healthcare, Imperial College London, South Kensington campus, London, SW7 2AZ UK
| | - Mauricio Barahona
- Department of Mathematics, Imperial College London, South Kensington campus, London, SW7 2AZ UK
- EPSRC Centre for Mathematics of Precision Healthcare, Imperial College London, South Kensington campus, London, SW7 2AZ UK
| |
Collapse
|
34
|
Faskowitz J, Yan X, Zuo XN, Sporns O. Weighted Stochastic Block Models of the Human Connectome across the Life Span. Sci Rep 2018; 8:12997. [PMID: 30158553 PMCID: PMC6115421 DOI: 10.1038/s41598-018-31202-1] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Accepted: 08/14/2018] [Indexed: 01/19/2023] Open
Abstract
The human brain can be described as a complex network of anatomical connections between distinct areas, referred to as the human connectome. Fundamental characteristics of connectome organization can be revealed using the tools of network science and graph theory. Of particular interest is the network's community structure, commonly identified by modularity maximization, where communities are conceptualized as densely intra-connected and sparsely inter-connected. Here we adopt a generative modeling approach called weighted stochastic block models (WSBM) that can describe a wider range of community structure topologies by explicitly considering patterned interactions between communities. We apply this method to the study of changes in the human connectome that occur across the life span (between 6-85 years old). We find that WSBM communities exhibit greater hemispheric symmetry and are spatially less compact than those derived from modularity maximization. We identify several network blocks that exhibit significant linear and non-linear changes across age, with the most significant changes involving subregions of prefrontal cortex. Overall, we show that the WSBM generative modeling approach can be an effective tool for describing types of community structure in brain networks that go beyond modularity.
Collapse
Affiliation(s)
- Joshua Faskowitz
- Program in Neuroscience, Indiana University, Bloomington, IN, USA
- Department of Psychological and Brain Sciences, Indiana University, Bloomington, IN, USA
| | - Xiaoran Yan
- Indiana University Network Science Institute, Indiana University, Bloomington, IN, USA
| | - Xi-Nian Zuo
- CAS Key Laboratory of Behavioral Science, Institute of Psychology, Beijing, China
- Research Center for Lifespan Development of Mind and Brain (CLIMB), Institute of Psychology, Beijing, China
- Key Laboratory for Brain and Education Sciences, Nanning Normal University, Nanning, Guangxi, 530001, China
| | - Olaf Sporns
- Program in Neuroscience, Indiana University, Bloomington, IN, USA.
- Department of Psychological and Brain Sciences, Indiana University, Bloomington, IN, USA.
- Indiana University Network Science Institute, Indiana University, Bloomington, IN, USA.
| |
Collapse
|
35
|
Šubelj L. Convex skeletons of complex networks. J R Soc Interface 2018; 15:20180422. [PMID: 30111666 PMCID: PMC6127167 DOI: 10.1098/rsif.2018.0422] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Accepted: 07/13/2018] [Indexed: 11/12/2022] Open
Abstract
A convex network can be defined as a network such that every connected induced subgraph includes all the shortest paths between its nodes. A fully convex network would therefore be a collection of cliques stitched together in a tree. In this paper, we study the largest high-convexity part of empirical networks obtained by removing the least number of edges, which we call a convex skeleton. A convex skeleton is a generalization of a network spanning tree in which each edge can be replaced by a clique of arbitrary size. We present different approaches for extracting convex skeletons and apply them to social collaboration and protein interactions networks, autonomous systems graphs and food webs. We show that the extracted convex skeletons retain the degree distribution, clustering, connectivity, distances, node position and also community structure, while making the shortest paths between the nodes largely unique. Moreover, in the Slovenian computer scientists coauthorship network, a convex skeleton retains the strongest ties between the authors, differently from a spanning tree or high-betweenness backbone and high-salience skeleton. A convex skeleton thus represents a simple definition of a network backbone with applications in coauthorship and other social collaboration networks.
Collapse
Affiliation(s)
- Lovro Šubelj
- Faculty of Computer and Information Science, University of Ljubljana, Ljubljana, Slovenia
| |
Collapse
|
36
|
Gandica Y, Geraci MV, Béreau S, Gnabo JY. Fragmentation, integration and macroprudential surveillance of the US financial industry: Insights from network science. PLoS One 2018; 13:e0195110. [PMID: 29694415 PMCID: PMC5919003 DOI: 10.1371/journal.pone.0195110] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2017] [Accepted: 03/13/2018] [Indexed: 11/25/2022] Open
Abstract
Drawing on recent contributions inferring financial interconnectedness from market data, our paper provides new insights on the evolution of the US financial industry over a long period of time by using several tools coming from network science. Relying on a Time-Varying Parameter Vector AutoRegressive (TVP-VAR) approach on stock market returns to retrieve unobserved directed links among financial institutions, we reconstruct a fully dynamic network in the sense that connections are let to evolve through time. The financial system analysed consists of a large set of 155 financial institutions that are all the banks, broker-dealers, insurance and real estate companies listed in the Standard & Poors’ 500 index over the 1993–2014 period. Looking alternatively at the individual, then sector-, community- and system-wide levels, we show that network sciences’ tools are able to support well-known features of the financial markets such as the dramatic fall of connectivity following Lehman Brothers’ collapse. More importantly, by means of less traditional metrics, such as sectoral interface or measurements based on contagion processes, our results document the co-existence of both fragmentation and integration phases between firms independently from the sectors they belong to, and doing so, question the relevance of existing macroprudential surveillance frameworks which have been mostly developed on a sectoral basis. Overall, our results improve our understanding of the US financial landscape and may have important implications for risk monitoring as well as macroprudential policy design.
Collapse
Affiliation(s)
- Yerali Gandica
- CeReFiM (DeFiPP), Université de Namur, Namur, Belgium
- Namur Center for Complex Systems - naXys, Université de Namur, Namur, Belgium
- * E-mail:
| | - Marco Valerio Geraci
- CeReFiM (DeFiPP), Université de Namur, Namur, Belgium
- ECARES, Université libre de Bruxelles, Brussels, Belgium
| | - Sophie Béreau
- CeReFiM (DeFiPP), Université de Namur, Namur, Belgium
- Namur Center for Complex Systems - naXys, Université de Namur, Namur, Belgium
- CORE, Université catholique de Louvain, Louvain-la-Neuve, Belgium
| | - Jean-Yves Gnabo
- CeReFiM (DeFiPP), Université de Namur, Namur, Belgium
- Namur Center for Complex Systems - naXys, Université de Namur, Namur, Belgium
| |
Collapse
|
37
|
Abstract
Assortative mixing in networks is the tendency for nodes with the same attributes, or metadata, to link to each other. It is a property often found in social networks, manifesting as a higher tendency of links occurring between people of the same age, race, or political belief. Quantifying the level of assortativity or disassortativity (the preference of linking to nodes with different attributes) can shed light on the organization of complex networks. It is common practice to measure the level of assortativity according to the assortativity coefficient, or modularity in the case of categorical metadata. This global value is the average level of assortativity across the network and may not be a representative statistic when mixing patterns are heterogeneous. For example, a social network spanning the globe may exhibit local differences in mixing patterns as a consequence of differences in cultural norms. Here, we introduce an approach to localize this global measure so that we can describe the assortativity, across multiple scales, at the node level. Consequently, we are able to capture and qualitatively evaluate the distribution of mixing patterns in the network. We find that, for many real-world networks, the distribution of assortativity is skewed, overdispersed, and multimodal. Our method provides a clearer lens through which we can more closely examine mixing patterns in networks.
Collapse
Affiliation(s)
- Leto Peel
- Institute of Information and Communication Technologies, Electronics and Applied Mathematics (ICTEAM), Université Catholique de Louvain, Louvain-la-Neuve B-1348, Belgium;
- Namur Institute for Complex Systems (naXys), Université de Namur, Namur B-5000, Belgium
| | - Jean-Charles Delvenne
- Institute of Information and Communication Technologies, Electronics and Applied Mathematics (ICTEAM), Université Catholique de Louvain, Louvain-la-Neuve B-1348, Belgium;
- Center for Operations Research and Econometrics (CORE), Université Catholique de Louvain, Louvain-la-Neuve B-1348, Belgium
| | - Renaud Lambiotte
- Mathematical Institute, University of Oxford, Oxford OX2 6GG, United Kingdom
| |
Collapse
|
38
|
Mondragon RJ, Iacovacci J, Bianconi G. Multilink communities of multiplex networks. PLoS One 2018; 13:e0193821. [PMID: 29558471 PMCID: PMC5860749 DOI: 10.1371/journal.pone.0193821] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2017] [Accepted: 02/20/2018] [Indexed: 11/19/2022] Open
Abstract
Multiplex networks describe a large number of complex social, biological and transportation networks where a set of nodes is connected by links of different nature and connotation. Here we uncover the rich community structure of multiplex networks by associating a community to each multilink where the multilinks characterize the connections existing between any two nodes of the multiplex network. Our community detection method reveals the rich interplay between the mesoscale structure of the multiplex networks and their multiplexity. For instance some nodes can belong to many layers and few communities while others can belong to few layers but many communities. Moreover the multilink communities can be formed by a different number of relevant layers. These results point out that mesoscopically there can be large differences in the compressibility of multiplex networks.
Collapse
Affiliation(s)
- Raul J. Mondragon
- School of Electronic Engineering and Computer Science, Queen Mary University of London, London, United Kingdom
- * E-mail:
| | - Jacopo Iacovacci
- School of Mathematical Sciences, Queen Mary University of London, London, United Kingdom
| | - Ginestra Bianconi
- School of Mathematical Sciences, Queen Mary University of London, London, United Kingdom
| |
Collapse
|
39
|
Schmidt C, Piper D, Pester B, Mierau A, Witte H. Tracking the Reorganization of Module Structure in Time-Varying Weighted Brain Functional Connectivity Networks. Int J Neural Syst 2018; 28:1750051. [DOI: 10.1142/s0129065717500514] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Identification of module structure in brain functional networks is a promising way to obtain novel insights into neural information processing, as modules correspond to delineated brain regions in which interactions are strongly increased. Tracking of network modules in time-varying brain functional networks is not yet commonly considered in neuroscience despite its potential for gaining an understanding of the time evolution of functional interaction patterns and associated changing degrees of functional segregation and integration. We introduce a general computational framework for extracting consensus partitions from defined time windows in sequences of weighted directed edge-complete networks and show how the temporal reorganization of the module structure can be tracked and visualized. Part of the framework is a new approach for computing edge weight thresholds for individual networks based on multiobjective optimization of module structure quality criteria as well as an approach for matching modules across time steps. By testing our framework using synthetic network sequences and applying it to brain functional networks computed from electroencephalographic recordings of healthy subjects that were exposed to a major balance perturbation, we demonstrate the framework’s potential for gaining meaningful insights into dynamic brain function in the form of evolving network modules. The precise chronology of the neural processing inferred with our framework and its interpretation helps to improve the currently incomplete understanding of the cortical contribution for the compensation of such balance perturbations.
Collapse
Affiliation(s)
- Christoph Schmidt
- Bernstein Group for Computational Neuroscience Jena, Institute of Medical Statistics, Computer Sciences and Documentation, Jena University Hospital, Friedrich Schiller University Jena, Bachstrasse 18, 07743 Jena, Germany
| | - Diana Piper
- Bernstein Group for Computational Neuroscience Jena, Institute of Medical Statistics, Computer Sciences and Documentation, Jena University Hospital, Friedrich Schiller University Jena, Bachstrasse 18, 07743 Jena, Germany
| | - Britta Pester
- Bernstein Group for Computational Neuroscience Jena, Institute of Medical Statistics, Computer Sciences and Documentation, Jena University Hospital, Friedrich Schiller University Jena, Bachstrasse 18, 07743 Jena, Germany
| | - Andreas Mierau
- Institute of Movement and Neurosciences, German Sport University Cologne, Am Sportpark Muengersdorf 6, 50933 Cologne, Germany
| | - Herbert Witte
- Bernstein Group for Computational Neuroscience Jena, Institute of Medical Statistics, Computer Sciences and Documentation, Jena University Hospital, Friedrich Schiller University Jena, Bachstrasse 18, 07743 Jena, Germany
| |
Collapse
|
40
|
Jeub LGS, Sporns O, Fortunato S. Multiresolution Consensus Clustering in Networks. Sci Rep 2018; 8:3259. [PMID: 29459635 PMCID: PMC5818657 DOI: 10.1038/s41598-018-21352-7] [Citation(s) in RCA: 79] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Accepted: 02/02/2018] [Indexed: 11/16/2022] Open
Abstract
Networks often exhibit structure at disparate scales. We propose a method for identifying community structure at different scales based on multiresolution modularity and consensus clustering. Our contribution consists of two parts. First, we propose a strategy for sampling the entire range of possible resolutions for the multiresolution modularity quality function. Our approach is directly based on the properties of modularity and, in particular, provides a natural way of avoiding the need to increase the resolution parameter by several orders of magnitude to break a few remaining small communities, necessitating the introduction of ad-hoc limits to the resolution range with standard sampling approaches. Second, we propose a hierarchical consensus clustering procedure, based on a modified modularity, that allows one to construct a hierarchical consensus structure given a set of input partitions. While here we are interested in its application to partitions sampled using multiresolution modularity, this consensus clustering procedure can be applied to the output of any clustering algorithm. As such, we see many potential applications of the individual parts of our multiresolution consensus clustering procedure in addition to using the procedure itself to identify hierarchical structure in networks.
Collapse
Affiliation(s)
- Lucas G S Jeub
- School of Informatics, Computing and Engineering, Indiana University, Indiana, United States.
| | - Olaf Sporns
- Department of Psychological and Brain Sciences, Indiana University, Indiana, United States
- Network Science Institute (IUNI), Indiana University, Indiana, United States
| | - Santo Fortunato
- School of Informatics, Computing and Engineering, Indiana University, Indiana, United States
- Network Science Institute (IUNI), Indiana University, Indiana, United States
| |
Collapse
|
41
|
Betzel RF, Bassett DS. Generative models for network neuroscience: prospects and promise. J R Soc Interface 2017; 14:20170623. [PMID: 29187640 PMCID: PMC5721166 DOI: 10.1098/rsif.2017.0623] [Citation(s) in RCA: 62] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 11/06/2017] [Indexed: 12/22/2022] Open
Abstract
Network neuroscience is the emerging discipline concerned with investigating the complex patterns of interconnections found in neural systems, and identifying principles with which to understand them. Within this discipline, one particularly powerful approach is network generative modelling, in which wiring rules are algorithmically implemented to produce synthetic network architectures with the same properties as observed in empirical network data. Successful models can highlight the principles by which a network is organized and potentially uncover the mechanisms by which it grows and develops. Here, we review the prospects and promise of generative models for network neuroscience. We begin with a primer on network generative models, with a discussion of compressibility and predictability, and utility in intuiting mechanisms, followed by a short history on their use in network science, broadly. We then discuss generative models in practice and application, paying particular attention to the critical need for cross-validation. Next, we review generative models of biological neural networks, both at the cellular and large-scale level, and across a variety of species including Caenorhabditis elegans, Drosophila, mouse, rat, cat, macaque and human. We offer a careful treatment of a few relevant distinctions, including differences between generative models and null models, sufficiency and redundancy, inferring and claiming mechanism, and functional and structural connectivity. We close with a discussion of future directions, outlining exciting frontiers both in empirical data collection efforts as well as in method and theory development that, together, further the utility of the generative network modelling approach for network neuroscience.
Collapse
Affiliation(s)
- Richard F Betzel
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Danielle S Bassett
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Electrical and Systems Engineering, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
42
|
Weir WH, Emmons S, Gibson R, Taylor D, Mucha PJ. Post-Processing Partitions to Identify Domains of Modularity Optimization. ALGORITHMS 2017; 10. [PMID: 29046743 PMCID: PMC5642987 DOI: 10.3390/a10030093] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
We introduce the Convex Hull of Admissible Modularity Partitions (CHAMP) algorithm to prune and prioritize different network community structures identified across multiple runs of possibly various computational heuristics. Given a set of partitions, CHAMP identifies the domain of modularity optimization for each partition—i.e., the parameter-space domain where it has the largest modularity relative to the input set—discarding partitions with empty domains to obtain the subset of partitions that are “admissible” candidate community structures that remain potentially optimal over indicated parameter domains. Importantly, CHAMP can be used for multi-dimensional parameter spaces, such as those for multilayer networks where one includes a resolution parameter and interlayer coupling. Using the results from CHAMP, a user can more appropriately select robust community structures by observing the sizes of domains of optimization and the pairwise comparisons between partitions in the admissible subset. We demonstrate the utility of CHAMP with several example networks. In these examples, CHAMP focuses attention onto pruned subsets of admissible partitions that are 20-to-1785 times smaller than the sets of unique partitions obtained by community detection heuristics that were input into CHAMP.
Collapse
Affiliation(s)
- William H. Weir
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina, Chapel Hill, NC 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, NC 27599, USA
- Correspondence:
| | - Scott Emmons
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Ryan Gibson
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Dane Taylor
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Peter J. Mucha
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina, Chapel Hill, NC 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| |
Collapse
|