1
|
Nguyen KC, Jameson CD, Baldwin SA, Nardini JT, Smith RC, Haugh JM, Flores KB. Quantifying collective motion patterns in mesenchymal cell populations using topological data analysis and agent-based modeling. Math Biosci 2024; 370:109158. [PMID: 38373479 DOI: 10.1016/j.mbs.2024.109158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 02/06/2024] [Accepted: 02/11/2024] [Indexed: 02/21/2024]
Abstract
Fibroblasts in a confluent monolayer are known to adopt elongated morphologies in which cells are oriented parallel to their neighbors. We collected and analyzed new microscopy movies to show that confluent fibroblasts are motile and that neighboring cells often move in anti-parallel directions in a collective motion phenomenon we refer to as "fluidization" of the cell population. We used machine learning to perform cell tracking for each movie and then leveraged topological data analysis (TDA) to show that time-varying point-clouds generated by the tracks contain significant topological information content that is driven by fluidization, i.e., the anti-parallel movement of individual neighboring cells and neighboring groups of cells over long distances. We then utilized the TDA summaries extracted from each movie to perform Bayesian parameter estimation for the D'Orsgona model, an agent-based model (ABM) known to produce a wide array of different patterns, including patterns that are qualitatively similar to fluidization. Although the D'Orsgona ABM is a phenomenological model that only describes inter-cellular attraction and repulsion, the estimated region of D'Orsogna model parameter space was consistent across all movies, suggesting that a specific level of inter-cellular repulsion force at close range may be a mechanism that helps drive fluidization patterns in confluent mesenchymal cell populations.
Collapse
Affiliation(s)
- Kyle C Nguyen
- Biomathematics Graduate Program, North Carolina State University, Raleigh, NC 27607, USA; Center for Research in Scientific Computation, North Carolina State University, Raleigh, NC 27607, USA.
| | | | - Scott A Baldwin
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA
| | - John T Nardini
- Department of Mathematics and Statistics, The College of New Jersey, Ewing, NJ 08628, USA
| | - Ralph C Smith
- Department of Mathematics, North Carolina State University, Raleigh, NC 27607, USA
| | - Jason M Haugh
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA
| | - Kevin B Flores
- Center for Research in Scientific Computation, North Carolina State University, Raleigh, NC 27607, USA; Department of Mathematics, North Carolina State University, Raleigh, NC 27607, USA
| |
Collapse
|
2
|
Sudhan Kumar H. Characterization of stability of dynamic particle ensemble systems using topological data analysis. CHAOS (WOODBURY, N.Y.) 2024; 34:013119. [PMID: 38231180 DOI: 10.1063/5.0177180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 12/14/2023] [Indexed: 01/18/2024]
Abstract
Holes are ubiquitous structures in phase space, and their time evolution could indicate an instability in the dynamics of the system. However, the properties of these holes are difficult to study directly due to their theoretical complexity and lack of computational tools. This study proposes the use of persistent homology (PH), a technique from topological data analysis, as a computational tool for analyzing the properties of these phase-space holes, or more formally the H1 homology class according to PH. Initially, by using a toy data set, it is shown that the time evolution and the growth rate of a H1 class in phase space could be obtained by PH. For further validation, PH is applied to particle ensemble systems, such as the Hamiltonian flow and the two-stream instability (TSI). Both the stable case, where no H1 forms, and the unstable case, where H1 forms, were analyzed. It was shown that PH can distinguish between the stable and unstable cases purely from the phase-space time evolution plots. In unstable TSI, the PH also distinguished the transition of the H1 class from linear to non-linear growth. The growth rate, thus, obtained is in excellent agreement with the growth rate of the particle energy in the TSI system.
Collapse
|
3
|
Tan P, Miles CE. Intrinsic statistical separation of subpopulations in heterogeneous collective motion via dimensionality reduction. Phys Rev E 2024; 109:014403. [PMID: 38366514 DOI: 10.1103/physreve.109.014403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 12/12/2023] [Indexed: 02/18/2024]
Abstract
Collective motion of locally interacting agents is found ubiquitously throughout nature. The inability to probe individuals has driven longstanding interest in the development of methods for inferring the underlying interactions. In the context of heterogeneous collectives, where the population consists of individuals driven by different interactions, existing approaches require some knowledge about the heterogeneities or underlying interactions. Here, we investigate the feasibility of identifying the identities in a heterogeneous collective without such prior knowledge. We numerically explore the behavior of a heterogeneous Vicsek model and find sufficiently long trajectories intrinsically cluster in a principal component analysis-based dimensionally reduced model-agnostic description of the data. We identify how heterogeneities in each parameter in the model (interaction radius, noise, population proportions) dictate this clustering. Finally, we show the generality of this phenomenon by finding similar behavior in a heterogeneous D'Orsogna model. Altogether, our results establish and quantify the intrinsic model-agnostic statistical disentanglement of identities in heterogeneous collectives.
Collapse
Affiliation(s)
- Pei Tan
- Mathematical, Computational, and Systems Biology Graduate Program, University of California, Irvine 92697, USA
| | | |
Collapse
|
4
|
Bhaskar D, Zhang WY, Volkening A, Sandstede B, Wong IY. Topological data analysis of spatial patterning in heterogeneous cell populations: clustering and sorting with varying cell-cell adhesion. NPJ Syst Biol Appl 2023; 9:43. [PMID: 37709793 PMCID: PMC10502054 DOI: 10.1038/s41540-023-00302-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 08/14/2023] [Indexed: 09/16/2023] Open
Abstract
Different cell types aggregate and sort into hierarchical architectures during the formation of animal tissues. The resulting spatial organization depends (in part) on the strength of adhesion of one cell type to itself relative to other cell types. However, automated and unsupervised classification of these multicellular spatial patterns remains challenging, particularly given their structural diversity and biological variability. Recent developments based on topological data analysis are intriguing to reveal similarities in tissue architecture, but these methods remain computationally expensive. In this article, we show that multicellular patterns organized from two interacting cell types can be efficiently represented through persistence images. Our optimized combination of dimensionality reduction via autoencoders, combined with hierarchical clustering, achieved high classification accuracy for simulations with constant cell numbers. We further demonstrate that persistence images can be normalized to improve classification for simulations with varying cell numbers due to proliferation. Finally, we systematically consider the importance of incorporating different topological features as well as information about each cell type to improve classification accuracy. We envision that topological machine learning based on persistence images will enable versatile and robust classification of complex tissue architectures that occur in development and disease.
Collapse
Affiliation(s)
- Dhananjay Bhaskar
- School of Engineering, Brown University, Providence, RI, USA
- Center for Biomedical Engineering, Brown University, Providence, RI, USA
- Data Science Institute, Brown University, Providence, RI, USA
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA
| | - William Y Zhang
- Data Science Institute, Brown University, Providence, RI, USA
- Division of Applied Mathematics, Brown University, Providence, RI, USA
- Operations Research Center, Massachusetts Institute of Technology, Cambridge, MA, USA
| | | | - Björn Sandstede
- Data Science Institute, Brown University, Providence, RI, USA
- Division of Applied Mathematics, Brown University, Providence, RI, USA
| | - Ian Y Wong
- School of Engineering, Brown University, Providence, RI, USA.
- Center for Biomedical Engineering, Brown University, Providence, RI, USA.
- Data Science Institute, Brown University, Providence, RI, USA.
- Legorreta Cancer Center, Brown University, Providence, RI, USA.
| |
Collapse
|
5
|
McDonald RA, Neuhausler R, Robinson M, Larsen LG, Harrington HA, Bruna M. Zigzag persistence for coral reef resilience using a stochastic spatial model. J R Soc Interface 2023; 20:20230280. [PMID: 37608713 PMCID: PMC10445017 DOI: 10.1098/rsif.2023.0280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 08/04/2023] [Indexed: 08/24/2023] Open
Abstract
A complex interplay between species governs the evolution of spatial patterns in ecology. An open problem in the biological sciences is characterizing spatio-temporal data and understanding how changes at the local scale affect global dynamics/behaviour. Here, we extend a well-studied temporal mathematical model of coral reef dynamics to include stochastic and spatial interactions and generate data to study different ecological scenarios. We present descriptors to characterize patterns in heterogeneous spatio-temporal data surpassing spatially averaged measures. We apply these descriptors to simulated coral data and demonstrate the utility of two topological data analysis techniques-persistent homology and zigzag persistence-for characterizing mechanisms of reef resilience. We show that the introduction of local competition between species leads to the appearance of coral clusters in the reef. We use our analyses to distinguish temporal dynamics stemming from different initial configurations of coral, showing that the neighbourhood composition of coral sites determines their long-term survival. Using zigzag persistence, we determine which spatial configurations protect coral from extinction in different environments. Finally, we apply this toolkit of multi-scale methods to empirical coral reef data, which distinguish spatio-temporal reef dynamics in different locations, and demonstrate the applicability to a range of datasets.
Collapse
Affiliation(s)
- R. A. McDonald
- Mathematical Institute, University of Oxford, Oxford OX2 6GG, UK
| | - R. Neuhausler
- Department of Geography, University of California, Berkeley, CA 94720, USA
| | - M. Robinson
- Computer Science Department, University of Oxford, Oxford OX1 3QG, UK
| | - L. G. Larsen
- Department of Geography, University of California, Berkeley, CA 94720, USA
| | - H. A. Harrington
- Mathematical Institute, University of Oxford, Oxford OX2 6GG, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - M. Bruna
- Department of Applied Mathematics and Theoretical Physics, Centre for Mathematical Sciences, University of Cambridge, Cambridge CB3 0WA, UK
| |
Collapse
|
6
|
Gu K, Yan L, Li X, Duan X, Liang J. Change point detection in multi-agent systems based on higher-order features. CHAOS (WOODBURY, N.Y.) 2022; 32:111102. [PMID: 36456327 DOI: 10.1063/5.0126848] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Accepted: 10/20/2022] [Indexed: 06/17/2023]
Abstract
Change point detection (CPD) for multi-agent systems helps one to evaluate the state and better control the system. Multivariate CPD methods solve the d × T time series well; however, the multi-agent systems often produce the N × d × T dimensional data, where d is the dimension of multivariate observations, T is the total observation time, and N is the number of agents. In this paper, we propose two valid approaches based on higher-order features, namely, the Betti number feature extraction and the Persistence feature extraction, to compress the d-dimensional features into one dimension so that general CPD methods can be applied to higher-dimensional data. First, a topological structure based on the Vietoris-Rips complex is constructed on each time-slice snapshot. Then, the Betti number and persistence of the topological structures are obtained to separately constitute two feature matrices for change point estimates. Higher-order features primarily describe the data distribution on each snapshot and are, therefore, independent of the node correspondence cross snapshots, which gives our methods unique advantages in processing missing data. Experiments in multi-agent systems demonstrate the significant performance of our methods. We believe that our methods not only provide a new tool for dimensionality reduction and missing data in multi-agent systems but also have the potential to be applied to a wider range of fields, such as complex networks.
Collapse
Affiliation(s)
- Kongjing Gu
- College of Science, National University of Defense Technology, Changsha, Hunan 410073, People's Republic of China
| | - Liang Yan
- College of Science, National University of Defense Technology, Changsha, Hunan 410073, People's Republic of China
| | - Xiang Li
- College of Science, National University of Defense Technology, Changsha, Hunan 410073, People's Republic of China
| | - Xiaojun Duan
- College of Science, National University of Defense Technology, Changsha, Hunan 410073, People's Republic of China
| | - Jingjie Liang
- College of Science, National University of Defense Technology, Changsha, Hunan 410073, People's Republic of China
| |
Collapse
|
7
|
Messenger DA, Wheeler GE, Liu X, Bortz DM. Learning anisotropic interaction rules from individual trajectories in a heterogeneous cellular population. J R Soc Interface 2022; 19:20220412. [PMCID: PMC9554727 DOI: 10.1098/rsif.2022.0412] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Interacting particle system (IPS) models have proven to be highly successful for describing the spatial movement of organisms. However, it is challenging to infer the interaction rules directly from data. In the field of equation discovery, the weak-form sparse identification of nonlinear dynamics (WSINDy) methodology has been shown to be computationally efficient for identifying the governing equations of complex systems from noisy data. Motivated by the success of IPS models to describe the spatial movement of organisms, we develop WSINDy for the second-order IPS to learn equations for communities of cells. Our approach learns the directional interaction rules for each individual cell that in aggregate govern the dynamics of a heterogeneous population of migrating cells. To sort a cell according to the active classes present in its model, we also develop a novel ad hoc classification scheme (which accounts for the fact that some cells do not have enough evidence to accurately infer a model). Aggregated models are then constructed hierarchically to simultaneously identify different species of cells present in the population and determine best-fit models for each species. We demonstrate the efficiency and proficiency of the method on several test scenarios, motivated by common cell migration experiments.
Collapse
Affiliation(s)
- Daniel A. Messenger
- Department of Applied Mathematics, University of Colorado, Boulder, CO 80309-0526, USA
| | - Graycen E. Wheeler
- Department of Biochemistry, University of Colorado, Boulder, CO 80309-0526, USA
| | - Xuedong Liu
- Department of Biochemistry, University of Colorado, Boulder, CO 80309-0526, USA
| | - David M. Bortz
- Department of Applied Mathematics, University of Colorado, Boulder, CO 80309-0526, USA
| |
Collapse
|
8
|
Güzel İ, Munch E, Khasawneh FA. Detecting bifurcations in dynamical systems with CROCKER plots. CHAOS (WOODBURY, N.Y.) 2022; 32:093111. [PMID: 36182371 DOI: 10.1063/5.0102421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 08/05/2022] [Indexed: 06/16/2023]
Abstract
Existing tools for bifurcation detection from signals of dynamical systems typically are either limited to a special class of systems or they require carefully chosen input parameters and a significant expertise to interpret the results. Therefore, we describe an alternative method based on persistent homology-a tool from topological data analysis-that utilizes Betti numbers and CROCKER plots. Betti numbers are topological invariants of topological spaces, while the CROCKER plot is a coarsened but easy to visualize data representation of a one-parameter varying family of persistence barcodes. The specific bifurcations we investigate are transitions from periodic to chaotic behavior or vice versa in a one-parameter collection of differential equations. We validate our methods using numerical experiments on ten dynamical systems and contrast the results with existing tools that use the maximum Lyapunov exponent. We further prove the relationship between the Wasserstein distance to the empty diagram and the norm of the Betti vector, which shows that an even more simplified version of the information has the potential to provide insight into the bifurcation parameter. The results show that our approach reveals more information about the shape of the periodic attractor than standard tools, and it has more favorable computational time in comparison with the Rösenstein algorithm for computing the maximum Lyapunov exponent.
Collapse
Affiliation(s)
- İsmail Güzel
- Department of Mathematics Engineering, İstanbul Technical University, Maslak, İstanbul 34469, Turkey
| | - Elizabeth Munch
- Department of Computational Mathematics, Science and Engineering and Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, USA
| | - Firas A Khasawneh
- Department of Mechanical Engineering, Michigan State University, East Lansing, Michigan 48824, USA
| |
Collapse
|
9
|
Leykam D, Rondón I, Angelakis DG. Dark soliton detection using persistent homology. CHAOS (WOODBURY, N.Y.) 2022; 32:073133. [PMID: 35907713 DOI: 10.1063/5.0097053] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Accepted: 07/01/2022] [Indexed: 06/15/2023]
Abstract
Classifying images often requires manual identification of qualitative features. Machine learning approaches including convolutional neural networks can achieve accuracy comparable to human classifiers but require extensive data and computational resources to train. We show how a topological data analysis technique, persistent homology, can be used to rapidly and reliably identify qualitative features in experimental image data. The identified features can be used as inputs to simple supervised machine learning models, such as logistic regression models, which are easier to train. As an example, we consider the identification of dark solitons using a dataset of 6257 labeled atomic Bose-Einstein condensate density images.
Collapse
Affiliation(s)
- Daniel Leykam
- Centre for Quantum Technologies, National University of Singapore, 3 Science Drive 2, Singapore 117543
| | - Irving Rondón
- School of Computational Sciences, Korea Institute for Advanced Study, 85 Hoegi-ro, Seoul 02455, Republic of Korea
| | - Dimitris G Angelakis
- Centre for Quantum Technologies, National University of Singapore, 3 Science Drive 2, Singapore 117543
| |
Collapse
|
10
|
Systematic Analysis of Emergent Collective Motion Produced by a 3D Hybrid Zonal Model. Bull Math Biol 2021; 84:16. [PMID: 34921628 DOI: 10.1007/s11538-021-00977-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 11/18/2021] [Indexed: 10/19/2022]
Abstract
Emergent patterns of collective motion are thought to arise from local rules of interaction that govern how individuals adjust their velocity in response to the relative locations and velocities of near neighbours. Many models of collective motion apply rules of interaction over a metric scale, based on the distances to neighbouring group members. However, empirical work suggests that some species apply interactions over a topological scale, based on distance determined neighbour rank. Here, we modify an important metric model of collective motion (Couzin et al. in J Theor Biol 218(1):1-11, 2002), so that interactions relating to orienting movements with neighbours and attraction towards more distant neighbours operate over topological scales. We examine the emergent group movement patterns generated by the model as the numbers of neighbours that contribute to orientation- and attraction-based velocity adjustments vary. Like the metric form of the model, simulated groups can fragment (when interactions are influenced by less than 10-15% of the group), swarm and move in parallel, but milling does not occur. The model also generates other cohesive group movements including cases where groups exhibit directed motion without strong overall alignment of individuals. Multiple emergent states are possible for the same set of underlying model parameters in some cases, suggesting sensitivity to initial conditions, and there is evidence that emergent states of the system depend on the history of the system. Groups that do not fragment tend to stay relatively compact in terms of neighbour distances. Even if a group does fragment, individuals remain relatively close to near neighbours, avoiding complete isolation.
Collapse
|
11
|
Li L, Thompson C, Henselman-Petrusek G, Giusti C, Ziegelmeier L. Minimal Cycle Representatives in Persistent Homology Using Linear Programming: An Empirical Study With User's Guide. Front Artif Intell 2021; 4:681117. [PMID: 34708196 PMCID: PMC8544243 DOI: 10.3389/frai.2021.681117] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 05/14/2021] [Indexed: 12/24/2022] Open
Abstract
Cycle representatives of persistent homology classes can be used to provide descriptions of topological features in data. However, the non-uniqueness of these representatives creates ambiguity and can lead to many different interpretations of the same set of classes. One approach to solving this problem is to optimize the choice of representative against some measure that is meaningful in the context of the data. In this work, we provide a study of the effectiveness and computational cost of severalℓ 1 minimization optimization procedures for constructing homological cycle bases for persistent homology with rational coefficients in dimension one, including uniform-weighted and length-weighted edge-loss algorithms as well as uniform-weighted and area-weighted triangle-loss algorithms. We conduct these optimizations via standard linear programming methods, applying general-purpose solvers to optimize over column bases of simplicial boundary matrices. Our key findings are: 1) optimization is effective in reducing the size of cycle representatives, though the extent of the reduction varies according to the dimension and distribution of the underlying data, 2) the computational cost of optimizing a basis of cycle representatives exceeds the cost of computing such a basis, in most data sets we consider, 3) the choice of linear solvers matters a lot to the computation time of optimizing cycles, 4) the computation time of solving an integer program is not significantly longer than the computation time of solving a linear program for most of the cycle representatives, using the Gurobi linear solver, 5) strikingly, whether requiring integer solutions or not, we almost always obtain a solution with the same cost and almost all solutions found have entries in{ - 1,0,1 } and therefore, are also solutions to a restrictedℓ 0 optimization problem, and 6) we obtain qualitatively different results for generators in Erdős-Rényi random clique complexes than in real-world and synthetic point cloud data.
Collapse
Affiliation(s)
- Lu Li
- Mathematics, Statistics, and Computer Science Department, Macalester College, Saint Paul, MN, United States
| | - Connor Thompson
- Department of Mathematics, Purdue University, West Lafayette, IN, United States
| | | | - Chad Giusti
- Department of Mathematical Sciences, University of Delaware, Newark, DE, United States
| | - Lori Ziegelmeier
- Mathematics, Statistics, and Computer Science Department, Macalester College, Saint Paul, MN, United States
| |
Collapse
|
12
|
Nardini JT, Stolz BJ, Flores KB, Harrington HA, Byrne HM. Topological data analysis distinguishes parameter regimes in the Anderson-Chaplain model of angiogenesis. PLoS Comput Biol 2021; 17:e1009094. [PMID: 34181657 PMCID: PMC8270459 DOI: 10.1371/journal.pcbi.1009094] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 07/09/2021] [Accepted: 05/18/2021] [Indexed: 12/27/2022] Open
Abstract
Angiogenesis is the process by which blood vessels form from pre-existing vessels. It plays a key role in many biological processes, including embryonic development and wound healing, and contributes to many diseases including cancer and rheumatoid arthritis. The structure of the resulting vessel networks determines their ability to deliver nutrients and remove waste products from biological tissues. Here we simulate the Anderson-Chaplain model of angiogenesis at different parameter values and quantify the vessel architectures of the resulting synthetic data. Specifically, we propose a topological data analysis (TDA) pipeline for systematic analysis of the model. TDA is a vibrant and relatively new field of computational mathematics for studying the shape of data. We compute topological and standard descriptors of model simulations generated by different parameter values. We show that TDA of model simulation data stratifies parameter space into regions with similar vessel morphology. The methodologies proposed here are widely applicable to other synthetic and experimental data including wound healing, development, and plant biology.
Collapse
Affiliation(s)
- John T. Nardini
- Department of Mathematics, North Carolina State University, Raleigh, North Carolina, United States of America
| | | | - Kevin B. Flores
- Department of Mathematics, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Heather A. Harrington
- Mathematical Institute, University of Oxford, Oxford, United Kingdom
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Helen M. Byrne
- Mathematical Institute, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
13
|
Adams H, Moy M. Topology Applied to Machine Learning: From Global to Local. Front Artif Intell 2021; 4:668302. [PMID: 34056580 PMCID: PMC8160457 DOI: 10.3389/frai.2021.668302] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 04/15/2021] [Indexed: 11/24/2022] Open
Abstract
Through the use of examples, we explain one way in which applied topology has evolved since the birth of persistent homology in the early 2000s. The first applications of topology to data emphasized the global shape of a dataset, such as the three-circle model for 3 × 3 pixel patches from natural images, or the configuration space of the cyclo-octane molecule, which is a sphere with a Klein bottle attached via two circles of singularity. In these studies of global shape, short persistent homology bars are disregarded as sampling noise. More recently, however, persistent homology has been used to address questions about the local geometry of data. For instance, how can local geometry be vectorized for use in machine learning problems? Persistent homology and its vectorization methods, including persistence landscapes and persistence images, provide popular techniques for incorporating both local geometry and global topology into machine learning. Our meta-hypothesis is that the short bars are as important as the long bars for many machine learning tasks. In defense of this claim, we survey applications of persistent homology to shape recognition, agent-based modeling, materials science, archaeology, and biology. Additionally, we survey work connecting persistent homology to geometric features of spaces, including curvature and fractal dimension, and various methods that have been used to incorporate persistent homology into machine learning.
Collapse
Affiliation(s)
- Henry Adams
- Department of Mathematics, Colorado State University, Fort Collins, CO, United States
| | - Michael Moy
- Department of Mathematics, Colorado State University, Fort Collins, CO, United States
| |
Collapse
|
14
|
Bhaskar D, Zhang WY, Wong IY. Topological data analysis of collective and individual epithelial cells using persistent homology of loops. SOFT MATTER 2021; 17:4653-4664. [PMID: 33949592 PMCID: PMC8276269 DOI: 10.1039/d1sm00072a] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Interacting, self-propelled particles such as epithelial cells can dynamically self-organize into complex multicellular patterns, which are challenging to classify without a priori information. Classically, different phases and phase transitions have been described based on local ordering, which may not capture structural features at larger length scales. Instead, topological data analysis (TDA) determines the stability of spatial connectivity at varying length scales (i.e. persistent homology), and can compare different particle configurations based on the "cost" of reorganizing one configuration into another. Here, we demonstrate a topology-based machine learning approach for unsupervised profiling of individual and collective phases based on large-scale loops. We show that these topological loops (i.e. dimension 1 homology) are robust to variations in particle number and density, particularly in comparison to connected components (i.e. dimension 0 homology). We use TDA to map out phase diagrams for simulated particles with varying adhesion and propulsion, at constant population size as well as when proliferation is permitted. Next, we use this approach to profile our recent experiments on the clustering of epithelial cells in varying growth factor conditions, which are compared to our simulations. Finally, we characterize the robustness of this approach at varying length scales, with sparse sampling, and over time. Overall, we envision TDA will be broadly applicable as a model-agnostic approach to analyze active systems with varying population size, from cytoskeletal motors to motile cells to flocking or swarming animals.
Collapse
Affiliation(s)
- Dhananjay Bhaskar
- School of Engineering, Center for Biomedical Engineering, Brown University, 184 Hope St Box D, Providence, RI 02912, USA. and Data Science Initiative, Brown University, 184 Hope St Box D, Providence, RI 02912, USA
| | - William Y Zhang
- Department of Computer Science, Brown University, 184 Hope St Box D, Providence, RI 02912, USA
| | - Ian Y Wong
- School of Engineering, Center for Biomedical Engineering, Brown University, 184 Hope St Box D, Providence, RI 02912, USA. and Data Science Initiative, Brown University, 184 Hope St Box D, Providence, RI 02912, USA
| |
Collapse
|
15
|
Mudaliar RK, Schaerf TM. Examination of an averaging method for estimating repulsion and attraction interactions in moving groups. PLoS One 2020; 15:e0243631. [PMID: 33296438 PMCID: PMC7725364 DOI: 10.1371/journal.pone.0243631] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 11/24/2020] [Indexed: 02/07/2023] Open
Abstract
Groups of animals coordinate remarkable, coherent, movement patterns during periods of collective motion. Such movement patterns include the toroidal mills seen in fish shoals, highly aligned parallel motion like that of flocks of migrating birds, and the swarming of insects. Since the 1970's a wide range of collective motion models have been studied that prescribe rules of interaction between individuals, and that are capable of generating emergent patterns that are visually similar to those seen in real animal group. This does not necessarily mean that real animals apply exactly the same interactions as those prescribed in models. In more recent work, researchers have sought to infer the rules of interaction of real animals directly from tracking data, by using a number of techniques, including averaging methods. In one of the simplest formulations, the averaging methods determine the mean changes in the components of the velocity of an individual over time as a function of the relative coordinates of group mates. The averaging methods can also be used to estimate other closely related quantities including the mean relative direction of motion of group mates as a function of their relative coordinates. Since these methods for extracting interaction rules and related quantities from trajectory data are relatively new, the accuracy of these methods has had limited inspection. In this paper, we examine the ability of an averaging method to reveal prescribed rules of interaction from data generated by two individual based models for collective motion. Our work suggests that an averaging method can capture the qualitative features of underlying interactions from trajectory data alone, including repulsion and attraction effects evident in changes in speed and direction of motion, and the presence of a blind zone. However, our work also illustrates that the output from a simple averaging method can be affected by emergent group level patterns of movement, and the sizes of the regions over which repulsion and attraction effects are apparent can be distorted depending on how individuals combine interactions with multiple group mates.
Collapse
Affiliation(s)
- Rajnesh K. Mudaliar
- School of Science and Technology, University of New England, Armidale, NSW, Australia
- School of Mathematical and Computing Science, Fiji National University, Suva, Fiji
| | - Timothy M. Schaerf
- School of Science and Technology, University of New England, Armidale, NSW, Australia
| |
Collapse
|
16
|
Buttenschön A, Edelstein-Keshet L. Bridging from single to collective cell migration: A review of models and links to experiments. PLoS Comput Biol 2020; 16:e1008411. [PMID: 33301528 PMCID: PMC7728230 DOI: 10.1371/journal.pcbi.1008411] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Mathematical and computational models can assist in gaining an understanding of cell behavior at many levels of organization. Here, we review models in the literature that focus on eukaryotic cell motility at 3 size scales: intracellular signaling that regulates cell shape and movement, single cell motility, and collective cell behavior from a few cells to tissues. We survey recent literature to summarize distinct computational methods (phase-field, polygonal, Cellular Potts, and spherical cells). We discuss models that bridge between levels of organization, and describe levels of detail, both biochemical and geometric, included in the models. We also highlight links between models and experiments. We find that models that span the 3 levels are still in the minority.
Collapse
Affiliation(s)
- Andreas Buttenschön
- Department of Mathematics, University of British Columbia, Vancouver, Canada
| | | |
Collapse
|