1
|
Ramos RH, Bardelotte YA, de Oliveira Lage Ferreira C, Simao A. Identifying key genes in cancer networks using persistent homology. Sci Rep 2025; 15:2751. [PMID: 39838168 PMCID: PMC11751331 DOI: 10.1038/s41598-025-87265-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2024] [Accepted: 01/17/2025] [Indexed: 01/23/2025] Open
Abstract
Identifying driver genes is crucial for understanding oncogenesis and developing targeted cancer therapies. Driver discovery methods using protein or pathway networks rely on traditional network science measures, focusing on nodes, edges, or community metrics. These methods can overlook the high-dimensional interactions that cancer genes have within cancer networks. This study presents a novel method using Persistent Homology to analyze the role of driver genes in higher-order structures within Cancer Consensus Networks derived from main cellular pathways. We integrate mutation data from six cancer types and three biological functions: DNA Repair, Chromatin Organization, and Programmed Cell Death. We systematically evaluated the impact of gene removal on topological voids ([Formula: see text] structures) within the Cancer Consensus Networks. Our results reveal that only known driver genes and cancer-associated genes influence these structures, while passenger genes do not. Although centrality measures alone proved insufficient to fully characterize impact genes, combining higher-order topological analysis with traditional network metrics can improve the precision of distinguishing between drivers and passengers. This work shows that cancer genes play an important role in higher-order structures, going beyond pairwise measures, and provides an approach to distinguish drivers and cancer-associated genes from passenger genes.
Collapse
Affiliation(s)
- Rodrigo Henrique Ramos
- University of São Paulo, ICMC, São Carlos, 13566-590, Brazil.
- Federal Institute of São Paulo, São Carlos, 13565-820, Brazil.
| | | | | | - Adenilso Simao
- University of São Paulo, ICMC, São Carlos, 13566-590, Brazil
| |
Collapse
|
2
|
Jiang C, Jiang Z, Zhang X, Qu L, Fu K, Teng Y, Lai R, Guo R, Ding C, Li K, Tian R. Robust and interpretable deep learning system for prognostic stratification of extranodal natural killer/T-cell lymphoma. Eur J Nucl Med Mol Imaging 2024:10.1007/s00259-024-07024-x. [PMID: 39714634 DOI: 10.1007/s00259-024-07024-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Accepted: 12/04/2024] [Indexed: 12/24/2024]
Abstract
PURPOSE Extranodal natural killer/T-cell lymphoma (ENKTCL) is an hematologic malignancy with prognostic heterogeneity. We aimed to develop and validate DeepENKTCL, an interpretable deep learning prediction system for prognosis risk stratification in ENKTCL. METHODS A total of 562 patients from four centers were divided into the training cohort, validation cohort and test cohort. DeepENKTCL combined a tumor segmentation model, a PET/CT fusion model, and prognostic prediction models. RadScore and TopoScore were constructed using radiomics and topological features derived from fused images, with SHapley Additive exPlanations (SHAP) analysis enhancing interpretability. The final prognostic models, termed FusionScore, were developed for predicting progression-free survival (PFS) and overall survival (OS). Performance was assessed using area under the receiver operator characteristic curve (AUC), time-dependent C-index, clinical decision curves (DCA), and Kaplan-Meier (KM) curves. RESULTS The tumor segmentation model accurately delineated the tumor lesions. RadScore (AUC: 0.908 for PFS, 0.922 for OS in validation; 0.822 for PFS, 0.867 for OS in test) and TopoScore (AUC: 0.756 for PFS, 0.805 for OS in validation; 0.689 for PFS, 0.769 for OS in test) both exhibited potential prognostic capability. The time-dependent C-index (0.897 for PFS, 0.928 for OS in validation; 0.894 for PFS, 0.868 for OS in test) and DCA indicated that FusionScore offers significant prognostic performance and superior net clinical benefits compared to existing models. KM survival analysis showed that higher FusionScores correlated with poorer PFS and OS across all cohorts. CONCLUSION DeepENKTCL provided a robust and interpretable framework for ENKTCL prognosis, with the potential to improve patient outcomes and guide personalized treatment.
Collapse
Affiliation(s)
- Chong Jiang
- Department of Nuclear Medicine, West China Hospital, Sichuan University, No.37, Guoxue Alley, Chengdu, Sichuan, 610041, China
| | - Zekun Jiang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
- Med-X Center for Informatics, Sichuan University, Chengdu, China
| | - Xinyu Zhang
- Department of Nuclear Medicine, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, No. 197 Ruijin Second Road, Shanghai, 200025, China
| | - Linhao Qu
- School of Basic Medical Sciences, Fudan University, Shanghai, China
| | - Kexue Fu
- School of Basic Medical Sciences, Fudan University, Shanghai, China
| | - Yue Teng
- Department of Nuclear Medicine, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China
| | - Ruihe Lai
- Department of Nuclear Medicine, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China
| | - Rui Guo
- Department of Nuclear Medicine, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, No. 197 Ruijin Second Road, Shanghai, 200025, China.
| | - Chongyang Ding
- Department of Nuclear Medicine, The First Affiliated Hospital of Nanjing Medical University, Jiangsu Province Hospital, No.321, Zhongshan Road, Nanjing, Jiangsu, 210008, China.
| | - Kang Li
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China.
- Med-X Center for Informatics, Sichuan University, Chengdu, China.
| | - Rong Tian
- Department of Nuclear Medicine, West China Hospital, Sichuan University, No.37, Guoxue Alley, Chengdu, Sichuan, 610041, China.
| |
Collapse
|
3
|
Nardi G, Torcq L, Schmidt AA, Olivo-Marin JC. Topology-based segmentation of 3D confocal images of emerging hematopoietic stem cells in the zebrafish embryo. BIOLOGICAL IMAGING 2024; 4:e11. [PMID: 39776612 PMCID: PMC11704129 DOI: 10.1017/s2633903x24000102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 06/08/2024] [Accepted: 06/18/2024] [Indexed: 01/11/2025]
Abstract
We develop a novel method for image segmentation of 3D confocal microscopy images of emerging hematopoietic stem cells. The method is based on the theory of persistent homology and uses an optimal threshold to select the most persistent cycles in the persistence diagram. This enables the segmentation of the image's most contrasted and representative shapes. Coupling this segmentation method with a meshing algorithm, we define a pipeline for 3D reconstruction of confocal volumes. Compared to related methods, this approach improves shape segmentation, is more ergonomic to automatize, and has fewer parameters. We apply it to the segmentation of membranes, at subcellular resolution, of cells involved in the endothelial-to-hematopoietic transition (EHT) in the zebrafish embryos.
Collapse
Affiliation(s)
- G. Nardi
- Biological Image Analysis Unit, Institut Pasteur, Université Paris Cité, Paris, France
- CNRS UMR3691, Paris, France
| | - L. Torcq
- Department of Developmental and Stem Cell Biology, Institut Pasteur, Université Paris Cité, Paris, France
- CNRS UMR3738, Paris, France
- Collège doctoral, Sorbonne Université, Paris, France
| | - A. A. Schmidt
- Department of Developmental and Stem Cell Biology, Institut Pasteur, Université Paris Cité, Paris, France
- CNRS UMR3738, Paris, France
| | - J.-C. Olivo-Marin
- Biological Image Analysis Unit, Institut Pasteur, Université Paris Cité, Paris, France
- CNRS UMR3691, Paris, France
| |
Collapse
|
4
|
Chung YM, Hu CS, Sun E, Tseng HC. Morphological multiparameter filtration and persistent homology in mitochondrial image analysis. PLoS One 2024; 19:e0310157. [PMID: 39302926 DOI: 10.1371/journal.pone.0310157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Accepted: 08/25/2024] [Indexed: 09/22/2024] Open
Abstract
The complexity of branching and curvilinear morphology of a complete mitochondrial network within each cell is challenging to analyze and quantify. To address this challenge, we developed an image analysis technique using persistent homology with a multiparameter filtration framework, combining image processing techniques in mathematical morphology. We show that such filtrations contain both topological and geometric information about complex cellular organelle structures, which allows a software program to extract meaningful features. Using this information, we also develop a connectivity index that describes the morphology of the branching patterns. As proof of concept, we utilize this approach to study how mitochondrial networks are altered by genetic changes in the Optineurin gene. Mutations in the autophagy gene Optineurin (OPTN) are associated with primary open-angle glaucoma (POAG), amyotrophic lateral sclerosis (ALS), and Paget's disease of the bone, but the pathophysiological mechanism is unclear. We utilized the proposed mathematical morphology-based multiparameter filtration and persistent homology approach to analyze and quantitatively compare how changes in the OPTN gene alter mitochondrial structures from their normal interconnected, tubular morphology into scattered, fragmented pieces.
Collapse
Affiliation(s)
- Yu-Min Chung
- Eli Lilly and Company, Indianapolis, IN, United States of America
| | - Chuan-Shen Hu
- School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore, Singapore
| | - Emily Sun
- Columbia Ophthalmology, Columbia University Irving Medical Center, New York, NY, United States of America
| | - Henry C Tseng
- Duke Eye Center, Department of Ophthalmology, Duke University Medical Center, Durham, NC, United States of America
| |
Collapse
|
5
|
Stolz BJ, Dhesi J, Bull JA, Harrington HA, Byrne HM, Yoon IHR. Relational Persistent Homology for Multispecies Data with Application to the Tumor Microenvironment. Bull Math Biol 2024; 86:128. [PMID: 39287883 PMCID: PMC11408586 DOI: 10.1007/s11538-024-01353-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 08/29/2024] [Indexed: 09/19/2024]
Abstract
Topological data analysis (TDA) is an active field of mathematics for quantifying shape in complex data. Standard methods in TDA such as persistent homology (PH) are typically focused on the analysis of data consisting of a single entity (e.g., cells or molecular species). However, state-of-the-art data collection techniques now generate exquisitely detailed multispecies data, prompting a need for methods that can examine and quantify the relations among them. Such heterogeneous data types arise in many contexts, ranging from biomedical imaging, geospatial analysis, to species ecology. Here, we propose two methods for encoding spatial relations among different data types that are based on Dowker complexes and Witness complexes. We apply the methods to synthetic multispecies data of a tumor microenvironment and analyze topological features that capture relations between different cell types, e.g., blood vessels, macrophages, tumor cells, and necrotic cells. We demonstrate that relational topological features can extract biological insight, including the dominant immune cell phenotype (an important predictor of patient prognosis) and the parameter regimes of a data-generating model. The methods provide a quantitative perspective on the relational analysis of multispecies spatial data, overcome the limits of traditional PH, and are readily computable.
Collapse
Affiliation(s)
- Bernadette J Stolz
- Laboratory for Topology and Neuroscience, EPFL, Station 8, Lausanne, 1015, Switzerland
- Mathematical Institute, University of Oxford, Andrew Wiles Building, Woodstock Rd, Oxford, OX2 6GG, UK
| | - Jagdeep Dhesi
- Mathematical Institute, University of Oxford, Andrew Wiles Building, Woodstock Rd, Oxford, OX2 6GG, UK
| | - Joshua A Bull
- Mathematical Institute, University of Oxford, Andrew Wiles Building, Woodstock Rd, Oxford, OX2 6GG, UK
| | - Heather A Harrington
- Mathematical Institute, University of Oxford, Andrew Wiles Building, Woodstock Rd, Oxford, OX2 6GG, UK
- Wellcome Centre for Human Genetics, University of Oxford, Roosevelt Dr, Headington, Headington, Oxford, OX3 7BN, UK
| | - Helen M Byrne
- Mathematical Institute, University of Oxford, Andrew Wiles Building, Woodstock Rd, Oxford, OX2 6GG, UK
- Ludwig Institute for Cancer Research, University of Oxford, Old Road Campus Research Build, Roosevelt Dr, Headington, Oxford, OX3 7DQ, UK
| | - Iris H R Yoon
- Mathematical Institute, University of Oxford, Andrew Wiles Building, Woodstock Rd, Oxford, OX2 6GG, UK.
- Department of Mathematics and Computer Science, Wesleyan University, 265 Church Street, Middletown, 06459, USA.
| |
Collapse
|
6
|
Manrique-Castano D, Bhaskar D, ElAli A. Dissecting glial scar formation by spatial point pattern and topological data analysis. Sci Rep 2024; 14:19035. [PMID: 39152163 PMCID: PMC11329771 DOI: 10.1038/s41598-024-69426-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 08/05/2024] [Indexed: 08/19/2024] Open
Abstract
Glial scar formation represents a fundamental response to central nervous system (CNS) injuries. It is mainly characterized by a well-defined spatial rearrangement of reactive astrocytes and microglia. The mechanisms underlying glial scar formation have been extensively studied, yet quantitative descriptors of the spatial arrangement of reactive glial cells remain limited. Here, we present a novel approach using point pattern analysis (PPA) and topological data analysis (TDA) to quantify spatial patterns of reactive glial cells after experimental ischemic stroke in mice. We provide open and reproducible tools using R and Julia to quantify spatial intensity, cell covariance and conditional distribution, cell-to-cell interactions, and short/long-scale arrangement, which collectively disentangle the arrangement patterns of the glial scar. This approach unravels a substantial divergence in the distribution of GFAP+ and IBA1+ cells after injury that conventional analysis methods cannot fully characterize. PPA and TDA are valuable tools for studying the complex spatial arrangement of reactive glia and other nervous cells following CNS injuries and have potential applications for evaluating glial-targeted restorative therapies.
Collapse
Affiliation(s)
- Daniel Manrique-Castano
- Neuroscience Axis, Research Center of CHU de Québec-Université Laval, Quebec City, QC, Canada.
- Department of Psychiatry and Neuroscience, Faculty of Medicine, Université Laval, Quebec City, QC, Canada.
| | | | - Ayman ElAli
- Neuroscience Axis, Research Center of CHU de Québec-Université Laval, Quebec City, QC, Canada.
- Department of Psychiatry and Neuroscience, Faculty of Medicine, Université Laval, Quebec City, QC, Canada.
| |
Collapse
|
7
|
Aggarwal M, Periwal V. Dory: Computation of persistence diagrams up to dimension two for Vietoris-Rips filtrations of large data sets. JOURNAL OF COMPUTATIONAL SCIENCE 2024; 79:102290. [PMID: 38774487 PMCID: PMC11105796 DOI: 10.1016/j.jocs.2024.102290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2024]
Abstract
Persistent homology (PH) is an approach to topological data analysis (TDA) that computes multi-scale topologically invariant properties of high-dimensional data that are robust to noise. While PH has revealed useful patterns across various applications, computational requirements have limited applications to small data sets of a few thousand points. We present Dory, an efficient and scalable algorithm that can compute the persistent homology of sparse Vietoris-Rips complexes on larger data sets, up to and including dimension two and over the field Z 2 . As an application, we compute the PH of the human genome at high resolution as revealed by a genome-wide Hi-C data set containing approximately three million points. Extant algorithms were unable to process it, whereas Dory processed it within five minutes, using less than five GB of memory. Results show that the topology of the human genome changes significantly upon treatment with auxin, a molecule that degrades cohesin, corroborating the hypothesis that cohesin plays a crucial role in loop formation in DNA.
Collapse
Affiliation(s)
- Manu Aggarwal
- Laboratory of Biological Modeling, NIDDK, National Institutes of Health, 31 Center Dr, Bethesda, 20892, MD, United States
| | - Vipul Periwal
- Laboratory of Biological Modeling, NIDDK, National Institutes of Health, 31 Center Dr, Bethesda, 20892, MD, United States
| |
Collapse
|
8
|
Levenson RM, Singh Y, Rieck B, Hathaway QA, Farrelly C, Rozenblit J, Prasanna P, Erickson B, Choudhary A, Carlsson G, Sarkar D. Advancing Precision Medicine: Algebraic Topology and Differential Geometry in Radiology and Computational Pathology. J Transl Med 2024; 104:102060. [PMID: 38626875 DOI: 10.1016/j.labinv.2024.102060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 04/08/2024] [Accepted: 04/10/2024] [Indexed: 05/19/2024] Open
Abstract
Precision medicine aims to provide personalized care based on individual patient characteristics, rather than guideline-directed therapies for groups of diseases or patient demographics. Images-both radiology- and pathology-derived-are a major source of information on presence, type, and status of disease. Exploring the mathematical relationship of pixels in medical imaging ("radiomics") and cellular-scale structures in digital pathology slides ("pathomics") offers powerful tools for extracting both qualitative and, increasingly, quantitative data. These analytical approaches, however, may be significantly enhanced by applying additional methods arising from fields of mathematics such as differential geometry and algebraic topology that remain underexplored in this context. Geometry's strength lies in its ability to provide precise local measurements, such as curvature, that can be crucial for identifying abnormalities at multiple spatial levels. These measurements can augment the quantitative features extracted in conventional radiomics, leading to more nuanced diagnostics. By contrast, topology serves as a robust shape descriptor, capturing essential features such as connected components and holes. The field of topological data analysis was initially founded to explore the shape of data, with functional network connectivity in the brain being a prominent example. Increasingly, its tools are now being used to explore organizational patterns of physical structures in medical images and digitized pathology slides. By leveraging tools from both differential geometry and algebraic topology, researchers and clinicians may be able to obtain a more comprehensive, multi-layered understanding of medical images and contribute to precision medicine's armamentarium.
Collapse
Affiliation(s)
- Richard M Levenson
- Department of Pathology and Laboratory Medicine, University of California Davis, Davis, California.
| | - Yashbir Singh
- Department of Radiology, Mayo Clinic, Rochester, Minnesota.
| | - Bastian Rieck
- Helmholtz Munich and Technical University of Munich, Munich, Germany
| | - Quincy A Hathaway
- Department of Medical Education, West Virginia University, Morgantown, West Virginia
| | | | | | - Prateek Prasanna
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, New York
| | | | | | - Gunnar Carlsson
- Department of Mathematics, Stanford University, Stanford, California
| | - Deepa Sarkar
- Institute of Genomic Health, Ichan school of Medicine, Mount Sinai, New York
| |
Collapse
|
9
|
Bou Dagher L, Madern D, Malbos P, Brochier-Armanet C. Persistent homology reveals strong phylogenetic signal in 3D protein structures. PNAS NEXUS 2024; 3:pgae158. [PMID: 38689707 PMCID: PMC11058471 DOI: 10.1093/pnasnexus/pgae158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 04/01/2024] [Indexed: 05/02/2024]
Abstract
Changes that occur in proteins over time provide a phylogenetic signal that can be used to decipher their evolutionary history and the relationships between organisms. Sequence comparison is the most common way to access this phylogenetic signal, while those based on 3D structure comparisons are still in their infancy. In this study, we propose an effective approach based on Persistent Homology Theory (PH) to extract the phylogenetic information contained in protein structures. PH provides efficient and robust algorithms for extracting and comparing geometric features from noisy datasets at different spatial resolutions. PH has a growing number of applications in the life sciences, including the study of proteins (e.g. classification, folding). However, it has never been used to study the phylogenetic signal they may contain. Here, using 518 protein families, representing 22,940 protein sequences and structures, from 10 major taxonomic groups, we show that distances calculated with PH from protein structures correlate strongly with phylogenetic distances calculated from protein sequences, at both small and large evolutionary scales. We test several methods for calculating PH distances and propose some refinements to improve their relevance for addressing evolutionary questions. This work opens up new perspectives in evolutionary biology by proposing an efficient way to access the phylogenetic signal contained in protein structures, as well as future developments of topological analysis in the life sciences.
Collapse
Affiliation(s)
- Léa Bou Dagher
- Université Claude Bernard Lyon 1, CNRS, VetAgro Sup, Laboratoire de Biométrie et BiologieÉvolutive, UMR5558, F-69622 Villeurbanne, France
- Université Claude Bernard Lyon 1, CNRS, Institut Camille Jordan, UMR5208, F-69622 Villeurbanne, France
- Université Libanaise, Laboratoire de Mathématiques, École Doctorale en Science et Technologie, PO BOX 5 Hadath, Liban
| | - Dominique Madern
- University Grenoble Alpes, CEA, CNRS, IBS, 38000 Grenoble, France
| | - Philippe Malbos
- Université Claude Bernard Lyon 1, CNRS, Institut Camille Jordan, UMR5208, F-69622 Villeurbanne, France
| | - Céline Brochier-Armanet
- Université Claude Bernard Lyon 1, CNRS, VetAgro Sup, Laboratoire de Biométrie et BiologieÉvolutive, UMR5558, F-69622 Villeurbanne, France
| |
Collapse
|
10
|
Wang H, Huang G, Zhao Z, Cheng L, Juncker-Jensen A, Nagy ML, Lu X, Zhang X, Chen DZ. CCF-GNN: A Unified Model Aggregating Appearance, Microenvironment, and Topology for Pathology Image Classification. IEEE TRANSACTIONS ON MEDICAL IMAGING 2023; 42:3179-3193. [PMID: 37027573 DOI: 10.1109/tmi.2023.3249343] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Pathology images contain rich information of cell appearance, microenvironment, and topology features for cancer analysis and diagnosis. Among such features, topology becomes increasingly important in analysis for cancer immunotherapy. By analyzing geometric and hierarchically structured cell distribution topology, oncologists can identify densely-packed and cancer-relevant cell communities (CCs) for making decisions. Compared to commonly-used pixel-level Convolution Neural Network (CNN) features and cell-instance-level Graph Neural Network (GNN) features, CC topology features are at a higher level of granularity and geometry. However, topological features have not been well exploited by recent deep learning (DL) methods for pathology image classification due to lack of effective topological descriptors for cell distribution and gathering patterns. In this paper, inspired by clinical practice, we analyze and classify pathology images by comprehensively learning cell appearance, microenvironment, and topology in a fine-to-coarse manner. To describe and exploit topology, we design Cell Community Forest (CCF), a novel graph that represents the hierarchical formulation process of big-sparse CCs from small-dense CCs. Using CCF as a new geometric topological descriptor of tumor cells in pathology images, we propose CCF-GNN, a GNN model that successively aggregates heterogeneous features (e.g., appearance, microenvironment) from cell-instance-level, cell-community-level, into image-level for pathology image classification. Extensive cross-validation experiments show that our method significantly outperforms alternative methods on H&E-stained and immunofluorescence images for disease grading tasks with multiple cancer types. Our proposed CCF-GNN establishes a new topological data analysis (TDA) based method, which facilitates integrating multi-level heterogeneous features of point clouds (e.g., for cells) into a unified DL framework.
Collapse
|
11
|
Ferrà A, Cecchini G, Nobbe Fisas FP, Casacuberta C, Cos I. A topological classifier to characterize brain states: When shape matters more than variance. PLoS One 2023; 18:e0292049. [PMID: 37782651 PMCID: PMC10545107 DOI: 10.1371/journal.pone.0292049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 08/04/2023] [Indexed: 10/04/2023] Open
Abstract
Despite the remarkable accuracies attained by machine learning classifiers to separate complex datasets in a supervised fashion, most of their operation falls short to provide an informed intuition about the structure of data, and, what is more important, about the phenomena being characterized by the given datasets. By contrast, topological data analysis (TDA) is devoted to study the shape of data clouds by means of persistence descriptors and provides a quantitative characterization of specific topological features of the dataset under scrutiny. Here we introduce a novel TDA-based classifier that works on the principle of assessing quantifiable changes on topological metrics caused by the addition of new input to a subset of data. We used this classifier with a high-dimensional electro-encephalographic (EEG) dataset recorded from eleven participants during a previous decision-making experiment in which three motivational states were induced through a manipulation of social pressure. We calculated silhouettes from persistence diagrams associated with each motivated state with a ready-made band-pass filtered version of these signals, and classified unlabeled signals according to their impact on each reference silhouette. Our results show that in addition to providing accuracies within the range of those of a nearest neighbour classifier, the TDA classifier provides formal intuition of the structure of the dataset as well as an estimate of its intrinsic dimension. Towards this end, we incorporated variance-based dimensionality reduction methods to our dataset and found that in most cases the accuracy of our TDA classifier remains essentially invariant beyond a certain dimension.
Collapse
Affiliation(s)
- Aina Ferrà
- Departament de Matemàtiques i Informàtica, Universitat de Barcelona, Barcelona, Catalonia, Spain
| | - Gloria Cecchini
- Departament de Matemàtiques i Informàtica, Universitat de Barcelona, Barcelona, Catalonia, Spain
| | - Fritz-Pere Nobbe Fisas
- Departament de Matemàtiques i Informàtica, Universitat de Barcelona, Barcelona, Catalonia, Spain
| | - Carles Casacuberta
- Departament de Matemàtiques i Informàtica, Universitat de Barcelona, Barcelona, Catalonia, Spain
- Institut de Matemàtica de la Universitat de Barcelona (IMUB), Barcelona, Spin
| | - Ignasi Cos
- Departament de Matemàtiques i Informàtica, Universitat de Barcelona, Barcelona, Catalonia, Spain
- Institut de Matemàtica de la Universitat de Barcelona (IMUB), Barcelona, Spin
- Serra-Húnter Fellow Programme, Barcelona, Catalonia, Spain
| |
Collapse
|
12
|
Chulián S, Stolz BJ, Martínez-Rubio Á, Blázquez Goñi C, Rodríguez Gutiérrez JF, Caballero Velázquez T, Molinos Quintana Á, Ramírez Orellana M, Castillo Robleda A, Fuster Soler JL, Minguela Puras A, Martínez Sánchez MV, Rosa M, Pérez-García VM, Byrne HM. The shape of cancer relapse: Topological data analysis predicts recurrence in paediatric acute lymphoblastic leukaemia. PLoS Comput Biol 2023; 19:e1011329. [PMID: 37578973 PMCID: PMC10468039 DOI: 10.1371/journal.pcbi.1011329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 08/30/2023] [Accepted: 07/05/2023] [Indexed: 08/16/2023] Open
Abstract
Although children and adolescents with acute lymphoblastic leukaemia (ALL) have high survival rates, approximately 15-20% of patients relapse. Risk of relapse is routinely estimated at diagnosis by biological factors, including flow cytometry data. This high-dimensional data is typically manually assessed by projecting it onto a subset of biomarkers. Cell density and "empty spaces" in 2D projections of the data, i.e. regions devoid of cells, are then used for qualitative assessment. Here, we use topological data analysis (TDA), which quantifies shapes, including empty spaces, in data, to analyse pre-treatment ALL datasets with known patient outcomes. We combine these fully unsupervised analyses with Machine Learning (ML) to identify significant shape characteristics and demonstrate that they accurately predict risk of relapse, particularly for patients previously classified as 'low risk'. We independently confirm the predictive power of CD10, CD20, CD38, and CD45 as biomarkers for ALL diagnosis. Based on our analyses, we propose three increasingly detailed prognostic pipelines for analysing flow cytometry data from ALL patients depending on technical and technological availability: 1. Visual inspection of specific biological features in biparametric projections of the data; 2. Computation of quantitative topological descriptors of such projections; 3. A combined analysis, using TDA and ML, in the four-parameter space defined by CD10, CD20, CD38 and CD45. Our analyses readily extend to other haematological malignancies.
Collapse
Affiliation(s)
- Salvador Chulián
- Department of Mathematics, Universidad de Cádiz, Puerto Real (Cádiz), Spain
- Biomedical Research and Innovation Institute of Cádiz (INiBICA), Hospital Universitario Puerta del Mar, Cádiz, Spain
- Department of Mathematics, Mathematical Oncology Laboratory (MOLAB), Universidad de Castilla-La Mancha, Ciudad Real, Spain
| | - Bernadette J. Stolz
- Mathematical Institute, University of Oxford, Oxford, United Kingdom
- Laboratory for Topology and Neuroscience, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Álvaro Martínez-Rubio
- Department of Mathematics, Universidad de Cádiz, Puerto Real (Cádiz), Spain
- Biomedical Research and Innovation Institute of Cádiz (INiBICA), Hospital Universitario Puerta del Mar, Cádiz, Spain
- Department of Mathematics, Mathematical Oncology Laboratory (MOLAB), Universidad de Castilla-La Mancha, Ciudad Real, Spain
| | - Cristina Blázquez Goñi
- Biomedical Research and Innovation Institute of Cádiz (INiBICA), Hospital Universitario Puerta del Mar, Cádiz, Spain
- Department of Paediatric Haematology and Oncology, Hospital Universitario de Jerez, Jerez de la Frontera (Cádiz), Spain
- Department of Haematology, Hospital Universitario Vírgen del Rocío, Instituto de Biomedicina de Sevilla (IBIS), Sevilla, Spain
| | - Juan F. Rodríguez Gutiérrez
- Biomedical Research and Innovation Institute of Cádiz (INiBICA), Hospital Universitario Puerta del Mar, Cádiz, Spain
- Department of Paediatric Haematology and Oncology, Hospital Universitario de Jerez, Jerez de la Frontera (Cádiz), Spain
| | - Teresa Caballero Velázquez
- Department of Haematology, Hospital Universitario Vírgen del Rocío, Instituto de Biomedicina de Sevilla (IBIS), Sevilla, Spain
- CSIC, University of Sevilla, Sevilla, Spain
| | - Águeda Molinos Quintana
- Department of Haematology, Hospital Universitario Vírgen del Rocío, Instituto de Biomedicina de Sevilla (IBIS), Sevilla, Spain
- CSIC, University of Sevilla, Sevilla, Spain
| | - Manuel Ramírez Orellana
- Department of Paediatric Haematology and Oncology, Hospital Infantil Universitario Niño Jesús - Instituto Investigación Sanitaria La Princesa, Madrid, Spain
| | - Ana Castillo Robleda
- Department of Paediatric Haematology and Oncology, Hospital Infantil Universitario Niño Jesús - Instituto Investigación Sanitaria La Princesa, Madrid, Spain
| | - José Luis Fuster Soler
- Department of Paediatric Haematology and Oncology, Hospital Clínico Universitario Virgen de la Arrixaca - Instituto Murciano de Investigación Biosanitaria (IMIB), Murcia, Spain
| | - Alfredo Minguela Puras
- Immunology Service, Hospital Clínico Universitario Virgen de la Arrixaca - Instituto Murciano de Investigación Biosanitaria (IMIB), Murcia, Spain
| | - María V. Martínez Sánchez
- Immunology Service, Hospital Clínico Universitario Virgen de la Arrixaca - Instituto Murciano de Investigación Biosanitaria (IMIB), Murcia, Spain
| | - María Rosa
- Department of Mathematics, Universidad de Cádiz, Puerto Real (Cádiz), Spain
- Biomedical Research and Innovation Institute of Cádiz (INiBICA), Hospital Universitario Puerta del Mar, Cádiz, Spain
- Department of Mathematics, Mathematical Oncology Laboratory (MOLAB), Universidad de Castilla-La Mancha, Ciudad Real, Spain
| | - Víctor M. Pérez-García
- Department of Mathematics, Mathematical Oncology Laboratory (MOLAB), Universidad de Castilla-La Mancha, Ciudad Real, Spain
- Instituto de Matemática Aplicada a la Ciencia y la Ingeniería (IMACI), Universidad de Castilla-La Mancha, Ciudad Real, Spain
- ETSI Industriales, Universidad de Castilla-La Mancha, Ciudad Real, Spain
| | - Helen M. Byrne
- Mathematical Institute, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
13
|
Lang H, Noble KV, Barth JL, Rumschlag JA, Jenkins TR, Storm SL, Eckert MA, Dubno JR, Schulte BA. The Stria Vascularis in Mice and Humans Is an Early Site of Age-Related Cochlear Degeneration, Macrophage Dysfunction, and Inflammation. J Neurosci 2023; 43:5057-5075. [PMID: 37268417 PMCID: PMC10324995 DOI: 10.1523/jneurosci.2234-22.2023] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 04/19/2023] [Accepted: 05/25/2023] [Indexed: 06/04/2023] Open
Abstract
Age-related hearing loss, or presbyacusis, is a common degenerative disorder affecting communication and quality of life for millions of older adults. Multiple pathophysiologic manifestations, along with many cellular and molecular alterations, have been linked to presbyacusis; however, the initial events and causal factors have not been clearly established. Comparisons of the transcriptome in the lateral wall (LW) with other cochlear regions in a mouse model (of both sexes) of "normal" age-related hearing loss revealed that early pathophysiological alterations in the stria vascularis (SV) are associated with increased macrophage activation and a molecular signature indicative of inflammaging, a common form of immune dysfunction. Structure-function correlation analyses in mice across the lifespan showed that the age-dependent increase in macrophage activation in the stria vascularis is associated with a decline in auditory sensitivity. High-resolution imaging analysis of macrophage activation in middle-aged and aged mouse and human cochleas, along with transcriptomic analysis of age-dependent changes in mouse cochlear macrophage gene expression, support the hypothesis that aberrant macrophage activity is an important contributor to age-dependent strial dysfunction, cochlear pathology, and hearing loss. Thus, this study highlights the SV as a primary site of age-related cochlear degeneration and aberrant macrophage activity and dysregulation of the immune system as early indicators of age-related cochlear pathology and hearing loss. Importantly, novel new imaging methods described here now provide a means to analyze human temporal bones in a way that had not previously been feasible and thereby represent a significant new tool for otopathological evaluation.SIGNIFICANCE STATEMENT Age-related hearing loss is a common neurodegenerative disorder affecting communication and quality of life. Current interventions (primarily hearing aids and cochlear implants) offer imperfect and often unsuccessful therapeutic outcomes. Identification of early pathology and causal factors is crucial for the development of new treatments and early diagnostic tests. Here, we find that the SV, a nonsensory component of the cochlea, is an early site of structural and functional pathology in mice and humans that is characterized by aberrant immune cell activity. We also establish a new technique for evaluating cochleas from human temporal bones, an important but understudied area of research because of a lack of well-preserved human specimens and difficult tissue preparation and processing approaches.
Collapse
Affiliation(s)
- Hainan Lang
- Department of Pathology and Laboratory Medicine, Medical University of South Carolina, Charleston, South Carolina 29425
| | - Kenyaria V Noble
- Department of Pathology and Laboratory Medicine, Medical University of South Carolina, Charleston, South Carolina 29425
| | - Jeremy L Barth
- Department of Regenerative Medicine and Cell Biology, Medical University of South Carolina, Charleston, South Carolina 29425
| | - Jeffrey A Rumschlag
- Department of Pathology and Laboratory Medicine, Medical University of South Carolina, Charleston, South Carolina 29425
| | - Tyreek R Jenkins
- Department of Pathology and Laboratory Medicine, Medical University of South Carolina, Charleston, South Carolina 29425
| | - Shelby L Storm
- Department of Pathology and Laboratory Medicine, Medical University of South Carolina, Charleston, South Carolina 29425
| | - Mark A Eckert
- Department of Otolaryngology-Head and Neck Surgery, Medical University of South Carolina, Charleston, South Carolina 29425
| | - Judy R Dubno
- Department of Otolaryngology-Head and Neck Surgery, Medical University of South Carolina, Charleston, South Carolina 29425
| | - Bradley A Schulte
- Department of Pathology and Laboratory Medicine, Medical University of South Carolina, Charleston, South Carolina 29425
| |
Collapse
|
14
|
Dłotko P, Gurnari D. Euler characteristic curves and profiles: a stable shape invariant for big data problems. Gigascience 2022; 12:giad094. [PMID: 37966428 PMCID: PMC10646871 DOI: 10.1093/gigascience/giad094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 09/25/2023] [Accepted: 10/09/2023] [Indexed: 11/16/2023] Open
Abstract
Tools of topological data analysis provide stable summaries encapsulating the shape of the considered data. Persistent homology, the most standard and well-studied data summary, suffers a number of limitations; its computations are hard to distribute, and it is hard to generalize to multifiltrations and is computationally prohibitive for big datasets. In this article, we study the concept of Euler characteristics curves for 1-parameter filtrations and Euler characteristic profiles for multiparameter filtrations. While being a weaker invariant in one dimension, we show that Euler characteristic-based approaches do not possess some handicaps of persistent homology; we show efficient algorithms to compute them in a distributed way, their generalization to multifiltrations, and practical applicability for big data problems. In addition, we show that the Euler curves and profiles enjoy a certain type of stability, which makes them robust tools for data analysis. Lastly, to show their practical applicability, multiple use cases are considered.
Collapse
Affiliation(s)
- Paweł Dłotko
- Dioscuri Centre in Topological Data Analysis, Mathematical Institute, Polish Academy of Sciences, Warsaw, 00-656, Poland
| | - Davide Gurnari
- Dioscuri Centre in Topological Data Analysis, Mathematical Institute, Polish Academy of Sciences, Warsaw, 00-656, Poland
| |
Collapse
|
15
|
Rammal A, Assaf R, Goupil A, Kacim M, Vrabie V. Machine learning techniques on homological persistence features for prostate cancer diagnosis. BMC Bioinformatics 2022; 23:476. [DOI: 10.1186/s12859-022-04992-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 10/18/2022] [Indexed: 11/14/2022] Open
Abstract
AbstractThe rapid evolution of image processing equipment and techniques ensures the development of novel picture analysis methodologies. One of the most powerful yet computationally possible algebraic techniques for measuring the topological characteristics of functions is persistent homology. It's an algebraic invariant that can capture topological details at different spatial resolutions. Persistent homology investigates the topological features of a space using a set of sampled points, such as pixels. It can track the appearance and disappearance of topological features caused by changes in the nested space created by an operation known as filtration, in which a parameter scale, in our case the intensity of pixels, is increased to detect changes in the studied space over a range of varying scales. In addition, at the level of machine learning there were many studies and articles witnessing recently the combination between homological persistence and machine learning algorithms. On another level, prostate cancer is diagnosed referring to a scoring criterion describing the severity of the cancer called Gleason score. The classical Gleason system defines five histological growth patterns (grades). In our study we propose to study the Gleason score on some glands issued from a new optical microscopy technique called SLIM. This new optical microscopy technique that combines two classic ideas in light imaging: Zernike’s phase contrast microscopy and Gabor’s holography. Persistent homology features are computed on these images. We suggested machine learning methods to classify these images into the corresponding Gleason score. Machine learning techniques applied on homological persistence features was very effective in the detection of the right Gleason score of the prostate cancer in these kinds of images and showed an accuracy of above 95%.
Collapse
|
16
|
Alsaleh L, Li C, Couetil JL, Ye Z, Huang K, Zhang J, Chen C, Johnson TS. Spatial Transcriptomic Analysis Reveals Associations between Genes and Cellular Topology in Breast and Prostate Cancers. Cancers (Basel) 2022; 14:4856. [PMID: 36230778 PMCID: PMC9562681 DOI: 10.3390/cancers14194856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 09/26/2022] [Accepted: 09/28/2022] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Cancer is the leading cause of death worldwide with breast and prostate cancer the most common among women and men, respectively. Gene expression and image features are independently prognostic of patient survival; but until the advent of spatial transcriptomics (ST), it was not possible to determine how gene expression of cells was tied to their spatial relationships (i.e., topology). METHODS We identify topology-associated genes (TAGs) that correlate with 700 image topological features (ITFs) in breast and prostate cancer ST samples. Genes and image topological features are independently clustered and correlated with each other. Themes among genes correlated with ITFs are investigated by functional enrichment analysis. RESULTS Overall, topology-associated genes (TAG) corresponding to extracellular matrix (ECM) and Collagen Type I Trimer gene ontology terms are common to both prostate and breast cancer. In breast cancer specifically, we identify the ZAG-PIP Complex as a TAG. In prostate cancer, we identify distinct TAGs that are enriched for GI dysmotility and the IgA immunoglobulin complex. We identified TAGs in every ST slide regardless of cancer type. CONCLUSIONS These TAGs are enriched for ontology terms, illustrating the biological relevance to our image topology features and their potential utility in diagnostic and prognostic models.
Collapse
Affiliation(s)
- Lujain Alsaleh
- Department of Biostatistics and Health Data Science, Indiana University, Indianapolis, IN 46202, USA
| | - Chen Li
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY 11794, USA
| | - Justin L. Couetil
- Department of Medical and Molecular Genetics, Indiana University, Indianapolis, IN 46202, USA
| | - Ze Ye
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY 11794, USA
| | - Kun Huang
- Department of Biostatistics and Health Data Science, Indiana University, Indianapolis, IN 46202, USA
- Department of Medical and Molecular Genetics, Indiana University, Indianapolis, IN 46202, USA
- Regenstrief Institute, Indiana University, Indianapolis, IN 46202, USA
- Melvin and Bren Simon Comprehensive Cancer Center, Indiana University, Indianapolis, IN 46202, USA
| | - Jie Zhang
- Department of Medical and Molecular Genetics, Indiana University, Indianapolis, IN 46202, USA
- Melvin and Bren Simon Comprehensive Cancer Center, Indiana University, Indianapolis, IN 46202, USA
| | - Chao Chen
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY 11794, USA
| | - Travis S. Johnson
- Department of Biostatistics and Health Data Science, Indiana University, Indianapolis, IN 46202, USA
- Melvin and Bren Simon Comprehensive Cancer Center, Indiana University, Indianapolis, IN 46202, USA
- Indiana Biosciences Research Institute, Indianapolis, IN 46202, USA
| |
Collapse
|
17
|
Skaf Y, Laubenbacher R. Topological data analysis in biomedicine: A review. J Biomed Inform 2022; 130:104082. [PMID: 35508272 DOI: 10.1016/j.jbi.2022.104082] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 03/20/2022] [Accepted: 04/23/2022] [Indexed: 01/22/2023]
Abstract
Significant technological advances made in recent years have shepherded a dramatic increase in utilization of digital technologies for biomedicine- everything from the widespread use of electronic health records to improved medical imaging capabilities and the rising ubiquity of genomic sequencing contribute to a "digitization" of biomedical research and clinical care. With this shift toward computerized tools comes a dramatic increase in the amount of available data, and current tools for data analysis capable of extracting meaningful knowledge from this wealth of information have yet to catch up. This article seeks to provide an overview of emerging mathematical methods with the potential to improve the abilities of clinicians and researchers to analyze biomedical data, but may be hindered from doing so by a lack of conceptual accessibility and awareness in the life sciences research community. In particular, we focus on topological data analysis (TDA), a set of methods grounded in the mathematical field of algebraic topology that seeks to describe and harness features related to the "shape" of data. We aim to make such techniques more approachable to non-mathematicians by providing a conceptual discussion of their theoretical foundations followed by a survey of their published applications to scientific research. Finally, we discuss the limitations of these methods and suggest potential avenues for future work integrating mathematical tools into clinical care and biomedical informatics.
Collapse
Affiliation(s)
- Yara Skaf
- University of Florida, Department of Mathematics, Gainesville, FL, USA; University of Florida, Department of Medicine, Division of Pulmonary, Critical Care, & Sleep Medicine, Gainesville, FL, USA.
| | - Reinhard Laubenbacher
- University of Florida, Department of Mathematics, Gainesville, FL, USA; University of Florida, Department of Medicine, Division of Pulmonary, Critical Care, & Sleep Medicine, Gainesville, FL, USA.
| |
Collapse
|
18
|
Branco S, Carvalho JG, Reis MS, Lopes NV, Cabral J. 0-Dimensional Persistent Homology Analysis Implementation in Resource-Scarce Embedded Systems. SENSORS (BASEL, SWITZERLAND) 2022; 22:3657. [PMID: 35632064 PMCID: PMC9144123 DOI: 10.3390/s22103657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Revised: 05/02/2022] [Accepted: 05/06/2022] [Indexed: 06/15/2023]
Abstract
Persistent Homology (PH) analysis is a powerful tool for understanding many relevant topological features from a given dataset. PH allows finding clusters, noise, and relevant connections in the dataset. Therefore, it can provide a better view of the problem and a way of perceiving if a given dataset is equal to another, if a given sample is relevant, and how the samples occupy the feature space. However, PH involves reducing the problem to its simplicial complex space, which is computationally expensive and implementing PH in such Resource-Scarce Embedded Systems (RSES) is an essential add-on for them. However, due to its complexity, implementing PH in such tiny devices is considerably complicated due to the lack of memory and processing power. The following paper shows the implementation of 0-Dimensional Persistent Homology Analysis in a set of well-known RSES, using a technique that reduces the memory footprint and processing power needs of the 0-Dimensional PH algorithm. The results are positive and show that RSES can be equipped with this real-time data analysis tool.
Collapse
Affiliation(s)
- Sérgio Branco
- Algoritmi Center, University of Minho, 4800-058 Guimarães, Portugal; (S.B.); (J.G.C.)
- CEiiA—Centro de Engenharia, Av. D. Afonso Henriques 1825, 4450-017 Matosinhos, Portugal
| | - João G. Carvalho
- Algoritmi Center, University of Minho, 4800-058 Guimarães, Portugal; (S.B.); (J.G.C.)
- DTx—Digital Transformation CoLab, University of Minho, 4800-058 Guimarães, Portugal;
| | - Marco S. Reis
- CIEPQPF, Department of Chemical Engineering, University of Coimbra, Rua Sílvio Lima, Pólo II—Pinhal de Marrocos, 3030-790 Coimbra, Portugal;
| | - Nuno V. Lopes
- DTx—Digital Transformation CoLab, University of Minho, 4800-058 Guimarães, Portugal;
| | - Jorge Cabral
- Algoritmi Center, University of Minho, 4800-058 Guimarães, Portugal; (S.B.); (J.G.C.)
- CEiiA—Centro de Engenharia, Av. D. Afonso Henriques 1825, 4450-017 Matosinhos, Portugal
| |
Collapse
|
19
|
Prabhu S, Prasad K, Robels-Kelly A, Lu X. AI-based carcinoma detection and classification using histopathological images: A systematic review. Comput Biol Med 2022; 142:105209. [DOI: 10.1016/j.compbiomed.2022.105209] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 01/01/2022] [Accepted: 01/01/2022] [Indexed: 02/07/2023]
|
20
|
Minciuna CE, Tanase M, Manuc TE, Tudor S, Herlea V, Dragomir MP, Calin GA, Vasilescu C. The seen and the unseen: Molecular classification and image based-analysis of gastrointestinal cancers. Comput Struct Biotechnol J 2022; 20:5065-5075. [PMID: 36187924 PMCID: PMC9489806 DOI: 10.1016/j.csbj.2022.09.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 09/07/2022] [Accepted: 09/07/2022] [Indexed: 11/13/2022] Open
Abstract
Gastrointestinal cancers account for 22.5% of cancer related deaths worldwide and represent circa 20% of all cancers. In the last decades, we have witnessed a shift from histology-based to molecular-based classifications using genomic, epigenomic, and transcriptomic data. The molecular based classification revealed new prognostic markers and may aid the therapy selection. Because of the high-costs to perform a molecular classification, in recent years immunohistochemistry-based surrogate classification were developed which permit the stratification of patients, and in parallel multiple groups developed hematoxylin and eosin whole slide image analysis for sub-classifying these entities. Hence, we are witnessing a return to an image-based classification with the purpose to infer hidden information from routine histology images that would permit to detect the patients that respond to specific therapies and would be able to predict their outcome. In this review paper, we will discuss the current histological, molecular, and immunohistochemical classifications of the most common gastrointestinal cancers, gastric adenocarcinoma, and colorectal adenocarcinoma, and will present key aspects for developing a new artificial intelligence aided image-based classification of these malignancies.
Collapse
|
21
|
Qin Y, Fasy BT, Wenk C, Summa B. A Domain-Oblivious Approach for Learning Concise Representations of Filtered Topological Spaces for Clustering. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2022; 28:302-312. [PMID: 34587087 DOI: 10.1109/tvcg.2021.3114872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Persistence diagrams have been widely used to quantify the underlying features of filtered topological spaces in data visualization. In many applications, computing distances between diagrams is essential; however, computing these distances has been challenging due to the computational cost. In this paper, we propose a persistence diagram hashing framework that learns a binary code representation of persistence diagrams, which allows for fast computation of distances. This framework is built upon a generative adversarial network (GAN) with a diagram distance loss function to steer the learning process. Instead of using standard representations, we hash diagrams into binary codes, which have natural advantages in large-scale tasks. The training of this model is domain-oblivious in that it can be computed purely from synthetic, randomly created diagrams. As a consequence, our proposed method is directly applicable to various datasets without the need for retraining the model. These binary codes, when compared using fast Hamming distance, better maintain topological similarity properties between datasets than other vectorized representations. To evaluate this method, we apply our framework to the problem of diagram clustering and we compare the quality and performance of our approach to the state-of-the-art. In addition, we show the scalability of our approach on a dataset with 10k persistence diagrams, which is not possible with current techniques. Moreover, our experimental results demonstrate that our method is significantly faster with the potential of less memory usage, while retaining comparable or better quality comparisons.
Collapse
|
22
|
韩 继, 谢 嘉, 顾 松, 闫 朝, 李 建, 张 志, 徐 军. [Automated grading of glioma based on density and atypia analysis in whole slide images]. SHENG WU YI XUE GONG CHENG XUE ZA ZHI = JOURNAL OF BIOMEDICAL ENGINEERING = SHENGWU YIXUE GONGCHENGXUE ZAZHI 2021; 38:1062-1071. [PMID: 34970888 PMCID: PMC9927119 DOI: 10.7507/1001-5515.202103050] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Revised: 08/05/2021] [Indexed: 06/14/2023]
Abstract
Glioma is the most common malignant brain tumor and classification of low grade glioma (LGG) and high grade glioma (HGG) is an important reference of making decisions on patient treatment options and prognosis. This work is largely done manually by pathologist based on an examination of whole slide image (WSI), which is arduous and heavily dependent on doctors' experience. In the World Health Organization (WHO) criteria, grade of glioma is closely related to hypercellularity, nuclear atypia and necrosis. Inspired by this, this paper designed and extracted cell density and atypia features to classify LGG and HGG. First, regions of interest (ROI) were located by analyzing cell density and global density features were extracted as well. Second, local density and atypia features were extracted in ROI. Third, balanced support vector machine (SVM) classifier was trained and tested using 10 selected features. The area under the curve (AUC) and accuracy (ACC) of 5-fold cross validation were 0.92 ± 0.01 and 0.82 ± 0.01 respectively. The results demonstrate that the proposed method of locating ROI is effective and the designed features of density and atypia can be used to predict glioma grade accurately, which can provide reliable basis for clinical diagnosis.
Collapse
Affiliation(s)
- 继能 韩
- 南京信息工程大学 自动化学院(南京 210044)School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, P.R.China
- 南京大学医学院附属金陵医院 放射诊断科(南京 210002)Department of Diagnostic Radiology, Jinling Hospital, Medical School of Nanjing University, Nanjing 210002, P.R.China
| | - 嘉伟 谢
- 南京信息工程大学 自动化学院(南京 210044)School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, P.R.China
- 南京大学医学院附属金陵医院 放射诊断科(南京 210002)Department of Diagnostic Radiology, Jinling Hospital, Medical School of Nanjing University, Nanjing 210002, P.R.China
| | - 松 顾
- 南京信息工程大学 自动化学院(南京 210044)School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, P.R.China
- 南京大学医学院附属金陵医院 放射诊断科(南京 210002)Department of Diagnostic Radiology, Jinling Hospital, Medical School of Nanjing University, Nanjing 210002, P.R.China
| | - 朝阳 闫
- 南京信息工程大学 自动化学院(南京 210044)School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, P.R.China
- 南京大学医学院附属金陵医院 放射诊断科(南京 210002)Department of Diagnostic Radiology, Jinling Hospital, Medical School of Nanjing University, Nanjing 210002, P.R.China
| | - 建瑞 李
- 南京信息工程大学 自动化学院(南京 210044)School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, P.R.China
| | - 志强 张
- 南京信息工程大学 自动化学院(南京 210044)School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, P.R.China
| | - 军 徐
- 南京信息工程大学 自动化学院(南京 210044)School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, P.R.China
- 南京大学医学院附属金陵医院 放射诊断科(南京 210002)Department of Diagnostic Radiology, Jinling Hospital, Medical School of Nanjing University, Nanjing 210002, P.R.China
| |
Collapse
|
23
|
Somasundaram E, Litzler A, Wadhwa R, Barker-Clarke R, Scott J. Persistent homology of tumor CT scans is associated with survival in lung cancer. Med Phys 2021; 48:7043-7051. [PMID: 34587294 DOI: 10.1002/mp.15255] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2021] [Revised: 09/09/2021] [Accepted: 09/13/2021] [Indexed: 11/07/2022] Open
Abstract
PURPOSE Radiomics, the objective study of nonvisual features in clinical imaging, has been useful in informing decisions in clinical oncology. However, radiomics currently lacks the ability to characterize the overall topological structure of the data. This niche can be filled by persistent homology, a form of topological data analysis that analyzes high-level structure. We hypothesized that persistent homology features quantified using cubical complexes could be extracted from lung tumor scans and related to survival. METHODS We obtained segmented computed tomography (CT) lung scans (n = 565) from the NSCLC-Radiomics and NSCLC-Radiogenomics datasets in The Cancer Imaging Archive. These scans are three-dimensional images whose pixel intensity corresponds to a number of Hounsfield units. Cubical complexes are a topological image analysis method that effectively analyzes the number of topological features in an image as the image is thresholded at different intensities. We calculated a novel output called a feature curve by plotting the number of zero-dimensional (0D) topological features counted from the cubical complex filtration against each Hounsfield value. This curve's first moment of distribution was utilized as a summary statistic to show association with survival in a Cox proportional hazards model. We hypothesized that persistent homology features quantified using cubical complexes could be extracted from lung tumor scans and related to survival. RESULTS After controlling for tumor image size, age, and stage, the first moment of the 0D topological feature curve was associated with poorer survival (HR = 1.118; 95% CI = 1.026-1.218; p = 0.01). The patients in our study with the lowest first moment scores had significantly better survival (1238 days; 95% CI = 936-1599) compared to the patients with the highest first moment scores (429 days; 95% CI = 326-601; p = 0.0015). CONCLUSIONS We have shown that persistent homology can generate useful clinical correlates from tumor CT scans. Our 0D topological feature curve statistic predicts survival in lung cancer patients. This novel statistic may be used in tandem with standard radiomics variables to better inform clinical oncology decisions.
Collapse
Affiliation(s)
| | - Adam Litzler
- University of Colorado Boulder, Department of Applied Mathematics, Boulder, Colorado, USA
| | - Raoul Wadhwa
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, Ohio, USA
| | - Rowan Barker-Clarke
- Lerner Research Institute, Department of Translational Hematology and Oncology Research, Cleveland, Ohio, USA
| | - Jacob Scott
- Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, Ohio, USA
- Lerner Research Institute, Department of Translational Hematology and Oncology Research, Cleveland, Ohio, USA
- Taussig Cancer Institute, Department of Radiation Oncology, Cleveland, Ohio, USA
| |
Collapse
|
24
|
Vipond O, Bull JA, Macklin PS, Tillmann U, Pugh CW, Byrne HM, Harrington HA. Multiparameter persistent homology landscapes identify immune cell spatial patterns in tumors. Proc Natl Acad Sci U S A 2021; 118:e2102166118. [PMID: 34625491 PMCID: PMC8522280 DOI: 10.1073/pnas.2102166118] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/24/2021] [Indexed: 12/29/2022] Open
Abstract
Highly resolved spatial data of complex systems encode rich and nonlinear information. Quantification of heterogeneous and noisy data-often with outliers, artifacts, and mislabeled points-such as those from tissues, remains a challenge. The mathematical field that extracts information from the shape of data, topological data analysis (TDA), has expanded its capability for analyzing real-world datasets in recent years by extending theory, statistics, and computation. An extension to the standard theory to handle heterogeneous data is multiparameter persistent homology (MPH). Here we provide an application of MPH landscapes, a statistical tool with theoretical underpinnings. MPH landscapes, computed for (noisy) data from agent-based model simulations of immune cells infiltrating into a spheroid, are shown to surpass existing spatial statistics and one-parameter persistent homology. We then apply MPH landscapes to study immune cell location in digital histology images from head and neck cancer. We quantify intratumoral immune cells and find that infiltrating regulatory T cells have more prominent voids in their spatial patterns than macrophages. Finally, we consider how TDA can integrate and interrogate data of different types and scales, e.g., immune cell locations and regions with differing levels of oxygenation. This work highlights the power of MPH landscapes for quantifying, characterizing, and comparing features within the tumor microenvironment in synthetic and real datasets.
Collapse
Affiliation(s)
- Oliver Vipond
- Mathematical Institute, University of Oxford, Oxford OX2 6GG, United Kingdom
| | - Joshua A Bull
- Mathematical Institute, University of Oxford, Oxford OX2 6GG, United Kingdom
| | - Philip S Macklin
- Nuffield Department of Medicine Research Building, University of Oxford, Oxford OX3 7FZ, United Kingdom
| | - Ulrike Tillmann
- Mathematical Institute, University of Oxford, Oxford OX2 6GG, United Kingdom;
| | - Christopher W Pugh
- Nuffield Department of Medicine Research Building, University of Oxford, Oxford OX3 7FZ, United Kingdom;
| | - Helen M Byrne
- Mathematical Institute, University of Oxford, Oxford OX2 6GG, United Kingdom;
| | - Heather A Harrington
- Mathematical Institute, University of Oxford, Oxford OX2 6GG, United Kingdom;
- Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
| |
Collapse
|
25
|
Sritharan D, Wang S, Hormoz S. Computing the Riemannian curvature of image patch and single-cell RNA sequencing data manifolds using extrinsic differential geometry. Proc Natl Acad Sci U S A 2021; 118:e2100473118. [PMID: 34272279 PMCID: PMC8307776 DOI: 10.1073/pnas.2100473118] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Most high-dimensional datasets are thought to be inherently low-dimensional-that is, data points are constrained to lie on a low-dimensional manifold embedded in a high-dimensional ambient space. Here, we study the viability of two approaches from differential geometry to estimate the Riemannian curvature of these low-dimensional manifolds. The intrinsic approach relates curvature to the Laplace-Beltrami operator using the heat-trace expansion and is agnostic to how a manifold is embedded in a high-dimensional space. The extrinsic approach relates the ambient coordinates of a manifold's embedding to its curvature using the Second Fundamental Form and the Gauss-Codazzi equation. We found that the intrinsic approach fails to accurately estimate the curvature of even a two-dimensional constant-curvature manifold, whereas the extrinsic approach was able to handle more complex toy models, even when confounded by practical constraints like small sample sizes and measurement noise. To test the applicability of the extrinsic approach to real-world data, we computed the curvature of a well-studied manifold of image patches and recapitulated its topological classification as a Klein bottle. Lastly, we applied the extrinsic approach to study single-cell transcriptomic sequencing (scRNAseq) datasets of blood, gastrulation, and brain cells to quantify the Riemannian curvature of scRNAseq manifolds.
Collapse
Affiliation(s)
- Duluxan Sritharan
- Harvard Graduate Program in Biophysics, Harvard University, Boston, MA 02115
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA 02215
| | - Shu Wang
- Harvard Graduate Program in Biophysics, Harvard University, Boston, MA 02115
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115
| | - Sahand Hormoz
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA 02215;
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115
- Broad Institute of MIT and Harvard, Cambridge, MA 02142
| |
Collapse
|
26
|
Ding Z, Song D, Wu H, Tian H, Ye X, Liang W, Xu J, Dong F. Development and validation of a nomogram based on multiparametric magnetic resonance imaging and elastography-derived data for the stratification of patients with prostate cancer. Quant Imaging Med Surg 2021; 11:3252-3262. [PMID: 34249651 DOI: 10.21037/qims-20-978] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2020] [Accepted: 04/01/2021] [Indexed: 01/08/2023]
Abstract
Background This study sought to develop and validate a nomogram combining the elastographic Q-analysis score (EQS), the Prostate Imaging Reporting and Data System (PI-RADS) score, and clinical parameters for the stratification of patients with prostate cancer (PCa). Methods A retrospective study was conducted of 375 patients with 375 lesions who underwent volume-navigation transrectal ultrasound (TRUS) and multiparametric magnetic resonance imaging (MP-MRI)-fusion targeted biopsies between April 2017 and January 2020. Based on a multivariate logistic regression model, a nomogram was created to assess any PCa and high-risk PCa [Gleason score (GS) ≥4+3] using data from patients diagnosed between April 2017 and June 2019 (n=271), and was validated in patients diagnosed after July 2019 (n=104). The nomogram's performance was evaluated based on its discrimination, calibration, and clinical usefulness. Results The areas under the curve (AUCs) of the nomogram for predicting any PCa and high-risk PCa were 0.949 [95% confidence interval (CI), 0.921 to 0.978] and 0.936 (95% CI, 0.906 to 0.965), respectively, in the training cohort, and 0.946 (95% CI, 0.894 to 0.997) and 0.971 (95% CI, 0.9331 to 1), respectively, in the validation cohort. The nomogram was well calibrated, and no significant difference was found between the predicted and observed probabilities. A decision curve analysis (DCA) for the nomogram with and without the EQS showed that the threshold probability of for any PCa was <67%. Conclusions The nomogram that combined elastography-derived and MP-MRI data was more clinically useful than the model based on PI-RADS and clinical parameters alone. Our nomogram could aid urologists to make decisions and avoid unnecessary biopsies.
Collapse
Affiliation(s)
- Zhimin Ding
- Department of Ultrasound, Shenzhen Medical Ultrasound Engineering Center, Shenzhen People's Hospital (The Second Clinical Medical College, Jinan University; The First Affiliated Hospital, Southern University of Science and Technology), Shenzhen, China
| | - Di Song
- Department of Ultrasound, Shenzhen Medical Ultrasound Engineering Center, Shenzhen People's Hospital (The Second Clinical Medical College, Jinan University; The First Affiliated Hospital, Southern University of Science and Technology), Shenzhen, China
| | - Huaiyu Wu
- Department of Ultrasound, Shenzhen Medical Ultrasound Engineering Center, Shenzhen People's Hospital (The Second Clinical Medical College, Jinan University; The First Affiliated Hospital, Southern University of Science and Technology), Shenzhen, China
| | - Hongtian Tian
- Department of Ultrasound, Shenzhen Medical Ultrasound Engineering Center, Shenzhen People's Hospital (The Second Clinical Medical College, Jinan University; The First Affiliated Hospital, Southern University of Science and Technology), Shenzhen, China
| | - Xiuqin Ye
- Department of Ultrasound, Shenzhen Medical Ultrasound Engineering Center, Shenzhen People's Hospital (The Second Clinical Medical College, Jinan University; The First Affiliated Hospital, Southern University of Science and Technology), Shenzhen, China
| | - Weiyu Liang
- Department of Ultrasound, Shenzhen Medical Ultrasound Engineering Center, Shenzhen People's Hospital (The Second Clinical Medical College, Jinan University; The First Affiliated Hospital, Southern University of Science and Technology), Shenzhen, China
| | - Jinfeng Xu
- Department of Ultrasound, Shenzhen Medical Ultrasound Engineering Center, Shenzhen People's Hospital (The Second Clinical Medical College, Jinan University; The First Affiliated Hospital, Southern University of Science and Technology), Shenzhen, China
| | - Fajin Dong
- Department of Ultrasound, Shenzhen Medical Ultrasound Engineering Center, Shenzhen People's Hospital (The Second Clinical Medical College, Jinan University; The First Affiliated Hospital, Southern University of Science and Technology), Shenzhen, China
| |
Collapse
|
27
|
Bukkuri A, Andor N, Darcy IK. Applications of Topological Data Analysis in Oncology. Front Artif Intell 2021; 4:659037. [PMID: 33928240 PMCID: PMC8076640 DOI: 10.3389/frai.2021.659037] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 03/16/2021] [Indexed: 12/12/2022] Open
Abstract
The emergence of the information age in the last few decades brought with it an explosion of biomedical data. But with great power comes great responsibility: there is now a pressing need for new data analysis algorithms to be developed to make sense of the data and transform this information into knowledge which can be directly translated into the clinic. Topological data analysis (TDA) provides a promising path forward: using tools from the mathematical field of algebraic topology, TDA provides a framework to extract insights into the often high-dimensional, incomplete, and noisy nature of biomedical data. Nowhere is this more evident than in the field of oncology, where patient-specific data is routinely presented to clinicians in a variety of forms, from imaging to single cell genomic sequencing. In this review, we focus on applications involving persistent homology, one of the main tools of TDA. We describe some recent successes of TDA in oncology, specifically in predicting treatment responses and prognosis, tumor segmentation and computer-aided diagnosis, disease classification, and cellular architecture determination. We also provide suggestions on avenues for future research including utilizing TDA to analyze cancer time-series data such as gene expression changes during pathogenesis, investigation of the relation between angiogenic vessel structure and treatment efficacy from imaging data, and experimental confirmation that geometric and topological connectivity implies functional connectivity in the context of cancer.
Collapse
Affiliation(s)
- Anuraag Bukkuri
- Department of Integrated Mathematical Oncology, Moffitt Cancer Center, Tampa, FL, United States
| | - Noemi Andor
- Department of Integrated Mathematical Oncology, Moffitt Cancer Center, Tampa, FL, United States
| | - Isabel K. Darcy
- Department of Mathematics, University of Iowa, Iowa City, IA, United States
| |
Collapse
|
28
|
Abstract
The homological scaffold leverages persistent homology to construct a topologically sound summary of a weighted network. However, its crucial dependency on the choice of representative cycles hinders the ability to trace back global features onto individual network components, unless one provides a principled way to make such a choice. In this paper, we apply recent advances in the computation of minimal homology bases to introduce a quasi-canonical version of the scaffold, called minimal, and employ it to analyze data both real and in silico. At the same time, we verify that, statistically, the standard scaffold is a good proxy of the minimal one for sufficiently complex networks.
Collapse
|
29
|
Ninomiya K, Arimura H, Chan WY, Tanaka K, Mizuno S, Muhammad Gowdh NF, Yaakup NA, Liam CK, Chai CS, Ng KH. Robust radiogenomics approach to the identification of EGFR mutations among patients with NSCLC from three different countries using topologically invariant Betti numbers. PLoS One 2021; 16:e0244354. [PMID: 33428651 PMCID: PMC7799813 DOI: 10.1371/journal.pone.0244354] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2020] [Accepted: 12/09/2020] [Indexed: 12/24/2022] Open
Abstract
OBJECTIVES To propose a novel robust radiogenomics approach to the identification of epidermal growth factor receptor (EGFR) mutations among patients with non-small cell lung cancer (NSCLC) using Betti numbers (BNs). MATERIALS AND METHODS Contrast enhanced computed tomography (CT) images of 194 multi-racial NSCLC patients (79 EGFR mutants and 115 wildtypes) were collected from three different countries using 5 manufacturers' scanners with a variety of scanning parameters. Ninety-nine cases obtained from the University of Malaya Medical Centre (UMMC) in Malaysia were used for training and validation procedures. Forty-one cases collected from the Kyushu University Hospital (KUH) in Japan and fifty-four cases obtained from The Cancer Imaging Archive (TCIA) in America were used for a test procedure. Radiomic features were obtained from BN maps, which represent topologically invariant heterogeneous characteristics of lung cancer on CT images, by applying histogram- and texture-based feature computations. A BN-based signature was determined using support vector machine (SVM) models with the best combination of features that maximized a robustness index (RI) which defined a higher total area under receiver operating characteristics curves (AUCs) and lower difference of AUCs between the training and the validation. The SVM model was built using the signature and optimized in a five-fold cross validation. The BN-based model was compared to conventional original image (OI)- and wavelet-decomposition (WD)-based models with respect to the RI between the validation and the test. RESULTS The BN-based model showed a higher RI of 1.51 compared with the models based on the OI (RI: 1.33) and the WD (RI: 1.29). CONCLUSION The proposed model showed higher robustness than the conventional models in the identification of EGFR mutations among NSCLC patients. The results suggested the robustness of the BN-based approach against variations in image scanner/scanning parameters.
Collapse
Affiliation(s)
- Kenta Ninomiya
- Division of Medical Quantum Science, Department of Health Sciences, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Hidetaka Arimura
- Faculty of Medical Sciences, Division of Medical Quantum Science, Department of Health Sciences, Kyushu University, Fukuoka, Japan
| | - Wai Yee Chan
- Faculty of Medicine, Department of Biomedical Imaging, University of Malaya, Kuala Lumpur, Malaysia
| | - Kentaro Tanaka
- Department of Respiratory Medicine, Kyushu University Hospital, Fukuoka, Japan
| | - Shinichi Mizuno
- Division of Medical Sciences and Technology, Department of Health Sciences, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | | | - Nur Adura Yaakup
- Faculty of Medicine, Department of Biomedical Imaging, University of Malaya, Kuala Lumpur, Malaysia
| | - Chong-Kin Liam
- Faculty of Medicine, Department of Medicine, University of Malaya, Kuala Lumpur, Malaysia
| | - Chee-Shee Chai
- Faculty of Medicine and Health Science, Department of Medicine, University Malaysia Sarawak, Kota Samarahan, Sarawak, Malaysia
| | - Kwan Hoong Ng
- Faculty of Medicine, Department of Biomedical Imaging, University of Malaya, Kuala Lumpur, Malaysia
| |
Collapse
|
30
|
Levy J, Haudenschild C, Barwick C, Christensen B, Vaickus L. Topological Feature Extraction and Visualization of Whole Slide Images using Graph Neural Networks. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2021; 26:285-296. [PMID: 33691025 PMCID: PMC7959046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Whole-slide images (WSI) are digitized representations of thin sections of stained tissue from various patient sources (biopsy, resection, exfoliation, fluid) and often exceed 100,000 pixels in any given spatial dimension. Deep learning approaches to digital pathology typically extract information from sub-images (patches) and treat the sub-images as independent entities, ignoring contributing information from vital large-scale architectural relationships. Modeling approaches that can capture higher-order dependencies between neighborhoods of tissue patches have demonstrated the potential to improve predictive accuracy while capturing the most essential slide-level information for prognosis, diagnosis and integration with other omics modalities. Here, we review two promising methods for capturing macro and micro architecture of histology images, Graph Neural Networks, which contextualize patch level information from their neighbors through message passing, and Topological Data Analysis, which distills contextual information into its essential components. We introduce a modeling framework, WSI-GTFE that integrates these two approaches in order to identify and quantify key pathogenic information pathways. To demonstrate a simple use case, we utilize these topological methods to develop a tumor invasion score to stage colon cancer.
Collapse
Affiliation(s)
- Joshua Levy
- Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, NH 03756, USA* To whom correspondence should be addressed.,
| | | | | | | | | |
Collapse
|
31
|
Amézquita EJ, Quigley MY, Ophelders T, Munch E, Chitwood DH. The shape of things to come: Topological data analysis and biology, from molecules to organisms. Dev Dyn 2020; 249:816-833. [PMID: 32246730 PMCID: PMC7383827 DOI: 10.1002/dvdy.175] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2020] [Revised: 03/29/2020] [Accepted: 03/29/2020] [Indexed: 11/11/2022] Open
Abstract
Shape is data and data is shape. Biologists are accustomed to thinking about how the shape of biomolecules, cells, tissues, and organisms arise from the effects of genetics, development, and the environment. Less often do we consider that data itself has shape and structure, or that it is possible to measure the shape of data and analyze it. Here, we review applications of topological data analysis (TDA) to biology in a way accessible to biologists and applied mathematicians alike. TDA uses principles from algebraic topology to comprehensively measure shape in data sets. Using a function that relates the similarity of data points to each other, we can monitor the evolution of topological features-connected components, loops, and voids. This evolution, a topological signature, concisely summarizes large, complex data sets. We first provide a TDA primer for biologists before exploring the use of TDA across biological sub-disciplines, spanning structural biology, molecular biology, evolution, and development. We end by comparing and contrasting different TDA approaches and the potential for their use in biology. The vision of TDA, that data are shape and shape is data, will be relevant as biology transitions into a data-driven era where the meaningful interpretation of large data sets is a limiting factor.
Collapse
Affiliation(s)
- Erik J Amézquita
- Department of Computational Mathematics, Science & Engineering, Michigan State University, East Lansing, Michigan, USA
| | - Michelle Y Quigley
- Department of Horticulture, Michigan State University, East Lansing, Michigan, USA
| | - Tim Ophelders
- Department of Computational Mathematics, Science & Engineering, Michigan State University, East Lansing, Michigan, USA
| | - Elizabeth Munch
- Department of Computational Mathematics, Science & Engineering, Michigan State University, East Lansing, Michigan, USA.,Department of Mathematics, Michigan State University, East Lansing, Michigan, USA
| | - Daniel H Chitwood
- Department of Computational Mathematics, Science & Engineering, Michigan State University, East Lansing, Michigan, USA.,Department of Horticulture, Michigan State University, East Lansing, Michigan, USA
| |
Collapse
|
32
|
Hu B, Li G, Brown JQ. Enhanced resolution 3D digital cytology and pathology with dual-view inverted selective plane illumination microscopy. BIOMEDICAL OPTICS EXPRESS 2019; 10:3833-3846. [PMID: 31452978 PMCID: PMC6701541 DOI: 10.1364/boe.10.003833] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Revised: 06/14/2019] [Accepted: 06/26/2019] [Indexed: 05/11/2023]
Abstract
The current gold-standard histopathology for tissue analysis is destructive, time consuming, and limited to 2D slices. Light sheet microscopy has emerged as a powerful tool for 3D imaging of tissue biospecimens with its fast speed and low photo-damage, but usually with worse axial resolution and complicated configuration for sample imaging. Here, we utilized inverted selective plane illumination microscopy for easy sample mounting and imaging, and dual-view imaging and deconvolution to overcome the anisotropic resolution. We have rendered 3D images of fresh cytology cell blocks and millimeter- to centimeter-sized fixed tissue samples with high resolution in both lateral and axial directions. More accurate cellular quantification, higher image sharpness, and more image details have been achieved with the dual-view method compared with single-view imaging.
Collapse
|