1
|
Lai Y, Guan W, Luo L, Guo Y, Song H, Meng H. Bayesian Estimation of Inverted Beta Mixture Models With Extended Stochastic Variational Inference for Positive Vector Classification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:6948-6962. [PMID: 36279334 DOI: 10.1109/tnnls.2022.3213518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
The finite inverted beta mixture model (IBMM) has been proven to be efficient in modeling positive vectors. Under the traditional variational inference framework, the critical challenge in Bayesian estimation of the IBMM is that the computational cost of performing inference with large datasets is prohibitively expensive, which often limits the use of Bayesian approaches to small datasets. An efficient alternative provided by the recently proposed stochastic variational inference (SVI) framework allows for efficient inference on large datasets. Nevertheless, when using the SVI framework to address the non-Gaussian statistical models, the evidence lower bound (ELBO) cannot be explicitly calculated due to the intractable moment computation. Therefore, the algorithm under the SVI framework cannot directly use stochastic optimization to optimize the ELBO, and an analytically tractable solution cannot be derived. To address this problem, we propose an extended version of the SVI framework with more flexibility, namely, the extended SVI (ESVI) framework. This framework can be used in many non-Gaussian statistical models. First, some approximation strategies are applied to further lower the ELBO to avoid intractable moment calculations. Then, stochastic optimization with noisy natural gradients is used to optimize the lower bound. The excellent performance and effectiveness of the proposed method are verified in real data evaluation.
Collapse
|
2
|
Aznarez-Sanado M, Romero-Garcia R, Li C, Morris RC, Price SJ, Manly T, Santarius T, Erez Y, Hart MG, Suckling J. Brain tumour microstructure is associated with post-surgical cognition. Sci Rep 2024; 14:5646. [PMID: 38454017 PMCID: PMC10920778 DOI: 10.1038/s41598-024-55130-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 02/20/2024] [Indexed: 03/09/2024] Open
Abstract
Brain tumour microstructure is potentially predictive of changes following treatment to cognitive functions subserved by the functional networks in which they are embedded. To test this hypothesis, intra-tumoural microstructure was quantified from diffusion-weighted MRI to identify which tumour subregions (if any) had a greater impact on participants' cognitive recovery after surgical resection. Additionally, we studied the role of tumour microstructure in the functional interaction between the tumour and the rest of the brain. Sixteen patients (22-56 years, 7 females) with brain tumours located in or near speech-eloquent areas of the brain were included in the analyses. Two different approaches were adopted for tumour segmentation from a multishell diffusion MRI acquisition: the first used a two-dimensional four group partition of feature space, whilst the second used data-driven clustering with Gaussian mixture modelling. For each approach, we assessed the capability of tumour microstructure to predict participants' cognitive outcomes after surgery and the strength of association between the BOLD signal of individual tumour subregions and the global BOLD signal. With both methodologies, the volumes of partially overlapped subregions within the tumour significantly predicted cognitive decline in verbal skills after surgery. We also found that these particular subregions were among those that showed greater functional interaction with the unaffected cortex. Our results indicate that tumour microstructure measured by MRI multishell diffusion is associated with cognitive recovery after surgery.
Collapse
Affiliation(s)
- Maite Aznarez-Sanado
- School of Education and Psychology, University of Navarra, 31009, Pamplona, Spain
| | - Rafael Romero-Garcia
- Department of Medical Physiology and Biophysics, Instituto de Biomedicina de Sevilla (IBiS), HUVR/CSIC/Universidad de Sevilla/CIBERSAM, ISCIII, 41013, Sevilla, Spain.
- Department of Psychiatry, University of Cambridge, Herchel Smith Bldg, Robinson Way, Cambridge, CB2 0SZ, UK.
| | - Chao Li
- Cambridge Brain Tumour Imaging Laboratory, Division of Neurosurgery, Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
- Department of Applied Mathematics and Theoretical Physics, The Centre for Mathematical Imaging in Healthcare, Cambridge, CB3 0WA, UK
- School of Medicine & School of Science and Engineering, University of Dundee, Dundee, DD1 4HN, UK
| | - Rob C Morris
- Academic Neurosurgery Division, Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
| | - Stephen J Price
- Cambridge Brain Tumour Imaging Laboratory, Division of Neurosurgery, Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
| | - Thomas Manly
- MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, CB2 7EF, UK
| | - Thomas Santarius
- Academic Neurosurgery Division, Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
| | - Yaara Erez
- Faculty of Engineering, Bar-Ilan University, 5290002, Ramat Gan, Israel
- The Gonda Multidisciplinary Brain Research Center, Bar-Ilan University, Ramat Gan, Israel
| | - Michael G Hart
- St George's, University of London and St George's University Hospitals NHS Foundation Trust, Institute of Molecular and Clinical Sciences, Neurosciences Research Centre, Cranmer Terrace, London, SW17 0RE, UK
| | - John Suckling
- Department of Psychiatry, University of Cambridge, Herchel Smith Bldg, Robinson Way, Cambridge, CB2 0SZ, UK
- Cambridge and Peterborough NHS Foundation Trust, Cambridge, CB21 5EF, UK
| |
Collapse
|
3
|
Luo Z, Amayri M, Fan W, Bouguila N. Cross-collection latent Beta-Liouville allocation model training with privacy protection and applications. APPL INTELL 2023; 53:1-25. [PMID: 36685642 PMCID: PMC9838479 DOI: 10.1007/s10489-022-04378-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/29/2022] [Indexed: 01/15/2023]
Abstract
Cross-collection topic models extend previous single-collection topic models, such as Latent Dirichlet Allocation (LDA), to multiple collections. The purpose of cross-collection topic modeling is to model document-topic representations and reveal similarities between each topic and differences among groups. However, the restriction of Dirichlet prior and the significant privacy risk have hampered those models' performance and utility. Training those cross-collection topic models may, in particular, leak sensitive information from the training dataset. To address the two issues mentioned above, we propose a novel model, cross-collection latent Beta-Liouville allocation (ccLBLA), which operates a more powerful prior, Beta-Liouville distribution with a more general covariance structure that enhances topic correlation analysis. To provide privacy protection for the ccLBLA model, we leverage the inherent differential privacy guarantee of the Collapsed Gibbs Sampling (CGS) inference scheme and then propose a hybrid privacy protection algorithm for the ccLBLA model (HPP-ccLBLA) that prevents inferring data from intermediate statistics during the CGS training process without sacrificing its utility. More crucially, our technique is the first attempt to use the cross-collection topic model in image classification applications and investigate the cross-collection topic model's capabilities beyond text analysis. The experimental results for comparative text mining and image classification will show the merits of our proposed approach.
Collapse
Affiliation(s)
- Zhiwen Luo
- The Concordia Institute for Information Systems Engineering (CIISE), Concordia University, Montréal, H3H 1M8 Québec Canada
| | - Manar Amayri
- The Concordia Institute for Information Systems Engineering (CIISE), Concordia University, Montréal, H3H 1M8 Québec Canada
- G-SCOP Lab, Grenoble Institute of Technology, Grenoble, 38031 France
| | - Wentao Fan
- Department of Computer Science, Beijing Normal University-Hong Kong Baptist University United International College (UIC), Zhuhai, Guangdong 519088 China
| | - Nizar Bouguila
- The Concordia Institute for Information Systems Engineering (CIISE), Concordia University, Montréal, H3H 1M8 Québec Canada
| |
Collapse
|
4
|
Ma Z, Lai Y, Xie J, Meng D, Kleijn WB, Guo J, Yu J. Dirichlet Process Mixture of Generalized Inverted Dirichlet Distributions for Positive Vector Data With Extended Variational Inference. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:6089-6102. [PMID: 34086578 DOI: 10.1109/tnnls.2021.3072209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
A Bayesian nonparametric approach for estimation of a Dirichlet process (DP) mixture of generalized inverted Dirichlet distributions [i.e., an infinite generalized inverted Dirichlet mixture model (InGIDMM)] has been proposed. The generalized inverted Dirichlet distribution has been proven to be efficient in modeling the vectors that contain only positive elements. Under the classical variational inference (VI) framework, the key challenge in the Bayesian estimation of InGIDMM is that the expectation of the joint distribution of data and variables cannot be explicitly calculated. Therefore, numerical methods are usually applied to simulate the optimal posterior distributions. With the recently proposed extended VI (EVI) framework, we introduce lower bound approximations to the original variational objective function in the VI framework such that an analytically tractable solution can be derived. Hence, the problem in numerical simulation has been overcome. By applying the DP mixture technique, an InGIDMM can automatically determine the number of mixture components from the observed data. Moreover, the DP mixture model with an infinite number of mixture components also avoids the problems of underfitting and overfitting. The performance of the proposed approach is demonstrated with both synthesized data and real-life data applications.
Collapse
|
5
|
Bregu O, Zamzami N, Bouguila N. Online mixture-based clustering for high dimensional count data using Neerchal–Morel distribution. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.107051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
6
|
Bregu O, Zamzami N, Bouguila N. Mixture‐based clustering for count data using approximated Fisher Scoring and Minorization–Maximization approaches. Comput Intell 2020. [DOI: 10.1111/coin.12429] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Ornela Bregu
- Concordia Institute for Information Systems Engineering (CIISE) Concordia University Montreal Quebec Canada
| | - Nuha Zamzami
- Concordia Institute for Information Systems Engineering (CIISE) Concordia University Montreal Quebec Canada
- Department of Computer Science and Artificial Intelligence University of Jeddah Jeddah Saudi Arabia
| | - Nizar Bouguila
- Concordia Institute for Information Systems Engineering (CIISE) Concordia University Montreal Quebec Canada
| |
Collapse
|
7
|
Automated Nuclear Lamina Network Recognition and Quantitative Analysis in Structured Illumination Super-Resolution Microscope Images Using a Gaussian Mixture Model and Morphological Processing. PHOTONICS 2020. [DOI: 10.3390/photonics7040119] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Studying the architecture of nuclear lamina networks is significantly important in biomedicine owing not only to their influence on the genome, but also because they are associated with several diseases. To save labor and time, an automated method for nuclear lamina network recognition and quantitative analysis is proposed for use with lattice structured illumination super-resolution microscope images in this study. This method is based on a Gaussian mixture model and morphological processing. It includes steps for target region generation, bias field correction, image segmentation, network connection, meshwork generation, and meshwork analysis. The effectiveness of the proposed method was confirmed by recognizing and quantitatively analyzing nuclear lamina networks in five images that are presented to show the method’s performance. The experimental results show that our algorithm achieved high accuracy in nuclear lamina network recognition and quantitative analysis, and the median face areas size of lamina networks from U2OS osteosarcoma cells are 0.3184 μm2.
Collapse
|
8
|
Ihou KE, Bouguila N, Bouachir W. Efficient integration of generative topic models into discriminative classifiers using robust probabilistic kernels. Pattern Anal Appl 2020. [DOI: 10.1007/s10044-020-00917-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
9
|
Ma Z, Xie J, Lai Y, Taghia J, Xue JH, Guo J. Insights Into Multiple/Single Lower Bound Approximation for Extended Variational Inference in Non-Gaussian Structured Data Modeling. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020; 31:2240-2254. [PMID: 30908264 DOI: 10.1109/tnnls.2019.2899613] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
For most of the non-Gaussian statistical models, the data being modeled represent strongly structured properties, such as scalar data with bounded support (e.g., beta distribution), vector data with unit length (e.g., Dirichlet distribution), and vector data with positive elements (e.g., generalized inverted Dirichlet distribution). In practical implementations of non-Gaussian statistical models, it is infeasible to find an analytically tractable solution to estimating the posterior distributions of the parameters. Variational inference (VI) is a widely used framework in Bayesian estimation. Recently, an improved framework, namely, the extended VI (EVI), has been introduced and applied successfully to a number of non-Gaussian statistical models. EVI derives analytically tractable solutions by introducing lower bound approximations to the variational objective function. In this paper, we compare two approximation strategies, namely, the multiple lower bounds (MLBs) approximation and the single lower bound (SLB) approximation, which can be applied to carry out the EVI. For implementation, two different conditions, the weak and the strong conditions, are discussed. Convergence of the EVI depends on the selection of the lower bound, regardless of the choice of weak or strong condition. We also discuss the convergence properties to clarify the differences between MLB and SLB. Extensive comparisons are made based on some EVI-based non-Gaussian statistical models. Theoretical analysis is conducted to demonstrate the differences between the weak and strong conditions. Experimental results based on real data show advantages of the SLB approximation over the MLB approximation.
Collapse
|
10
|
Zamzami N, Bouguila N. High-dimensional count data clustering based on an exponential approximation to the multinomial Beta-Liouville distribution. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.03.028] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
11
|
Xia Y. Correlation and association analyses in microbiome study integrating multiomics in health and disease. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2020; 171:309-491. [PMID: 32475527 DOI: 10.1016/bs.pmbts.2020.04.003] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Correlation and association analyses are one of the most widely used statistical methods in research fields, including microbiome and integrative multiomics studies. Correlation and association have two implications: dependence and co-occurrence. Microbiome data are structured as phylogenetic tree and have several unique characteristics, including high dimensionality, compositionality, sparsity with excess zeros, and heterogeneity. These unique characteristics cause several statistical issues when analyzing microbiome data and integrating multiomics data, such as large p and small n, dependency, overdispersion, and zero-inflation. In microbiome research, on the one hand, classic correlation and association methods are still applied in real studies and used for the development of new methods; on the other hand, new methods have been developed to target statistical issues arising from unique characteristics of microbiome data. Here, we first provide a comprehensive view of classic and newly developed univariate correlation and association-based methods. We discuss the appropriateness and limitations of using classic methods and demonstrate how the newly developed methods mitigate the issues of microbiome data. Second, we emphasize that concepts of correlation and association analyses have been shifted by introducing network analysis, microbe-metabolite interactions, functional analysis, etc. Third, we introduce multivariate correlation and association-based methods, which are organized by the categories of exploratory, interpretive, and discriminatory analyses and classification methods. Fourth, we focus on the hypothesis testing of univariate and multivariate regression-based association methods, including alpha and beta diversities-based, count-based, and relative abundance (or compositional)-based association analyses. We demonstrate the characteristics and limitations of each approaches. Fifth, we introduce two specific microbiome-based methods: phylogenetic tree-based association analysis and testing for survival outcomes. Sixth, we provide an overall view of longitudinal methods in analysis of microbiome and omics data, which cover standard, static, regression-based time series methods, principal trend analysis, and newly developed univariate overdispersed and zero-inflated as well as multivariate distance/kernel-based longitudinal models. Finally, we comment on current association analysis and future direction of association analysis in microbiome and multiomics studies.
Collapse
Affiliation(s)
- Yinglin Xia
- Department of Medicine, University of Illinois at Chicago, Chicago, IL, United States.
| |
Collapse
|
12
|
|
13
|
Fan W, Bouguila N, Du JX, Liu X. Axially Symmetric Data Clustering Through Dirichlet Process Mixture Models of Watson Distributions. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019; 30:1683-1694. [PMID: 30369452 DOI: 10.1109/tnnls.2018.2872986] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
This paper proposes a Bayesian nonparametric framework for clustering axially symmetric data. Our approach is based on a Dirichlet processes mixture model with Watson distributions, which can also be considered as the infinite Watson mixture model. In this paper, first, we extend the finite Watson mixture model into its infinite counterpart based on the framework of truncated Dirichlet process mixture model with a stick-breaking representation. Second, we propose a coordinate ascent mean-field variational inference algorithm that can effectively learn the parameters of our model with closed-form solutions; Third, to cope with a massive data set, we develop a stochastic variational inference algorithm to learn the proposed model through the method of stochastic gradient ascent; Finally, the proposed nonparametric Bayesian model is evaluated through simulated axially symmetric data sets and a real-world application, namely, gene expression data clustering.
Collapse
|
14
|
Hybrid generative discriminative approaches based on Multinomial Scaled Dirichlet mixture models. APPL INTELL 2019. [DOI: 10.1007/s10489-019-01437-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
15
|
Ihou KE, Bouguila N. Variational-based latent generalized Dirichlet allocation model in the collapsed space and applications. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2018.12.046] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
16
|
Zamzami N, Bouguila N. Model selection and application to high-dimensional count data clustering. APPL INTELL 2018. [DOI: 10.1007/s10489-018-1333-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
17
|
|
18
|
Nakada Y. Improving on Deterministic Approximate Bayesian Inferences for Mixture Distributions. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2016; 27:2282-2300. [PMID: 26452291 DOI: 10.1109/tnnls.2015.2477361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
This paper presents the branching approach, which can improve deterministic implementation methods (such as variational Bayesian inference and expectation propagation method) for Bayesian mixture distributions. This proposed approach utilizes a set of artificial conditions defined by using the latent variables of the mixture distribution. This condition set is updated iteratively by branching based on a condition selected from the previous condition set. The target approximate Bayesian inference is obtained by merging the approximate conditional inferences under each condition in the condition set. The proposed approach is compared with several standard implementation methods by using a numerical example and a real-world example.
Collapse
|
19
|
Perina A, Jojic N. Capturing Spatial Interdependence in Image Features: The Counting Grid, an Epitomic Representation for Bags of Features. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2015; 37:2374-2387. [PMID: 26539844 DOI: 10.1109/tpami.2015.2424864] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
In recent scene recognition research images or large image regions are often represented as disorganized "bags" of features which can then be analyzed using models originally developed to capture co-variation of word counts in text. However, image feature counts are likely to be constrained in different ways than word counts in text. For example, as a camera pans upwards from a building entrance over its first few floors and then further up into the sky Fig. 1 Fig. 1. Feature counts change slightly as the field of view moves. For example, the abundance of the "car" features is reduced, but the counts of the features found on building facades are increased. The counting grid model accounts for such changes naturally, and it can also account for images of different scenes.
Collapse
|
20
|
Probabilistic approach for QoS-aware recommender system for trustworthy web service selection. APPL INTELL 2014. [DOI: 10.1007/s10489-014-0537-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
21
|
Fan W, Bouguila N. Online learning of a Dirichlet process mixture of Beta-Liouville distributions via variational inference. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2013; 24:1850-1862. [PMID: 24808617 DOI: 10.1109/tnnls.2013.2268461] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
A large class of problems can be formulated in terms of the clustering process. Mixture models are an increasingly important tool in statistical pattern recognition and for analyzing and clustering complex data. Two challenging aspects that should be addressed when considering mixture models are how to choose between a set of plausible models and how to estimate the model's parameters. In this paper, we address both problems simultaneously within a unified online nonparametric Bayesian framework that we develop to learn a Dirichlet process mixture of Beta-Liouville distributions (i.e., an infinite Beta-Liouville mixture model). The proposed infinite model is used for the online modeling and clustering of proportional data for which the Beta-Liouville mixture has been shown to be effective. We propose a principled approach for approximating the intractable model's posterior distribution by a tractable one-which we develop-such that all the involved mixture's parameters can be estimated simultaneously and effectively in a closed form. This is done through variational inference that enjoys important advantages, such as handling of unobserved attributes and preventing under or overfitting; we explain that in detail. The effectiveness of the proposed work is evaluated on three challenging real applications, namely facial expression recognition, behavior modeling and recognition, and dynamic textures clustering.
Collapse
|
22
|
Zhang H, Wu QMJ, Nguyen TM. Incorporating mean template into finite mixture model for image segmentation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2013; 24:328-335. [PMID: 24808286 DOI: 10.1109/tnnls.2012.2228227] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
The well-known finite mixture model (FMM) has been regarded as a useful tool for image segmentation application. However, the pixels in FMM are considered independent of each other and the spatial relationship between neighboring pixels is not taken into account. These limitations make the FMM more sensitive to noise. In this brief, we propose a simple and effective method to make the traditional FMM more robust to noise with the help of a mean template. FMM can be considered a linear combination of prior and conditional probability from the expression of its mathematical formula. We calculate these probabilities with two mean templates: a weighted arithmetic mean template and a weighted geometric mean template. Thus, in our model, the prior probability (or conditional probability) of an image pixel is influenced by the probabilities of pixels in its immediate neighborhood to incorporate the local spatial and intensity information for eliminating the noise. Finally, our algorithm is general enough and can be extended to any other FMM-based models to achieve super performance. Experimental results demonstrate the improved robustness and effectiveness of our approach.
Collapse
|
23
|
Holmes I, Harris K, Quince C. Dirichlet multinomial mixtures: generative models for microbial metagenomics. PLoS One 2012; 7:e30126. [PMID: 22319561 PMCID: PMC3272020 DOI: 10.1371/journal.pone.0030126] [Citation(s) in RCA: 483] [Impact Index Per Article: 40.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2011] [Accepted: 12/12/2011] [Indexed: 12/12/2022] Open
Abstract
We introduce Dirichlet multinomial mixtures (DMM) for the probabilistic modelling of microbial metagenomics data. This data can be represented as a frequency matrix giving the number of times each taxa is observed in each sample. The samples have different size, and the matrix is sparse, as communities are diverse and skewed to rare taxa. Most methods used previously to classify or cluster samples have ignored these features. We describe each community by a vector of taxa probabilities. These vectors are generated from one of a finite number of Dirichlet mixture components each with different hyperparameters. Observed samples are generated through multinomial sampling. The mixture components cluster communities into distinct 'metacommunities', and, hence, determine envirotypes or enterotypes, groups of communities with a similar composition. The model can also deduce the impact of a treatment and be used for classification. We wrote software for the fitting of DMM models using the 'evidence framework' (http://code.google.com/p/microbedmm/). This includes the Laplace approximation of the model evidence. We applied the DMM model to human gut microbe genera frequencies from Obese and Lean twins. From the model evidence four clusters fit this data best. Two clusters were dominated by Bacteroides and were homogenous; two had a more variable community composition. We could not find a significant impact of body mass on community structure. However, Obese twins were more likely to derive from the high variance clusters. We propose that obesity is not associated with a distinct microbiota but increases the chance that an individual derives from a disturbed enterotype. This is an example of the 'Anna Karenina principle (AKP)' applied to microbial communities: disturbed states having many more configurations than undisturbed. We verify this by showing that in a study of inflammatory bowel disease (IBD) phenotypes, ileal Crohn's disease (ICD) is associated with a more variable community.
Collapse
Affiliation(s)
- Ian Holmes
- Department of Bioengineering, University of California, Berkeley, California, United States of America
| | - Keith Harris
- School of Engineering, University of Glasgow, Glasgow, United Kingdom
| | - Christopher Quince
- School of Engineering, University of Glasgow, Glasgow, United Kingdom
- * E-mail:
| |
Collapse
|
24
|
Cheng J, Sayeh MR, Zargham MR, Cheng Q. Real-time vector quantization and clustering based on ordinary differential equations. IEEE TRANSACTIONS ON NEURAL NETWORKS 2011; 22:2143-8. [PMID: 22057062 DOI: 10.1109/tnn.2011.2172627] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
This brief presents a dynamical system approach to vector quantization or clustering based on ordinary differential equations with the potential for real-time implementation. Two examples of different pattern clusters demonstrate that the model can successfully quantize different types of input patterns. Furthermore, we analyze and study the stability of our dynamical system. By discovering the equilibrium points for certain input patterns and analyzing their stability, we have shown the quantizing behavior of the system with respect to its vigilance parameter. The proposed system is applied to two real-world problems, providing comparable results to the best reported findings. This validates the effectiveness of our proposed approach.
Collapse
Affiliation(s)
- Jie Cheng
- Department of Computer Science, University of Hawaii, Hilo, HI 96720, USA.
| | | | | | | |
Collapse
|