1
|
Feng J, Zhang SW, Chen L, Zuo C. Detection of Alzheimer’s Disease Using Features of Brain Region-of-Interest-Based Individual Network Constructed with the sMRI Image. Comput Med Imaging Graph 2022; 98:102057. [DOI: 10.1016/j.compmedimag.2022.102057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 02/18/2022] [Accepted: 03/17/2022] [Indexed: 10/18/2022]
|
2
|
Khasawneh AM, Bukhari A, Al-Khasawneh MA. Early Detection of Medical Image Analysis by Using Machine Learning Method. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:3041811. [PMID: 38170039 PMCID: PMC10761224 DOI: 10.1155/2022/3041811] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/11/2021] [Revised: 02/02/2022] [Accepted: 02/07/2022] [Indexed: 01/05/2024]
Abstract
We develop effective medical image classification techniques, with an emphasis on histopathology and magnetic resonance imaging (MRI). The trainer utilized the curriculum as a starting point for a set of data and a restricted number of samples, and we used it as a starting point for a set of data. As calibrating a machine learning model is difficult, we used alternative methods as unsupervised feature extracts or weight-conditioning factors for identifying pathological histology pictures. As a result, the pretrained models will be trained on 3-channel RGB pictures, while the MRI sample has more slices. To alter the working model using the MRI data, the convolutional neural network (CNN) must be fine-tuned. Pretrained models are placed and then used as feature snippets. However, there is a scarcity of well-done medical photos, making training machine learning models a difficult endeavor to begin with. In any case, data augmentation aids in the generation of sufficient training samples; however, it is unclear if data augmentation aids in the prediction of unknown data samples. As a result, we fine-tuned machine learning models without using any additional data. Furthermore, rather than utilizing a standard machine learning classifier for the MRI data, we created a unique CNN that uses both 3D shear descriptors and deep features as input. This custom network identifies the MRI sample after processing our representation of the characteristics from beginning to end. On the hidden MRI dataset, our bespoke CNN outperforms traditional machine learning. Our CNN model is less prone to overfitting as a result of this. Furthermore, we have given cutting-edge outcomes employing machine learning.
Collapse
Affiliation(s)
| | - Amal Bukhari
- College of Computer Science and Engineering, University of Jeddah, Saudi Arabia
| | - Mahmoud Ahmad Al-Khasawneh
- School of Information Technology, Skyline University College, University City Sharjah, 1797 Sharjah, UAE
| |
Collapse
|
3
|
Baruah HG, Nath VK, Hazarika D, Hatibaruah R. Local bit-plane neighbour dissimilarity pattern in non-subsampled shearlet transform domain for bio-medical image retrieval. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022; 19:1609-1632. [PMID: 35135220 DOI: 10.3934/mbe.2022075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
This paper introduces a novel descriptor non-subsampled shearlet transform (NSST) local bit-plane neighbour dissimilarity pattern (NSST-LBNDP) for biomedical image retrieval based on NSST, bit-plane slicing and local pattern based features. In NSST-LBNDP, the input image is first decomposed by NSST, followed by introduction of non-linearity on the NSST coefficients by computing local energy features. The local energy features are next normalized into 8-bit values. The multiscale NSST is used to provide translational invariance and has flexible directional sensitivity to catch more anisotropic information of an image. The normalised NSST subband features are next decomposed into bit-plane slices in order to capture very fine to coarse subband details. Then each bit-plane slices of all the subbands are encoded by exploiting the dissimilarity relationship between each neighbouring pixel and its adjacent neighbours. Experiments on two computed tomography (CT) and one magnetic resonance imaging (MRI) image datasets confirms the superior results of NSST-LBNDP when compared to many recent well known relevant descriptors both in terms of average retrieval precision (ARP) and average retrieval recall (ARR).
Collapse
Affiliation(s)
- Hilly Gohain Baruah
- Department of Electronics and Communication Engineering, School of Engineering, Tezpur University, Napaam, Tezpur, Assam 784028, India
| | - Vijay Kumar Nath
- Department of Electronics and Communication Engineering, School of Engineering, Tezpur University, Napaam, Tezpur, Assam 784028, India
| | - Deepika Hazarika
- Department of Electronics and Communication Engineering, School of Engineering, Tezpur University, Napaam, Tezpur, Assam 784028, India
| | - Rakcinpha Hatibaruah
- Department of Electronics and Communication Engineering, School of Engineering, Tezpur University, Napaam, Tezpur, Assam 784028, India
| |
Collapse
|
4
|
Intelligent vision based wear forecasting on surfaces of machine tool elements. SN APPLIED SCIENCES 2021. [DOI: 10.1007/s42452-021-04839-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022] Open
Abstract
Abstract
To realize autonomous production machines it is necessary that machines are able to automatically and autonomously predict their condition. Although many classical as well as Deep Learning based approaches have shown the ability to classify faults, so far there are no approaches that go beyond the basic detection of faults. A complete, image based predictive maintenance approach for machine tool components has to the best of our knowledge not been investigated so far. In this paper it is shown how defects on a Ball Screw Drive (BSD) can be automatically detected by using a machine learning based detection module, quantified by using an intelligent defect quantification module and finally forecasted by a prognosis module in a combined approach. To optimize the presented method, it is shown how existing domain knowledge can be formalized in an expert system and combined with the predictions of the machine learning model to aid quality of the prediction system. This enables the practitioner to forecast the severity of failures on BSD and prevent machine breakdowns. The work is intended to set new accents for the development of practical predictive maintenance systems and bridging the fields of machine learning and production engineering. The code is available under: https://github.com/2Obe/Pitting_Pred_Maintenance.
Collapse
|
5
|
YOT-Net: YOLOv3 Combined Triplet Loss Network for Copper Elbow Surface Defect Detection. SENSORS 2021; 21:s21217260. [PMID: 34770569 PMCID: PMC8586934 DOI: 10.3390/s21217260] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Revised: 10/25/2021] [Accepted: 10/26/2021] [Indexed: 11/29/2022]
Abstract
Copper elbows are an important product in industry. They are used to connect pipes for transferring gas, oil, and liquids. Defective copper elbows can lead to serious industrial accidents. In this paper, a novel model named YOT-Net (YOLOv3 combined triplet loss network) is proposed to automatically detect defective copper elbows. To increase the defect detection accuracy, triplet loss function is employed in YOT-Net. The triplet loss function is introduced into the loss module of YOT-Net, which utilizes image similarity to enhance feature extraction ability. The proposed method of YOT-Net shows outstanding performance in copper elbow surface defect detection.
Collapse
|
6
|
Alinsaif S, Lang J. 3D shearlet-based descriptors combined with deep features for the classification of Alzheimer's disease based on MRI data. Comput Biol Med 2021; 138:104879. [PMID: 34598071 DOI: 10.1016/j.compbiomed.2021.104879] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2021] [Revised: 09/15/2021] [Accepted: 09/15/2021] [Indexed: 01/14/2023]
Abstract
Alzheimer's disease (AD) is a neurodegenerative disease that afflicts millions of people worldwide. Early detection of AD is critical, as drug trials show a promising advantage to those patients with early diagnoses. In this study, magnetic resonance imaging (MRI) datasets from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and The Open Access Series of Imaging Studies are used. Our method for performing the classification of AD is to combine a set of shearlet-based descriptors with deep features. A major challenge in classifying such MRI datasets is the high dimensionality of feature vectors because of the large number of slices of each MRI sample. Given the volumetric nature of the MRI data, we propose using the 3D shearlet transform (3D-ST), but we obtain the average of all directionalities, which reduces the dimensionality. On the other hand, we propose to leverage the capabilities of convolutional neural networks (CNN) to learn feature maps from stacked MRI slices, which generate a very compact feature vector for each MRI sample. The 3D-ST and CNN feature vectors are combined for the classification of AD. After the concatenation of the feature vectors, they are used to train a classifier. Alternatively, a custom CNN model is utilized, in which the descriptors are further processed end to end to obtain the classification model. Our experimental results show that the fusion of shearlet-based descriptors and deep features improves classification performance, especially on the ADNI dataset.
Collapse
Affiliation(s)
- Sadiq Alinsaif
- School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, K1N 6N5, Canada; College of Computer Science and Engineering, University of Hafr Al Batin, Al Jamiah, Hafar Al Batin, 39524, Saudi Arabia.
| | - Jochen Lang
- School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, K1N 6N5, Canada.
| | | |
Collapse
|
7
|
Ghalati MK, Nunes A, Ferreira H, Serranho P, Bernardes R. Texture Analysis and its Applications in Biomedical Imaging: A Survey. IEEE Rev Biomed Eng 2021; 15:222-246. [PMID: 34570709 DOI: 10.1109/rbme.2021.3115703] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Texture analysis describes a variety of image analysis techniques that quantify the variation in intensity and pattern. This paper provides an overview of several texture analysis approaches addressing the rationale supporting them, their advantages, drawbacks, and applications. This surveys emphasis is in collecting and categorising over five decades of active research on texture analysis. Brief descriptions of different approaches are presented along with application examples. From a broad range of texture analysis applications, this surveys final focus is on biomedical image analysis. An up-to-date list of biological tissues and organs in which disorders produce texture changes that may be used to spot disease onset and progression is provided. Finally, the role of texture analysis methods as biomarkers of disease is summarised.
Collapse
|
8
|
Zhang J, Liu M, Lu K, Gao Y. Group-Wise Learning for Aurora Image Classification With Multiple Representations. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:4112-4124. [PMID: 30932858 DOI: 10.1109/tcyb.2019.2903591] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In conventional aurora image classification methods, it is general to employ only one single feature representation to capture the morphological characteristics of aurora images, which is difficult to describe the complicated morphologies of different aurora categories. Although several studies have proposed to use multiple feature representations, the inherent correlation among these representations are usually neglected. To address this problem, we propose a group-wise learning (GWL) method for the automatic aurora image classification using multiple representations. Specifically, we first extract the multiple feature representations for aurora images, and then construct a graph in each of multiple feature spaces. To model the correlation among different representations, we partition multiple graphs into several groups via a clustering algorithm. We further propose a GWL model to automatically estimate class labels for aurora images and optimal weights for the multiple representations in a data-driven manner. Finally, we develop a label fusion approach to make a final classification decision for new testing samples. The proposed GWL method focuses on the diverse properties of multiple feature representations, by clustering the correlated representations into the same group. We evaluate our method on an aurora image data set that contains 12 682 aurora images from 19 days. The experimental results demonstrate that the proposed GWL method achieves approximately 6% improvement in terms of classification accuracy, compared to the methods using a single feature representation.
Collapse
|
9
|
Liu M, Jiao L, Liu X, Li L, Liu F, Yang S. C-CNN: Contourlet Convolutional Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:2636-2649. [PMID: 32692683 DOI: 10.1109/tnnls.2020.3007412] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Extracting effective features is always a challenging problem for texture classification because of the uncertainty of scales and the clutter of textural patterns. For texture classification, spectral analysis is traditionally employed in the frequency domain. Recent studies have shown the potential of convolutional neural networks (CNNs) when dealing with the texture classification task in the spatial domain. In this article, we try combining both approaches in different domains for more abundant information and proposed a novel network architecture named contourlet CNN (C-CNN). The network aims to learn sparse and effective feature representations for images. First, the contourlet transform is applied to get the spectral features from an image. Second, the spatial-spectral feature fusion strategy is designed to incorporate the spectral features into CNN architecture. Third, the statistical features are integrated into the network by the statistical feature fusion. Finally, the results are obtained by classifying the fusion features. We also investigated the behavior of the parameters in contourlet decomposition. Experiments on the widely used three texture data sets (kth-tips2-b, DTD, and CUReT) and five remote sensing data sets (UCM, WHU-RS, AID, RSSCN7, and NWPU-RESISC45) demonstrate that the proposed approach outperforms several well-known classification methods in terms of classification accuracy with fewer trainable parameters.
Collapse
|
10
|
Yao Y, Chen H, Yao X. Discriminative Learning in the Model Space for Symbolic Sequence Classification. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2021. [DOI: 10.1109/tetci.2019.2914266] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
11
|
Chen L, Li M, Su W, Wu M, Hirota K, Pedrycz W. Adaptive Feature Selection-Based AdaBoost-KNN With Direct Optimization for Dynamic Emotion Recognition in Human–Robot Interaction. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2021. [DOI: 10.1109/tetci.2019.2909930] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
12
|
Feng J, Zhang SW, Chen L, Xia J. Alzheimer’s disease classification using features extracted from nonsubsampled contourlet subband-based individual networks. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.09.012] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
13
|
Alinsaif S, Lang J. Texture features in the Shearlet domain for histopathological image classification. BMC Med Inform Decis Mak 2020; 20:312. [PMID: 33323118 PMCID: PMC7739509 DOI: 10.1186/s12911-020-01327-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Background A various number of imaging modalities are available (e.g., magnetic resonance, x-ray, ultrasound, and biopsy) where each modality can reveal different structural aspects of tissues. However, the analysis of histological slide images that are captured using a biopsy is considered the gold standard to determine whether cancer exists. Furthermore, it can reveal the stage of cancer. Therefore, supervised machine learning can be used to classify histopathological tissues. Several computational techniques have been proposed to study histopathological images with varying levels of success. Often handcrafted techniques based on texture analysis are proposed to classify histopathological tissues which can be used with supervised machine learning.
Methods In this paper, we construct a novel feature space to automate the classification of tissues in histology images. Our feature representation is to integrate various features sets into a new texture feature representation. All of our descriptors are computed in the complex Shearlet domain. With complex coefficients, we investigate not only the use of magnitude coefficients, but also study the effectiveness of incorporating the relative phase (RP) coefficients to create the input feature vector. In our study, four texture-based descriptors are extracted from the Shearlet coefficients: co-occurrence texture features, Local Binary Patterns, Local Oriented Statistic Information Booster, and segmentation-based Fractal Texture Analysis. Each set of these attributes captures significant local and global statistics. Therefore, we study them individually, but additionally integrate them to boost the accuracy of classifying the histopathology tissues while being fed to classical classifiers. To tackle the problem of high-dimensionality, our proposed feature space is reduced using principal component analysis. In our study, we use two classifiers to indicate the success of our proposed feature representation: Support Vector Machine (SVM) and Decision Tree Bagger (DTB). Results Our feature representation delivered high performance when used on four public datasets. As such, the best achieved accuracy: multi-class Kather (i.e., 92.56%), BreakHis (i.e., 91.73%), Epistroma (i.e., 98.04%), Warwick-QU (i.e., 96.29%). Conclusions Our proposed method in the Shearlet domain for the classification of histopathological images proved to be effective when it was investigated on four different datasets that exhibit different levels of complexity.
Collapse
|
14
|
Li X, Song D, Dong Y. Hierarchical Feature Fusion Network for Salient Object Detection. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2020; PP:9165-9175. [PMID: 32946391 DOI: 10.1109/tip.2020.3023774] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Convolutional Neural Network (CNN) has shown their advantages in salient object detection. CNN can generate great saliency maps because it can obtain high-level semantic information. And the semantic information is usually achieved by stacking multiple convolutional layers and pooling layers. However, multiple pooling operations will reduce the size of the feature map and easily blur the boundary of the salient object. Therefore, such operations are not beneficial to generate great saliency results. To alleviate this issue, we propose a novel edge information-guided hierarchical feature fusion network (HFFNet). Our network fuses features hierarchically and retains accurate semantic information and clear edge information effectively. Specifically, we extract image features from different levels of VGG. Then, we fuse the features hierarchically to generate high-level semantic information and low-level edge information. In order to retain better information at different levels, we adopt a one-to-one hierarchical supervision strategy to supervise the generation of low-level information and high-level information respectively. Finally, we use low-level edge information to guide the saliency map generation, and the edge guidance fusion is able to identify saliency regions effectively. The proposed HFFNet has been extensively evaluated on five traditional benchmark datasets. The experimental results demonstrate that the proposed model is fairly effective in salient object detection compared with 10 state-of-the-art models under different evaluation indicators, and it is superior to most of the comparison models.
Collapse
|
15
|
Yuan C, Xia Z, Sun X, Wu QMJ. Deep Residual Network With Adaptive Learning Framework for Fingerprint Liveness Detection. IEEE Trans Cogn Dev Syst 2020. [DOI: 10.1109/tcds.2019.2920364] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
16
|
Ali H, Sharif M, Yasmin M, Rehmani MH, Riaz F. A survey of feature extraction and fusion of deep learning for detection of abnormalities in video endoscopy of gastrointestinal-tract. Artif Intell Rev 2019. [DOI: 10.1007/s10462-019-09743-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
17
|
Synchronization correction-based robust digital image watermarking approach using Bessel K-form PDF. Pattern Anal Appl 2019. [DOI: 10.1007/s10044-019-00828-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
18
|
Bhunia AK, Bhattacharyya A, Banerjee P, Roy PP, Murala S. A novel feature descriptor for image retrieval by combining modified color histogram and diagonally symmetric co-occurrence texture pattern. Pattern Anal Appl 2019. [DOI: 10.1007/s10044-019-00827-x] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
19
|
Deng C, Li Z, Gao X, Tao D. Deep Multi-scale Discriminative Networks for Double JPEG Compression Forensics. ACM T INTEL SYST TEC 2019. [DOI: 10.1145/3301274] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
As JPEG is the most widely used image format, the importance of tampering detection for JPEG images in blind forensics is self-evident. In this area, extracting effective statistical characteristics from a JPEG image for classification remains a challenge. Effective features are designed manually in traditional methods, suggesting that extensive labor-consuming research and derivation is required. In this article, we propose a novel image tampering detection method based on deep multi-scale discriminative networks (MSD-Nets). The multi-scale module is designed to automatically extract multiple features from the discrete cosine transform (DCT) coefficient histograms of the JPEG image. This module can capture the characteristic information in different scale spaces. In addition, a discriminative module is also utilized to improve the detection effect of the networks in those difficult situations when the first compression quality (
QF
1) is higher than the second one (
QF
2). A special network in this module is designed to distinguish the small statistical difference between authentic and tampered regions in these cases. Finally, a probability map can be obtained and the specific tampering area is located using the last classification results. Extensive experiments demonstrate the superiority of our proposed method in both quantitative and qualitative metrics when compared with state-of-the-art approaches.
Collapse
Affiliation(s)
| | - Zhao Li
- Xidian University, Xi’an, China
| | | | - Dacheng Tao
- University of Sydney, Darlington NSW, Australia
| |
Collapse
|
20
|
|
21
|
Li X, Cui G, Dong Y. Discriminative and Orthogonal Subspace Constraints-Based Nonnegative Matrix Factorization. ACM T INTEL SYST TEC 2018. [DOI: 10.1145/3229051] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Nonnegative matrix factorization (NMF) is one widely used feature extraction technology in the tasks of image clustering and image classification. For the former task, various unsupervised NMF methods based on the data distribution structure information have been proposed. While for the latter task, the label information of the dataset is one very important guiding. However, most previous proposed supervised NMF methods emphasis on imposing the discriminant constraints on the coefficient matrix. When dealing with new coming samples, the transpose or the pseudoinverse of the basis matrix is used to project these samples to the low dimension space. In this way, the label influence to the basis matrix is indirect. Although, there are also some methods trying to constrain the basis matrix in NMF framework, either they only restrict within-class samples or impose improper constraint on the basis matrix. To address these problems, in this article a novel NMF framework named discriminative and orthogonal subspace constraints-based nonnegative matrix factorization (DOSNMF) is proposed. In DOSNMF, the discriminative constraints are imposed on the projected subspace instead of the directly learned representation. In this manner, the discriminative information is directly connected with the projected subspace. At the same time, an orthogonal term is incorporated in DOSNMF to adjust the orthogonality of the learned basis matrix, which can ensure the orthogonality of the learned subspace and improve the sparseness of the basis matrix at the same time. This framework can be implemented in two ways. The first way is based on the manifold learning theory. In this way, two graphs, i.e., the intrinsic graph and the penalty graph, are constructed to capture the intra-class structure and the inter-class distinctness. With this design, both the manifold structure information and the discriminative information of the dataset are utilized. For convenience, we name this method as the name of the framework, i.e., DOSNMF. The second way is based on the Fisher’s criterion, we name it Fisher’s criterion-based DOSNMF (FDOSNMF). The objective functions of DOSNMF and FDOSNMF can be easily optimized using multiplicative update (MU) rules. The new methods are tested on five datasets and compared with several supervised and unsupervised variants of NMF. The experimental results reveal the effectiveness of the proposed methods.
Collapse
Affiliation(s)
- Xuelong Li
- Chinese Academy of Sciences, Shaanxi, P. R. China
| | - Guosheng Cui
- Chinese Academy of Sciences, Shaanxi, P. R. China
| | | |
Collapse
|
22
|
Ji L, Ren Y, Liu G, Pu X. Training-Based Gradient LBP Feature Models for Multiresolution Texture Classification. IEEE TRANSACTIONS ON CYBERNETICS 2018; 48:2683-2696. [PMID: 28922134 DOI: 10.1109/tcyb.2017.2748500] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Local binary pattern (LBP) is a simple, yet efficient coding model for extracting texture features. To improve texture classification, this paper designs a median sampling regulation, defines a group of gradient LBP (gLBP) descriptors, proposes a training-based feature model mapping method, and then develops a texture classification frame using the multiresolution feature fusion of four gLBP descriptors. Cooperated by median sampling, four descriptors encode a pixel respectively by central gradient, radial gradient, magnitude gradient and tangent gradient to generate initial gLBP patterns. The feature mapping models of gLBP descriptors are constructed by the maximal relative-variation rate (mr2) of rotation-invariant patterns, and then prestored as mapping lookup files. By mapping, initial patterns can be transformed into low-dimensional ones. And then it generates multiresolution texture features via the joint and concatenation of gLBP descriptors on different sampling parameters. A trained nearest neighbor classifier with chi-square distance is applied to classify textures by feature histograms. The experimental results of simulation on five public texture databases show that the proposed method is reliable and efficient in texture classification. In comparison with nine other similar approaches, including two state-of-the-art ones, the proposed method runs faster than most of them and also outperforms all of them in terms of classification accuracy and noise robustness. It achieves higher accuracy and has also better robustness to the Salt&Pepper and Gaussian noise added artificially into texture images.
Collapse
|
23
|
|
24
|
|
25
|
Jia S, Shen L, Zhu J, Li Q. A 3-D Gabor Phase-Based Coding and Matching Framework for Hyperspectral Imagery Classification. IEEE TRANSACTIONS ON CYBERNETICS 2018; 48:1176-1188. [PMID: 28368844 DOI: 10.1109/tcyb.2017.2682846] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
As manual labeling is very difficult and time-consuming, the labeled samples used to train a supervised classifier are generally limited, which become one of the biggest challenge for hyperspectral imagery classification. In order to tackle this issue, a recent trend is to exploit the structure information of materials, as which reflects the region homogeneity in the spatial domain and offers an invaluable complement to the spectral information. In this respect, 3-D Gabor wavelets have been introduced to extract joint spectral-spatial features for hyperspectral images. One the one hand, the features extracted by 3-D Gabor wavelets lead to very good performance for classification. On the other hand, its drawbacks, i.e., big number of features and high computational cost, limit its applicability. In this paper, a 3-D Gabor-wavelet-based phase coding and Hamming distance-based matching (3DGPC-HDM) framework is developed for hyperspectral imagery classification. The proposed method, instead of taking into account the large volume of Gabor magnitude features, exploits the Gabor phase features with certain orientations (i.e., the direction parallel to the spectral axis), which are then encoded by a simple quadrant bit coding scheme. After that, a normalized Hamming distance matching (HDM) method is adopted to determine the similarity of two samples, and the nearest neighbor classifier is routinely utilized for pixelwise recognition. Finally, experiments on three real hyperspectral data sets show that the proposed 3DGPC-HDM leads to very good performance. Comparisons with the state-of-the-art methods in the literature, in terms of both classifier complexity and generalization ability from very small training sets, are also included.
Collapse
|
26
|
Li W, Wang Y, Xu Z, Jiang Y, Lu Z, Liao Q. Weighted contourlet binary patterns and image-based fisher linear discriminant for face recognition. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2017.06.045] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
27
|
Li X, Cui G, Dong Y. Graph Regularized Non-Negative Low-Rank Matrix Factorization for Image Clustering. IEEE TRANSACTIONS ON CYBERNETICS 2017; 47:3840-3853. [PMID: 27448379 DOI: 10.1109/tcyb.2016.2585355] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Non-negative matrix factorization (NMF) has been one of the most popular methods for feature learning in the field of machine learning and computer vision. Most existing works directly apply NMF on high-dimensional image datasets for computing the effective representation of the raw images. However, in fact, the common essential information of a given class of images is hidden in their low rank parts. For obtaining an effective low-rank data representation, we in this paper propose a non-negative low-rank matrix factorization (NLMF) method for image clustering. For the purpose of improving its robustness for the data in a manifold structure, we further propose a graph regularized NLMF by incorporating the manifold structure information into our proposed objective function. Finally, we develop an efficient alternating iterative algorithm to learn the low-dimensional representation of low-rank parts of images for clustering. Alternatively, we also incorporate robust principal component analysis into our proposed scheme. Experimental results on four image datasets reveal that our proposed methods outperform four representative methods.
Collapse
|
28
|
Hu Z, Wen Y, Liu L, Jiang J, Hong R, Wang M, Yan S. Visual Classification of Furniture Styles. ACM T INTEL SYST TEC 2017. [DOI: 10.1145/3065951] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Furniture style describes the discriminative appearance characteristics of furniture. It plays an important role in real-world indoor decoration. In this article, we explore the furniture style features and study the problem of furniture style classification. Differing from traditional object classification, furniture style classification aims at classifying different furniture in terms of the “style” that describes its appearance (e.g., American style, Gothic style, Rococo style, etc.) rather than the “kind” that is more related to its functional structure (e.g., bed, desk, etc.). To pursue efficient furniture style features, we construct a novel dataset of furniture styles that contains 16 common style categories and implement three strategies with respect to two categories of classification, that is, handcrafted classification and learning-based classification. First, we follow the typical image classification pipeline to extract the handcrafted features and train the classifier by support vector machine. Then we use the convolutional neural network to extract learning-based features from training images. To obtain comprehensive furniture style features, we finally combine the handcrafted image classification pipeline and the learning-based network. We experimentally evaluate the performances of handcrafted features and learning-based features of each strategy, and the results show the superiority of learning-based features and also the comprehensiveness of handcrafted features.
Collapse
Affiliation(s)
- Zhenhen Hu
- National University of Singapore, Nanyang Avenue, Singapore
| | - Yonggang Wen
- National University of Singapore, Nanyang Avenue, Singapore
| | - Luoqi Liu
- National University of Singapore, Singapore
| | | | | | - Meng Wang
- Hefei University of Technology, Hefei, China
| | | |
Collapse
|
29
|
|
30
|
Ren R, Hung T, Tan KC. Automatic Microstructure Defect Detection of Ti-6Al-4V Titanium Alloy by Regions-Based Graph. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2017. [DOI: 10.1109/tetci.2017.2669523] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
31
|
Multiresolution LDBP descriptors for texture classification using anisotropic diffusion with an application to wood texture analysis. Pattern Recognit Lett 2017. [DOI: 10.1016/j.patrec.2017.01.015] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
32
|
Javed M, Nagabhushan P, Chaudhuri BB. A review on document image analysis techniques directly in the compressed domain. Artif Intell Rev 2017. [DOI: 10.1007/s10462-017-9551-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
33
|
Liu H, Li X, Zhang S. Learning Instance Correlation Functions for Multilabel Classification. IEEE TRANSACTIONS ON CYBERNETICS 2017; 47:499-510. [PMID: 26887023 DOI: 10.1109/tcyb.2016.2519683] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Multilabel learning has a wide range of potential applications in reality. It attracts a great deal of attention during the past years and has been extensively studied in many fields including image annotation and text categorization. Although many efforts have been made for multilabel learning, there are two challenging issues remaining, i.e., how to exploit the correlations and how to tackle the high-dimensional problems of multilabel data. In this paper, an effective algorithm is developed for multilabel classification with utilizing those data that are relevant to the targets. The key is the construction of a coefficient-based mapping between training and test instances, where the mapping relationship exploits the correlations among the instances, rather than the explicit relationship between the variables and the class labels of data. Further, a constraint, ℓ¹-norm penalty, is performed on the mapping relationship to make the model sparse, weakening the impacts of noisy data. Our empirical study on eight public datasets shows that the proposed method is more effective in comparing with the state-of-the-art multilabel classifiers.
Collapse
|
34
|
Exploring Deep Learning and Transfer Learning for Colonic Polyp Classification. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2016; 2016:6584725. [PMID: 27847543 PMCID: PMC5101370 DOI: 10.1155/2016/6584725] [Citation(s) in RCA: 99] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Accepted: 10/04/2016] [Indexed: 12/26/2022]
Abstract
Recently, Deep Learning, especially through Convolutional Neural Networks (CNNs) has been widely used to enable the extraction of highly representative features. This is done among the network layers by filtering, selecting, and using these features in the last fully connected layers for pattern classification. However, CNN training for automated endoscopic image classification still provides a challenge due to the lack of large and publicly available annotated databases. In this work we explore Deep Learning for the automated classification of colonic polyps using different configurations for training CNNs from scratch (or full training) and distinct architectures of pretrained CNNs tested on 8-HD-endoscopic image databases acquired using different modalities. We compare our results with some commonly used features for colonic polyp classification and the good results suggest that features learned by CNNs trained from scratch and the “off-the-shelf” CNNs features can be highly relevant for automated classification of colonic polyps. Moreover, we also show that the combination of classical features and “off-the-shelf” CNNs features can be a good approach to further improve the results.
Collapse
|
35
|
Wimmer G, Tamaki T, Tischendorf JJW, Häfner M, Yoshida S, Tanaka S, Uhl A. Directional wavelet based features for colonic polyp classification. Med Image Anal 2016; 31:16-36. [PMID: 26948110 DOI: 10.1016/j.media.2016.02.001] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Revised: 02/08/2016] [Accepted: 02/09/2016] [Indexed: 01/27/2023]
Abstract
In this work, various wavelet based methods like the discrete wavelet transform, the dual-tree complex wavelet transform, the Gabor wavelet transform, curvelets, contourlets and shearlets are applied for the automated classification of colonic polyps. The methods are tested on 8 HD-endoscopic image databases, where each database is acquired using different imaging modalities (Pentax's i-Scan technology combined with or without staining the mucosa), 2 NBI high-magnification databases and one database with chromoscopy high-magnification images. To evaluate the suitability of the wavelet based methods with respect to the classification of colonic polyps, the classification performances of 3 wavelet transforms and the more recent curvelets, contourlets and shearlets are compared using a common framework. Wavelet transforms were already often and successfully applied to the classification of colonic polyps, whereas curvelets, contourlets and shearlets have not been used for this purpose so far. We apply different feature extraction techniques to extract the information of the subbands of the wavelet based methods. Most of the in total 25 approaches were already published in different texture classification contexts. Thus, the aim is also to assess and compare their classification performance using a common framework. Three of the 25 approaches are novel. These three approaches extract Weibull features from the subbands of curvelets, contourlets and shearlets. Additionally, 5 state-of-the-art non wavelet based methods are applied to our databases so that we can compare their results with those of the wavelet based methods. It turned out that extracting Weibull distribution parameters from the subband coefficients generally leads to high classification results, especially for the dual-tree complex wavelet transform, the Gabor wavelet transform and the Shearlet transform. These three wavelet based transforms in combination with Weibull features even outperform the state-of-the-art methods on most of the databases. We will also show that the Weibull distribution is better suited to model the subband coefficient distribution than other commonly used probability distributions like the Gaussian distribution and the generalized Gaussian distribution. So this work gives a reasonable summary of wavelet based methods for colonic polyp classification and the huge amount of endoscopic polyp databases used for our experiments assures a high significance of the achieved results.
Collapse
Affiliation(s)
- Georg Wimmer
- University of Salzburg, Department of Computer Sciences, Jakob Haringerstrasse 2, 5020 Salzburg, Austria.
| | - Toru Tamaki
- Hiroshima University, Department of Information Engineering, Graduate School of Engineering, 1-4-1 Kagamiyama, Higashi-hiroshima, Hiroshima 739-8527, Japan
| | - J J W Tischendorf
- Medical Department III (Gastroenterology, Hepatology and Metabolic Diseases), RWTH Aachen University Hospital, Paulwelsstr. 30, 52072 Aachen, Germany
| | - Michael Häfner
- St. Elisabeth Hospital, Landstraßer Hauptstraße 4a, A-1030 Vienna, Austria
| | - Shigeto Yoshida
- Hiroshima University Hospital, Department of Endoscopy, 1-2-3 Kasumi, Minami-ku, Hiroshima 734-8551, Japan
| | - Shinji Tanaka
- Hiroshima University Hospital, Department of Endoscopy, 1-2-3 Kasumi, Minami-ku, Hiroshima 734-8551, Japan
| | - Andreas Uhl
- University of Salzburg, Department of Computer Sciences, Jakob Haringerstrasse 2, 5020 Salzburg, Austria.
| |
Collapse
|
36
|
Stephens KD, Hoffman DD. On Visual Texture Preference: Can an Ecological Model Explain Why People Like Some Textures More Than Others? Perception 2016; 45:527-551. [DOI: 10.1177/0301006616629026] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
What visual textures do people like and why? Here, we test whether the ecological valence theory proposed for color preferences can also predict people’s preferences for visual texture. According to the theory, people should like visual textures associated with positive objects or entities and dislike visual textures associated with negative objects or entities. We compare the results for the ecological model with a more traditional texture-preference model based on computational features and find that the ecological model performs reasonably well considering its lower complexity, explaining 63% of the variance in the human preference data.
Collapse
|