1
|
Nie H, Pang H, Ma M, Zheng R. A Lightweight Remote Sensing Small Target Image Detection Algorithm Based on Improved YOLOv8. SENSORS (BASEL, SWITZERLAND) 2024; 24:2952. [PMID: 38733059 PMCID: PMC11086322 DOI: 10.3390/s24092952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 04/30/2024] [Accepted: 05/04/2024] [Indexed: 05/13/2024]
Abstract
In response to the challenges posed by small objects in remote sensing images, such as low resolution, complex backgrounds, and severe occlusions, this paper proposes a lightweight improved model based on YOLOv8n. During the detection of small objects, the feature fusion part of the YOLOv8n algorithm retrieves relatively fewer features of small objects from the backbone network compared to large objects, resulting in low detection accuracy for small objects. To address this issue, firstly, this paper adds a dedicated small object detection layer in the feature fusion network to better integrate the features of small objects into the feature fusion part of the model. Secondly, the SSFF module is introduced to facilitate multi-scale feature fusion, enabling the model to capture more gradient paths and further improve accuracy while reducing model parameters. Finally, the HPANet structure is proposed, replacing the Path Aggregation Network with HPANet. Compared to the original YOLOv8n algorithm, the recognition accuracy of mAP@0.5 on the VisDrone data set and the AI-TOD data set has increased by 14.3% and 17.9%, respectively, while the recognition accuracy of mAP@0.5:0.95 has increased by 17.1% and 19.8%, respectively. The proposed method reduces the parameter count by 33% and the model size by 31.7% compared to the original model. Experimental results demonstrate that the proposed method can quickly and accurately identify small objects in complex backgrounds.
Collapse
Affiliation(s)
| | - Huanli Pang
- School of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China; (H.N.); (M.M.); (R.Z.)
| | | | | |
Collapse
|
2
|
Yildirim M, Bingol H, Cengil E, Aslan S, Baykara M. Automatic Classification of Particles in the Urine Sediment Test with the Developed Artificial Intelligence-Based Hybrid Model. Diagnostics (Basel) 2023; 13:diagnostics13071299. [PMID: 37046517 PMCID: PMC10093318 DOI: 10.3390/diagnostics13071299] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 03/23/2023] [Accepted: 03/29/2023] [Indexed: 04/14/2023] Open
Abstract
Urine sediment examination is one of the main tests used in the diagnosis of many diseases. Thanks to this test, many diseases can be detected in advance. Examining the results of this test is an intensive and time-consuming process. Therefore, it is very important to automatically interpret the urine sediment test results using computer-aided systems. In this study, a data set consisting of eight classes was used. The data set used in the study consists of 8509 particle images obtained by examining the particles in the urine sediment. A hybrid model based on textural and Convolutional Neural Networks (CNN) was developed to classify the images in the related data set. The features obtained using textural-based methods and the features obtained from CNN-based architectures were combined after optimizing using the Minimum Redundancy Maximum Relevance (mRMR) method. In this way, we aimed to extract different features of the same image. This increased the performance of the proposed model. The CNN-based ResNet50 architecture and textural-based Local Binary Pattern (LBP) method were used for feature extraction. Finally, the optimized and combined feature map was classified at different machine learning classifiers. In order to compare the performance of the model proposed in the study, results were also obtained from different CNN architectures. A high accuracy value of 96.0% was obtained in the proposed model.
Collapse
Affiliation(s)
- Muhammed Yildirim
- Department of Computer Engineering, Malatya Turgut Ozal University, Malatya 44200, Turkey
| | - Harun Bingol
- Department of Software Engineering, Malatya Turgut Ozal University, Malatya 44200, Turkey
| | - Emine Cengil
- Department of Computer Engineering, Bitlis Eren University, Bitlis 13100, Turkey
| | - Serpil Aslan
- Department of Software Engineering, Malatya Turgut Ozal University, Malatya 44200, Turkey
| | - Muhammet Baykara
- Department of Software Engineering, Firat University, Elazig 23100, Turkey
| |
Collapse
|
3
|
A Coal Gangue Identification Method Based on HOG Combined with LBP Features and Improved Support Vector Machine. Symmetry (Basel) 2023. [DOI: 10.3390/sym15010202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Identification of coal and gangue is one of the important problems in the coal industry. To improve the accuracy of coal gangue identification in the coal mining process, a coal gangue identification method based on histogram of oriented gradient (HOG) combined with local binary pattern (LBP) features and improved support vector machine (SVM) was proposed. First, according to the actual underground working environment of the mine, a machine vision platform for coal gangue identification was built and the coal gangue image acquisition experiment was carried out. Then, the images of coal and gangue were denoised by median filtering, and the coal and gangue features were extracted by using the HOG combined with LBP feature extraction algorithm, and these features were normalized and principal component analysis (PCA) reduced dimension to remove the correlation and redundancy between the features. Finally, SVM, SVM optimized by genetic algorithm (GA-SVM), SVM optimized by particle swarm optimization (PSO-SVM) algorithm, and SVM optimized by grey wolf optimization (GWO-SVM) algorithm were used as classifiers for identification and classification, respectively. The experimental results show that the GWO-SVM classification model has the highest accuracy, and the average classification accuracies were 96.49% and 94.82% of the training set and test set, respectively, which shows that grey wolf algorithm to optimize support vector machine has a good effect on classification of coal gangue images, which proves the feasibility and accuracy of the proposed method.
Collapse
|
4
|
Ranjeeth Kumar C, Kalaiarasu M. Multiple vehicles tracking and detection using weight high order singular value decomposition dimensionality reduction and double classifiers. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-222795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Controlling and managing city traffic is one of them. In order to use image processing to prevent accidents on the road, vehicle tracking and detection are essential. By following moving objects, surveillance video monitoring and human activity recording are carried out. By taking this into account, a useful technique for image processing that detects automobiles from the image is suggested. For numerous vehicle tracking and detection systems, the ECNN-SVM (Enhanced Convolution Neural Network with Support Vector Machine) has just been introduced. However, the larger dimensional data space and inaccurate edge recognition make this system’s performance difficult. The WHOSVD (Weight High Order Singular Value Decomposition) approach, which reduces the dimension and breaks up the positive and negative training picture samples, is established to improve training speed and visual vehicle recognition. To effectively identify the edges at corners, improved canny edge detection is used for edge detection. Mean Kernel Fuzzy C Means (MKFCM) clustering algorithm-based three-dimensional bounding box estimation is used to identify the vehicle items. By merging the feature value of samples with their class labels, the Speed Factor Based Cuckoo Search Algorithm (SFCSA) is introduced for feature selection. The WHOSVD algorithm was used as the input for the enhanced convolutional neural network (ECNN), which is introduced for low-dimensional space and is used for vehicle detection and tracking. Occlusion problems are resolved and target features are further identified using a machine learning classifier. For common algorithms like CNN+SVM, Support Vector Machine (SVM), and the proposed technique, experimentation is done in regards to the metrics of accuracy, f-measure, precision, and recall for performance evaluation.
Collapse
Affiliation(s)
- C. Ranjeeth Kumar
- Department of Information Technology, Sri Ramakrishna Engineering College, Coimbatore
| | - M. Kalaiarasu
- Department of Information Technology, Sri Ramakrishna Engineering College, Coimbatore
| |
Collapse
|
5
|
Feng Q, Wang S, Wang H, Qin Z, Wang H. Circle Fitting Based Image Segmentation and Multi-Scale Block Local Binary Pattern Based Distinction of Ring Rot and Anthracnose on Apple Fruits. FRONTIERS IN PLANT SCIENCE 2022; 13:884891. [PMID: 35755697 PMCID: PMC9218820 DOI: 10.3389/fpls.2022.884891] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Accepted: 05/11/2022] [Indexed: 06/15/2023]
Abstract
Ring rot caused by Botryosphaeria dothidea and anthracnose caused by Colletotrichum gloeosporioides are two important apple fruit diseases. It is critical to conduct timely and accurate distinction and diagnosis of the two diseases for apple disease management and apple quality control. The automatic distinction between the two diseases was investigated based on image processing technology in this study. The acquired disease images were preprocessed via image scaling, color image contrast stretching, and morphological opening and closing reconstruction. Then, two lesion segmentation methods based on circle fitting were proposed and used to conduct lesion segmentation. After comparison with the manual segmentation results obtained via the software Adobe Photoshop CC, Lesion segmentation method 1 was chosen for further disease image processing. The gray images on the nine components in the RGB, HSI, and L*a*b* color spaces of the segmented lesion images were filtered by using multi-scale block local binary pattern operators with the sizes of pixel blocks of 1 × 1, 2 × 2, and 3 × 3, respectively, and the corresponding local binary pattern (LBP) histogram vectors were calculated as the features of the lesion images. Subsequently, support vector machine (SVM) models and random forest models were built based on individual LBP histogram features or different LBP histogram feature combinations for distinguishing the diseases. The optimal SVM model with the distinction accuracies of the training and testing sets equal to 100 and 95.12% and the optimal random forest model with the distinction accuracies of the training and testing sets equal to 100 and 90.24% were achieved. The results indicated that the distinction between the two diseases could be implemented with high accuracy by using the proposed method. In this study, a method based on image processing technology was provided for the distinction of ring rot and anthracnose on apple fruits.
Collapse
Affiliation(s)
- Qin Feng
- College of Plant Protection, China Agricultural University, Beijing, China
| | - Shutong Wang
- College of Plant Protection, Hebei Agricultural University, Baoding, China
| | - He Wang
- Forest Pest Management and Quarantine Station of Beijing, Beijing, China
| | - Zhilin Qin
- College of Plant Protection, China Agricultural University, Beijing, China
| | - Haiguang Wang
- College of Plant Protection, China Agricultural University, Beijing, China
| |
Collapse
|
6
|
Milad A, Yurtkan K. An integrated 3D model based face recognition method using synthesized facial expressions and poses for single image applications. APPLIED NANOSCIENCE 2022. [DOI: 10.1007/s13204-021-02123-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
7
|
Sanchez-Moreno AS, Olivares-Mercado J, Hernandez-Suarez A, Toscano-Medina K, Sanchez-Perez G, Benitez-Garcia G. Efficient Face Recognition System for Operating in Unconstrained Environments. J Imaging 2021; 7:jimaging7090161. [PMID: 34460797 PMCID: PMC8466208 DOI: 10.3390/jimaging7090161] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/14/2021] [Accepted: 08/24/2021] [Indexed: 11/17/2022] Open
Abstract
Facial recognition is fundamental for a wide variety of security systems operating in real-time applications. Recently, several deep neural networks algorithms have been developed to achieve state-of-the-art performance on this task. The present work was conceived due to the need for an efficient and low-cost processing system, so a real-time facial recognition system was proposed using a combination of deep learning algorithms like FaceNet and some traditional classifiers like SVM, KNN, and RF using moderate hardware to operate in an unconstrained environment. Generally, a facial recognition system involves two main tasks: face detection and recognition. The proposed scheme uses the YOLO-Face method for the face detection task which is a high-speed real-time detector based on YOLOv3, while, for the recognition stage, a combination of FaceNet with a supervised learning algorithm, such as the support vector machine (SVM), is proposed for classification. Extensive experiments on unconstrained datasets demonstrate that YOLO-Face provides better performance when the face under an analysis presents partial occlusion and pose variations; besides that, it can detect small faces. The face detector was able to achieve an accuracy of over 89.6% using the Honda/UCSD dataset which runs at 26 FPS with darknet-53 to VGA-resolution images for classification tasks. The experimental results have demonstrated that the FaceNet+SVM model was able to achieve an accuracy of 99.7% using the LFW dataset. On the same dataset, FaceNet+KNN and FaceNet+RF achieve 99.5% and 85.1%, respectively; on the other hand, the FaceNet was able to achieve 99.6%. Finally, the proposed system provides a recognition accuracy of 99.1% and 49 ms runtime when both the face detection and classifications stages operate together.
Collapse
Affiliation(s)
- Alejandra Sarahi Sanchez-Moreno
- Sección de Estudios de Posgrado e Investigación, Instituto Politécnico Nacional, Av. Santa Ana 1000, San Francisco Culhuacan, Mexico City 04440, Mexico; (A.S.S.-M.); (J.O.-M.); (A.H.-S.); (K.T.-M.); (G.S.-P.)
| | - Jesus Olivares-Mercado
- Sección de Estudios de Posgrado e Investigación, Instituto Politécnico Nacional, Av. Santa Ana 1000, San Francisco Culhuacan, Mexico City 04440, Mexico; (A.S.S.-M.); (J.O.-M.); (A.H.-S.); (K.T.-M.); (G.S.-P.)
| | - Aldo Hernandez-Suarez
- Sección de Estudios de Posgrado e Investigación, Instituto Politécnico Nacional, Av. Santa Ana 1000, San Francisco Culhuacan, Mexico City 04440, Mexico; (A.S.S.-M.); (J.O.-M.); (A.H.-S.); (K.T.-M.); (G.S.-P.)
| | - Karina Toscano-Medina
- Sección de Estudios de Posgrado e Investigación, Instituto Politécnico Nacional, Av. Santa Ana 1000, San Francisco Culhuacan, Mexico City 04440, Mexico; (A.S.S.-M.); (J.O.-M.); (A.H.-S.); (K.T.-M.); (G.S.-P.)
| | - Gabriel Sanchez-Perez
- Sección de Estudios de Posgrado e Investigación, Instituto Politécnico Nacional, Av. Santa Ana 1000, San Francisco Culhuacan, Mexico City 04440, Mexico; (A.S.S.-M.); (J.O.-M.); (A.H.-S.); (K.T.-M.); (G.S.-P.)
| | - Gibran Benitez-Garcia
- Graduate School of Informatics and Engineering, The University of Electro-Communications, Chofu-shi 182-8585, Japan
- Correspondence:
| |
Collapse
|
8
|
Giambelluca FL, Cappelletti MA, Osio JR, Giambelluca LA. Novel automatic scorpion-detection and -recognition system based on machine-learning techniques. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abd51d] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
All species of scorpions can inject venom, some of them even with the possibility of killing a human. Therefore, early detection and identification are essential to minimize scorpion stings. In this paper, we propose a novel automatic system for the detection and recognition of scorpions using computer vision and machine learning (ML) approaches. Two complementary image-processing techniques were used for the proposed detection method to accurately and reliably detect the presence of scorpions. The first is based on the fluorescent characteristics of scorpions when exposed to ultraviolet light, and the second on the shape features of the scorpions. Also, three models based on ML algorithms for the image recognition and classification of scorpions are compared. In particular, the three species of scorpions found in La Plata city (Argentina): Bothriurus bonariensis (of no sanitary importance), Tityus trivittatus, and Tityus confluence (both of sanitary importance) have been researched using a local binary-pattern histogram algorithm and deep neural networks with transfer learning (DNNs with TL) and data augmentation (DNNs with TL and DA) approaches. A confusion matrix and a receiver operating characteristic curve were used to evaluate the quality of these models. The results obtained show that the model of DNN with TL and DA is the most efficient at simultaneously differentiating between Tityus and Bothriurus (for health security) and between T. trivittatus and T. confluence (for biological research purposes).
Collapse
|
9
|
Tabejamaat M, Mousavi A, Gavrilova ML. Local Comparative Decimal Pattern for Face Recognition. INT J PATTERN RECOGN 2020. [DOI: 10.1142/s0218001420560066] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Rapid growth of social networks has provided an extraordinary medium to share a large volume of photographs online. This calls for designing efficient face recognition techniques that are applicable to images with low resolutions and arbitrary poses. This paper proposes a new pose invariant face recognition method for low resolution images using only a single training sample. A 3D model, reconstructed using Generic Elastic Model (3D GEM) from a frontal view training sample, is used to generate a set of nonfrontal gallery face images. The face region of the nonfrontal query sample is then extracted using the same landmark detection technique as in the 3D GEM algorithm. Afterwards, a novel texture representation technique called Local Comparative Decimal Pattern (LCDP) is proposed to extract features from each of the training and query samples. A set of experimental results on the ORL, Georgia Tech (GT), and LFW face databases demonstrates the efficiency of the proposed method compared to other state-of-the-art approaches.
Collapse
Affiliation(s)
- Mohsen Tabejamaat
- Department of Electrical Engineering, Lorestan University, Khorramabad, Lorestan
| | - Abdolmajid Mousavi
- Department of Electrical Engineering, Lorestan University, Khorramabad, Lorestan
| | - Marina L. Gavrilova
- Department of Computer Science, University of Calgary, Calgary, AB T2N 1N4, Canada
| |
Collapse
|
10
|
Awais M, Ghayvat H, Krishnan Pandarathodiyil A, Nabillah Ghani WM, Ramanathan A, Pandya S, Walter N, Saad MN, Zain RB, Faye I. Healthcare Professional in the Loop (HPIL): Classification of Standard and Oral Cancer-Causing Anomalous Regions of Oral Cavity Using Textural Analysis Technique in Autofluorescence Imaging. SENSORS (BASEL, SWITZERLAND) 2020; 20:E5780. [PMID: 33053886 PMCID: PMC7601168 DOI: 10.3390/s20205780] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 10/05/2020] [Accepted: 10/08/2020] [Indexed: 02/07/2023]
Abstract
Oral mucosal lesions (OML) and oral potentially malignant disorders (OPMDs) have been identified as having the potential to transform into oral squamous cell carcinoma (OSCC). This research focuses on the human-in-the-loop-system named Healthcare Professionals in the Loop (HPIL) to support diagnosis through an advanced machine learning procedure. HPIL is a novel system approach based on the textural pattern of OML and OPMDs (anomalous regions) to differentiate them from standard regions of the oral cavity by using autofluorescence imaging. An innovative method based on pre-processing, e.g., the Deriche-Canny edge detector and circular Hough transform (CHT); a post-processing textural analysis approach using the gray-level co-occurrence matrix (GLCM); and a feature selection algorithm (linear discriminant analysis (LDA)), followed by k-nearest neighbor (KNN) to classify OPMDs and the standard region, is proposed in this paper. The accuracy, sensitivity, and specificity in differentiating between standard and anomalous regions of the oral cavity are 83%, 85%, and 84%, respectively. The performance evaluation was plotted through the receiver operating characteristics of periodontist diagnosis with the HPIL system and without the system. This method of classifying OML and OPMD areas may help the dental specialist to identify anomalous regions for performing their biopsies more efficiently to predict the histological diagnosis of epithelial dysplasia.
Collapse
Affiliation(s)
- Muhammad Awais
- Center for Intelligent Medical Electronics, Department of Electronic Engineering, School of Information Science and Technology, Fudan University, Shanghai 200433, China;
| | - Hemant Ghayvat
- Innovation Division Technical University of Denmark, 2800 Lyngby, Denmark;
| | - Anitha Krishnan Pandarathodiyil
- Oral Diagnostic Sciences, Faculty of Dentistry, SEGi University, Jalan Teknologi, Kota Damansara, Petaling Jaya 47810, Selangor, Malaysia;
| | - Wan Maria Nabillah Ghani
- Oral Cancer Research and Coordinating Centre, Faculty of Dentistry, University of Malaya, Kuala Lumpur 50603, Malaysia; (W.M.N.G.); (A.R.); (R.B.Z.)
| | - Anand Ramanathan
- Oral Cancer Research and Coordinating Centre, Faculty of Dentistry, University of Malaya, Kuala Lumpur 50603, Malaysia; (W.M.N.G.); (A.R.); (R.B.Z.)
- Department of Oral and Maxillofacial Clinical Sciences, Faculty of Dentistry, University of Malaya, Kuala Lumpur 50603, Malaysia
| | - Sharnil Pandya
- Symbiosis Centre for Applied Artificial Intelligence and CSE Dept, Symbiosis International (Deemed) University, Pune 412115, Maharashtra, India;
| | - Nicolas Walter
- Department of Electrical and Electronic Engineering, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, Perak, Malaysia; (N.W.); (M.N.S.)
| | - Mohamad Naufal Saad
- Department of Electrical and Electronic Engineering, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, Perak, Malaysia; (N.W.); (M.N.S.)
| | - Rosnah Binti Zain
- Oral Cancer Research and Coordinating Centre, Faculty of Dentistry, University of Malaya, Kuala Lumpur 50603, Malaysia; (W.M.N.G.); (A.R.); (R.B.Z.)
- MAHSA University, Dean Office, Level 9, Dental Block, Bandar Saujana Putra, Jenjarom 42610, Selangor, Malaysia
| | - Ibrahima Faye
- Department of Fundamental and Applied Sciences, Universiti Teknologi PETRONAS, Bandar Seri Iskandar 32610, Perak, Malaysia
| |
Collapse
|
11
|
Kaplan K, Kaya Y, Kuncan M, Ertunç HM. Brain tumor classification using modified local binary patterns (LBP) feature extraction methods. Med Hypotheses 2020; 139:109696. [PMID: 32234609 DOI: 10.1016/j.mehy.2020.109696] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Revised: 03/19/2020] [Accepted: 03/23/2020] [Indexed: 11/16/2022]
Abstract
Automatic classification of brain tumor types is very important for accelerating the treatment process, planning and increasing the patient's survival rate. Today, MR images are used to determine the type of brain tumor. Manual diagnosis of brain tumor type depends on the experience and sensitivity of radiologists. Therefore, researchers have developed many brain tumor classification models to minimize the human factor. In this study, two different feature extraction (nLBP and αLBP) approaches were used to classify the most common brain tumor types; Glioma, Meningioma, and Pituitary brain tumors. nLBP is formed based on the relationship for each pixel around the neighbors. The nLBP method has a d parameter that specifies the distance between consecutive neighbors for comparison. Different patterns are obtained for different d parameter values. The αLBP operator calculates the value of each pixel based on an angle value. The angle values used for calculation are 0, 45, 90 and 135. To test the proposed methods, it was applied to images obtained from the brain tumor database collected from Nanfang Hospital, Guangzhou, China, and Tianjin Medical University General Hospital between the years of 2005 and 2010. The classification process was performed by using K-Nearest Neighbor (Knn) and Artificial Neural Networks (ANN), Random Forest (RF), A1DE, Linear Discriminant Analysis (LDA) classification methods, with the feature matrices obtained with nLBP, αLBP and classical LBP from the images in the data set. The highest success rate in brain tumor classification was 95.56% with the nLBPd = 1 feature extraction method and Knn model.
Collapse
Affiliation(s)
- Kaplan Kaplan
- Kocaeli University, Mechatronics Engineering, 41380, Turkey.
| | - Yılmaz Kaya
- Siirt University, Computer Engineering, 56100, Turkey.
| | - Melih Kuncan
- Siirt University, Electrical and Electronics Engineering, 56100, Turkey.
| | - H Metin Ertunç
- Kocaeli University, Mechatronics Engineering, 41380, Turkey.
| |
Collapse
|
12
|
Face Recognition with Triangular Fuzzy Set-Based Local Cross Patterns in Wavelet Domain. Symmetry (Basel) 2019. [DOI: 10.3390/sym11060787] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
In this study, a new face recognition architecture is proposed using fuzzy-based Discrete Wavelet Transform (DWT) and fuzzy with two novel local graph descriptors. These graph descriptors are called Local Cross Pattern (LCP). The proposed fuzzy wavelet-based face recognition architecture consists of DWT, Triangular fuzzy set transformation, and textural feature extraction with local descriptors and classification phases. Firstly, the LL (Low-Low) sub-band is obtained by applying the 2 Dimensions Discrete Wavelet Transform (2D DWT) to face images. After that, the triangular fuzzy transformation is applied to this band in order to obtain A, B, and C images. The proposed LCP is then applied to the B image. LCP consists of two types of descriptors: Vertical Local Cross Pattern (VLCP) and Horizontal Local Cross Pattern (HLCP). Linear discriminant analysis, quadratic discriminant, analysis, quadratic kernel-based support vector machine (QKSVM), and K-nearest neighbors (KNN) were ultimately used to classify the extracted features. Ten widely used descriptors in the literature are applied to the fuzzy wavelet architecture. AT&T, CIE, Face94, and FERET databases are used for performance evaluation of the proposed methods. Experimental results show that the LCP descriptors have high face recognition performance, and the fuzzy wavelet-based model significantly improves the performances of the textural descriptors-based face recognition methods. Moreover, the proposed fuzzy-based domain and LCP method achieved classification accuracy rates of 97.3%, 100.0%, 100.0%, and 96.3% for AT&T, CIE, Face94, and FERET datasets, respectively.
Collapse
|
13
|
Multiple feature descriptors based model for individual identification in group photos. JOURNAL OF KING SAUD UNIVERSITY - COMPUTER AND INFORMATION SCIENCES 2019. [DOI: 10.1016/j.jksuci.2017.02.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
14
|
Moujahid A, Dornaika F. Multi-scale multi-block covariance descriptor with feature selection. Neural Comput Appl 2019. [DOI: 10.1007/s00521-019-04135-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
15
|
|
16
|
Pae DS, Choi IH, Kang TK, Lim MT. Vehicle detection framework for challenging lighting driving environment based on feature fusion method using adaptive neuro-fuzzy inference system. INT J ADV ROBOT SYST 2018. [DOI: 10.1177/1729881418770545] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Affiliation(s)
- Dong Sung Pae
- School of Electrical Engineering, Korea University, Seoul, Korea
| | - In Hwan Choi
- School of Electrical Engineering, Korea University, Seoul, Korea
| | - Tae Koo Kang
- Department of Human Intelligence and Robot Engineering, Sangmyung University, Cheonan, Korea
| | - Myo Taeg Lim
- School of Electrical Engineering, Korea University, Seoul, Korea
| |
Collapse
|
17
|
Moujahid A, Dornaika F. Feature selection for spatially enhanced LBP: application to face recognition. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS 2017. [DOI: 10.1007/s41060-017-0083-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
18
|
|
19
|
|
20
|
Person Recognition System Based on a Combination of Body Images from Visible Light and Thermal Cameras. SENSORS 2017; 17:s17030605. [PMID: 28300783 PMCID: PMC5375891 DOI: 10.3390/s17030605] [Citation(s) in RCA: 55] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Revised: 03/03/2017] [Accepted: 03/14/2017] [Indexed: 11/17/2022]
Abstract
The human body contains identity information that can be used for the person recognition (verification/recognition) problem. In this paper, we propose a person recognition method using the information extracted from body images. Our research is novel in the following three ways compared to previous studies. First, we use the images of human body for recognizing individuals. To overcome the limitations of previous studies on body-based person recognition that use only visible light images for recognition, we use human body images captured by two different kinds of camera, including a visible light camera and a thermal camera. The use of two different kinds of body image helps us to reduce the effects of noise, background, and variation in the appearance of a human body. Second, we apply a state-of-the art method, called convolutional neural network (CNN) among various available methods, for image features extraction in order to overcome the limitations of traditional hand-designed image feature extraction methods. Finally, with the extracted image features from body images, the recognition task is performed by measuring the distance between the input and enrolled samples. The experimental results show that the proposed method is efficient for enhancing recognition accuracy compared to systems that use only visible light or thermal images of the human body.
Collapse
|
21
|
|
22
|
Pixel-Level and Robust Vibration Source Sensing in High-Frame-Rate Video Analysis. SENSORS 2016; 16:s16111842. [PMID: 27827860 PMCID: PMC5134501 DOI: 10.3390/s16111842] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/03/2016] [Revised: 10/13/2016] [Accepted: 10/26/2016] [Indexed: 11/16/2022]
Abstract
We investigate the effect of appearance variations on the detectability of vibration feature extraction with pixel-level digital filters for high-frame-rate videos. In particular, we consider robust vibrating object tracking, which is clearly different from conventional appearance-based object tracking with spatial pattern recognition in a high-quality image region of a certain size. For 512 × 512 videos of a rotating fan located at different positions and orientations and captured at 2000 frames per second with different lens settings, we verify how many pixels are extracted as vibrating regions with pixel-level digital filters. The effectiveness of dynamics-based vibration features is demonstrated by examining the robustness against changes in aperture size and the focal condition of the camera lens, the apparent size and orientation of the object being tracked, and its rotational frequency, as well as complexities and movements of background scenes. Tracking experiments for a flying multicopter with rotating propellers are also described to verify the robustness of localization under complex imaging conditions in outside scenarios.
Collapse
|
23
|
Weng L, Dornaika F, Jin Z. Graph construction based on data self-representativeness and Laplacian smoothness. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.05.021] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
24
|
Patil H, Kothari A, Bhurchandi K. Expression invariant face recognition using semidecimated DWT, Patch-LDSMT, feature and score level fusion. APPL INTELL 2015. [DOI: 10.1007/s10489-015-0735-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
25
|
|
26
|
Yang M, Zhu P, Liu F, Shen L. Joint representation and pattern learning for robust face recognition. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2015.06.013] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
27
|
Luo Y, Guan YP. Enhanced facial texture illumination normalization for face recognition. APPLIED OPTICS 2015; 54:6887-6894. [PMID: 26368106 DOI: 10.1364/ao.54.006887] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
An uncontrolled lighting condition is one of the most critical challenges for practical face recognition applications. An enhanced facial texture illumination normalization method is put forward to resolve this challenge. An adaptive relighting algorithm is developed to improve the brightness uniformity of face images. Facial texture is extracted by using an illumination estimation difference algorithm. An anisotropic histogram-stretching algorithm is proposed to minimize the intraclass distance of facial skin and maximize the dynamic range of facial texture distribution. Compared with the existing methods, the proposed method can more effectively eliminate the redundant information of facial skin and illumination. Extensive experiments show that the proposed method has superior performance in normalizing illumination variation and enhancing facial texture features for illumination-insensitive face recognition.
Collapse
|
28
|
Improved local ternary patterns for automatic target recognition in infrared imagery. SENSORS 2015; 15:6399-418. [PMID: 25785311 PMCID: PMC4435149 DOI: 10.3390/s150306399] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Revised: 12/25/2014] [Accepted: 02/16/2015] [Indexed: 11/17/2022]
Abstract
This paper presents an improved local ternary pattern (LTP) for automatic target recognition (ATR) in infrared imagery. Firstly, a robust LTP (RLTP) scheme is proposed to overcome the limitation of the original LTP for achieving the invariance with respect to the illumination transformation. Then, a soft concave-convex partition (SCCP) is introduced to add some flexibility to the original concave-convex partition (CCP) scheme. Referring to the orthogonal combination of local binary patterns (OC_LBP), the orthogonal combination of LTP (OC_LTP) is adopted to reduce the dimensionality of the LTP histogram. Further, a novel operator, called the soft concave-convex orthogonal combination of robust LTP (SCC_OC_RLTP), is proposed by combing RLTP, SCCP and OC_LTP. Finally, the new operator is used for ATR along with a blocking schedule to improve its discriminability and a feature selection technique to enhance its efficiency. Experimental results on infrared imagery show that the proposed features can achieve competitive ATR results compared with the state-of-the-art methods.
Collapse
|
29
|
Uzair M, Mahmood A, Mian A, McDonald C. Periocular region-based person identification in the visible, infrared and hyperspectral imagery. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.07.049] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|