1
|
Isik H, Tasdemir S, Taspinar YS, Kursun R, Cinar I, Yasar A, Yasin ET, Koklu M. Maize seeds forecasting with hybrid directional and bi-directional long short-term memory models. Food Sci Nutr 2024; 12:786-803. [PMID: 38370035 PMCID: PMC10867492 DOI: 10.1002/fsn3.3783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 10/05/2023] [Accepted: 10/06/2023] [Indexed: 02/20/2024] Open
Abstract
The purity of the seeds is one of the important factors that increase the yield. For this reason, the classification of maize cultivars constitutes a significant problem. Within the scope of this study, six different classification models were designed to solve this problem. A special dataset was created to be used in the models designed for the study. The dataset contains a total of 14,469 images in four classes. Images belong to four different maize types, BT6470, CALIPOS, ES_ARMANDI, and HIVA, taken from the BIOTEK company. AlexNet and ResNet50 architectures, with the transfer learning method, were used in the models created for the image classification. In order to improve the classification success, LSTM (Directional Long Short-Term Memory) and BiLSTM (Bi-directional Long Short-Term Memory) algorithms and AlexNet and ResNet50 architectures were hybridized. As a result of the classifications, the highest classification success was obtained from the ResNet50+BiLSTM model with 98.10%.
Collapse
Affiliation(s)
- Hakan Isik
- Department of Electric‐Electronic EngineeringSelcuk UniversityKonyaTurkey
| | - Sakir Tasdemir
- Department of Computer EngineeringSelcuk UniversityKonyaTurkey
| | | | - Ramazan Kursun
- Guneysinir Vocational SchoolSelcuk UniversityKonyaTurkey
| | - Ilkay Cinar
- Department of Computer EngineeringSelcuk UniversityKonyaTurkey
| | - Ali Yasar
- Department of Mechatronic EngineeringSelcuk UniversityKonyaTurkey
| | | | - Murat Koklu
- Department of Computer EngineeringSelcuk UniversityKonyaTurkey
| |
Collapse
|
2
|
Eman M, Mahmoud TM, Ibrahim MM, Abd El-Hafeez T. Innovative Hybrid Approach for Masked Face Recognition Using Pretrained Mask Detection and Segmentation, Robust PCA, and KNN Classifier. SENSORS (BASEL, SWITZERLAND) 2023; 23:6727. [PMID: 37571511 PMCID: PMC10422420 DOI: 10.3390/s23156727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 07/15/2023] [Accepted: 07/25/2023] [Indexed: 08/13/2023]
Abstract
Face masks are widely used in various industries and jobs, such as healthcare, food service, construction, manufacturing, retail, hospitality, transportation, education, and public safety. Masked face recognition is essential to accurately identify and authenticate individuals wearing masks. Masked face recognition has emerged as a vital technology to address this problem and enable accurate identification and authentication in masked scenarios. In this paper, we propose a novel method that utilizes a combination of deep-learning-based mask detection, landmark and oval face detection, and robust principal component analysis (RPCA) for masked face recognition. Specifically, we use pretrained ssd-MobileNetV2 for detecting the presence and location of masks on a face and employ landmark and oval face detection to identify key facial features. The proposed method also utilizes RPCA to separate occluded and non-occluded components of an image, making it more reliable in identifying faces with masks. To optimize the performance of our proposed method, we use particle swarm optimization (PSO) to optimize both the KNN features and the number of k for KNN. Experimental results demonstrate that our proposed method outperforms existing methods in terms of accuracy and robustness to occlusion. Our proposed method achieves a recognition rate of 97%, which is significantly higher than the state-of-the-art methods. Our proposed method represents a significant improvement over existing methods for masked face recognition, providing high accuracy and robustness to occlusion.
Collapse
Affiliation(s)
- Mohammed Eman
- Computer Science Department, Faculty of Computing and Artificial Intelligence, Beni Suef University, Beni-Suef 62511, Egypt
| | - Tarek M. Mahmoud
- Computer Science Department, Faculty of Science, Minia University, Minia 61519, Egypt
- Computer Science Department, Faculty of Computers and Artificial Intelligence, University of Sadat City, Sadat City 32897, Egypt;
| | - Mostafa M. Ibrahim
- Electrical Engineering Department, Faculty of Engineering, Minia University, Minia 61519, Egypt;
| | - Tarek Abd El-Hafeez
- Computer Science Department, Faculty of Science, Minia University, Minia 61519, Egypt
- Computer Science Unit, Deraya University, Minia 61765, Egypt
| |
Collapse
|
3
|
Zhao S, Wang P, Heidari AA, Zhao X, Chen H. Boosted crow search algorithm for handling multi-threshold image problems with application to X-ray images of COVID-19. EXPERT SYSTEMS WITH APPLICATIONS 2023; 213:119095. [PMID: 36313263 PMCID: PMC9595503 DOI: 10.1016/j.eswa.2022.119095] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 10/11/2022] [Accepted: 10/18/2022] [Indexed: 06/16/2023]
Abstract
COVID-19 is pervasive and threatens the safety of people around the world. Therefore, now, a method is needed to diagnose COVID-19 accurately. The identification of COVID-19 by X-ray images is a common method. The target area is extracted from the X-ray images by image segmentation to improve classification efficiency and help doctors make a diagnosis. In this paper, we propose an improved crow search algorithm (CSA) based on variable neighborhood descent (VND) and information exchange mutation (IEM) strategies, called VMCSA. The original CSA quickly falls into the local optimum, and the possibility of finding the best solution is significantly reduced. Therefore, to help the algorithm avoid falling into local optimality and improve the global search capability of the algorithm, we introduce VND and IEM into CSA. Comparative experiments are conducted at CEC2014 and CEC'21 to demonstrate the better performance of the proposed algorithm in optimization. We also apply the proposed algorithm to multi-level thresholding image segmentation using Renyi's entropy as the objective function to find the optimal threshold, where we construct 2-D histograms with grayscale images and non-local mean images and maximize the Renyi's entropy on top of the 2-D histogram. The proposed segmentation method is evaluated on X-ray images of COVID-19 and compared with some algorithms. VMCSA has a significant advantage in segmentation results and obtains better robustness than other algorithms. The available extra info can be found at https://github.com/1234zsw/VMCSA.
Collapse
Affiliation(s)
- Songwei Zhao
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, Zhejiang 325035, China
| | - Pengjun Wang
- College of Electrical and Electronic Engineering, Wenzhou University, Wenzhou 325035, China
| | - Ali Asghar Heidari
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, Zhejiang 325035, China
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Xuehua Zhao
- School of Digital Media, Shenzhen Institute of Information Technology, Shenzhen 518172, China
| | - Huiling Chen
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, Zhejiang 325035, China
| |
Collapse
|
4
|
Taspinar YS, Dogan M, Cinar I, Kursun R, Ozkan IA, Koklu M. Computer vision classification of dry beans (Phaseolus vulgaris L.) based on deep transfer learning techniques. Eur Food Res Technol 2022. [DOI: 10.1007/s00217-022-04080-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|
5
|
Application of Pre-Trained Deep Convolutional Neural Networks for Coffee Beans Species Detection. FOOD ANAL METHOD 2022. [DOI: 10.1007/s12161-022-02362-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
6
|
Albashish D. Ensemble of adapted convolutional neural networks (CNN) methods for classifying colon histopathological images. PeerJ Comput Sci 2022; 8:e1031. [PMID: 35875641 PMCID: PMC9299234 DOI: 10.7717/peerj-cs.1031] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Accepted: 06/17/2022] [Indexed: 06/15/2023]
Abstract
Deep convolutional neural networks (CNN) manifest the potential for computer-aided diagnosis systems (CADs) by learning features directly from images rather than using traditional feature extraction methods. Nevertheless, due to the limited sample sizes and heterogeneity in tumor presentation in medical images, CNN models suffer from training issues, including training from scratch, which leads to overfitting. Alternatively, a pre-trained neural network's transfer learning (TL) is used to derive tumor knowledge from medical image datasets using CNN that were designed for non-medical activations, alleviating the need for large datasets. This study proposes two ensemble learning techniques: E-CNN (product rule) and E-CNN (majority voting). These techniques are based on the adaptation of the pretrained CNN models to classify colon cancer histopathology images into various classes. In these ensembles, the individuals are, initially, constructed by adapting pretrained DenseNet121, MobileNetV2, InceptionV3, and VGG16 models. The adaptation of these models is based on a block-wise fine-tuning policy, in which a set of dense and dropout layers of these pretrained models is joined to explore the variation in the histology images. Then, the models' decisions are fused via product rule and majority voting aggregation methods. The proposed model was validated against the standard pretrained models and the most recent works on two publicly available benchmark colon histopathological image datasets: Stoean (357 images) and Kather colorectal histology (5,000 images). The results were 97.20% and 91.28% accurate, respectively. The achieved results outperformed the state-of-the-art studies and confirmed that the proposed E-CNNs could be extended to be used in various medical image applications.
Collapse
Affiliation(s)
- Dheeb Albashish
- Computer Science Department/ Prince Abdullah bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Alsalt, Jordan
| |
Collapse
|
7
|
A cascaded deep-learning-based model for face mask detection. DATA TECHNOLOGIES AND APPLICATIONS 2022. [DOI: 10.1108/dta-02-2022-0076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
PurposeThis work aims to present a deep learning model for face mask detection in surveillance environments such as automatic teller machines (ATMs), banks, etc. to identify persons wearing face masks. In surveillance environments, complete visibility of the face area is a guideline, and criminals and law offenders commit crimes by hiding their faces behind a face mask. The face mask detector model proposed in this work can be used as a tool and integrated with surveillance cameras in autonomous surveillance environments to identify and catch law offenders and criminals.Design/methodology/approachThe proposed face mask detector is developed by integrating the residual network (ResNet)34 feature extractor on top of three You Only Look Once (YOLO) detection layers along with the usage of the spatial pyramid pooling (SPP) layer to extract a rich and dense feature map. Furthermore, at the training time, data augmentation operations such as Mosaic and MixUp have been applied to the feature extraction network so that it can get trained with images of varying complexities. The proposed detector is trained and tested over a custom face mask detection dataset consisting of 52,635 images. For validation, comparisons have been provided with the performance of YOLO v1, v2, tiny YOLO v1, v2, v3 and v4 and other benchmark work present in the literature by evaluating performance metrics such as precision, recall, F1 score, mean average precision (mAP) for the overall dataset and average precision (AP) for each class of the dataset.FindingsThe proposed face mask detector achieved 4.75–9.75 per cent higher detection accuracy in terms of mAP, 5–31 per cent higher AP for detection of faces with masks and, specifically, 2–30 per cent higher AP for detection of face masks on the face region as compared to the tested baseline variants of YOLO. Furthermore, the usage of the ResNet34 feature extractor and SPP layer in the proposed detection model reduced the training time and the detection time. The proposed face mask detection model can perform detection over an image in 0.45 s, which is 0.2–0.15 s lesser than that for other tested YOLO variants, thus making the proposed detection model perform detections at a higher speed.Research limitations/implicationsThe proposed face mask detector model can be utilized as a tool to detect persons with face masks who are a potential threat to the automatic surveillance environments such as ATMs, banks, airport security checks, etc. The other research implication of the proposed work is that it can be trained and tested for other object detection problems such as cancer detection in images, fish species detection, vehicle detection, etc.Practical implicationsThe proposed face mask detector can be integrated with automatic surveillance systems and used as a tool to detect persons with face masks who are potential threats to ATMs, banks, etc. and in the present times of COVID-19 to detect if the people are following a COVID-appropriate behavior of wearing a face mask or not in the public areas.Originality/valueThe novelty of this work lies in the usage of the ResNet34 feature extractor with YOLO detection layers, which makes the proposed model a compact and powerful convolutional neural-network-based face mask detector model. Furthermore, the SPP layer has been applied to the ResNet34 feature extractor to make it able to extract a rich and dense feature map. The other novelty of the present work is the implementation of Mosaic and MixUp data augmentation in the training network that provided the feature extractor with 3× images of varying complexities and orientations and further aided in achieving higher detection accuracy. The proposed model is novel in terms of extracting rich features, performing augmentation at the training time and achieving high detection accuracy while maintaining the detection speed.
Collapse
|
8
|
Classification and Analysis of Pistachio Species with Pre-Trained Deep Learning Models. ELECTRONICS 2022. [DOI: 10.3390/electronics11070981] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
Pistachio is a shelled fruit from the anacardiaceae family. The homeland of pistachio is the Middle East. The Kirmizi pistachios and Siirt pistachios are the major types grown and exported in Turkey. Since the prices, tastes, and nutritional values of these types differs, the type of pistachio becomes important when it comes to trade. This study aims to identify these two types of pistachios, which are frequently grown in Turkey, by classifying them via convolutional neural networks. Within the scope of the study, images of Kirmizi and Siirt pistachio types were obtained through the computer vision system. The pre-trained dataset includes a total of 2148 images, 1232 of Kirmizi type and 916 of Siirt type. Three different convolutional neural network models were used to classify these images. Models were trained by using the transfer learning method, with AlexNet and the pre-trained models VGG16 and VGG19. The dataset is divided as 80% training and 20% test. As a result of the performed classifications, the success rates obtained from the AlexNet, VGG16, and VGG19 models are 94.42%, 98.84%, and 98.14%, respectively. Models’ performances were evaluated through sensitivity, specificity, precision, and F-1 score metrics. In addition, ROC curves and AUC values were used in the performance evaluation. The highest classification success was achieved with the VGG16 model. The obtained results reveal that these methods can be used successfully in the determination of pistachio types.
Collapse
|