1
|
Choudhary V, Tanwar S, Choudhury T, Kotecha K. Towards secure IoT networks: A comprehensive study of metaheuristic algorithms in conjunction with CNN using a self-generated dataset. MethodsX 2024; 12:102747. [PMID: 38774685 PMCID: PMC11107349 DOI: 10.1016/j.mex.2024.102747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 05/03/2024] [Indexed: 05/24/2024] Open
Abstract
The Internet of Things (IoT) has radically reformed various sectors and industries, enabling unprecedented levels of connectivity and automation. However, the surge in the number of IoT devices has also widened the attack surface, rendering IoT networks potentially susceptible to a plethora of security risks. Addressing the critical challenge of enhancing security in IoT networks is of utmost importance. Moreover, there is a considerable lack of datasets designed exclusively for IoT applications. To bridge this gap, a customized dataset that accurately mimics real-world IoT scenarios impacted by four different types of attacks-blackhole, sinkhole, flooding, and version number attacks was generated using the Contiki-OS Cooja Simulator in this study. The resulting dataset is then consequently employed to evaluate the efficacy of several metaheuristic algorithms, in conjunction with Convolutional Neural Network (CNN) for IoT networks. •The proposed study's goal is to identify optimal hyperparameters for CNNs, ensuring their peak performance in intrusion detection tasks.•This study not only intensifies our comprehension of IoT network security but also provides practical guidance for implementation of the robust security measures in real-world IoT applications.
Collapse
Affiliation(s)
- Vandana Choudhary
- Amity Institute of Information Technology, Amity University, Noida 201313, India
| | - Sarvesh Tanwar
- Amity Institute of Information Technology, Amity University, Noida 201313, India
| | - Tanupriya Choudhury
- Research Professor, CSE Department, Graphic Era Deemed to be University, Dehradun, Uttarakhand 248002, India
- Adjunct Professor, CSE Department, Symbiosis Institute of Technology, Symbiosis International (Deemed University) (SIU), Lavale Campus, Pune, Maharashtra 412115, India
| | - Ketan Kotecha
- Symbiosis Centre for Applied Artificial Intelligence (SCAAI), Symbiosis International (Deemed University) (SIU), Symbiosis Institute of Technology, Lavale Campus, Pune 412115, India
| |
Collapse
|
2
|
Wang D, Zhai L, Fang J, Li Y, Xu Z. psoResNet: An improved PSO-based residual network search algorithm. Neural Netw 2024; 172:106104. [PMID: 38219681 DOI: 10.1016/j.neunet.2024.106104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 10/26/2023] [Accepted: 01/04/2024] [Indexed: 01/16/2024]
Abstract
Neural Architecture Search (NAS) methods are widely employed to address the time-consuming and costly challenges associated with manual operation and design of deep convolutional neural networks (DCNNs). Nonetheless, prevailing methods still encounter several pressing obstacles, including limited network architecture design, excessively lengthy search periods, and insufficient utilization of the search space. In light of these concerns, this study proposes an optimization strategy for residual networks that leverages an enhanced Particle swarm optimization algorithm. Primarily, low-complexity residual architecture block is employed as the foundational unit for architecture exploration, facilitating a more diverse investigation into network architectures while minimizing parameters. Additionally, we employ a depth initialization strategy to confine the search space within a reasonable range, thereby mitigating unnecessary particle exploration. Lastly, we present a novel approach for computing particle differences and updating velocity mechanisms to enhance the exploration of updated trajectories. This method significantly contributes to the improved utilization of the search space and the augmentation of particle diversity. Moreover, we constructed a crime-dataset comprising 13 classes to assess the effectiveness of the proposed algorithm. Experimental results demonstrate that our algorithm can design lightweight networks with superior classification performance on both benchmark datasets and the crime-dataset.
Collapse
Affiliation(s)
- Dianwei Wang
- School of Telecommunication and Information Engineering, Xi'an University of Posts and Telecommunications, Xi'an 710121, PR China.
| | - Leilei Zhai
- School of Telecommunication and Information Engineering, Xi'an University of Posts and Telecommunications, Xi'an 710121, PR China
| | - Jie Fang
- School of Telecommunication and Information Engineering, Xi'an University of Posts and Telecommunications, Xi'an 710121, PR China
| | - Yuanqing Li
- School of Telecommunication and Information Engineering, Xi'an University of Posts and Telecommunications, Xi'an 710121, PR China
| | - Zhijie Xu
- School of Computing and Engineering, University of Huddersfield, Huddersfield HD1 3DH, UK
| |
Collapse
|
3
|
Zhang C, Xue Y, Neri F, Cai X, Slowik A. Multi-Objective Self-Adaptive Particle Swarm Optimization for Large-Scale Feature Selection in Classification. Int J Neural Syst 2024; 34:2450014. [PMID: 38352979 DOI: 10.1142/s012906572450014x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Feature selection (FS) is recognized for its role in enhancing the performance of learning algorithms, especially for high-dimensional datasets. In recent times, FS has been framed as a multi-objective optimization problem, leading to the application of various multi-objective evolutionary algorithms (MOEAs) to address it. However, the solution space expands exponentially with the dataset's dimensionality. Simultaneously, the extensive search space often results in numerous local optimal solutions due to a large proportion of unrelated and redundant features [H. Adeli and H. S. Park, Fully automated design of super-high-rise building structures by a hybrid ai model on a massively parallel machine, AI Mag. 17 (1996) 87-93]. Consequently, existing MOEAs struggle with local optima stagnation, particularly in large-scale multi-objective FS problems (LSMOFSPs). Different LSMOFSPs generally exhibit unique characteristics, yet most existing MOEAs rely on a single candidate solution generation strategy (CSGS), which may be less efficient for diverse LSMOFSPs [H. S. Park and H. Adeli, Distributed neural dynamics algorithms for optimization of large steel structures, J. Struct. Eng. ASCE 123 (1997) 880-888; M. Aldwaik and H. Adeli, Advances in optimization of highrise building structures, Struct. Multidiscip. Optim. 50 (2014) 899-919; E. G. González, J. R. Villar, Q. Tan, J. Sedano and C. Chira, An efficient multi-robot path planning solution using a* and coevolutionary algorithms, Integr. Comput. Aided Eng. 30 (2022) 41-52]. Moreover, selecting an appropriate MOEA and determining its corresponding parameter values for a specified LSMOFSP is time-consuming. To address these challenges, a multi-objective self-adaptive particle swarm optimization (MOSaPSO) algorithm is proposed, combined with a rapid nondominated sorting approach. MOSaPSO employs a self-adaptive mechanism, along with five modified efficient CSGSs, to generate new solutions. Experiments were conducted on ten datasets, and the results demonstrate that the number of features is effectively reduced by MOSaPSO while lowering the classification error rate. Furthermore, superior performance is observed in comparison to its counterparts on both the training and test sets, with advantages becoming increasingly evident as the dimensionality increases.
Collapse
Affiliation(s)
- Chenyi Zhang
- School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing 210044, P. R. China
| | - Yu Xue
- School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing 210044, P. R. China
| | - Ferrante Neri
- NICE Research Group, School of Computer Science and Electronic Engineering, University of Surrey Guildford, GU2 7XS, UK
| | - Xu Cai
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, P. R. China
| | - Adam Slowik
- Department of Electronics and Computer Science, Koszalin University of Technology, Koszalin 75-453, Poland
| |
Collapse
|
4
|
Zhang Y, Jiang H, Tian Y, Ma H, Zhang X. Multigranularity Surrogate Modeling for Evolutionary Multiobjective Optimization With Expensive Constraints. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:2956-2968. [PMID: 37527320 DOI: 10.1109/tnnls.2023.3297624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/03/2023]
Abstract
Multiobjective optimization problems (MOPs) with expensive constraints pose stiff challenges to existing surrogate-assisted evolutionary algorithms (SAEAs) in a very limited computational cost, due to the fact that the number of expensive constraints for an MOP is often large. For existing SAEAs, they always approximate constraint functions in a single granularity, namely, approximating the constraint violation (CV, coarse-grained) or each constraint (fine-grained). However, the landscape of CV is often too complex to be accurately approximated by a surrogate model. Although the modeling of each constraint function may be simpler than that of CV, approximating all the constraint functions independently may result in tremendous cumulative errors and high computational costs. To address this issue, in this article, we develop a multigranularity surrogate modeling framework for evolutionary algorithms (EAs), where the approximation granularity of constraint surrogates is adaptively determined by the position of the population in the fitness landscape. Moreover, a dedicated model management strategy is also developed to reduce the impact resulting from the errors introduced by constraint surrogates and prevent the population from trapping into local optima. To evaluate the performance of the proposed framework, an implementation called K-MGSAEA is proposed, and the experimental results on a large number of test problems show that the proposed framework is superior to seven state-of-the-art competitors.
Collapse
|
5
|
Najaran MHT. A genetic programming-based convolutional deep learning algorithm for identifying COVID-19 cases via X-ray images. Artif Intell Med 2023; 142:102571. [PMID: 37316095 DOI: 10.1016/j.artmed.2023.102571] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 03/07/2023] [Accepted: 04/27/2023] [Indexed: 06/16/2023]
Abstract
Evolutionary algorithms have been successfully employed to find the best structure for many learning algorithms including neural networks. Due to their flexibility and promising results, Convolutional Neural Networks (CNNs) have found their application in many image processing applications. The structure of CNNs greatly affects the performance of these algorithms both in terms of accuracy and computational cost, thus, finding the best architecture for these networks is a crucial task before they are employed. In this paper, we develop a genetic programming approach for the optimization of CNN structure in diagnosing COVID-19 cases via X-ray images. A graph representation for CNN architecture is proposed and evolutionary operators including crossover and mutation are specifically designed for the proposed representation. The proposed architecture of CNNs is defined by two sets of parameters, one is the skeleton which determines the arrangement of the convolutional and pooling operators and their connections and one is the numerical parameters of the operators which determine the properties of these operators like filter size and kernel size. The proposed algorithm in this paper optimizes the skeleton and the numerical parameters of the CNN architectures in a co-evolutionary scheme. The proposed algorithm is used to identify covid-19 cases via X-ray images.
Collapse
|
6
|
Li JY, Zhan ZH, Xu J, Kwong S, Zhang J. Surrogate-Assisted Hybrid-Model Estimation of Distribution Algorithm for Mixed-Variable Hyperparameters Optimization in Convolutional Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:2338-2352. [PMID: 34543206 DOI: 10.1109/tnnls.2021.3106399] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The performance of a convolutional neural network (CNN) heavily depends on its hyperparameters. However, finding a suitable hyperparameters configuration is difficult, challenging, and computationally expensive due to three issues, which are 1) the mixed-variable problem of different types of hyperparameters; 2) the large-scale search space of finding optimal hyperparameters; and 3) the expensive computational cost for evaluating candidate hyperparameters configuration. Therefore, this article focuses on these three issues and proposes a novel estimation of distribution algorithm (EDA) for efficient hyperparameters optimization, with three major contributions in the algorithm design. First, a hybrid-model EDA is proposed to efficiently deal with the mixed-variable difficulty. The proposed algorithm uses a mixed-variable encoding scheme to encode the mixed-variable hyperparameters and adopts an adaptive hybrid-model learning (AHL) strategy to efficiently optimize the mixed-variables. Second, an orthogonal initialization (OI) strategy is proposed to efficiently deal with the challenge of large-scale search space. Third, a surrogate-assisted multi-level evaluation (SME) method is proposed to reduce the expensive computational cost. Based on the above, the proposed algorithm is named s urrogate-assisted hybrid-model EDA (SHEDA). For experimental studies, the proposed SHEDA is verified on widely used classification benchmark problems, and is compared with various state-of-the-art methods. Moreover, a case study on aortic dissection (AD) diagnosis is carried out to evaluate its performance. Experimental results show that the proposed SHEDA is very effective and efficient for hyperparameters optimization, which can find a satisfactory hyperparameters configuration for the CIFAR10, CIFAR100, and AD diagnosis with only 0.58, 0.97, and 1.18 GPU days, respectively.
Collapse
|
7
|
Vlahek D, Mongus D. An Efficient Iterative Approach to Explainable Feature Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:2606-2618. [PMID: 34478388 DOI: 10.1109/tnnls.2021.3107049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
This article introduces a new iterative approach to explainable feature learning. During each iteration, new features are generated, first by applying arithmetic operations on the input set of features. These are then evaluated in terms of probability distribution agreements between values of samples belonging to different classes. Finally, a graph-based approach for feature selection is proposed, which allows for selecting high-quality and uncorrelated features to be used in feature generation during the next iteration. As shown by the results, the proposed method improved the accuracy of all tested classifiers, where the best accuracies were achieved using random forest. In addition, the method turned out to be insensitive to both of the input parameters, while superior performances in comparison to the state of the art were demonstrated on nine out of 15 test sets and achieving comparable results in the others. Finally, we demonstrate the explainability of the learned feature representation for knowledge discovery.
Collapse
|
8
|
Dan Y, Li Z. Particle Swarm Optimization-Based Convolutional Neural Network for Handwritten Chinese Character Recognition. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2023. [DOI: 10.20965/jaciii.2023.p0165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/21/2023]
Abstract
Recently, handwritten Chinese character recognition has become an important research field in computer vision. With the development of deep learning, convolutional neural networks (CNNs) have demonstrated excellent performance in computer vision. However, CNNs are typically designed manually, which requires extensive experience and may lead to redundant computations. To solve these problems, in this study, the particle swarm optimization approach is incorporated into the design of a CNN for handwritten Chinese character recognition, reducing redundant computations in the network. In this approach, each network architecture is represented by a particle, and the optimal network architecture is determined by continuously updating the particles until a global particle is identified. The experimental validation resulted in a network accuracy of 97.24% with only 1.43 million network parameters. Therefore, it is demonstrated that the proposed particle swarm optimization method can quickly and accurately find the optimal network architecture.
Collapse
Affiliation(s)
- Yongping Dan
- School of Electronic and Information, Zhongyuan University of Technology, No.41 Zhongyuan Road, Zhengzhou 450007, China
| | - Zhuo Li
- School of Electronic and Information, Zhongyuan University of Technology, No.41 Zhongyuan Road, Zhengzhou 450007, China
| |
Collapse
|
9
|
Liu Y, Sun Y, Xue B, Zhang M, Yen GG, Tan KC. A Survey on Evolutionary Neural Architecture Search. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:550-570. [PMID: 34357870 DOI: 10.1109/tnnls.2021.3100554] [Citation(s) in RCA: 28] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Deep neural networks (DNNs) have achieved great success in many applications. The architectures of DNNs play a crucial role in their performance, which is usually manually designed with rich expertise. However, such a design process is labor-intensive because of the trial-and-error process and also not easy to realize due to the rare expertise in practice. Neural architecture search (NAS) is a type of technology that can design the architectures automatically. Among different methods to realize NAS, the evolutionary computation (EC) methods have recently gained much attention and success. Unfortunately, there has not yet been a comprehensive summary of the EC-based NAS algorithms. This article reviews over 200 articles of most recent EC-based NAS methods in light of the core components, to systematically discuss their design principles and justifications on the design. Furthermore, current challenges and issues are also discussed to identify future research in this emerging field.
Collapse
|
10
|
Cheng B, Fu H, Li T, Zhang H, Huang J, Peng Y, Chen H, Fan C. Evolutionary computation-based multitask learning network for railway passenger comfort evaluation from EEG signals. Appl Soft Comput 2023. [DOI: 10.1016/j.asoc.2023.110079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
|
11
|
Bhuvaneshwari M, Grace Mary Kanaga E, George ST. Classification of SSVEP-EEG signals using CNN and Red Fox Optimization for BCI applications. Proc Inst Mech Eng H 2023; 237:134-143. [PMID: 36398685 DOI: 10.1177/09544119221135714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Classification of electroencephalography (EEG) signals associated with Steady-state visually evoked potential (SSVEP) is prominent because of its potential in restoring the communication and controlling capability of paralytic people. However, SSVEP signals classification is a challenging task for researchers because of its low signal-to-noise ratio, non-stationary and high dimensional properties. A proficient technique has to be evolved to classify the SSVEP-based EEG data. In recent times, convolutional neural network (CNN) has reached a quantum leap in EEG signal classification. Therefore, the proposed system employs CNN to classify the SSVEP-based EEG signals. Though CNN has proved its proficiency in handling EEG signal classification problems, the calibration of hyperparameters is required to enhance the performance of the model. The calibration of a hyperparameter is a time-consuming task, hence proposed an automated hyperparameter optimization technique using the Red Fox Optimization Algorithm (RFO). The effectiveness of the algorithm is evaluated by comparing it with the performance of Harris Hawk Optimization (HHO), Flower Pollination Algorithm (FPA), Grey Wolf Optimization Algorithm (GWO) and Whale Optimization Algorithm (WOA) based hyperparameter optimized CNN applied to the SSVEP based EEG signals multiclass dataset. The experimental results infer that the proposed algorithm can achieve a testing accuracy of 88.91% which is higher than other comparative algorithms like HHO, FPA, GWO and WOA. The above-mentioned values clearly show that the proposed algorithm achieved competitive performance when compared to the other reported algorithm.
Collapse
Affiliation(s)
- M Bhuvaneshwari
- Department of Computer Science and Engineering, Karunya Institute of Technology and Sciences, Coimbatore, India
| | - E Grace Mary Kanaga
- Department of Biomedical Engineering, Karunya Institute of Technology and Sciences, Coimbatore, India
| | - S Thomas George
- Department of Computer Science and Engineering, Karunya Institute of Technology and Sciences, Coimbatore, India
| |
Collapse
|
12
|
Convolution-layer parameters optimization in Convolutional Neural Networks. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.110210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
13
|
Chen H, Chen Y, Wang Q, Chen T, Zhao H. A New SCAE-MT Classification Model for Hyperspectral Remote Sensing Images. SENSORS (BASEL, SWITZERLAND) 2022; 22:8881. [PMID: 36433480 PMCID: PMC9694134 DOI: 10.3390/s22228881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 11/15/2022] [Accepted: 11/15/2022] [Indexed: 06/16/2023]
Abstract
Hyperspectral remote sensing images (HRSI) have the characteristics of foreign objects with the same spectrum. As it is difficult to label samples manually, the hyperspectral remote sensing images are understood to be typical "small sample" datasets. Deep neural networks can effectively extract the deep features from the HRSI, but the classification accuracy mainly depends on the training label samples. Therefore, the stacked convolutional autoencoder network and transfer learning strategy are employed in order to design a new stacked convolutional autoencoder network model transfer (SCAE-MT) for the purposes of classifying the HRSI in this paper. In the proposed classification method, the stacked convolutional au-to-encoding network is employed in order to effectively extract the deep features from the HRSI. Then, the transfer learning strategy is applied to design a stacked convolutional autoencoder network model transfer under the small and limited training samples. The SCAE-MT model is used to propose a new HRSI classification method in order to solve the small samples of the HRSI. In this study, in order to prove the effectiveness of the proposed classification method, two HRSI datasets were chosen. In order to verify the effectiveness of the methods, the overall classification accuracy (OA) of the convolutional self-coding network classification method (CAE), the stack convolutional self-coding network classification method (SCAE), and the SCAE-MT method under 5%, 10%, and 15% training sets are calculated. When compared with the CAE and SCAE models in 5%, 10%, and 15% training datasets, the overall accuracy (OA) of the SCAE-MT method was improved by 2.71%, 3.33%, and 3.07% (on average), respectively. The SCAE-MT method is, thus, clearly superior to the other methods and also shows a good classification performance.
Collapse
Affiliation(s)
- Huayue Chen
- School of Computer Science, China West Normal University, Nanchong 637002, China
| | - Ye Chen
- School of Computer Science, China West Normal University, Nanchong 637002, China
| | - Qiuyue Wang
- School of Computer Science, China West Normal University, Nanchong 637002, China
| | - Tao Chen
- School of Computer Science, China West Normal University, Nanchong 637002, China
| | - Huimin Zhao
- School of Electronic Information and Automation, Civil Aviation University of China, Tianjin 300300, China
- Traction Power State Key Laboratory, Southwest Jiaotong University, Chengdu 610031, China
| |
Collapse
|
14
|
Optimal Design of Convolutional Neural Network Architectures Using Teaching–Learning-Based Optimization for Image Classification. Symmetry (Basel) 2022. [DOI: 10.3390/sym14112323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Convolutional neural networks (CNNs) have exhibited significant performance gains over conventional machine learning techniques in solving various real-life problems in computational intelligence fields, such as image classification. However, most existing CNN architectures were handcrafted from scratch and required significant amounts of problem domain knowledge from designers. A novel deep learning method abbreviated as TLBOCNN is proposed in this paper by leveraging the excellent global search ability of teaching–learning-based optimization (TLBO) to obtain an optimal design of network architecture for a CNN based on the given dataset with symmetrical distribution of each class of data samples. A variable-length encoding scheme is first introduced in TLBOCNN to represent each learner as a potential CNN architecture with different layer parameters. During the teacher phase, a new mainstream architecture computation scheme is designed to compute the mean parameter values of CNN architectures by considering the information encoded into the existing population members with variable lengths. The new mechanisms of determining the differences between two learners with variable lengths and updating their positions are also devised in both the teacher and learner phases to obtain new learners. Extensive simulation studies report that the proposed TLBOCNN achieves symmetrical performance in classifying the majority of MNIST-variant datasets, displays the highest accuracy, and produces CNN models with the lowest complexity levels compared to other state-of-the-art methods due to its promising search ability.
Collapse
|
15
|
Autonomous CNN (AutoCNN): A data-driven approach to network architecture determination. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.05.100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
16
|
Zhang H, Hu X, Ma D, Wang R, Xie X. Insufficient Data Generative Model for Pipeline Network Leak Detection Using Generative Adversarial Networks. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:7107-7120. [PMID: 33296325 DOI: 10.1109/tcyb.2020.3035518] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
In terms of pipeline leak detection, the unavoidable fact is that existing data could not provide enough effective leak data to train a high accuracy model. To address this issue, this article proposes mixed generative adversarial networks (mixed-GANs) as a practical way to provide additional data, ensuring data reliability. First, multitype generative networks with heterogeneous parameter-updating mechanisms are designed to explore a variety of different solutions and eliminate the potential risks of instable training and scenario collapse. Then, based on expert experience, two data constraints are proposed to describe leak characteristics and further evaluate the quality of generated leak data in the training process. Through integrating the particle swarm optimization algorithm into generative model training, mixed-GAN has better generation performance than the conventional gradient descent algorithm. Based on the above-mentioned contents, the proposed model is able to provide satisfactory leak data with different scenarios, contributing to data quantity expansion, data credibility enhancement, and data variety enrichment. Finally, extensive experiments are given to illustrate the effectiveness of the proposed generative model for pipeline network leak detection.
Collapse
|
17
|
Early Diagnosis of Retinal Blood Vessel Damage via Deep Learning-Powered Collective Intelligence Models. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:3571364. [PMID: 35785142 PMCID: PMC9246601 DOI: 10.1155/2022/3571364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 05/30/2022] [Indexed: 11/17/2022]
Abstract
Early diagnosis of retinal diseases such as diabetic retinopathy has had the attention of many researchers. Deep learning through the introduction of convolutional neural networks has become a prominent solution for image-related tasks such as classification and segmentation. Most tasks in image classification are handled by deep CNNs pretrained and evaluated on imagenet dataset. However, these models do not always translate to the best result on other datasets. Devising a neural network manually from scratch based on heuristics may not lead to an optimal model as there are numerous hyperparameters in play. In this paper, we use two nature-inspired swarm algorithms: particle swarm optimization (PSO) and ant colony optimization (ACO) to obtain TDCN models to perform classification of fundus images into severity classes. The power of swarm algorithms is used to search for various combinations of convolutional, pooling, and normalization layers to provide the best model for the task. It is observed that TDCN-PSO outperforms imagenet models and existing literature, while TDCN-ACO achieves faster architecture search. The best TDCN model achieves an accuracy of 90.3%, AUC ROC of 0.956, and a Cohen's kappa score of 0.967. The results were compared with the previous studies to show that the proposed TDCN models exhibit superior performance.
Collapse
|
18
|
Vaiyapuri T, Dutta AK, Punithavathi ISH, Duraipandy P, Alotaibi SS, Alsolai H, Mohamed A, Mahgoub H. Intelligent Deep-Learning-Enabled Decision-Making Medical System for Pancreatic Tumor Classification on CT Images. Healthcare (Basel) 2022; 10:healthcare10040677. [PMID: 35455854 PMCID: PMC9027672 DOI: 10.3390/healthcare10040677] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 03/26/2022] [Accepted: 03/28/2022] [Indexed: 12/13/2022] Open
Abstract
Decision-making medical systems (DMS) refer to the design of decision techniques in the healthcare sector. They involve a procedure of employing ideas and decisions related to certain processes such as data acquisition, processing, judgment, and conclusion. Pancreatic cancer is a lethal type of cancer, and its prediction is ineffective with current techniques. Automated detection and classification of pancreatic tumors can be provided by the computer-aided diagnosis (CAD) model using radiological images such as computed tomography (CT) and magnetic resonance imaging (MRI). The recently developed machine learning (ML) and deep learning (DL) models can be utilized for the automated and timely detection of pancreatic cancer. In light of this, this article introduces an intelligent deep-learning-enabled decision-making medical system for pancreatic tumor classification (IDLDMS-PTC) using CT images. The major intention of the IDLDMS-PTC technique is to examine the CT images for the existence of pancreatic tumors. The IDLDMS-PTC model derives an emperor penguin optimizer (EPO) with multilevel thresholding (EPO-MLT) technique for pancreatic tumor segmentation. Additionally, the MobileNet model is applied as a feature extractor with optimal auto encoder (AE) for pancreatic tumor classification. In order to optimally adjust the weight and bias values of the AE technique, the multileader optimization (MLO) technique is utilized. The design of the EPO algorithm for optimal threshold selection and the MLO algorithm for parameter tuning shows the novelty. A wide range of simulations was executed on benchmark datasets, and the outcomes reported the promising performance of the IDLDMS-PTC model on the existing methods.
Collapse
Affiliation(s)
- Thavavel Vaiyapuri
- Department of Computer Sciences, College of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia;
| | - Ashit Kumar Dutta
- Department of Computer Science and Information Systems, College of Applied Sciences, AlMaarefa University, Ad Diriyah, Riyadh 13713, Saudi Arabia;
| | - I. S. Hephzi Punithavathi
- Department of Computer Science and Engineering, Sphoorthy Engineering College, Telangana, Hyderabad 501510, India;
| | - P. Duraipandy
- Department of Electrical and Electronics Engineering, J. B. Institute of Engineering and Technology, Telangana, Hyderabad 500075, India;
| | - Saud S. Alotaibi
- Department of Information Systems, College of Computing and Information System, Umm Al-Qura University, Mecca 21911, Saudi Arabia;
| | - Hadeel Alsolai
- Department of Information Systems, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia;
| | - Abdullah Mohamed
- Research Centre, Future University in Egypt, New Cairo, Cairo 11745, Egypt;
| | - Hany Mahgoub
- Department of Computer Science, College of Science & Art at Mahayil, King Khalid University, Abha 61421, Saudi Arabia
- Correspondence:
| |
Collapse
|
19
|
|
20
|
Lai J, Wang X, Xiang Q, Li R, Song Y. FVAE: a regularized variational autoencoder using the Fisher criterion. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03422-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
21
|
Optimization of deep learning based segmentation method. Soft comput 2022. [DOI: 10.1007/s00500-021-06711-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
22
|
Fernandes FE, Yen GG. Automatic Searching and Pruning of Deep Neural Networks for Medical Imaging Diagnostic. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:5664-5674. [PMID: 33048758 DOI: 10.1109/tnnls.2020.3027308] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The field of medical imaging diagnostic makes use of a modality of imaging tests, e.g., X-rays, ultrasounds, computed tomographies, and magnetic resonance imaging, to assist physicians with the diagnostic of patients' illnesses. Due to their state-of-the-art results in many challenging image classification tasks, deep neural networks (DNNs) are suitable tools for use by physicians to provide diagnostic support when dealing with medical images. To further advance the field, the present work proposes a two-phase algorithm capable of automatically generating compact DNN architectures given a database, called here DNNDeepeningPruning. In the first phase, also called the deepening phase, the algorithm grows a DNN by adding blocks of residual layers one after another until the model overfits the given data. In the second phase, called the pruning phase, the algorithm prunes the created DNN model from the first phase to produce a DNN with a small amount of floating-point operations guided by some preference given by the user. The proposed algorithm unifies the two separate fields of DNN architecture searching and pruning under a single framework, and it is tested in two medical imaging data sets with satisfactory results.
Collapse
|
23
|
Liu J, Jin Y. Multi-objective search of robust neural architectures against multiple types of adversarial attacks. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.04.111] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
24
|
Xue Y, Jiang P, Neri F, Liang J. A Multi-Objective Evolutionary Approach Based on Graph-in-Graph for Neural Architecture Search of Convolutional Neural Networks. Int J Neural Syst 2021; 31:2150035. [PMID: 34304718 DOI: 10.1142/s0129065721500350] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
With the development of deep learning, the design of an appropriate network structure becomes fundamental. In recent years, the successful practice of Neural Architecture Search (NAS) has indicated that an automated design of the network structure can efficiently replace the design performed by human experts. Most NAS algorithms make the assumption that the overall structure of the network is linear and focus solely on accuracy to assess the performance of candidate networks. This paper introduces a novel NAS algorithm based on a multi-objective modeling of the network design problem to design accurate Convolutional Neural Networks (CNNs) with a small structure. The proposed algorithm makes use of a graph-based representation of the solutions which enables a high flexibility in the automatic design. Furthermore, the proposed algorithm includes novel ad-hoc crossover and mutation operators. We also propose a mechanism to accelerate the evaluation of the candidate solutions. Experimental results demonstrate that the proposed NAS approach can design accurate neural networks with limited size.
Collapse
Affiliation(s)
- Yu Xue
- School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing, P. R. China.,Engineering Research Center of Digital Forensics, Ministry of Education, Nanjing University of Information Science and Technology, Nanjing, P. R. China
| | - Pengcheng Jiang
- School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing, P. R. China
| | - Ferrante Neri
- COL Laboratory, School of Computer Science, University of Nottingham, Nottingham, UK
| | - Jiayu Liang
- Tianjin Key Laboratory of Autonomous Intelligent Technology and System, Tiangong University, Tianjin, P. R. China
| |
Collapse
|
25
|
Al-Andoli M, Tan SC, Cheah WP. Parallel stacked autoencoder with particle swarm optimization for community detection in complex networks. APPL INTELL 2021. [DOI: 10.1007/s10489-021-02589-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
26
|
Optimization of Convolutional Neural Networks Architectures Using PSO for Sign Language Recognition. AXIOMS 2021. [DOI: 10.3390/axioms10030139] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
This paper presents an approach to design convolutional neural network architectures, using the particle swarm optimization algorithm. The adjustment of the hyper-parameters and finding the optimal network architecture of convolutional neural networks represents an important challenge. Network performance and achieving efficient learning models for a particular problem depends on setting hyper-parameter values and this implies exploring a huge and complex search space. The use of heuristic-based searches supports these types of problems; therefore, the main contribution of this research work is to apply the PSO algorithm to find the optimal parameters of the convolutional neural networks which include the number of convolutional layers, the filter size used in the convolutional process, the number of convolutional filters, and the batch size. This work describes two optimization approaches; the first, the parameters obtained by PSO are kept under the same conditions in each convolutional layer, and the objective function evaluated by PSO is given by the classification rate; in the second, the PSO generates different parameters per layer, and the objective function is composed of the recognition rate in conjunction with the Akaike information criterion, the latter helps to find the best network performance but with the minimum parameters. The optimized architectures are implemented in three study cases of sign language databases, in which are included the Mexican Sign Language alphabet, the American Sign Language MNIST, and the American Sign Language alphabet. According to the results, the proposed methodologies achieved favorable results with a recognition rate higher than 99%, showing competitive results compared to other state-of-the-art approaches.
Collapse
|
27
|
Song W, Li W, Hua Z, Zhu F. A new deep auto-encoder using multiscale reconstruction errors and weight update correlation. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.01.064] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
28
|
Li X, Zhang S, Wong KC. Deep embedded clustering with multiple objectives on scRNA-seq data. Brief Bioinform 2021; 22:6209682. [PMID: 33822877 DOI: 10.1093/bib/bbab090] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 02/17/2021] [Accepted: 02/25/2021] [Indexed: 12/19/2022] Open
Abstract
In recent years, single-cell RNA sequencing (scRNA-seq) technologies have been widely adopted to interrogate gene expression of individual cells; it brings opportunities to understand the underlying processes in a high-throughput manner. Deep embedded clustering (DEC) was demonstrated successful in high-dimensional sparse scRNA-seq data by joint feature learning and cluster assignment for identifying cell types simultaneously. However, the deep network architecture for embedding clustering is not trivial to optimize. Therefore, we propose an evolutionary multiobjective DEC by synergizing the multiobjective evolutionary optimization to simultaneously evolve the hyperparameters and architectures of DEC in an automatic manner. Firstly, a denoising autoencoder is integrated into the DEC to project the high-dimensional sparse scRNA-seq data into a low-dimensional space. After that, to guide the evolution, three objective functions are formulated to balance the model's generality and clustering performance for robustness. Meanwhile, migration and mutation operators are proposed to optimize the objective functions to select the suitable hyperparameters and architectures of DEC in the multiobjective framework. Multiple comparison analyses are conducted on twenty synthetic data and eight real data from different representative single-cell sequencing platforms to validate the effectiveness. The experimental results reveal that the proposed algorithm outperforms other state-of-the-art clustering methods under different metrics. Meanwhile, marker genes identification, gene ontology enrichment and pathology analysis are conducted to reveal novel insights into the cell type identification and characterization mechanisms.
Collapse
Affiliation(s)
- Xiangtao Li
- School of Artificial Intelligence Jilin University, Jilin, China
| | - Shixiong Zhang
- Department of Computer science City University of Hong Kong, Hong Kong SAR
| | - Ka-Chun Wong
- Department of Computer science City University of Hong Kong, Hong Kong SAR
| |
Collapse
|
29
|
|
30
|
Soniya, Paul S, Singh L. Application and Need-Based Architecture Design of Deep Neural Networks. INT J PATTERN RECOGN 2020. [DOI: 10.1142/s021800142052014x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
This paper applies a hybrid evolutionary approach to a convolutional neural network (CNN) and determines the number of layers and filters based on the application and user need. It integrates compact genetic algorithm with stochastic gradient descent (SGD) for simultaneously evolving structure and parameters of the CNN. It defines an effectual string representation for combining structure and parameters of the CNN. The compact genetic algorithm helps in the evolution of network structure by optimizing the number of convolutional layers and number of filters in each convolutional layer. At the same time, an optimal set of weight parameters of the network is obtained using the SGD law. This approach amalgamates exploration in network space by compact genetic algorithm and exploitation in weight space with SGD in an effective manner. The proposed approach also incorporates user-defined parameters in the cost function in an elegant manner which controls the network structure and hence the performance of the network based on the users need. The effectiveness of the proposed approach has been demonstrated on four benchmark datasets, namely MNIST, COIL-100, CIFAR-10 and CIFAR-100. The obtained results clearly demonstrate the potential of the proposed approach by evolving architectures based on the nature of the application and the need of the user.
Collapse
Affiliation(s)
- Soniya
- Dayalbagh Educational Institute, Dayalbagh, Agra, India
| | - Sandeep Paul
- Dayalbagh Educational Institute, Dayalbagh, Agra, India
| | - Lotika Singh
- Dayalbagh Educational Institute, Dayalbagh, Agra, India
| |
Collapse
|
31
|
Lan K, Liu L, Li T, Chen Y, Fong S, Marques JAL, Wong RK, Tang R. Multi-view convolutional neural network with leader and long-tail particle swarm optimizer for enhancing heart disease and breast cancer detection. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-04769-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
32
|
An Improved LightGBM Algorithm for Online Fault Detection of Wind Turbine Gearboxes. ENERGIES 2020. [DOI: 10.3390/en13040807] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
It is widely accepted that conventional boost algorithms are of low efficiency and accuracy in dealing with big data collected from wind turbine operations. To address this issue, this paper is devoted to the application of an adaptive LightGBM method for wind turbine fault detections. To this end, the realization of feature selection for fault detection is firstly achieved by utilizing the maximum information coefficient to analyze the correlation among features in supervisory control and data acquisition (SCADA) of wind turbines. After that, a performance evaluation criterion is proposed for the improved LightGBM model to support fault detections. In this scheme, by embedding the confusion matrix as a performance indicator, an improved LightGBM fault detection approach is then developed. Based on the adaptive LightGBM fault detection model, a fault detection strategy for wind turbine gearboxes is investigated. To demonstrate the applications of the proposed algorithms and methods, a case study with a three-year SCADA dataset obtained from a wind farm sited in Southern China is conducted. Results indicate that the proposed approaches established a fault detection framework of wind turbine systems with either lower false alarm rate or lower missing detection rate.
Collapse
|
33
|
Darwish A, Hassanien AE, Das S. A survey of swarm and evolutionary computing approaches for deep learning. Artif Intell Rev 2019. [DOI: 10.1007/s10462-019-09719-2] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
|