1
|
Choi K, Choe Y, Park H. Reinforcement Learning May Demystify the Limited Human Motor Learning Efficacy Due to Visual-Proprioceptive Mismatch. Int J Neural Syst 2024; 34:2450037. [PMID: 38655914 DOI: 10.1142/s0129065724500370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Vision and proprioception have fundamental sensory mismatches in delivering locational information, and such mismatches are critical factors limiting the efficacy of motor learning. However, it is still not clear how and to what extent this mismatch limits motor learning outcomes. To further the understanding of the effect of sensory mismatch on motor learning outcomes, a reinforcement learning algorithm and the simplified biomechanical elbow joint model were employed to mimic the motor learning process in a computational environment. By applying a reinforcement learning algorithm to the motor learning of elbow joint flexion task, simulation results successfully explained how visual-proprioceptive mismatch limits motor learning outcomes in terms of motor control accuracy and task completion speed. The larger the perceived angular offset between the two sensory modalities, the lower the motor control accuracy. Also, the more similar the peak reward amplitude of the two sensory modalities, the lower the motor control accuracy. In addition, simulation results suggest that insufficient exploration rate limits task completion speed, and excessive exploration rate limits motor control accuracy. Such a speed-accuracy trade-off shows that a moderate exploration rate could serve as another important factor in motor learning.
Collapse
Affiliation(s)
- Kyungrak Choi
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843, USA
| | - Yoonsuck Choe
- Department of Computer Science and Engineering, Texas A&M University, College Station, TX 77843, USA
| | - Hangue Park
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843, USA
- Department of Biomedical Engineering, Sungkyunkwan University, Suwon, South Korea
- Department of Intelligent Precision Healthcare Convergence, Sungkyunkwan University, Suwon, South Korea
| |
Collapse
|
2
|
Zhang L, Xu F, Neri F. An Asynchronous Spiking Neural Membrane System for Edge Detection. Int J Neural Syst 2024; 34:2450023. [PMID: 38490956 DOI: 10.1142/s0129065724500230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/17/2024]
Abstract
Spiking neural membrane systems (SN P systems) are a class of bio-inspired models inspired by the activities and connectivity of neurons. Extensive studies have been made on SN P systems with synchronization-based communication, while further efforts are needed for the systems with rhythm-based communication. In this work, we design an asynchronous SN P system with resonant connections where all the enabled neurons in the same group connected by resonant connections should instantly produce spikes with the same rhythm. In the designed system, each of the three modules implements one type of the three operations associated with the edge detection of digital images, and they collaborate each other through the resonant connections. An algorithm called EDSNP for edge detection is proposed to simulate the working of the designed asynchronous SN P system. A quantitative analysis of EDSNP and the related methods for edge detection had been conducted to evaluate the performance of EDSNP. The performance of the EDSNP in processing the testing images is superior to the compared methods, based on the quantitative metrics of accuracy, error rate, mean square error, peak signal-to-noise ratio and true positive rate. The results indicate the potential of the temporal firing and the proper neuronal connections in the SN P system to achieve good performance in edge detection.
Collapse
Affiliation(s)
- Luping Zhang
- Jiangxi Engineering Technology Research Center of Nuclear, Geoscience Data Science and System, Jiangxi Engineering Laboratory on Radioactive Geoscience and Big Data Technology, School of Information Engineering, East China University of Technology, Nanchang 330013, Jiangxi, P. R. China
| | - Fei Xu
- Key Laboratory of Image Information Processing and Intelligent Control of Education Ministry of China, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, P. R. China
| | - Ferrante Neri
- NICE Research Group, School of Computer Science and Electronic Engineering, University of Surrey, Guildford, Surrey GU2 7XH, UK
| |
Collapse
|
3
|
Gill TS, Zaidi SSH, Shirazi MA. Attention-based deep convolutional neural network for classification of generalized and focal epileptic seizures. Epilepsy Behav 2024; 155:109732. [PMID: 38636140 DOI: 10.1016/j.yebeh.2024.109732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 02/03/2024] [Accepted: 02/27/2024] [Indexed: 04/20/2024]
Abstract
Epilepsy affects over 50 million people globally. Electroencephalography is critical for epilepsy diagnosis, but manual seizure classification is time-consuming and requires extensive expertise. This paper presents an automated multi-class seizure classification model using EEG signals from the Temple University Hospital Seizure Corpus ver. 1.5.2. 11 features including time-based correlation, time-based eigenvalues, power spectral density, frequency-based correlation, frequency-based eigenvalues, sample entropy, spectral entropy, logarithmic sum, standard deviation, absolute mean, and ratio of Daubechies D4 wavelet transformed coefficients were extracted from 10-second sliding windows across channels. The model combines multi-head self-attention mechanism with a deep convolutional neural network (CNN) to classify seven subtypes of generalized and focal epileptic seizures. The model achieved 0.921 weighted accuracy and 0.902 weighted F1 score in classifying focal onset non-motor, generalized onset non-motor, simple partial, complex partial, absence, tonic, and tonic-clonic seizures. In comparison, a CNN model without multi-head attention achieved 0.767 weighted accuracy. Ablation studies were conducted to validate the importance of transformer encoders and attention. The promising classification results demonstrate the potential of deep learning for handling EEG complexity and improving epilepsy diagnosis. This seizure classification model could enable timely interventions when translated into clinical practice.
Collapse
Affiliation(s)
- Taimur Shahzad Gill
- Department of Electronics and Power Engineering, National University of Sciences and Technology, Islamabad 44000, Pakistan.
| | - Syed Sajjad Haider Zaidi
- Department of Electronics and Power Engineering, National University of Sciences and Technology, Islamabad 44000, Pakistan.
| | - Muhammad Ayaz Shirazi
- Department of Electronics and Power Engineering, National University of Sciences and Technology, Islamabad 44000, Pakistan.
| |
Collapse
|
4
|
Ermini I, Zandron C. Modular Spiking Neural Membrane Systems for Image Classification. Int J Neural Syst 2024; 34:2450021. [PMID: 38453666 DOI: 10.1142/s0129065724500217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]
Abstract
A variant of membrane computing models called Spiking Neural P systems (SNP systems) closely mimics the structure and behavior of biological neurons. As third-generation neural networks, SNP systems have flexible architectures allowing the design of bio-inspired machine learning algorithms. This paper proposes Modular Spiking Neural P (MSNP) systems to solve image classification problems, a novel SNP system to be applied in scenarios where hundreds or even thousands of different classes are considered. A main issue to face in such situations is related to the structural complexity of the network. MSNP systems devised in this work allow to approach the general classification problem by dividing it in smaller parts, that are then faced by single entities of the network. As a benchmark dataset, the Oxford Flowers 102 dataset is considered, consisting of more than 8000 pictures of flowers belonging to the 102 species commonly found in the UK. These classes sometimes present large variations within them, may be also very similar to one another, and different images of the same subject may differ a lot. The work describes the architecture of the MSNP system, based on modules focusing on a specific class, their training phase, and the evaluation of the model both concerning result accuracy as well as energy consumption. Experimental results on image classification problems show that the model achieves good results, but is strongly connected to image quality, mainly depending on the frequency of images, remarkable changes of pose, images not centered, and subject mostly not shown.
Collapse
Affiliation(s)
- Iris Ermini
- Dipartimento di Informatica, Sistemistica e Comunicazione, Università degli Studi di Milano-Bicocca, Viale Sarca 336/14 Milano 20126, Italy
| | - Claudio Zandron
- Dipartimento di Informatica, Sistemistica e Comunicazione, Università degli Studi di Milano-Bicocca, Viale Sarca 336/14 Milano 20126, Italy
| |
Collapse
|
5
|
Avola D, Cinque L, Mambro AD, Fagioli A, Marini MR, Pannone D, Fanini B, Foresti GL. Spatio-Temporal Image-Based Encoded Atlases for EEG Emotion Recognition. Int J Neural Syst 2024; 34:2450024. [PMID: 38533631 DOI: 10.1142/s0129065724500242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/28/2024]
Abstract
Emotion recognition plays an essential role in human-human interaction since it is a key to understanding the emotional states and reactions of human beings when they are subject to events and engagements in everyday life. Moving towards human-computer interaction, the study of emotions becomes fundamental because it is at the basis of the design of advanced systems to support a broad spectrum of application areas, including forensic, rehabilitative, educational, and many others. An effective method for discriminating emotions is based on ElectroEncephaloGraphy (EEG) data analysis, which is used as input for classification systems. Collecting brain signals on several channels and for a wide range of emotions produces cumbersome datasets that are hard to manage, transmit, and use in varied applications. In this context, the paper introduces the Empátheia system, which explores a different EEG representation by encoding EEG signals into images prior to their classification. In particular, the proposed system extracts spatio-temporal image encodings, or atlases, from EEG data through the Processing and transfeR of Interaction States and Mappings through Image-based eNcoding (PRISMIN) framework, thus obtaining a compact representation of the input signals. The atlases are then classified through the Empátheia architecture, which comprises branches based on convolutional, recurrent, and transformer models designed and tuned to capture the spatial and temporal aspects of emotions. Extensive experiments were conducted on the Shanghai Jiao Tong University (SJTU) Emotion EEG Dataset (SEED) public dataset, where the proposed system significantly reduced its size while retaining high performance. The results obtained highlight the effectiveness of the proposed approach and suggest new avenues for data representation in emotion recognition from EEG signals.
Collapse
Affiliation(s)
- Danilo Avola
- Department of Computer Science, Sapienza University of Rome, Via Salaria 113, Rome 00198, Italy
| | - Luigi Cinque
- Department of Computer Science, Sapienza University of Rome, Via Salaria 113, Rome 00198, Italy
| | - Angelo Di Mambro
- Department of Computer Science, Sapienza University of Rome, Via Salaria 113, Rome 00198, Italy
| | - Alessio Fagioli
- Department of Computer Science, Sapienza University of Rome, Via Salaria 113, Rome 00198, Italy
| | - Marco Raoul Marini
- Department of Computer Science, Sapienza University of Rome, Via Salaria 113, Rome 00198, Italy
| | - Daniele Pannone
- Department of Computer Science, Sapienza University of Rome, Via Salaria 113, Rome 00198, Italy
| | - Bruno Fanini
- Institute of Heritage Science, National Research Council, Area della Ricerca Roma 1, SP35d, 9, Montelibretti 00010, Italy
| | - Gian Luca Foresti
- Department of Computer Science, Mathematics and Physics, University of Udine, Via delle Scienze 206, Udine 33100, Italy
| |
Collapse
|
6
|
Madni HA, Umer RM, Foresti GL. Robust Federated Learning for Heterogeneous Model and Data. Int J Neural Syst 2024; 34:2450019. [PMID: 38414421 DOI: 10.1142/s0129065724500199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]
Abstract
Data privacy and security is an essential challenge in medical clinical settings, where individual hospital has its own sensitive patients data. Due to recent advances in decentralized machine learning in Federated Learning (FL), each hospital has its own private data and learning models to collaborate with other trusted participating hospitals. Heterogeneous data and models among different hospitals raise major challenges in robust FL, such as gradient leakage, where participants can exploit model weights to infer data. Here, we proposed a robust FL method to efficiently tackle data and model heterogeneity, where we train our model using knowledge distillation and a novel weighted client confidence score on hematological cytomorphology data in clinical settings. In the knowledge distillation, each participant learns from other participants by a weighted confidence score so that knowledge from clean models is distributed other than the noisy clients possessing noisy data. Moreover, we use symmetric loss to reduce the negative impact of data heterogeneity and label diversity by reducing overfitting the model to noisy labels. In comparison to the current approaches, our proposed method performs the best, and this is the first demonstration of addressing both data and model heterogeneity in end-to-end FL that lays the foundation for robust FL in laboratories and clinical applications.
Collapse
Affiliation(s)
- Hussain Ahmad Madni
- Department of Mathematics, Computer Science and Physics (DMIF), University of Udine, Udine 33100, Italy
| | - Rao Muhammad Umer
- Institute of AI for Health, Helmholtz Zentrum München - German Research, Center for Environmental Health, Neuherberg 85764, Germany
| | - Gian Luca Foresti
- Department of Mathematics, Computer Science and Physics (DMIF), University of Udine, Udine 33100, Italy
| |
Collapse
|
7
|
Chen L, Leng L, Yang Z, Teoh ABJ. Enhanced Multitask Learning for Hash Code Generation of Palmprint Biometrics. Int J Neural Syst 2024; 34:2450020. [PMID: 38414422 DOI: 10.1142/s0129065724500205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]
Abstract
This paper presents a novel multitask learning framework for palmprint biometrics, which optimizes classification and hashing branches jointly. The classification branch within our framework facilitates the concurrent execution of three distinct tasks: identity recognition and classification of soft biometrics, encompassing gender and chirality. On the other hand, the hashing branch enables the generation of palmprint hash codes, optimizing for minimal storage as templates and efficient matching. The hashing branch derives the complementary information from these tasks by amalgamating knowledge acquired from the classification branch. This approach leads to superior overall performance compared to individual tasks in isolation. To enhance the effectiveness of multitask learning, two additional modules, an attention mechanism module and a customized gate control module, are introduced. These modules are vital in allocating higher weights to crucial channels and facilitating task-specific expert knowledge integration. Furthermore, an automatic weight adjustment module is incorporated to optimize the learning process further. This module fine-tunes the weights assigned to different tasks, improving performance. Integrating the three modules above has shown promising accuracies across various classification tasks and has notably improved authentication accuracy. The extensive experimental results validate the efficacy of our proposed framework.
Collapse
Affiliation(s)
- Lin Chen
- Key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition, Nanchang Hangkong University, Nanchang, Jiangxi, P. R. China
| | - Lu Leng
- Key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition, Nanchang Hangkong University, Nanchang, Jiangxi, P. R. China
| | - Ziyuan Yang
- College of Computer Science, Sichuan University, Chengdu, Sichuan, P. R. China
| | - Andrew Beng Jin Teoh
- School of Electrical and Electronic Engineering, College of Engineering, Yonsei University Seoul, Republic of Korea
| |
Collapse
|
8
|
Niu H, Yi Z, He T. A Bidirectional Feedforward Neural Network Architecture Using the Discretized Neural Memory Ordinary Differential Equation. Int J Neural Syst 2024; 34:2450015. [PMID: 38318709 DOI: 10.1142/s0129065724500151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2024]
Abstract
Deep Feedforward Neural Networks (FNNs) with skip connections have revolutionized various image recognition tasks. In this paper, we propose a novel architecture called bidirectional FNN (BiFNN), which utilizes skip connections to aggregate features between its forward and backward paths. The BiFNN accepts any FNN as a plugin that can incorporate any general FNN model into its forward path, introducing only a few additional parameters in the cross-path connections. The backward path is implemented as a nonparameter layer, utilizing a discretized form of the neural memory Ordinary Differential Equation (nmODE), which is named [Formula: see text]-net. We provide a proof of convergence for the [Formula: see text]-net and evaluate its initial value problem. Our proposed architecture is evaluated on diverse image recognition datasets, including Fashion-MNIST, SVHN, CIFAR-10, CIFAR-100, and Tiny-ImageNet. The results demonstrate that BiFNNs offer significant improvements compared to embedded models such as ConvMixer, ResNet, ResNeXt, and Vision Transformer. Furthermore, BiFNNs can be fine-tuned to achieve comparable performance with embedded models on Tiny-ImageNet and ImageNet-1K datasets by loading the same pretrained parameters.
Collapse
Affiliation(s)
- Hao Niu
- College of Computer Science, Sichuan University, Chengdu 610065, P. R. China
| | - Zhang Yi
- College of Computer Science, Sichuan University, Chengdu 610065, P. R. China
| | - Tao He
- College of Computer Science, Sichuan University, Chengdu 610065, P. R. China
| |
Collapse
|
9
|
Li R, Li J, Wang C, Liu H, Liu T, Wang X, Zou T, Huang W, Yan H, Chen H. Multi-Semantic Decoding of Visual Perception with Graph Neural Networks. Int J Neural Syst 2024; 34:2450016. [PMID: 38372016 DOI: 10.1142/s0129065724500163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Constructing computational decoding models to account for the cortical representation of semantic information plays a crucial role in understanding visual perception. The human visual system processes interactive relationships among different objects when perceiving the semantic contents of natural visions. However, the existing semantic decoding models commonly regard categories as completely separate and independent visually and semantically and rarely consider the relationships from prior information. In this work, a novel semantic graph learning model was proposed to decode multiple semantic categories of perceived natural images from brain activity. The proposed model was validated on the functional magnetic resonance imaging data collected from five normal subjects while viewing 2750 natural images comprising 52 semantic categories. The results showed that the Graph Neural Network-based decoding model achieved higher accuracies than other deep neural network models. Moreover, the co-occurrence probability among semantic categories showed a significant correlation with the decoding accuracy. Additionally, the results suggested that semantic content organized in a hierarchical way with higher visual areas was more closely related to the internal visual experience. Together, this study provides a superior computational framework for multi-semantic decoding that supports the visual integration mechanism of semantic processing.
Collapse
Affiliation(s)
- Rong Li
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- MOE Key Lab for Neuroinformation, High-Field Magnetic Resonance Brain Imaging, Key Laboratory of Sichuan Province, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
| | - Jiyi Li
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- MOE Key Lab for Neuroinformation, High-Field Magnetic Resonance Brain Imaging, Key Laboratory of Sichuan Province, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
| | - Chong Wang
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- MOE Key Lab for Neuroinformation, High-Field Magnetic Resonance Brain Imaging, Key Laboratory of Sichuan Province, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
| | - Haoxiang Liu
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- MOE Key Lab for Neuroinformation, High-Field Magnetic Resonance Brain Imaging, Key Laboratory of Sichuan Province, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
| | - Tao Liu
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- MOE Key Lab for Neuroinformation, High-Field Magnetic Resonance Brain Imaging, Key Laboratory of Sichuan Province, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
| | - Xuyang Wang
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- MOE Key Lab for Neuroinformation, High-Field Magnetic Resonance Brain Imaging, Key Laboratory of Sichuan Province, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
| | - Ting Zou
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- MOE Key Lab for Neuroinformation, High-Field Magnetic Resonance Brain Imaging, Key Laboratory of Sichuan Province, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
| | - Wei Huang
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- MOE Key Lab for Neuroinformation, High-Field Magnetic Resonance Brain Imaging, Key Laboratory of Sichuan Province, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
| | - Hongmei Yan
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- MOE Key Lab for Neuroinformation, High-Field Magnetic Resonance Brain Imaging, Key Laboratory of Sichuan Province, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
| | - Huafu Chen
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
- MOE Key Lab for Neuroinformation, High-Field Magnetic Resonance Brain Imaging, Key Laboratory of Sichuan Province, University of Electronic Science and Technology of China, Chengdu 611731, P. R. China
| |
Collapse
|
10
|
Wang M, Yu J, Leng H, Du X, Liu Y. Bearing fault detection by using graph autoencoder and ensemble learning. Sci Rep 2024; 14:5206. [PMID: 38433237 PMCID: PMC10909884 DOI: 10.1038/s41598-024-55620-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 02/26/2024] [Indexed: 03/05/2024] Open
Abstract
The research and application of bearing fault diagnosis techniques are crucial for enhancing equipment reliability, extending bearing lifespan, and reducing maintenance expenses. Nevertheless, most existing methods encounter challenges in discriminating between signals from machines operating under normal and faulty conditions, leading to unstable detection results. To tackle this issue, the present study proposes a novel approach for bearing fault detection based on graph neural networks and ensemble learning. Our key contribution is a novel stochasticity-based compositional method that transforms Euclidean-structured data into a graph format for processing by graph neural networks, with feature fusion and a newly proposed ensemble learning strategy for outlier detection specifically designed for bearing fault diagnosis. This approach marks a significant advancement in accurately identifying bearing faults, highlighting our study's pivotal role in enhancing diagnostic methodologies.
Collapse
Affiliation(s)
- Meng Wang
- School of Information Science and Engineering, Xinjiang University, Urumqi, 830046, China.
| | - Jiong Yu
- School of Information Science and Engineering, Xinjiang University, Urumqi, 830046, China
| | - Hongyong Leng
- School of Software, Xinjiang University, Urumqi, 830046, China.
| | - Xusheng Du
- School of Information Science and Engineering, Xinjiang University, Urumqi, 830046, China
| | - Yiran Liu
- School of Information Science and Engineering, Xinjiang University, Urumqi, 830046, China
| |
Collapse
|
11
|
Zhang C, Xue Y, Neri F, Cai X, Slowik A. Multi-Objective Self-Adaptive Particle Swarm Optimization for Large-Scale Feature Selection in Classification. Int J Neural Syst 2024; 34:2450014. [PMID: 38352979 DOI: 10.1142/s012906572450014x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Feature selection (FS) is recognized for its role in enhancing the performance of learning algorithms, especially for high-dimensional datasets. In recent times, FS has been framed as a multi-objective optimization problem, leading to the application of various multi-objective evolutionary algorithms (MOEAs) to address it. However, the solution space expands exponentially with the dataset's dimensionality. Simultaneously, the extensive search space often results in numerous local optimal solutions due to a large proportion of unrelated and redundant features [H. Adeli and H. S. Park, Fully automated design of super-high-rise building structures by a hybrid ai model on a massively parallel machine, AI Mag. 17 (1996) 87-93]. Consequently, existing MOEAs struggle with local optima stagnation, particularly in large-scale multi-objective FS problems (LSMOFSPs). Different LSMOFSPs generally exhibit unique characteristics, yet most existing MOEAs rely on a single candidate solution generation strategy (CSGS), which may be less efficient for diverse LSMOFSPs [H. S. Park and H. Adeli, Distributed neural dynamics algorithms for optimization of large steel structures, J. Struct. Eng. ASCE 123 (1997) 880-888; M. Aldwaik and H. Adeli, Advances in optimization of highrise building structures, Struct. Multidiscip. Optim. 50 (2014) 899-919; E. G. González, J. R. Villar, Q. Tan, J. Sedano and C. Chira, An efficient multi-robot path planning solution using a* and coevolutionary algorithms, Integr. Comput. Aided Eng. 30 (2022) 41-52]. Moreover, selecting an appropriate MOEA and determining its corresponding parameter values for a specified LSMOFSP is time-consuming. To address these challenges, a multi-objective self-adaptive particle swarm optimization (MOSaPSO) algorithm is proposed, combined with a rapid nondominated sorting approach. MOSaPSO employs a self-adaptive mechanism, along with five modified efficient CSGSs, to generate new solutions. Experiments were conducted on ten datasets, and the results demonstrate that the number of features is effectively reduced by MOSaPSO while lowering the classification error rate. Furthermore, superior performance is observed in comparison to its counterparts on both the training and test sets, with advantages becoming increasingly evident as the dimensionality increases.
Collapse
Affiliation(s)
- Chenyi Zhang
- School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing 210044, P. R. China
| | - Yu Xue
- School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing 210044, P. R. China
| | - Ferrante Neri
- NICE Research Group, School of Computer Science and Electronic Engineering, University of Surrey Guildford, GU2 7XS, UK
| | - Xu Cai
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, P. R. China
| | - Adam Slowik
- Department of Electronics and Computer Science, Koszalin University of Technology, Koszalin 75-453, Poland
| |
Collapse
|
12
|
Zhao R, Xie Z, Zhuang Y, L H Yu P. Automated Quality Evaluation of Large-Scale Benchmark Datasets for Vision-Language Tasks. Int J Neural Syst 2024; 34:2450009. [PMID: 38318751 DOI: 10.1142/s0129065724500096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2024]
Abstract
Large-scale benchmark datasets are crucial in advancing research within the computer science communities. They enable the development of more sophisticated AI models and serve as "golden" benchmarks for evaluating their performance. Thus, ensuring the quality of these datasets is of utmost importance for academic research and the progress of AI systems. For the emerging vision-language tasks, some datasets have been created and frequently used, such as Flickr30k, COCO, and NoCaps, which typically contain a large number of images paired with their ground-truth textual descriptions. In this paper, an automatic method is proposed to assess the quality of large-scale benchmark datasets designed for vision-language tasks. In particular, a new cross-modal matching model is developed, which is capable of automatically scoring the textual descriptions of visual images. Subsequently, this model is employed to evaluate the quality of vision-language datasets by automatically assigning a score to each 'ground-truth' description for every image picture. With a good agreement between manual and automated scoring results on the datasets, our findings reveal significant disparities in the quality of the ground-truth descriptions included in the benchmark datasets. Even more surprising, it is evident that a small portion of the descriptions are unsuitable for serving as reliable ground-truth references. These discoveries emphasize the need for careful utilization of these publicly accessible benchmark databases.
Collapse
Affiliation(s)
- Ruibin Zhao
- Department of Mathematics and Information Technology, The Education University of Hong Kong, Hong Kong SAR, P. R. China
- School of Computer Science and Information Engineering, Chuzhou University, Chuzhou, P. R. China
| | - Zhiwei Xie
- Department of Mathematics and Information Technology, The Education University of Hong Kong, Hong Kong SAR, P. R. China
| | - Yipeng Zhuang
- Department of Mathematics and Information Technology, The Education University of Hong Kong, Hong Kong SAR, P. R. China
| | - Philip L H Yu
- Department of Mathematics and Information Technology, The Education University of Hong Kong, Hong Kong SAR, P. R. China
| |
Collapse
|
13
|
Zhu H, Xu Y, Wu Y, Shen N, Wang L, Chen C, Chen W. A Sequential End-to-End Neonatal Sleep Staging Model with Squeeze and Excitation Blocks and Sequential Multi-Scale Convolution Neural Networks. Int J Neural Syst 2024; 34:2450013. [PMID: 38369905 DOI: 10.1142/s0129065724500138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Automatic sleep staging offers a quick and objective assessment for quantitatively interpreting sleep stages in neonates. However, most of the existing studies either do not encompass any temporal information, or simply apply neural networks to exploit temporal information at the expense of high computational overhead and modeling ambiguity. This limits the application of these methods to multiple scenarios. In this paper, a sequential end-to-end sleep staging model, SeqEESleepNet, which is competent for parallelly processing sequential epochs and has a fast training rate to adapt to different scenarios, is proposed. SeqEESleepNet consists of a sequence epoch generation (SEG) module, a sequential multi-scale convolution neural network (SMSCNN) and squeeze and excitation (SE) blocks. The SEG module expands independent epochs into sequential signals, enabling the model to learn the temporal information between sleep stages. SMSCNN is a multi-scale convolution neural network that can extract both multi-scale features and temporal information from the signal. Subsequently, the followed SE block can reassign the weights of features through mapping and pooling. Experimental results exhibit that in a clinical dataset, the proposed method outperforms the state-of-the-art approaches, achieving an overall accuracy, F1-score, and Kappa coefficient of 71.8%, 71.8%, and 0.684 on a three-class classification task with a single channel EEG signal. Based on our overall results, we believe the proposed method could pave the way for convenient multi-scenario neonatal sleep staging methods.
Collapse
Affiliation(s)
- Hangyu Zhu
- Center for Intelligent Medical Electronics, School of Information Science and Technology, Fudan University, Shanghai 200433, P. R. China
| | - Yan Xu
- Department of Neurology, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, P. R. China
| | - Yonglin Wu
- Center for Intelligent Medical Electronics, School of Information Science and Technology, Fudan University, Shanghai 200433, P. R. China
| | - Ning Shen
- Center for Intelligent Medical Electronics, School of Information Science and Technology, Fudan University, Shanghai 200433, P. R. China
| | - Laishuan Wang
- Department of Neurology, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, P. R. China
| | - Chen Chen
- Human Phenome Institute, Fudan University, 825 Zhangheng Road, Shanghai 201203, P. R. China
| | - Wei Chen
- Center for Intelligent Medical Electronics, School of Information Science and Technology, Fudan University, Shanghai 200433, P. R. China
| |
Collapse
|
14
|
Mammone N, Ieracitano C, Spataro R, Guger C, Cho W, Morabito FC. A Few-Shot Transfer Learning Approach for Motion Intention Decoding from Electroencephalographic Signals. Int J Neural Syst 2024; 34:2350068. [PMID: 38073546 DOI: 10.1142/s0129065723500685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2024]
Abstract
In this study, a few-shot transfer learning approach was introduced to decode movement intention from electroencephalographic (EEG) signals, allowing to recognize new tasks with minimal adaptation. To this end, a dataset of EEG signals recorded during the preparation of complex sub-movements was created from a publicly available data collection. The dataset was divided into two parts: the source domain dataset (including 5 classes) and the support (target domain) dataset, (including 2 classes) with no overlap between the two datasets in terms of classes. The proposed methodology consists in projecting EEG signals into the space-frequency-time domain, in processing such projections (rearranged in channels × frequency frames) by means of a custom EEG-based deep neural network (denoted as EEGframeNET5), and then adapting the system to recognize new tasks through a few-shot transfer learning approach. The proposed method achieved an average accuracy of 72.45 ± 4.19% in the 5-way classification of samples from the source domain dataset, outperforming comparable studies in the literature. In the second phase of the study, a few-shot transfer learning approach was proposed to adapt the neural system and make it able to recognize new tasks in the support dataset. The results demonstrated the system's ability to adapt and recognize new tasks with an average accuracy of 80 ± 0.12% in discriminating hand opening/closing preparation and outperforming reported results in the literature. This study suggests the effectiveness of EEG in capturing information related to the motor preparation of complex movements, potentially paving the way for BCI systems based on motion planning decoding. The proposed methodology could be straightforwardly extended to advanced EEG signal processing in other scenarios, such as motor imagery or neural disorder classification.
Collapse
Affiliation(s)
- Nadia Mammone
- DICEAM, University Mediterranea of Reggio Calabria Via Zehender, Loc. Feo di Vito, Reggio Calabria, 89122, Italy
| | - Cosimo Ieracitano
- DICEAM, University Mediterranea of Reggio Calabria Via Zehender, Loc. Feo di Vito, Reggio Calabria, 89122, Italy
| | - Rossella Spataro
- ALS Clinical Research Center, BiND, University of Palermo, Palermo, Italy
- Intensive Rehabilitation Unit, Villa delle Ginestre Hospital, Palermo, Italy
| | | | - Woosang Cho
- g.tec Medical Engineering GmbH, 4521, Schiedlberg, Austria
| | - Francesco Carlo Morabito
- DICEAM, University Mediterranea of Reggio Calabria Via Zehender, Loc. Feo di Vito, Reggio Calabria, 89122, Italy
| |
Collapse
|
15
|
Rafiei MH, Gauthier LV, Adeli H, Takabi D. Self-Supervised Learning for Electroencephalography. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:1457-1471. [PMID: 35867362 DOI: 10.1109/tnnls.2022.3190448] [Citation(s) in RCA: 25] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Decades of research have shown machine learning superiority in discovering highly nonlinear patterns embedded in electroencephalography (EEG) records compared with conventional statistical techniques. However, even the most advanced machine learning techniques require relatively large, labeled EEG repositories. EEG data collection and labeling are costly. Moreover, combining available datasets to achieve a large data volume is usually infeasible due to inconsistent experimental paradigms across trials. Self-supervised learning (SSL) solves these challenges because it enables learning from EEG records across trials with variable experimental paradigms, even when the trials explore different phenomena. It aggregates multiple EEG repositories to increase accuracy, reduce bias, and mitigate overfitting in machine learning training. In addition, SSL could be employed in situations where there is limited labeled training data, and manual labeling is costly. This article: 1) provides a brief introduction to SSL; 2) describes some SSL techniques employed in recent studies, including EEG; 3) proposes current and potential SSL techniques for future investigations in EEG studies; 4) discusses the cons and pros of different SSL techniques; and 5) proposes holistic implementation tips and potential future directions for EEG SSL practices.
Collapse
|
16
|
Tang X, Zhu Y. Enhancing bank marketing strategies with ensemble learning: Empirical analysis. PLoS One 2024; 19:e0294759. [PMID: 38206947 PMCID: PMC10783788 DOI: 10.1371/journal.pone.0294759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 11/08/2023] [Indexed: 01/13/2024] Open
Abstract
In order to enhance market share and competitiveness, large banks are increasingly focusing on promoting marketing strategies. However, the traditional bank marketing strategy often leads to the homogenization of customer demand, making it challenging to distinguish among various products. To address this issue, this paper presents a customer demand learning model based on financial datasets and optimizes the distribution model of bank big data channels through induction to rectify the imbalance in bank customer transaction data. By comparing the prediction models of random forest model and support vector machine (SVM), this paper analyzes the ability of the prediction model based on ensemble learning to significantly enhance the market segmentation of e-commerce banks. The empirical results reveal that the accuracy of random forest model reaches 92%, while the accuracy of SVM model reaches 87%. This indicates that the ensemble learning model has higher accuracy and forecasting ability than the single model. It enables the bank marketing system to implement targeted marketing, effectively maintain the relationship between customers and banks, and significantly improve the success probability of product marketing. Meanwhile, the marketing model based on ensemble learning has achieved a sales growth rate of 20% and improved customer satisfaction by 30%. This demonstrates that the implementation of the ensemble learning model has also significantly elevated the overall marketing level of bank e-commerce services. Therefore, this paper offers valuable academic guidance for bank marketing decision-making and holds important academic and practical significance in predicting bank customer demand and optimizing product marketing strategy.
Collapse
Affiliation(s)
- Xing Tang
- Institute of Traffic Engineering, Nanjing Vocational University of Industry Technology, Nanjing, Jiangsu, China
| | - Yusi Zhu
- School of Mathematics, Sichuan University, Chengdu, Sichuan, China
| |
Collapse
|
17
|
Ma Y, Zhang W, Du M, Jing H, Zheng N. Hierarchical Bayesian Causality Network to Extract High-Level Semantic Information in Visual Cortex. Int J Neural Syst 2024; 34:2450002. [PMID: 38084473 DOI: 10.1142/s0129065724500023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Abstract
Functional MRI (fMRI) is a brain signal with high spatial resolution, and visual cognitive processes and semantic information in the brain can be represented and obtained through fMRI. In this paper, we design single-graphic and matched/unmatched double-graphic visual stimulus experiments and collect 12 subjects' fMRI data to explore the brain's visual perception processes. In the double-graphic stimulus experiment, we focus on the high-level semantic information as "matching", and remove tail-to-tail conjunction by designing a model to screen the matching-related voxels. Then, we perform Bayesian causal learning between fMRI voxels based on the transfer entropy, establish a hierarchical Bayesian causal network (HBcausalNet) of the visual cortex, and use the model for visual stimulus image reconstruction. HBcausalNet achieves an average accuracy of 70.57% and 53.70% in single- and double-graphic stimulus image reconstruction tasks, respectively, higher than HcorrNet and HcasaulNet. The results show that the matching-related voxel screening and causality analysis method in this paper can extract the "matching" information in fMRI, obtain a direct causal relationship between matching information and fMRI, and explore the causal inference process in the brain. It suggests that our model can effectively extract high-level semantic information in brain signals and model effective connections and visual perception processes in the visual cortex of the brain.
Collapse
Affiliation(s)
- Yongqiang Ma
- National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, P. R. China
| | - Wen Zhang
- National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, P. R. China
| | - Ming Du
- National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, P. R. China
| | - Haodong Jing
- National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, P. R. China
| | - Nanning Zheng
- National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, P. R. China
| |
Collapse
|
18
|
Xu L, Guo C, Liu M. A weighted distance-based dynamic ensemble regression framework for gastric cancer survival time prediction. Artif Intell Med 2024; 147:102740. [PMID: 38184344 DOI: 10.1016/j.artmed.2023.102740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Revised: 10/28/2023] [Accepted: 11/28/2023] [Indexed: 01/08/2024]
Abstract
Accurate prediction of gastric cancer patient survival time is essential for clinical decision-making. However, unified static models lack specificity and flexibility in predictions owing to the varying survival outcomes among gastric cancer patients. We address these problems by using an ensemble learning approach and adaptively assigning greater weights to similar patients to make more targeted predictions when predicting an individual's survival time. We treat these problems as regression problems and introduce a weighted dynamic ensemble regression framework. To better identify similar patients, we devise a method to measure patient similarity, considering the diverse impacts of features. Subsequently, we use this measure to design both a weighted K-means clustering method and a fuzzy K-means sampling technique to group patients and train corresponding base regressors. To achieve more targeted predictions, we calculate the weight of each base regressor based on the similarity between the patient to be predicted and the patient clusters, culminating in the integration of the results. The model is validated on a dataset of 7791 patients, outperforming other models in terms of three evaluation metrics, namely, the root mean square error, mean absolute error, and the coefficient of determination. The weighted dynamic ensemble regression strategy can improve the baseline model by 1.75%, 2.12%, and 13.45% in terms of the three respective metrics while also mitigating the imbalanced survival time distribution issue. This enhanced performance has been statistically validated, even when tested on six public datasets with different sizes. By considering feature variations, patients with distinct survival profiles can be effectively differentiated, and the model predictive performance can be enhanced. The results generated by our proposed model can be invaluable in guiding decisions related to treatment plans and resource allocation. Furthermore, the model has the potential for broader applications in prognosis for other types of cancers or similar regression problems in various domains.
Collapse
Affiliation(s)
- Liangchen Xu
- Institute of Systems Engineering, Dalian University of Technology, Dalian 116024, China.
| | - Chonghui Guo
- Institute of Systems Engineering, Dalian University of Technology, Dalian 116024, China.
| | - Mucan Liu
- Institute of Systems Engineering, Dalian University of Technology, Dalian 116024, China.
| |
Collapse
|
19
|
Ganjali M, Mehridehnavi A, Rakhshani S, Khorasani A. Unsupervised Neural Manifold Alignment for Stable Decoding of Movement from Cortical Signals. Int J Neural Syst 2024; 34:2450006. [PMID: 38063378 DOI: 10.1142/s0129065724500060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Abstract
The stable decoding of movement parameters using neural activity is crucial for the success of brain-machine interfaces (BMIs). However, neural activity can be unstable over time, leading to changes in the parameters used for decoding movement, which can hinder accurate movement decoding. To tackle this issue, one approach is to transfer neural activity to a stable, low-dimensional manifold using dimensionality reduction techniques and align manifolds across sessions by maximizing correlations of the manifolds. However, the practical use of manifold stabilization techniques requires knowledge of the true subject intentions such as target direction or behavioral state. To overcome this limitation, an automatic unsupervised algorithm is proposed that determines movement target intention before manifold alignment in the presence of manifold rotation and scaling across sessions. This unsupervised algorithm is combined with a dimensionality reduction and alignment method to overcome decoder instabilities. The effectiveness of the BMI stabilizer method is represented by decoding the two-dimensional (2D) hand velocity of two rhesus macaque monkeys during a center-out-reaching movement task. The performance of the proposed method is evaluated using correlation coefficient and R-squared measures, demonstrating higher decoding performance compared to a state-of-the-art unsupervised BMI stabilizer. The results offer benefits for the automatic determination of movement intents in long-term BMI decoding. Overall, the proposed method offers a promising automatic solution for achieving stable and accurate movement decoding in BMI applications.
Collapse
Affiliation(s)
- Mohammadali Ganjali
- Department of Biomedical Engineering, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Alireza Mehridehnavi
- Department of Biomedical Engineering, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Sajed Rakhshani
- Medical Image and Signal Processing Research Center, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Abed Khorasani
- Department of Neurology, Northwestern University, Chicago, IL, 60611, USA
- Neuroscience Research Center, Institute of Neuropharmacology, Kerman University of Medical Sciences, Kerman, Iran
| |
Collapse
|
20
|
Dózsa T, Deuschle F, Cornelis B, Kovács P. Variable Projection Support Vector Machines and Some Applications Using Adaptive Hermite Expansions. Int J Neural Syst 2024; 34:2450004. [PMID: 38073547 DOI: 10.1142/s0129065724500047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Abstract
In this paper, we develop the so-called variable projection support vector machine (VP-SVM) algorithm that is a generalization of the classical SVM. In fact, the VP block serves as an automatic feature extractor to the SVM, which are trained simultaneously. We consider the primal form of the arising optimization task and investigate the use of nonlinear kernels. We show that by choosing the so-called adaptive Hermite function system as the basis of the orthogonal projections in our classification scheme, several real-world signal processing problems can be successfully solved. In particular, we test the effectiveness of our method in two case studies corresponding to anomaly detection. First, we consider the detection of abnormal peaks in accelerometer data caused by sensor malfunction. Then, we show that the proposed classification algorithm can be used to detect abnormalities in ECG data. Our experiments show that the proposed method produces comparable results to the state-of-the-art while retaining desired properties of SVM classification such as light weight architecture and interpretability. We implement the proposed method on a microcontroller and demonstrate its ability to be used for real-time applications. To further minimize computational cost, discrete orthogonal adaptive Hermite functions are introduced for the first time.
Collapse
Affiliation(s)
- Tamás Dózsa
- Department of Numerical Analysis, HUN-REN Institute for Computer Science and Control, Eötvös Loránd University, Budapest H-1111, Hungary
| | - Federico Deuschle
- Siemens Digital Industries Software, 68 Interleuvenlaan KU Leuven, Department of Mechanical Engineering, Leuven B-3001, Belgium
| | - Bram Cornelis
- Siemens Digital Industries Software, 68 Interleuvenlaan KU Leuven, Department of Mechanical Engineering, Leuven B-3001, Belgium
| | - Péter Kovács
- Department of Numerical Analysis, Eötvös Loránd University, Pázmány Péter sétány 1/C Budapest 1117, Hungary
| |
Collapse
|
21
|
Leng J, Zhu J, Yan Y, Yu X, Liu M, Lou Y, Liu Y, Gao L, Sun Y, He T, Yang Q, Feng C, Wang D, Zhang Y, Xu Q, Xu F. Multilevel Laser-Induced Pain Measurement with Wasserstein Generative Adversarial Network - Gradient Penalty Model. Int J Neural Syst 2024; 34:2350067. [PMID: 38149912 DOI: 10.1142/s0129065723500673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Abstract
Pain is an experience of unpleasant sensations and emotions associated with actual or potential tissue damage. In the global context, billions of people are affected by pain disorders. There are particular challenges in the measurement and assessment of pain, and the commonly used pain measuring tools include traditional subjective scoring methods and biomarker-based measures. The main tools for biomarker-based analysis are electroencephalography (EEG), electrocardiography and functional magnetic resonance. The EEG-based quantitative pain measurements are of immense value in clinical pain management and can provide objective assessments of pain intensity. The assessment of pain is now primarily limited to the identification of the presence or absence of pain, with less research on multilevel pain. High power laser stimulation pain experimental paradigm and five pain level classification methods based on EEG data augmentation are presented. First, the EEG features are extracted using modified S-transform, and the time-frequency information of the features is retained. Based on the pain recognition effect, the 20-40[Formula: see text]Hz frequency band features are optimized. Afterwards the Wasserstein generative adversarial network with gradient penalty is used for feature data augmentation. It can be inferred from the good classification performance of features in the parietal region of the brain that the sensory function of the parietal lobe region is effectively activated during the occurrence of pain. By comparing the latest data augmentation methods and classification algorithms, the proposed method has significant advantages for the five-level pain dataset. This research provides new ways of thinking and research methods related to pain recognition, which is essential for the study of neural mechanisms and regulatory mechanisms of pain.
Collapse
Affiliation(s)
- Jiancai Leng
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Jianqun Zhu
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Yihao Yan
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Xin Yu
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Ming Liu
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Yitai Lou
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Yanbing Liu
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Licai Gao
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Yuan Sun
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Tianzheng He
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Qingbo Yang
- School of Mathematics and Statistics, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Chao Feng
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Dezheng Wang
- Rehabilitation Center, Qilu Hospital of Shandong University, Jinan 250012, P. R. China
| | - Yang Zhang
- Rehabilitation Center, Qilu Hospital of Shandong University, Jinan 250012, P. R. China
| | - Qing Xu
- Shandong Institute of Scientific and Technical Information, Jinan 250101, P. R. China
| | - Fangzhou Xu
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| |
Collapse
|
22
|
Villarrubia-Martin EA, Rodriguez-Benitez L, Jimenez-Linares L, Muñoz-Valero D, Liu J. A Hybrid Online Off-Policy Reinforcement Learning Agent Framework Supported by Transformers. Int J Neural Syst 2023; 33:2350065. [PMID: 37857407 DOI: 10.1142/s012906572350065x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2023]
Abstract
Reinforcement learning (RL) is a powerful technique that allows agents to learn optimal decision-making policies through interactions with an environment. However, traditional RL algorithms suffer from several limitations such as the need for large amounts of data and long-term credit assignment, i.e. the problem of determining which actions actually produce a certain reward. Recently, Transformers have shown their capacity to address these constraints in this area of learning in an offline setting. This paper proposes a framework that uses Transformers to enhance the training of online off-policy RL agents and address the challenges described above through self-attention. The proposal introduces a hybrid agent with a mixed policy that combines an online off-policy agent with an offline Transformer agent using the Decision Transformer architecture. By sequentially exchanging the experience replay buffer between the agents, the agent's learning training efficiency is improved in the first iterations and so is the training of Transformer-based RL agents in situations with limited data availability or unknown environments.
Collapse
Affiliation(s)
- Enrique Adrian Villarrubia-Martin
- Department of Technologies and Information Systems, Universidad de Castilla-La Mancha, Paseo de la Universidad 4, 13005 Ciudad Real, Spain
| | - Luis Rodriguez-Benitez
- Department of Technologies and Information Systems, Universidad de Castilla-La Mancha, Paseo de la Universidad 4, 13005 Ciudad Real, Spain
| | - Luis Jimenez-Linares
- Department of Technologies and Information Systems, Universidad de Castilla-La Mancha, Paseo de la Universidad 4, 13005 Ciudad Real, Spain
| | - David Muñoz-Valero
- Department of Technologies and Information Systems, Universidad de Castilla-La Mancha, Avenida Carlos III, s/n, 45004 Toledo, Spain
| | - Jun Liu
- School of Computing, University of Ulster, Northern Ireland, UK
| |
Collapse
|
23
|
Hu J, Yu C, Yi Z, Zhang H. Enhancing Robustness of Medical Image Segmentation Model with Neural Memory Ordinary Differential Equation. Int J Neural Syst 2023; 33:2350060. [PMID: 37743765 DOI: 10.1142/s0129065723500600] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Deep neural networks (DNNs) have emerged as a prominent model in medical image segmentation, achieving remarkable advancements in clinical practice. Despite the promising results reported in the literature, the effectiveness of DNNs necessitates substantial quantities of high-quality annotated training data. During experiments, we observe a significant decline in the performance of DNNs on the test set when there exists disruption in the labels of the training dataset, revealing inherent limitations in the robustness of DNNs. In this paper, we find that the neural memory ordinary differential equation (nmODE), a recently proposed model based on ordinary differential equations (ODEs), not only addresses the robustness limitation but also enhances performance when trained by the clean training dataset. However, it is acknowledged that the ODE-based model tends to be less computationally efficient compared to the conventional discrete models due to the multiple function evaluations required by the ODE solver. Recognizing the efficiency limitation of the ODE-based model, we propose a novel approach called the nmODE-based knowledge distillation (nmODE-KD). The proposed method aims to transfer knowledge from the continuous nmODE to a discrete layer, simultaneously enhancing the model's robustness and efficiency. The core concept of nmODE-KD revolves around enforcing the discrete layer to mimic the continuous nmODE by minimizing the KL divergence between them. Experimental results on 18 organs-at-risk segmentation tasks demonstrate that nmODE-KD exhibits improved robustness compared to ODE-based models while also mitigating the efficiency limitation.
Collapse
Affiliation(s)
- Junjie Hu
- Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu 610065, P. R. China
| | - Chengrong Yu
- Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu 610065, P. R. China
| | - Zhang Yi
- Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu 610065, P. R. China
| | - Haixian Zhang
- Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu 610065, P. R. China
| |
Collapse
|
24
|
Laport F, Dapena A, Castro PM, Iglesias DI, Vazquez-Araujo FJ. Eye State Detection Using Frequency Features from 1 or 2-Channel EEG. Int J Neural Syst 2023; 33:2350062. [PMID: 37822240 DOI: 10.1142/s0129065723500624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
Brain-computer interfaces (BCIs) establish a direct communication channel between the human brain and external devices. Among various methods, electroencephalography (EEG) stands out as the most popular choice for BCI design due to its non-invasiveness, ease of use, and cost-effectiveness. This paper aims to present and compare the accuracy and robustness of an EEG system employing one or two channels. We present both hardware and algorithms for the detection of open and closed eyes. Firstly, we utilize a low-cost hardware device to capture EEG activity from one or two channels. Next, we apply the discrete Fourier transform to analyze the signals in the frequency domain, extracting features from each channel. For classification, we test various well-known techniques, including Linear Discriminant Analysis (LDA), Support Vector Machine (SVM), Decision Tree (DT), or Logistic Regression (LR). To evaluate the system, we conduct experiments, acquiring signals associated with open and closed eyes, and compare the performance between one and two channels. The results demonstrate that employing a system with two channels and using SVM, DT, or LR classifiers enhances robustness compared to a single-channel setup and allows us to achieve an accuracy percentage greater than 95% for both eye states.
Collapse
Affiliation(s)
- Francisco Laport
- CITIC Research Centre & University of A Coruña, Campus de Elviña, s/n A Coruña, 15071, Spain
| | - Adriana Dapena
- CITIC Research Centre & University of A Coruña, Campus de Elviña, s/n A Coruña, 15071, Spain
| | - Paula M Castro
- CITIC Research Centre & University of A Coruña, Campus de Elviña, s/n A Coruña, 15071, Spain
| | - Daniel I Iglesias
- CITIC Research Centre & University of A Coruña, Campus de Elviña, s/n A Coruña, 15071, Spain
| | | |
Collapse
|
25
|
Teran-Pineda D, Thurnhofer-Hemsi K, Domínguez E. Human Gait Activity Recognition Using Multimodal Sensors. Int J Neural Syst 2023; 33:2350058. [PMID: 37779221 DOI: 10.1142/s0129065723500582] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/03/2023]
Abstract
Human activity recognition is an application of machine learning with the aim of identifying activities from the gathered activity raw data acquired by different sensors. In medicine, human gait is commonly analyzed by doctors to detect abnormalities and determine possible treatments for the patient. Monitoring the patient's activity is paramount in evaluating the treatment's evolution. This type of classification is still not enough precise, which may lead to unfavorable reactions and responses. A novel methodology that reduces the complexity of extracting features from multimodal sensors is proposed to improve human activity classification based on accelerometer data. A sliding window technique is used to demarcate the first dominant spectral amplitude, decreasing dimensionality and improving feature extraction. In this work, we compared several state-of-art machine learning classifiers evaluated on the HuGaDB dataset and validated on our dataset. Several configurations to reduce features and training time were analyzed using multimodal sensors: all-axis spectrum, single-axis spectrum, and sensor reduction.
Collapse
Affiliation(s)
- Diego Teran-Pineda
- Department of Computer Languages and Computer Science, University of Málaga Bulevar Louis Pasteur, 35, 29071, Málaga, Spain
- Biomedical Research Institute of Málaga (IBIMA), C/ Doctor Miguel Díaz Recio, 28, 29010, Málaga, Spain
| | - Karl Thurnhofer-Hemsi
- Department of Computer Languages and Computer Science, University of Málaga Bulevar Louis Pasteur, 35, 29071, Málaga, Spain
- Biomedical Research Institute of Málaga (IBIMA), C/ Doctor Miguel Díaz Recio, 28, 29010, Málaga, Spain
| | - Enrique Domínguez
- Department of Computer Languages and Computer Science, University of Málaga Bulevar Louis Pasteur, 35, 29071, Málaga, Spain
- Biomedical Research Institute of Málaga (IBIMA), C/ Doctor Miguel Díaz Recio, 28, 29010, Málaga, Spain
| |
Collapse
|
26
|
Bakasa W, Viriri S. Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting. Front Artif Intell 2023; 6:1232640. [PMID: 37876961 PMCID: PMC10591225 DOI: 10.3389/frai.2023.1232640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 09/04/2023] [Indexed: 10/26/2023] Open
Abstract
Ensemble learning aims to improve prediction performance by combining several models or forecasts. However, how much and which ensemble learning techniques are useful in deep learning-based pipelines for pancreas computed tomography (CT) image classification is a challenge. Ensemble approaches are the most advanced solution to many machine learning problems. These techniques entail training multiple models and combining their predictions to improve the predictive performance of a single model. This article introduces the idea of Stacked Ensemble Deep Learning (SEDL), a pipeline for classifying pancreas CT medical images. The weak learners are Inception V3, VGG16, and ResNet34, and we employed a stacking ensemble. By combining the first-level predictions, an input train set for XGBoost, the ensemble model at the second level of prediction, is created. Extreme Gradient Boosting (XGBoost), employed as a strong learner, will make the final classification. Our findings showed that SEDL performed better, with a 98.8% ensemble accuracy, after some adjustments to the hyperparameters. The Cancer Imaging Archive (TCIA) public access dataset consists of 80 pancreas CT scans with a resolution of 512 * 512 pixels, from 53 male and 27 female subjects. A sample of two hundred and twenty-two images was used for training and testing data. We concluded that implementing the SEDL technique is an effective way to strengthen the robustness and increase the performance of the pipeline for classifying pancreas CT medical images. Interestingly, grouping like-minded or talented learners does not make a difference.
Collapse
Affiliation(s)
| | - Serestina Viriri
- School of Mathematics Statistics & Computer Science, College of Agriculture, Engineering and Science, University of KwaZulu-Natal, Durban, South Africa
| |
Collapse
|
27
|
Vernikos I, Spyrou E, Kostis IA, Mathe E, Mylonas P. A Deep Regression Approach for Human Activity Recognition Under Partial Occlusion. Int J Neural Syst 2023; 33:2350047. [PMID: 37602705 DOI: 10.1142/s0129065723500478] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/22/2023]
Abstract
In real-life scenarios, Human Activity Recognition (HAR) from video data is prone to occlusion of one or more body parts of the human subjects involved. Although it is common sense that the recognition of the majority of activities strongly depends on the motion of some body parts, which when occluded compromise the performance of recognition approaches, this problem is often underestimated in contemporary research works. Currently, training and evaluation is based on datasets that have been shot under laboratory (ideal) conditions, i.e. without any kind of occlusion. In this work, we propose an approach for HAR in the presence of partial occlusion, in cases wherein up to two body parts are involved. We assume that human motion is modeled using a set of 3D skeletal joints and also that occluded body parts remain occluded during the whole duration of the activity. We solve this problem using regression, performed by a novel deep Convolutional Recurrent Neural Network (CRNN). Specifically, given a partially occluded skeleton, we attempt to reconstruct the missing information regarding the motion of its occluded part(s). We evaluate our approach using four publicly available human motion datasets. Our experimental results indicate a significant increase of performance, when compared to baseline approaches, wherein networks that have been trained using only nonoccluded or both occluded and nonoccluded samples are evaluated using occluded samples. To the best of our knowledge, this is the first research work that formulates and copes with the problem of HAR under occlusion as a regression task.
Collapse
Affiliation(s)
- Ioannis Vernikos
- Department of Informatics and Telecommunications, University of Thessaly, 3rd Km Old National Road Lamia-Athens, Lamia 35132, Greece
| | - Evaggelos Spyrou
- Department of Informatics and Telecommunications, University of Thessaly, 3rd Km Old National Road Lamia-Athens, Lamia 35132, Greece
| | - Ioannis-Aris Kostis
- Department of Informatics and Telecommunications, University of Thessaly, 3rd Km Old National Road Lamia-Athens, Lamia 35132, Greece
| | - Eirini Mathe
- Department of Informatics, Ionian University, 7 Tsirigoti Square, Corfu 49100, Greece
| | - Phivos Mylonas
- Department of Informatics and Computer Engineering, University of West Attica, Egaleo Park, Agiou Spyridonos Street, 12243 Egaleo, Athens, Greece
| |
Collapse
|
28
|
Aversano L, Bernardi ML, Cimitile M, Maiellaro A, Pecori R. A systematic review on artificial intelligence techniques for detecting thyroid diseases. PeerJ Comput Sci 2023; 9:e1394. [PMID: 37346658 PMCID: PMC10280452 DOI: 10.7717/peerj-cs.1394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 04/21/2023] [Indexed: 06/23/2023]
Abstract
The use of artificial intelligence approaches in health-care systems has grown rapidly over the last few years. In this context, early detection of diseases is the most common area of application. In this scenario, thyroid diseases are an example of illnesses that can be effectively faced if discovered quite early. Detecting thyroid diseases is crucial in order to treat patients effectively and promptly, by saving lives and reducing healthcare costs. This work aims at systematically reviewing and analyzing the literature on various artificial intelligence-related techniques applied to the detection and identification of various diseases related to the thyroid gland. The contributions we reviewed are classified according to different viewpoints and taxonomies in order to highlight pros and cons of the most recent research in the field. After a careful selection process, we selected and reviewed 72 papers, analyzing them according to three main research questions, i.e., which diseases of the thyroid gland are detected by different artificial intelligence techniques, which datasets are used to perform the aforementioned detection, and what types of data are used to perform the detection. The review demonstrates that the majority of the considered papers deal with supervised methods to detect hypo- and hyperthyroidism. The average accuracy of detection is high (96.84%), but the usage of private and outdated datasets with a majority of clinical data is very common. Finally, we discuss the outcomes of the systematic review, pointing out advantages, disadvantages, and future developments in the application of artificial intelligence for thyroid diseases detection.
Collapse
Affiliation(s)
- Lerina Aversano
- Department of Engineering, University of Sannio, Benevento, Italy
| | | | - Marta Cimitile
- Dept. of Law and Digital Society, UnitelmaSapienza University, Rome, Italy
| | - Andrea Maiellaro
- Department of Engineering, University of Sannio, Benevento, Italy
| | - Riccardo Pecori
- Institute of Materials for Electronics and Magnetism, National Research Council, Parma, Italy
- SMARTEST Research Centre, eCampus University, Novedrate (CO), Italy
| |
Collapse
|
29
|
Manikandan P, Durga U, Ponnuraja C. An integrative machine learning framework for classifying SEER breast cancer. Sci Rep 2023; 13:5362. [PMID: 37005484 PMCID: PMC10067827 DOI: 10.1038/s41598-023-32029-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 03/21/2023] [Indexed: 04/04/2023] Open
Abstract
Breast cancer is the commonest type of cancer in women worldwide and the leading cause of mortality for females. The aim of this research is to classify the alive and death status of breast cancer patients using the Surveillance, Epidemiology, and End Results dataset. Due to its capacity to handle enormous data sets systematically, machine learning and deep learning has been widely employed in biomedical research to answer diverse classification difficulties. Pre-processing the data enables its visualization and analysis for use in making important decisions. This research presents a feasible machine learning-based approach for categorizing SEER breast cancer dataset. Moreover, a two-step feature selection method based on Variance Threshold and Principal Component Analysis was employed to select the features from the SEER breast cancer dataset. After selecting the features, the classification of the breast cancer dataset is carried out using Supervised and Ensemble learning techniques such as Ada Boosting, XG Boosting, Gradient Boosting, Naive Bayes and Decision Tree. Utilizing the train-test split and k-fold cross-validation approaches, the performance of various machine learning algorithms is examined. The accuracy of Decision Tree for both train-test split and cross validation achieved as 98%. In this study, it is observed that the Decision Tree algorithm outperforms other supervised and ensemble learning approaches for the SEER Breast Cancer dataset.
Collapse
Affiliation(s)
- P Manikandan
- Department of Data Science, Loyola College, Chennai, 600 034, India.
| | - U Durga
- Department of Data Science, Loyola College, Chennai, 600 034, India
| | - C Ponnuraja
- ICMR-National Institute for Research in Tuberculosis, Chennai, 600 031, India.
| |
Collapse
|
30
|
Qin X, Niu Y, Zhou H, Li X, Jia W, Zheng Y. Driver Drowsiness EEG Detection Based on Tree Federated Learning and Interpretable Network. Int J Neural Syst 2023; 33:2350009. [PMID: 36655401 DOI: 10.1142/s0129065723500090] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Accurate identification of driver's drowsiness state through Electroencephalogram (EEG) signals can effectively reduce traffic accidents, but EEG signals are usually stored in various clients in the form of small samples. This study attempts to construct an efficient and accurate privacy-preserving drowsiness monitoring system, and proposes a fusion model based on tree Federated Learning (FL) and Convolutional Neural Network (CNN), which can not only identify and explain the driver's drowsiness state, but also integrate the information of different clients under the premise of privacy protection. Each client uses CNN with the Global Average Pooling (GAP) layer and shares model parameters. The tree FL transforms communication relationships into a graph structure, and model parameters are transmitted in parallel along connected branches of the graph. Moreover, the Class Activation Mapping (CAM) is used to find distinctive EEG features for representing specific classes. On EEG data of 11 subjects, it is found that this method has higher average accuracy, F1-score and AUC than the traditional classification method, reaching 73.56%, 73.26% and 78.23%, respectively. Compared with the traditional FL algorithm, this method better protects the driver's privacy and improves communication efficiency.
Collapse
Affiliation(s)
- Xue Qin
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, P. R. China
| | - Yi Niu
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, P. R. China
| | - Huiyu Zhou
- School of Computing and Mathematical Sciences, University of Leicester, Leicester, LE1 7RH, UK
| | - Xiaojie Li
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, P. R. China
| | - Weikuan Jia
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, P. R. China
| | - Yuanjie Zheng
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, P. R. China
| |
Collapse
|
31
|
Maisano R, Foresti GL. A Sentiment Analysis Anomaly Detection System for Cyber Intelligence. Int J Neural Syst 2023; 33:2350003. [PMID: 36585854 DOI: 10.1142/s012906572350003x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Considering the 2030 United Nations intent of world connection, Cyber Intelligence becomes the main area of the human dimension able of inflicting changes in geopolitical dynamics. In cyberspace, the new battlefield is the mind of people including new weapons like abuse of social media with information manipulation, deception by activists and misinformation. In this paper, a Sentiment Analysis system with Anomaly Detection (SAAD) capability is proposed. The system, scalable and modular, uses an OSINT-Deep Learning approach to investigate on social media sentiment in order to predict suspicious anomaly trend in Twitter posts. Anomaly detection is investigated with a new semi-supervised process that is able to detect potentially dangerous situations in critical areas. The main contributions of the paper are the system suitability for working in different areas and domains, the anomaly detection procedure in sentiment context and a time-dependent confusion matrix to address model evaluation with unbalanced dataset. Real experiments and tests were performed on Sahel Region. The detected anomalies in negative sentiment have been checked by experts of Sahel area, proving true links between the models results and real situations observable from the tweets.
Collapse
Affiliation(s)
- Roberta Maisano
- Computer Science Centre, University of Messina, Piazza Antonello, 2, 98122 Messina, Italy
| | - Gian Luca Foresti
- Department of Mathematics, Computer Science and Physics, University of Udine, Viale delle Scienze, 206, 33100 Udine, Italy
| |
Collapse
|
32
|
Kumoi G, Yagi H, Kobayashi M, Goto M, Hirasawa S. Performance Evaluation of Error-Correcting Output Coding Based on Noisy and Noiseless Binary Classifiers. Int J Neural Syst 2023; 33:2350004. [PMID: 36624957 DOI: 10.1142/s0129065723500041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Error-correcting output coding (ECOC) is a method for constructing a multi-valued classifier using a combination of given binary classifiers. ECOC can estimate the correct category by other binary classifiers even if the output of some binary classifiers is incorrect based on the framework of the coding theory. The code word table representing the combination of these binary classifiers is important in ECOC. ECOC is known to perform well experimentally on real data. However, the complexity of the classification problem makes it difficult to analyze the classification performance in detail. For this reason, theoretical analysis of ECOC has not been conducted. In this study, if a binary classifier outputs the estimated posterior probability with errors, then this binary classifier is said to be noisy. In contrast, if a binary classifier outputs the true posterior probability, then this binary classifier is said to be noiseless. For a theoretical analysis of ECOC, we discuss the optimality for the code word table with noiseless binary classifiers and the error rate for one with noisy binary classifiers. This evaluation result shows that the Hamming distance of the code word table is an important indicator.
Collapse
Affiliation(s)
- Gendo Kumoi
- Center for Data Science, Waseda University, 1-6-1, Nishiwaseda, Shinjuku-ku, Tokyo 169-8050, Japan
| | - Hideki Yagi
- Department of Computer and Network Engineering, The University of Electro-Communications, 1-5-1, Chofugaoka, Chofu, Tokyo 182-8585, Japan
| | - Manabu Kobayashi
- Center for Data Science, Waseda University, 1-6-1, Nishiwaseda, Shinjuku-ku, Tokyo 169-8050, Japan
| | - Masayuki Goto
- School of Creative Science and Engineering, Waseda University, 3-4-1, Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
| | - Shigeichi Hirasawa
- Center for Data Science, Waseda University, 1-6-1, Nishiwaseda, Shinjuku-ku, Tokyo 169-8050, Japan
| |
Collapse
|
33
|
Fei N, Li R, Cui H, Hu Y. A Prediction Model for Normal Variation of Somatosensory Evoked Potential During Scoliosis Surgery. Int J Neural Syst 2023; 33:2350005. [PMID: 36581320 DOI: 10.1142/s0129065723500053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Somatosensory evoked potential (SEP) has been commonly used as intraoperative monitoring to detect the presence of neurological deficits during scoliosis surgery. However, SEP usually presents an enormous variation in response to patient-specific factors such as physiological parameters leading to the false warning. This study proposes a prediction model to quantify SEP amplitude variation due to noninjury-related physiological changes of the patient undergoing scoliosis surgery. Based on a hybrid network of attention-based long-short-term memory (LSTM) and convolutional neural networks (CNNs), we develop a deep learning-based framework for predicting the SEP value in response to variation of physiological variables. The training and selection of model parameters were based on a 5-fold cross-validation scheme using mean square error (MSE) as evaluation metrics. The proposed model obtained MSE of 0.027[Formula: see text][Formula: see text] on left cortical SEP, MSE of 0.024[Formula: see text][Formula: see text] on left subcortical SEP, MSE of 0.031[Formula: see text][Formula: see text] on right cortical SEP, and MSE of 0.025[Formula: see text][Formula: see text] on right subcortical SEP based on the test set. The proposed model could quantify the affection from physiological parameters to the SEP amplitude in response to normal variation of physiology during scoliosis surgery. The prediction of SEP amplitude provides a potential varying reference for intraoperative SEP monitoring.
Collapse
Affiliation(s)
- Ningbo Fei
- Department of Orthopaedics and Traumatology, The University of Hong Kong - Shenzhen Hospital, Shenzhen 518058, Guangdong, P. R. China.,Department of Orthopeadics and Traumatology, The University of Hong Kong, Pokfulam, Hong Kong
| | - Rong Li
- Department of Orthopaedics and Traumatology, The University of Hong Kong - Shenzhen Hospital, Shenzhen 518058, Guangdong, P. R. China.,Department of Orthopeadics and Traumatology, The University of Hong Kong, Pokfulam, Hong Kong
| | - Hongyan Cui
- Institute of Biomedical Engineering, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300192, P. R. China
| | - Yong Hu
- Department of Orthopaedics and Traumatology, The University of Hong Kong - Shenzhen Hospital, Shenzhen 518058, Guangdong, P. R. China.,Department of Orthopeadics and Traumatology, The University of Hong Kong, Pokfulam, Hong Kong
| |
Collapse
|
34
|
An Extension Network of Dendritic Neurons. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2023; 2023:7037124. [PMID: 36726357 PMCID: PMC9886486 DOI: 10.1155/2023/7037124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 12/08/2022] [Accepted: 01/07/2023] [Indexed: 01/24/2023]
Abstract
Deep learning (DL) has achieved breakthrough successes in various tasks, owing to its layer-by-layer information processing and sufficient model complexity. However, DL suffers from the issues of both redundant model complexity and low interpretability, which are mainly because of its oversimplified basic McCulloch-Pitts neuron unit. A widely recognized biologically plausible dendritic neuron model (DNM) has demonstrated its effectiveness in alleviating the aforementioned issues, but it can only solve binary classification tasks, which significantly limits its applicability. In this study, a novel extended network based on the dendritic structure is innovatively proposed, thereby enabling it to solve multiple-class classification problems. Also, for the first time, an efficient error-back-propagation learning algorithm is derived. In the extensive experimental results, the effectiveness and superiority of the proposed method in comparison with other nine state-of-the-art classifiers on ten datasets are demonstrated, including a real-world quality of web service application. The experimental results suggest that the proposed learning algorithm is competent and reliable in terms of classification performance and stability and has a notable advantage in small-scale disequilibrium data. Additionally, aspects of network structure constrained by scale are examined.
Collapse
|
35
|
Nguyen DK, Nguyen TP, Ngamkhanong C, Keawsawasvong S, Lai VQ. Bearing capacity of ring footings in anisotropic clays: FELA and ANN. Neural Comput Appl 2023. [DOI: 10.1007/s00521-023-08278-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
|
36
|
Koutrintzes D, Spyrou E, Mathe E, Mylonas P. A Multimodal Fusion Approach for Human Activity Recognition. Int J Neural Syst 2023; 33:2350002. [PMID: 36573880 DOI: 10.1142/s0129065723500028] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The problem of human activity recognition (HAR) has been increasingly attracting the efforts of the research community, having several applications. It consists of recognizing human motion and/or behavior within a given image or a video sequence, using as input raw sensor measurements. In this paper, a multimodal approach addressing the task of video-based HAR is proposed. It is based on 3D visual data that are collected using an RGB + depth camera, resulting to both raw video and 3D skeletal sequences. These data are transformed into six different 2D image representations; four of them are in the spectral domain, another is a pseudo-colored image. The aforementioned representations are based on skeletal data. The last representation is a "dynamic" image which is actually an artificially created image that summarizes RGB data of the whole video sequence, in a visually comprehensible way. In order to classify a given activity video, first, all the aforementioned 2D images are extracted and then six trained convolutional neural networks are used so as to extract visual features. The latter are fused so as to form a single feature vector and are fed into a support vector machine for classification into human activities. For evaluation purposes, a challenging motion activity recognition dataset is used, while single-view, cross-view and cross-subject experiments are performed. Moreover, the proposed approach is compared to three other state-of-the-art methods, demonstrating superior performance in most experiments.
Collapse
Affiliation(s)
- Dimitrios Koutrintzes
- Institute of Informatics and Telecommunications, National Center for Scientific Research - "Demokritos", Athens, Greece
| | - Evaggelos Spyrou
- Department of Informatics and Telecommunication, University of Thessaly, Lamia, Greece
| | - Eirini Mathe
- Department of Informatics, Ionian University, Corfu, Greece
| | - Phivos Mylonas
- Department of Informatics, Ionian University, Corfu, Greece
| |
Collapse
|
37
|
Sbrollini A, Barocci M, Mancinelli M, Paris M, Raffaelli S, Marcantoni I, Morettini M, Swenne CA, Burattini L. Automatic diagnosis of newly emerged heart failure from serial electrocardiography by repeated structuring & learning procedure. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
38
|
Wang J, Ge X, Shi Y, Sun M, Gong Q, Wang H, Huang W. Dual-Modal Information Bottleneck Network for Seizure Detection. Int J Neural Syst 2023; 33:2250061. [PMID: 36599663 DOI: 10.1142/s0129065722500617] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
In recent years, deep learning has shown very competitive performance in seizure detection. However, most of the currently used methods either convert electroencephalogram (EEG) signals into spectral images and employ 2D-CNNs, or split the one-dimensional (1D) features of EEG signals into many segments and employ 1D-CNNs. Moreover, these investigations are further constrained by the absence of consideration for temporal links between time series segments or spectrogram images. Therefore, we propose a Dual-Modal Information Bottleneck (Dual-modal IB) network for EEG seizure detection. The network extracts EEG features from both time series and spectrogram dimensions, allowing information from different modalities to pass through the Dual-modal IB, requiring the model to gather and condense the most pertinent information in each modality and only share what is necessary. Specifically, we make full use of the information shared between the two modality representations to obtain key information for seizure detection and to remove irrelevant feature between the two modalities. In addition, to explore the intrinsic temporal dependencies, we further introduce a bidirectional long-short-term memory (BiLSTM) for Dual-modal IB model, which is used to model the temporal relationships between the information after each modality is extracted by convolutional neural network (CNN). For CHB-MIT dataset, the proposed framework can achieve an average segment-based sensitivity of 97.42%, specificity of 99.32%, accuracy of 98.29%, and an average event-based sensitivity of 96.02%, false detection rate (FDR) of 0.70/h. We release our code at https://github.com/LLLL1021/Dual-modal-IB.
Collapse
Affiliation(s)
- Jiale Wang
- School of Information Science and Engineering, Shandong Normal University, Jinan 250358, P. R. China
| | - Xinting Ge
- School of Information Science and Engineering, Shandong Normal University, Jinan 250358, P. R. China
| | - Yunfeng Shi
- School of Information Science and Engineering, Shandong Normal University, Jinan 250358, P. R. China
| | - Mengxue Sun
- School of Information Science and Engineering, Shandong Normal University, Jinan 250358, P. R. China
| | - Qingtao Gong
- Ulsan Ship and Ocean College, Ludong University, Yantai 264025, P. R. China
| | - Haipeng Wang
- Institute of Information Fusion, Naval, Aviation University, Yantai 264001, P. R. China
| | - Wenhui Huang
- School of Information Science and Engineering, Shandong Normal University, Jinan 250358, P. R. China
| |
Collapse
|
39
|
Cruz YJ, Rivas M, Quiza R, Haber RE, Castaño F, Villalonga A. A two-step machine learning approach for dynamic model selection: A case study on a micro milling process. COMPUT IND 2022. [DOI: 10.1016/j.compind.2022.103764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
40
|
JSMix: a holistic algorithm for learning with label noise. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07770-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
41
|
Wang K, Wang Y, Zhan B, Yang Y, Zu C, Wu X, Zhou J, Nie D, Zhou L. An Efficient Semi-Supervised Framework with Multi-Task and Curriculum Learning for Medical Image Segmentation. Int J Neural Syst 2022; 32:2250043. [DOI: 10.1142/s0129065722500435] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
42
|
Sanchez-Reolid R, Martinez-Saez MC, Garcia-Martinez B, Fernandez-Aguilar L, Segura LR, Latorre JM, Fernandez-Caballero A. Emotion Classification from EEG with a Low-Cost BCI Versus a High-End Equipment. Int J Neural Syst 2022; 32:2250041. [DOI: 10.1142/s0129065722500411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
43
|
Khan MM, Tazin T, Zunaid Hussain M, Mostakim M, Rehman T, Singh S, Gupta V, Alomeir O. Breast Tumor Detection Using Robust and Efficient Machine Learning and Convolutional Neural Network Approaches. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:6333573. [PMID: 35712068 PMCID: PMC9197653 DOI: 10.1155/2022/6333573] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 04/19/2022] [Indexed: 01/02/2023]
Abstract
Breast cancer develops when cells in the breast expand and divide uncontrollably, resulting in a lump of tissue known as a tumor. This lump of tissue is called a tumor. After skin cancer, breast cancer is the second most common cancer among women. It is more common in women over the age of 50. Men may also acquire breast cancer, albeit it is uncommon. Each year, approximately 2,600 men in the United States are diagnosed with breast cancer, accounting for less than 1% of all cases. Transgender women are more likely than cisgender men to acquire breast cancer. Additionally, transgender males are less likely than cisgender women to acquire breast cancer. Breast cancer is more common in women over the age of 50, although it can affect anyone at any age. Early detection of a breast tumor may significantly lower the risk of developing breast cancer. A public dataset of breast tumor features was used instead to build models for identifying breast tumors through machine learning and deep learning. Prediction models were built using logistic regression (LR), decision tree (DT), random forest (RF), voting classifier (VC), support vector machine (SVM), and a proprietary convolutional neural network (CNN). These models were used to find critical prognostic indicators linked to breast cancer. The proposed network performs far better, with an average accuracy of 99%. This study has six types of models: LR, RF, SVM, VC, DT, and a custom CNN model. They all had 96% to 99% accuracy in this study. CNN, LR, RF, SVM, VC, and DT achieved 99%, 96%, 98%, 97%, 97%, and 96% F1 score, respectively. There were many machine learning algorithms used in this study that were very accurate, which means that these techniques could be used as alternative prognostic tools in breast tumor detection studies in Asia.
Collapse
Affiliation(s)
- Mohammad Monirujjaman Khan
- Department of Electrical and Computer Engineering, North South University, Bashundhara R/A, Dhaka 1229, Bangladesh
| | - Tahia Tazin
- Department of Electrical and Computer Engineering, North South University, Bashundhara R/A, Dhaka 1229, Bangladesh
| | - Mohammad Zunaid Hussain
- Department of Electrical and Computer Engineering, North South University, Bashundhara R/A, Dhaka 1229, Bangladesh
| | - Monira Mostakim
- Department of Electrical and Computer Engineering, North South University, Bashundhara R/A, Dhaka 1229, Bangladesh
| | - Taeefur Rehman
- Department of Electrical and Computer Engineering, North South University, Bashundhara R/A, Dhaka 1229, Bangladesh
| | | | - Vaishali Gupta
- Computer Science & Engineering, Galgotias University, Greater Nodia, India
| | - Othman Alomeir
- Department of Pharmacy Practice, College of Pharmacy, Shaqra University, Shaqra, Saudi Arabia
| |
Collapse
|
44
|
Chen BW, Yang SH, Kuo CH, Chen JW, Lo YC, Kuo YT, Lin YC, Chang HC, Lin SH, Yu X, Qu B, Ro SCV, Lai HY, Chen YY. Neuro-Inspired Reinforcement Learning To Improve Trajectory Prediction In Reward-Guided Behavior. Int J Neural Syst 2022; 32:2250038. [DOI: 10.1142/s0129065722500381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
45
|
Wu T, Neri F, Pan L. On the tuning of the computation capability of spiking neural membrane systems with communication on request. Int J Neural Syst 2022; 32:2250037. [DOI: 10.1142/s012906572250037x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
46
|
Personalized Hybrid Education Framework Based on Neuroevolution Methodologies. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:6925668. [PMID: 35634078 PMCID: PMC9135544 DOI: 10.1155/2022/6925668] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 04/08/2022] [Accepted: 04/15/2022] [Indexed: 12/29/2022]
Abstract
The future pedagogical systems need anthropocentric inclusive educational programs in which the goal should be adjustable according to the knowledge requirements, intelligence, and learning objective of each student. Prioritizing these needs, innovative AI methods are required to assist and ensure the making of conscious educational decisions, in terms of clear identification and categorization with high accuracy of various forms of skills and knowledge of each student. This paper proposes a neuroevolution emerging technique that combines the searchability of evolutionary computation and the learning capability of a hybrid artificial neural networks method. Specifically, the proposed growing semiorganizing neural gas (GsONG) is a practical AI methodology utilizing advanced clustering techniques to enhance the learning experience by categorizing the true abilities, skills, and needs of learners, in an inclusive differentiated learning framework. It is a neural network architecture that includes competing and cooperating neurons with an unstructured mode whereby a cooperation-competition process delimits the topological neighborhood of neurons in a grid to identify patterns for which their classes are not known. To optimize the above process, a heuristic method was used that investigates the space of an objective function by regulating the optimal topologies of neurons that form pathway segments in a semi-contemplative manner. Based on the extensive experiments and results obtained from the GsONG clustering approach, the proposed algorithm can compensate with high accuracy for difficulties in multicriteria grouping and differentiation of uncertainty structures such as in small or tiny data sets.
Collapse
|
47
|
Dong Q. Leakage Prediction in Machine Learning Models When Using Data from Sports Wearable Sensors. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:5314671. [PMID: 35619770 PMCID: PMC9129943 DOI: 10.1155/2022/5314671] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 04/19/2022] [Accepted: 04/25/2022] [Indexed: 11/17/2022]
Abstract
One of the major problems in machine learning is data leakage, which can be directly related to adversarial type attacks, raising serious concerns about the validity and reliability of artificial intelligence. Data leakage occurs when the independent variables used to teach the machine learning algorithm include either the dependent variable itself or a variable that contains clear information that the model is trying to predict. This data leakage results in unreliable and poor predictive results after the development and use of the model. It prevents the model from generalizing, which is required in a machine learning problem and thus causes false assumptions about its performance. To have a solid and generalized forecasting model, which will be able to produce remarkable forecasting results, we must pay great attention to detecting and preventing data leakage. This study presents an innovative system of leakage prediction in machine learning models, which is based on Bayesian inference to produce a thorough approach to calculating the reverse probability of unseen variables in order to make statistical conclusions about the relevant correlated variables and to calculate accordingly a lower limit on the marginal likelihood of the observed variables being derived from some coupling method. The main notion is that a higher marginal probability for a set of variables suggests a better fit of the data and thus a greater likelihood of a data leak in the model. The methodology is evaluated in a specialized dataset derived from sports wearable sensors.
Collapse
Affiliation(s)
- Qizheng Dong
- Zhengzhou University of Science and Technology, Zhengzhou, Henan 450000, China
| |
Collapse
|
48
|
Mitigating Bias and Error in Machine Learning to Protect Sports Data. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:4777010. [PMID: 35602627 PMCID: PMC9117075 DOI: 10.1155/2022/4777010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 04/09/2022] [Indexed: 11/17/2022]
Abstract
One of the essential processes in modern sports is doping control. In recent years, specialized methods of artificial intelligence and large-scale data analysis have been used to make faster and simpler detection of violations of international regulations on the use of banned substances. The smart systems in question depend directly on the quality of the data used, as high-quality data will produce algorithmic approaches of correspondingly high quality and accuracy. It is evident that there are many sources of errors in data collections and intentional algorithmic interventions that may result from cyber-attacks, so end-users of artificial intelligence technologies should be able to know the exact origins of data and analytical methods of these data at an algorithmic level. Given that artificial intelligence systems based on incomplete or discriminatory data can lead to inaccurate results that violate the fundamental rights of athletes, this paper presents an advanced model for mitigating bias and error in machine learning to protect sports data, using convolutional neural network (ConvNet) with high-precise class activation maps (HiPrCAM). It is an innovative neural network interpretability technique, wherewith the addition of Bellman reinforcement learning (BRL) and Broyden–Fletcher–Goldfarb–Shanno (BFGS) optimization; it can produce high-precision maps that deliver high definition, clarity, and the input and output capture when the algorithm makes a prediction. The evaluation of the proposed system uses the Shapley value solution from the cooperative game theory to provide algorithmic performance propositions for each of the produced results, assigning partial responsibility to parts of the architecture based on the impact that the efforts have on the relative success measurement, which it has been preset.
Collapse
|
49
|
Zhao S, Chen B, Wang H, Luo Z, Zhang T. A Feed-Forward Neural Network for Increasing the Hopfield-Network Storage Capacity. Int J Neural Syst 2022; 32:2250027. [PMID: 35534937 DOI: 10.1142/s0129065722500277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In the hippocampal dentate gyrus (DG), pattern separation mainly depends on the concepts of 'expansion recoding', meaning random mixing of different DG input channels. However, recent advances in neurophysiology have challenged the theory of pattern separation based on these concepts. In this study, we propose a novel feed-forward neural network, inspired by the structure of the DG and neural oscillatory analysis, to increase the Hopfield-network storage capacity. Unlike the previously published feed-forward neural networks, our bio-inspired neural network is designed to take advantage of both biological structure and functions of the DG. To better understand the computational principles of pattern separation in the DG, we have established a mouse model of environmental enrichment. We obtained a possible computational model of the DG, associated with better pattern separation ability, by using neural oscillatory analysis. Furthermore, we have developed a new algorithm based on Hebbian learning and coupling direction of neural oscillation to train the proposed neural network. The simulation results show that our proposed network significantly expands the storage capacity of Hopfield network, and more effective pattern separation is achieved. The storage capacity rises from 0.13 for the standard Hopfield network to 0.32 using our model when the overlap in patterns is 10%.
Collapse
Affiliation(s)
- Shaokai Zhao
- College of Life Sciences, Nankai University, 300071 Tianjin, P. R. China
| | - Bin Chen
- College of Life Sciences, Nankai University, 300071 Tianjin, P. R. China
| | - Hui Wang
- College of Life Sciences, Nankai University, 300071 Tianjin, P. R. China
| | - Zhiyuan Luo
- Department of Computer Science, Royal Holloway, University of London, Egham, Surrey TW20 0EX, UK
| | - Tao Zhang
- College of Life Sciences, Nankai University, 300071 Tianjin, P. R. China
| |
Collapse
|
50
|
De Nardin A, Mishra P, Foresti GL, Piciarelli C. Masked Transformer for image Anomaly Localization. Int J Neural Syst 2022; 32:2250030. [DOI: 10.1142/s0129065722500307] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|