1
|
Jiang P, Neri F, Xue Y, Maulik U. A Generalized Attention Mechanism to Enhance the Accuracy Performance of Neural Networks. Int J Neural Syst 2024; 34:2450063. [PMID: 39212940 DOI: 10.1142/s0129065724500631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]
Abstract
In many modern machine learning (ML) models, attention mechanisms (AMs) play a crucial role in processing data and identifying significant parts of the inputs, whether these are text or images. This selective focus enables subsequent stages of the model to achieve improved classification performance. Traditionally, AMs are applied as a preprocessing substructure before a neural network, such as in encoder/decoder architectures. In this paper, we extend the application of AMs to intermediate stages of data propagation within ML models. Specifically, we propose a generalized attention mechanism (GAM), which can be integrated before each layer of a neural network for classification tasks. The proposed GAM allows for at each layer/step of the ML architecture identification of the most relevant sections of the intermediate results. Our experimental results demonstrate that incorporating the proposed GAM into various ML models consistently enhances the accuracy of these models. This improvement is achieved with only a marginal increase in the number of parameters, which does not significantly affect the training time.
Collapse
Affiliation(s)
- Pengcheng Jiang
- School of Software, Nanjing University of Information Science and Technology, Nanjing 210044, P. R. China
| | - Ferrante Neri
- NICE Research Group, School of Computer Science and Electronic Engineering, University of Surrey, Guildford GU2 7XS, UK
| | - Yu Xue
- School of Software, Nanjing University of Information Science and Technology, Nanjing 210044, P. R. China
| | - Ujjwal Maulik
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| |
Collapse
|
2
|
Al Fahoum A, Zyout A. Wavelet Transform, Reconstructed Phase Space, and Deep Learning Neural Networks for EEG-Based Schizophrenia Detection. Int J Neural Syst 2024; 34:2450046. [PMID: 39010724 DOI: 10.1142/s0129065724500461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/17/2024]
Abstract
This study proposes an innovative expert system that uses exclusively EEG signals to diagnose schizophrenia in its early stages. For diagnosing psychiatric/neurological disorders, electroencephalogram (EEG) testing is considered a financially viable, safe, and reliable alternative. Using the reconstructed phase space (RPS) and the continuous wavelet transform, the researchers maximized the differences between the EEG nonstationary signals of normal and schizophrenia individuals, which cannot be observed in the time, frequency, or time-frequency domains. This reveals significant information, highlighting more distinguishable features. Then, a deep learning network was trained to enhance the accuracy of the resulting image classification. The algorithm's efficacy was confirmed through three distinct methods: employing 70% of the dataset for training, 15% for validation, and the remaining 15% for testing. This was followed by a 5-fold cross-validation technique and a leave-one-out classification approach. Each method was iterated 100 times to ascertain the algorithm's robustness. The performance metrics derived from these tests - accuracy, precision, sensitivity, F1 score, Matthews correlation coefficient, and Kappa - indicated remarkable outcomes. The algorithm demonstrated steady performance across all evaluation strategies, underscoring its relevance and reliability. The outcomes validate the system's accuracy, precision, sensitivity, and robustness by showcasing its capability to autonomously differentiate individuals diagnosed with schizophrenia from those in a state of normal health.
Collapse
Affiliation(s)
- Amjed Al Fahoum
- Biomedical Systems and Informatics Engineering Department, Yarmouk University, Irbid 21163, Jordan
| | - Ala'a Zyout
- Biomedical Systems and Informatics Engineering Department, Yarmouk University, Irbid 21163, Jordan
| |
Collapse
|
3
|
Safari M, Shalbaf R, Bagherzadeh S, Shalbaf A. Classification of mental workload with EEG analysis by using effective connectivity and a hybrid model of CNN and LSTM. Comput Methods Biomech Biomed Engin 2024:1-15. [PMID: 39086252 DOI: 10.1080/10255842.2024.2386325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 06/15/2024] [Accepted: 07/25/2024] [Indexed: 08/02/2024]
Abstract
Estimation of mental workload from electroencephalogram (EEG) signals aims to accurately measure the cognitive demands placed on an individual during multitasking mental activities. By analyzing the brain activity of the subject, we can determine the level of mental effort required to perform a task and optimize the workload to prevent cognitive overload or underload. This information can be used to enhance performance and productivity in various fields such as healthcare, education, and aviation. In this paper, we propose a method that uses EEG and deep neural networks to estimate the mental workload of human subjects during multitasking mental activities. Notably, our proposed method employs subject-independent classification. We use the "STEW" dataset, which consists of two tasks, namely "No task" and "simultaneous capacity (SIMKAP)-based multitasking activity". We estimate the different workload levels of two tasks using a composite framework consisting of brain connectivity and deep neural networks. After the initial preprocessing of EEG signals, an analysis of the relationships between the 14 EEG channels is conducted to evaluate effective brain connectivity. This assessment illustrates the information flow between various brain regions, utilizing the direct Directed Transfer Function (dDTF) method. Then, we propose a deep hybrid model based on pre-trained Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) for the classification of workload levels. The accuracy of the proposed deep model achieved 83.12% according to the subject-independent leave-subject-out (LSO) approach. The pre-trained CNN + LSTM approaches to EEG data have been found to be an accurate method for assessing the mental workload.
Collapse
Affiliation(s)
| | - Reza Shalbaf
- Institute for Cognitive Science Studies, Tehran, Iran
| | - Sara Bagherzadeh
- Department of Biomedical Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | - Ahmad Shalbaf
- Department of Biomedical Engineering and Medical Physics, School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
4
|
Jin MX, Qin PP, Xia AWL, Kan RLD, Zhang BBB, Tang AHP, Li ASM, Lin TTZ, Giron CG, Pei JJ, Kranz GS. Neurophysiological and neuroimaging markers of repetitive transcranial magnetic stimulation treatment response in major depressive disorder: A systematic review and meta-analysis of predictive modeling studies. Neurosci Biobehav Rev 2024; 162:105695. [PMID: 38710424 DOI: 10.1016/j.neubiorev.2024.105695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 04/10/2024] [Accepted: 04/26/2024] [Indexed: 05/08/2024]
Abstract
Predicting repetitive transcranial magnetic stimulation (rTMS) treatment outcomes in major depressive disorder (MDD) could reduce the financial and psychological risks of treatment failure. We systematically reviewed and meta-analyzed studies that leveraged neurophysiological and neuroimaging markers to predict rTMS response in MDD. Five databases were searched from inception to May 25, 2023. The primary meta-analytic outcome was predictive accuracy pooled from classification models. Regression models were summarized qualitatively. A promising marker was identified if it showed a sensitivity and specificity of 80% or higher in at least two independent studies. Searching yielded 36 studies. Twenty-two classification modeling studies produced an estimated area under the summary receiver operating characteristic curve of 0.87 (95% CI = 0.83-0.92), with 86.8% sensitivity (95% CI = 80.6-91.2%) and 81.9% specificity (95% CI = 76.1-86.4%). Frontal theta cordance measured by electroencephalography is closest to proof of concept. Predicting rTMS response using neurophysiological and neuroimaging markers is promising for clinical decision-making. However, replications by different research groups are needed to establish rigorous markers.
Collapse
Affiliation(s)
- Min Xia Jin
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hong Kong, Special Administrative Region of China; Shanghai YangZhi Rehabilitation Hospital (Shanghai Sunshine Rehabilitation Center), School of Medicine, Tongji University, Shanghai, China
| | - Penny Ping Qin
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hong Kong, Special Administrative Region of China
| | - Adam Wei Li Xia
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hong Kong, Special Administrative Region of China
| | - Rebecca Lai Di Kan
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hong Kong, Special Administrative Region of China
| | - Bella Bing Bing Zhang
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hong Kong, Special Administrative Region of China
| | - Alvin Hong Pui Tang
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hong Kong, Special Administrative Region of China
| | - Ami Sin Man Li
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hong Kong, Special Administrative Region of China
| | - Tim Tian Ze Lin
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hong Kong, Special Administrative Region of China
| | - Cristian G Giron
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hong Kong, Special Administrative Region of China
| | - Jun Jie Pei
- Department of Rehabilitation Medicine, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Zhejiang, China
| | - Georg S Kranz
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hong Kong, Special Administrative Region of China; Mental Health Research Center, The Hong Kong Polytechnic University, Hong Kong, Special Administrative Region of China; Department of Psychiatry and Psychotherapy, Comprehensive Center for Clinical Neurosciences and Mental Health, Medical University of Vienna, Austria.
| |
Collapse
|
5
|
Zhong C, Xiong Y, Tang W, Guo J. A Stage-Wise Residual Attention Generation Adversarial Network for Mandibular Defect Repairing and Reconstruction. Int J Neural Syst 2024; 34:2450033. [PMID: 38623651 DOI: 10.1142/s0129065724500333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2024]
Abstract
Surgical reconstruction of mandibular defects is a clinical routine manner for the rehabilitation of patients with deformities. The mandible plays a crucial role in maintaining the facial contour and ensuring the speech and mastication functions. The repairing and reconstruction of mandible defects is a significant yet challenging task in oral-maxillofacial surgery. Currently, the mainly available methods are traditional digitalized design methods that suffer from substantial artificial operations, limited applicability and high reconstruction error rates. An automated, precise, and individualized method is imperative for maxillofacial surgeons. In this paper, we propose a Stage-wise Residual Attention Generative Adversarial Network (SRA-GAN) for mandibular defect reconstruction. Specifically, we design a stage-wise residual attention mechanism for generator to enhance the extraction capability of mandibular remote spatial information, making it adaptable to various defects. For the discriminator, we propose a multi-field perceptual network, consisting of two parallel discriminators with different perceptual fields, to reduce the cumulative reconstruction errors. Furthermore, we design a self-encoder perceptual loss function to ensure the correctness of mandibular anatomical structures. The experimental results on a novel custom-built mandibular defect dataset demonstrate that our method has a promising prospect in clinical application, achieving the best Dice Similarity Coefficient (DSC) of 94.238% and 95% Hausdorff Distance (HD95) of 4.787.
Collapse
Affiliation(s)
- Chenglan Zhong
- Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu 610065, P. R. China
| | - Yutao Xiong
- State Key Laboratory of Oral Diseases & National Center for Stomatology & National Clinical Research Center for Oral Diseases & Department of Oral and Maxillofacial Surgery, West China Hospital of Stomatology, Sichuan University, Chengdu 610041, P. R. China
| | - Wei Tang
- State Key Laboratory of Oral Diseases & National Center for Stomatology & National Clinical Research Center for Oral Diseases & Department of Oral and Maxillofacial Surgery, West China Hospital of Stomatology, Sichuan University, Chengdu 610041, P. R. China
| | - Jixiang Guo
- Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu 610065, P. R. China
| |
Collapse
|
6
|
Colonnese F, Di Luzio F, Rosato A, Panella M. Bimodal Feature Analysis with Deep Learning for Autism Spectrum Disorder Detection. Int J Neural Syst 2024; 34:2450005. [PMID: 38063381 DOI: 10.1142/s0129065724500059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2024]
Abstract
Autism Spectrum Disorder (ASD) is a complex and heterogeneous neurodevelopmental disorder which affects a significant proportion of the population, with estimates suggesting that about 1 in 100 children worldwide are affected by ASD. This study introduces a new Deep Neural Network for identifying ASD in children through gait analysis, using features extracted from frames composing video recordings of their walking patterns. The innovative method presented herein is based on imagery and combines gait analysis and deep learning, offering a noninvasive and objective assessment of neurodevelopmental disorders while delivering high accuracy in ASD detection. Our model proposes a bimodal approach based on the concatenation of two distinct Convolutional Neural Networks processing two feature sets extracted from the same videos. The features obtained from the convolutions of both networks are subsequently flattened and merged into a single vector, serving as input for the fully connected layers in the binary classification process. This approach demonstrates the potential for effective ASD detection in children through the combination of gait analysis and deep learning techniques.
Collapse
Affiliation(s)
- Federica Colonnese
- Department of Information Engineering, Electronics and Telecommunications, University of Rome "La Sapienza" Via Eudossiana 18, 00184 Rome, Italy
| | - Francesco Di Luzio
- Department of Information Engineering, Electronics and Telecommunications, University of Rome "La Sapienza" Via Eudossiana 18, 00184 Rome, Italy
| | - Antonello Rosato
- Department of Information Engineering, Electronics and Telecommunications, University of Rome "La Sapienza" Via Eudossiana 18, 00184 Rome, Italy
| | - Massimo Panella
- Department of Information Engineering, Electronics and Telecommunications, University of Rome "La Sapienza" Via Eudossiana 18, 00184 Rome, Italy
| |
Collapse
|
7
|
Greco C, Raimo G, Amorese T, Cuciniello M, Mcconvey G, Cordasco G, Faundez-Zanuy M, Vinciarelli A, Callejas-Carrion Z, Esposito A. Discriminative Power of Handwriting and Drawing Features in Depression. Int J Neural Syst 2024; 34:2350069. [PMID: 38009869 DOI: 10.1142/s0129065723500697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
This study contributes knowledge on the detection of depression through handwriting/drawing features, to identify quantitative and noninvasive indicators of the disorder for implementing algorithms for its automatic detection. For this purpose, an original online approach was adopted to provide a dynamic evaluation of handwriting/drawing performance of healthy participants with no history of any psychiatric disorders ([Formula: see text]), and patients with a clinical diagnosis of depression ([Formula: see text]). Both groups were asked to complete seven tasks requiring either the writing or drawing on a paper while five handwriting/drawing features' categories (i.e. pressure on the paper, time, ductus, space among characters, and pen inclination) were recorded by using a digitalized tablet. The collected records were statistically analyzed. Results showed that, except for pressure, all the considered features, successfully discriminate between depressed and nondepressed subjects. In addition, it was observed that depression affects different writing/drawing functionalities. These findings suggest the adoption of writing/drawing tasks in the clinical practice as tools to support the current depression detection methods. This would have important repercussions on reducing the diagnostic times and treatment formulation.
Collapse
Affiliation(s)
- Claudia Greco
- Department of Psychology, Università della Campania "Luigi Vanvitelli", Viale Ellittico 31 Caserta, 81000, Italy
| | - Gennaro Raimo
- Department of Psychology, Università della Campania "Luigi Vanvitelli", Viale Ellittico 31 Caserta, 81000, Italy
| | - Terry Amorese
- Department of Psychology, Università della Campania "Luigi Vanvitelli", Viale Ellittico 31 Caserta, 81000, Italy
| | - Marialucia Cuciniello
- Department of Psychology, Università della Campania "Luigi Vanvitelli", Viale Ellittico 31 Caserta, 81000, Italy
| | - Gavin Mcconvey
- Action Mental Health, 27 Jubilee Rd, BT23 4YH, Newtownards, UK
| | - Gennaro Cordasco
- Department of Psychology, Università della Campania "Luigi Vanvitelli", Viale Ellittico 31 Caserta, 81000, Italy
| | - Marcos Faundez-Zanuy
- Tecnocampus Universitat Pompeu Fabra, Carrer d'Ernest Lluch 32 Mataro, Barcelona 08302, Spain
| | - Alessandro Vinciarelli
- University of Glasgow, School of Computing Science, 18 Lilybank Gardens Glasgow, G12,8RZ, Scotland
| | - Zoraida Callejas-Carrion
- Department of Languages and Computer Systems, Universidad de Granada, Periodista Daniel Saucedo Aranda Granada, 18071, Spain
| | - Anna Esposito
- Department of Psychology, Università della Campania "Luigi Vanvitelli", Viale Ellittico 31 Caserta, 81000, Italy
| |
Collapse
|
8
|
Cotronei M, Giuffrè S, Marcianò A, Rosaci D, Sarnè GML. Improving the Effectiveness of Eigentrust in Computing the Reputation of Social Agents in Presence of Collusion. Int J Neural Syst 2024; 34:2350063. [PMID: 37806781 DOI: 10.1142/s0129065723500636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/10/2023]
Abstract
The introduction of trust-based approaches in social scenarios modeled as multi-agent systems (MAS) has been recognized as a valid solution to improve the effectiveness of these communities. In fact, they make interactions taking place in social scenarios much fruitful as possible, limiting or even avoiding malicious or fraudulent behaviors, including collusion. This is also the case of multi-layered neural networks (NN), which can face limited, incomplete, misleading, controversial or noisy datasets, produced by untrustworthy agents. Many strategies to deal with malicious agents in social networks have been proposed in the literature. One of the most effective is represented by Eigentrust, often adopted as a benchmark. It can be seen as a variation of PageRank, an algorithm for determining result rankings used by search engines like Google. Moreover, Eigentrust can also be viewed as a linear neural network whose architecture is represented by the graph of Web pages. A major drawback of Eigentrust is that it uses some additional information about agents that can be a priori considered particularly trustworthy, rewarding them in terms of reputation, while the non pre-trusted agents are penalized. In this paper, we propose a different strategy to detect malicious agents which does not modify the real reputation values of the honest ones. We introduce a measure of effectiveness when computing reputation in presence of malicious agents. Moreover, we define a metric of error useful to quantitatively determine how much an algorithm for the identification of malicious agents modifies the reputation scores of the honest ones. We have performed an experimental campaign of mathematical simulations on a dynamic multi-agent environment. The obtained results show that our method is more effective than Eigentrust in determining reputation values, presenting an error which is about a thousand times lower than the error produced by Eigentrust on medium-sized social networks.
Collapse
Affiliation(s)
- Mariantonia Cotronei
- Department of Information Engineering, Infrastructure and Sustainable Energy (DIIES), University Mediterranea of Reggio Calabria, Feo di Vito, Reggio Calabria 89122, Italy
| | - Sofia Giuffrè
- Department of Information Engineering, Infrastructure and Sustainable Energy (DIIES), University Mediterranea of Reggio Calabria, Feo di Vito, Reggio Calabria 89122, Italy
| | - Attilio Marcianò
- Department of Information Engineering, Infrastructure and Sustainable Energy (DIIES), University Mediterranea of Reggio Calabria, Feo di Vito, Reggio Calabria 89122, Italy
| | - Domenico Rosaci
- Department of Information Engineering, Infrastructure and Sustainable Energy (DIIES), University Mediterranea of Reggio Calabria, Feo di Vito, Reggio Calabria 89122, Italy
| | - Giuseppe M L Sarnè
- Department of Psychology, University of Milan Bicocca, Piazza dell'Ateneo Nuovo, 1 Milano 20126, Italy
| |
Collapse
|