1
|
Fu B, Yu X, Jiang G, Sun N, Liu Y. Enhancing local representation learning through global-local integration with functional connectivity for EEG-based emotion recognition. Comput Biol Med 2024; 179:108857. [PMID: 39018882 DOI: 10.1016/j.compbiomed.2024.108857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 06/21/2024] [Accepted: 07/06/2024] [Indexed: 07/19/2024]
Abstract
Emotion recognition based on electroencephalogram (EEG) signals is crucial in understanding human affective states. Current research has limitations in extracting local features. The representation capabilities of local features are limited, making it difficult to comprehensively capture emotional information. In this study, a novel approach is proposed to enhance local representation learning through global-local integration with functional connectivity for EEG-based emotion recognition. By leveraging the functional connectivity of brain regions, EEG signals are divided into global embeddings that represent comprehensive brain connectivity patterns throughout the entire process and local embeddings that reflect dynamic interactions within specific brain functional networks at particular moments. Firstly, a convolutional feature extraction branch based on the residual network is designed to extract local features from the global embedding. To further improve the representation ability and accuracy of local features, a multidimensional collaborative attention (MCA) module is introduced. Secondly, the local features and patch embedded local embeddings are integrated into the feature coupling module (FCM), which utilizes hierarchical connections and enhanced cross-attention to couple region-level features, thereby enhancing local representation learning. Experimental results on three public datasets show that compared with other methods, this method improves accuracy by 4.92% on the DEAP, by 1.11% on the SEED, and by 7.76% on the SEED-IV, demonstrating its superior performance in emotion recognition tasks.
Collapse
Affiliation(s)
- Baole Fu
- School of Automation, Qingdao University, Qingdao 266071, China; Institute for Future, Qingdao University, Qingdao 266071, China.
| | - Xiangkun Yu
- College of Computer Science & Technology, Qingdao University, Qingdao 266071, China; Institute for Future, Qingdao University, Qingdao 266071, China.
| | - Guijie Jiang
- College of Computer Science & Technology, Qingdao University, Qingdao 266071, China.
| | - Ninghao Sun
- College of Computer Science & Technology, Qingdao University, Qingdao 266071, China.
| | - Yinhua Liu
- School of Automation, Qingdao University, Qingdao 266071, China; Shandong Key Laboratory of Industrial Control Technology, Qingdao 266071, China; Institute for Future, Qingdao University, Qingdao 266071, China.
| |
Collapse
|
2
|
Shao Y, Zhou Y, Gong P, Sun Q, Zhang D. A Dual-Adversarial Model for Cross-Time and Cross-Subject Cognitive Workload Decoding. IEEE Trans Neural Syst Rehabil Eng 2024; 32:2324-2335. [PMID: 38885097 DOI: 10.1109/tnsre.2024.3415364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/20/2024]
Abstract
Electroencephalogram (EEG) signals are widely utilized in the field of cognitive workload decoding (CWD). However, when the recognition scenario is shifted from subject-dependent to subject-independent or spans a long period, the accuracy of CWD deteriorates significantly. Current solutions are either dependent on extensive training datasets or fail to maintain clear distinctions between categories, additionally lacking a robust feature extraction mechanism. In this paper, we tackle these issues by proposing a Bi-Classifier Joint Domain Adaptation (BCJDA) model for EEG-based cross-time and cross-subject CWD. Specifically, the model consists of a feature extractor, a domain discriminator, and a Bi-Classifier, containing two sets of adversarial processes for domain-wise alignment and class-wise alignment. In the adversarial domain adaptation, the feature extractor is forced to learn the common domain features deliberately. The Bi-Classifier also fosters the feature extractor to retain the category discrepancies of the unlabeled domain, so that its classification boundary is consistent with the labeled domain. Furthermore, different adversarial distance functions of the Bi-Classifier are adopted and evaluated in this model. We conduct classification experiments on a publicly available BCI competition dataset for recognizing low, medium, and high cognitive workload levels. The experimental results demonstrate that our proposed BCJDA model based on cross-gradient difference maximization achieves the best performance.
Collapse
|
3
|
Zhang F, Wu H, Guo Y. Semi-supervised multi-source transfer learning for cross-subject EEG motor imagery classification. Med Biol Eng Comput 2024; 62:1655-1672. [PMID: 38324109 DOI: 10.1007/s11517-024-03032-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Accepted: 12/27/2023] [Indexed: 02/08/2024]
Abstract
Electroencephalogram (EEG) motor imagery (MI) classification refers to the use of EEG signals to identify and classify subjects' motor imagery activities; this task has received increasing attention with the development of brain-computer interfaces (BCIs). However, the collection of EEG data is usually time-consuming and labor-intensive, which makes it difficult to obtain sufficient labeled data from the new subject to train a new model. Moreover, the EEG signals of different individuals exhibit significant differences, leading to a significant drop in the performance of a model trained on the existing subjects when directly classifying EEG signals acquired from new subjects. Therefore, it is crucial to make full use of the EEG data of the existing subjects and the unlabeled EEG data of the new target subject to improve the MI classification performance achieved for the target subject. This research study proposes a semi-supervised multi-source transfer (SSMT) learning model to address the above problems; the model learns informative and domain-invariant representations to address cross-subject MI-EEG classification tasks. In particular, a dynamic transferred weighting schema is presented to obtain the final predictions by integrating the weighted features derived from multi-source domains. The average accuracies achieved on two publicly available EEG datasets reach 83.57 % and 85.09 % , respectively, validating the effectiveness of the SSMT process. The SSMT process reveals the importance of informative and domain-invariant representations in MI classification tasks, as they make full use of the domain-invariant information acquired from each subject.
Collapse
Affiliation(s)
| | - Hanliang Wu
- Liwan District People's Hospital of Guangzhou, Guangzhou, China.
| | - Yuxin Guo
- Guangzhou Institute of Science and Technology, Guangzhou, China
| |
Collapse
|
4
|
Ma W, Zheng Y, Li T, Li Z, Li Y, Wang L. A comprehensive review of deep learning in EEG-based emotion recognition: classifications, trends, and practical implications. PeerJ Comput Sci 2024; 10:e2065. [PMID: 38855206 PMCID: PMC11157589 DOI: 10.7717/peerj-cs.2065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 04/25/2024] [Indexed: 06/11/2024]
Abstract
Emotion recognition utilizing EEG signals has emerged as a pivotal component of human-computer interaction. In recent years, with the relentless advancement of deep learning techniques, using deep learning for analyzing EEG signals has assumed a prominent role in emotion recognition. Applying deep learning in the context of EEG-based emotion recognition carries profound practical implications. Although many model approaches and some review articles have scrutinized this domain, they have yet to undergo a comprehensive and precise classification and summarization process. The existing classifications are somewhat coarse, with insufficient attention given to the potential applications within this domain. Therefore, this article systematically classifies recent developments in EEG-based emotion recognition, providing researchers with a lucid understanding of this field's various trajectories and methodologies. Additionally, it elucidates why distinct directions necessitate distinct modeling approaches. In conclusion, this article synthesizes and dissects the practical significance of EEG signals in emotion recognition, emphasizing its promising avenues for future application.
Collapse
Affiliation(s)
- Weizhi Ma
- School of Information Science and Technology, North China University of Technology, Beijing, China
| | - Yujia Zheng
- School of Information Science and Technology, North China University of Technology, Beijing, China
| | - Tianhao Li
- School of Information Science and Technology, North China University of Technology, Beijing, China
| | - Zhengping Li
- School of Information Science and Technology, North China University of Technology, Beijing, China
| | - Ying Li
- School of Information Science and Technology, North China University of Technology, Beijing, China
| | - Lijun Wang
- School of Information Science and Technology, North China University of Technology, Beijing, China
| |
Collapse
|
5
|
Pang M, Wang H, Huang J, Vong CM, Zeng Z, Chen C. Multi-Scale Masked Autoencoders for Cross-Session Emotion Recognition. IEEE Trans Neural Syst Rehabil Eng 2024; 32:1637-1646. [PMID: 38619940 DOI: 10.1109/tnsre.2024.3389037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2024]
Abstract
Affective brain-computer interfaces (aBCIs) have garnered widespread applications, with remarkable advancements in utilizing electroencephalogram (EEG) technology for emotion recognition. However, the time-consuming process of annotating EEG data, inherent individual differences, non-stationary characteristics of EEG data, and noise artifacts in EEG data collection pose formidable challenges in developing subject-specific cross-session emotion recognition models. To simultaneously address these challenges, we propose a unified pre-training framework based on multi-scale masked autoencoders (MSMAE), which utilizes large-scale unlabeled EEG signals from multiple subjects and sessions to extract noise-robust, subject-invariant, and temporal-invariant features. We subsequently fine-tune the obtained generalized features with only a small amount of labeled data from a specific subject for personalization and enable cross-session emotion recognition. Our framework emphasizes: 1) Multi-scale representation to capture diverse aspects of EEG signals, obtaining comprehensive information; 2) An improved masking mechanism for robust channel-level representation learning, addressing missing channel issues while preserving inter-channel relationships; and 3) Invariance learning for regional correlations in spatial-level representation, minimizing inter-subject and inter-session variances. Under these elaborate designs, the proposed MSMAE exhibits a remarkable ability to decode emotional states from a different session of EEG data during the testing phase. Extensive experiments conducted on the two publicly available datasets, i.e., SEED and SEED-IV, demonstrate that the proposed MSMAE consistently achieves stable results and outperforms competitive baseline methods in cross-session emotion recognition.
Collapse
|
6
|
Ju X, Li M, Tian W, Hu D. EEG-based emotion recognition using a temporal-difference minimizing neural network. Cogn Neurodyn 2024; 18:405-416. [PMID: 38699602 PMCID: PMC11061074 DOI: 10.1007/s11571-023-10004-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 07/25/2023] [Accepted: 08/21/2023] [Indexed: 05/05/2024] Open
Abstract
Electroencephalogram (EEG) emotion recognition plays an important role in human-computer interaction. An increasing number of algorithms for emotion recognition have been proposed recently. However, it is still challenging to make efficient use of emotional activity knowledge. In this paper, based on prior knowledge that emotion varies slowly across time, we propose a temporal-difference minimizing neural network (TDMNN) for EEG emotion recognition. We use maximum mean discrepancy (MMD) technology to evaluate the difference in EEG features across time and minimize the difference by a multibranch convolutional recurrent network. State-of-the-art performances are achieved using the proposed method on the SEED, SEED-IV, DEAP and DREAMER datasets, demonstrating the effectiveness of including prior knowledge in EEG emotion recognition.
Collapse
Affiliation(s)
- Xiangyu Ju
- College of Intelligence Science and Technology, National University of Defense Technology, Changsha, China
| | - Ming Li
- College of Intelligence Science and Technology, National University of Defense Technology, Changsha, China
| | - Wenli Tian
- College of Intelligence Science and Technology, National University of Defense Technology, Changsha, China
| | - Dewen Hu
- College of Intelligence Science and Technology, National University of Defense Technology, Changsha, China
| |
Collapse
|
7
|
Zhang R, Guo H, Xu Z, Hu Y, Chen M, Zhang L. MGFKD: A semi-supervised multi-source domain adaptation algorithm for cross-subject EEG emotion recognition. Brain Res Bull 2024; 208:110901. [PMID: 38355058 DOI: 10.1016/j.brainresbull.2024.110901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 12/31/2023] [Accepted: 02/11/2024] [Indexed: 02/16/2024]
Abstract
Currently, most models rarely consider the negative transfer problem in the research field of cross-subject EEG emotion recognition. To solve this problem, this paper proposes a semi-supervised domain adaptive algorithm based on few labeled samples of target subject, which called multi-domain geodesic flow kernel dynamic distribution alignment (MGFKD). It consists of three modules: 1) GFK common feature extractor: projects the feature distribution of source and target subjects to the Grassmann manifold space, and obtains the latent common features of the two feature distributions through GFK method. 2) Source domain selector: obtains pseudo-labels of the target subject through weak classifier, finds "golden source subjects" by using few known labels of target subjects. 3) Label corrector: uses a dynamic distribution balance strategy to correct the pseudo-labels of the target subject. We conducted comparison experiments on the SEED and SEED-IV datasets, and the results show that MGFKD outperforms unsupervised and semi-supervised domain adaptation algorithms, achieving an average accuracy of 87.51±7.68% and 68.79±8.25% on the SEED and SEED-IV datasets with only one labeled sample per video for target subject. Especially when the number of source domains is set as 6 and the number of known labels is set as 5, the accuracy increase to 90.20±7.57% and 69.99±7.38%, respectively. The above results prove that our proposed algorithm can efficiently and quickly improve the cross-subject EEG emotion classification performance. Since it only need a small number of labeled samples of new subjects, making it has strong application value in future EEG-based emotion recognition applications.
Collapse
Affiliation(s)
- Rui Zhang
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China
| | - Huifeng Guo
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China
| | - Zongxin Xu
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China
| | - Yuxia Hu
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China
| | - Mingming Chen
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China
| | - Lipeng Zhang
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China.
| |
Collapse
|
8
|
Wang X, Li B, Lin Y, Gao X. Multi-source domain adaptation based tempo-spatial convolution network for cross-subject EEG classification in RSVP task. J Neural Eng 2024; 21:016025. [PMID: 38324909 DOI: 10.1088/1741-2552/ad2710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 02/07/2024] [Indexed: 02/09/2024]
Abstract
Objective.Many subject-dependent methods were proposed for electroencephalogram (EEG) classification in rapid serial visual presentation (RSVP) task, which required a large amount of data from new subject and were time-consuming to calibrate system. Cross-subject classification can realize calibration reduction or zero calibration. However, cross-subject classification in RSVP task is still a challenge.Approach.This study proposed a multi-source domain adaptation based tempo-spatial convolution (MDA-TSC) network for cross-subject RSVP classification. The proposed network consisted of three modules. First, the common feature extraction with multi-scale tempo-spatial convolution was constructed to extract domain-invariant features across all subjects, which could improve generalization of the network. Second, the multi-branch domain-specific feature extraction and alignment was conducted to extract and align domain-specific feature distributions of source and target domains in pairs, which could consider feature distribution differences among source domains. Third, the domain-specific classifier was exploited to optimize the network through loss functions and obtain prediction for the target domain.Main results.The proposed network was evaluated on the benchmark RSVP dataset, and the cross-subject classification results showed that the proposed MDA-TSC network outperformed the reference methods. Moreover, the effectiveness of the MDA-TSC network was verified through both ablation studies and visualization.Significance.The proposed network could effectively improve cross-subject classification performance in RSVP task, and was helpful to reduce system calibration time.
Collapse
Affiliation(s)
- Xuepu Wang
- School of Integrated Circuits and Electronics, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| | - Bowen Li
- School of Medicine, Tsinghua University, Beijing 100084, People's Republic of China
| | - Yanfei Lin
- School of Integrated Circuits and Electronics, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| | - Xiaorong Gao
- School of Medicine, Tsinghua University, Beijing 100084, People's Republic of China
| |
Collapse
|
9
|
Gong M, Zhong W, Ye L, Zhang Q. MISNet: multi-source information-shared EEG emotion recognition network with two-stream structure. Front Neurosci 2024; 18:1293962. [PMID: 38419660 PMCID: PMC10899343 DOI: 10.3389/fnins.2024.1293962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 01/26/2024] [Indexed: 03/02/2024] Open
Abstract
Introduction When constructing machine learning and deep neural networks, the domain shift problem on different subjects complicates the subject independent electroencephalography (EEG) emotion recognition. Most of the existing domain adaptation methods either treat all source domains as equivalent or train source-specific learners directly, misleading the network to acquire unreasonable transfer knowledge and thus resulting in negative transfer. Methods This paper incorporates the individual difference and group commonality of distinct domains and proposes a multi-source information-shared network (MISNet) to enhance the performance of subject independent EEG emotion recognition models. The network stability is enhanced by employing a two-stream training structure with loop iteration strategy to alleviate outlier sources confusing the model. Additionally, we design two auxiliary loss functions for aligning the marginal distributions of domain-specific and domain shared features, and then optimize the convergence process by constraining gradient penalty on these auxiliary loss functions. Furthermore, the pre-training strategy is also proposed to ensure that the initial mapping of shared encoder contains sufficient emotional information. Results We evaluate the proposed MISNet to ascertain the impact of several hyper-parameters on the domain adaptation capability of network. The ablation experiments are conducted on two publically accessible datasets SEED and SEED-IV to assess the effectiveness of each loss function. Discussion The experimental results demonstrate that by disentangling private and shared emotional characteristics from differential entropy features of EEG signals, the proposed MISNet can gain robust subject independent performance and strong domain adaptability.
Collapse
Affiliation(s)
- Ming Gong
- Key Laboratory of Media Audio and Video (Communication University of China), Ministry of Education, Beijing, China
| | - Wei Zhong
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing, China
| | - Long Ye
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing, China
| | - Qin Zhang
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing, China
| |
Collapse
|
10
|
Zhu L, Yu F, Huang A, Ying N, Zhang J. Instance-representation transfer method based on joint distribution and deep adaptation for EEG emotion recognition. Med Biol Eng Comput 2024; 62:479-493. [PMID: 37914959 DOI: 10.1007/s11517-023-02956-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 10/20/2023] [Indexed: 11/03/2023]
Abstract
Electroencephalogram (EEG) emotion recognition technology is essential for improving human-computer interaction. However, the practical application of emotion recognition technology is limited due to the variety of subjects and sessions. Transfer learning has been applied to address this issue and has received extensive research and application. Studies mainly concentrate on either instance transfer or representation transfer methods. This paper proposes an emotion recognition method called Joint Distributed Instances Represent Transfer (JD-IRT), which includes two core components: Joint Distribution Deep Adaptation (JDDA) and Instance-Representation Transfer (I-RT). Specifically, JDDA is different from common representation transfer methods in transfer learning. It bridges the discrepancies of marginal and conditional distributions simultaneously and combines multiple adaptive layers and kernels for deep domain adaptation. On the other hand, I-RT utilizes instance transfer to select source domain data for better representation transfer. We performed experiments and compared them with other representative methods in the SEED, SEED-IV, and SEED-V datasets. In cross-subject experiments, our approach achieved an average accuracy of 83.21% in SEED, 52.12% in SEED-IV, and 60.17% in SEED-V. Similarly, in cross-session experiments, the accuracy was 91.29% in SEED, 59.02% in SEED-IV, and 65.91% in SEED-V. These results demonstrate the improvement in the accuracy of EEG emotion recognition using the proposed approach.
Collapse
Affiliation(s)
- Lei Zhu
- School of Automation, Hangzhou Dianzi University, Hangzhou, 310000, China.
| | - Fei Yu
- School of Automation, Hangzhou Dianzi University, Hangzhou, 310000, China
| | - Aiai Huang
- School of Automation, Hangzhou Dianzi University, Hangzhou, 310000, China
| | - Nanjiao Ying
- School of Automation, Hangzhou Dianzi University, Hangzhou, 310000, China
- Center for Drug Inspection of Zhejiang Province, Hangzhou, 310000, China
| | - Jianhai Zhang
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310000, China
- Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province, Hangzhou, 310000, China
| |
Collapse
|
11
|
Jiang H, Shen F, Chen L, Peng Y, Guo H, Gao H. Joint domain symmetry and predictive balance for cross-dataset EEG emotion recognition. J Neurosci Methods 2023; 400:109978. [PMID: 37806390 DOI: 10.1016/j.jneumeth.2023.109978] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 09/16/2023] [Accepted: 09/26/2023] [Indexed: 10/10/2023]
Abstract
BACKGROUND Cross-dataset EEG emotion recognition is an extremely challenging task, since data distributions of EEG from different datasets are greatly different, which makes the universal models yield unsatisfactory results. Although there are many methods have been proposed to reduce cross-dataset distribution discrepancies, they still neglected the following two problems. (1) Label space inconsistency: emotional label spaces of subjects from different datasets are different; (2) Uncertainty propagation: the uncertainty of misclassified emotion samples will propagate between datasets. NEW METHOD To solve these problems, we propose a novel method called domain symmetry and predictive balance (DSPB). For the problem of label space inconsistency, a domain symmetry module is designed to make label spaces of source and target domain to be the same, which randomly selects samples from the source domain and put into the target domain. For the problem of uncertainty propagation, a predictive balance module is proposed to reduce the prediction score of incorrect samples and then effectively reduce distribution differences between EEG from different datasets. RESULTS Experimental results show that our method achieve 61.48% average accuracies on the three cross-dataset tasks. Moreover, we find that gamma is the most relevant to emotion recognition among the five frequency bands, and the prefrontal and temporal brain regions are the channels carrying the most emotional information among the 62 brain channels. COMPARISON WITH EXISTING METHODS Compared with the partial domain adaptation method (SPDA) and the unsupervised domain adaptation (MS-MDA), our method improves average accuracies by 15.60% and 23.11%, respectively. CONCLUSION Besides, data distributions of EEG from different datasets but with the same emotional labels have been well aligned, which demonstrates the effectiveness of DSPB.
Collapse
Affiliation(s)
- Haiting Jiang
- College of Physics and Electronic Information Engineering, Zhejiang Normal University, Jin Hua, 321004, China
| | - Fangyao Shen
- School of Computer Science and Technology (School of Artificial Intelligence), Zhejiang Normal University, Jin Hua, 321004, China
| | - Lina Chen
- School of Computer Science and Technology (School of Artificial Intelligence), Zhejiang Normal University, Jin Hua, 321004, China.
| | - Yong Peng
- School of Computer Science and Technology, Hangzhou Dianzi University, Hang Zhou, 310018, China
| | - Hongjie Guo
- School of Computer Science and Technology (School of Artificial Intelligence), Zhejiang Normal University, Jin Hua, 321004, China
| | - Hong Gao
- School of Computer Science and Technology (School of Artificial Intelligence), Zhejiang Normal University, Jin Hua, 321004, China
| |
Collapse
|
12
|
Lu W, Liu H, Ma H, Tan TP, Xia L. Hybrid transfer learning strategy for cross-subject EEG emotion recognition. Front Hum Neurosci 2023; 17:1280241. [PMID: 38034069 PMCID: PMC10687359 DOI: 10.3389/fnhum.2023.1280241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Accepted: 10/25/2023] [Indexed: 12/02/2023] Open
Abstract
Emotion recognition constitutes a pivotal research topic within affective computing, owing to its potential applications across various domains. Currently, emotion recognition methods based on deep learning frameworks utilizing electroencephalogram (EEG) signals have demonstrated effective application and achieved impressive performance. However, in EEG-based emotion recognition, there exists a significant performance drop in cross-subject EEG Emotion recognition due to inter-individual differences among subjects. In order to address this challenge, a hybrid transfer learning strategy is proposed, and the Domain Adaptation with a Few-shot Fine-tuning Network (DFF-Net) is designed for cross-subject EEG emotion recognition. The first step involves the design of a domain adaptive learning module specialized for EEG emotion recognition, known as the Emo-DA module. Following this, the Emo-DA module is utilized to pre-train a model on both the source and target domains. Subsequently, fine-tuning is performed on the target domain specifically for the purpose of cross-subject EEG emotion recognition testing. This comprehensive approach effectively harnesses the attributes of domain adaptation and fine-tuning, resulting in a noteworthy improvement in the accuracy of the model for the challenging task of cross-subject EEG emotion recognition. The proposed DFF-Net surpasses the state-of-the-art methods in the cross-subject EEG emotion recognition task, achieving an average recognition accuracy of 93.37% on the SEED dataset and 82.32% on the SEED-IV dataset.
Collapse
Affiliation(s)
- Wei Lu
- Henan High-speed Railway Operation and Maintenance Engineering Research Center, Zhengzhou Railway Vocational and Technical College, Zhengzhou, China
- School of Computer Sciences, Universiti Sains Malaysia, Penang, Malaysia
| | - Haiyan Liu
- Zhengzhou Railway Vocational and Technical College, Zhengzhou, China
| | - Hua Ma
- Henan High-speed Railway Operation and Maintenance Engineering Research Center, Zhengzhou Railway Vocational and Technical College, Zhengzhou, China
| | - Tien-Ping Tan
- School of Computer Sciences, Universiti Sains Malaysia, Penang, Malaysia
| | - Lingnan Xia
- Zhengzhou Railway Vocational and Technical College, Zhengzhou, China
| |
Collapse
|
13
|
Tao J, Dan Y, Zhou D. Local domain generalization with low-rank constraint for EEG-based emotion recognition. Front Neurosci 2023; 17:1213099. [PMID: 38027525 PMCID: PMC10662311 DOI: 10.3389/fnins.2023.1213099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 10/04/2023] [Indexed: 12/01/2023] Open
Abstract
As an important branch in the field of affective computing, emotion recognition based on electroencephalography (EEG) faces a long-standing challenge due to individual diversities. To conquer this challenge, domain adaptation (DA) or domain generalization (i.e., DA without target domain in the training stage) techniques have been introduced into EEG-based emotion recognition to eliminate the distribution discrepancy between different subjects. The preceding DA or domain generalization (DG) methods mainly focus on aligning the global distribution shift between source and target domains, yet without considering the correlations between the subdomains within the source domain and the target domain of interest. Since the ignorance of the fine-grained distribution information in the source may still bind the DG expectation on EEG datasets with multimodal structures, multiple patches (or subdomains) should be reconstructed from the source domain, on which multi-classifiers could be learned collaboratively. It is expected that accurately aligning relevant subdomains by excavating multiple distribution patterns within the source domain could further boost the learning performance of DG/DA. Therefore, we propose in this work a novel DG method for EEG-based emotion recognition, i.e., Local Domain Generalization with low-rank constraint (LDG). Specifically, the source domain is firstly partitioned into multiple local domains, each of which contains only one positive sample and its positive neighbors and k2 negative neighbors. Multiple subject-invariant classifiers on different subdomains are then co-learned in a unified framework by minimizing local regression loss with low-rank regularization for considering the shared knowledge among local domains. In the inference stage, the learned local classifiers are discriminatively selected according to their importance of adaptation. Extensive experiments are conducted on two benchmark databases (DEAP and SEED) under two cross-validation evaluation protocols, i.e., cross-subject within-dataset and cross-dataset within-session. The experimental results under the 5-fold cross-validation demonstrate the superiority of the proposed method compared with several state-of-the-art methods.
Collapse
Affiliation(s)
- Jianwen Tao
- Institute of Artificial Intelligence Application, Ningbo Polytechnic, Zhejiang, China
| | - Yufang Dan
- Institute of Artificial Intelligence Application, Ningbo Polytechnic, Zhejiang, China
| | - Di Zhou
- Industrial Technological Institute of Intelligent Manufacturing, Sichuan University of Arts and Science, Dazhou, China
| |
Collapse
|
14
|
Li W, Huan W, Shao S, Hou B, Song A. MS-FRAN: A Novel Multi-Source Domain Adaptation Method for EEG-Based Emotion Recognition. IEEE J Biomed Health Inform 2023; 27:5302-5313. [PMID: 37665703 DOI: 10.1109/jbhi.2023.3311338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2023]
Abstract
Electroencephalogram (EEG)-based emotion recognition has gradually become a research hotspot. However, the large distribution differences of EEG signals across subjects make the current research stuck in a dilemma. To resolve this problem, in this article, we propose a novel and effective method, Multi-Source Feature Representation and Alignment Network (MS-FRAN). The effectiveness of proposed method mainly comes from three new modules: Wide Feature Extractor (WFE) for feature learning, Random Matching Operation (RMO) for model training, and Top- h ranked domain classifier selection (TOP) for emotion classification. MS-FRAN is not only effective in aligning the distributions of each pair of source and target domains, but also capable of reducing the distributional differences among the multiple source domains. Experimental results on the public benchmark datasets SEED and DEAP have demonstrated the advantage of our method over the related competitive approaches for cross-subject EEG-based emotion recognition.
Collapse
|
15
|
Huang HY, Lin YP. Validation of Model-Basis Transfer Learning for a Personalized Electroencephalogram-Based Emotion-Classification Model . ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023; 2023:1-4. [PMID: 38082699 DOI: 10.1109/embc40787.2023.10340188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]
Abstract
The electroencephalogram (EEG)-based affective brain-computer interface (aBCI) has attracted extensive attention in multidisciplinary fields in the past decade. However, the inherent variability of emotional responses recorded in EEG signals increases the vulnerability of pre-trained machine-learning models and impedes the applicability of aBCIs with real-life settings. To overcome the shortcomings associated with the limited personal data in affective modeling, this study proposes a model-basis transfer learning (TL) approach and verifies its feasibility to construct a personalized model using less emotion-annotated data in a longitudinal eight-day dataset comprising data on 10 subjects. By performing daily reliability testing, the proposed TL approach outperformed the subject-dependent counterpart (using limited data only) by ~6% in binary valence classification after recycling a compact set of the eight most transferable models from other subjects. These empirical findings practically contribute to progress in applying TL in realistic aBCI applications.Clinical Relevance- The proposed model-basis TL approach overcomes the shortcoming of inherent variability in EEG signals, supporting realistic aBCI applications.
Collapse
|
16
|
Quan J, Li Y, Wang L, He R, Yang S, Guo L. EEG-based cross-subject emotion recognition using multi-source domain transfer learning. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
|
17
|
Ran S, Zhong W, Duan D, Ye L, Zhang Q. SSTM-IS: simplified STM method based on instance selection for real-time EEG emotion recognition. Front Hum Neurosci 2023; 17:1132254. [PMID: 37323929 PMCID: PMC10267366 DOI: 10.3389/fnhum.2023.1132254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Accepted: 05/05/2023] [Indexed: 06/17/2023] Open
Abstract
Introduction EEG signals can non-invasively monitor the brain activities and have been widely used in brain-computer interfaces (BCI). One of the research areas is to recognize emotions objectively through EEG. In fact, the emotion of people changes over time, however, most of the existing affective BCIs process data and recognize emotions offline, and thus cannot be applied to real-time emotion recognition. Methods In order to solve this problem, we introduce the instance selection strategy into transfer learning and propose a simplified style transfer mapping algorithm. In the proposed method, the informative instances are firstly selected from the source domain data, and then the update strategy of hyperparameters is also simplified for style transfer mapping, making the model training more quickly and accurately for a new subject. Results To verify the effectiveness of our algorithm, we carry out the experiments on SEED, SEED-IV and the offline dataset collected by ourselves, and achieve the recognition accuracies up to 86.78%, 82.55% and 77.68% in computing time of 7s, 4s and 10s, respectively. Furthermore, we also develop a real-time emotion recognition system which integrates the modules of EEG signal acquisition, data processing, emotion recognition and result visualization. Discussion Both the results of offline and online experiments show that the proposed algorithm can accurately recognize emotions in a short time, meeting the needs of real-time emotion recognition applications.
Collapse
Affiliation(s)
- Shuang Ran
- Key Laboratory of Media Audio & Video, Ministry of Education, Communication University of China, Beijing, China
| | - Wei Zhong
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing, China
| | - Danting Duan
- Key Laboratory of Media Audio & Video, Ministry of Education, Communication University of China, Beijing, China
| | - Long Ye
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing, China
| | - Qin Zhang
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing, China
| |
Collapse
|
18
|
Yuan D, Yue J, Xiong X, Jiang Y, Zan P, Li C. A regression method for EEG-based cross-dataset fatigue detection. Front Physiol 2023; 14:1196919. [PMID: 37324376 PMCID: PMC10266210 DOI: 10.3389/fphys.2023.1196919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 05/17/2023] [Indexed: 06/17/2023] Open
Abstract
Introduction: Fatigue is dangerous for certain jobs requiring continuous concentration. When faced with new datasets, the existing fatigue detection model needs a large amount of electroencephalogram (EEG) data for training, which is resource-consuming and impractical. Although the cross-dataset fatigue detection model does not need to be retrained, no one has studied this problem previously. Therefore, this study will focus on the design of the cross-dataset fatigue detection model. Methods: This study proposes a regression method for EEG-based cross-dataset fatigue detection. This method is similar to self-supervised learning and can be divided into two steps: pre-training and the domain-specific adaptive step. To extract specific features for different datasets, a pretext task is proposed to distinguish data on different datasets in the pre-training step. Then, in the domain-specific adaptation stage, these specific features are projected into a shared subspace. Moreover, the maximum mean discrepancy (MMD) is exploited to continuously narrow the differences in the subspace so that an inherent connection can be built between datasets. In addition, the attention mechanism is introduced to extract continuous information on spatial features, and the gated recurrent unit (GRU) is used to capture time series information. Results: The accuracy and root mean square error (RMSE) achieved by the proposed method are 59.10% and 0.27, respectively, which significantly outperforms state-of-the-art domain adaptation methods. Discussion: In addition, this study discusses the effect of labeled samples. When the number of labeled samples is 10% of the total number, the accuracy of the proposed model can reach 66.21%. This study fills a vacancy in the field of fatigue detection. In addition, the EEG-based cross-dataset fatigue detection method can be used for reference by other EEG-based deep learning research practices.
Collapse
Affiliation(s)
- Duanyang Yuan
- Shanghai Key Laboratory of Power Station Automation, School of Mechatronics Engineering and Automation, Shanghai University, Shanghai, China
| | - Jingwei Yue
- Beijing Institute of Radiation Medicine, Academy of Military Medical Sciences (AMMS), Beijing, China
| | - Xuefeng Xiong
- Shanghai Key Laboratory of Power Station Automation, School of Mechatronics Engineering and Automation, Shanghai University, Shanghai, China
| | - Yibi Jiang
- Shanghai Key Laboratory of Power Station Automation, School of Mechatronics Engineering and Automation, Shanghai University, Shanghai, China
| | - Peng Zan
- Shanghai Key Laboratory of Power Station Automation, School of Mechatronics Engineering and Automation, Shanghai University, Shanghai, China
| | - Chunyong Li
- Beijing Institute of Radiation Medicine, Academy of Military Medical Sciences (AMMS), Beijing, China
| |
Collapse
|
19
|
Zhang W, Wang Z, Wu D. Multi-Source Decentralized Transfer for Privacy-Preserving BCIs. IEEE Trans Neural Syst Rehabil Eng 2022; 30:2710-2720. [PMID: 36112563 DOI: 10.1109/tnsre.2022.3207494] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Transfer learning, which utilizes labeled source domains to facilitate the learning in a target model, is effective in alleviating high intra- and inter-subject variations in electroencephalogram (EEG) based brain-computer interfaces (BCIs). Existing transfer learning approaches usually use the source subjects' EEG data directly, leading to privacy concerns. This paper considers a decentralized privacy-preserving transfer learning scenario: there are multiple source subjects, whose data and computations are kept local, and only the parameters or predictions of their pre-trained models can be accessed for privacy-protection; then, how to perform effective cross-subject transfer for a new subject with unlabeled EEG trials? We propose an offline unsupervised multi-source decentralized transfer (MSDT) approach, which first generates a pre-trained model from each source subject, and then performs decentralized transfer using the source model parameters (in gray-box settings) or predictions (in black-box settings). Experiments on two datasets from two BCI paradigms, motor imagery and affective BCI, demonstrated that MSDT outperformed several existing approaches, which do not consider privacy-protection at all. In other words, MSDT achieved both high privacy-protection and better classification performance.
Collapse
|
20
|
Guo W, Xu G, Wang Y. Horizontal and vertical features fusion network based on different brain regions for emotion recognition. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
21
|
Cao J, He X, Yang C, Chen S, Li Z, Wang Z. Multi-Source and Multi-Representation Adaptation for Cross-Domain Electroencephalography Emotion Recognition. Front Psychol 2022; 12:809459. [PMID: 35095696 PMCID: PMC8792438 DOI: 10.3389/fpsyg.2021.809459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 12/21/2021] [Indexed: 11/29/2022] Open
Abstract
Due to the non-invasiveness and high precision of electroencephalography (EEG), the combination of EEG and artificial intelligence (AI) is often used for emotion recognition. However, the internal differences in EEG data have become an obstacle to classification accuracy. To solve this problem, considering labeled data from similar nature but different domains, domain adaptation usually provides an attractive option. Most of the existing researches aggregate the EEG data from different subjects and sessions as a source domain, which ignores the assumption that the source has a certain marginal distribution. Moreover, existing methods often only align the representation distributions extracted from a single structure, and may only contain partial information. Therefore, we propose the multi-source and multi-representation adaptation (MSMRA) for cross-domain EEG emotion recognition, which divides the EEG data from different subjects and sessions into multiple domains and aligns the distribution of multiple representations extracted from a hybrid structure. Two datasets, i.e., SEED and SEED IV, are used to validate the proposed method in cross-session and cross-subject transfer scenarios, experimental results demonstrate the superior performance of our model to state-of-the-art models in most settings.
Collapse
Affiliation(s)
| | - Xueqin He
- School of Informatics, Xiamen University, Xiamen, China
| | - Chenhui Yang
- School of Informatics, Xiamen University, Xiamen, China
| | - Sifang Chen
- Department of Neurosurgery, The First Affiliated Hospital of Xiamen University, Xiamen, China
| | - Zhangyu Li
- Department of Neurosurgery, The First Affiliated Hospital of Xiamen University, Xiamen, China
| | - Zhanxiang Wang
- Department of Neurosurgery, Xiamen Key Laboratory of Brain Center, The First Affiliated Hospital of Xiamen University, Xiamen, China
- Department of Neuroscience, Institute of Neurosurgery, School of Medicine, Xiamen University, Xiamen, China
| |
Collapse
|