Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Li Y, Zeng J, Shan S, Chen X. Occlusion aware facial expression recognition using CNN with attention mechanism. IEEE Trans Image Process 2018;28:2439-2450. [PMID: 30571627 DOI: 10.1109/tip.2018.2886767] [Citation(s) in RCA: 86] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

For:	Li Y, Zeng J, Shan S, Chen X. Occlusion aware facial expression recognition using CNN with attention mechanism. IEEE Trans Image Process 2018;28:2439-2450. [PMID: 30571627 DOI: 10.1109/tip.2018.2886767] [Citation(s) in RCA: 86] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Number

Cited by Other Article(s)

Wang J. Evaluation and analysis of visual perception using attention-enhanced computation in multimedia affective computing. Front Neurosci 2024;18:1449527. [PMID: 39170679 PMCID: PMC11335721 DOI: 10.3389/fnins.2024.1449527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Accepted: 07/11/2024] [Indexed: 08/23/2024] Open

Abstract

Facial expression recognition (FER) plays a crucial role in affective computing, enhancing human-computer interaction by enabling machines to understand and respond to human emotions. Despite advancements in deep learning, current FER systems often struggle with challenges such as occlusions, head pose variations, and motion blur in natural environments. These challenges highlight the need for more robust FER solutions. To address these issues, we propose the Attention-Enhanced Multi-Layer Transformer (AEMT) model, which integrates a dual-branch Convolutional Neural Network (CNN), an Attentional Selective Fusion (ASF) module, and a Multi-Layer Transformer Encoder (MTE) with transfer learning. The dual-branch CNN captures detailed texture and color information by processing RGB and Local Binary Pattern (LBP) features separately. The ASF module selectively enhances relevant features by applying global and local attention mechanisms to the extracted features. The MTE captures long-range dependencies and models the complex relationships between features, collectively improving feature representation and classification accuracy. Our model was evaluated on the RAF-DB and AffectNet datasets. Experimental results demonstrate that the AEMT model achieved an accuracy of 81.45% on RAF-DB and 71.23% on AffectNet, significantly outperforming existing state-of-the-art methods. These results indicate that our model effectively addresses the challenges of FER in natural environments, providing a more robust and accurate solution. The AEMT model significantly advances the field of FER by improving the robustness and accuracy of emotion recognition in complex real-world scenarios. This work not only enhances the capabilities of affective computing systems but also opens new avenues for future research in improving model efficiency and expanding multimodal data integration.

Collapse

Aina J, Akinniyi O, Rahman MM, Odero-Marah V, Khalifa F. A Hybrid Learning-Architecture for Mental Disorder Detection Using Emotion Recognition. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2024;12:91410-91425. [PMID: 39054996 PMCID: PMC11270886 DOI: 10.1109/access.2024.3421376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/27/2024]

Li N, Huang Y, Wang Z, Fan Z, Li X, Xiao Z. Enhanced Hybrid Vision Transformer with Multi-Scale Feature Integration and Patch Dropping for Facial Expression Recognition. SENSORS (BASEL, SWITZERLAND) 2024;24:4153. [PMID: 39000930 PMCID: PMC11243949 DOI: 10.3390/s24134153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 06/22/2024] [Accepted: 06/24/2024] [Indexed: 07/16/2024]

Ramzani Shahrestani M, Motamed S, Yamaghani M. Recognition of facial emotion based on SOAR model. Front Neurosci 2024;18:1374112. [PMID: 38826778 PMCID: PMC11140482 DOI: 10.3389/fnins.2024.1374112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Accepted: 05/01/2024] [Indexed: 06/04/2024] Open

Abstract

Introduction

Expressing emotions play a special role in daily communication, and one of the most essential methods in detecting emotions is to detect facial emotional states. Therefore, one of the crucial aspects of the natural human-machine interaction is the recognition of facial expressions and the creation of feedback, according to the perceived emotion.

Methods

To implement each part of this model, two main steps have been introduced. The first step is reading the video and converting it to images and preprocessing on them. The next step is to use the combination of 3D convolutional neural network (3DCNN) and learning automata (LA) to classify and detect the rate of facial emotional recognition. The reason for choosing 3DCNN in our model is that no dimension is removed from the images, and considering the temporal information in dynamic images leads to more efficient and better classification. In addition, the training of the 3DCNN network in calculating the backpropagation error is adjusted by LA so that both the efficiency of the proposed model is increased, and the working memory part of the SOAR model can be implemented.

Results and discussion

Due to the importance of the topic, this article presents an efficient method for recognizing emotional states from facial images based on a mixed deep learning and cognitive model called SOAR. Among the objectives of the proposed model, it is possible to mention providing a model for learning the time order of frames in the movie and providing a model for better display of visual features, increasing the recognition rate. The accuracy of recognition rate of facial emotional states in the proposed model is 85.3%. To compare the effectiveness of the proposed model with other models, this model has been compared with competing models. By examining the results, we found that the proposed model has a better performance than other models.

Collapse

Xie W, Peng Z, Shen L, Lu W, Zhang Y, Song S. Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2024;33:2514-2529. [PMID: 38530732 DOI: 10.1109/tip.2024.3378459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/28/2024]

Lim H, Joo Y, Ha E, Song Y, Yoon S, Shin T. Brain Age Prediction Using Multi-Hop Graph Attention Combined with Convolutional Neural Network. Bioengineering (Basel) 2024;11:265. [PMID: 38534539 DOI: 10.3390/bioengineering11030265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 02/28/2024] [Accepted: 03/01/2024] [Indexed: 03/28/2024] Open

Tao H, Duan Q. Hierarchical attention network with progressive feature fusion for facial expression recognition. Neural Netw 2024;170:337-348. [PMID: 38006736 DOI: 10.1016/j.neunet.2023.11.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Revised: 11/04/2023] [Accepted: 11/12/2023] [Indexed: 11/27/2023]

B A, Sarkar A, Behera PR, Shukla J. Multi-source transfer learning for facial emotion recognition using multivariate correlation analysis. Sci Rep 2023;13:21004. [PMID: 38017241 PMCID: PMC10684585 DOI: 10.1038/s41598-023-48250-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 11/23/2023] [Indexed: 11/30/2023] Open

Wen H. Webcast marketing platform optimization via 6G R&D and the impact on brand content creation. PLoS One 2023;18:e0292394. [PMID: 37856448 PMCID: PMC10586639 DOI: 10.1371/journal.pone.0292394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 09/19/2023] [Indexed: 10/21/2023] Open

Li Y, Huang J, Lu S, Zhang Z, Lu G. Cross-Domain Facial Expression Recognition via Contrastive Warm up and Complexity-Aware Self-Training. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2023;32:5438-5450. [PMID: 37773906 DOI: 10.1109/tip.2023.3318955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/01/2023]

Zhao Y, Zhu H, Chen X, Luo F, Li M, Zhou J, Chen S, Pan Y. Pose-invariant and occlusion-robust neonatal facial pain assessment. Comput Biol Med 2023;165:107462. [PMID: 37716244 DOI: 10.1016/j.compbiomed.2023.107462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 08/12/2023] [Accepted: 09/04/2023] [Indexed: 09/18/2023]

Chen Y, Liu S, Zhao D, Ji W. Occlusion facial expression recognition based on feature fusion residual attention network. Front Neurorobot 2023;17:1250706. [PMID: 37663762 PMCID: PMC10472272 DOI: 10.3389/fnbot.2023.1250706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 07/31/2023] [Indexed: 09/05/2023] Open

Fang B, Zhao Y, Han G, He J. Expression-Guided Deep Joint Learning for Facial Expression Recognition. SENSORS (BASEL, SWITZERLAND) 2023;23:7148. [PMID: 37631685 PMCID: PMC10457757 DOI: 10.3390/s23167148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 08/10/2023] [Accepted: 08/11/2023] [Indexed: 08/27/2023]

Bellamkonda S, Gopalan NP, Mala C, Settipalli L. Facial expression recognition on partially occluded faces using component based ensemble stacked CNN. Cogn Neurodyn 2023;17:985-1008. [PMID: 37522034 PMCID: PMC10374495 DOI: 10.1007/s11571-022-09879-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 07/22/2022] [Accepted: 08/13/2022] [Indexed: 11/28/2022] Open

Yao H, Yang X, Chen D, Wang Z, Tian Y. Facial Expression Recognition Based on Fine-Tuned Channel-Spatial Attention Transformer. SENSORS (BASEL, SWITZERLAND) 2023;23:6799. [PMID: 37571582 PMCID: PMC10422316 DOI: 10.3390/s23156799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 07/27/2023] [Accepted: 07/28/2023] [Indexed: 08/13/2023]

Abstract

Facial expressions help individuals convey their emotions. In recent years, thanks to the development of computer vision technology, facial expression recognition (FER) has become a research hotspot and made remarkable progress. However, human faces in real-world environments are affected by various unfavorable factors, such as facial occlusion and head pose changes, which are seldom encountered in controlled laboratory settings. These factors often lead to a reduction in expression recognition accuracy. Inspired by the recent success of transformers in many computer vision tasks, we propose a model called the fine-tuned channel-spatial attention transformer (FT-CSAT) to improve the accuracy of recognition of FER in the wild. FT-CSAT consists of two crucial components: channel-spatial attention module and fine-tuning module. In the channel-spatial attention module, the feature map is input into the channel attention module and the spatial attention module sequentially. The final output feature map will effectively incorporate both channel information and spatial information. Consequently, the network becomes adept at focusing on relevant and meaningful features associated with facial expressions. To further improve the model's performance while controlling the number of excessive parameters, we employ a fine-tuning method. Extensive experimental results demonstrate that our FT-CSAT outperforms the state-of-the-art methods on two benchmark datasets: RAF-DB and FERPlus. The achieved recognition accuracy is 88.61% and 89.26%, respectively. Furthermore, to evaluate the robustness of FT-CSAT in the case of facial occlusion and head pose changes, we take tests on Occlusion-RAF-DB and Pose-RAF-DB data sets, and the results also show that the superior recognition performance of the proposed method under such conditions.

Collapse

Chen X, Zheng X, Sun K, Liu W, Zhang Y. Self-supervised vision transformer-based few-shot learning for facial expression recognition. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.03.105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023]

Raimundo A, Pavia JP, Sebastião P, Postolache O. YOLOX-Ray: An Efficient Attention-Based Single-Staged Object Detector Tailored for Industrial Inspections. SENSORS (BASEL, SWITZERLAND) 2023;23:4681. [PMID: 37430595 DOI: 10.3390/s23104681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 04/29/2023] [Accepted: 05/05/2023] [Indexed: 07/12/2023]

Qu Z, Niu D. Leveraging ResNet and label distribution in advanced intelligent systems for facial expression recognition. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023;20:11101-11115. [PMID: 37322973 DOI: 10.3934/mbe.2023491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]

Liao J, Lin Y, Ma T, He S, Liu X, He G. Facial Expression Recognition Methods in the Wild Based on Fusion Feature of Attention Mechanism and LBP. SENSORS (BASEL, SWITZERLAND) 2023;23:s23094204. [PMID: 37177408 PMCID: PMC10180539 DOI: 10.3390/s23094204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Revised: 03/16/2023] [Accepted: 04/12/2023] [Indexed: 05/15/2023]

Affiliation(s)

Jun Liao Chongqing Institute of Green Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China College of Mechanical Engineering, Chongqing University of Technology, Chongqing 400054, China Chongqing Key Laboratory of Artificial Intelligence and Service Robot Control Technology, Chongqing Institute of Green Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China
Yuanchang Lin Chongqing Institute of Green Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China Chongqing Key Laboratory of Artificial Intelligence and Service Robot Control Technology, Chongqing Institute of Green Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China
Tengyun Ma Chongqing Institute of Green Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China Chongqing Key Laboratory of Artificial Intelligence and Service Robot Control Technology, Chongqing Institute of Green Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China
Songxiying He Chongqing Key Laboratory of Artificial Intelligence and Service Robot Control Technology, Chongqing Institute of Green Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China
Xiaofang Liu Chongqing Institute of Green Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China Chongqing Key Laboratory of Artificial Intelligence and Service Robot Control Technology, Chongqing Institute of Green Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China
Guotian He Chongqing Institute of Green Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China Chongqing Key Laboratory of Artificial Intelligence and Service Robot Control Technology, Chongqing Institute of Green Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China

Collapse

Rasmussen SHR, Ludeke SG, Klemmensen R. Using deep learning to predict ideology from facial photographs: expressions, beauty, and extra-facial information. Sci Rep 2023;13:5257. [PMID: 37002240 PMCID: PMC10066183 DOI: 10.1038/s41598-023-31796-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Accepted: 03/17/2023] [Indexed: 04/03/2023] Open

Qiu S, Zhao G, Li X, Wang X. Facial Expression Recognition Using Local Sliding Window Attention. SENSORS (BASEL, SWITZERLAND) 2023;23:s23073424. [PMID: 37050483 PMCID: PMC10098964 DOI: 10.3390/s23073424] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 03/07/2023] [Accepted: 03/19/2023] [Indexed: 06/12/2023]

Shahid AR, Yan H. SqueezExpNet: Dual-stage convolutional neural network for accurate facial expression recognition with attention mechanism. Knowl Based Syst 2023. [DOI: 10.1016/j.knosys.2023.110451] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/28/2023]

Kim J, Lee D. Facial Expression Recognition Robust to Occlusion and to Intra-Similarity Problem Using Relevant Subsampling. SENSORS (BASEL, SWITZERLAND) 2023;23:2619. [PMID: 36904823 PMCID: PMC10007059 DOI: 10.3390/s23052619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 02/22/2023] [Accepted: 02/25/2023] [Indexed: 06/18/2023]

SoftClusterMix: learning soft boundaries for empirical risk minimization. Neural Comput Appl 2023. [DOI: 10.1007/s00521-023-08338-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]

Gupta S, Kumar P, Tekchandani R. A multimodal facial cues based engagement detection system in e-learning context using deep learning approach. MULTIMEDIA TOOLS AND APPLICATIONS 2023;82:1-27. [PMID: 36789011 PMCID: PMC9911959 DOI: 10.1007/s11042-023-14392-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Revised: 11/20/2022] [Indexed: 06/18/2023]

Zhang Z, Tian X, Zhang Y, Guo K, Xu X. Enhanced Discriminative Global-Local Feature Learning with Priority for Facial Expression Recognition. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.02.056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]

Zhang X, Yan X. Predicting collision cases at unsignalized intersections using EEG metrics and driving simulator platform. ACCIDENT; ANALYSIS AND PREVENTION 2023;180:106910. [PMID: 36525717 DOI: 10.1016/j.aap.2022.106910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 10/16/2022] [Accepted: 11/25/2022] [Indexed: 06/17/2023]

Mixing Global and Local Features for Long-Tailed Expression Recognition. INFORMATION 2023. [DOI: 10.3390/info14020083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open

Eyiokur FI, Kantarcı A, Erakın ME, Damer N, Ofli F, Imran M, Križaj J, Salah AA, Waibel A, Štruc V, Ekenel HK. A survey on computer vision based human analysis in the COVID-19 era. IMAGE AND VISION COMPUTING 2023;130:104610. [PMID: 36540857 PMCID: PMC9755265 DOI: 10.1016/j.imavis.2022.104610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 12/11/2022] [Indexed: 06/17/2023]

Abstract

The emergence of COVID-19 has had a global and profound impact, not only on society as a whole, but also on the lives of individuals. Various prevention measures were introduced around the world to limit the transmission of the disease, including face masks, mandates for social distancing and regular disinfection in public spaces, and the use of screening applications. These developments also triggered the need for novel and improved computer vision techniques capable of ( i ) providing support to the prevention measures through an automated analysis of visual data, on the one hand, and ( ii ) facilitating normal operation of existing vision-based services, such as biometric authentication schemes, on the other. Especially important here, are computer vision techniques that focus on the analysis of people and faces in visual data and have been affected the most by the partial occlusions introduced by the mandates for facial masks. Such computer vision based human analysis techniques include face and face-mask detection approaches, face recognition techniques, crowd counting solutions, age and expression estimation procedures, models for detecting face-hand interactions and many others, and have seen considerable attention over recent years. The goal of this survey is to provide an introduction to the problems induced by COVID-19 into such research and to present a comprehensive review of the work done in the computer vision based human analysis field. Particular attention is paid to the impact of facial masks on the performance of various methods and recent solutions to mitigate this problem. Additionally, a detailed review of existing datasets useful for the development and evaluation of methods for COVID-19 related applications is also provided. Finally, to help advance the field further, a discussion on the main open challenges and future research direction is given at the end of the survey. This work is intended to have a broad appeal and be useful not only for computer vision researchers but also the general public.

Collapse

Fang J, Lin X, Liu W, An Y, Sun H. Triple attention feature enhanced pyramid network for facial expression recognition. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2023. [DOI: 10.3233/jifs-222252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

Mukhiddinov M, Djuraev O, Akhmedov F, Mukhamadiyev A, Cho J. Masked Face Emotion Recognition Based on Facial Landmarks and Deep Learning Approaches for Visually Impaired People. SENSORS (BASEL, SWITZERLAND) 2023;23:1080. [PMID: 36772117 PMCID: PMC9921901 DOI: 10.3390/s23031080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 01/10/2023] [Accepted: 01/15/2023] [Indexed: 06/18/2023]

Zhou S, Wu X, Jiang F, Huang Q, Huang C. Emotion Recognition from Large-Scale Video Clips with Cross-Attention and Hybrid Feature Weighting Neural Networks. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023;20:1400. [PMID: 36674161 PMCID: PMC9859118 DOI: 10.3390/ijerph20021400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 01/06/2023] [Accepted: 01/07/2023] [Indexed: 06/17/2023]

Abstract

The emotion of humans is an important indicator or reflection of their mental states, e.g., satisfaction or stress, and recognizing or detecting emotion from different media is essential to perform sequence analysis or for certain applications, e.g., mental health assessments, job stress level estimation, and tourist satisfaction assessments. Emotion recognition based on computer vision techniques, as an important method of detecting emotion from visual media (e.g., images or videos) of human behaviors with the use of plentiful emotional cues, has been extensively investigated because of its significant applications. However, most existing models neglect inter-feature interaction and use simple concatenation for feature fusion, failing to capture the crucial complementary gains between face and context information in video clips, which is significant in addressing the problems of emotion confusion and emotion misunderstanding. Accordingly, in this paper, to fully exploit the complementary information between face and context features, we present a novel cross-attention and hybrid feature weighting network to achieve accurate emotion recognition from large-scale video clips, and the proposed model consists of a dual-branch encoding (DBE) network, a hierarchical-attention encoding (HAE) network, and a deep fusion (DF) block. Specifically, the face and context encoding blocks in the DBE network generate the respective shallow features. After this, the HAE network uses the cross-attention (CA) block to investigate and capture the complementarity between facial expression features and their contexts via a cross-channel attention operation. The element recalibration (ER) block is introduced to revise the feature map of each channel by embedding global information. Moreover, the adaptive-attention (AA) block in the HAE network is developed to infer the optimal feature fusion weights and obtain the adaptive emotion features via a hybrid feature weighting operation. Finally, the DF block integrates these adaptive emotion features to predict an individual emotional state. Extensive experimental results of the CAER-S dataset demonstrate the effectiveness of our method, exhibiting its potential in the analysis of tourist reviews with video clips, estimation of job stress levels with visual emotional evidence, or assessments of mental healthiness with visual media.

Collapse

CLC-Net: Contextual and Local Collaborative Network for Lesion Segmentation in Diabetic Retinopathy Images. Neurocomputing 2023. [DOI: 10.1016/j.neucom.2023.01.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

Zhong J, Chen T, Yi L. Face expression recognition based on NGO-BILSTM model. Front Neurorobot 2023;17:1155038. [PMID: 37025255 PMCID: PMC10072256 DOI: 10.3389/fnbot.2023.1155038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Accepted: 03/03/2023] [Indexed: 04/08/2023] Open

Gao H, Wu M, Chen Z, Li Y, Wang X, An S, Li J, Liu C. SSA-ICL: Multi-domain adaptive attention with intra-dataset continual learning for Facial expression recognition. Neural Netw 2023;158:228-238. [PMID: 36473290 DOI: 10.1016/j.neunet.2022.11.025] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 09/26/2022] [Accepted: 11/15/2022] [Indexed: 11/27/2022]

Facial Expression Recognition Based on Dual-Channel Fusion with Edge Features. Symmetry (Basel) 2022. [DOI: 10.3390/sym14122651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open

Zhu Y, Wei L, Lang C, Li S, Feng S, Li Y. Fine-grained facial expression recognition via relational reasoning and hierarchical relation optimization. Pattern Recognit Lett 2022. [DOI: 10.1016/j.patrec.2022.10.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Yang B, Wu J, Ikeda K, Hattori G, Sugano M, Iwasawa Y, Matsuo Y. Face-mask-aware Facial Expression Recognition based on Face Parsing and Vision Transformer. Pattern Recognit Lett 2022;164:173-182. [PMID: 36407855 PMCID: PMC9645067 DOI: 10.1016/j.patrec.2022.11.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2022] [Revised: 10/05/2022] [Accepted: 11/04/2022] [Indexed: 11/11/2022]

Zhang Z, Sun X, Li J, Wang M. MAN: Mining Ambiguity and Noise for Facial Expression Recognition in the Wild. Pattern Recognit Lett 2022. [DOI: 10.1016/j.patrec.2022.10.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Liu P, Lin Y, Meng Z, Lu L, Deng W, Zhou JT, Yang Y. Point Adversarial Self-Mining: A Simple Method for Facial Expression Recognition. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:12649-12660. [PMID: 34197333 DOI: 10.1109/tcyb.2021.3085744] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Su C, Wei J, Lin D, Kong L. Using attention LSGB network for facial expression recognition. Pattern Anal Appl 2022. [DOI: 10.1007/s10044-022-01124-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Huo H, Yu Y, Liu Z. Facial expression recognition based on improved depthwise separable convolutional network. MULTIMEDIA TOOLS AND APPLICATIONS 2022;82:18635-18652. [PMID: 36467439 PMCID: PMC9686458 DOI: 10.1007/s11042-022-14066-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 08/29/2022] [Accepted: 10/10/2022] [Indexed: 06/17/2023]

Gong W, Qian Y, Fan Y. MPCSAN: multi-head parallel channel-spatial attention network for facial expression recognition in the wild. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-08040-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Patch Attention Convolutional Vision Transformer for Facial Expression Recognition with Occlusion. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.11.068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Zhou J, Wang Y, Zhang C, Wu W, Ji Y, Zou Y. Eyebirds: Enabling the Public to Recognize Water Birds at Hand. Animals (Basel) 2022;12:3000. [PMID: 36359124 PMCID: PMC9658372 DOI: 10.3390/ani12213000] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 10/25/2022] [Accepted: 10/26/2022] [Indexed: 09/29/2023] Open

Xu X, Zong Y, Lu C, Jiang X. Enhanced Sample Self-Revised Network for Cross-Dataset Facial Expression Recognition. ENTROPY (BASEL, SWITZERLAND) 2022;24:1475. [PMID: 37420495 DOI: 10.3390/e24101475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Revised: 10/04/2022] [Accepted: 10/10/2022] [Indexed: 07/09/2023]

CNN-LSTM Facial Expression Recognition Method Fused with Two-Layer Attention Mechanism. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:7450637. [DOI: 10.1155/2022/7450637] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 09/29/2022] [Indexed: 11/17/2022]

Kuruvayil S, Palaniswamy S. Emotion recognition from facial images with simultaneous occlusion, pose and illumination variations using meta-learning. JOURNAL OF KING SAUD UNIVERSITY - COMPUTER AND INFORMATION SCIENCES 2022. [DOI: 10.1016/j.jksuci.2021.06.012] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Gupta S, Kumar P, Tekchandani RK. Facial emotion recognition based real-time learner engagement detection system in online learning context using deep learning models. MULTIMEDIA TOOLS AND APPLICATIONS 2022;82:11365-11394. [PMID: 36105662 PMCID: PMC9461440 DOI: 10.1007/s11042-022-13558-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 05/14/2022] [Accepted: 07/14/2022] [Indexed: 06/15/2023]

Cai Q, An JP, Li HY, Guo JY, Gao ZK. Cross-subject emotion recognition using visibility graph and genetic algorithm-based convolution neural network. CHAOS (WOODBURY, N.Y.) 2022;32:093110. [PMID: 36182360 DOI: 10.1063/5.0098454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 08/01/2022] [Indexed: 06/16/2023]