Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Li Y, Zeng J, Shan S. Learning Representations for Facial Actions From Unlabeled Videos. IEEE Trans Pattern Anal Mach Intell 2022;44:302-317. [PMID: 32750828 DOI: 10.1109/tpami.2020.3011063] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

For:	Li Y, Zeng J, Shan S. Learning Representations for Facial Actions From Unlabeled Videos. IEEE Trans Pattern Anal Mach Intell 2022;44:302-317. [PMID: 32750828 DOI: 10.1109/tpami.2020.3011063] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Number

Cited by Other Article(s)

Zhou S, Xu H, Bai Z, Du Z, Zeng J, Wang Y, Wang Y, Li S, Wang M, Li Y, Li J, Xu J. A multidimensional feature fusion network based on MGSE and TAAC for video-based human action recognition. Neural Netw 2023;168:496-507. [PMID: 37827068 DOI: 10.1016/j.neunet.2023.09.031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 09/14/2023] [Accepted: 09/18/2023] [Indexed: 10/14/2023]

Abstract

With the maturity of intelligent technology such as human-computer interaction, human action recognition (HAR) technology has been widely used in virtual reality, video surveillance, and other fields. However, the current video-based HAR methods still cannot fully extract abstract action features, and there is still a lack of action collection and recognition for special personnel such as prisoners and elderly people living alone. To solve the above problems, this paper proposes a multidimensional feature fusion network, called P-MTSC3D, a parallel network based on context modeling and temporal adaptive attention module. It consists of three branches. The first branch serves as the basic network branch, which extracts basic feature information. The second branch consists of a feature pre-extraction layer and two multiscale-convolution-based global context modeling combined squeeze and excitation (MGSE) modules, which can extract spatial and channel features. The third branch consists of two temporal adaptive attention units based on convolution (TAAC) to extract temporal dimension features. In order to verify the validity of the proposed network, this paper conducts experiments on the University of Central Florida (UCF) 101 dataset and the human motion database (HMDB) 51 dataset. The recognition accuracy of the proposed P-MTSC3D network is 97.92% on the UCF101 dataset and 75.59% on the HMDB51 dataset, respectively. The FLOPs of the P-MTSC3D network is 30.85G, and the test time is 2.83 s/16 samples on the UCF101 dataset. The experimental results demonstrate that the P-MTSC3D network has better overall performance than the state-of-the-art networks. In addition, a prison action (PA) dataset is constructed in this paper to verify the application effect of the proposed network in actual scenarios.

Collapse

Chen X, Zheng X, Sun K, Liu W, Zhang Y. Self-supervised vision transformer-based few-shot learning for facial expression recognition. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.03.105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023]

Wang C, Wang Z. Unsupervised Facial Action Representation Learning by Temporal Prediction. Front Neurorobot 2022;16:851847. [PMID: 35370591 PMCID: PMC8965886 DOI: 10.3389/fnbot.2022.851847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 01/31/2022] [Indexed: 11/13/2022] Open

Wang C, Wang Z. Progressive Multi-Scale Vision Transformer for Facial Action Unit Detection. Front Neurorobot 2022;15:824592. [PMID: 35095460 PMCID: PMC8790567 DOI: 10.3389/fnbot.2021.824592] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 12/10/2021] [Indexed: 11/29/2022] Open

Gao J, Zhao Y. TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition. Front Neurorobot 2021;15:763100. [PMID: 34759808 PMCID: PMC8573424 DOI: 10.3389/fnbot.2021.763100] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 09/13/2021] [Indexed: 11/13/2022] Open