Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Jegham I, Ben Khalifa A, Alouani I, Mahjoub MA. Vision-based human action recognition: An overview and real world challenges. ACTA ACUST UNITED AC 2020;32:200901. [DOI: 10.1016/j.fsidi.2019.200901] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

For:	Jegham I, Ben Khalifa A, Alouani I, Mahjoub MA. Vision-based human action recognition: An overview and real world challenges. ACTA ACUST UNITED AC 2020;32:200901. [DOI: 10.1016/j.fsidi.2019.200901] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Number

Cited by Other Article(s)

Ahmedt-Aristizabal D, Armin MA, Hayder Z, Garcia-Cairasco N, Petersson L, Fookes C, Denman S, McGonigal A. Deep learning approaches for seizure video analysis: A review. Epilepsy Behav 2024;154:109735. [PMID: 38522192 DOI: 10.1016/j.yebeh.2024.109735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 02/06/2024] [Accepted: 03/03/2024] [Indexed: 03/26/2024]

Abstract

Seizure events can manifest as transient disruptions in the control of movements which may be organized in distinct behavioral sequences, accompanied or not by other observable features such as altered facial expressions. The analysis of these clinical signs, referred to as semiology, is subject to observer variations when specialists evaluate video-recorded events in the clinical setting. To enhance the accuracy and consistency of evaluations, computer-aided video analysis of seizures has emerged as a natural avenue. In the field of medical applications, deep learning and computer vision approaches have driven substantial advancements. Historically, these approaches have been used for disease detection, classification, and prediction using diagnostic data; however, there has been limited exploration of their application in evaluating video-based motion detection in the clinical epileptology setting. While vision-based technologies do not aim to replace clinical expertise, they can significantly contribute to medical decision-making and patient care by providing quantitative evidence and decision support. Behavior monitoring tools offer several advantages such as providing objective information, detecting challenging-to-observe events, reducing documentation efforts, and extending assessment capabilities to areas with limited expertise. The main applications of these could be (1) improved seizure detection methods; (2) refined semiology analysis for predicting seizure type and cerebral localization. In this paper, we detail the foundation technologies used in vision-based systems in the analysis of seizure videos, highlighting their success in semiology detection and analysis, focusing on work published in the last 7 years. We systematically present these methods and indicate how the adoption of deep learning for the analysis of video recordings of seizures could be approached. Additionally, we illustrate how existing technologies can be interconnected through an integrated system for video-based semiology analysis. Each module can be customized and improved by adapting more accurate and robust deep learning approaches as these evolve. Finally, we discuss challenges and research directions for future studies.

Collapse

Camarena F, Gonzalez-Mendoza M, Chang L. Knowledge Distillation in Video-Based Human Action Recognition: An Intuitive Approach to Efficient and Flexible Model Training. J Imaging 2024;10:85. [PMID: 38667983 PMCID: PMC11051277 DOI: 10.3390/jimaging10040085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 03/23/2024] [Accepted: 03/25/2024] [Indexed: 04/28/2024] Open

Gao M, Ju B. Attention-enhanced gated recurrent unit for action recognition in tennis. PeerJ Comput Sci 2024;10:e1804. [PMID: 38259901 PMCID: PMC10803087 DOI: 10.7717/peerj-cs.1804] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 12/18/2023] [Indexed: 01/24/2024]

Abstract

Human Action Recognition (HAR) is an essential topic in computer vision and artificial intelligence, focused on the automatic identification and categorization of human actions or activities from video sequences or sensor data. The goal of HAR is to teach machines to comprehend and interpret human movements, gestures, and behaviors, allowing for a wide range of applications in areas such as surveillance, healthcare, sports analysis, and human-computer interaction. HAR systems utilize a variety of techniques, including deep learning, motion analysis, and feature extraction, to capture and analyze the spatiotemporal characteristics of human actions. These systems have the capacity to distinguish between various actions, whether they are simple actions like walking and waving or more complex activities such as playing a musical instrument or performing sports maneuvers. HAR continues to be an active area of research and development, with the potential to enhance numerous real-world applications by providing machines with the ability to understand and respond to human actions effectively. In our study, we developed a HAR system to recognize actions in tennis using an attention-based gated recurrent unit (GRU), a prevalent recurrent neural network. The combination of GRU architecture and attention mechanism showed a significant improvement in prediction power compared to two other deep learning models. Our models were trained on the THETIS dataset, one of the standard medium-sized datasets for fine-grained tennis actions. The effectiveness of the proposed model was confirmed by three different types of image encoders: InceptionV3, DenseNet, and EfficientNetB5. The models developed with InceptionV3, DenseNet, and EfficientNetB5 achieved average ROC-AUC values of 0.97, 0.98, and 0.81, respectively. While, the models obtained average PR-AUC values of 0.84, 0.87, and 0.49 for InceptionV3, DenseNet, and EfficientNetB5 features, respectively. The experimental results confirmed the applicability of our proposed method in recognizing action in tennis and may be applied to other HAR problems.

Collapse

Guerra BMV, Torti E, Marenzi E, Schmid M, Ramat S, Leporati F, Danese G. Ambient assisted living for frail people through human activity recognition: state-of-the-art, challenges and future directions. Front Neurosci 2023;17:1256682. [PMID: 37849892 PMCID: PMC10577184 DOI: 10.3389/fnins.2023.1256682] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/18/2023] [Indexed: 10/19/2023] Open

Maudsley-Barton S, Yap MH. KINECAL: A Dataset for Falls-Risk Assessment and Balance Impairment Analysis. Sci Data 2023;10:633. [PMID: 37723189 PMCID: PMC10507078 DOI: 10.1038/s41597-023-02375-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 07/11/2023] [Indexed: 09/20/2023] Open

Muaaz M, Waqar S, Pätzold M. Orientation-Independent Human Activity Recognition Using Complementary Radio Frequency Sensing. SENSORS (BASEL, SWITZERLAND) 2023;23:5810. [PMID: 37447660 DOI: 10.3390/s23135810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 06/12/2023] [Accepted: 06/19/2023] [Indexed: 07/15/2023]

Abstract

RF sensing offers an unobtrusive, user-friendly, and privacy-preserving method for detecting accidental falls and recognizing human activities. Contemporary RF-based HAR systems generally employ a single monostatic radar to recognize human activities. However, a single monostatic radar cannot detect the motion of a target, e.g., a moving person, orthogonal to the boresight axis of the radar. Owing to this inherent physical limitation, a single monostatic radar fails to efficiently recognize orientation-independent human activities. In this work, we present a complementary RF sensing approach that overcomes the limitation of existing single monostatic radar-based HAR systems to robustly recognize orientation-independent human activities and falls. Our approach used a distributed mmWave MIMO radar system that was set up as two separate monostatic radars placed orthogonal to each other in an indoor environment. These two radars illuminated the moving person from two different aspect angles and consequently produced two time-variant micro-Doppler signatures. We first computed the mean Doppler shifts (MDSs) from the micro-Doppler signatures and then extracted statistical and time- and frequency-domain features. We adopted feature-level fusion techniques to fuse the extracted features and a support vector machine to classify orientation-independent human activities. To evaluate our approach, we used an orientation-independent human activity dataset, which was collected from six volunteers. The dataset consisted of more than 1350 activity trials of five different activities that were performed in different orientations. The proposed complementary RF sensing approach achieved an overall classification accuracy ranging from 98.31 to 98.54%. It overcame the inherent limitations of a conventional single monostatic radar-based HAR and outperformed it by 6%.

Collapse

Kulbacki M, Segen J, Chaczko Z, Rozenblit JW, Kulbacki M, Klempous R, Wojciechowski K. Intelligent Video Analytics for Human Action Recognition: The State of Knowledge. SENSORS (BASEL, SWITZERLAND) 2023;23:s23094258. [PMID: 37177461 PMCID: PMC10181781 DOI: 10.3390/s23094258] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Revised: 04/09/2023] [Accepted: 04/21/2023] [Indexed: 05/15/2023]

Jamshed A, Mallick B, Kumar Bharti R. Grey wolf optimization (GWO) with the convolution neural network (CNN)-based pattern recognition system. THE IMAGING SCIENCE JOURNAL 2023. [DOI: 10.1080/13682199.2023.2166193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Toward human activity recognition: a survey. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07937-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

A Comprehensive Review of Recent Deep Learning Techniques for Human Activity Recognition. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:8323962. [PMID: 35498187 PMCID: PMC9045967 DOI: 10.1155/2022/8323962] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 01/22/2022] [Accepted: 03/31/2022] [Indexed: 01/21/2023]

Neural Networks for Automatic Posture Recognition in Ambient-Assisted Living. SENSORS 2022;22:s22072609. [PMID: 35408224 PMCID: PMC9003043 DOI: 10.3390/s22072609] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 03/25/2022] [Accepted: 03/27/2022] [Indexed: 12/29/2022]

Implementation of Sequence-Based Classification Methods for Motion Assessment and Recognition in a Traditional Chinese Sport (Baduanjin). INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;19:ijerph19031744. [PMID: 35162767 PMCID: PMC8834705 DOI: 10.3390/ijerph19031744] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 01/30/2022] [Accepted: 01/30/2022] [Indexed: 11/30/2022]

Bossavit B, Arnedillo-Sánchez I. Using motion capture technology to assess locomotor development in children. Digit Health 2022;8:20552076221144201. [PMID: 36532118 PMCID: PMC9756361 DOI: 10.1177/20552076221144201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 11/21/2022] [Indexed: 08/01/2023] Open

Ramos RG, Domingo JD, Zalama E, Gómez-García-Bermejo J. Daily Human Activity Recognition Using Non-Intrusive Sensors. SENSORS 2021;21:s21165270. [PMID: 34450709 PMCID: PMC8401661 DOI: 10.3390/s21165270] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 07/26/2021] [Accepted: 07/31/2021] [Indexed: 11/16/2022]

Evaluating the Performance of Eigenface, Fisherface, and Local Binary Pattern Histogram-Based Facial Recognition Methods under Various Weather Conditions. TECHNOLOGIES 2021. [DOI: 10.3390/technologies9020031] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Liang JM, Chung PL, Ye YJ, Mishra S. Applying Machine Learning Technologies Based on Historical Activity Features for Multi-Resident Activity Recognition. SENSORS 2021;21:s21072520. [PMID: 33916549 PMCID: PMC8038457 DOI: 10.3390/s21072520] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Revised: 03/18/2021] [Accepted: 03/26/2021] [Indexed: 11/16/2022]

OLIMP: A Heterogeneous Multimodal Dataset for Advanced Environment Perception. ELECTRONICS 2020. [DOI: 10.3390/electronics9040560] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]