1
|
Bonato P, Feipel V, Corniani G, Arin-Bal G, Leardini A. Position paper on how technology for human motion analysis and relevant clinical applications have evolved over the past decades: Striking a balance between accuracy and convenience. Gait Posture 2024; 113:191-203. [PMID: 38917666 DOI: 10.1016/j.gaitpost.2024.06.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 05/30/2024] [Accepted: 06/10/2024] [Indexed: 06/27/2024]
Abstract
BACKGROUND Over the past decades, tremendous technological advances have emerged in human motion analysis (HMA). RESEARCH QUESTION How has technology for analysing human motion evolved over the past decades, and what clinical applications has it enabled? METHODS The literature on HMA has been extensively reviewed, focusing on three main approaches: Fully-Instrumented Gait Analysis (FGA), Wearable Sensor Analysis (WSA), and Deep-Learning Video Analysis (DVA), considering both technical and clinical aspects. RESULTS FGA techniques relying on data collected using stereophotogrammetric systems, force plates, and electromyographic sensors have been dramatically improved providing highly accurate estimates of the biomechanics of motion. WSA techniques have been developed with the advances in data collection at home and in community settings. DVA techniques have emerged through artificial intelligence, which has marked the last decade. Some authors have considered WSA and DVA techniques as alternatives to "traditional" HMA techniques. They have suggested that WSA and DVA techniques are destined to replace FGA. SIGNIFICANCE We argue that FGA, WSA, and DVA complement each other and hence should be accounted as "synergistic" in the context of modern HMA and its clinical applications. We point out that DVA techniques are especially attractive as screening techniques, WSA methods enable data collection in the home and community for extensive periods of time, and FGA does maintain superior accuracy and should be the preferred technique when a complete and highly accurate biomechanical data is required. Accordingly, we envision that future clinical applications of HMA would favour screening patients using DVA in the outpatient setting. If deemed clinically appropriate, then WSA would be used to collect data in the home and community to derive relevant information. If accurate kinetic data is needed, then patients should be referred to specialized centres where an FGA system is available, together with medical imaging and thorough clinical assessments.
Collapse
Affiliation(s)
- Paolo Bonato
- Dept of PM&R, Harvard Medical School, Spaulding Rehabilitation Hospital, Boston, USA
| | - Véronique Feipel
- Laboratory of Functional Anatomy, Faculty of Motor Sciences, Laboratory of Anatomy, Biomechanics and Organogenesis, Faculty of Medicine, Université Libre de Bruxelles, Brussels, Belgium
| | - Giulia Corniani
- Dept of PM&R, Harvard Medical School, Spaulding Rehabilitation Hospital, Boston, USA
| | - Gamze Arin-Bal
- Faculty of Physical Therapy and Rehabilitation, Hacettepe University, Ankara, Turkey; Movement Analysis Laboratory, IRCCS Istituto Ortopedico Rizzoli, Bologna, Italy.
| | - Alberto Leardini
- Movement Analysis Laboratory, IRCCS Istituto Ortopedico Rizzoli, Bologna, Italy
| |
Collapse
|
2
|
Hulleck AA, AlShehhi A, El Rich M, Khan R, Katmah R, Mohseni M, Arjmand N, Khalaf K. BlazePose-Seq2Seq: Leveraging Regular RGB Cameras for Robust Gait Assessment. IEEE Trans Neural Syst Rehabil Eng 2024; 32:1715-1724. [PMID: 38648155 DOI: 10.1109/tnsre.2024.3391908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2024]
Abstract
Evaluation of human gait through smartphone-based pose estimation algorithms provides an attractive alternative to costly lab-bound instrumented assessment and offers a paradigm shift with real time gait capture for clinical assessment. Systems based on smart phones, such as OpenPose and BlazePose have demonstrated potential for virtual motion assessment but still lack the accuracy and repeatability standards required for clinical viability. Seq2seq architecture offers an alternative solution to conventional deep learning techniques for predicting joint kinematics during gait. This study introduces a novel enhancement to the low-powered BlazePose algorithm by incorporating a Seq2seq autoencoder deep learning model. To ensure data accuracy and reliability, synchronized motion capture involving an RGB camera and ten Vicon cameras were employed across three distinct self-selected walking speeds. This investigation presents a groundbreaking avenue for remote gait assessment, harnessing the potential of Seq2seq architectures inspired by natural language processing (NLP) to enhance pose estimation accuracy. When comparing BlazePose alone to the combination of BlazePose and 1D convolution Long Short-term Memory Network (1D-LSTM), Gated Recurrent Unit (GRU) and Long Short-Term Memory (LSTM), the average mean absolute errors decreased from 13.4° to 5.3° for fast gait, from 16.3° to 7.5° for normal gait, and from 15.5° to 7.5° for slow gait at the left ankle joint angle respectively. The strategic utilization of synchronized data and rigorous testing methodologies further bolsters the robustness and credibility of these findings.
Collapse
|
3
|
Yang J, Park K. Improving Gait Analysis Techniques with Markerless Pose Estimation Based on Smartphone Location. Bioengineering (Basel) 2024; 11:141. [PMID: 38391625 PMCID: PMC10886083 DOI: 10.3390/bioengineering11020141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 01/25/2024] [Accepted: 01/29/2024] [Indexed: 02/24/2024] Open
Abstract
Marker-based 3D motion capture systems, widely used for gait analysis, are accurate but have disadvantages such as cost and accessibility. Whereas markerless pose estimation has emerged as a convenient and cost-effective alternative for gait analysis, challenges remain in achieving optimal accuracy. Given the limited research on the effects of camera location and orientation on data collection accuracy, this study investigates how camera placement affects gait assessment accuracy utilizing five smartphones. This study aimed to explore the differences in data collection accuracy between marker-based systems and pose estimation, as well as to assess the impact of camera location and orientation on accuracy in pose estimation. The results showed that the differences in joint angles between pose estimation and marker-based systems are below 5°, an acceptable level for gait analysis, with a strong correlation between the two datasets supporting the effectiveness of pose estimation in gait analysis. In addition, hip and knee angles were accurately measured at the front diagonal of the subject and ankle angle at the lateral side. This research highlights the significance of careful camera placement for reliable gait analysis using pose estimation, serving as a concise reference to guide future efforts in enhancing the quantitative accuracy of gait analysis.
Collapse
Affiliation(s)
- Junhyuk Yang
- Department of Mechatronics Engineering, Incheon National University, Incheon 22012, Republic of Korea
| | - Kiwon Park
- Department of Mechatronics Engineering, Incheon National University, Incheon 22012, Republic of Korea
| |
Collapse
|
4
|
Clemente C, Chambel G, Silva DCF, Montes AM, Pinto JF, da Silva HP. Feasibility of 3D Body Tracking from Monocular 2D Video Feeds in Musculoskeletal Telerehabilitation. SENSORS (BASEL, SWITZERLAND) 2023; 24:206. [PMID: 38203068 PMCID: PMC10781343 DOI: 10.3390/s24010206] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 12/20/2023] [Accepted: 12/21/2023] [Indexed: 01/12/2024]
Abstract
Musculoskeletal conditions affect millions of people globally; however, conventional treatments pose challenges concerning price, accessibility, and convenience. Many telerehabilitation solutions offer an engaging alternative but rely on complex hardware for body tracking. This work explores the feasibility of a model for 3D Human Pose Estimation (HPE) from monocular 2D videos (MediaPipe Pose) in a physiotherapy context, by comparing its performance to ground truth measurements. MediaPipe Pose was investigated in eight exercises typically performed in musculoskeletal physiotherapy sessions, where the Range of Motion (ROM) of the human joints was the evaluated parameter. This model showed the best performance for shoulder abduction, shoulder press, elbow flexion, and squat exercises. Results have shown a MAPE ranging between 14.9% and 25.0%, Pearson's coefficient ranging between 0.963 and 0.996, and cosine similarity ranging between 0.987 and 0.999. Some exercises (e.g., seated knee extension and shoulder flexion) posed challenges due to unusual poses, occlusions, and depth ambiguities, possibly related to a lack of training data. This study demonstrates the potential of HPE from monocular 2D videos, as a markerless, affordable, and accessible solution for musculoskeletal telerehabilitation approaches. Future work should focus on exploring variations of the 3D HPE models trained on physiotherapy-related datasets, such as the Fit3D dataset, and post-preprocessing techniques to enhance the model's performance.
Collapse
Affiliation(s)
- Carolina Clemente
- Instituto Superior Técnico (IST), Department of Bioengineering (DBE), Av. Rovisco Pais n. 1, 1049-001 Lisboa, Portugal
- CLYNXIO, LDA, Rua Augusto Macedo, n. 6, 5 Dto., 1600-794 Lisboa, Portugal
| | - Gonçalo Chambel
- CLYNXIO, LDA, Rua Augusto Macedo, n. 6, 5 Dto., 1600-794 Lisboa, Portugal
| | - Diogo C. F. Silva
- Department of Physiotherapy, Santa Maria Health School, Trav. Antero de Quental 173/175, 4049-024 Porto, Portugal; (D.C.F.S.); (A.M.M.)
- Department of Functional Sciences, School of Health, Polytechnic Institute of Porto, Rua Dr. António Bernardino de Almeida 400, 4200-072 Porto, Portugal
- Center for Rehabilitation Research, School of Health, Polytechnic Institute of Porto, Rua Dr. António Bernardino de Almeida 400, 4200-072 Porto, Portugal
| | - António Mesquita Montes
- Department of Physiotherapy, Santa Maria Health School, Trav. Antero de Quental 173/175, 4049-024 Porto, Portugal; (D.C.F.S.); (A.M.M.)
- Center for Rehabilitation Research, School of Health, Polytechnic Institute of Porto, Rua Dr. António Bernardino de Almeida 400, 4200-072 Porto, Portugal
- Department of Physiotherapy, School of Health, Polytechnic Institute of Porto, Rua Dr. António Bernardino de Almeida 400, 4200-072 Porto, Portugal
| | - Joana F. Pinto
- CLYNXIO, LDA, Rua Augusto Macedo, n. 6, 5 Dto., 1600-794 Lisboa, Portugal
| | - Hugo Plácido da Silva
- Instituto Superior Técnico (IST), Department of Bioengineering (DBE), Av. Rovisco Pais n. 1, 1049-001 Lisboa, Portugal
- Instituto de Telecomunicações (IT), Av. Rovisco Pais n. 1, Torre Norte—Piso 10, 1049-001 Lisboa, Portugal
- Lisbon Unit for Learning and Intelligent Systems (LUMLIS), European Laboratory for Learning and Intelligent Systems (ELLIS), Av. Rovisco Pais n. 1, 1049-001 Lisboa, Portugal
| |
Collapse
|
5
|
Li Y, Wu Y, Chen X, Chen H, Kong D, Tang H, Li S. Beyond Human Detection: A Benchmark for Detecting Common Human Posture. SENSORS (BASEL, SWITZERLAND) 2023; 23:8061. [PMID: 37836891 PMCID: PMC10574885 DOI: 10.3390/s23198061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 09/01/2023] [Accepted: 09/20/2023] [Indexed: 10/15/2023]
Abstract
Human detection is the task of locating all instances of human beings present in an image, which has a wide range of applications across various fields, including search and rescue, surveillance, and autonomous driving. The rapid advancement of computer vision and deep learning technologies has brought significant improvements in human detection. However, for more advanced applications like healthcare, human-computer interaction, and scene understanding, it is crucial to obtain information beyond just the localization of humans. These applications require a deeper understanding of human behavior and state to enable effective and safe interactions with humans and the environment. This study presents a comprehensive benchmark, the Common Human Postures (CHP) dataset, aimed at promoting a more informative and more encouraging task beyond mere human detection. The benchmark dataset comprises a diverse collection of images, featuring individuals in different environments, clothing, and occlusions, performing a wide range of postures and activities. The benchmark aims to enhance research in this challenging task by designing novel and precise methods specifically for it. The CHP dataset consists of 5250 human images collected from different scenes, annotated with bounding boxes for seven common human poses. Using this well-annotated dataset, we have developed two baseline detectors, namely CHP-YOLOF and CHP-YOLOX, building upon two identity-preserved human posture detectors: IPH-YOLOF and IPH-YOLOX. We evaluate the performance of these baseline detectors through extensive experiments. The results demonstrate that these baseline detectors effectively detect human postures on the CHP dataset. By releasing the CHP dataset, we aim to facilitate further research on human pose estimation and to attract more researchers to focus on this challenging task.
Collapse
Affiliation(s)
| | | | | | | | | | - Haihua Tang
- Guangxi Key Laboratory of Embedded Technology and Intelligent Information Processing, College of Information Science and Engineering, Guilin University of Technology, Guilin 541006, China
| | | |
Collapse
|
6
|
Tang W, van Ooijen PMA, Sival DA, Maurits NM. 2D Gait Skeleton Data Normalization for Quantitative Assessment of Movement Disorders from Freehand Single Camera Video Recordings. SENSORS (BASEL, SWITZERLAND) 2022; 22:4245. [PMID: 35684866 PMCID: PMC9185346 DOI: 10.3390/s22114245] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 05/30/2022] [Accepted: 05/31/2022] [Indexed: 06/15/2023]
Abstract
Overlapping phenotypic features between Early Onset Ataxia (EOA) and Developmental Coordination Disorder (DCD) can complicate the clinical distinction of these disorders. Clinical rating scales are a common way to quantify movement disorders but in children these scales also rely on the observer's assessment and interpretation. Despite the introduction of inertial measurement units for objective and more precise evaluation, special hardware is still required, restricting their widespread application. Gait video recordings of movement disorder patients are frequently captured in routine clinical settings, but there is presently no suitable quantitative analysis method for these recordings. Owing to advancements in computer vision technology, deep learning pose estimation techniques may soon be ready for convenient and low-cost clinical usage. This study presents a framework based on 2D video recording in the coronal plane and pose estimation for the quantitative assessment of gait in movement disorders. To allow the calculation of distance-based features, seven different methods to normalize 2D skeleton keypoint data derived from pose estimation using deep neural networks applied to freehand video recording of gait were evaluated. In our experiments, 15 children (five EOA, five DCD and five healthy controls) were asked to walk naturally while being videotaped by a single camera in 1280 × 720 resolution at 25 frames per second. The high likelihood of the prediction of keypoint locations (mean = 0.889, standard deviation = 0.02) demonstrates the potential for distance-based features derived from routine video recordings to assist in the clinical evaluation of movement in EOA and DCD. By comparison of mean absolute angle error and mean variance of distance, the normalization methods using the Euclidean (2D) distance of left shoulder and right hip, or the average distance from left shoulder to right hip and from right shoulder to left hip were found to better perform for deriving distance-based features and further quantitative assessment of movement disorders.
Collapse
Affiliation(s)
- Wei Tang
- Department of Neurology, University Medical Center Groningen, University of Groningen, P.O. Box 30001, 9700 RB Groningen, The Netherlands;
| | - Peter M. A. van Ooijen
- Data Science Center in Health, University Medical Center Groningen, University of Groningen, P.O. Box 30001, 9700 RB Groningen, The Netherlands;
| | - Deborah A. Sival
- Department of Pediatric Neurology, University Medical Center Groningen, University of Groningen, P.O. Box 30001, 9700 RB Groningen, The Netherlands;
| | - Natasha M. Maurits
- Department of Neurology, University Medical Center Groningen, University of Groningen, P.O. Box 30001, 9700 RB Groningen, The Netherlands;
| |
Collapse
|