1
|
Egeonu D, Jia B. A systematic literature review of computer vision-based biomechanical models for physical workload estimation. ERGONOMICS 2024:1-24. [PMID: 38294701 DOI: 10.1080/00140139.2024.2308705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 01/17/2024] [Indexed: 02/01/2024]
Abstract
Ergonomic risks, driven by strenuous physical demands in complex work settings, are prevalent across industries. Addressing these challenges through detailed assessment and effective interventions enhances safety and employee well-being. Proper and timely measurement of physical workloads is the initial step towards holistic ergonomic control. This study comprehensively explores existing computer vision-based biomechanical analysis methods for workload assessment, assessing their performance against traditional techniques, and categorising them for easier use. Recent strides in artificial intelligence have revolutionised workload assessment, especially in realistic work settings where conventional methods fall short. However, understanding the accuracy, characteristics, and practicality of computer vision-based methods versus traditional approaches remains limited. To bridge this knowledge gap, a literature review along with a meta-analysis was completed in this study to illuminate model accuracy, advantages, and challenges, offering valuable insights for refined technology implementation in diverse work environments.
Collapse
Affiliation(s)
- Darlington Egeonu
- Industrial and Manufacturing Systems Engineering Department, University of Michigan, Dearborn, MI, USA
| | - Bochen Jia
- Industrial and Manufacturing Systems Engineering Department, University of Michigan, Dearborn, MI, USA
| |
Collapse
|
2
|
Li W, Liu X, An K, Qin C, Cheng Y. Table Tennis Track Detection Based on Temporal Feature Multiplexing Network. SENSORS (BASEL, SWITZERLAND) 2023; 23:1726. [PMID: 36772762 PMCID: PMC9921165 DOI: 10.3390/s23031726] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 01/30/2023] [Accepted: 02/01/2023] [Indexed: 06/18/2023]
Abstract
Recording the trajectory of table tennis balls in real-time enables the analysis of the opponent's attacking characteristics and weaknesses. The current analysis of the ball paths mainly relied on human viewing, which lacked certain theoretical data support. In order to solve the problem of the lack of objective data analysis in the research of table tennis competition, a target detection algorithm-based table tennis trajectory extraction network was proposed to record the trajectory of the table tennis movement in video. The network improved the feature reuse rate in order to achieve a lightweight network and enhance the detection accuracy. The core of the network was the "feature store & return" module, which could store the output of the current network layer and pass the features to the input of the network layer at the next moment to achieve efficient reuse of the features. In this module, the Transformer model was used to secondarily process the features, build the global association information, and enhance the feature richness of the feature map. According to the designed experiments, the detection accuracy of the network was 96.8% for table tennis and 89.1% for target localization. Moreover, the parameter size of the model was only 7.68 MB, and the detection frame rate could reach 634.19 FPS using the hardware for the tests. In summary, the network designed in this paper has the characteristics of both lightweight and high precision in table tennis detection, and the performance of the proposed model significantly outperforms that of the existing models.
Collapse
Affiliation(s)
- Wenjie Li
- College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 201418, China
| | - Xiangpeng Liu
- College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 201418, China
| | - Kang An
- College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 201418, China
| | - Chengjin Qin
- School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yuhua Cheng
- Shanghai Research Institute of Microelectronics, Peking University, Shanghai 201203, China
| |
Collapse
|
3
|
Mundt M, Born Z, Goldacre M, Alderson J. Estimating Ground Reaction Forces from Two-Dimensional Pose Data: A Biomechanics-Based Comparison of AlphaPose, BlazePose, and OpenPose. SENSORS (BASEL, SWITZERLAND) 2022; 23:s23010078. [PMID: 36616676 PMCID: PMC9823796 DOI: 10.3390/s23010078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 12/12/2022] [Accepted: 12/16/2022] [Indexed: 05/14/2023]
Abstract
The adoption of computer vision pose estimation approaches, used to identify keypoint locations which are intended to reflect the necessary anatomical landmarks relied upon by biomechanists for musculoskeletal modelling, has gained increasing traction in recent years. This uptake has been further accelerated by keypoint use as inputs into machine learning models used to estimate biomechanical parameters such as ground reaction forces (GRFs) in the absence of instrumentation required for direct measurement. This study first aimed to investigate the keypoint detection rate of three open-source pose estimation models (AlphaPose, BlazePose, and OpenPose) across varying movements, camera views, and trial lengths. Second, this study aimed to assess the suitability and interchangeability of keypoints detected by each pose estimation model when used as inputs into machine learning models for the estimation of GRFs. The keypoint detection rate of BlazePose was distinctly lower than that of AlphaPose and OpenPose. All pose estimation models achieved a high keypoint detection rate at the centre of an image frame and a lower detection rate in the true sagittal plane camera field of view, compared with slightly anteriorly or posteriorly located quasi-sagittal plane camera views. The three-dimensional ground reaction force, instantaneous loading rate, and peak force for running could be estimated using the keypoints of all three pose estimation models. However, only AlphaPose and OpenPose keypoints could be used interchangeably with a machine learning model trained to estimate GRFs based on AlphaPose keypoints resulting in a high estimation accuracy when OpenPose keypoints were used as inputs and vice versa. The findings of this study highlight the need for further evaluation of computer vision-based pose estimation models for application in biomechanical human modelling, and the limitations of machine learning-based GRF estimation models that rely on 2D keypoints. This is of particular relevance given that machine learning models informing athlete monitoring guidelines are being developed for application related to athlete well-being.
Collapse
Affiliation(s)
- Marion Mundt
- UWA Minderoo Tech & Policy Lab, Law School, The University of Western Australia, Crawley, WA 6009, Australia
- Correspondence:
| | - Zachery Born
- UWA Minderoo Tech & Policy Lab, Law School, The University of Western Australia, Crawley, WA 6009, Australia
| | - Molly Goldacre
- UWA Minderoo Tech & Policy Lab, Law School, The University of Western Australia, Crawley, WA 6009, Australia
| | - Jacqueline Alderson
- UWA Minderoo Tech & Policy Lab, Law School, The University of Western Australia, Crawley, WA 6009, Australia
- Sports Performance Research Institute New Zealand (SPRINZ), Auckland University of Technology, Auckland 1010, New Zealand
| |
Collapse
|
4
|
A Comprehensive Review of Computer Vision in Sports: Open Issues, Future Trends and Research Directions. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12094429] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Recent developments in video analysis of sports and computer vision techniques have achieved significant improvements to enable a variety of critical operations. To provide enhanced information, such as detailed complex analysis in sports such as soccer, basketball, cricket, and badminton, studies have focused mainly on computer vision techniques employed to carry out different tasks. This paper presents a comprehensive review of sports video analysis for various applications: high-level analysis such as detection and classification of players, tracking players or balls in sports and predicting the trajectories of players or balls, recognizing the team’s strategies, and classifying various events in sports. The paper further discusses published works in a variety of application-specific tasks related to sports and the present researcher’s views regarding them. Since there is a wide research scope in sports for deploying computer vision techniques in various sports, some of the publicly available datasets related to a particular sport have been discussed. This paper reviews detailed discussion on some of the artificial intelligence (AI) applications, GPU-based work-stations and embedded platforms in sports vision. Finally, this review identifies the research directions, probable challenges, and future trends in the area of visual recognition in sports.
Collapse
|
5
|
A Systematic Review of the Application of Camera-Based Human Pose Estimation in the Field of Sport and Physical Exercise. SENSORS 2021; 21:s21185996. [PMID: 34577204 PMCID: PMC8472911 DOI: 10.3390/s21185996] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Revised: 08/30/2021] [Accepted: 09/03/2021] [Indexed: 11/16/2022]
Abstract
Human Pose Estimation (HPE) has received considerable attention during the past years, improving its performance thanks to the use of Deep Learning, and introducing new interesting uses, such as its application in Sport and Physical Exercise (SPE). The aim of this systematic review is to analyze the literature related to the application of HPE in SPE, the available data, methods, performance, opportunities, and challenges. One reviewer applied different inclusion and exclusion criteria, as well as quality metrics, to perform the paper filtering through the paper databases. The Association for Computing Machinery Digital Library, Web of Science, and dblp included more than 500 related papers after the initial filtering, finally resulting in 20. In addition, research was carried out regarding the publicly available data related to this topic. It can be concluded that even if related public data can be found, much more data is needed to be able to obtain good performance in different contexts. In relation with the methods of the authors, the use of general purpose systems as base, such as Openpose, combined with other methods and adaptations to the specific use case can be found. Finally, the limitations, opportunities, and challenges are presented.
Collapse
|
6
|
Javadiha M, Andujar C, Lacasa E, Ric A, Susin A. Estimating Player Positions from Padel High-Angle Videos: Accuracy Comparison of Recent Computer Vision Methods. SENSORS 2021; 21:s21103368. [PMID: 34066162 PMCID: PMC8151013 DOI: 10.3390/s21103368] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 04/27/2021] [Accepted: 05/05/2021] [Indexed: 11/26/2022]
Abstract
The estimation of player positions is key for performance analysis in sport. In this paper, we focus on image-based, single-angle, player position estimation in padel. Unlike tennis, the primary camera view in professional padel videos follows a de facto standard, consisting of a high-angle shot at about 7.6 m above the court floor. This camera angle reduces the occlusion impact of the mesh that stands over the glass walls, and offers a convenient view for judging the depth of the ball and the player positions and poses. We evaluate and compare the accuracy of state-of-the-art computer vision methods on a large set of images from both amateur videos and publicly available videos from the major international padel circuit. The methods we analyze include object detection, image segmentation and pose estimation techniques, all of them based on deep convolutional neural networks. We report accuracy and average precision with respect to manually-annotated video frames. The best results are obtained by top-down pose estimation methods, which offer a detection rate of 99.8% and a RMSE below 5 and 12 cm for horizontal/vertical court-space coordinates (deviations from predicted and ground-truth player positions). These results demonstrate the suitability of pose estimation methods based on deep convolutional neural networks for estimating player positions from single-angle padel videos. Immediate applications of this work include the player and team analysis of the large collection of publicly available videos from international circuits, as well as an inexpensive method to get player positional data in amateur padel clubs.
Collapse
Affiliation(s)
- Mohammadreza Javadiha
- ViRVIG, Universitat Politècnica de Catalunya-BarcelonaTech, Pau Gargallo 14, CS Dept, Edifici U, 08028 Barcelona, Spain;
| | - Carlos Andujar
- ViRVIG, Universitat Politècnica de Catalunya-BarcelonaTech, Jordi Girona 1-3, CS Dept, Edifici Omega, 08034 Barcelona, Spain
- Correspondence:
| | - Enrique Lacasa
- Complex Systems in Sport Research Group, Institut Nacional D’Educacio Fisica de Catalunya (INEFC), University of Lleida (UdL), 25192 Lleida, Spain; (E.L.); (A.R.)
| | - Angel Ric
- Complex Systems in Sport Research Group, Institut Nacional D’Educacio Fisica de Catalunya (INEFC), University of Lleida (UdL), 25192 Lleida, Spain; (E.L.); (A.R.)
| | - Antonio Susin
- Engineering School (ETSEIB), ViRVIG, Universitat Politècnica de Catalunya-BarcelonaTech, Avda. Diagonal 647, 08028 Barcelona, Spain;
| |
Collapse
|