1
|
Khaleel Sallam Ma'aitah M, Helwan A. 3D DenseNet with temporal transition layer for heart rate estimation from real-life RGB videos. Technol Health Care 2024:THC241104. [PMID: 39058471 DOI: 10.3233/thc-241104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/28/2024]
Abstract
BACKGROUND Deep learning has demonstrated superior performance over traditional methods for the estimation of heart rates in controlled contexts. However, in less controlled scenarios this performance seems to vary based on the training dataset and the architecture of the deep learning models. OBJECTIVES In this paper, we develop a deep learning-based model leveraging the power of 3D convolutional neural networks (3DCNN) to extract temporal and spatial features that lead to an accurate heart rates estimation from RGB no pre-defined region of interest (ROI) videos. METHODS We propose a 3D DenseNet with a 3D temporal transition layer for the estimation of heart rates from a large-scale dataset of videos that appear more hospital-like and real-life than other existing facial video-based datasets. RESULTS Experimentally, our model was trained and tested on this less controlled dataset and showed heart rate estimation performance with root mean square error (RMSE) of 8.68 BPM and mean absolute error (MAE) of 3.34 BPM. CONCLUSION Moreover, we show that such a model can also achieve better results than the state-of-the-art models when tested on the VIPL-HR public dataset.
Collapse
|
2
|
Chen W, Yi Z, Lim LJR, Lim RQR, Zhang A, Qian Z, Huang J, He J, Liu B. Deep learning and remote photoplethysmography powered advancements in contactless physiological measurement. Front Bioeng Biotechnol 2024; 12:1420100. [PMID: 39104628 PMCID: PMC11298756 DOI: 10.3389/fbioe.2024.1420100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Accepted: 06/27/2024] [Indexed: 08/07/2024] Open
Abstract
In recent decades, there has been ongoing development in the application of computer vision (CV) in the medical field. As conventional contact-based physiological measurement techniques often restrict a patient's mobility in the clinical environment, the ability to achieve continuous, comfortable and convenient monitoring is thus a topic of interest to researchers. One type of CV application is remote imaging photoplethysmography (rPPG), which can predict vital signs using a video or image. While contactless physiological measurement techniques have an excellent application prospect, the lack of uniformity or standardization of contactless vital monitoring methods limits their application in remote healthcare/telehealth settings. Several methods have been developed to improve this limitation and solve the heterogeneity of video signals caused by movement, lighting, and equipment. The fundamental algorithms include traditional algorithms with optimization and developing deep learning (DL) algorithms. This article aims to provide an in-depth review of current Artificial Intelligence (AI) methods using CV and DL in contactless physiological measurement and a comprehensive summary of the latest development of contactless measurement techniques for skin perfusion, respiratory rate, blood oxygen saturation, heart rate, heart rate variability, and blood pressure.
Collapse
Affiliation(s)
- Wei Chen
- Department of Hand Surgery, Beijing Jishuitan Hospital, Capital Medical University, Beijing, China
| | - Zhe Yi
- Department of Hand Surgery, Beijing Jishuitan Hospital, Capital Medical University, Beijing, China
| | - Lincoln Jian Rong Lim
- Department of Medical Imaging, Western Health, Footscray Hospital, Footscray, VIC, Australia
- Department of Surgery, The University of Melbourne, Melbourne, VIC, Australia
| | - Rebecca Qian Ru Lim
- Department of Hand & Reconstructive Microsurgery, Singapore General Hospital, Singapore, Singapore
| | - Aijie Zhang
- Department of Hand Surgery, Beijing Jishuitan Hospital, Capital Medical University, Beijing, China
| | - Zhen Qian
- Institute of Intelligent Diagnostics, Beijing United-Imaging Research Institute of Intelligent Imaging, Beijing, China
| | - Jiaxing Huang
- Institute of Automation, Chinese Academy of Sciences, Beijing, China
- School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China
| | - Jia He
- Institute of Automation, Chinese Academy of Sciences, Beijing, China
- School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China
| | - Bo Liu
- Department of Hand Surgery, Beijing Jishuitan Hospital, Capital Medical University, Beijing, China
- Beijing Research Institute of Traumatology and Orthopaedics, Beijing, China
| |
Collapse
|
3
|
Helwan A, Azar D, Ma'aitah MKS. Conventional and deep learning methods in heart rate estimation from RGB face videos. Physiol Meas 2024; 45:02TR01. [PMID: 38081130 DOI: 10.1088/1361-6579/ad1458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Accepted: 12/11/2023] [Indexed: 02/10/2024]
Abstract
Contactless vital signs monitoring is a fast-advancing scientific field that aims to employ monitoring methods that do not necessitate the use of leads or physical attachments to the patient in order to overcome the shortcomings and limits of traditional monitoring systems. Several traditional methods have been applied to extract the heart rate (HR) signal from the face. Moreover, machine learning has recently contributed majorly to the development of such a field in which deep networks and other deep learning methods are employed to extract the HR signal from RGB face videos. In this paper, we evaluate the state-of-the-art conventional and deep learning methods for HR estimates, focusing on the limits of deep learning methods and the availability of less-controlled face video datasets. We aim to present an extensive review that helps the various approaches of remote photoplethysmography extraction and HR estimation to be understood, in addition to their drawbacks and benefits.
Collapse
Affiliation(s)
| | | | - Mohamad Khaleel Sallam Ma'aitah
- Department of Robotics and Artificial Intelligence Engineering, Faculty of Engineering & Technology, Applied Science Private University, Amman, Jordan
| |
Collapse
|
4
|
Joshi J, Wang K, Cho Y. PhysioKit: An Open-Source, Low-Cost Physiological Computing Toolkit for Single- and Multi-User Studies. SENSORS (BASEL, SWITZERLAND) 2023; 23:8244. [PMID: 37837074 PMCID: PMC10575364 DOI: 10.3390/s23198244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 09/12/2023] [Accepted: 09/29/2023] [Indexed: 10/15/2023]
Abstract
The proliferation of physiological sensors opens new opportunities to explore interactions, conduct experiments and evaluate the user experience with continuous monitoring of bodily functions. Commercial devices, however, can be costly or limit access to raw waveform data, while low-cost sensors are efforts-intensive to setup. To address these challenges, we introduce PhysioKit, an open-source, low-cost physiological computing toolkit. PhysioKit provides a one-stop pipeline consisting of (i) a sensing and data acquisition layer that can be configured in a modular manner per research needs, and (ii) a software application layer that enables data acquisition, real-time visualization and machine learning (ML)-enabled signal quality assessment. This also supports basic visual biofeedback configurations and synchronized acquisition for co-located or remote multi-user settings. In a validation study with 16 participants, PhysioKit shows strong agreement with research-grade sensors on measuring heart rate and heart rate variability metrics data. Furthermore, we report usability survey results from 10 small-project teams (44 individual members in total) who used PhysioKit for 4-6 weeks, providing insights into its use cases and research benefits. Lastly, we discuss the extensibility and potential impact of the toolkit on the research community.
Collapse
|
5
|
Mellouk W, Handouzi W. CNN-LSTM for automatic emotion recognition using contactless photoplythesmographic signals. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
|
6
|
Al-Zyoud I, Laamarti F, Ma X, Tobón D, El Saddik A. Towards a Machine Learning-Based Digital Twin for Non-Invasive Human Bio-Signal Fusion. SENSORS (BASEL, SWITZERLAND) 2022; 22:9747. [PMID: 36560115 PMCID: PMC9786606 DOI: 10.3390/s22249747] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 11/10/2022] [Accepted: 11/24/2022] [Indexed: 06/17/2023]
Abstract
Human bio-signal fusion is considered a critical technological solution that needs to be advanced to enable modern and secure digital health and well-being applications in the metaverse. To support such efforts, we propose a new data-driven digital twin (DT) system to fuse three human physiological bio-signals: heart rate (HR), breathing rate (BR), and blood oxygen saturation level (SpO2). To accomplish this goal, we design a computer vision technology based on the non-invasive photoplethysmography (PPG) technique to extract raw time-series bio-signal data from facial video frames. Then, we implement machine learning (ML) technology to model and measure the bio-signals. We accurately demonstrate the digital twin capability in the modelling and measuring of three human bio-signals, HR, BR, and SpO2, and achieve strong performance compared to the ground-truth values. This research sets the foundation and the path forward for realizing a holistic human health and well-being DT model for real-world medical applications.
Collapse
Affiliation(s)
- Izaldein Al-Zyoud
- Multimedia Communications Research Laboratory (MCRLab), School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| | - Fedwa Laamarti
- Multimedia Communications Research Laboratory (MCRLab), School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, ON K1N 6N5, Canada
- Mohamed bin Zayed University of AI, Abu Dhabi, United Arab Emirates
| | - Xiaocong Ma
- Multimedia Communications Research Laboratory (MCRLab), School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| | - Diana Tobón
- Faculty of Engineering, University of Medellín, Medellín 050010, Colombia
| | - Abdulmotaleb El Saddik
- Multimedia Communications Research Laboratory (MCRLab), School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, ON K1N 6N5, Canada
- Mohamed bin Zayed University of AI, Abu Dhabi, United Arab Emirates
| |
Collapse
|
7
|
Pagano TP, dos Santos LL, Santos VR, Sá PHM, Bonfim YDS, Paranhos JVD, Ortega LL, Nascimento LFS, Santos A, Rönnau MM, Winkler I, Nascimento EGS. Remote Heart Rate Prediction in Virtual Reality Head-Mounted Displays Using Machine Learning Techniques. SENSORS (BASEL, SWITZERLAND) 2022; 22:9486. [PMID: 36502188 PMCID: PMC9738680 DOI: 10.3390/s22239486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 11/22/2022] [Accepted: 11/30/2022] [Indexed: 06/17/2023]
Abstract
Head-mounted displays are virtual reality devices that may be equipped with sensors and cameras to measure a patient's heart rate through facial regions. Heart rate is an essential body signal that can be used to remotely monitor users in a variety of situations. There is currently no study that predicts heart rate using only highlighted facial regions; thus, an adaptation is required for beats per minute predictions. Likewise, there are no datasets containing only the eye and lower face regions, necessitating the development of a simulation mechanism. This work aims to remotely estimate heart rate from facial regions that can be captured by the cameras of a head-mounted display using state-of-the-art EVM-CNN and Meta-rPPG techniques. We developed a region of interest extractor to simulate a dataset from a head-mounted display device using stabilizer and video magnification techniques. Then, we combined support vector machine and FaceMash to determine the regions of interest and adapted photoplethysmography and beats per minute signal predictions to work with the other techniques. We observed an improvement of 188.88% for the EVM and 55.93% for the Meta-rPPG. In addition, both models were able to predict heart rate using only facial regions as input. Moreover, the adapted technique Meta-rPPG outperformed the original work, whereas the EVM adaptation produced comparable results for the photoplethysmography signal.
Collapse
Affiliation(s)
- Tiago Palma Pagano
- Computational Modeling Department, SENAI CIMATEC University Center, Salvador 41650-010, Bahia, Brazil
| | - Lucas Lisboa dos Santos
- Computational Modeling Department, SENAI CIMATEC University Center, Salvador 41650-010, Bahia, Brazil
| | - Victor Rocha Santos
- Computational Modeling Department, SENAI CIMATEC University Center, Salvador 41650-010, Bahia, Brazil
| | - Paulo H. Miranda Sá
- Computational Modeling Department, SENAI CIMATEC University Center, Salvador 41650-010, Bahia, Brazil
| | - Yasmin da Silva Bonfim
- Computational Modeling Department, SENAI CIMATEC University Center, Salvador 41650-010, Bahia, Brazil
| | | | - Lucas Lemos Ortega
- Computational Modeling Department, SENAI CIMATEC University Center, Salvador 41650-010, Bahia, Brazil
| | | | - Alexandre Santos
- HP Inc. Brazil R&D, Porto Alegre 90619-900, Rio Grande do Sul, Brazil
| | | | - Ingrid Winkler
- Department of Management and Industrial Technology, SENAI CIMATEC University Center, Salvador 41650-010, Bahia, Brazil
| | - Erick G. Sperandio Nascimento
- Department of Management and Industrial Technology, SENAI CIMATEC University Center, Salvador 41650-010, Bahia, Brazil
- Faculty of Engineering and Physical Sciences, School of Computer Science and Electronic Engineering, Surrey Institute for People-Centred AI, University of Surrey, Guildford GU2 7XH, UK
| |
Collapse
|
8
|
Shukla P, Akanbi O, Atuah AS, Aljaedi A, Bouye M, Sharma S. Cryptography-Based Medical Signal Securing Using Improved Variation Mode Decomposition with Machine Learning Techniques. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:7307552. [PMID: 36131899 PMCID: PMC9484937 DOI: 10.1155/2022/7307552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 07/16/2022] [Accepted: 07/21/2022] [Indexed: 11/20/2022]
Abstract
There is no question about the value that digital signal processing brings to the area of biomedical research. DSP processors are used to sample and process the analog inputs that are received from a human organ. These inputs come from the organ itself. DSP processors, because of their multidimensional data processing nature, are the electrical components that take up the greatest space and use the most power. In this age of digital technology and electronic gizmos, portable biomedical devices represent an essential step forward in technological advancement. Electrocardiogram (ECG) units are among the most common types of biomedical equipment, and their functions are absolutely necessary to the process of saving human life. In the latter part of the 1990s, portable electrocardiogram (ECG) devices began to appear on the market, and research into their signal processing and electronics design capabilities continues today. System-on-chip (SoC) design refers to the process through which the separate computing components of a DSP unit are combined onto a single chip in order to achieve greater power and space efficiency. In the design of biomedical DSP devices, this body of research presents a number of different solutions for reducing power consumption and space requirements. Using serial or parallel data buses, which are often the region that consumes the most power, it is possible to send data between the system-on-chip (SoC) and other components. To cut down on the number of needless switching operations that take place during data transmission, a hybrid solution that makes use of the shift invert bus encoding scheme has been developed. Using a phase-encoded shift invert bus encoding approach, which embeds the two-bit indication lines into a single-bit encoded line, is one way to solve the issue of having two distinct indicator bits. This method reduces the problem. The PESHINV approach is compared to the SHINV method that already exists, and the comparison reveals that the suggested PESHINV method reduces the total power consumption of the encoding circuit by around 30 percent. The computing unit of the DSP processor is the target of further optimization efforts. Virtually, all signal processing methods need memory and multiplier circuits to function properly.
Collapse
Affiliation(s)
| | - Oluwatobi Akanbi
- Computer Science Department, University of Colorado, Colorado Springs, CO 80918, USA
| | - Asakipaam Simon Atuah
- Department of Telecommunication Engineering, KNUST (Kwame Nkrumah University of Science and Technology), Ghana
| | - Amer Aljaedi
- College of Computing and Information Technology, University of Tabuk, Tabuk 71491, Saudi Arabia
| | - Mohamed Bouye
- Department of Mathematics, College of Science, King Khalid University, Abha, Saudi Arabia
| | - Shakti Sharma
- School of Computer Science Engineering & Technology, Bennett University, India
| |
Collapse
|
9
|
Yang X, Zhang Z, Huang Y, Zheng Y, Shen Y. Using a graph-based image segmentation algorithm for remote vital sign estimation and monitoring. Sci Rep 2022; 12:15197. [PMID: 36071124 PMCID: PMC9451121 DOI: 10.1038/s41598-022-19198-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 08/25/2022] [Indexed: 12/02/2022] Open
Abstract
Reliable and contactless measurements of vital signs, such as respiration and heart rate, are still unmet needs in clinical and home settings. Mm-wave radar and video-based technologies are promising, but currently, the signal processing-based vital sign extraction methods are prone to body motion disruptions or illumination variations in the surrounding environment. Here we propose an image segmentation-based method to extract vital signs from the recorded video and mm-wave radar signals. The proposed method analyses time-frequency spectrograms obtained from Short-Time Fourier Transform rather than individual time-domain signals. This leads to much-improved robustness and accuracy of the heart rate and respiration rate extraction over existing methods. The experiments were conducted under pre- and post-exercise conditions and were repeated on multiple individuals. The results are evaluated by using four metrics against the gold standard contact-based measurements. Significant improvements were observed in terms of precision, accuracy, and stability. The performance was reflected by achieving an averaged Pearson correlation coefficient (PCC) of 93.8% on multiple subjects. We believe that the proposed estimation method will help address the needs for the increasingly popular remote cardiovascular sensing and diagnosing posed by Covid-19.
Collapse
Affiliation(s)
- Xingyu Yang
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, L69 3GJ, UK
| | - Zijian Zhang
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, L69 3GJ, UK
| | - Yi Huang
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, L69 3GJ, UK
| | - Yalin Zheng
- Department of Eye and Vision Science, University of Liverpool, Liverpool, L7 8TX, UK
| | - Yaochun Shen
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, L69 3GJ, UK.
| |
Collapse
|
10
|
Intelligent Remote Photoplethysmography-Based Methods for Heart Rate Estimation from Face Videos: A Survey. INFORMATICS 2022. [DOI: 10.3390/informatics9030057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Over the last few years, a rich amount of research has been conducted on remote vital sign monitoring of the human body. Remote photoplethysmography (rPPG) is a camera-based, unobtrusive technology that allows continuous monitoring of changes in vital signs and thereby helps to diagnose and treat diseases earlier in an effective manner. Recent advances in computer vision and its extensive applications have led to rPPG being in high demand. This paper specifically presents a survey on different remote photoplethysmography methods and investigates all facets of heart rate analysis. We explore the investigation of the challenges of the video-based rPPG method and extend it to the recent advancements in the literature. We discuss the gap within the literature and suggestions for future directions.
Collapse
|
11
|
Selvaraju V, Spicher N, Swaminathan R, Deserno TM. Unobtrusive Heart Rate Monitoring using Near-Infrared Imaging During Driving. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2022; 2022:2967-2971. [PMID: 36085768 DOI: 10.1109/embc48229.2022.9871416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
In-vehicle health monitoring allows for continuous vital sign measurement in everyday life. Eventually, this could lead to early detection of cardiovascular diseases. In this work, we propose non-contact heart rate (HR) monitoring utilizing near-infrared (NIR) camera technology. Ten healthy volunteers are monitored in a realistic driving simulator during resting (5 min) and driving (10 min). We synchronously acquire videos using an out-of-the-shelf, low-cost NIR camera and 3-lead electrocardiography (ECG) serves as ground truth. The MediaPipe face detector delivers the region of interest (ROI) and we determine the HR from the peak with maximum amplitude within the power spectrum of skin color changes. We compare video-based with ECG-based HR, resulting in a mean absolute error (MAE) of 7.8 bpm and 13.0 bpm in resting and driving condition, respectively. As we apply only a simple signal processing pipeline without sophisticated filtering, we conclude that NIR camera-based HR measurements enables unobtrusive and non-contact monitoring to a certain extent, but artifacts from subject movement pose a challenge. If these issues can be addressed, continuous vital sign measurement in everyday life could become reality.
Collapse
|
12
|
Hu M, Qian F, Wang X, He L, Guo D, Ren F. Robust Heart Rate Estimation With Spatial–Temporal Attention Network From Facial Videos. IEEE Trans Cogn Dev Syst 2022. [DOI: 10.1109/tcds.2021.3062370] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Min Hu
- School of Computer and Information, Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, Hefei University of Technology, Hefei, China
| | - Fei Qian
- School of Computer and Information, Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, Hefei University of Technology, Hefei, China
| | - Xiaohua Wang
- School of Computer and Information, Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, Hefei University of Technology, Hefei, China
| | - Lei He
- School of Mathematics, Hefei University of Technology, Hefei, China
| | - Dong Guo
- School of Computer and Information, Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, Hefei University of Technology, Hefei, China
| | - Fuji Ren
- Graduate School of Advanced Technology and Science, University of Tokushima, Tokushima, Japan
| |
Collapse
|
13
|
Gupta A, Ravelo-García AG, Dias FM. Availability and performance of face based non-contact methods for heart rate and oxygen saturation estimations: A systematic review. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 219:106771. [PMID: 35390724 DOI: 10.1016/j.cmpb.2022.106771] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Revised: 03/03/2022] [Accepted: 03/21/2022] [Indexed: 06/14/2023]
Abstract
BACKGROUND Consumer-level cameras have provided an advantage of designing cost-effective, non-contact physiological parameters estimation approaches which is not possible with gold standard estimation techniques. This encourages the development of non-contact estimation methods using camera technology. Therefore, this work aims to present a systematic review summarizing the currently existing face-based non-contact methods along with their performance. METHODS This review includes all heart rate (HR) and oxygen saturation (SpO2) studies published in journals and a few reputed conferences, which have compared the proposed estimation methods with one or more standard reference devices. The articles were collected from the following research databases: Institute of Electrical and Electronics Engineers (IEEE), PubMed, Web of Science (WoS), Science Direct, and Association of Computer Machinery (ACM) digital library. All database searches were completed on May 20, 2021. Each study was assessed using a finite set of identified factors for reporting bias. RESULTS Out of 332 identified studies, 32 studies were selected for the final review. Additionally, 18 studies were included by thoroughly checking these studies. 3 out of 50 (6%) studies were performed in clinical conditions, while the remaining studies were carried out on a healthy population. 42 out of 50 (84%) studies have estimated HR, while 5/50 (10%) studies have measured SpO2 only. The remaining three studies have estimated both parameters. The majority of the studies have used 1-3 min videos for estimation. Among the estimation methods, Deep Learning and Independent component analysis (ICA) were used by 11/42 (26.19%) and 9/42 (21.42%) studies, respectively. According to the Bland-Altman analysis, only 8/45 (17.77%) HR studies achieved the clinically accepted error limits whereas, for SpO2, 4/5 (80%) studies have matched the industry standards (±3%). DISCUSSION Deep Learning and ICA have been predominantly used for HR estimations. Among deep learning estimation methods, convolutional neural networks have been employed till date due to their good generalization ability. Most non-contact HR estimation methods need significant improvements to implement these methods in a clinical environment. Furthermore, these methods need to be tested on the subjects suffering from any related disease. SpO2 estimation studies are challenging and need to be tested by conducting hypoxemic events. The authors would encourage reporting the detailed information about the study population, the use of longer videos, and appropriate performance metrics and testing under abnormal HR and SpO2 ranges for future estimation studies.
Collapse
Affiliation(s)
- Ankit Gupta
- Interactive Technologies Institute/Larsys/Madeira Interactive Technologies Institute, Caminho da Penteada, Funchal, 9020-105, Portugal; Universidade da Madeira, Caminho da Penteada, Funchal, 9020-105, Portugal.
| | - Antonio G Ravelo-García
- Interactive Technologies Institute/Larsys/Madeira Interactive Technologies Institute, Caminho da Penteada, Funchal, 9020-105, Portugal; Universidad de Las Palmas de Gran Canaria, C. Juan de Quesada, 30, Las Palmas, 35001, Spain.
| | - Fernando Morgado Dias
- Interactive Technologies Institute/Larsys/Madeira Interactive Technologies Institute, Caminho da Penteada, Funchal, 9020-105, Portugal; Universidade da Madeira, Caminho da Penteada, Funchal, 9020-105, Portugal.
| |
Collapse
|
14
|
Selvaraju V, Spicher N, Wang J, Ganapathy N, Warnecke JM, Leonhardt S, Swaminathan R, Deserno TM. Continuous Monitoring of Vital Signs Using Cameras: A Systematic Review. SENSORS (BASEL, SWITZERLAND) 2022; 22:4097. [PMID: 35684717 PMCID: PMC9185528 DOI: 10.3390/s22114097] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 05/18/2022] [Accepted: 05/18/2022] [Indexed: 02/04/2023]
Abstract
In recent years, noncontact measurements of vital signs using cameras received a great amount of interest. However, some questions are unanswered: (i) Which vital sign is monitored using what type of camera? (ii) What is the performance and which factors affect it? (iii) Which health issues are addressed by camera-based techniques? Following the preferred reporting items for systematic reviews and meta-analyses (PRISMA) statement, we conduct a systematic review of continuous camera-based vital sign monitoring using Scopus, PubMed, and the Association for Computing Machinery (ACM) databases. We consider articles that were published between January 2018 and April 2021 in the English language. We include five vital signs: heart rate (HR), respiratory rate (RR), blood pressure (BP), body skin temperature (BST), and oxygen saturation (SpO2). In total, we retrieve 905 articles and screened them regarding title, abstract, and full text. One hundred and four articles remained: 60, 20, 6, 2, and 1 of the articles focus on HR, RR, BP, BST, and SpO2, respectively, and 15 on multiple vital signs. HR and RR can be measured using red, green, and blue (RGB) and near-infrared (NIR) as well as far-infrared (FIR) cameras. So far, BP and SpO2 are monitored with RGB cameras only, whereas BST is derived from FIR cameras only. Under ideal conditions, the root mean squared error is around 2.60 bpm, 2.22 cpm, 6.91 mm Hg, 4.88 mm Hg, and 0.86 °C for HR, RR, systolic BP, diastolic BP, and BST, respectively. The estimated error for SpO2 is less than 1%, but it increases with movements of the subject and the camera-subject distance. Camera-based remote monitoring mainly explores intensive care, post-anaesthesia care, and sleep monitoring, but also explores special diseases such as heart failure. The monitored targets are newborn and pediatric patients, geriatric patients, athletes (e.g., exercising, cycling), and vehicle drivers. Camera-based techniques monitor HR, RR, and BST in static conditions within acceptable ranges for certain applications. The research gaps are large and heterogeneous populations, real-time scenarios, moving subjects, and accuracy of BP and SpO2 monitoring.
Collapse
Affiliation(s)
- Vinothini Selvaraju
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, D-38106 Braunschweig, Germany; (V.S.); (N.S.); (J.W.); (N.G.); (J.M.W.)
- Non-Invasive Imaging and Diagnostic Laboratory, Biomedical Engineering, Department of Applied Mechanics, Indian Institute of Technology Madras, Chennai 600036, India;
| | - Nicolai Spicher
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, D-38106 Braunschweig, Germany; (V.S.); (N.S.); (J.W.); (N.G.); (J.M.W.)
| | - Ju Wang
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, D-38106 Braunschweig, Germany; (V.S.); (N.S.); (J.W.); (N.G.); (J.M.W.)
| | - Nagarajan Ganapathy
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, D-38106 Braunschweig, Germany; (V.S.); (N.S.); (J.W.); (N.G.); (J.M.W.)
| | - Joana M. Warnecke
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, D-38106 Braunschweig, Germany; (V.S.); (N.S.); (J.W.); (N.G.); (J.M.W.)
| | - Steffen Leonhardt
- Chair for Medical Information Technology, Helmholtz-Institute for Biomedical Engineering, RWTH Aachen University, D-52074 Aachen, Germany;
| | - Ramakrishnan Swaminathan
- Non-Invasive Imaging and Diagnostic Laboratory, Biomedical Engineering, Department of Applied Mechanics, Indian Institute of Technology Madras, Chennai 600036, India;
| | - Thomas M. Deserno
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, D-38106 Braunschweig, Germany; (V.S.); (N.S.); (J.W.); (N.G.); (J.M.W.)
| |
Collapse
|
15
|
Molinaro N, Schena E, Silvestri S, Bonotti F, Aguzzi D, Viola E, Buccolini F, Massaroni C. Contactless Vital Signs Monitoring From Videos Recorded With Digital Cameras: An Overview. Front Physiol 2022; 13:801709. [PMID: 35250612 PMCID: PMC8895203 DOI: 10.3389/fphys.2022.801709] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 01/20/2022] [Indexed: 01/26/2023] Open
Abstract
The measurement of physiological parameters is fundamental to assess the health status of an individual. The contactless monitoring of vital signs may provide benefits in various fields of application, from healthcare and clinical setting to occupational and sports scenarios. Recent research has been focused on the potentiality of camera-based systems working in the visible range (380-750 nm) for estimating vital signs by capturing subtle color changes or motions caused by physiological activities but invisible to human eyes. These quantities are typically extracted from videos framing some exposed body areas (e.g., face, torso, and hands) with adequate post-processing algorithms. In this review, we provided an overview of the physiological and technical aspects behind the estimation of vital signs like respiratory rate, heart rate, blood oxygen saturation, and blood pressure from digital images as well as the potential fields of application of these technologies. Per each vital sign, we provided the rationale for the measurement, a classification of the different techniques implemented for post-processing the original videos, and the main results obtained during various applications or in validation studies. The available evidence supports the premise of digital cameras as an unobtrusive and easy-to-use technology for physiological signs monitoring. Further research is needed to promote the advancements of the technology, allowing its application in a wide range of population and everyday life, fostering a biometrical holistic of the human body (BHOHB) approach.
Collapse
Affiliation(s)
- Nunzia Molinaro
- Unit of Measurements and Biomedical Instrumentation, Departmental Faculty of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy
| | - Emiliano Schena
- Unit of Measurements and Biomedical Instrumentation, Departmental Faculty of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy
| | - Sergio Silvestri
- Unit of Measurements and Biomedical Instrumentation, Departmental Faculty of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy
| | | | - Damiano Aguzzi
- BHOHB – Biometrical Holistic of Human Body S.r.l., Rome, Italy
| | - Erika Viola
- BHOHB – Biometrical Holistic of Human Body S.r.l., Rome, Italy
| | - Fabio Buccolini
- BHOHB – Biometrical Holistic of Human Body S.r.l., Rome, Italy
| | - Carlo Massaroni
- Unit of Measurements and Biomedical Instrumentation, Departmental Faculty of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy
| |
Collapse
|
16
|
|
17
|
Rea T, Kudenchuk PJ, Sayre MR, Doll A, Eisenberg M. Out of hospital cardiac arrest: Past, present, and future. Resuscitation 2021; 165:101-109. [PMID: 34166740 DOI: 10.1016/j.resuscitation.2021.06.010] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 06/09/2021] [Accepted: 06/17/2021] [Indexed: 10/21/2022]
Abstract
Advances in resuscitation following out-of-hospital cardiac arrest (OHCA) provide an opportunity to improve public health. This review reflects on past developments, present status, and future possibilities using the science-education-implementation framework of the Utstein Formula and the clinical framework of the links in the chain of survival. With the discovery of CPR and defibrillation in the mid 20th century, resuscitation developed a scientific construct for progress. Systems of emergency community response provided operational efficiency to treat OHCA. Contemporary resuscitation involves integrated interventions in the chain of survival: early recognition, early CPR, early defibrillation, expert and timely advanced life support and hospital care, and multidimensional rehabilitation. Implementation of scientific advances is especially challenging given the unexpected nature of OHCA, the need for time-sensitive interventions, and the substantial collective of stakeholders involved in the chain of survival. Systematic measurement provides the foundation to evaluate performance and guide implementation initiatives. For many systems, telecommunicator CPR and high-performance CPR by emergency professionals are accessible, near-term programs to improve OHCA outcome. Smart technologies that activate, coordinate, and/or coach community "volunteers" to accelerate early CPR and defibrillation have conceptual promise, though robust implementation has been achieved by only a handful of systems. Longer-term strategies may leverage technology to develop a high-fidelity "life-detector" or engineer and disseminate a specialized consumer defibrillator designed to bridge care until arrival of professional response.
Collapse
Affiliation(s)
- Thomas Rea
- Department of Medicine, University of Washington, United States
| | | | - Michael R Sayre
- Department of Emergency Medicine, University of Washington, United States
| | - Ann Doll
- Resuscitation Academy, United States
| | - Mickey Eisenberg
- Department of Emergency Medicine, University of Washington, United States.
| |
Collapse
|
18
|
Hassan T, Seus D, Wollenberg J, Weitz K, Kunz M, Lautenbacher S, Garbas JU, Schmid U. Automatic Detection of Pain from Facial Expressions: A Survey. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2021; 43:1815-1831. [PMID: 31825861 DOI: 10.1109/tpami.2019.2958341] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Pain sensation is essential for survival, since it draws attention to physical threat to the body. Pain assessment is usually done through self-reports. However, self-assessment of pain is not available in the case of noncommunicative patients, and therefore, observer reports should be relied upon. Observer reports of pain could be prone to errors due to subjective biases of observers. Moreover, continuous monitoring by humans is impractical. Therefore, automatic pain detection technology could be deployed to assist human caregivers and complement their service, thereby improving the quality of pain management, especially for noncommunicative patients. Facial expressions are a reliable indicator of pain, and are used in all observer-based pain assessment tools. Following the advancements in automatic facial expression analysis, computer vision researchers have tried to use this technology for developing approaches for automatically detecting pain from facial expressions. This paper surveys the literature published in this field over the past decade, categorizes it, and identifies future research directions. The survey covers the pain datasets used in the reviewed literature, the learning tasks targeted by the approaches, the features extracted from images and image sequences to represent pain-related information, and finally, the machine learning methods used.
Collapse
|
19
|
Waqar M, Zwiggelaar R, Tiddeman B. Contact-Free Pulse Signal Extraction from Human Face Videos: A Review and New Optimized Filtering Approach. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2021; 1317:181-202. [PMID: 33945138 DOI: 10.1007/978-3-030-61125-5_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
In this chapter, we review methods for video-based heart monitoring, from classical signal processing approaches to modern deep learning methods. In addition, we propose a new method for learning an optimal filter that can overcome many of the problems that can affect classical approaches, such as light reflection and subject's movements, at a fraction of the training cost of deep learning approaches. Following the usual procedures for region of interest extraction and tracking, robust skin color estimation and signal pre-processing, we introduce a least-squares error optimal filter, learnt using an established training dataset to estimate the photoplethysmographic (PPG) signal more accurately from the measured color changes over time. This method not only improves the accuracy of heart rate measurement but also resulted in the extraction of a cleaner pulse signal, which could be integrated into many other useful applications such as human biometric recognition or recognition of emotional state. The method was tested on the DEAP dataset and showed improved performance over the best previous classical method on that dataset. The results obtained show that our proposed contact-free heart rate measurement method has significantly improved on existing methods.
Collapse
|
20
|
Stress levels estimation from facial video based on non-contact measurement of pulse wave. ARTIFICIAL LIFE AND ROBOTICS 2020. [DOI: 10.1007/s10015-020-00624-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
21
|
Cohoon TJ, Bhavnani SP. Toward precision health: applying artificial intelligence analytics to digital health biometric datasets. Per Med 2020; 17:307-316. [PMID: 32588726 DOI: 10.2217/pme-2019-0113] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The rapid development of digital health devices has enabled patients to engage in their care to an unprecedented degree and holds the possibility of significantly improving the diagnosis, treatment and monitoring of many medical conditions. Combined with the emergence of artificial intelligence algorithms, biometric datasets produced from these digital health devices present new opportunities to create precision-based, personalized approaches for healthcare delivery. For effective implementation of such innovations to patient care, clinicians will require an understanding of the types of datasets produced from digital health technologies; the types of analytic methods including feature selection, convolution neural networking, and deep learning that can be used to analyze digital data; and how the interpretation of these findings are best translated to patient care. In this perspective, we aim to provide the groundwork for clinicians to be able to apply artificial intelligence to this transformation of healthcare.
Collapse
Affiliation(s)
- Travis J Cohoon
- Department of Medicine, Scripps Clinic, San Diego, CA 92037, USA
| | - Sanjeev P Bhavnani
- Division of Cardiology, Healthcare Innovation & Practice Transformation Laboratory, Scripps Clinic, San Diego, CA 92037, USA
| |
Collapse
|
22
|
Wang D, Yang X, Liu X, Jing J, Fang S. Detail-preserving pulse wave extraction from facial videos using consumer-level camera. BIOMEDICAL OPTICS EXPRESS 2020; 11:1876-1891. [PMID: 32341854 PMCID: PMC7173900 DOI: 10.1364/boe.380646] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Revised: 01/29/2020] [Accepted: 02/04/2020] [Indexed: 05/11/2023]
Abstract
With the popularity of smart phones, non-contact video-based vital sign monitoring using a camera has gained increased attention over recent years. Especially, imaging photoplethysmography (IPPG), a technique for extracting pulse waves from videos, conduces to monitor physiological information on a daily basis, including heart rate, respiration rate, blood oxygen saturation, and so on. The main challenge for accurate pulse wave extraction from facial videos is that the facial color intensity change due to cardiovascular activities is subtle and is often badly disturbed by noise, such as illumination variation, facial expression changes, and head movements. Even a tiny interference could bring a big obstacle for pulse wave extraction and reduce the accuracy of the calculated vital signs. In recent years, many novel approaches have been proposed to eliminate noise such as filter banks, adaptive filters, Distance-PPG, and machine learning, but these methods mainly focus on heart rate detection and neglect the retention of useful details of pulse wave. For example, the pulse wave extracted by the filter bank method has no dicrotic wave and approaching sine wave, but dicrotic waves are essential for calculating vital signs like blood viscosity and blood pressure. Therefore, a new framework is proposed to achieve accurate pulse wave extraction that contains mainly two steps: 1) preprocessing procedure to remove baseline offset and high frequency random noise; and 2) a self-adaptive singular spectrum analysis algorithm to obtain cyclical components and remove aperiodic irregular noise. Experimental results show that the proposed method can extract detail-preserved pulse waves from facial videos under realistic situations and outperforms state-of-the-art methods in terms of detail-preserving and real time heart rate estimation. Furthermore, the pulse wave extracted by our approach enabled the non-contact estimation of atrial fibrillation, heart rate variability, blood pressure, as well as other physiological indices that require standard pulse wave.
Collapse
Affiliation(s)
- Dingliang Wang
- School of Computer and Information, Hefei University of Technology, Hefei, 230009, China
| | - Xuezhi Yang
- School of Software, Hefei University of Technology, Hefei, 230009, China
- Anhui Key Laboratory of Industry Safety and Emergency Technology, Hefei, 230009, China
| | - Xuenan Liu
- School of Computer and Information, Hefei University of Technology, Hefei, 230009, China
| | - Jin Jing
- School of Computer and Information, Hefei University of Technology, Hefei, 230009, China
| | - Shuai Fang
- School of Computer and Information, Hefei University of Technology, Hefei, 230009, China
| |
Collapse
|
23
|
Non-Contact Vital Signs Monitoring of Dog and Cat Using a UWB Radar. Animals (Basel) 2020; 10:ani10020205. [PMID: 31991803 PMCID: PMC7070589 DOI: 10.3390/ani10020205] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 01/21/2020] [Accepted: 01/23/2020] [Indexed: 11/17/2022] Open
Abstract
Keywords: cat; dog; vital signs monitoring; radar; ultra-wideband (UWB); variational mode decomposition (VMD).
Collapse
|
24
|
Zhang Y, Tsujikawa M, Onishi Y. Sleep/wake classification via remote PPG signals. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2020; 2019:3226-3230. [PMID: 31946573 DOI: 10.1109/embc.2019.8857097] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
This paper proposes a remote sleep/wake classification method by combining vision-based heart rate (HR) estimation and convolutional neural network (CNN). Instead of inputting the estimated HR with low temporal resolution, remote PPG (Photoplethysmogram) signals, which contain high-temporal-resolution HR information, are input into the CNN. To reduce noise in the remote PPG signals, we propose a dynamic HR filter. Evaluation results show that the dynamic HR filter works more effectively in comparison with the static filter, which helps improve the area under the ROC curve (AUC) to 0.70, which is almost as good as the reference 0.71 for HR from a wearable sensor.
Collapse
|
25
|
O'Reilly T, Webb A. Deconstructing and reconstructing MRI hardware. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2019; 306:134-138. [PMID: 31311711 DOI: 10.1016/j.jmr.2019.07.014] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 04/28/2019] [Accepted: 07/08/2019] [Indexed: 06/10/2023]
Abstract
Three topics concerning future hardware development for clinical MRI are discussed, with the general theme of operational simplification.
Collapse
Affiliation(s)
- Thomas O'Reilly
- C.J.Gorter Center for High Field MRI, Department of Radiology, Leiden University Medical Center, Leiden, the Netherlands
| | - Andrew Webb
- C.J.Gorter Center for High Field MRI, Department of Radiology, Leiden University Medical Center, Leiden, the Netherlands.
| |
Collapse
|