1
|
K M, Syed K. Arrhythmia classification for non-experts using infinite impulse response (IIR)-filter-based machine learning and deep learning models of the electrocardiogram. PeerJ Comput Sci 2024; 10:e1774. [PMID: 38435599 PMCID: PMC10909216 DOI: 10.7717/peerj-cs.1774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 12/04/2023] [Indexed: 03/05/2024]
Abstract
Arrhythmias are a leading cause of cardiovascular morbidity and mortality. Portable electrocardiogram (ECG) monitors have been used for decades to monitor patients with arrhythmias. These monitors provide real-time data on cardiac activity to identify irregular heartbeats. However, rhythm monitoring and wave detection, especially in the 12-lead ECG, make it difficult to interpret the ECG analysis by correlating it with the condition of the patient. Moreover, even experienced practitioners find ECG analysis challenging. All of this is due to the noise in ECG readings and the frequencies at which the noise occurs. The primary objective of this research is to remove noise and extract features from ECG signals using the proposed infinite impulse response (IIR) filter to improve ECG quality, which can be better understood by non-experts. For this purpose, this study used ECG signal data from the Massachusetts Institute of Technology Beth Israel Hospital (MIT-BIH) database. This allows the acquired data to be easily evaluated using machine learning (ML) and deep learning (DL) models and classified as rhythms. To achieve accurate results, we applied hyperparameter (HP)-tuning for ML classifiers and fine-tuning (FT) for DL models. This study also examined the categorization of arrhythmias using different filters and the changes in accuracy. As a result, when all models were evaluated, DenseNet-121 without FT achieved 99% accuracy, while FT showed better results with 99.97% accuracy.
Collapse
Affiliation(s)
- Mallikarjunamallu K
- School of Computer Science and Engineering, VIT-AP University, Amaravati, Andhra Pradesh, India
| | - Khasim Syed
- School of Computer Science and Engineering, VIT-AP University, Amaravati, Andhra Pradesh, India
| |
Collapse
|
2
|
Rahman S, Karmakar C, Natgunanathan I, Yearwood J, Palaniswami M. Robustness of electrocardiogram signal quality indices. J R Soc Interface 2022; 19:20220012. [PMID: 35414211 PMCID: PMC9006023 DOI: 10.1098/rsif.2022.0012] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Electrocardiogram (ECG) signal quality indices (SQIs) are essential for improving diagnostic accuracy and reliability of ECG analysis systems. In various practical applications, the ECG signals are corrupted by different types of noise. These corrupted ECG signals often provide insufficient and incorrect information regarding a patient’s health. To solve this problem, signal quality measurements should be made before an ECG signal is used for decision-making. This paper investigates the robustness of existing popular statistical signal quality indices (SSQIs): relative power of QRS complex (SQIp), skewness (SQIskew), signal-to-noise ratio (SQIsnr), higher order statistics SQI (SQIhos) and peakedness of kurtosis (SQIkur). We analysed the robustness of these SSQIs against different window sizes across diverse datasets. Results showed that the performance of SSQIs considerably fluctuates against varying datasets, whereas the impact of varying window sizes was minimal. This fluctuation occurred due to the use of a static threshold value for classifying noise-free ECG signals from the raw ECG signals. Another drawback of these SSQIs is the bias towards noise-free ECG signals, that limits their usefulness in clinical settings. In summary, the fixed threshold-based SSQIs cannot be used as a robust noise detection system. In order to solve this fixed threshold problem, other techniques can be developed using adaptive thresholds and machine-learning mechanisms.
Collapse
Affiliation(s)
- Saifur Rahman
- School of Information Technology, Deakin University, Geelong 3225, Australia
| | - Chandan Karmakar
- School of Information Technology, Deakin University, Geelong 3225, Australia
| | | | - John Yearwood
- School of Information Technology, Deakin University, Geelong 3225, Australia
| | - Marimuthu Palaniswami
- Department of Electrical and Electronic Engineering, University of Melbourne, Melbourne, Victoria 3010, Australia
| |
Collapse
|
3
|
Mehrang S, Jafari Tadi M, Knuutila T, Jaakkola J, Jaakola S, Kiviniemi T, Vasankari T, Airaksinen J, Koivisto T, Pänkäälä M. End-to-end sensor fusion and classification of atrial fibrillation using deep neural networks and smartphone mechanocardiography. Physiol Meas 2022; 43. [PMID: 35413698 DOI: 10.1088/1361-6579/ac66ba] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 04/12/2022] [Indexed: 11/12/2022]
Abstract
OBJECTIVE The purpose of this research is to develop a new deep learning framework for detecting atrial fibrillation (AFib), one of the most common heart arrhythmias, by analyzing the heart's mechanical functioning as reflected in seismocardiography (SCG) and gyrocardiography (GCG) signals. Jointly, SCG and GCG constitute the concept of mechanocardiography (MCG), a method used to measure precordial vibrations with the built-in inertial sensors of smartphones. APPROACH We present a modified deep residual neural network model for the classification of sinus rhythm (SR), AFib, and Noise categories from tri-axial SCG and GCG data derived from smartphones. In the model presented, pre-processing including automated early sensor fusion and spatial feature extraction are carried out using attention-based convolutional and residual blocks. Additionally, we use bidirectional long short-term memory layers on top of fully-connected layers to extract both spatial and spatiotemporal features of the multidimensional SCG and GCG signals. The dataset consisted of 728 short measurements recorded from 300 patients. Further, the measurements were divided into disjoint training, validation, and test sets, respectively, of 481 measurements, 140 measurements, and 107 measurements. Prior to ingestion by the model, measurements were split into 10-second segments with 75 percent overlap, pre-processed, and augmented. MAIN RESULTS On the unseen test set, the model delivered average micro- and macro-F1-score of 0.88 (0.87-0.89; 95% CI) and 0.83 (0.83-0.84; 95% CI) for the segment-wise classification as well as 0.95 (0.94-0.96; 95% CI) and 0.95 (0.94-0.96; 95% CI) for the measurement-wise classification, respectively. SIGNIFICANCE Our method not only can effectively fuse SCG and GCG signals but also can identify heart rhythms and abnormalities in the MCG signals with remarkable accuracy.
Collapse
Affiliation(s)
- Saeed Mehrang
- Department of Computing, Turun Yliopisto, Yliopistonmäki, 20500 Vesilinnantie 5, Turku, 20500, FINLAND
| | - Mojtaba Jafari Tadi
- Turun Yliopisto, Yliopistonmäki, 20500 Vesilinnantie 5, Turku, 20500, FINLAND
| | - Timo Knuutila
- Turun Yliopisto, Yliopistonmäki, 20500 Vesilinnantie 5, Turku, 20014, FINLAND
| | - Jussi Jaakkola
- TYKS Turku University Hospital, Hämeentie 11, Turku, Varsinais-Suomi, 20521, FINLAND
| | | | | | - Tuija Vasankari
- Department of Internal Medicine Division of Cardiology, TYKS Turku University Hospital, Hämeentie 11, Turku, Varsinais-Suomi, 20521, FINLAND
| | - Juhani Airaksinen
- Department of Internal Medicine Division of Cardiology, TYKS Turku University Hospital, Hämeentie 11, Turku, Varsinais-Suomi, 20521, FINLAND
| | - Tero Koivisto
- Turun Yliopisto, Yliopistonmäki, 20500 Vesilinnantie 5, Turku, 20500, FINLAND
| | - Mikko Pänkäälä
- Turun Yliopisto, Yliopistonmäki, 20500 Vesilinnantie 5, Turku, 20500, FINLAND
| |
Collapse
|
4
|
Patient contrastive learning: A performant, expressive, and practical approach to electrocardiogram modeling. PLoS Comput Biol 2022; 18:e1009862. [PMID: 35157695 PMCID: PMC8880931 DOI: 10.1371/journal.pcbi.1009862] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Revised: 02/25/2022] [Accepted: 01/25/2022] [Indexed: 11/19/2022] Open
Abstract
Supervised machine learning applications in health care are often limited due to a scarcity of labeled training data. To mitigate the effect of small sample size, we introduce a pre-training approach, Patient Contrastive Learning of Representations (PCLR), which creates latent representations of electrocardiograms (ECGs) from a large number of unlabeled examples using contrastive learning. The resulting representations are expressive, performant, and practical across a wide spectrum of clinical tasks. We develop PCLR using a large health care system with over 3.2 million 12-lead ECGs and demonstrate that training linear models on PCLR representations achieves a 51% performance increase, on average, over six training set sizes and four tasks (sex classification, age regression, and the detection of left ventricular hypertrophy and atrial fibrillation), relative to training neural network models from scratch. We also compared PCLR to three other ECG pre-training approaches (supervised pre-training, unsupervised pre-training with an autoencoder, and pre-training using a contrastive multi ECG-segment approach), and show significant performance benefits in three out of four tasks. We found an average performance benefit of 47% over the other models and an average of a 9% performance benefit compared to best model for each task. We release PCLR to enable others to extract ECG representations at https://github.com/broadinstitute/ml4h/tree/master/model_zoo/PCLR. ECGs are a rich source of cardiac health information. Many recent works have shown that deep learning can extract new information from ECGs when there are a sufficient number of labeled data. However, when there are not enough labeled data or a clinician scientist does not have the resources to train a deep learning model from scratch, options are limited. We introduce Patient Contrastive Learning of Representations (PCLR), an approach to train a neural network that extracts representations of ECGs. The only labels required to train PCLR are which ECG comes from which patient. The resulting ECG representations can be used directly in linear models for new tasks without needing to finetune the neural network. We show PCLR is better than using a set of handpicked features for four tasks, and better than three other deep learning approaches for three out of four tasks evaluated. Furthermore, PCLR is better than training a neural network from scratch when training data are limited. PCLR is one of the first attempts at releasing and evaluating a pre-trained ECG model with the purpose of accelerating deep learning ECG research.
Collapse
|
5
|
Designing an Optimum and Reduced Order Filter for Efficient ECG QRS Peak Detection and Classification of Arrhythmia Data. JOURNAL OF HEALTHCARE ENGINEERING 2021. [DOI: 10.1155/2021/6542290] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Electrocardiogram (ECG) is commonly used biological signals that show an important role in cardiac analysis. The interpretation and acquisition of QRS complex are significant measures of ECG data dispensation. The R wave has a vital character in the analysis of cardiac rhythm irregularities as well as in the determination of heart rate variability (HRV). This manuscript is proposed to design a new artificial-intelligence-based approach of QRS peak detection and classification of the ECG data. The design of reduced order IIR filter is proposed for the low pass smoothening of the ECG signal data. The min-max optimization is used for optimizing the filter coefficient to design the reduced order filter. In this research paper, elimination of baseline wondering and the power line interferences from the ECG signal is of main attention. The result presented that the accuracy is increased by around 13% over the basic Pan–Tompkins method and around 8% over the existing FIR-filter-based classification rules.
Collapse
|
6
|
Kozlowski M, Singh S, Ramage G, Rodriguez-Villegas E. Effects of denoising strategies on R-wave detection in ECG analysis. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021; 2021:373-376. [PMID: 34891312 DOI: 10.1109/embc46164.2021.9629495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The use of ECG in cardiovascular health monitoring is well established. The signal is collected using specialised equipment, capturing the electrical discharge properties of the human heart. This produces a well-structured signal trace, which can be characterised through its peaks and troughs. The signal can then be used by clinicians to diagnose cardiac disorders. However, as with any measuring equipment, the ECG output signal can experience deterioration resulting from noise. This can happen due to environmental interference, human issues or measuring equipment failure, necessitating the development of various denoising strategies to reduce, or remove, the noise. In this paper, we study typically occurring types of noise and implement popular strategies used to rectify them. We also show, that the given strategy's denoising potential is directly related to R-wave detection, and provide best strategies to apply when faced with specific noise type.
Collapse
|
7
|
Niroshana SMI, Zhu X, Nakamura K, Chen W. A fused-image-based approach to detect obstructive sleep apnea using a single-lead ECG and a 2D convolutional neural network. PLoS One 2021; 16:e0250618. [PMID: 33901251 PMCID: PMC8075238 DOI: 10.1371/journal.pone.0250618] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 04/09/2021] [Indexed: 11/18/2022] Open
Abstract
Obstructive sleep apnea (OSA) is a common chronic sleep disorder that disrupts breathing during sleep and is associated with many other medical conditions, including hypertension, coronary heart disease, and depression. Clinically, the standard for diagnosing OSA involves nocturnal polysomnography (PSG). However, this requires expert human intervention and considerable time, which limits the availability of OSA diagnosis in public health sectors. Therefore, electrocardiogram (ECG)-based methods for OSA detection have been proposed to automate the polysomnography procedure and reduce its discomfort. So far, most of the proposed approaches rely on feature engineering, which calls for advanced expert knowledge and experience. This paper proposes a novel fused-image-based technique that detects OSA using only a single-lead ECG signal. In the proposed approach, a convolutional neural network extracts features automatically from images created with one-minute ECG segments. The proposed network comprises 37 layers, including four residual blocks, a dense layer, a dropout layer, and a soft-max layer. In this study, three time-frequency representations, namely the scalogram, the spectrogram, and the Wigner-Ville distribution, were used to investigate the effectiveness of the fused-image-based approach. We found that blending scalogram and spectrogram images further improved the system's discriminative characteristics. Seventy ECG recordings from the PhysioNet Apnea-ECG database were used to train and evaluate the proposed model using 10-fold cross validation. The results of this study demonstrated that the proposed classifier can perform OSA detection with an average accuracy, recall, and specificity of 92.4%, 92.3%, and 92.6%, respectively, for the fused spectral images.
Collapse
Affiliation(s)
- S. M. Isuru Niroshana
- Biomedical Information Engineering Lab, The University of Aizu, Aizuwakamatsu, Fukushima, Japan
| | - Xin Zhu
- Biomedical Information Engineering Lab, The University of Aizu, Aizuwakamatsu, Fukushima, Japan
| | - Keijiro Nakamura
- Division of Cardiovascular Medicine, Ohashi Medical Center, Toho University, Meguro, Tokyo, Japan
| | - Wenxi Chen
- Biomedical Information Engineering Lab, The University of Aizu, Aizuwakamatsu, Fukushima, Japan
| |
Collapse
|