1
|
Al-Zaben A, Al-Fahoum A, Ababneh M, Al-Naami B, Al-Omari G. Improved recovery of cardiac auscultation sounds using modified cosine transform and LSTM-based masking. Med Biol Eng Comput 2024; 62:2485-2497. [PMID: 38627355 DOI: 10.1007/s11517-024-03088-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 04/02/2024] [Indexed: 04/24/2024]
Abstract
Obtaining accurate cardiac auscultation signals, including basic heart sounds (S1 and S2) and subtle signs of disease, is crucial for improving cardiac diagnoses and making the most of telehealth. This research paper introduces an innovative approach that utilizes a modified cosine transform (MCT) and a masking strategy based on long short-term memory (LSTM) to effectively distinguish heart sounds and murmurs from background noise and interfering sounds. The MCT is used to capture the repeated pattern of the heart sounds, while the LSTMs are trained to construct masking based on the repeated MCT spectrum. The proposed strategy's performance in maintaining the clinical relevance of heart sounds continues to demonstrate effectiveness, even in environments marked by increased noise and complex disruptions. The present work highlights the clinical significance and reliability of the suggested methodology through in-depth signal visualization and rigorous statistical performance evaluations. In comparative assessments, the proposed approach has demonstrated superior performance compared to recent algorithms, such as LU-Net and PC-DAE. Furthermore, the system's adaptability to various datasets enhances its reliability and practicality. The suggested method is a potential way to improve the accuracy of cardiovascular diagnostics in an era of rapid advancement in medical signal processing. The proposed approach showed an enhancement in the average signal-to-noise ratio (SNR) by 9.6 dB at an input SNR of - 6 dB and by 3.3 dB at an input SNR of 10 dB. The average signal distortion ratio (SDR) achieved across a variety of input SNR values was 8.56 dB.
Collapse
Affiliation(s)
- Awad Al-Zaben
- Biomedical Engineering Department, Engineering Faculty, Hashemite University, Zarqa, Jordan.
- Biomedical Systems and Medical Informatics Department, Hijjawi Faculty for Engineering Technology, Yarmouk University, Irbid, Jordan.
| | - Amjad Al-Fahoum
- Biomedical Systems and Medical Informatics Department, Hijjawi Faculty for Engineering Technology, Yarmouk University, Irbid, Jordan
| | - Muhannad Ababneh
- Faculty of Medicine, Interventional Cardiologist, Jordan University of Science and Technology, Irbid, Jordan
| | - Bassam Al-Naami
- Biomedical Engineering Department, Engineering Faculty, Hashemite University, Zarqa, Jordan
| | - Ghadeer Al-Omari
- Biomedical Systems and Medical Informatics Department, Hijjawi Faculty for Engineering Technology, Yarmouk University, Irbid, Jordan
| |
Collapse
|
2
|
Fassina L, Muzio FPL, Berboth L, Ötvös J, Faragli A, Alogna A. Prediction of Left Ventricle Pressure Indices Via a Machine Learning Approach Combining ECG, Pulse Oximetry, and Cardiac Sounds: a Preclinical Feasibility Study. J Cardiovasc Transl Res 2024:10.1007/s12265-024-10546-2. [PMID: 39017912 DOI: 10.1007/s12265-024-10546-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 07/10/2024] [Indexed: 07/18/2024]
Abstract
Heart failure (HF) is defined as the inability of the heart to meet body oxygen demand requiring an elevation in left ventricular filling pressures (LVP) to compensate. LVP increase can be assessed in the cardiac catheterization laboratory, but this procedure is invasive and time-consuming to the extent that physicians rather rely on non-invasive diagnostic tools. In this work, we assess the feasibility to develop a novel machine-learning (ML) approach to predict clinically relevant LVP indices. Synchronized invasive (pressure-volume tracings) and non-invasive signals (ECG, pulse oximetry, and cardiac sounds) were collected from anesthetized, closed-chest Göttingen minipigs. Animals were either healthy or had HF with reduced ejection fraction and circa 500 heartbeats were included in the analysis for each animal. The ML algorithm showed excellent prediction of LVP indices estimating, for instance, the end-diastolic pressure with a R2 of 0.955. This novel ML algorithm could assist clinicians in the care of HF patients.
Collapse
Affiliation(s)
- Lorenzo Fassina
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Via Ferrata 5, Pavia, 27100, Italy.
| | - Francesco Paolo Lo Muzio
- Department of Cardiology, Deutsches Herzzentrum Der Charité, Angiology and Intensive Care Medicine, Campus Virchow-Klinikum, Augustenburgerplatz 1, Berlin, 13353, Germany
| | - Leonhard Berboth
- Department of Cardiology, Deutsches Herzzentrum Der Charité, Angiology and Intensive Care Medicine, Campus Virchow-Klinikum, Augustenburgerplatz 1, Berlin, 13353, Germany
| | - Jens Ötvös
- Department of Cardiology, Deutsches Herzzentrum Der Charité, Angiology and Intensive Care Medicine, Campus Virchow-Klinikum, Augustenburgerplatz 1, Berlin, 13353, Germany
| | - Alessandro Faragli
- Department of Cardiology, Deutsches Herzzentrum Der Charité, Angiology and Intensive Care Medicine, Campus Virchow-Klinikum, Augustenburgerplatz 1, Berlin, 13353, Germany
| | - Alessio Alogna
- Department of Cardiology, Deutsches Herzzentrum Der Charité, Angiology and Intensive Care Medicine, Campus Virchow-Klinikum, Augustenburgerplatz 1, Berlin, 13353, Germany.
- DZHK (German Centre for Cardiovascular Research), Partner Site, Berlin, 10785, Germany.
| |
Collapse
|
3
|
Zhou G, Chien C, Chen J, Luan L, Chen Y, Carroll S, Dayton J, Thanjan M, Bayle K, Flynn P. Identifying pediatric heart murmurs and distinguishing innocent from pathologic using deep learning. Artif Intell Med 2024; 153:102867. [PMID: 38723434 DOI: 10.1016/j.artmed.2024.102867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 04/02/2024] [Accepted: 04/03/2024] [Indexed: 06/11/2024]
Abstract
OBJECTIVE To develop a deep learning algorithm to perform multi-class classification of normal pediatric heart sounds, innocent murmurs, and pathologic murmurs. METHODS We prospectively enrolled children under age 18 being evaluated by the Division of Pediatric Cardiology. Parents provided consent for a deidentified recording of their child's heart sounds with a digital stethoscope. Innocent murmurs were validated by a pediatric cardiologist and pathologic murmurs were validated by echocardiogram. To augment our collection of normal heart sounds, we utilized a public database of pediatric heart sound recordings (Oliveira, 2022). We propose two novel approaches for this audio classification task. We train a vision transformer on either Markov transition field or Gramian angular field image representations of the frequency spectrum. We benchmark our results against a ResNet-50 CNN trained on spectrogram images. RESULTS Our final dataset consisted of 366 normal heart sounds, 175 innocent murmurs, and 216 pathologic murmurs. Innocent murmurs collected include Still's murmur, venous hum, and flow murmurs. Pathologic murmurs included ventricular septal defect, tetralogy of Fallot, aortic regurgitation, aortic stenosis, pulmonary stenosis, mitral regurgitation and stenosis, and tricuspid regurgitation. We find that the Vision Transformer consistently outperforms the ResNet-50 on all three image representations, and that the Gramian angular field is the superior image representation for pediatric heart sounds. We calculated a one-vs-rest multi-class ROC curve for each of the three classes. Our best model achieves an area under the curve (AUC) value of 0.92 ± 0.05, 0.83 ± 0.04, and 0.88 ± 0.04 for identifying normal heart sounds, innocent murmurs, and pathologic murmurs, respectively. CONCLUSION We present two novel methods for pediatric heart sound classification, which outperforms the current standard of using a convolutional neural network trained on spectrogram images. To our knowledge, we are the first to demonstrate multi-class classification of pediatric murmurs. Multiclass output affords a more explainable and interpretable model, which can facilitate further model improvement in the downstream model development cycle and enhance clinician trust and therefore adoption.
Collapse
Affiliation(s)
- George Zhou
- Weill Cornell Medicine, New York, NY 10021, USA.
| | - Candace Chien
- Children's Hospital Los Angeles, Los Angeles, CA 90027, USA
| | - Justin Chen
- Staten Island University Hospital, Northwell Health, Staten Island, NY 10305, USA
| | - Lucille Luan
- Teachers College, Columbia University, New York, NY 10027, USA
| | | | - Sheila Carroll
- Division of Pediatric Cardiology, NewYork-Presbyterian Hospital, New York, NY 10021, USA
| | - Jeffrey Dayton
- Division of Pediatric Cardiology, NewYork-Presbyterian Hospital, New York, NY 10021, USA
| | - Maria Thanjan
- Division of Pediatric Cardiology, NewYork-Presbyterian Hospital Queens, New York, NY 11355, USA
| | - Ken Bayle
- Division of Pediatric Cardiology, NewYork-Presbyterian Hospital Queens, New York, NY 11355, USA
| | - Patrick Flynn
- Division of Pediatric Cardiology, NewYork-Presbyterian Hospital, New York, NY 10021, USA
| |
Collapse
|
4
|
De Fazio R, Spongano L, De Vittorio M, Patrono L, Visconti P. Machine Learning Algorithms for Processing and Classifying Unsegmented Phonocardiographic Signals: An Efficient Edge Computing Solution Suitable for Wearable Devices. SENSORS (BASEL, SWITZERLAND) 2024; 24:3853. [PMID: 38931636 PMCID: PMC11207414 DOI: 10.3390/s24123853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 06/04/2024] [Accepted: 06/11/2024] [Indexed: 06/28/2024]
Abstract
The phonocardiogram (PCG) can be used as an affordable way to monitor heart conditions. This study proposes the training and testing of several classifiers based on SVMs (support vector machines), k-NN (k-Nearest Neighbor), and NNs (neural networks) to perform binary ("Normal"/"Pathologic") and multiclass ("Normal", "CAD" (coronary artery disease), "MVP" (mitral valve prolapse), and "Benign" (benign murmurs)) classification of PCG signals, without heart sound segmentation algorithms. Two datasets of 482 and 826 PCG signals from the Physionet/CinC 2016 dataset are used to train the binary and multiclass classifiers, respectively. Each PCG signal is pre-processed, with spike removal, denoising, filtering, and normalization; afterward, it is divided into 5 s frames with a 1 s shift. Subsequently, a feature set is extracted from each frame to train and test the binary and multiclass classifiers. Concerning the binary classification, the trained classifiers yielded accuracies ranging from 92.4 to 98.7% on the test set, with memory occupations from 92.7 kB to 11.1 MB. Regarding the multiclass classification, the trained classifiers achieved accuracies spanning from 95.3 to 98.6% on the test set, occupying a memory portion from 233 kB to 14.1 MB. The NNs trained and tested in this work offer the best trade-off between performance and memory occupation, whereas the trained k-NN models obtained the best performance at the cost of large memory occupation (up to 14.1 MB). The classifiers' performance slightly depends on the signal quality, since a denoising step is performed during pre-processing. To this end, the signal-to-noise ratio (SNR) was acquired before and after the denoising, indicating an improvement between 15 and 30 dB. The trained and tested models occupy relatively little memory, enabling their implementation in resource-limited systems.
Collapse
Affiliation(s)
- Roberto De Fazio
- Department of Innovation Engineering, University of Salento, 73100 Lecce, Italy; (R.D.F.); (L.S.); (M.D.V.); (L.P.)
| | - Lorenzo Spongano
- Department of Innovation Engineering, University of Salento, 73100 Lecce, Italy; (R.D.F.); (L.S.); (M.D.V.); (L.P.)
- Center for Biomolecular Nanotechnologies, Italian Institute of Technology, 73010 Arnesano, Italy
| | - Massimo De Vittorio
- Department of Innovation Engineering, University of Salento, 73100 Lecce, Italy; (R.D.F.); (L.S.); (M.D.V.); (L.P.)
- Center for Biomolecular Nanotechnologies, Italian Institute of Technology, 73010 Arnesano, Italy
| | - Luigi Patrono
- Department of Innovation Engineering, University of Salento, 73100 Lecce, Italy; (R.D.F.); (L.S.); (M.D.V.); (L.P.)
| | - Paolo Visconti
- Department of Innovation Engineering, University of Salento, 73100 Lecce, Italy; (R.D.F.); (L.S.); (M.D.V.); (L.P.)
- Center for Biomolecular Nanotechnologies, Italian Institute of Technology, 73010 Arnesano, Italy
| |
Collapse
|
5
|
Centracchio J, Parlato S, Esposito D, Andreozzi E. Accurate Localization of First and Second Heart Sounds via Template Matching in Forcecardiography Signals. SENSORS (BASEL, SWITZERLAND) 2024; 24:1525. [PMID: 38475062 DOI: 10.3390/s24051525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Revised: 02/21/2024] [Accepted: 02/23/2024] [Indexed: 03/14/2024]
Abstract
Cardiac auscultation is an essential part of physical examination and plays a key role in the early diagnosis of many cardiovascular diseases. The analysis of phonocardiography (PCG) recordings is generally based on the recognition of the main heart sounds, i.e., S1 and S2, which is not a trivial task. This study proposes a method for an accurate recognition and localization of heart sounds in Forcecardiography (FCG) recordings. FCG is a novel technique able to measure subsonic vibrations and sounds via small force sensors placed onto a subject's thorax, allowing continuous cardio-respiratory monitoring. In this study, a template-matching technique based on normalized cross-correlation was used to automatically recognize heart sounds in FCG signals recorded from six healthy subjects at rest. Distinct templates were manually selected from each FCG recording and used to separately localize S1 and S2 sounds, as well as S1-S2 pairs. A simultaneously recorded electrocardiography (ECG) trace was used for performance evaluation. The results show that the template matching approach proved capable of separately classifying S1 and S2 sounds in more than 96% of all heartbeats. Linear regression, correlation, and Bland-Altman analyses showed that inter-beat intervals were estimated with high accuracy. Indeed, the estimation error was confined within 10 ms, with negligible impact on heart rate estimation. Heart rate variability (HRV) indices were also computed and turned out to be almost comparable with those obtained from ECG. The preliminary yet encouraging results of this study suggest that the template matching approach based on normalized cross-correlation allows very accurate heart sounds localization and inter-beat intervals estimation.
Collapse
Affiliation(s)
- Jessica Centracchio
- Department of Electrical Engineering and Information Technologies, University of Naples Federico II, Via Claudio, 21, I-80125 Naples, Italy
| | - Salvatore Parlato
- Department of Electrical Engineering and Information Technologies, University of Naples Federico II, Via Claudio, 21, I-80125 Naples, Italy
| | - Daniele Esposito
- Department of Electrical Engineering and Information Technologies, University of Naples Federico II, Via Claudio, 21, I-80125 Naples, Italy
| | - Emilio Andreozzi
- Department of Electrical Engineering and Information Technologies, University of Naples Federico II, Via Claudio, 21, I-80125 Naples, Italy
| |
Collapse
|
6
|
Abbas S, Ojo S, Al Hejaili A, Sampedro GA, Almadhor A, Zaidi MM, Kryvinska N. Artificial intelligence framework for heart disease classification from audio signals. Sci Rep 2024; 14:3123. [PMID: 38326488 PMCID: PMC10850078 DOI: 10.1038/s41598-024-53778-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Accepted: 02/05/2024] [Indexed: 02/09/2024] Open
Abstract
As cardiovascular disorders are prevalent, there is a growing demand for reliable and precise diagnostic methods within this domain. Audio signal-based heart disease detection is a promising area of research that leverages sound signals generated by the heart to identify and diagnose cardiovascular disorders. Machine learning (ML) and deep learning (DL) techniques are pivotal in classifying and identifying heart disease from audio signals. This study investigates ML and DL techniques to detect heart disease by analyzing noisy sound signals. This study employed two subsets of datasets from the PASCAL CHALLENGE having real heart audios. The research process and visually depict signals using spectrograms and Mel-Frequency Cepstral Coefficients (MFCCs). We employ data augmentation to improve the model's performance by introducing synthetic noise to the heart sound signals. In addition, a feature ensembler is developed to integrate various audio feature extraction techniques. Several machine learning and deep learning classifiers are utilized for heart disease detection. Among the numerous models studied and previous study findings, the multilayer perceptron model performed best, with an accuracy rate of 95.65%. This study demonstrates the potential of this methodology in accurately detecting heart disease from sound signals. These findings present promising opportunities for enhancing medical diagnosis and patient care.
Collapse
Affiliation(s)
- Sidra Abbas
- Department of Computer Science, COMSATS University Islamabad, Islamabad, Pakistan.
| | - Stephen Ojo
- Department of Electrical and Computer Engineering, College of Engineering Anderson, Anderson, SC, 29621, USA
| | - Abdullah Al Hejaili
- Computer Science Department, Faculty of Computers and Information Technology, University of Tabuk, Tabuk, 71491, Saudi Arabia
| | - Gabriel Avelino Sampedro
- Faculty of Information and Communication Studies, University of the Philippines Open University, Los Baños, 4031, Philippines
- Center for Computational Imaging and Visual Innovations, De La Salle University, 2401 Taft Ave., Malate, 1004, Manila, Philippines
| | - Ahmad Almadhor
- Department of Computer Engineering and Networks, College of Computer and Information Sciences, Jouf University, 72388, Sakaka, Saudi Arabia
| | - Monji Mohamed Zaidi
- Department of Electrical Engineering, College of Engineering, King Khalid University, Abha, Saudi Arabia
| | - Natalia Kryvinska
- Information Systems Department, Faculty of Management, Comenius University in Bratislava, Odbojárov 10, 82005, Bratislava 25, Slovakia.
| |
Collapse
|
7
|
Netto AN, Abraham L, Philip S. HBNET: A blended ensemble model for the detection of cardiovascular anomalies using phonocardiogram. Technol Health Care 2024; 32:1925-1945. [PMID: 38393859 DOI: 10.3233/thc-231290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2024]
Abstract
BACKGROUND Cardiac diseases are highly detrimental illnesses, responsible for approximately 32% of global mortality [1]. Early diagnosis and prompt treatment can reduce deaths caused by cardiac diseases. In paediatric patients, it is challenging for paediatricians to identify functional murmurs and pathological murmurs from heart sounds. OBJECTIVE The study intends to develop a novel blended ensemble model using hybrid deep learning models and softmax regression to classify adult, and paediatric heart sounds into five distinct classes, distinguishing itself as a groundbreaking work in this domain. Furthermore, the research aims to create a comprehensive 5-class paediatric phonocardiogram (PCG) dataset. The dataset includes two critical pathological classes, namely atrial septal defects and ventricular septal defects, along with functional murmurs, pathological and normal heart sounds. METHODS The work proposes a blended ensemble model (HbNet-Heartbeat Network) comprising two hybrid models, CNN-BiLSTM and CNN-LSTM, as base models and Softmax regression as meta-learner. HbNet leverages the strengths of base models and improves the overall PCG classification accuracy. Mel Frequency Cepstral Coefficients (MFCC) capture the crucial audio signal characteristics relevant to the classification. The amalgamation of these two deep learning structures enhances the precision and reliability of PCG classification, leading to improved diagnostic results. RESULTS The HbNet model exhibited excellent results with an average accuracy of 99.72% and sensitivity of 99.3% on an adult dataset, surpassing all the existing state-of-the-art works. The researchers have validated the reliability of the HbNet model by testing it on a real-time paediatric dataset. The paediatric model's accuracy is 86.5%. HbNet detected functional murmur with 100% precision. CONCLUSION The results indicate that the HbNet model exhibits a high level of efficacy in the early detection of cardiac disorders. Results also imply that HbNet has the potential to serve as a valuable tool for the development of decision-support systems that aid medical practitioners in confirming their diagnoses. This method makes it easier for medical professionals to diagnose and initiate prompt treatment while performing preliminary auscultation and reduces unnecessary echocardiograms.
Collapse
Affiliation(s)
- Ann Nita Netto
- Department of Electronics and Communication Engineering, LBS Institute of Technology for Women, APJ Abdul Kalam Technological University, Trivandrum, India
| | - Lizy Abraham
- Department of Electronics and Communication Engineering, LBS Institute of Technology for Women, APJ Abdul Kalam Technological University, Trivandrum, India
| | - Saji Philip
- Department of Cardiology, Thiruvalla Medical Mission Hospital, Thiruvalla, India
| |
Collapse
|
8
|
Liu A, Zhang S, Wang Z, Tang Y, Zhang X, Wang Y. A learnable front-end based efficient channel attention network for heart sound classification. Physiol Meas 2023; 44:095003. [PMID: 37619586 DOI: 10.1088/1361-6579/acf3cf] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Accepted: 08/24/2023] [Indexed: 08/26/2023]
Abstract
Objective. To enhance the accuracy of heart sound classification, this study aims to overcome the limitations of common models which rely on handcrafted feature extraction. These traditional methods may distort or discard crucial pathological information within heart sounds due to their requirement of tedious parameter settings.Approach.We propose a learnable front-end based Efficient Channel Attention Network (ECA-Net) for heart sound classification. This novel approach optimizes the transformation of waveform-to-spectrogram, enabling adaptive feature extraction from heart sound signals without domain knowledge. The features are subsequently fed into an ECA-Net based convolutional recurrent neural network, which emphasizes informative features and suppresses irrelevant information. To address data imbalance, Focal loss is employed in our model.Main results.Using the well-known public PhysioNet challenge 2016 dataset, our method achieved a classification accuracy of 97.77%, outperforming the majority of previous studies and closely rivaling the best model with a difference of just 0.57%.Significance.The learnable front-end facilitates end-to-end training by replacing the conventional heart sound feature extraction module. This provides a novel and efficient approach for heart sound classification research and applications, enhancing the practical utility of end-to-end models in this field.
Collapse
Affiliation(s)
- Aolei Liu
- School of Optical Information and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, People's Republic of China
| | - Sunjie Zhang
- School of Optical Information and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, People's Republic of China
| | - Zhe Wang
- School of Optical Information and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, People's Republic of China
| | - Yiheng Tang
- School of Optical Information and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, People's Republic of China
| | - Xiaoli Zhang
- School of Optical Information and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, People's Republic of China
| | - Yongxiong Wang
- School of Optical Information and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, People's Republic of China
| |
Collapse
|
9
|
Roy TS, Roy JK, Mandal N. Design of ear-contactless stethoscope and improvement in the performance of deep learning based on CNN to classify the heart sound. Med Biol Eng Comput 2023; 61:2417-2439. [PMID: 37103637 PMCID: PMC10133919 DOI: 10.1007/s11517-023-02827-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 03/18/2023] [Indexed: 04/28/2023]
Abstract
Cardiac-related disorders are rapidly growing throughout the world. Accurate classification of cardiovascular diseases is an important research topic in healthcare. During COVID-19, auscultating heart sounds was challenging as health workers and doctors wear protective clothing, and direct contact with patients can spread the outbreak. Thus, contactless auscultation of heart sound is necessary. In this paper, a low-cost ear contactless stethoscope is designed where auscultation is done with the help of a bluetooth-enabled micro speaker instead of an earpiece. The PCG recordings are further compared with other standard electronic stethoscopes like Littman 3 M. This work is made to improve the performance of deep learning-based classifiers like recurrent neural networks (RNN) and convolutional neural networks (CNN) for different valvular heart problems using tuning of hyperparameters like learning rate of optimizers, dropout rate, and hidden layer. Hyper-parameter tuning is used to optimize the performances of various deep learning models and their learning curves for real-time analysis. The acoustic, time, and frequency domain features are used in this research. The investigation is made on the heart sounds of normal and diseased patients available from the standard data repository to train the software models. The proposed CNN-based inception network model achieved an accuracy of 99.65 ± 0.06% on the test dataset with a sensitivity of 98.8 ± 0.05% and specificity of 98.2 ± 0.19%. The proposed hybrid CNN-RNN architecture attained 91.17 ± 0.03% accuracy on test data after hyperparameter optimization, whereas the LSTM-based RNN model achieved 82.32 ± 0.11% accuracy. Finally, the evaluated results were compared with machine learning algorithms, and the improved CNN-based Inception Net model is the most effective among others.
Collapse
Affiliation(s)
- Tanmay Sinha Roy
- Department of Electrical Engineering, Haldia Institute of Technology, Haldia, West Bengal, India.
| | | | - Nirupama Mandal
- Department of Electronics Engineering, Indian Institute of Technology (ISM), Dhanbad, India
| |
Collapse
|
10
|
Chen X, Li H, Huang Y, Han W, Yu X, Zhang P, Tao R. Heart sound classification based on equal scale frequency cepstral coefficients and deep learning. BIOMED ENG-BIOMED TE 2023:bmt-2021-0254. [PMID: 36780471 DOI: 10.1515/bmt-2021-0254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 01/17/2023] [Indexed: 02/15/2023]
Abstract
Heart diseases represent a serious medical condition that can be fatal. Therefore, it is critical to investigate the measures of its early prevention. The Mel-scale frequency cepstral coefficients (MFCC) feature has been widely used in the early diagnosis of heart abnormity and achieved promising results. During feature extraction, the Mel-scale triangular overlapping filter set is applied, which makes the frequency response more in line with the human auditory property. However, the frequency of the heart sound signals has no specific relationship with the human auditory system, which may not be suitable for processing of heart sound signals. To overcome this issue and obtain a more objective feature that can better adapt to practical use, in this work, we propose an equal scale frequency cepstral coefficients (EFCC) feature based on replacing the Mel-scale filter set with a set of equally spaced triangular overlapping filters. We further designed classifiers combining convolutional neural network (CNN), recurrent neural network (RNN) and random forest (RF) layers, which can extract both the spatial and temporal information of the input features. We evaluated the proposed algorithm on our database and the PhysioNet Computational Cardiology (CinC) 2016 Challenge Database. Results from ten-fold cross-validation reveal that the EFCC-based features show considerably better performance and robustness than the MFCC-based features on the task of classifying heart sounds from novel patients. Our algorithm can be further used in wearable medical devices to monitor the heart status of patients in real time with high precision, which is of great clinical importance.
Collapse
Affiliation(s)
- Xiaoqing Chen
- College of Information Science and Engineering, Northeastern University, Shenyang, China
| | - Hongru Li
- College of Information Science and Engineering, Northeastern University, Shenyang, China
| | - Youhe Huang
- College of Information Science and Engineering, Northeastern University, Shenyang, China
| | - Weiwei Han
- Shijiazhuang First People's Hospital, Shijiazhuang, China
| | - Xia Yu
- College of Information Science and Engineering, Northeastern University, Shenyang, China
| | - Pengfei Zhang
- Hebei Derui Health Technology Co., Ltd, Shijiazhuang, China
| | - Rui Tao
- College of Information Science and Engineering, Northeastern University, Shenyang, China
| |
Collapse
|
11
|
Keikhosrokiani P, Naidu A/P Anathan AB, Iryanti Fadilah S, Manickam S, Li Z. Heartbeat sound classification using a hybrid adaptive neuro-fuzzy inferences system (ANFIS) and artificial bee colony. Digit Health 2023; 9:20552076221150741. [PMID: 36655183 PMCID: PMC9841877 DOI: 10.1177/20552076221150741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 12/26/2022] [Indexed: 01/13/2023] Open
Abstract
Cardiovascular disease is one of the main causes of death worldwide which can be easily diagnosed by listening to the murmur sound of heartbeat sounds using a stethoscope. The murmur sound happens at the Lub-Dub, which indicates there are abnormalities in the heart. However, using the stethoscope for listening to the heartbeat sound requires a long time of training then only the physician can detect the murmuring sound. The existing studies show that young physicians face difficulties in this heart sound detection. Use of computerized methods and data analytics for detection and classification of heartbeat sounds will improve the overall quality of sound detection. Many studies have been worked on classifying the heartbeat sound; however, they lack the method with high accuracy. Therefore, this research aims to classify the heartbeat sound using a novel optimized Adaptive Neuro-Fuzzy Inferences System (ANFIS) by artificial bee colony (ABC). The data is cleaned, pre-processed, and MFCC is extracted from the heartbeat sounds. Then the proposed ABC-ANFIS is used to run the pre-processed heartbeat sound, and accuracy is calculated for the model. The results indicate that the proposed ABC-ANFIS model achieved 93% accuracy for the murmur class. The proposed ABC-ANFIS has higher accuracy in compared to ANFIS, PSO ANFIS, SVM, KSTM, KNN, and other existing studies. Thus, this study can assist physicians to classify heartbeat sounds for detecting cardiovascular disease in the early stages.
Collapse
Affiliation(s)
- Pantea Keikhosrokiani
- School of Computer Sciences, 26689Universiti Sains Malaysia, Minden, Penang, Malaysia,Pantea Keikhosrokiani, School of Computer Sciences, Universiti Sains Malaysia, Minden 11800, Penang, Malaysia.
| | | | - Suzi Iryanti Fadilah
- School of Computer Sciences, 26689Universiti Sains Malaysia, Minden, Penang, Malaysia
| | - Selvakumar Manickam
- National Advanced IPv6 Centre, 26689Universiti Sains Malaysia, Minden, Penang, Malaysia
| | - Zuoyong Li
- College of Computer and Control Engineering, 26465Minjiang University, Fuzhou, China
| |
Collapse
|
12
|
Research of heart sound classification using two-dimensional features. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
13
|
Abbas Q, Hussain A, Baig AR. Automatic Detection and Classification of Cardiovascular Disorders Using Phonocardiogram and Convolutional Vision Transformers. Diagnostics (Basel) 2022; 12:diagnostics12123109. [PMID: 36553116 PMCID: PMC9777096 DOI: 10.3390/diagnostics12123109] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 12/07/2022] [Accepted: 12/08/2022] [Indexed: 12/14/2022] Open
Abstract
The major cause of death worldwide is due to cardiovascular disorders (CVDs). For a proper diagnosis of CVD disease, an inexpensive solution based on phonocardiogram (PCG) signals is proposed. (1) Background: Currently, a few deep learning (DL)-based CVD systems have been developed to recognize different stages of CVD. However, the accuracy of these systems is not up-to-the-mark, and the methods require high computational power and huge training datasets. (2) Methods: To address these issues, we developed a novel attention-based technique (CVT-Trans) on a convolutional vision transformer to recognize and categorize PCG signals into five classes. The continuous wavelet transform-based spectrogram (CWTS) strategy was used to extract representative features from PCG data. Following that, a new CVT-Trans architecture was created to categorize the CWTS signals into five groups. (3) Results: The dataset derived from our investigation indicated that the CVT-Trans system had an overall average accuracy ACC of 100%, SE of 99.00%, SP of 99.5%, and F1-score of 98%, based on 10-fold cross validation. (4) Conclusions: The CVD-Trans technique outperformed many state-of-the-art methods. The robustness of the constructed model was confirmed by 10-fold cross-validation. Cardiologists can use this CVT-Trans system to help patients with the diagnosis of heart valve problems.
Collapse
Affiliation(s)
- Qaisar Abbas
- College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia
| | - Ayyaz Hussain
- Department of Computer Science, Quaid-i-Azam University, Islamabad 44000, Pakistan
| | - Abdul Rauf Baig
- College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia
- Correspondence: ; Tel.: +966-563336816
| |
Collapse
|
14
|
Huang Y, Li H, Tao R, Han W, Zhang P, Yu X, Wu R. A customized framework for coronary artery disease detection using phonocardiogram signals. Biomed Signal Process Control 2022. [DOI: 10.1016/j.bspc.2022.103982] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
15
|
Zheng Y, Guo X, Wang Y, Qin J, Lv F. A multi-scale and multi-domain heart sound feature-based machine learning model for ACC/AHA heart failure stage classification. Physiol Meas 2022; 43. [PMID: 35512699 DOI: 10.1088/1361-6579/ac6d40] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 05/05/2022] [Indexed: 11/11/2022]
Abstract
OBJECTIVE Heart sounds can reflect detrimental changes in cardiac mechanical activity that are common pathological characteristics of chronic heart failure (CHF). The ACC/AHA heart failure (HF) stage classification is essential for clinical decision-making and the management of CHF. Herein, a machine learning model that makes use of multi-scale and multi-domain heart sound features was proposed to provide an objective aid for ACC/AHA HF stage classification. APPROACH A dataset containing phonocardiogram (PCG) signals from 275 subjects was obtained from two medical institutions and used in this study. Complementary ensemble empirical mode decomposition and tunable-Q wavelet transform were used to construct self-adaptive sub-sequences and multi-level sub-band signals for PCG signals. Time-domain, frequency-domain and nonlinear feature extraction were then applied to the original PCG signal, heart sound sub-sequences and sub-band signals to construct multi-scale and multi-domain heart sound features. The features selected via the least absolute shrinkage and selection operator were fed into a machine learning classifier for ACC/AHA HF stage classification. Finally, mainstream machine learning classifiers, including least-squares support vector machine (LS-SVM), deep belief network (DBN) and random forest (RF), were compared to determine the optimal model. MAIN RESULTS The results showed that the LS-SVM, which utilized a combination of multi-scale and multi-domain features, achieved better classification performance than the DBN and RF using multi-scale or multi-domain features alone or together, with average sensitivity, specificity, and accuracy of 0.821, 0.955 and 0.820 on the testing set, respectively. SIGNIFICANCE PCG signal analysis provides efficient measurement information regarding CHF severity and is a promising noninvasive method for ACC/AHA HF stage classification.
Collapse
Affiliation(s)
- Yineng Zheng
- Chongqing Medical University, 1 Yixueyuan Road, Yuzhong District, Chongqing 400016, P.R.China, Chongqing, Chongqing, 400016, CHINA
| | - Xingming Guo
- Bioengineering College, Chongqing University, Chongqing 400044, Chongqing, 400044, CHINA
| | - Yingying Wang
- Chongqing Medical University, 1 Yixueyuan Road, Yuzhong District, Chongqing 400016, P.R.China, Chongqing, Chongqing, 400016, CHINA
| | - Jian Qin
- Chongqing Medical University, 1 Yixueyuan Road, Yuzhong District, Chongqing 400016, P.R.China, Chongqing, Chongqing, 400016, CHINA
| | - Fajin Lv
- Chongqing Medical University, 1 Yixueyuan Road, Yuzhong District, Chongqing 400016, P.R.China, Chongqing, 400016, CHINA
| |
Collapse
|
16
|
Classifier identification using Deep Learning and Machine Learning Algorithms for the detection of Valvular Heart diseases. BIOMEDICAL ENGINEERING ADVANCES 2022. [DOI: 10.1016/j.bea.2022.100035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
17
|
Automatic detection of heart valve disorders using Teager–Kaiser energy operator, rational-dilation wavelet transform and convolutional neural networks with PCG signals. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10184-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
18
|
Rossi M, Alessandrelli G, Dombrovschi A, Bovio D, Salito C, Mainardi L, Cerveri P. Identification of Characteristic Points in Multivariate Physiological Signals by Sensor Fusion and Multi-Task Deep Networks. SENSORS 2022; 22:s22072684. [PMID: 35408297 PMCID: PMC9003131 DOI: 10.3390/s22072684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 03/28/2022] [Accepted: 03/28/2022] [Indexed: 11/28/2022]
Abstract
Identification of characteristic points in physiological signals, such as the peak of the R wave in the electrocardiogram and the peak of the systolic wave of the photopletismogram, is a fundamental step for the quantification of clinical parameters, such as the pulse transit time. In this work, we presented a novel neural architecture, called eMTUnet, to automate point identification in multivariate signals acquired with a chest-worn device. The eMTUnet consists of a single deep network capable of performing three tasks simultaneously: (i) localization in time of characteristic points (labeling task), (ii) evaluation of the quality of signals (classification task); (iii) estimation of the reliability of classification (reliability task). Preliminary results in overnight monitoring showcased the ability to detect characteristic points in the four signals with a recall index of about 1.00, 0.90, 0.90, and 0.80, respectively. The accuracy of the signal quality classification was about 0.90, on average over four different classes. The average confidence of the correctly classified signals, against the misclassifications, was 0.93 vs. 0.52, proving the worthiness of the confidence index, which may better qualify the point identification. From the achieved outcomes, we point out that high-quality segmentation and classification are both ensured, which brings the use of a multi-modal framework, composed of wearable sensors and artificial intelligence, incrementally closer to clinical translation.
Collapse
Affiliation(s)
- Matteo Rossi
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, 20133 Milan, Italy; (G.A.); (A.D.); (L.M.)
- Correspondence: (M.R.); (P.C.)
| | - Giulia Alessandrelli
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, 20133 Milan, Italy; (G.A.); (A.D.); (L.M.)
| | - Andra Dombrovschi
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, 20133 Milan, Italy; (G.A.); (A.D.); (L.M.)
| | - Dario Bovio
- Biocubica SRL, 20154 Milan, Italy; (D.B.); (C.S.)
| | | | - Luca Mainardi
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, 20133 Milan, Italy; (G.A.); (A.D.); (L.M.)
| | - Pietro Cerveri
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, 20133 Milan, Italy; (G.A.); (A.D.); (L.M.)
- Correspondence: (M.R.); (P.C.)
| |
Collapse
|
19
|
Khan KN, Khan FA, Abid A, Olmez T, Dokur Z, Khandakar A, Chowdhury MEH, Khan MS. Deep learning based classification of unsegmented phonocardiogram spectrograms leveraging transfer learning. Physiol Meas 2021; 42. [PMID: 34388736 DOI: 10.1088/1361-6579/ac1d59] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2021] [Accepted: 08/13/2021] [Indexed: 11/11/2022]
Abstract
OBJECTIVE Cardiovascular diseases (CVDs) are the main cause of deaths all over the world. This research focuses on computer-aided analysis based on deep learning of phonocardiogram (PCG) signals that can enable improved and timely detection of heart abnormalities. The two widely used publicly available PCG datasets are from the PhysioNet/CinC (2016) and PASCAL (2011) challenges. The datasets are significantly different in terms of the tools used for data acquisition, clinical protocols, digital storages and signal qualities, making it challenging to process and analyze. APPROACH In this work, we have used short-time Fourier transform (STFT) based spectrograms to learn the representative patterns of the normal and abnormal PCG signals. Spectrograms generated from both the datasets are utilized to perform four different studies: (i) train, validate and test different variants of convolutional neural network (CNN) models with PhysioNet dataset, (ii) train, validate and test the best performing CNN structure on PASCAL dataset as well as, (iii) on the combined PhysioNet-PASCAL dataset and (iv) finally, transfer learning technique is employed to train the best performing pre-trained network from the first study with PASCAL dataset. MAIN RESULTS The first study achieves an accuracy, sensitivity, specificity, precision and F1 score of 95.75%, 96.3%, 94.1%, 97.52% , and 96.93% respectively while the second study shows accuracy, sensitivity, specificity, precision and F1 score of 75.25%, 74.2%, 76.4%, 76.73%, and 75.42% respectively. The third study shows accuracy, sensitivity, specificity, precision and F1 score of 92.7%, 94.98%, 89.95%, 95.3% and 94.6% respectively. Finally, the fourth study shows a precision of 96.98% on the noisy PASCAL dataset with transfer learning approach. SIGNIFICANCE The proposed approach employs a less complex and relatively light custom CNN model that outperforms most of the recent competing studies by achieving comparatively high classification accuracy and precision, making it suitable for screening CVDs using PCG signals.
Collapse
Affiliation(s)
- Kaleem Nawaz Khan
- AI in Healthcare, Intelligent Information Processing Lab, National Center of Artificial Intelligence, University of Engineering and Technology, Peshawar, Khyber Pakhtunkhwa, PAKISTAN
| | - Faiq Ahmad Khan
- AI in healthcare, University of Engineering and Technology, Artificial Intelligence in Healthcare, Intelligent Information Processing Lab, National Center of Artificial Intelligence, Basement-New Academic Block, University of Engineering & Technology, Jamrud Road Peshawar, KPK, 25000, Pakistan, Peshawar, PAKISTAN
| | - Anam Abid
- Mechatronics Engineering, University of Engineering and Technology, Peshawar, PAKISTAN
| | - Tamer Olmez
- Department of Electronics and Communication Engineering, Istanbul Technical University, Istanbul, Istanbul, TURKEY
| | - Zümray Dokur
- Department of Electronics and Communication Engineering, Istanbul Technical University, Istanbul, Istanbul, TURKEY
| | - Amith Khandakar
- Electrical Engineering, Qatar University, POBOX 2713, Doha, Doha, Doha, QATAR
| | | | - Muhammad Salman Khan
- Electrical Engineering (JC), University of Engineering and Technology, UET Peshawar, KP, Peshawar, PAKISTAN
| |
Collapse
|
20
|
Chen W, Sun Q, Chen X, Xie G, Wu H, Xu C. Deep Learning Methods for Heart Sounds Classification: A Systematic Review. ENTROPY 2021; 23:e23060667. [PMID: 34073201 PMCID: PMC8229456 DOI: 10.3390/e23060667] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Revised: 05/11/2021] [Accepted: 05/14/2021] [Indexed: 01/14/2023]
Abstract
The automated classification of heart sounds plays a significant role in the diagnosis of cardiovascular diseases (CVDs). With the recent introduction of medical big data and artificial intelligence technology, there has been an increased focus on the development of deep learning approaches for heart sound classification. However, despite significant achievements in this field, there are still limitations due to insufficient data, inefficient training, and the unavailability of effective models. With the aim of improving the accuracy of heart sounds classification, an in-depth systematic review and an analysis of existing deep learning methods were performed in the present study, with an emphasis on the convolutional neural network (CNN) and recurrent neural network (RNN) methods developed over the last five years. This paper also discusses the challenges and expected future trends in the application of deep learning to heart sounds classification with the objective of providing an essential reference for further study.
Collapse
Affiliation(s)
- Wei Chen
- Medical School, Nantong University, Nantong 226001, China; (W.C.); (G.X.); (H.W.)
- School of Information Science and Technology, Nantong University, Nantong 226019, China;
| | - Qiang Sun
- School of Information Science and Technology, Nantong University, Nantong 226019, China;
- Correspondence: (Q.S.); (C.X.)
| | - Xiaomin Chen
- School of Information Science and Technology, Nantong University, Nantong 226019, China;
| | - Gangcai Xie
- Medical School, Nantong University, Nantong 226001, China; (W.C.); (G.X.); (H.W.)
| | - Huiqun Wu
- Medical School, Nantong University, Nantong 226001, China; (W.C.); (G.X.); (H.W.)
| | - Chen Xu
- School of Information Science and Technology, Nantong University, Nantong 226019, China;
- Correspondence: (Q.S.); (C.X.)
| |
Collapse
|
21
|
ARORA SHRUTI, JAIN SUSHMA, CHANA INDERVEER. A FUSION FRAMEWORK BASED ON CEPSTRAL DOMAIN FEATURES FROM PHONOCARDIOGRAM TO PREDICT HEART HEALTH STATUS. J MECH MED BIOL 2021. [DOI: 10.1142/s0219519421500342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
A great increase in the number of cardiovascular cases has been a cause of serious concern for the medical experts all over the world today. In order to achieve valuable risk stratification for patients, early prediction of heart health can benefit specialists to make effective decisions. Heart sound signals help to know about the condition of heart of a patient. Motivated by the success of cepstral features in speech signal classification, authors have used here three different cepstral features, viz. Mel-frequency cepstral coefficients (MFCCs), gammatone frequency cepstral coefficients (GFCCs), and Mel-spectrogram for classifying phonocardiogram into normal and abnormal. Existing research has explored only MFCCs and Mel-feature set extensively for classifying the phonocardiogram. However, in this work, the authors have used a fusion of GFCCs with MFCCs and Mel-spectrogram, and achieved a better accuracy score of 0.96 with sensitivity and specificity scores as 0.91 and 0.98, respectively. The proposed model has been validated on the publicly available benchmark dataset PhysioNet 2016.
Collapse
Affiliation(s)
- SHRUTI ARORA
- Computer Science & Engineering Department, Thapar Institute of Engineering & Technology, Patiala, Punjab, India
| | - SUSHMA JAIN
- Computer Science & Engineering Department, Thapar Institute of Engineering & Technology, Patiala, Punjab, India
| | - INDERVEER CHANA
- Computer Science & Engineering Department, Thapar Institute of Engineering & Technology, Patiala, Punjab, India
| |
Collapse
|
22
|
Zeng W, Lin Z, Yuan C, Wang Q, Liu F, Wang Y. Detection of heart valve disorders from PCG signals using TQWT, FA-MVEMD, Shannon energy envelope and deterministic learning. Artif Intell Rev 2021. [DOI: 10.1007/s10462-021-09969-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
23
|
Research on Segmentation and Classification of Heart Sound Signals Based on Deep Learning. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11020651] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The heart sound signal is one of the signals that reflect the health of the heart. Research on the heart sound signal contributes to the early diagnosis and prevention of cardiovascular diseases. As a commonly used deep learning network, convolutional neural network (CNN) has been widely used in images. In this paper, the method of analyzing heart sound through using CNN has been studied. Firstly, the original data set was preprocessed, and then the heart sounds were segmented on U-net, based on the deep CNN. Finally, the classification of heart sounds was completed through CNN. The data from 2016 PhysioNet/CinC Challenge was utilized for algorithm validation, and the following results were obtained. When the heart sound segmented, the overall accuracy rate was 0.991, the accuracy of the first heart sound was 0.991, the accuracy of the systolic period was 0.996, the accuracy of the second heart sound was 0.996, and the accuracy of the diastolic period was 0.997, and the average accuracy rate was 0.995; While in classification, the accuracy was 0.964, the sensitivity was 0.781, and the specificity was 0.873. These results show that deep learning based on CNN shows good performance in the segmentation and classification of the heart sound signal.
Collapse
|