1
|
Zhao Q, Geng S, Wang B, Sun Y, Nie W, Bai B, Yu C, Zhang F, Tang G, Zhang D, Zhou Y, Liu J, Hong S. Deep Learning in Heart Sound Analysis: From Techniques to Clinical Applications. HEALTH DATA SCIENCE 2024; 4:0182. [PMID: 39387057 PMCID: PMC11461928 DOI: 10.34133/hds.0182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 08/09/2024] [Accepted: 08/13/2024] [Indexed: 10/12/2024]
Abstract
Importance: Heart sound auscultation is a routinely used physical examination in clinical practice to identify potential cardiac abnormalities. However, accurate interpretation of heart sounds requires specialized training and experience, which limits its generalizability. Deep learning, a subset of machine learning, involves training artificial neural networks to learn from large datasets and perform complex tasks with intricate patterns. Over the past decade, deep learning has been successfully applied to heart sound analysis, achieving remarkable results and accumulating substantial heart sound data for model training. Although several reviews have summarized deep learning algorithms for heart sound analysis, there is a lack of comprehensive summaries regarding the available heart sound data and the clinical applications. Highlights: This review will compile the commonly used heart sound datasets, introduce the fundamentals and state-of-the-art techniques in heart sound analysis and deep learning, and summarize the current applications of deep learning for heart sound analysis, along with their limitations and areas for future improvement. Conclusions: The integration of deep learning into heart sound analysis represents a significant advancement in clinical practice. The growing availability of heart sound datasets and the continuous development of deep learning techniques contribute to the improvement and broader clinical adoption of these models. However, ongoing research is needed to address existing challenges and refine these technologies for broader clinical use.
Collapse
Affiliation(s)
- Qinghao Zhao
- Department of Cardiology,
Peking University People’s Hospital, Beijing, China
| | | | - Boya Wang
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Gastrointestinal Oncology,
Peking University Cancer Hospital and Institute, Beijing, China
| | - Yutong Sun
- Department of Cardiology,
Peking University People’s Hospital, Beijing, China
| | - Wenchang Nie
- Department of Cardiology,
Peking University People’s Hospital, Beijing, China
| | - Baochen Bai
- Department of Cardiology,
Peking University People’s Hospital, Beijing, China
| | - Chao Yu
- Department of Cardiology,
Peking University People’s Hospital, Beijing, China
| | - Feng Zhang
- Department of Cardiology,
Peking University People’s Hospital, Beijing, China
| | - Gongzheng Tang
- National Institute of Health Data Science,
Peking University, Beijing, China
- Institute of Medical Technology,
Health Science Center of Peking University, Beijing, China
| | | | - Yuxi Zhou
- Department of Computer Science,
Tianjin University of Technology, Tianjin, China
- DCST, BNRist, RIIT, Institute of Internet Industry,
Tsinghua University, Beijing, China
| | - Jian Liu
- Department of Cardiology,
Peking University People’s Hospital, Beijing, China
| | - Shenda Hong
- National Institute of Health Data Science,
Peking University, Beijing, China
- Institute of Medical Technology,
Health Science Center of Peking University, Beijing, China
| |
Collapse
|
2
|
Zeng Y, Li M, He Z, Zhou L. Segmentation of Heart Sound Signal Based on Multi-Scale Feature Fusion and Multi-Classification of Congenital Heart Disease. Bioengineering (Basel) 2024; 11:876. [PMID: 39329618 PMCID: PMC11428210 DOI: 10.3390/bioengineering11090876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Revised: 08/22/2024] [Accepted: 08/27/2024] [Indexed: 09/28/2024] Open
Abstract
Analyzing heart sound signals presents a novel approach for early diagnosis of pediatric congenital heart disease. The existing segmentation algorithms have limitations in accurately distinguishing the first (S1) and second (S2) heart sounds, limiting the diagnostic utility of cardiac cycle data for pediatric pathology assessment. This study proposes a time bidirectional long short-term memory network (TBLSTM) based on multi-scale analysis to segment pediatric heart sound signals according to different cardiac cycles. Mel frequency cepstral coefficients and dynamic characteristics of the heart sound fragments were extracted and input into random forest for multi-classification of congenital heart disease. The segmentation model achieved an overall F1 score of 94.15% on the verification set, with specific F1 scores of 90.25% for S1 and 86.04% for S2. In a situation where the number of cardiac cycles in the heart sound fragments was set to six, the results for multi-classification achieved stabilization. The performance metrics for this configuration were as follows: accuracy of 94.43%, sensitivity of 95.58%, and an F1 score of 94.51%. Furthermore, the segmentation model demonstrates robustness in accurately segmenting pediatric heart sound signals across different heart rates and in the presence of noise. Notably, the number of cardiac cycles in heart sound fragments directly impacts the multi-classification of these heart sound signals.
Collapse
Affiliation(s)
- Yuan Zeng
- Research Center of Fluid Machinery Engineering and Technology, Jiangsu University, Zhenjiang 212013, China; (Y.Z.); (M.L.); (Z.H.)
| | - Mingzhe Li
- Research Center of Fluid Machinery Engineering and Technology, Jiangsu University, Zhenjiang 212013, China; (Y.Z.); (M.L.); (Z.H.)
| | - Zhaoming He
- Research Center of Fluid Machinery Engineering and Technology, Jiangsu University, Zhenjiang 212013, China; (Y.Z.); (M.L.); (Z.H.)
- Department of Mechanical Engineering, Texas Tech University, Lubbock, TX 79411, USA
| | - Ling Zhou
- Research Center of Fluid Machinery Engineering and Technology, Jiangsu University, Zhenjiang 212013, China; (Y.Z.); (M.L.); (Z.H.)
| |
Collapse
|
3
|
Al-Zaben A, Al-Fahoum A, Ababneh M, Al-Naami B, Al-Omari G. Improved recovery of cardiac auscultation sounds using modified cosine transform and LSTM-based masking. Med Biol Eng Comput 2024; 62:2485-2497. [PMID: 38627355 DOI: 10.1007/s11517-024-03088-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 04/02/2024] [Indexed: 04/24/2024]
Abstract
Obtaining accurate cardiac auscultation signals, including basic heart sounds (S1 and S2) and subtle signs of disease, is crucial for improving cardiac diagnoses and making the most of telehealth. This research paper introduces an innovative approach that utilizes a modified cosine transform (MCT) and a masking strategy based on long short-term memory (LSTM) to effectively distinguish heart sounds and murmurs from background noise and interfering sounds. The MCT is used to capture the repeated pattern of the heart sounds, while the LSTMs are trained to construct masking based on the repeated MCT spectrum. The proposed strategy's performance in maintaining the clinical relevance of heart sounds continues to demonstrate effectiveness, even in environments marked by increased noise and complex disruptions. The present work highlights the clinical significance and reliability of the suggested methodology through in-depth signal visualization and rigorous statistical performance evaluations. In comparative assessments, the proposed approach has demonstrated superior performance compared to recent algorithms, such as LU-Net and PC-DAE. Furthermore, the system's adaptability to various datasets enhances its reliability and practicality. The suggested method is a potential way to improve the accuracy of cardiovascular diagnostics in an era of rapid advancement in medical signal processing. The proposed approach showed an enhancement in the average signal-to-noise ratio (SNR) by 9.6 dB at an input SNR of - 6 dB and by 3.3 dB at an input SNR of 10 dB. The average signal distortion ratio (SDR) achieved across a variety of input SNR values was 8.56 dB.
Collapse
Affiliation(s)
- Awad Al-Zaben
- Biomedical Engineering Department, Engineering Faculty, Hashemite University, Zarqa, Jordan.
- Biomedical Systems and Medical Informatics Department, Hijjawi Faculty for Engineering Technology, Yarmouk University, Irbid, Jordan.
| | - Amjad Al-Fahoum
- Biomedical Systems and Medical Informatics Department, Hijjawi Faculty for Engineering Technology, Yarmouk University, Irbid, Jordan
| | - Muhannad Ababneh
- Faculty of Medicine, Interventional Cardiologist, Jordan University of Science and Technology, Irbid, Jordan
| | - Bassam Al-Naami
- Biomedical Engineering Department, Engineering Faculty, Hashemite University, Zarqa, Jordan
| | - Ghadeer Al-Omari
- Biomedical Systems and Medical Informatics Department, Hijjawi Faculty for Engineering Technology, Yarmouk University, Irbid, Jordan
| |
Collapse
|
4
|
Martins ML, Coimbra MT, Renna F. Markov-Based Neural Networks for Heart Sound Segmentation: Using Domain Knowledge in a Principled Way. IEEE J Biomed Health Inform 2023; 27:5357-5368. [PMID: 37672365 DOI: 10.1109/jbhi.2023.3312597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
This work considers the problem of segmenting heart sounds into their fundamental components. We unify statistical and data-driven solutions by introducing Markov-based Neural Networks (MNNs), a hybrid end-to-end framework that exploits Markov models as statistical inductive biases for an Artificial Neural Network (ANN) discriminator. We show that an MNN leveraging a simple one-dimensional Convolutional ANN significantly outperforms two recent purely data-driven solutions for this task in two publicly available datasets: PhysioNet 2016 (Sensitivity: 0.947 ±0.02; Positive Predictive Value : 0.937 ±0.025) and the CirCor DigiScope 2022 (Sensitivity: 0.950 ±0.008; Positive Predictive Value: 0.943 ±0.012). We also propose a novel gradient-based unsupervised learning algorithm that effectively makes the MNN adaptive to unseen datum sampled from unknown distributions. We perform a cross dataset analysis and show that an MNN pre-trained in the CirCor DigiScope 2022 can benefit from an average improvement of 3.90% Positive Predictive Value on unseen observations from the PhysioNet 2016 dataset using this method.
Collapse
|
5
|
Silva A, Teixeira R, Fontes-Carvalho R, Coimbra M, Renna F. On the Impact of Synchronous Electrocardiogram Signals for Heart Sounds Segmentation. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023; 2023:1-5. [PMID: 38083715 DOI: 10.1109/embc40787.2023.10341149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]
Abstract
In this paper we study the heart sound segmentation problem using Deep Neural Networks. The impact of available electrocardiogram (ECG) signals in addition to phonocardiogram (PCG) signals is evaluated. To incorporate ECG, two different models considered, which are built upon a 1D U-net - an early fusion one that fuses ECG in an early processing stage, and a late fusion one that averages the probabilities obtained by two networks applied independently on PCG and ECG data. Results show that, in contrast with traditional uses of ECG for PCG gating, early fusion of PCG and ECG information can provide more robust heart sound segmentation. As a proof of concept, we use the publicly available PhysioNet dataset. Validation results provide, on average, a sensitivity of 97.2%, 94.5%, and 95.6% and a Positive Predictive Value of 97.5%, 96.2%, and 96.1% for Early-fusion, Late-fusion, and unimodal (PCG only) models, respectively, showing the advantages of combining both signals at early stages to segment heart sounds.Clinical relevance- Cardiac auscultation is the first line of screening for cardiovascular diseases. Its low cost and simplicity are especially suitable for screening large populations in underprivileged countries. The proposed analysis and algorithm show the potential of effectively including electrocardiogram information to improve heart sound segmentation performance, thus enhancing the capacity of extracting useful information from heart sound recordings.
Collapse
|
6
|
Cheema A, Singh M, Kumar M, Setia G. Combined empirical mode decomposition and phase space reconstruction based psychologically stressed and non-stressed state classification from cardiac sound signals. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
|
7
|
Yang C, Dai N, Wang Z, Cai S, Wang J, Hu N. Cardiopulmonary auscultation enhancement with a two-stage noise cancellation approach. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
8
|
Rezaee K, Khosravi MR, Jabari M, Hesari S, Anari MS, Aghaei F. Graph convolutional network‐based deep feature learning for cardiovascular disease recognition from heart sound signals. INT J INTELL SYST 2022. [DOI: 10.1002/int.23041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Khosro Rezaee
- Department of Biomedical Engineering Meybod University Meybod Iran
| | - Mohammad R. Khosravi
- Shandong Provincial University Laboratory for Protected Horticulture Weifang University of Science and Technology Weifang Shandong China
- Department of Computer Engineering Persian Gulf University Bushehr Iran
| | - Mohammad Jabari
- Faculty of Mechanical Engineering University of Tabriz Tabriz Iran
| | - Shabnam Hesari
- Department of Electrical and Computer Engineering Ferdows Branch Islamic Azad University Ferdows Iran
| | - Maryam Saberi Anari
- Department of Computer Engineering Technical and Vocational University (TVU) Tehran Iran
| | - Fahimeh Aghaei
- Department of Electrical and Electronics Engineering Ozyegin University Istanbul Turkey
| |
Collapse
|
9
|
Xu W, Yu K, Ye J, Li H, Chen J, Yin F, Xu J, Zhu J, Li D, Shu Q. Automatic pediatric congenital heart disease classification based on heart sound signal. Artif Intell Med 2022; 126:102257. [DOI: 10.1016/j.artmed.2022.102257] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 12/31/2021] [Accepted: 02/15/2022] [Indexed: 11/02/2022]
|