1
|
Fang Y, Leng H, Wang W, Liu D. Multi-level feature encoding algorithm based on FBPSI for heart sound classification. Sci Rep 2024; 14:29132. [PMID: 39587136 PMCID: PMC11589741 DOI: 10.1038/s41598-024-70230-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Accepted: 08/14/2024] [Indexed: 11/27/2024] Open
Abstract
Analysis of heart sound signals plays an essential role in preventing and diagnosing cardiac diseases. This study proposes a multi-level feature encoding algorithm based on frequency-balanced power spectral intensity for heart sound signal classification. Firstly, a wavelet threshold function is employed to denoise the heart sound signals. Then, the frequency-balanced power spectral intensity envelope is calculated, and an encoder is utilized to extract multi-level features based on the envelope. Finally, an ensemble bagging tree classifier is selected for classification. The experimental data includes binary classification data from the 2016 PhysioNet/CinC Challenge and ternary classification data from the self-collected hypertrophic cardiomyopathy dataset. Results demonstrate that the proposed algorithm performs well, achieving an average classification accuracy of 98.73% for normal and abnormal heart sounds, and 98.12% for normal and two types of hypertrophic cardiomyopathy heart sounds. The proposed method holds significant reference value for the early diagnosis of heart diseases.
Collapse
Affiliation(s)
- Yu Fang
- School of Electrical and Electronic Information, Xihua University, Chengdu, 610039, Sichuan, China
| | - Hongxia Leng
- School of Electrical and Electronic Information, Xihua University, Chengdu, 610039, Sichuan, China
| | - Weibo Wang
- School of Electrical and Electronic Information, Xihua University, Chengdu, 610039, Sichuan, China
| | - Dongbo Liu
- School of Electrical and Electronic Information, Xihua University, Chengdu, 610039, Sichuan, China.
| |
Collapse
|
2
|
Chung TL, Liu YH, Wu PY, Huang JC, Tsai YC, Wang YC, Pan SP, Hsu YL, Chen SC. Prediction of Arteriovenous Access Dysfunction by Mel Spectrogram-based Deep Learning Model. Int J Med Sci 2024; 21:2252-2260. [PMID: 39310268 PMCID: PMC11413895 DOI: 10.7150/ijms.98421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Accepted: 08/13/2024] [Indexed: 09/25/2024] Open
Abstract
Background: The early detection of arteriovenous (AV) access dysfunction is crucial for maintaining the patency of vascular access. This study aimed to use deep learning to predict AV access malfunction necessitating further vascular management. Methods: This prospective cohort study enrolled prevalent hemodialysis (HD) patients with an AV fistula or AV graft from a single HD center. Their AV access bruit sounds were recorded weekly using an electronic stethoscope from three different sites (arterial needle site, venous needle site, and the midpoint between the arterial and venous needle sites) before HD sessions. The audio signals were converted to Mel spectrograms using Fourier transformation and utilized to develop deep learning models. Three deep learning models, (1) Convolutional Neural Network (CNN), (2) Convolutional Recurrent Neural Network (CRNN), and (3) Vision Transformers-Gate Recurrent Unit (ViT-GRU), were trained and compared to predict the likelihood of dysfunctional AV access. Results: Total 437 audio recordings were obtained from 84 patients. The CNN model outperformed the other models in the test set, with an F1 score of 0.7037 and area under the receiver operating characteristic curve (AUROC) of 0.7112. The Vit-GRU model had high performance in out-of-fold predictions, with an F1 score of 0.7131 and AUROC of 0.7745, but low generalization ability in the test set, with an F1 score of 0.5225 and AUROC of 0.5977. Conclusions: The CNN model based on Mel spectrograms could predict malfunctioning AV access requiring vascular intervention within 10 days. This approach could serve as a useful screening tool for high-risk AV access.
Collapse
Affiliation(s)
- Tung-Ling Chung
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
- Division of Nephrology, Department of Internal Medicine, Kaohsiung Veterans General Hospital, Kaohsiung, Taiwan
| | - Yi-Hsueh Liu
- Graduate Institute of Clinical Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
- Department of Internal Medicine, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Division of Cardiology, Department of Internal Medicine, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Pei-Yu Wu
- Department of Internal Medicine, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Division of Nephrology, Department of Internal Medicine, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Faculty of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Jiun-Chi Huang
- Department of Internal Medicine, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Division of Nephrology, Department of Internal Medicine, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Faculty of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Yi-Chun Tsai
- Division of Nephrology, Department of Internal Medicine, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Division of General Medicine, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Faculty of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Yu-Chen Wang
- Muen Biomedical and Optoelectronics Technologies Inc., Taipei, Taiwan
| | - Shan-Pin Pan
- Muen Biomedical and Optoelectronics Technologies Inc., Taipei, Taiwan
| | - Ya-Ling Hsu
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Szu-Chia Chen
- Department of Internal Medicine, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Division of Nephrology, Department of Internal Medicine, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Faculty of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| |
Collapse
|
3
|
Yin C, Zheng Y, Ding X, Shi Y, Qin J, Guo X. Detection of Coronary Artery Disease Based on Clinical Phonocardiogram and Multiscale Attention Convolutional Compression Network. IEEE J Biomed Health Inform 2024; 28:1353-1362. [PMID: 38227404 DOI: 10.1109/jbhi.2024.3354832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2024]
Abstract
Heart sound is an important physiological signal that contains rich pathological information related with coronary stenosis. Thus, some machine learning methods are developed to detect coronary artery disease (CAD) based on phonocardiogram (PCG). However, current methods lack sufficient clinical dataset and fail to achieve efficient feature utilization. Besides, the methods require complex processing steps including empirical feature extraction and classifier design. To achieve efficient CAD detection, we propose the multiscale attention convolutional compression network (MACCN) based on clinical PCG dataset. Firstly, PCG dataset including 102 CAD subjects and 82 non-CAD subjects was established. Then, a multiscale convolution structure was developed to catch comprehensive heart sound features and a channel attention module was developed to enhance key features in multiscale attention convolutional block (MACB). Finally, a separate downsampling block was proposed to reduce feature losses. MACCN combining the blocks can automatically extract features without empirical and manual feature selection. It obtains good classification results with accuracy 93.43%, sensitivity 93.44%, precision 93.48%, and F1 score 93.42%. The study implies that MACCN performs effective PCG feature mining aiming for CAD detection. Further, it integrates feature extraction and classification and provides a simplified PCG processing case.
Collapse
|
4
|
Arjoune Y, Nguyen TN, Doroshow RW, Shekhar R. A Noise-Robust Heart Sound Segmentation Algorithm Based on Shannon Energy. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2024; 12:7747-7761. [PMID: 39398361 PMCID: PMC11469632 DOI: 10.1109/access.2024.3351570] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/15/2024]
Abstract
Heart sound segmentation has been shown to improve the performance of artificial intelligence (AI)-based auscultation decision support systems increasingly viewed as a solution to compensate for eroding auscultatory skills and the associated subjectivity. Various segmentation approaches with demonstrated performance can be utilized for this task, but their robustness can suffer in the presence of noise. A noise-robust heart sound segmentation algorithm was developed and its accuracy was tested using two datasets: the CirCor DigiScope Phonocardiogram dataset and an in-house dataset - a heart murmur library collected at the Children's National Hospital (CNH). On the CirCor dataset, our segmentation algorithm marked the boundaries of the primary heart sounds S1 and S2 with an accuracy of 0.28 ms and 0.29 ms, respectively, and correctly identified the actual positive segments with a sensitivity of 97.44%. The algorithm also executed four times faster than a logistic regression hidden semi-Markov model. On the CNH dataset, the algorithm succeeded in 87.4% cases, achieving a 6% increase in segmentation success rate demonstrated by our original Shannon energy-based algorithm. Accurate heart sound segmentation is critical to supporting and accelerating AI research in cardiovascular diseases. The proposed algorithm increases the robustness of heart sound segmentation to noise and viability for clinical use.
Collapse
Affiliation(s)
- Youness Arjoune
- Sheikh Zayed Institute for Pediatric Surgical Innovation, Children's National Hospital, Washington, DC 20010, USA
| | | | - Robin W Doroshow
- Department of Cardiology, Children's National Hospital, Washington, DC 20010, USA
| | - Raj Shekhar
- Sheikh Zayed Institute for Pediatric Surgical Innovation, Children's National Hospital, Washington, DC 20010, USA
| |
Collapse
|