1
|
Lella KK, Jagadeesh MS, Alphonse PJA. Artificial intelligence-based framework to identify the abnormalities in the COVID-19 disease and other common respiratory diseases from digital stethoscope data using deep CNN. Health Inf Sci Syst 2024; 12:22. [PMID: 38469455 PMCID: PMC10924857 DOI: 10.1007/s13755-024-00283-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 02/21/2024] [Indexed: 03/13/2024] Open
Abstract
The utilization of lung sounds to diagnose lung diseases using respiratory sound features has significantly increased in the past few years. The Digital Stethoscope data has been examined extensively by medical researchers and technical scientists to diagnose the symptoms of respiratory diseases. Artificial intelligence-based approaches are applied in the real universe to distinguish respiratory disease signs from human pulmonary auscultation sounds. The Deep CNN model is implemented with combined multi-feature channels (Modified MFCC, Log Mel, and Soft Mel) to obtain the sound parameters from lung-based Digital Stethoscope data. The model analysis is observed with max-pooling and without max-pool operations using multi-feature channels on respiratory digital stethoscope data. In addition, COVID-19 sound data and enriched data, which are recently acquired data to enhance model performance using a combination of L2 regularization to overcome the risk of overfitting because of less respiratory sound data, are included in the work. The suggested DCNN with Max-Pooling on the improved dataset demonstrates cutting-edge performance employing a multi-feature channels spectrogram. The model has been developed with different convolutional filter sizes (1 × 12 , 1 × 24 , 1 × 36 , 1 × 48 , and 1 × 60 ) that helped to test the proposed neural network. According to the experimental findings, the suggested DCNN architecture with a max-pooling function performs better to identify respiratory disease symptoms than DCNN without max-pooling. In order to demonstrate the model's effectiveness in categorization, it is trained and tested with the DCNN model that extract several modalities of respiratory sound data.
Collapse
Affiliation(s)
- Kranthi Kumar Lella
- School of Computer Science and Engineering, VIT-AP University, Vijayawada, Guntur, Andhra Pradesh 522237 India
| | - M. S. Jagadeesh
- School of Computer Science and Engineering, VIT-AP University, Vijayawada, Guntur, Andhra Pradesh 522237 India
| | - P. J. A. Alphonse
- Department of Computer Applications, NIT Tiruchirappalli, Tiruchirappalli, Guntur, Tamil Nadu 620015 India
| |
Collapse
|
2
|
Chen Z, Liang N, Li H, Zhang H, Li H, Yan L, Hu Z, Chen Y, Zhang Y, Wang Y, Ke D, Shi N. Exploring explainable AI features in the vocal biomarkers of lung disease. Comput Biol Med 2024; 179:108844. [PMID: 38981214 DOI: 10.1016/j.compbiomed.2024.108844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Revised: 05/15/2024] [Accepted: 06/04/2024] [Indexed: 07/11/2024]
Abstract
This review delves into the burgeoning field of explainable artificial intelligence (XAI) in the detection and analysis of lung diseases through vocal biomarkers. Lung diseases, often elusive in their early stages, pose a significant public health challenge. Recent advancements in AI have ushered in innovative methods for early detection, yet the black-box nature of many AI models limits their clinical applicability. XAI emerges as a pivotal tool, enhancing transparency and interpretability in AI-driven diagnostics. This review synthesizes current research on the application of XAI in analyzing vocal biomarkers for lung diseases, highlighting how these techniques elucidate the connections between specific vocal features and lung pathology. We critically examine the methodologies employed, the types of lung diseases studied, and the performance of various XAI models. The potential for XAI to aid in early detection, monitor disease progression, and personalize treatment strategies in pulmonary medicine is emphasized. Furthermore, this review identifies current challenges, including data heterogeneity and model generalizability, and proposes future directions for research. By offering a comprehensive analysis of explainable AI features in the context of lung disease detection, this review aims to bridge the gap between advanced computational approaches and clinical practice, paving the way for more transparent, reliable, and effective diagnostic tools.
Collapse
Affiliation(s)
- Zhao Chen
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Ning Liang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Haoyuan Li
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Haili Zhang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Huizhen Li
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Lijiao Yan
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Ziteng Hu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yaxin Chen
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yujing Zhang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yanping Wang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Dandan Ke
- Special Disease Clinic, Huaishuling Branch of Beijing Fengtai Hospital of Integrated Traditional Chinese and Western Medicine, Beijing, China.
| | - Nannan Shi
- State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China; Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China.
| |
Collapse
|
3
|
Shen J, Zhang X, Lu Y, Ye P, Zhang P, Yan Y. Novel audio characteristic-dependent feature extraction and data augmentation methods for cough-based respiratory disease classification. Comput Biol Med 2024; 179:108843. [PMID: 39029433 DOI: 10.1016/j.compbiomed.2024.108843] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 05/08/2024] [Accepted: 06/04/2024] [Indexed: 07/21/2024]
Abstract
Respiratory diseases are one of the major health problems worldwide. Early diagnosis of the disease types is of vital importance. As one of the main symptoms of many respiratory diseases, cough may contain information about different pathological changes in the respiratory system. Therefore, many researchers have used cough sounds to diagnose different diseases through artificial intelligence in recent years. The acoustic features and data augmentation methods commonly used in speech tasks are used to achieve better performance. Although these methods are applicable, previous studies have not considered the characteristics of cough sound signals. In this paper, we designed a cough-based respiratory disease classification system and proposed audio characteristic-dependent feature extraction and data augmentation methods. Firstly, according to the short durations and rapid transition of different cough stages, we proposed maximum overlapping mel-spectrogram to avoid missing inter-frame information caused by traditional framing methods. Secondly, we applied various data augmentation methods to mitigate the problem of limited labeled data. Based on the frequency energy distributions of different diseased cough audios, we proposed a parameter-independent self-energy-based augmentation method to enhance the differences between different frequency bands. Finally, in the model testing stage, we leveraged test-time augmentation to further improve the classification performance by fusing the test results of the original and multiple augmented audios. The proposed methods were validated on the Coswara dataset through stratified four-fold cross-validation. Compared to the baseline model using mel-spectrogram as input, the proposed methods achieved an average absolute performance improvement of 3.33% and 3.10% in macro Area Under the Receiver Operating Characteristic (macro AUC) and Unweighted Average Recall (UAR), respectively. The visualization results through Gradient-weighted Class Activation Mapping (Grad-CAM) showed the contributions of different features to model decisions.
Collapse
Affiliation(s)
- Jiakun Shen
- Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
| | - Xueshuai Zhang
- Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
| | - Yu Lu
- Institute of Information on Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing, China.
| | - Pengfei Ye
- Children's Hospital Capital Institute of Pediatrics, Beijing, China.
| | - Pengyuan Zhang
- Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
| | - Yonghong Yan
- Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
4
|
Mousavi M, Hosseini S. A deep convolutional neural network approach using medical image classification. BMC Med Inform Decis Mak 2024; 24:239. [PMID: 39210320 PMCID: PMC11360845 DOI: 10.1186/s12911-024-02646-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Accepted: 08/22/2024] [Indexed: 09/04/2024] Open
Abstract
The epidemic diseases such as COVID-19 are rapidly spreading all around the world. The diagnosis of epidemic at initial stage is of high importance to provide medical care to and recovery of infected people as well as protecting the uninfected population. In this paper, an automatic COVID-19 detection model using respiratory sound and medical image based on internet of health things (IoHT) is proposed. In this model, primarily to screen those people having suspected Coronavirus disease, the sound of coughing used to detect healthy people and those suffering from COVID-19, which finally obtained an accuracy of 94.999%. This approach not only expedites diagnosis and enhances accuracy but also facilitates swift screening in public places using simple equipment. Then, in the second step, in order to help radiologists to interpret medical images as best as possible, we use three pre-trained convolutional neural network models InceptionResNetV2, InceptionV3 and EfficientNetB4 and two data sets of chest radiology medical images, and CT Scan in a three-class classification. Utilizing transfer learning and pre-existing knowledge in these models leads to notable improvements in disease diagnosis and identification compared to traditional techniques. Finally, the best result obtained for CT-Scan images belonging to InceptionResNetV2 architecture with 99.414% accuracy and for radiology images related to InceptionV3 and EfficientNetB4 architectures with the accuracy is 96.943%. Therefore, the proposed model can help radiology specialists to confirm the initial assessments of the COVID-19 disease.
Collapse
Affiliation(s)
- Mohammad Mousavi
- Department of Computer Science, Faculty of Mathematics and Computer, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Soodeh Hosseini
- Department of Computer Science, Faculty of Mathematics and Computer, Shahid Bahonar University of Kerman, Kerman, Iran.
| |
Collapse
|
5
|
Zhang S, Zhang Y, Liao Y, Pang K, Wan Z, Zhou S. Polyphonic sound event localization and detection based on Multiple Attention Fusion ResNet. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2024; 21:2004-2023. [PMID: 38454672 DOI: 10.3934/mbe.2024089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]
Abstract
Sound event localization and detection have been applied in various fields. Due to the polyphony and noise interference, it becomes challenging to accurately predict the sound event and their occurrence locations. Aiming at this problem, we propose a Multiple Attention Fusion ResNet, which uses ResNet34 as the base network. Given the situation that the sound duration is not fixed, and there are multiple polyphonic and noise, we introduce the Gated Channel Transform to enhance the residual basic block. This enables the model to capture contextual information, evaluate channel weights, and reduce the interference caused by polyphony and noise. Furthermore, Split Attention is introduced to the model for capturing cross-channel information, which enhances the ability to distinguish the polyphony. Finally, Coordinate Attention is introduced to the model so that the model can focus on both the channel information and spatial location information of sound events. Experiments were conducted on two different datasets, TAU-NIGENS Spatial Sound Events 2020, and TAU-NIGENS Spatial Sound Events 2021. The results demonstrate that the proposed model significantly outperforms state-of-the-art methods under multiple polyphonic and noise-directional interference environments and it achieves competitive performance under a single polyphonic environment.
Collapse
Affiliation(s)
- Shouming Zhang
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
| | - Yaling Zhang
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
- Institute of Intelligent Manufacturing, Guangdong Academy of Science, Guangdong Key Laboratory of Modern Control Technology, Guangzhou 510030, China
| | - Yixiao Liao
- Institute of Intelligent Manufacturing, Guangdong Academy of Science, Guangdong Key Laboratory of Modern Control Technology, Guangzhou 510030, China
| | - Kunkun Pang
- Institute of Intelligent Manufacturing, Guangdong Academy of Science, Guangdong Key Laboratory of Modern Control Technology, Guangzhou 510030, China
| | - Zhiyong Wan
- Institute of Intelligent Manufacturing, Guangdong Academy of Science, Guangdong Key Laboratory of Modern Control Technology, Guangzhou 510030, China
| | - Songbin Zhou
- Institute of Intelligent Manufacturing, Guangdong Academy of Science, Guangdong Key Laboratory of Modern Control Technology, Guangzhou 510030, China
| |
Collapse
|
6
|
Huang ML, Huang ZB. An ensemble-acute lymphoblastic leukemia model for acute lymphoblastic leukemia image classification. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2024; 21:1959-1978. [PMID: 38454670 DOI: 10.3934/mbe.2024087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]
Abstract
The timely diagnosis of acute lymphoblastic leukemia (ALL) is of paramount importance for enhancing the treatment efficacy and the survival rates of patients. In this study, we seek to introduce an ensemble-ALL model for the image classification of ALL, with the goal of enhancing early diagnostic capabilities and streamlining the diagnostic and treatment processes for medical practitioners. In this study, a publicly available dataset is partitioned into training, validation, and test sets. A diverse set of convolutional neural networks, including InceptionV3, EfficientNetB4, ResNet50, CONV_POOL-CNN, ALL-CNN, Network in Network, and AlexNet, are employed for training. The top-performing four individual models are meticulously chosen and integrated with the squeeze-and-excitation (SE) module. Furthermore, the two most effective SE-embedded models are harmoniously combined to create the proposed ensemble-ALL model. This model leverages the Bayesian optimization algorithm to enhance its performance. The proposed ensemble-ALL model attains remarkable accuracy, precision, recall, F1-score, and kappa scores, registering at 96.26, 96.26, 96.26, 96.25, and 91.36%, respectively. These results surpass the benchmarks set by state-of-the-art studies in the realm of ALL image classification. This model represents a valuable contribution to the field of medical image recognition, particularly in the diagnosis of acute lymphoblastic leukemia, and it offers the potential to enhance the efficiency and accuracy of medical professionals in the diagnostic and treatment processes.
Collapse
Affiliation(s)
- Mei-Ling Huang
- Department of Industrial Engineering & Management, National Chin-Yi University of Technology, Taichung, Taiwan
| | - Zong-Bin Huang
- Department of Industrial Engineering & Management, National Chin-Yi University of Technology, Taichung, Taiwan
| |
Collapse
|
7
|
El-Sherif DM, Ahmed AA, Sharif AF, Elzarif MT, Abouzid M. Greenway of Digital Health Technology During COVID-19 Crisis: Bibliometric Analysis, Challenges, and Future Perspective. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2024; 1458:315-334. [PMID: 39102206 DOI: 10.1007/978-3-031-61943-4_21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/06/2024]
Abstract
Digital health has transformed the healthcare landscape by leveraging technology to improve patient outcomes and access to medical services. The COVID-19 pandemic has highlighted the urgent need for digital healthcare solutions that can mitigate the impact of the outbreak while ensuring patient safety. In this chapter, we delve into how digital health technologies such as telemedicine, mobile apps, and wearable devices can provide personalized care, reduce healthcare provider burden, and lower healthcare costs. We also explore the creation of a greenway of digital healthcare that safeguards patient confidentiality, enables efficient communication, and ensures cost-effective payment systems. This chapter showcases the potential of digital health to revolutionize healthcare delivery while ensuring patient well-being and medical staff satisfaction.
Collapse
Affiliation(s)
- Dina M El-Sherif
- Collaborative Innovation Center of Food Safety and Quality Control in Jiangsu Province, School of Food Science and Technology, Jiangnan University, Wuxi, 214122, China.
- National Institute of Oceanography and Fisheries (NIOF), Cairo, Egypt.
| | - Alhassan Ali Ahmed
- Department of Bioinformatics and Computational Biology, Poznan University of Medical Sciences, 60-781, Poznan, Poland
- Doctoral School, Poznan University of Medical Sciences, 60-812, Poznan, Poland
| | - Asmaa Fady Sharif
- Department of Forensic Medicine and Clinical Toxicology, Faculty of Medicine, Tanta University, Tanta, Egypt
- Clinical Medical Sciences Department, College of Medicine, Dar Al-Uloom University, Riyadh, Saudi Arabia
| | | | - Mohamed Abouzid
- Doctoral School, Poznan University of Medical Sciences, 60-812, Poznan, Poland
- Department of Physical Pharmacy and Pharmacokinetics, Faculty of Pharmacy, Poznan University of Medical Sciences, Rokietnicka 3 St., 60-806, Poznan, Poland
| |
Collapse
|
8
|
Im S, Kim T, Min C, Kang S, Roh Y, Kim C, Kim M, Kim SH, Shim K, Koh JS, Han S, Lee J, Kim D, Kang D, Seo S. Real-time counting of wheezing events from lung sounds using deep learning algorithms: Implications for disease prediction and early intervention. PLoS One 2023; 18:e0294447. [PMID: 37983213 PMCID: PMC10659186 DOI: 10.1371/journal.pone.0294447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 10/23/2023] [Indexed: 11/22/2023] Open
Abstract
This pioneering study aims to revolutionize self-symptom management and telemedicine-based remote monitoring through the development of a real-time wheeze counting algorithm. Leveraging a novel approach that includes the detailed labeling of one breathing cycle into three types: break, normal, and wheeze, this study not only identifies abnormal sounds within each breath but also captures comprehensive data on their location, duration, and relationships within entire respiratory cycles, including atypical patterns. This innovative strategy is based on a combination of a one-dimensional convolutional neural network (1D-CNN) and a long short-term memory (LSTM) network model, enabling real-time analysis of respiratory sounds. Notably, it stands out for its capacity to handle continuous data, distinguishing it from conventional lung sound classification algorithms. The study utilizes a substantial dataset consisting of 535 respiration cycles from diverse sources, including the Child Sim Lung Sound Simulator, the EMTprep Open-Source Database, Clinical Patient Records, and the ICBHI 2017 Challenge Database. Achieving a classification accuracy of 90%, the exceptional result metrics encompass the identification of each breath cycle and simultaneous detection of the abnormal sound, enabling the real-time wheeze counting of all respirations. This innovative wheeze counter holds the promise of revolutionizing research on predicting lung diseases based on long-term breathing patterns and offers applicability in clinical and non-clinical settings for on-the-go detection and remote intervention of exacerbated respiratory symptoms.
Collapse
Affiliation(s)
- Sunghoon Im
- Department of Mechanical Engineering, Ajou University, Suwon-si, Gyeonggi-do, Republic of Korea
| | - Taewi Kim
- Department of Mechanical Engineering, Ajou University, Suwon-si, Gyeonggi-do, Republic of Korea
| | | | - Sanghun Kang
- Department of Mechanical Engineering, Ajou University, Suwon-si, Gyeonggi-do, Republic of Korea
| | - Yeonwook Roh
- Department of Mechanical Engineering, Ajou University, Suwon-si, Gyeonggi-do, Republic of Korea
| | - Changhwan Kim
- Department of Mechanical Engineering, Ajou University, Suwon-si, Gyeonggi-do, Republic of Korea
| | - Minho Kim
- Department of Mechanical Engineering, Ajou University, Suwon-si, Gyeonggi-do, Republic of Korea
| | - Seung Hyun Kim
- Department of Medical Humanities, Korea University College of Medicine, Seoul, Republic of Korea
| | - KyungMin Shim
- Industry-University Cooperation Foundation, Seogyeong University, Seoul, Republic of Korea
| | - Je-sung Koh
- Department of Mechanical Engineering, Ajou University, Suwon-si, Gyeonggi-do, Republic of Korea
| | - Seungyong Han
- Department of Mechanical Engineering, Ajou University, Suwon-si, Gyeonggi-do, Republic of Korea
| | - JaeWang Lee
- Department of Biomedical Laboratory Science, College of Health Science, Eulji University, Seongnam-si, Gyeonggi-do, Republic of Korea
| | - Dohyeong Kim
- University of Texas at Dallas, Richardson, TX, United States of America
| | - Daeshik Kang
- Department of Mechanical Engineering, Ajou University, Suwon-si, Gyeonggi-do, Republic of Korea
| | - SungChul Seo
- Department of Nano-Chemical, Biological and Environmental Engineering, Seogyeong University, Seoul, Republic of Korea
| |
Collapse
|
9
|
Pentakota P, Rudraraju G, Sripada NR, Mamidgi B, Gottipulla C, Jalukuru C, Palreddy SD, Bhoge NKR, Firmal P, Yechuri V, Jain M, Peddireddi VS, Bhimarasetty DM, Sreenivas S, Prasad K KL, Joshi N, Vijayan S, Turaga S, Avasarala V. Screening COVID-19 by Swaasa AI platform using cough sounds: a cross-sectional study. Sci Rep 2023; 13:18284. [PMID: 37880351 PMCID: PMC10600180 DOI: 10.1038/s41598-023-45104-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Accepted: 10/16/2023] [Indexed: 10/27/2023] Open
Abstract
The Advent of Artificial Intelligence (AI) has led to the use of auditory data for detecting various diseases, including COVID-19. SARS-CoV-2 infection has claimed more than six million lives to date and therefore, needs a robust screening technique to control the disease spread. In the present study we created and validated the Swaasa AI platform, which uses the signature cough sound and symptoms presented by patients to screen and prioritize COVID-19 patients. We collected cough data from 234 COVID-19 suspects to validate our Convolutional Neural Network (CNN) architecture and Feedforward Artificial Neural Network (FFANN) (tabular features) based algorithm. The final output from both models was combined to predict the likelihood of having the disease. During the clinical validation phase, our model showed a 75.54% accuracy rate in detecting the likely presence of COVID-19, with 95.45% sensitivity and 73.46% specificity. We conducted pilot testing on 183 presumptive COVID subjects, of which 58 were truly COVID-19 positive, resulting in a Positive Predictive Value of 70.73%. Due to the high cost and technical expertise required for currently available rapid screening methods, there is a need for a cost-effective and remote monitoring tool that can serve as a preliminary screening method for potential COVID-19 subjects. Therefore, Swaasa would be highly beneficial in detecting the disease and could have a significant impact in reducing its spread.
Collapse
Affiliation(s)
| | | | | | | | | | - Charan Jalukuru
- Salcit Technologies, Jayabheri Silicon Towers, Hyderabad, India
| | | | | | - Priyanka Firmal
- Salcit Technologies, Jayabheri Silicon Towers, Hyderabad, India
| | - Venkat Yechuri
- Salcit Technologies, Jayabheri Silicon Towers, Hyderabad, India
| | - Manmohan Jain
- Salcit Technologies, Jayabheri Silicon Towers, Hyderabad, India
| | | | | | - S Sreenivas
- Andhra Medical College, Visakhapatnam, India
| | | | | | - Shibu Vijayan
- Qure.Ai Technologies, Oberoi Commerz II, Mumbai, India
| | | | - Vardhan Avasarala
- Otolaryngology - Head and Neck Surgery, Northeast Ohio Medical University, Rootstown, USA
| |
Collapse
|
10
|
Celik G. CovidCoughNet: A new method based on convolutional neural networks and deep feature extraction using pitch-shifting data augmentation for covid-19 detection from cough, breath, and voice signals. Comput Biol Med 2023; 163:107153. [PMID: 37321101 PMCID: PMC10249348 DOI: 10.1016/j.compbiomed.2023.107153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Revised: 05/25/2023] [Accepted: 06/07/2023] [Indexed: 06/17/2023]
Abstract
This study proposes a new deep learning-based method that demonstrates high performance in detecting Covid-19 disease from cough, breath, and voice signals. This impressive method, named CovidCoughNet, consists of a deep feature extraction network (InceptionFireNet) and a prediction network (DeepConvNet). The InceptionFireNet architecture, based on Inception and Fire modules, was designed to extract important feature maps. The DeepConvNet architecture, which is made up of convolutional neural network blocks, was developed to predict the feature vectors obtained from the InceptionFireNet architecture. The COUGHVID dataset containing cough data and the Coswara dataset containing cough, breath, and voice signals were used as the data sets. The pitch-shifting technique was used to data augmentation the signal data, which significantly contributed to improving performance. Additionally, Chroma features (CF), Root mean square energy (RMSE), Spectral centroid (SC), Spectral bandwidth (SB), Spectral rolloff (SR), Zero crossing rate (ZCR), and Mel frequency cepstral coefficients (MFCC) feature extraction techniques were used to extract important features from voice signals. Experimental studies have shown that using the pitch-shifting technique improved performance by around 3% compared to raw signals. When the proposed model was used with the COUGHVID dataset (Healthy, Covid-19, and Symptomatic), a high performance of 99.19% accuracy, 0.99 precision, 0.98 recall, 0.98 F1-Score, 97.77% specificity, and 98.44% AUC was achieved. Similarly, when the voice data in the Coswara dataset was used, higher performance was achieved compared to the cough and breath studies, with 99.63% accuracy, 100% precision, 0.99 recall, 0.99 F1-Score, 99.24% specificity, and 99.24% AUC. Moreover, when compared with current studies in the literature, the proposed model was observed to exhibit highly successful performance. The codes and details of the experimental studies can be accessed from the relevant Github page: (https://github.com/GaffariCelik/CovidCoughNet).
Collapse
Affiliation(s)
- Gaffari Celik
- Agri Ibrahim Cecen University, Department of Computer Technology, Agri, Turkey.
| |
Collapse
|
11
|
Alvarado E, Grágeda N, Luzanto A, Mahu R, Wuth J, Mendoza L, Stern RM, Yoma NB. Automatic Detection of Dyspnea in Real Human-Robot Interaction Scenarios. SENSORS (BASEL, SWITZERLAND) 2023; 23:7590. [PMID: 37688044 PMCID: PMC10490721 DOI: 10.3390/s23177590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 08/20/2023] [Accepted: 08/25/2023] [Indexed: 09/10/2023]
Abstract
A respiratory distress estimation technique for telephony previously proposed by the authors is adapted and evaluated in real static and dynamic HRI scenarios. The system is evaluated with a telephone dataset re-recorded using the robotic platform designed and implemented for this study. In addition, the original telephone training data are modified using an environmental model that incorporates natural robot-generated and external noise sources and reverberant effects using room impulse responses (RIRs). The results indicate that the average accuracy and AUC are just 0.4% less than those obtained with matched training/testing conditions with simulated data. Quite surprisingly, there is not much difference in accuracy and AUC between static and dynamic HRI conditions. Moreover, the beamforming methods delay-and-sum and MVDR lead to average improvement in accuracy and AUC equal to 8% and 2%, respectively, when applied to training and testing data. Regarding the complementarity of time-dependent and time-independent features, the combination of both types of classifiers provides the best joint accuracy and AUC score.
Collapse
Affiliation(s)
- Eduardo Alvarado
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile; (E.A.); (N.G.); (A.L.); (R.M.); (J.W.)
| | - Nicolás Grágeda
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile; (E.A.); (N.G.); (A.L.); (R.M.); (J.W.)
| | - Alejandro Luzanto
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile; (E.A.); (N.G.); (A.L.); (R.M.); (J.W.)
| | - Rodrigo Mahu
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile; (E.A.); (N.G.); (A.L.); (R.M.); (J.W.)
| | - Jorge Wuth
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile; (E.A.); (N.G.); (A.L.); (R.M.); (J.W.)
| | - Laura Mendoza
- Hospital Clínico Universidad de Chile, Santiago 8380420, Chile;
- Clínica Alemana, Santiago 7630000, Chile
| | - Richard M. Stern
- Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA 15213, USA;
| | - Néstor Becerra Yoma
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile; (E.A.); (N.G.); (A.L.); (R.M.); (J.W.)
| |
Collapse
|
12
|
Hussain SS, Degang X, Shah PM, Islam SU, Alam M, Khan IA, Awwad FA, Ismail EAA. Classification of Parkinson's Disease in Patch-Based MRI of Substantia Nigra. Diagnostics (Basel) 2023; 13:2827. [PMID: 37685365 PMCID: PMC10486663 DOI: 10.3390/diagnostics13172827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 08/25/2023] [Accepted: 08/26/2023] [Indexed: 09/10/2023] Open
Abstract
Parkinson's disease (PD) is a chronic and progressive neurological disease that mostly shakes and compromises the motor system of the human brain. Patients with PD can face resting tremors, loss of balance, bradykinesia, and rigidity problems. Complex patterns of PD, i.e., with relevance to other neurological diseases and minor changes in brain structure, make the diagnosis of this disease a challenge and cause inaccuracy of about 25% in the diagnostics. The research community utilizes different machine learning techniques for diagnosis using handcrafted features. This paper proposes a computer-aided diagnostic system using a convolutional neural network (CNN) to diagnose PD. CNN is one of the most suitable models to extract and learn the essential features of a problem. The dataset is obtained from Parkinson's Progression Markers Initiative (PPMI), which provides different datasets (benchmarks), such as T2-weighted MRI for PD and other healthy controls (HC). The mid slices are collected from each MRI. Further, these slices are registered for alignment. Since the PD can be found in substantia nigra (i.e., the midbrain), the midbrain region of the registered T2-weighted MRI slice is selected using the freehand region of interest technique with a 33 × 33 sized window. Several experiments have been carried out to ensure the validity of the CNN. The standard measures, such as accuracy, sensitivity, specificity, and area under the curve, are used to evaluate the proposed system. The evaluation results show that CNN provides better accuracy than machine learning techniques, such as naive Bayes, decision tree, support vector machine, and artificial neural network.
Collapse
Affiliation(s)
| | - Xu Degang
- School of Automation, Central South University, Changsha 410010, China;
| | - Pir Masoom Shah
- Department of Computer Science, Bacha Khan University Charsadda, Charsadda 24540, Pakistan; (P.M.S.); (I.A.K.)
- School of Computer Science and Engineering, Central South University, Changsha 410010, China;
| | - Saif Ul Islam
- Department of Computer Science, Institute of Space Technology, Islamabad 44000, Pakistan;
| | - Mahmood Alam
- School of Computer Science and Engineering, Central South University, Changsha 410010, China;
| | - Izaz Ahmad Khan
- Department of Computer Science, Bacha Khan University Charsadda, Charsadda 24540, Pakistan; (P.M.S.); (I.A.K.)
| | - Fuad A. Awwad
- Department of Quantitative Analysis, College of Business Administration, King Saud University, P.O. Box 71115, Riyadh 11587, Saudi Arabia; (F.A.A.); (E.A.A.I.)
| | - Emad A. A. Ismail
- Department of Quantitative Analysis, College of Business Administration, King Saud University, P.O. Box 71115, Riyadh 11587, Saudi Arabia; (F.A.A.); (E.A.A.I.)
| |
Collapse
|
13
|
Hassan MM, Tahir MH, Ameeq M, Jamal F, Mendy JT, Chesneau C. Risk factors identification of COVID-19 patients with chronic obstructive pulmonary disease: A retrospective study in Punjab-Pakistan. Immun Inflamm Dis 2023; 11:e981. [PMID: 37647450 PMCID: PMC10461420 DOI: 10.1002/iid3.981] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 07/30/2023] [Accepted: 07/31/2023] [Indexed: 09/01/2023] Open
Abstract
BACKGROUND Accessibility to the immense collection of studies on noncommunicable diseases related to coronavirus disease of 2019 (COVID-19) and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is an immediate focus of researchers. However, there is a scarcity of information about chronic obstructed pulmonary disease (COPD), which is associated with a high rate of infection in COVID-19 patients. Moreover, by combining the effects of the SARS-CoV-2 on COPD patients, we may be able to overcome formidable obstacles factors, and diagnosis influencers. MATERIALS AND METHODS A retrospective study of 280 patients was conducted at DHQ Hospital Muzaffargarh in Punjab, Pakistan. Negative binomial regression describes the risk of fixed successive variables. The association is described by the Cox proportional hazard model and the model coefficient is determined through log-likelihood observation. Patients with COPD had their survival and mortality plotted on Kaplan-Meier curves. RESULTS The increased risk of death in COPD patients was due to the effects of variables such as cough, lower respiratory tract infection (LRTI), tuberculosis (TB), and body-aches being 1.369, 0.693, 0.170, and 0.217 times higher at (95% confidence interval [CI]: 0.747-1.992), (95% CI: 0.231-1.156), (95% CI: 0.008-0.332), and (95% CI: -0.07 to 0.440) while it decreased 0.396 in normal condition. CONCLUSION We found that the symptoms of COPD (cough, LRTI, TB, and bodyaches) are statistically significant in patients who were most infected by SARS-CoV-2.
Collapse
Affiliation(s)
| | - M. H. Tahir
- Department of StatisticsThe Islamia University of BahawalpurBahawalpurPunjabPakistan
| | - Muhammad Ameeq
- Department of StatisticsThe Islamia University of BahawalpurBahawalpurPunjabPakistan
| | - Farrukh Jamal
- Department of StatisticsThe Islamia University of BahawalpurBahawalpurPunjabPakistan
| | - John T. Mendy
- Department of Mathematics, School of Arts and ScienceUniversity of The GambiaSerrekundaGambia
| | | |
Collapse
|
14
|
Bhattacharya D, Sharma NK, Dutta D, Chetupalli SR, Mote P, Ganapathy S, Chandrakiran C, Nori S, Suhail KK, Gonuguntla S, Alagesan M. Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection. Sci Data 2023; 10:397. [PMID: 37349364 PMCID: PMC10287715 DOI: 10.1038/s41597-023-02266-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Accepted: 05/25/2023] [Indexed: 06/24/2023] Open
Abstract
This paper presents the Coswara dataset, a dataset containing diverse set of respiratory sounds and rich meta-data, recorded between April-2020 and February-2022 from 2635 individuals (1819 SARS-CoV-2 negative, 674 positive, and 142 recovered subjects). The respiratory sounds contained nine sound categories associated with variants of breathing, cough and speech. The rich metadata contained demographic information associated with age, gender and geographic location, as well as the health information relating to the symptoms, pre-existing respiratory ailments, comorbidity and SARS-CoV-2 test status. Our study is the first of its kind to manually annotate the audio quality of the entire dataset (amounting to 65 hours) through manual listening. The paper summarizes the data collection procedure, demographic, symptoms and audio data information. A COVID-19 classifier based on bi-directional long short-term (BLSTM) architecture, is trained and evaluated on the different population sub-groups contained in the dataset to understand the bias/fairness of the model. This enabled the analysis of the impact of gender, geographic location, date of recording, and language proficiency on the COVID-19 detection performance.
Collapse
Affiliation(s)
| | - Neeraj Kumar Sharma
- Mehta Family School of Data Science and Artificial Intelligence, Indian Institute of Technology Guwahati, Guwahati, India
| | - Debottam Dutta
- Department of Electrical Engineering, Indian Institute of Science, Bangalore, India
| | | | - Pravin Mote
- Department of Electrical Engineering, Indian Institute of Science, Bangalore, India
| | - Sriram Ganapathy
- Department of Electrical Engineering, Indian Institute of Science, Bangalore, India.
| | | | - Sahiti Nori
- Ramaiah Medical College Hospital, Bangalore, India
| | - K K Suhail
- Ramaiah Medical College Hospital, Bangalore, India
| | | | - Murali Alagesan
- PSG Institute of Medical Sciences and Research, Coimbatore, India
| |
Collapse
|
15
|
Awais M, Bhuva A, Bhuva D, Fatima S, Sadiq T. Optimized DEC: An effective cough detection framework using optimal weighted Features-aided deep Ensemble classifier for COVID-19. Biomed Signal Process Control 2023:105026. [PMID: 37361196 PMCID: PMC10183638 DOI: 10.1016/j.bspc.2023.105026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 04/17/2023] [Accepted: 05/09/2023] [Indexed: 06/28/2023]
Abstract
Since the year 2019, the entire world has been facing the most hazardous and contagious disease as Corona Virus Disease 2019 (COVID-19). Based on the symptoms, the virus can be identified and diagnosed. Amongst, cough is the primary syndrome to detect COVID-19. Existing method requires a long processing time. Early screening and detection is a complex task. To surmount the research drawbacks, a novel ensemble-based deep learning model is designed on heuristic development. The prime intention of the designed work is to detect COVID-19 disease using cough audio signals. At the initial stage, the source signals are fetched and undergo for signal decomposition phase by Empirical Mean Curve Decomposition (EMCD). Consequently, the decomposed signal is called "Mel Frequency Cepstral Coefficients (MFCC), spectral features, and statistical features". Further, all three features are fused and provide the optimal weighted features with the optimal weight value with the help of "Modified Cat and Mouse Based Optimizer (MCMBO)". Lastly, the optimal weighted features are fed as input to the Optimized Deep Ensemble Classifier (ODEC) that is fused together with various classifiers such as "Radial Basis Function (RBF), Long-Short Term Memory (LSTM), and Deep Neural Network (DNN)". In order to attain the best detection results, the parameters in ODEC are optimized by the MCMBO algorithm. Throughout the validation, the designed method attains 96% and 92% concerning accuracy and precision. Thus, result analysis elucidates that the proposed work achieves the desired detective value that aids practitioners to early diagnose COVID-19 ailments.
Collapse
Affiliation(s)
- Muhammad Awais
- Department of Creative Technologies, Air University, Islamabad, Pakistan
| | - Abhishek Bhuva
- Department of Computer Science, University of Massachusetts Boston, United States
| | - Dipen Bhuva
- Department of EECS, Cleveland State University, United States
| | - Saman Fatima
- Department of Medical Education, The University of Lahore, Lahore, Pakistan
| | - Touseef Sadiq
- Department of Information and Communication Technology, University of Agder, Norway
| |
Collapse
|
16
|
Hou L, Duan W, Xuan G, Xiao S, Li Y, Li Y, Zhao J. Intelligent Microsystem for Sound Event Recognition in Edge Computing Using End-to-End Mesh Networking. SENSORS (BASEL, SWITZERLAND) 2023; 23:3630. [PMID: 37050691 PMCID: PMC10099044 DOI: 10.3390/s23073630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 03/22/2023] [Accepted: 03/29/2023] [Indexed: 06/19/2023]
Abstract
Wireless acoustic sensor networks (WASNs) and intelligent microsystems are crucial components of the Internet of Things (IoT) ecosystem. In various IoT applications, small, lightweight, and low-power microsystems are essential to enable autonomous edge computing and networked cooperative work. This study presents an innovative intelligent microsystem with wireless networking capabilities, sound sensing, and sound event recognition. The microsystem is designed with optimized sensing, energy supply, processing, and transceiver modules to achieve small size and low power consumption. Additionally, a low-computational sound event recognition algorithm based on a Convolutional Neural Network has been designed and integrated into the microsystem. Multiple microsystems are connected using low-power Bluetooth Mesh wireless networking technology to form a meshed WASN, which is easily accessible, flexible to expand, and straightforward to manage with smartphones. The microsystem is 7.36 cm3 in size and weighs 8 g without housing. The microsystem can accurately recognize sound events in both trained and untrained data tests, achieving an average accuracy of over 92.50% for alarm sounds above 70 dB and water flow sounds above 55 dB. The microsystems can communicate wirelessly with a direct range of 5 m. It can be applied in the field of home IoT and border security.
Collapse
Affiliation(s)
- Lulu Hou
- School of Instrument Science and Opto-Electronics Engineering, Beijing Information Science and Technology University, Beijing 100192, China
| | - Wenrui Duan
- School of Instrument Science and Opto-Electronics Engineering, Beijing Information Science and Technology University, Beijing 100192, China
| | - Guozhe Xuan
- Department of Precision Instrument, Tsinghua University, Beijing 100084, China
- Key Laboratory of Smart Microsystem (Tsinghua University), Ministry of Education, Beijing 100084, China
- State Key Laboratory of Precision Measurement Technology and Instruments, Beijing 100084, China
- Beijing Laboratory of Biomedical Detection Technology and Instrument, Beijing 100084, China
- Beijing Advanced Innovation Center for Integrated Circuits, Beijing 100084, China
| | - Shanpeng Xiao
- China Mobile Research Institute, Beijing 100053, China
- The IoT Intelligent Microsystem Center, Tsinghua University-China Mobile Joint Research Institute, Beijing 100084, China
| | - Yuan Li
- China Mobile Research Institute, Beijing 100053, China
- The IoT Intelligent Microsystem Center, Tsinghua University-China Mobile Joint Research Institute, Beijing 100084, China
| | - Yizheng Li
- China Mobile Research Institute, Beijing 100053, China
- The IoT Intelligent Microsystem Center, Tsinghua University-China Mobile Joint Research Institute, Beijing 100084, China
| | - Jiahao Zhao
- Department of Precision Instrument, Tsinghua University, Beijing 100084, China
- Key Laboratory of Smart Microsystem (Tsinghua University), Ministry of Education, Beijing 100084, China
- State Key Laboratory of Precision Measurement Technology and Instruments, Beijing 100084, China
- Beijing Laboratory of Biomedical Detection Technology and Instrument, Beijing 100084, China
- Beijing Advanced Innovation Center for Integrated Circuits, Beijing 100084, China
| |
Collapse
|
17
|
Paul SG, Saha A, Biswas AA, Zulfiker MS, Arefin MS, Rahman MM, Reza AW. Combating Covid-19 using machine learning and deep learning: Applications, challenges, and future perspectives. ARRAY 2023; 17:100271. [PMID: 36530931 PMCID: PMC9737520 DOI: 10.1016/j.array.2022.100271] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 12/05/2022] [Accepted: 12/07/2022] [Indexed: 12/14/2022] Open
Abstract
COVID-19, a worldwide pandemic that has affected many people and thousands of individuals have died due to COVID-19, during the last two years. Due to the benefits of Artificial Intelligence (AI) in X-ray image interpretation, sound analysis, diagnosis, patient monitoring, and CT image identification, it has been further researched in the area of medical science during the period of COVID-19. This study has assessed the performance and investigated different machine learning (ML), deep learning (DL), and combinations of various ML, DL, and AI approaches that have been employed in recent studies with diverse data formats to combat the problems that have arisen due to the COVID-19 pandemic. Finally, this study shows the comparison among the stand-alone ML and DL-based research works regarding the COVID-19 issues with the combinations of ML, DL, and AI-based research works. After in-depth analysis and comparison, this study responds to the proposed research questions and presents the future research directions in this context. This review work will guide different research groups to develop viable applications based on ML, DL, and AI models, and will also guide healthcare institutes, researchers, and governments by showing them how these techniques can ease the process of tackling the COVID-19.
Collapse
Affiliation(s)
- Showmick Guha Paul
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Arpa Saha
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Al Amin Biswas
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh,Corresponding author
| | - Md. Sabab Zulfiker
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Mohammad Shamsul Arefin
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh,Department of Computer Science and Engineering, Chittagong University of Engineering and Technology, Chittagong, Bangladesh
| | - Md. Mahfujur Rahman
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Ahmed Wasif Reza
- Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
| |
Collapse
|
18
|
Alvarado E, Grágeda N, Luzanto A, Mahu R, Wuth J, Mendoza L, Yoma NB. Dyspnea Severity Assessment Based on Vocalization Behavior with Deep Learning on the Telephone. SENSORS (BASEL, SWITZERLAND) 2023; 23:2441. [PMID: 36904646 PMCID: PMC10007248 DOI: 10.3390/s23052441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 02/08/2023] [Accepted: 02/17/2023] [Indexed: 06/18/2023]
Abstract
In this paper, a system to assess dyspnea with the mMRC scale, on the phone, via deep learning, is proposed. The method is based on modeling the spontaneous behavior of subjects while pronouncing controlled phonetization. These vocalizations were designed, or chosen, to deal with the stationary noise suppression of cellular handsets, to provoke different rates of exhaled air, and to stimulate different levels of fluency. Time-independent and time-dependent engineered features were proposed and selected, and a k-fold scheme with double validation was adopted to select the models with the greatest potential for generalization. Moreover, score fusion methods were also investigated to optimize the complementarity of the controlled phonetizations and features that were engineered and selected. The results reported here were obtained from 104 participants, where 34 corresponded to healthy individuals and 70 were patients with respiratory conditions. The subjects' vocalizations were recorded with a telephone call (i.e., with an IVR server). The system provided an accuracy of 59% (i.e., estimating the correct mMRC), a root mean square error equal to 0.98, false positive rate of 6%, false negative rate of 11%, and an area under the ROC curve equal to 0.97. Finally, a prototype was developed and implemented, with an ASR-based automatic segmentation scheme, to estimate dyspnea on line.
Collapse
Affiliation(s)
- Eduardo Alvarado
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile
| | - Nicolás Grágeda
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile
| | - Alejandro Luzanto
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile
| | - Rodrigo Mahu
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile
| | - Jorge Wuth
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile
| | - Laura Mendoza
- Clinical Hospital, University of Chile, Santiago 8380420, Chile
| | - Néstor Becerra Yoma
- Speech Processing and Transmission Laboratory, Electrical Engineering Department, University of Chile, Santiago 8370451, Chile
| |
Collapse
|
19
|
Khishe M. An automatic COVID-19 diagnosis from chest X-ray images using a deep trigonometric convolutional neural network. THE IMAGING SCIENCE JOURNAL 2023. [DOI: 10.1080/13682199.2023.2178094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
|
20
|
Feng J, Zhang H, Geng M, Chen H, Jia K, Sun Z, Li Z, Cao X, Pogue BW. X-ray Cherenkov-luminescence tomography reconstruction with a three-component deep learning algorithm: Swin transformer, convolutional neural network, and locality module. JOURNAL OF BIOMEDICAL OPTICS 2023; 28:026004. [PMID: 36818584 PMCID: PMC9932523 DOI: 10.1117/1.jbo.28.2.026004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 01/19/2023] [Indexed: 06/18/2023]
Abstract
SIGNIFICANCE X-ray Cherenkov-luminescence tomography (XCLT) produces fast emission data from megavoltage (MV) x-ray scanning, in which the excitation location of molecules within tissue is reconstructed. However standard filtered backprojection (FBP) algorithms for XCLT sinogram reconstruction can suffer from insufficient data due to dose limitations, so there are limits in the reconstruction quality with some artifacts. We report a deep learning algorithm for XCLT with high image quality and improved quantitative accuracy. AIM To directly reconstruct the distribution of emission quantum yield for x-ray Cherenkov-luminescence tomography, we proposed a three-component deep learning algorithm that includes a Swin transformer, convolution neural network, and locality module model. APPROACH A data-to-image model x-ray Cherenkov-luminescence tomography is developed based on a Swin transformer, which is used to extract pixel-level prior information from the sinogram domain. Meanwhile, a convolutional neural network structure is deployed to transform the extracted pixel information from the sinogram domain to the image domain. Finally, a locality module is designed between the encoder and decoder connection structures for delivering features. Its performance was validated with simulation, physical phantom, and in vivo experiments. RESULTS This approach can better deal with the limits to data than conventional FBP methods. The method was validated with numerical and physical phantom experiments, with results showing that it improved the reconstruction performance mean square error ( > 94.1 % ), peak signal-to-noise ratio ( > 41.7 % ), and Pearson correlation ( > 19 % ) compared with the FBP algorithm. The Swin-CNN also achieved a 32.1% improvement in PSNR over the deep learning method AUTOMAP. CONCLUSIONS This study shows that the three-component deep learning algorithm provides an effective reconstruction method for x-ray Cherenkov-luminescence tomography.
Collapse
Affiliation(s)
- Jinchao Feng
- Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Faculty of Information Technology, Beijing, China
- Beijing Laboratory of Advanced Information Networks, Beijing, China
| | - Hu Zhang
- Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Faculty of Information Technology, Beijing, China
| | - Mengfan Geng
- Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Faculty of Information Technology, Beijing, China
| | - Hanliang Chen
- Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Faculty of Information Technology, Beijing, China
| | - Kebin Jia
- Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Faculty of Information Technology, Beijing, China
- Beijing Laboratory of Advanced Information Networks, Beijing, China
| | - Zhonghua Sun
- Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Faculty of Information Technology, Beijing, China
- Beijing Laboratory of Advanced Information Networks, Beijing, China
| | - Zhe Li
- Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Faculty of Information Technology, Beijing, China
- Beijing Laboratory of Advanced Information Networks, Beijing, China
| | - Xu Cao
- Xidian University, Engineering Research Center of Molecular and Neuro Imaging of the Ministry of Education and School of Life Science and Technology, Xi’an, China
| | - Brian W. Pogue
- University of Wisconsin-Madison, Department of Medical Physics, Madison, Wisconsin, United States
| |
Collapse
|
21
|
Naz Z, Khan MUG, Saba T, Rehman A, Nobanee H, Bahaj SA. An Explainable AI-Enabled Framework for Interpreting Pulmonary Diseases from Chest Radiographs. Cancers (Basel) 2023; 15:cancers15010314. [PMID: 36612309 PMCID: PMC9818469 DOI: 10.3390/cancers15010314] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 12/22/2022] [Accepted: 12/23/2022] [Indexed: 01/05/2023] Open
Abstract
Explainable Artificial Intelligence is a key component of artificially intelligent systems that aim to explain the classification results. The classification results explanation is essential for automatic disease diagnosis in healthcare. The human respiration system is badly affected by different chest pulmonary diseases. Automatic classification and explanation can be used to detect these lung diseases. In this paper, we introduced a CNN-based transfer learning-based approach for automatically explaining pulmonary diseases, i.e., edema, tuberculosis, nodules, and pneumonia from chest radiographs. Among these pulmonary diseases, pneumonia, which COVID-19 causes, is deadly; therefore, radiographs of COVID-19 are used for the explanation task. We used the ResNet50 neural network and trained the network on extensive training with the COVID-CT dataset and the COVIDNet dataset. The interpretable model LIME is used for the explanation of classification results. Lime highlights the input image's important features for generating the classification result. We evaluated the explanation using radiologists' highlighted images and identified that our model highlights and explains the same regions. We achieved improved classification results with our fine-tuned model with an accuracy of 93% and 97%, respectively. The analysis of our results indicates that this research not only improves the classification results but also provides an explanation of pulmonary diseases with advanced deep-learning methods. This research would assist radiologists with automatic disease detection and explanations, which are used to make clinical decisions and assist in diagnosing and treating pulmonary diseases in the early stage.
Collapse
Affiliation(s)
- Zubaira Naz
- Department of Computer Science, University of Engineering and Technology Lahore, Lahore 54890, Pakistan
| | - Muhammad Usman Ghani Khan
- Department of Computer Science, University of Engineering and Technology Lahore, Lahore 54890, Pakistan
| | - Tanzila Saba
- Artificial Intelligence & Data Analytics Lab, CCIS, Prince Sultan University, Riyadh 11586, Saudi Arabia
| | - Amjad Rehman
- Artificial Intelligence & Data Analytics Lab, CCIS, Prince Sultan University, Riyadh 11586, Saudi Arabia
- Correspondence: (A.R.); (H.N.)
| | - Haitham Nobanee
- College of Business, Abu Dhabi University, Abu Dhabi 59911, United Arab Emirates
- Oxford Center for Islamic Studies, University of Oxford, Oxford OX3 0EE, UK
- Faculty of Humanities & Social Sciences, University of Liverpool, Liverpool L69 7WZ, UK
- Correspondence: (A.R.); (H.N.)
| | - Saeed Ali Bahaj
- MIS Department, College of Business Administration, Prince Sattam bin Abdulaziz University, Alkharj 11942, Saudi Arabia
| |
Collapse
|
22
|
Deep Survival Analysis With Clinical Variables for COVID-19. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2023; 11:223-231. [PMID: 36950264 PMCID: PMC10027076 DOI: 10.1109/jtehm.2023.3256966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 01/08/2023] [Accepted: 03/08/2023] [Indexed: 03/18/2023]
Abstract
OBJECTIVE Millions of people have been affected by coronavirus disease 2019 (COVID-19), which has caused millions of deaths around the world. Artificial intelligence (AI) plays an increasing role in all areas of patient care, including prognostics. This paper proposes a novel predictive model based on one dimensional convolutional neural networks (1D CNN) to use clinical variables in predicting the survival outcome of COVID-19 patients. METHODS AND PROCEDURES We have considered two scenarios for survival analysis, 1) uni-variate analysis using the Log-rank test and Kaplan-Meier estimator and 2) combining all clinical variables ([Formula: see text]=44) for predicting the short-term from long-term survival. We considered the random forest (RF) model as a baseline model, comparing to our proposed 1D CNN in predicting survival groups. RESULTS Our experiments using the univariate analysis show that nine clinical variables are significantly associated with the survival outcome with corrected p < 0.05. Our approach of 1D CNN shows a significant improvement in performance metrics compared to the RF and the state-of-the-art techniques (i.e., 1D CNN) in predicting the survival group of patients with COVID-19. CONCLUSION Our model has been tested using clinical variables, where the performance is found promising. The 1D CNN model could be a useful tool for detecting the risk of mortality and developing treatment plans in a timely manner. CLINICAL IMPACT The findings indicate that using both Heparin and Exnox for treatment is typically the most useful factor in predicting a patient's chances of survival from COVID-19. Moreover, our predictive model shows that the combination of AI and clinical data can be applied to point-of-care services through fast-learning healthcare systems.
Collapse
|
23
|
Cough Audio Analysis for COVID-19 Diagnosis. SN COMPUTER SCIENCE 2023; 4:125. [PMID: 36589771 PMCID: PMC9791965 DOI: 10.1007/s42979-022-01522-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 11/23/2022] [Indexed: 12/27/2022]
Abstract
Humanity has suffered catastrophically due to the COVID-19 pandemic. One of the most reliable diagnoses of COVID-19 is RT-PCR (reverse-transcription polymer chain reaction) testing. This method, however, has its limitations. It is time consuming and requires scalability. This research work carries out a preliminary prognosis of COVID-19, which is scalable and less time consuming. The research carried out a competitive analysis of four machine-learning models namely, Multilayer Perceptron, Convolutional Neural Networks, Recurrent Neural Networks with Long Short-Term Memory, and VGG-19 with Support Vector Machines. Out of these models, Multilayer Perceptron outperformed with higher specificity of 94.5% and accuracy of 96.8%. The results show that Multilayer Perceptron was able to distinguish between positive and negative COVID-19 coughs by a robust feature embedding technique.
Collapse
|
24
|
Silva G, Batista P, Rodrigues PM. COVID-19 activity screening by a smart-data-driven multi-band voice analysis. J Voice 2022:S0892-1997(22)00360-5. [PMID: 36464573 PMCID: PMC9663738 DOI: 10.1016/j.jvoice.2022.11.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Revised: 11/06/2022] [Accepted: 11/07/2022] [Indexed: 11/17/2022]
Abstract
COVID-19 is a disease caused by the new coronavirus SARS-COV-2 which can lead to severe respiratory infections. Since its first detection it caused more than six million worldwide deaths. COVID-19 diagnosis non-invasive and low-cost methods with faster and accurate results are still needed for a fast disease control. In this research, 3 different signal analyses have been applied (per broadband, per sub-bands and per broadband & sub-bands) to Cough, Breathing & Speech signals of Coswara dataset to extract non-linear patterns (Energy, Entropies, Correlation Dimension, Detrended Fluctuation Analysis, Lyapunov Exponent & Fractal Dimensions) for feeding a XGBoost classifier to discriminate COVID-19 activity on its different stages. Classification accuracies ranged between 83.33% and 98.46% have been achieved, surpassing the state-of-art methods in some comparisons. It should be empathized the 98.46% of accuracy reached on pair Healthy Controls vs all COVID-19 stages. The results shows that the method may be adequate for COVID-19 diagnosis screening assistance.
Collapse
Affiliation(s)
- Gabriel Silva
- CBQF-Centro de Biotecnologia e Química Fina-Laboratório Associado, Escola Superior de Biotecnologia, Universidade Católica Portuguesa, Rua de Diogo Botelho, 1327, 4169-005 Porto, Portugal
| | - Patrícia Batista
- CBQF-Centro de Biotecnologia e Química Fina-Laboratório Associado, Escola Superior de Biotecnologia, Universidade Católica Portuguesa, Rua de Diogo Botelho, 1327, 4169-005 Porto, Portugal; HNL/CEDH-Human Neurobehavioural Laboratory/Research Centre for Human Development, Universidade Católica Portuguesa, Rua de Diogo Botelho, 1327, 4169-005 Porto, Portugal
| | - Pedro Miguel Rodrigues
- CBQF-Centro de Biotecnologia e Química Fina-Laboratório Associado, Escola Superior de Biotecnologia, Universidade Católica Portuguesa, Rua de Diogo Botelho, 1327, 4169-005 Porto, Portugal.
| |
Collapse
|
25
|
Dar JA, Srivastava KK, Ahmed Lone S. Design and development of hybrid optimization enabled deep learning model for COVID-19 detection with comparative analysis with DCNN, BIAT-GRU, XGBoost. Comput Biol Med 2022; 150:106123. [PMID: 36228465 PMCID: PMC9527202 DOI: 10.1016/j.compbiomed.2022.106123] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 09/03/2022] [Accepted: 09/17/2022] [Indexed: 11/03/2022]
Abstract
The recent investigation has started for evaluating the human respiratory sounds, like voice recorded, cough, and breathing from hospital confirmed Covid-19 tools, which differs from healthy person's sound. The cough-based detection of Covid-19 also considered with non-respiratory and respiratory sounds data related with all declared situations. Covid-19 is respiratory disease, which is usually produced by Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2). However, it is more indispensable to detect the positive cases for reducing further spread of virus, and former treatment of affected patients. With constant rise in the COVID-19 cases, there has been a constant rise in the need of efficient and safe ways to detect an infected individual. With the cases multiplying constantly, the current detecting devices like RT-PCR and fast testing kits have become short in supply. An effectual Covid-19 detection model using devised hybrid Honey Badger Optimization-based Deep Neuro Fuzzy Network (HBO-DNFN) is developed in this paper. Here, the audio signal is considered as input for detecting Covid-19. The gaussian filter is applied to input signal for removing the noises and then feature extraction is performed. The substantial features, like spectral roll-off, spectral bandwidth, Mel frequency cepstral coefficients (MFCC), spectral flatness, zero crossing rate, spectral centroid, mean square energy and spectral contract are extracted for further processing. Finally, DNFN is applied for detecting Covid-19 and the deep leaning model is trained by designed hybrid HBO algorithm. Accordingly, the developed Hybrid HBO method is newly designed by incorporating Honey Badger optimization Algorithm (HBA) and Jaya algorithm. The performance of developed Covid-19 detection model is evaluated using three metrics, like testing accuracy, sensitivity and specificity. The developed Hybrid HBO-based DNFN is outpaced than other existing approaches in terms of testing accuracy, sensitivity and specificity of "0.9176, 0.9218 and 0. 9219". All the test results are validated with the k-fold cross validation method in order to make an assessment of the generalizability of these results. When k-fold value is 9, sensitivity of existing techniques and developed JHBO-based DNFN is 0.8982, 0.8816, 0.8938, and 0.9207. The sensitivity of developed approach is improved by means of gaussian filtering model. The specificity of DCNN is 0.9125, BI-AT-GRU is 0.8926, and XGBoost is 0.9014, while developed JHBO-based DNFN is 0.9219 in k-fold value 9.
Collapse
Affiliation(s)
- Jawad Ahmad Dar
- Department of Computer Science and Engineering, Mansarovar Global University, Madhya Pradesh, India.
| | - Kamal Kr Srivastava
- Department of Computer Science and Engineering, Mansarovar Global University, Madhya Pradesh, India.
| | - Sajaad Ahmed Lone
- Department of Computer Science and Engineering, Islamic University of Science and Technology, Kashmir, India.
| |
Collapse
|
26
|
Suo J, Liu Y, Wu C, Chen M, Huang Q, Liu Y, Yao K, Chen Y, Pan Q, Chang X, Leung AYL, Chan H, Zhang G, Yang Z, Daoud W, Li X, Roy VAL, Shen J, Yu X, Wang J, Li WJ. Wide-Bandwidth Nanocomposite-Sensor Integrated Smart Mask for Tracking Multiphase Respiratory Activities. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2022; 9:e2203565. [PMID: 35999427 PMCID: PMC9631096 DOI: 10.1002/advs.202203565] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 08/03/2022] [Indexed: 06/15/2023]
Abstract
Wearing masks has been a recommended protective measure due to the risks of coronavirus disease 2019 (COVID-19) even in its coming endemic phase. Therefore, deploying a "smart mask" to monitor human physiological signals is highly beneficial for personal and public health. This work presents a smart mask integrating an ultrathin nanocomposite sponge structure-based soundwave sensor (≈400 µm), which allows the high sensitivity in a wide-bandwidth dynamic pressure range, i.e., capable of detecting various respiratory sounds of breathing, speaking, and coughing. Thirty-one subjects test the smart mask in recording their respiratory activities. Machine/deep learning methods, i.e., support vector machine and convolutional neural networks, are used to recognize these activities, which show average macro-recalls of ≈95% in both individual and generalized models. With rich high-frequency (≈4000 Hz) information recorded, the two-/tri-phase coughs can be mapped while speaking words can be identified, demonstrating that the smart mask can be applicable as a daily wearable Internet of Things (IoT) device for respiratory disease identification, voice interaction tool, etc. in the future. This work bridges the technological gap between ultra-lightweight but high-frequency response sensor material fabrication, signal transduction and processing, and machining/deep learning to demonstrate a wearable device for potential applications in continual health monitoring in daily life.
Collapse
Affiliation(s)
- Jiao Suo
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
| | - Yifan Liu
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
| | - Cong Wu
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
- Hong Kong Centre for Cerebro‐cardiovascular Health Engineering (COCHE)Hong KongChina
| | - Meng Chen
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
| | - Qingyun Huang
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
| | - Yiming Liu
- Dept. of Biomedical EngineeringCity University of Hong KongHong KongChina
| | - Kuanming Yao
- Dept. of Biomedical EngineeringCity University of Hong KongHong KongChina
| | - Yangbin Chen
- Dept. of Computer ScienceCity University of Hong KongHong KongChina
| | - Qiqi Pan
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
| | - Xiaoyu Chang
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
| | | | - Ho‐yin Chan
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
| | - Guanglie Zhang
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
| | - Zhengbao Yang
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
| | - Walid Daoud
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
| | - Xinyue Li
- School of Data ScienceCity University of Hong KongHong KongChina
| | | | - Jiangang Shen
- School of Chinese MedicineThe University of Hong KongHong KongChina
| | - Xinge Yu
- Dept. of Biomedical EngineeringCity University of Hong KongHong KongChina
- Hong Kong Centre for Cerebro‐cardiovascular Health Engineering (COCHE)Hong KongChina
| | - Jianping Wang
- Dept. of Computer ScienceCity University of Hong KongHong KongChina
| | - Wen Jung Li
- Dept. of Mechanical EngineeringCity University of Hong KongHong KongChina
- Hong Kong Centre for Cerebro‐cardiovascular Health Engineering (COCHE)Hong KongChina
| |
Collapse
|
27
|
Kumar S, Mallik A. COVID-19 Detection from Chest X-rays Using Trained Output Based Transfer Learning Approach. Neural Process Lett 2022; 55:1-24. [PMID: 36339644 PMCID: PMC9616430 DOI: 10.1007/s11063-022-11060-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/16/2022] [Indexed: 10/31/2022]
Abstract
The recent Coronavirus disease (COVID-19), which started in 2019, has spread across the globe and become a global pandemic. The efficient and effective COVID-19 detection using chest X-rays helps in early detection and curtailing the spread of the disease. In this paper, we propose a novel Trained Output-based Transfer Learning (TOTL) approach for COVID-19 detection from chest X-rays. We start by preprocessing the Chest X-rays of the patients with techniques like denoising, contrasting, segmentation. These processed images are then fed to several pre-trained transfer learning models like InceptionV3, InceptionResNetV2, Xception, MobileNet, ResNet50, ResNet50V2, VGG16, and VGG19. We fine-tune these models on the processed chest X-rays. Then we further train the outputs of these models using a deep neural network architecture to achieve enhanced performance and aggregate the capabilities of each of them. The proposed model has been tested on four recent COVID-19 chest X-rays datasets by computing several popular evaluation metrics. The performance of our model has also been compared with various deep transfer learning models and several contemporary COVID-19 detection methods. The obtained results demonstrate the efficiency and efficacy of our proposed model.
Collapse
Affiliation(s)
- Sanjay Kumar
- Department of Computer Science and Engineering, Delhi Technological University, New Delhi, 110042 India
| | - Abhishek Mallik
- Department of Computer Science and Engineering, Delhi Technological University, New Delhi, 110042 India
| |
Collapse
|
28
|
Xu B, Martín D, Khishe M, Boostani R. COVID-19 diagnosis using chest CT scans and deep convolutional neural networks evolved by IP-based sine-cosine algorithm. Med Biol Eng Comput 2022; 60:2931-2949. [PMID: 35962266 PMCID: PMC9374292 DOI: 10.1007/s11517-022-02637-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Accepted: 06/15/2022] [Indexed: 11/25/2022]
Abstract
The prevalence of the COVID-19 virus and its variants has influenced all aspects of our life, and therefore, the precise diagnosis of this disease is vital. If a polymerase chain reaction test for a subject is negative, but he/she cannot easily breathe, taking a computed tomography (CT) image from his/her lung is urgently recommended. This study aims to optimize a deep convolution neural network (DCNN) structure to increase the COVID-19 diagnosis accuracy in lung CT images. This paper employs the sine-cosine algorithm (SCA) to optimize the structure of DCNN to take raw CT images and determine their status. Three improvements based on regular SCA are proposed to enhance both the accuracy and speed of the results. First, a new encoding approach is proposed based on the internet protocol (IP) address. Then, an enfeebled layer is proposed to generate a variable-length DCNN. The suggested model is examined over the COVID-CT and SARS-CoV-2 datasets. The proposed method is compared to a standard DCNN and seven variable-length models in terms of five known metrics, including sensitivity, accuracy, specificity, F1-score, precision, and receiver operative curve (ROC) and precision-recall curves. The results demonstrate that the proposed DCNN-IPSCA surpasses other benchmarks, achieving final accuracy of (98.32% and 98.01%), the sensitivity of (97.22% and 96.23%), and specificity of (96.77% and 96.44%) on the SARS-CoV-2 and COVID-CT datasets, respectively. Also, the proposed DCNN-IPSCA performs much better than the standard DCNN, with GPU and CPU training times, which are 387.69 and 63.10 times faster, respectively.
Collapse
Affiliation(s)
- Binfeng Xu
- Guangdong Food and Drug Vocational College, Guangzhou, 510520, Guangdong, China.
| | - Diego Martín
- ETSI Telecomunicación, Universidad Politécnica de Madrid, Av. Complutense 30, 28040, Madrid, Spain
| | - Mohammad Khishe
- Department of Electrical Engineering, Imam Khomeini Marine Science University, Nowshahr, Iran.
| | - Reza Boostani
- CSE & IT Department, Electrical and Computer Engineering Faculty, Shiraz University, Shiraz, Iran
| |
Collapse
|
29
|
Alsaaidah B, Al-Hadidi MR, Al-Nsour H, Masadeh R, AlZubi N. Comprehensive Survey of Machine Learning Systems for COVID-19 Detection. J Imaging 2022; 8:267. [PMID: 36286361 PMCID: PMC9604704 DOI: 10.3390/jimaging8100267] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Revised: 09/11/2022] [Accepted: 09/20/2022] [Indexed: 01/14/2023] Open
Abstract
The last two years are considered the most crucial and critical period of the COVID-19 pandemic affecting most life aspects worldwide. This virus spreads quickly within a short period, increasing the fatality rate associated with the virus. From a clinical perspective, several diagnosis methods are carried out for early detection to avoid virus propagation. However, the capabilities of these methods are limited and have various associated challenges. Consequently, many studies have been performed for COVID-19 automated detection without involving manual intervention and allowing an accurate and fast decision. As is the case with other diseases and medical issues, Artificial Intelligence (AI) provides the medical community with potential technical solutions that help doctors and radiologists diagnose based on chest images. In this paper, a comprehensive review of the mentioned AI-based detection solution proposals is conducted. More than 200 papers are reviewed and analyzed, and 145 articles have been extensively examined to specify the proposed AI mechanisms with chest medical images. A comprehensive examination of the associated advantages and shortcomings is illustrated and summarized. Several findings are concluded as a result of a deep analysis of all the previous works using machine learning for COVID-19 detection, segmentation, and classification.
Collapse
Affiliation(s)
- Bayan Alsaaidah
- Department of Computer Science, Prince Abdullah bin Ghazi Faculty of Information Technology and Communications, Al-Balqa Applied University, Salt 19117, Jordan
| | - Moh’d Rasoul Al-Hadidi
- Department of Electrical Engineering, Electrical Power Engineering and Computer Engineering, Faculty of Engineering, Al-Balqa Applied University, Salt 19117, Jordan
| | - Heba Al-Nsour
- Department of Computer Science, Prince Abdullah bin Ghazi Faculty of Information Technology and Communications, Al-Balqa Applied University, Salt 19117, Jordan
| | - Raja Masadeh
- Computer Science Department, The World Islamic Sciences and Education University, Amman 11947, Jordan
| | - Nael AlZubi
- Department of Electrical Engineering, Electrical Power Engineering and Computer Engineering, Faculty of Engineering, Al-Balqa Applied University, Salt 19117, Jordan
| |
Collapse
|
30
|
Ghrabli S, Elgendi M, Menon C. Challenges and Opportunities of Deep Learning for Cough-Based COVID-19 Diagnosis: A Scoping Review. Diagnostics (Basel) 2022; 12:2142. [PMID: 36140543 PMCID: PMC9498071 DOI: 10.3390/diagnostics12092142] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Revised: 08/26/2022] [Accepted: 08/31/2022] [Indexed: 11/16/2022] Open
Abstract
In the past two years, medical researchers and data scientists worldwide have focused their efforts on containing the pandemic of coronavirus disease 2019 (COVID-19). Deep learning models have been proven to be capable of efficient medical diagnosis and prognosis in cancer, common lung diseases, and COVID-19. On the other hand, artificial neural networks have demonstrated their potential in pattern recognition and classification in various domains, including healthcare. This literature review aims to report the state of research on developing neural network models to diagnose COVID-19 from cough sounds to create a cost-efficient and accessible testing tool in the fight against the pandemic. A total of 35 papers were included in this review following a screening of the 161 outputs of the literature search. We extracted information from articles on data resources, model structures, and evaluation metrics and then explored the scope of experimental studies and methodologies and analyzed their outcomes and limitations. We found that cough is a biomarker, and its associated information can determine an individual's health status. Convolutional neural networks were predominantly used, suggesting they are particularly suitable for feature extraction and classification. The reported accuracy values ranged from 73.1% to 98.5%. Moreover, the dataset sizes ranged from 16 to over 30,000 cough audio samples. Although deep learning is a promising prospect in identifying COVID-19, we identified a gap in the literature on research conducted over large and diversified data sets.
Collapse
Affiliation(s)
- Syrine Ghrabli
- Biomedical and Mobile Health Technology Lab, ETH Zurich, 8008 Zurich, Switzerland
- Department of Physics, ETH Zurich, 8093 Zurich, Switzerland
| | - Mohamed Elgendi
- Biomedical and Mobile Health Technology Lab, ETH Zurich, 8008 Zurich, Switzerland
| | - Carlo Menon
- Biomedical and Mobile Health Technology Lab, ETH Zurich, 8008 Zurich, Switzerland
| |
Collapse
|
31
|
Kranthi Kumar L, Alphonse PJA. COVID-19: respiratory disease diagnosis with regularized deep convolutional neural network using human respiratory sounds. THE EUROPEAN PHYSICAL JOURNAL. SPECIAL TOPICS 2022; 231:3673-3696. [PMID: 35966369 PMCID: PMC9363874 DOI: 10.1140/epjs/s11734-022-00649-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 07/26/2022] [Indexed: 06/15/2023]
Abstract
Human respiratory sound auscultation (HRSA) parameters have been the real choice for detecting human respiratory diseases in the last few years. It is a challenging task to extract the respiratory sound features from the breath, voice, and cough sounds. The existing methods failed to extract the sound features to diagnose respiratory diseases. We proposed and evaluated a new regularized deep convolutional neural network (RDCNN) architecture to accept COVID-19 sound data and essential sound features. The proposed architecture is trained with the COVID-19 sound data sets and gives a better learning curve than any other state-of-the-art model. We examine the performance of RDCNN with Max-Pooling (Model-1) and without Max-Pooling (Model-2) functions. In this work, we observed that RDCNN model performance with three sound feature extraction methods [Soft-Mel frequency channel, Log-Mel frequency spectrum, and Modified Mel-frequency Cepstral Coefficient (MMFCC) spectrum] for COVID-19 sound data sets (KDD-data, ComParE2021-CCS-CSS-Data, and NeurlPs2021-data). To amplify the models' performance, we applied the augmentation technique along with regularization. We have also carried out this work to estimate the mutation of SARS-CoV-2 in the five waves using prognostic models (fractal-based). The proposed model achieves state-of-the-art performance on the COVID-19 sound data set to identify COVID-19 disease symptoms. The model's learnable parameter gradients have vanished in the intermediate layers while optimizing the prediction error which is addressed with our proposed RDCNN model. Our experiments suggested that 3 × 3 kernel size for regularized deep CNN (without max-pooling) shows 2-3% better classification accuracy compared to RDCNN with max-pooling. The experimental results suggest that this new approach may achieve the finest results on respiratory diseases.
Collapse
Affiliation(s)
- Lella Kranthi Kumar
- Department of Computer Applications, NIT Tiruchirappalli, Tiruchirappalli, Tamil Nadu 620015 India
| | - P. J. A. Alphonse
- Department of Computer Applications, NIT Tiruchirappalli, Tiruchirappalli, Tamil Nadu 620015 India
| |
Collapse
|
32
|
Baghdadi NA, Malki A, Abdelaliem SF, Magdy Balaha H, Badawy M, Elhosseini M. An automated diagnosis and classification of COVID-19 from chest CT images using a transfer learning-based convolutional neural network. Comput Biol Med 2022; 144:105383. [PMID: 35290811 PMCID: PMC8906898 DOI: 10.1016/j.compbiomed.2022.105383] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Revised: 02/15/2022] [Accepted: 03/02/2022] [Indexed: 02/06/2023]
Abstract
Researchers have developed more intelligent, highly responsive, and efficient detection methods owing to the COVID-19 demands for more widespread diagnosis. The work done deals with developing an AI-based framework that can help radiologists and other healthcare professionals diagnose COVID-19 cases at a high level of accuracy. However, in the absence of publicly available CT datasets, the development of such AI tools can prove challenging. Therefore, an algorithm for performing automatic and accurate COVID-19 classification using Convolutional Neural Network (CNN), pre-trained model, and Sparrow search algorithm (SSA) on CT lung images was proposed. The pre-trained CNN models used are SeresNext50, SeresNext101, SeNet154, MobileNet, MobileNetV2, MobileNetV3Small, and MobileNetV3Large. In addition, the SSA will be used to optimize the different CNN and transfer learning(TL) hyperparameters to find the best configuration for the pre-trained model used and enhance its performance. Two datasets are used in the experiments. There are two classes in the first dataset, while three in the second. The authors combined two publicly available COVID-19 datasets as the first dataset, namely the COVID-19 Lung CT Scans and COVID-19 CT Scan Dataset. In total, 14,486 images were included in this study. The authors analyzed the Large COVID-19 CT scan slice dataset in the second dataset, which utilized 17,104 images. Compared to other pre-trained models on both classes datasets, MobileNetV3Large pre-trained is the best model. As far as the three-classes dataset is concerned, a model trained on SeNet154 is the best available. Results show that, when compared to other CNN models like LeNet-5 CNN, COVID faster R–CNN, Light CNN, Fuzzy + CNN, Dynamic CNN, CNN and Optimized CNN, the proposed Framework achieves the best accuracy of 99.74% (two classes) and 98% (three classes).
Collapse
Affiliation(s)
- Nadiah A Baghdadi
- Princess Nourah bint Abdulrahman University, College of Nursing, Riyadh, 11671, Riyadh, P.O. BOX 84428, Saudi Arabia.
| | - Amer Malki
- Taibah University, College of Computer Science and Engineering, Yanbu, 46421, Saudi Arabia
| | - Sally F Abdelaliem
- Princess Nourah bint Abdulrahman University, College of Nursing, Riyadh, 11671, Riyadh, P.O. BOX 84428, Saudi Arabia
| | - Hossam Magdy Balaha
- Mansoura University, Faculty of Engineering, Computers and Control Systems Engineering Department, Mansoura, 46421, Egypt
| | - Mahmoud Badawy
- Mansoura University, Faculty of Engineering, Computers and Control Systems Engineering Department, Mansoura, 46421, Egypt.
| | - Mostafa Elhosseini
- Taibah University, College of Computer Science and Engineering, Yanbu, 46421, Saudi Arabia; Mansoura University, Faculty of Engineering, Computers and Control Systems Engineering Department, Mansoura, 46421, Egypt
| |
Collapse
|
33
|
Hong W, Lu X, Wu L, Pu X. Analysis of factors influencing public attention to masks during the COVID-19 epidemic-Data from Sina Weibo. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022; 19:6469-6488. [PMID: 35730267 DOI: 10.3934/mbe.2022304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
As we all know, vaccination still does not protect people from novel coronavirus infections, and wearing masks remains essential. Research on mask attention is helpful to understand the public's cognition and willingness to wear masks, but there are few studies on mask attention in the existing literature. The health belief model used to study disease prevention behaviors is rarely applied to the research on mask attention, and the research on health belief models basically entails the use of a questionnaire survey. This study was purposed to establish a health belief model affecting mask attention to explore the relationship between perceived susceptibility, perceived severity, self-efficacy, perceived impairment, action cues and mask attention. On the basis of the establishment of the hypothesis model, the Baidu index of epidemic and mask attention, the number of likes and comments on Weibo, and the historical weather temperature data were retrieved by using software. Keyword extraction and manual screening were carried out for Weibo comments, and then the independent variables and dependent variables were coded. Finally, through binomial logistic regression analysis, it was concluded that perceived susceptibility, perceived severity and action cues have significant influences on mask attention, and that the accuracy rate for predicting low attention is 93.4%, and the global accuracy is 84.3%. These conclusions can also help suppliers make production decisions.
Collapse
Affiliation(s)
- Wei Hong
- Food Safety Research Base of Jiangsu Province, Jiangnan University, Wuxi 214122, China
- School of Business, Jiangnan University, Wuxi 214122, China
- Institute for Food Safety Risk Management, Jiangnan University, Wuxi 214122, China
| | - Xinhang Lu
- School of Business, Jiangnan University, Wuxi 214122, China
| | - Linhai Wu
- Food Safety Research Base of Jiangsu Province, Jiangnan University, Wuxi 214122, China
- School of Business, Jiangnan University, Wuxi 214122, China
- Institute for Food Safety Risk Management, Jiangnan University, Wuxi 214122, China
| | - Xujin Pu
- Food Safety Research Base of Jiangsu Province, Jiangnan University, Wuxi 214122, China
- School of Business, Jiangnan University, Wuxi 214122, China
- Institute for Food Safety Risk Management, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
34
|
Santosh KC, Rasmussen N, Mamun M, Aryal S. A systematic review on cough sound analysis for Covid-19 diagnosis and screening: is my cough sound COVID-19? PeerJ Comput Sci 2022; 8:e958. [PMID: 35634112 PMCID: PMC9138020 DOI: 10.7717/peerj-cs.958] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 04/04/2022] [Indexed: 06/15/2023]
Abstract
For COVID-19, the need for robust, inexpensive, and accessible screening becomes critical. Even though symptoms present differently, cough is still taken as one of the primary symptoms in severe and non-severe infections alike. For mass screening in resource-constrained regions, artificial intelligence (AI)-guided tools have progressively contributed to detect/screen COVID-19 infections using cough sounds. Therefore, in this article, we review state-of-the-art works in both years 2020 and 2021 by considering AI-guided tools to analyze cough sound for COVID-19 screening primarily based on machine learning algorithms. In our study, we used PubMed central repository and Web of Science with key words: (Cough OR Cough Sounds OR Speech) AND (Machine learning OR Deep learning OR Artificial intelligence) AND (COVID-19 OR Coronavirus). For better meta-analysis, we screened for appropriate dataset (size and source), algorithmic factors (both shallow learning and deep learning models) and corresponding performance scores. Further, in order not to miss up-to-date experimental research-based articles, we also included articles outside of PubMed and Web of Science, but pre-print articles were strictly avoided as they are not peer-reviewed.
Collapse
Affiliation(s)
- KC Santosh
- 2AI: Applied Artificial Intelligence Lab, Computer Science, University of South Dakota, Vermiillion, South Dakota, United States
| | - Nicholas Rasmussen
- 2AI: Applied Artificial Intelligence Lab, Computer Science, University of South Dakota, Vermiillion, South Dakota, United States
| | - Muntasir Mamun
- 2AI: Applied Artificial Intelligence Lab, Computer Science, University of South Dakota, Vermiillion, South Dakota, United States
| | - Sunil Aryal
- School of Information Technology, Deakin University, Victoria, Australia
| |
Collapse
|
35
|
Hamdi S, Oussalah M, Moussaoui A, Saidi M. Attention-based hybrid CNN-LSTM and spectral data augmentation for COVID-19 diagnosis from cough sound. J Intell Inf Syst 2022; 59:367-389. [PMID: 35498369 PMCID: PMC9034264 DOI: 10.1007/s10844-022-00707-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 03/23/2022] [Accepted: 03/27/2022] [Indexed: 12/11/2022]
Abstract
COVID-19 pandemic has fueled the interest in artificial intelligence tools for quick diagnosis to limit virus spreading. Over 60% of people who are infected complain of a dry cough. Cough and other respiratory sounds were used to build diagnosis models in much recent research. We propose in this work, an augmentation pipeline which is applied on the pre-filtered data and uses i) pitch-shifting technique to augment the raw signal and, ii) spectral data augmentation technique SpecAugment to augment the computed mel-spectrograms. A deep learning based architecture that hybridizes convolution neural networks and long-short term memory with an attention mechanism is proposed for building the classification model. The feasibility of the proposed is demonstrated through a set of testing scenarios using the large-scale COUGHVID cough dataset and through a comparison with three baselines models. We have shown that our classification model achieved 91.13% of testing accuracy, 90.93% of sensitivity and an area under the curve of receiver operating characteristic of 91.13%.
Collapse
Affiliation(s)
- Skander Hamdi
- Department of Computer Science, University of Ferhat Abbes Setif, 19000 Setif, Algeria
| | - Mourad Oussalah
- Department of Computer Science and Engineering, University of Oulu, 90570 Oulu, Finland
| | - Abdelouahab Moussaoui
- Department of Computer Science, University of Ferhat Abbes Setif, 19000 Setif, Algeria
| | - Mohamed Saidi
- Department of Computer Science, University of Ferhat Abbes Setif, 19000 Setif, Algeria
| |
Collapse
|
36
|
Nguyen T, Pernkopf F. Lung Sound Classification Using Co-tuning and Stochastic Normalization. IEEE Trans Biomed Eng 2022; 69:2872-2882. [PMID: 35254969 DOI: 10.1109/tbme.2022.3156293] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Computational methods for lung sound analysis are beneficial for computer-aided diagnosis support, storage and monitoring in critical care. In this paper, we use pre-trained ResNet models as backbone architectures for classification of adventitious lung sounds and respiratory diseases. The learned representation of the pre-trained model is transferred by using vanilla fine-tuning, co-tuning, stochastic normalization and the combination of the co-tuning and stochastic normalization techniques. Furthermore, data augmentation in both time domain and time-frequency domain is used to account for the class imbalance of the ICBHI and our multi-channel lung sound dataset. Additionally, we introduce spectrum correction to account for the variations of the recording device properties on the ICBHI dataset. Empirically, our proposed systems mostly outperform all state-of-the-art lung sound classification systems for the adventitious lung sounds and respiratory diseases of both datasets.
Collapse
|
37
|
Husain M, Simpkin A, Gibbons C, Talkar T, Low D, Bonato P, Ghosh SS, Quatieri T, O'Keeffe DT. Artificial Intelligence for Detecting COVID-19 With the Aid of Human Cough, Breathing and Speech Signals: Scoping Review. IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY 2022; 3:235-241. [PMID: 36819937 PMCID: PMC9933914 DOI: 10.1109/ojemb.2022.3143688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 11/30/2021] [Accepted: 12/26/2021] [Indexed: 09/01/2023] Open
Abstract
Goal: Official tests for COVID-19 are time consuming, costly, can produce high false negatives, use up vital chemicals and may violate social distancing laws. Therefore, a fast and reliable additional solution using recordings of cough, breathing and speech data for preliminary screening may help alleviate these issues. Objective: This scoping review explores how Artificial Intelligence (AI) technology aims to detect COVID-19 disease by using cough, breathing and speech recordings, as reported in the literature. Here, we describe and summarize attributes of the identified AI techniques and datasets used for their implementation. Methods: A scoping review was conducted following the guidelines of PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews). Electronic databases (Google Scholar, Science Direct, and IEEE Xplore) were searched between 1st April 2020 and 15th August 2021. Terms were selected based on the target intervention (i.e., AI), the target disease (i.e., COVID-19) and acoustic correlates of the disease (i.e., speech, breathing and cough). A narrative approach was used to summarize the extracted data. Results: 24 studies and 8 Apps out of the 86 retrieved studies met the inclusion criteria. Half of the publications and Apps were from the USA. The most prominent AI architecture used was a convolutional neural network, followed by a recurrent neural network. AI models were mainly trained, tested and run-on websites and personal computers, rather than on phone apps. More than half of the included studies reported area-under-the-curve performance of greater than 0.90 on symptomatic and negative datasets while one study achieved 100% sensitivity in predicting asymptomatic COVID-19 from cough-, breathing- or speech-based acoustic features. Conclusions: The included studies show that AI has the potential to help detect COVID-19 using cough, breathing and speech samples. The proposed methods (with some time and appropriate clinical testing) could prove to be an effective method in detecting various diseases related to respiratory and neurophysiological changes in the human body.
Collapse
Affiliation(s)
- Mouzzam Husain
- Health Innovation Via Engineering (HIVE) Lab, Curam, Lero, School of MedicineLambe Institute for Translational ResearchNational University of Ireland GalwayH91 TK33GalwayIreland
| | - Andrew Simpkin
- School of Mathematics, Statistics and Applied MathematicsNational University of IrelandH91 TK33GalwayIreland
| | - Claire Gibbons
- Health Innovation Via Engineering (HIVE) Lab, Curam, Lero, School of MedicineLambe Institute for Translational ResearchNational University of Ireland GalwayH91 TK33GalwayIreland
| | - Tanya Talkar
- MIT Lincoln LaboratoryLexingtonMA02421USA
- Program in Speech and Hearing Bioscience and TechnologyHarvard Medical SchoolBostonMA02115USA
| | - Daniel Low
- Program in Speech and Hearing Bioscience and TechnologyHarvard Medical SchoolBostonMA02115USA
- MIT McGovern Institute for Brain Research, CambridgeMA02139USA
| | - Paolo Bonato
- Department of Physical Medicine and RehabilitationHarvard Medical School, Spaulding Rehabilitation HospitalBostonMAUSA
| | - Satrajit S. Ghosh
- Program in Speech and Hearing Bioscience and TechnologyHarvard Medical SchoolBostonMA02115USA
- MIT McGovern Institute for Brain Research, CambridgeMA02139USA
| | - Thomas Quatieri
- MIT Lincoln LaboratoryLexingtonMA02421USA
- Program in Speech and Hearing Bioscience and TechnologyHarvard Medical SchoolBostonMA02115USA
| | - Derek T. O'Keeffe
- Health Innovation Via Engineering (HIVE) Lab, Curam, Lero, School of MedicineLambe Institute for Translational ResearchNational University of Ireland GalwayH91 TK33GalwayIreland
- University Hospital Galway, Saolta, Health Services ExecutiveIreland
| |
Collapse
|
38
|
Ponomarchuk A, Burenko I, Malkin E, Nazarov I, Kokh V, Avetisian M, Zhukov L. Project Achoo: A Practical Model and Application for COVID-19 Detection From Recordings of Breath, Voice, and Cough. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING 2022; 16:175-187. [PMID: 35582703 PMCID: PMC9088778 DOI: 10.1109/jstsp.2022.3142514] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 11/23/2021] [Accepted: 01/06/2022] [Indexed: 05/27/2023]
Abstract
The COVID-19 pandemic created significant interest and demand for infection detection and monitoring solutions. In this paper, we propose a machine learning method to quickly detect COVID-19 using audio recordings made on consumer devices. The approach combines signal processing and noise removal methods with an ensemble of fine-tuned deep learning networks and enables COVID detection on coughs. We have also developed and deployed a mobile application that uses a symptoms checker together with voice, breath, and cough signals to detect COVID-19 infection. The application showed robust performance on both openly sourced datasets and the noisy data collected during beta testing by the end users.
Collapse
|
39
|
Kranthi Kumar L, Alphonse P. COVID-19 disease diagnosis with light-weight CNN using modified MFCC and enhanced GFCC from human respiratory sounds. THE EUROPEAN PHYSICAL JOURNAL. SPECIAL TOPICS 2022; 231:3329-3346. [PMID: 35096278 PMCID: PMC8785156 DOI: 10.1140/epjs/s11734-022-00432-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Accepted: 12/18/2021] [Indexed: 06/02/2023]
Abstract
In the last 2 years, medical researchers and clinical scientists have paid close attention to the problem of respiratory sound classification to classify COVID-19 disease symptoms. In the physical world, very few AI-based (Artificial Intelligence) techniques are often used to detect COVID-19/SARS-CoV-2 respiratory disease symptoms from the human respiratory system-generated acoustic sounds such as acoustic voice sound, breathing (inhale and exhale) sounds, and cough sound. We propose a light-weight Convolutional Neural Network (CNN) with Modified-Mel-frequency Cepstral Coefficient (M-MFCC) using different depths and kernel sizes to classify COVID-19 and other respiratory sound disease symptoms such as Asthma, Pertussis, and Bronchitis. The proposed network outperforms conventional feature extraction models and existing Deep Learning (DL) models for COVID-19/SARS-CoV-2 classification accuracy in the range of 4-10%. The model's performance is compared with the COVID-19 crowdsourced benchmark dataset and gives a competitive performance. We applied different receptive fields and depths in the proposed model to get different contextual information that should aid in classification. And our experiments suggested 1 × 12 receptive fields and a depth of 5-Layer for the light-weight CNN to extract and identify the features from respiratory sound data. The model is also trained and tested with different modalities of data to showcase its effectiveness in classification.
Collapse
Affiliation(s)
- Lella Kranthi Kumar
- Health Analytics Research Labs, Department of Computer Applications, NIT Tiruchirappalli, Tiruchirappalli, Tamil Nadu 620015 India
| | - P.J.A. Alphonse
- Health Analytics Research Labs, Department of Computer Applications, NIT Tiruchirappalli, Tiruchirappalli, Tamil Nadu 620015 India
| |
Collapse
|
40
|
Malla S, Alphonse PJA. Fake or real news about COVID-19? Pretrained transformer model to detect potential misleading news. THE EUROPEAN PHYSICAL JOURNAL. SPECIAL TOPICS 2022; 231:3347-3356. [PMID: 35039760 PMCID: PMC8756170 DOI: 10.1140/epjs/s11734-022-00436-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 12/18/2021] [Indexed: 06/14/2023]
Abstract
The World Health Organization declared the novel coronavirus disease 2019 a pandemic on March 11, 2020. Along with the coronavirus pandemic, a new crisis has emerged, characterized by widespread fear and panic caused by a lack of information or, in some cases, outright fake messages. In these circumstances, Twitter is one of the most eminent and trusted social media platforms. Fake tweets, on the other hand, are challenging to detect and differentiate. The primary goal of this paper is to educate society about the importance of accurate information and prevent the spread of fake information. This paper has investigated COVID-19 fake data from various social media platforms such as Twitter, Facebook, and Instagram. The objective of this paper is to categorize given tweets as either fake or real news. The authors have tested various deep learning models on the COVID-19 fake dataset. Finally, the CT-BERT and RoBERTa deep learning models outperformed other deep learning models like BERT, BERTweet, AlBERT, and DistlBERT. The proposed ensemble deep learning architecture outperformed CT-BERT and RoBERTa on the COVID-19 fake news dataset using the multiplicative fusion technique. The proposed model's performance in this technique was determined by the multiplicative product of the final predictive values of CT-BERT and RoBERTa. This technique overcomes the disadvantage of these CT-BERT and RoBERTa models' incorrect predictive nature. The proposed architecture outperforms both well-known ML and DL models, with 98.88% accuracy and a 98.93% F1-score.
Collapse
Affiliation(s)
- SreeJagadeesh Malla
- Department of Computer Applications, National Institute of Technology, Thuvakudi, Tiruchirappalli, Tamil Nadu 620015 India
| | - P. J. A. Alphonse
- Department of Computer Applications, National Institute of Technology, Thuvakudi, Tiruchirappalli, Tamil Nadu 620015 India
| |
Collapse
|
41
|
Chang Y, Jing X, Ren Z, Schuller BW. CovNet: A Transfer Learning Framework for Automatic COVID-19 Detection From Crowd-Sourced Cough Sounds. Front Digit Health 2022; 3:799067. [PMID: 35047869 PMCID: PMC8761863 DOI: 10.3389/fdgth.2021.799067] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 11/29/2021] [Indexed: 11/13/2022] Open
Abstract
Since the COronaVIrus Disease 2019 (COVID-19) outbreak, developing a digital diagnostic tool to detect COVID-19 from respiratory sounds with computer audition has become an essential topic due to its advantages of being swift, low-cost, and eco-friendly. However, prior studies mainly focused on small-scale COVID-19 datasets. To build a robust model, the large-scale multi-sound FluSense dataset is utilised to help detect COVID-19 from cough sounds in this study. Due to the gap between FluSense and the COVID-19-related datasets consisting of cough only, the transfer learning framework (namely CovNet) is proposed and applied rather than simply augmenting the training data with FluSense. The CovNet contains (i) a parameter transferring strategy and (ii) an embedding incorporation strategy. Specifically, to validate the CovNet's effectiveness, it is used to transfer knowledge from FluSense to COUGHVID, a large-scale cough sound database of COVID-19 negative and COVID-19 positive individuals. The trained model on FluSense and COUGHVID is further applied under the CovNet to another two small-scale cough datasets for COVID-19 detection, the COVID-19 cough sub-challenge (CCS) database in the INTERSPEECH Computational Paralinguistics challengE (ComParE) challenge and the DiCOVA Track-1 database. By training four simple convolutional neural networks (CNNs) in the transfer learning framework, our approach achieves an absolute improvement of 3.57% over the baseline of DiCOVA Track-1 validation of the area under the receiver operating characteristic curve (ROC AUC) and an absolute improvement of 1.73% over the baseline of ComParE CCS test unweighted average recall (UAR).
Collapse
Affiliation(s)
- Yi Chang
- Group on Language, Audio, and Music, Imperial College London, London, United Kingdom
| | - Xin Jing
- Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Augsburg, Germany
| | - Zhao Ren
- Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Augsburg, Germany
- L3S Research Center, Hannover, Germany
| | - Björn W. Schuller
- Group on Language, Audio, and Music, Imperial College London, London, United Kingdom
- Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Augsburg, Germany
| |
Collapse
|
42
|
Rostami M, Oussalah M. A novel explainable COVID-19 diagnosis method by integration of feature selection with random forest. INFORMATICS IN MEDICINE UNLOCKED 2022; 30:100941. [PMID: 35399333 PMCID: PMC8985417 DOI: 10.1016/j.imu.2022.100941] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 04/01/2022] [Accepted: 04/01/2022] [Indexed: 12/12/2022] Open
Abstract
Several Artificial Intelligence-based models have been developed for COVID-19 disease diagnosis. In spite of the promise of artificial intelligence, there are very few models which bridge the gap between traditional human-centered diagnosis and the potential future of machine-centered disease diagnosis. Under the concept of human-computer interaction design, this study proposes a new explainable artificial intelligence method that exploits graph analysis for feature visualization and optimization for the purpose of COVID-19 diagnosis from blood test samples. In this developed model, an explainable decision forest classifier is employed to COVID-19 classification based on routinely available patient blood test data. The approach enables the clinician to use the decision tree and feature visualization to guide the explainability and interpretability of the prediction model. By utilizing this novel feature selection phase, the proposed diagnosis model will not only improve diagnosis accuracy but decrease the execution time as well.
Collapse
Affiliation(s)
- Mehrdad Rostami
- Centre for Machine Vision and Signal Processing, Faculty of Information Technology, University of Oulu, Oulu, Finland
| | - Mourad Oussalah
- Centre for Machine Vision and Signal Processing, Faculty of Information Technology, University of Oulu, Oulu, Finland
- Research Unit of Medical Imaging, Physics, and Technology, Faculty of Medicine, University of Oulu, Finland
| |
Collapse
|
43
|
Mahmood AF, Mahmood SW. Cough/X-ray/CT (CXC) website for testing COVID-19 and auto-informing results. THE REVIEW OF SCIENTIFIC INSTRUMENTS 2022; 93:013705. [PMID: 35104981 DOI: 10.1063/5.0076314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2021] [Accepted: 12/30/2021] [Indexed: 06/14/2023]
Abstract
Despite the development of vaccines and the emergence of various treatments for COVID-19, the number of confirmed cases of the coronavirus disease (COVID-19) is increasing worldwide, and it is unlikely that the disease will ever disappear completely. Having a non-contact remote testing system can improve the workload of health-care centers and contribute to reducing the infection by recommending early self-isolation for those who suffer from a cough. In the proposed system, patients can upload an audio cough recording via mobile phones through the suggested Cough/X-ray/CT website and then receive the diagnosis within seconds on the same phone. Moreover, in the case of infection, the health center and the community are informed in addition to automatically calling the mobile phones of the injured cases. The higher proposed accuracy with deep cough training was achieved on the ResNet152v2 model after converting the cough signal into an image using the Mel-spectrogram, where the accuracy was 99.95%, the sensitivity was 100%, and the specificity was 99%.
Collapse
Affiliation(s)
| | - Saja Waleed Mahmood
- Computer Engineering, College of Engineering, University of Mosul, Mosul, Iraq
| |
Collapse
|
44
|
Boosting COVID-19 Image Classification Using MobileNetV3 and Aquila Optimizer Algorithm. ENTROPY 2021; 23:e23111383. [PMID: 34828081 PMCID: PMC8624090 DOI: 10.3390/e23111383] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Revised: 10/12/2021] [Accepted: 10/13/2021] [Indexed: 12/23/2022]
Abstract
Currently, the world is still facing a COVID-19 (coronavirus disease 2019) classified as a highly infectious disease due to its rapid spreading. The shortage of X-ray machines may lead to critical situations and delay the diagnosis results, increasing the number of deaths. Therefore, the exploitation of deep learning (DL) and optimization algorithms can be advantageous in early diagnosis and COVID-19 detection. In this paper, we propose a framework for COVID-19 images classification using hybridization of DL and swarm-based algorithms. The MobileNetV3 is used as a backbone feature extraction to learn and extract relevant image representations as a DL model. As a swarm-based algorithm, the Aquila Optimizer (Aqu) is used as a feature selector to reduce the dimensionality of the image representations and improve the classification accuracy using only the most essential selected features. To validate the proposed framework, two datasets with X-ray and CT COVID-19 images are used. The obtained results from the experiments show a good performance of the proposed framework in terms of classification accuracy and dimensionality reduction during the feature extraction and selection phases. The Aqu feature selection algorithm achieves accuracy better than other methods in terms of performance metrics.
Collapse
|