1
|
Yao C, Cheng D, Yang W, Guo Y, Zhou T. Characterization and clinical outcomes of outpatients with subacute or chronic post COVID-19 cough: a real-world study. PeerJ 2024; 12:e18705. [PMID: 39677958 PMCID: PMC11645979 DOI: 10.7717/peerj.18705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Accepted: 11/22/2024] [Indexed: 12/17/2024] Open
Abstract
Background Limited research exists on the features and management of post-COVID cough. This real-world study investigates outpatients with subacute or chronic post-COVID cough, aiming to delineate characteristics and regimen responses within the population. Method We enrolled eligible patients from our outpatient unit between August 2023 and February 2024. Comprehensive clinical data, prescriptions, and patient-reported cough severity were collected during the primary visit and subsequent follow-ups. Result A total of 141 patients, aged: 42 ± 14 years old, were included, with 70% being female. The median cough duration was 8 weeks (interquartile range 4-12 weeks). Sixty percent presented with a dry cough, while the rest had coughs with phlegm. Over half reported abnormal laryngeal sensations (54%). Twenty-one percent coughed during the day, while 32% coughed constantly, and 48% experienced nocturnal episodes. Compound methoxyphenamine capsules were the most prescribed, but our study found ICS/LABA to be the most effective, followed by compound methoxyphenamine capsules, montelukast, and Chinese patent drugs. Conclusion Females exhibit a higher prevalence of post-COVID cough, and our study recommends ICS/LABA as the preferred treatment. These findings warrant validation through larger, prospectively designed studies.
Collapse
Affiliation(s)
- Chun Yao
- Department of Respiratory and Critical Care Medicine, the Second Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China
| | - Dongliang Cheng
- Department of Respiratory and Critical Care Medicine, the Second Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China
| | - Wenhong Yang
- Department of Respiratory and Critical Care Medicine, the Second Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China
| | - Yun Guo
- Department of Respiratory and Critical Care Medicine, the Second Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China
| | - Tong Zhou
- Department of Respiratory and Critical Care Medicine, the Second Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China
| |
Collapse
|
2
|
Hossein Motlagh N, Zuniga A, Thi Nguyen N, Flores H, Wang J, Tarkoma S, Prosperi M, Helal S, Nurmi P. Population Digital Health: Continuous Health Monitoring and Profiling at Scale. Online J Public Health Inform 2024; 16:e60261. [PMID: 39565687 PMCID: PMC11601140 DOI: 10.2196/60261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 07/24/2024] [Accepted: 09/10/2024] [Indexed: 09/22/2024] Open
Abstract
Unlabelled This paper introduces population digital health (PDH)-the use of digital health information sourced from health internet of things (IoT) and wearable devices for population health modeling-as an emerging research domain that offers an integrated approach for continuous monitoring and profiling of diseases and health conditions at multiple spatial resolutions. PDH combines health data sourced from health IoT devices, machine learning, and ubiquitous computing or networking infrastructure to increase the scale, coverage, equity, and cost-effectiveness of population health. This contrasts with the traditional population health approach, which relies on data from structured clinical records (eg, electronic health records) or health surveys. We present the overall PDH approach and highlight its key research challenges, provide solutions to key research challenges, and demonstrate the potential of PDH through three case studies that address (1) data inadequacy, (2) inaccuracy of the health IoT devices' sensor measurements, and (3) the spatiotemporal sparsity in the available digital health information. Finally, we discuss the conditions, prerequisites, and barriers for adopting PDH drawing on from real-world examples from different geographic regions.
Collapse
Affiliation(s)
- Naser Hossein Motlagh
- Department of Computer Science, University of Helsinki, PL 64 (Gustaf Hällströmin katu 2), Helsinki, 00014, Finland, 358 451707064
| | - Agustin Zuniga
- Department of Computer Science, University of Helsinki, PL 64 (Gustaf Hällströmin katu 2), Helsinki, 00014, Finland, 358 451707064
| | - Ngoc Thi Nguyen
- Department of Computer Science, University of Helsinki, PL 64 (Gustaf Hällströmin katu 2), Helsinki, 00014, Finland, 358 451707064
| | - Huber Flores
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | - Jiangtao Wang
- School of Computing, Engineering and Digital Technologies, Teessidde University, Middlesbrough, UK
| | - Sasu Tarkoma
- Department of Computer Science, University of Helsinki, PL 64 (Gustaf Hällströmin katu 2), Helsinki, 00014, Finland, 358 451707064
| | - Mattia Prosperi
- Department of Epidemiology, University of Florida, Gainesville, FL, United States
| | - Sumi Helal
- Department of Computer Science and Engineering, University of Bologna, Bologna, Italy
| | - Petteri Nurmi
- Department of Computer Science, University of Helsinki, PL 64 (Gustaf Hällströmin katu 2), Helsinki, 00014, Finland, 358 451707064
| |
Collapse
|
3
|
Ren Z, Chang Y, Bartl-Pokorny KD, Pokorny FB, Schuller BW. The Acoustic Dissection of Cough: Diving Into Machine Listening-based COVID-19 Analysis and Detection. J Voice 2024; 38:1264-1277. [PMID: 35835648 PMCID: PMC9197794 DOI: 10.1016/j.jvoice.2022.06.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 05/25/2022] [Accepted: 06/09/2022] [Indexed: 12/05/2022]
Abstract
OBJECTIVES The coronavirus disease 2019 (COVID-19) has caused a crisis worldwide. Amounts of efforts have been made to prevent and control COVID-19's transmission, from early screenings to vaccinations and treatments. Recently, due to the spring up of many automatic disease recognition applications based on machine listening techniques, it would be fast and cheap to detect COVID-19 from recordings of cough, a key symptom of COVID-19. To date, knowledge of the acoustic characteristics of COVID-19 cough sounds is limited but would be essential for structuring effective and robust machine learning models. The present study aims to explore acoustic features for distinguishing COVID-19 positive individuals from COVID-19 negative ones based on their cough sounds. METHODS By applying conventional inferential statistics, we analyze the acoustic correlates of COVID-19 cough sounds based on the ComParE feature set, i.e., a standardized set of 6,373 acoustic higher-level features. Furthermore, we train automatic COVID-19 detection models with machine learning methods and explore the latent features by evaluating the contribution of all features to the COVID-19 status predictions. RESULTS The experimental results demonstrate that a set of acoustic parameters of cough sounds, e.g., statistical functionals of the root mean square energy and Mel-frequency cepstral coefficients, bear essential acoustic information in terms of effect sizes for the differentiation between COVID-19 positive and COVID-19 negative cough samples. Our general automatic COVID-19 detection model performs significantly above chance level, i.e., at an unweighted average recall (UAR) of 0.632, on a data set consisting of 1,411 cough samples (COVID-19 positive/negative: 210/1,201). CONCLUSIONS Based on the acoustic correlates analysis on the ComParE feature set and the feature analysis in the effective COVID-19 detection approach, we find that several acoustic features that show higher effects in conventional group difference testing are also higher weighted in the machine learning models.
Collapse
Affiliation(s)
- Zhao Ren
- EIHW - Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Augsburg, Germany; L3S Research Center, Hannover, Germany.
| | - Yi Chang
- GLAM - Group on Language, Audio, & Music, Imperial College London, London, United Kingdom
| | - Katrin D Bartl-Pokorny
- EIHW - Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Augsburg, Germany; Division of Phoniatrics, Medical University of Graz, Graz, Austria; Division of Physiology, Medical University of Graz, Graz, Austria.
| | - Florian B Pokorny
- EIHW - Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Augsburg, Germany; Division of Phoniatrics, Medical University of Graz, Graz, Austria; Division of Physiology, Medical University of Graz, Graz, Austria
| | - Björn W Schuller
- EIHW - Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Augsburg, Germany; GLAM - Group on Language, Audio, & Music, Imperial College London, London, United Kingdom
| |
Collapse
|
4
|
Carreiro-Martins P, Paixão P, Caires I, Matias P, Gamboa H, Soares F, Gomez P, Sousa J, Neuparth N. Acoustic and Clinical Data Analysis of Vocal Recordings: Pandemic Insights and Lessons. Diagnostics (Basel) 2024; 14:2273. [PMID: 39451596 PMCID: PMC11507201 DOI: 10.3390/diagnostics14202273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2024] [Revised: 10/03/2024] [Accepted: 10/09/2024] [Indexed: 10/26/2024] Open
Abstract
Background/Objectives: The interest in processing human speech and other human-generated audio signals as a diagnostic tool has increased due to the COVID-19 pandemic. The project OSCAR (vOice Screening of CoronA viRus) aimed to develop an algorithm to screen for COVID-19 using a dataset of Portuguese participants with voice recordings and clinical data. Methods: This cross-sectional study aimed to characterise the pattern of sounds produced by the vocal apparatus in patients with SARS-CoV-2 infection documented by a positive RT-PCR test, and to develop and validate a screening algorithm. In Phase II, the algorithm developed in Phase I was tested in a real-world setting. Results: In Phase I, after filtering, the training group consisted of 166 subjects who were effectively available to train the classification model (34.3% SARS-CoV-2 positive/65.7% SARS-CoV-2 negative). Phase II enrolled 58 participants (69.0% SARS-CoV-2 positive/31.0% SARS-CoV-2 negative). The final model achieved a sensitivity of 85%, a specificity of 88.9%, and an F1-score of 84.7%, suggesting voice screening algorithms as an attractive strategy for COVID-19 diagnosis. Conclusions: Our findings highlight the potential of a voice-based detection strategy as an alternative method for respiratory tract screening.
Collapse
Affiliation(s)
- Pedro Carreiro-Martins
- Comprehensive Health Research Center (CHRC), LA-REAL, NOVA Medical School, Campo Mártires da Pátria, 130, 1169-056 Lisboa, Portugal (N.N.)
- Serviço de Imunoalergologia, Hospital de Dona Estefânia, ULS São José, Rua Jacinta Marto, 1169-045 Lisbon, Portugal
| | - Paulo Paixão
- Comprehensive Health Research Center (CHRC), LA-REAL, NOVA Medical School, Campo Mártires da Pátria, 130, 1169-056 Lisboa, Portugal (N.N.)
| | - Iolanda Caires
- Comprehensive Health Research Center (CHRC), LA-REAL, NOVA Medical School, Campo Mártires da Pátria, 130, 1169-056 Lisboa, Portugal (N.N.)
| | - Pedro Matias
- Fraunhofer Portugal AICOS, Rua Alfredo Allen 455/461, 4200-135 Porto, Portugal (F.S.)
| | - Hugo Gamboa
- Fraunhofer Portugal AICOS, Rua Alfredo Allen 455/461, 4200-135 Porto, Portugal (F.S.)
- Laboratory for Instrumentation, Biomedical Engineering and Radiation Physics (LIBPhys), Faculdade de Ciências e Tecnologia, NOVA University of Lisbon, Caparica, 2820-001 Lisbon, Portugal
| | - Filipe Soares
- Fraunhofer Portugal AICOS, Rua Alfredo Allen 455/461, 4200-135 Porto, Portugal (F.S.)
| | - Pedro Gomez
- NeuSpeLab, CTB, Universidad Politécnica de Madrid, Campus de Montegancedo, s/n, 28223 Madrid, Spain;
| | - Joana Sousa
- NOS Inovação, Rua Actor António Silva, 9–6° Piso, Campo Grande, 1600-404 Lisboa, Portugal
| | - Nuno Neuparth
- Comprehensive Health Research Center (CHRC), LA-REAL, NOVA Medical School, Campo Mártires da Pátria, 130, 1169-056 Lisboa, Portugal (N.N.)
- Serviço de Imunoalergologia, Hospital de Dona Estefânia, ULS São José, Rua Jacinta Marto, 1169-045 Lisbon, Portugal
| |
Collapse
|
5
|
Truong T, Lenga M, Serrurier A, Mohammadi S. Fused Audio Instance and Representation for Respiratory Disease Detection. SENSORS (BASEL, SWITZERLAND) 2024; 24:6176. [PMID: 39409216 PMCID: PMC11479208 DOI: 10.3390/s24196176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/19/2024] [Revised: 09/17/2024] [Accepted: 09/20/2024] [Indexed: 10/20/2024]
Abstract
Audio-based classification techniques for body sounds have long been studied to aid in the diagnosis of respiratory diseases. While most research is centered on the use of coughs as the main acoustic biomarker, other body sounds also have the potential to detect respiratory diseases. Recent studies on the coronavirus disease 2019 (COVID-19) have suggested that breath and speech sounds, in addition to cough, correlate with the disease. Our study proposes fused audio instance and representation (FAIR) as a method for respiratory disease detection. FAIR relies on constructing a joint feature vector from various body sounds represented in waveform and spectrogram form. We conduct experiments on the use case of COVID-19 detection by combining waveform and spectrogram representation of body sounds. Our findings show that the use of self-attention to combine extracted features from cough, breath, and speech sounds leads to the best performance with an area under the receiver operating characteristic curve (AUC) score of 0.8658, a sensitivity of 0.8057, and a specificity of 0.7958. Compared to models trained solely on spectrograms or waveforms, the use of both representations results in an improved AUC score, demonstrating that combining spectrogram and waveform representation helps to enrich the extracted features and outperforms the models that use only one representation. While this study focuses on COVID-19, FAIR's flexibility allows it to combine various multi-modal and multi-instance features in many other diagnostic applications, potentially leading to more accurate diagnoses across a wider range of diseases.
Collapse
Affiliation(s)
- Tuan Truong
- Bayer AG, 13353 Berlin, Germany; (M.L.); (S.M.)
| | | | - Antoine Serrurier
- Clinic for Phoniatrics, Pedaudiology and Communication Disorders, University Hospital of RWTH Aachen, 52074 Aachen, Germany;
| | | |
Collapse
|
6
|
Austin GI, Pe’er I, Korem T. Distributional bias compromises leave-one-out cross-validation. ARXIV 2024:arXiv:2406.01652v1. [PMID: 38883233 PMCID: PMC11177965] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2024]
Abstract
Cross-validation is a common method for estimating the predictive performance of machine learning models. In a data-scarce regime, where one typically wishes to maximize the number of instances used for training the model, an approach called 'leave-one-out cross-validation' is often used. In this design, a separate model is built for predicting each data instance after training on all other instances. Since this results in a single test data point available per model trained, predictions are aggregated across the entire dataset to calculate common rank-based performance metrics such as the area under the receiver operating characteristic or precision-recall curves. In this work, we demonstrate that this approach creates a negative correlation between the average label of each training fold and the label of its corresponding test instance, a phenomenon that we term distributional bias. As machine learning models tend to regress to the mean of their training data, this distributional bias tends to negatively impact performance evaluation and hyperparameter optimization. We show that this effect generalizes to leave-P-out cross-validation and persists across a wide range of modeling and evaluation approaches, and that it can lead to a bias against stronger regularization. To address this, we propose a generalizable rebalanced cross-validation approach that corrects for distributional bias. We demonstrate that our approach improves cross-validation performance evaluation in synthetic simulations and in several published leave-one-out analyses.
Collapse
Affiliation(s)
- George I. Austin
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
- Program for Mathematical Genomics, Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Itsik Pe’er
- Program for Mathematical Genomics, Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- Department of Computer Science, Columbia University, New York, NY, USA
| | - Tal Korem
- Program for Mathematical Genomics, Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- Department of Obstetrics and Gynecology, Columbia University Irving Medical Center, New York, NY, USA
| |
Collapse
|
7
|
Xu W, Bao X, Lou X, Liu X, Chen Y, Zhao X, Zhang C, Pan C, Liu W, Liu F. Feature fusion method for pulmonary tuberculosis patient detection based on cough sound. PLoS One 2024; 19:e0302651. [PMID: 38743758 PMCID: PMC11093322 DOI: 10.1371/journal.pone.0302651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 04/08/2024] [Indexed: 05/16/2024] Open
Abstract
Since the COVID-19, cough sounds have been widely used for screening purposes. Intelligent analysis techniques have proven to be effective in detecting respiratory diseases. In 2021, there were up to 10 million TB-infected patients worldwide, with an annual growth rate of 4.5%. Most of the patients were from economically underdeveloped regions and countries. The PPD test, a common screening method in the community, has a sensitivity of as low as 77%. Although IGRA and Xpert MTB/RIF offer high specificity and sensitivity, their cost makes them less accessible. In this study, we proposed a feature fusion model-based cough sound classification method for primary TB screening in communities. Data were collected from hospitals using smart phones, including 230 cough sounds from 70 patients with TB and 226 cough sounds from 74 healthy subjects. We employed Bi-LSTM and Bi-GRU recurrent neural networks to analyze five traditional feature sets including the Mel frequency cepstrum coefficient (MFCC), zero-crossing rate (ZCR), short-time energy, root mean square, and chroma_cens. The incorporation of features extracted from the speech spectrogram by 2D convolution training into the Bi-LSTM model enhanced the classification results. With traditional futures, the best TB patient detection result was achieved with the Bi-LSTM model, with 93.99% accuracy, 93.93% specificity, and 92.39% sensitivity. When combined with a speech spectrogram, the classification results showed 96.33% accuracy, 94.99% specificity, and 98.13% sensitivity. Our findings underscore that traditional features and deep features have good complementarity when fused using Bi LSTM modelling, which outperforms existing PPD detection methods in terms of both efficiency and accuracy.
Collapse
Affiliation(s)
- Wenlong Xu
- College of Information Engineering, China Jiliang University, Hangzhou, Zhejiang, China
| | - Xiaofan Bao
- College of Information Engineering, China Jiliang University, Hangzhou, Zhejiang, China
| | - Xiaomin Lou
- Hangzhou Red Cross Hospital, Hangzhou, Zhejiang, China
| | - Xiaofang Liu
- College of Information Engineering, China Jiliang University, Hangzhou, Zhejiang, China
| | - Yuanyuan Chen
- Hangzhou Red Cross Hospital, Hangzhou, Zhejiang, China
| | | | - Chenlu Zhang
- Hangzhou Red Cross Hospital, Hangzhou, Zhejiang, China
| | - Chen Pan
- College of Information Engineering, China Jiliang University, Hangzhou, Zhejiang, China
| | - Wenlong Liu
- College of Information Engineering, China Jiliang University, Hangzhou, Zhejiang, China
| | - Feng Liu
- School of Information Technology and Electrical Engineering, University of Queensland, Brisbane, Queensland, Australia
| |
Collapse
|
8
|
Isangula KG, Haule RJ. Leveraging AI and Machine Learning to Develop and Evaluate a Contextualized User-Friendly Cough Audio Classifier for Detecting Respiratory Diseases: Protocol for a Diagnostic Study in Rural Tanzania. JMIR Res Protoc 2024; 13:e54388. [PMID: 38652526 PMCID: PMC11077412 DOI: 10.2196/54388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 02/14/2024] [Accepted: 02/21/2024] [Indexed: 04/25/2024] Open
Abstract
BACKGROUND Respiratory diseases, including active tuberculosis (TB), asthma, and chronic obstructive pulmonary disease (COPD), constitute substantial global health challenges, necessitating timely and accurate diagnosis for effective treatment and management. OBJECTIVE This research seeks to develop and evaluate a noninvasive user-friendly artificial intelligence (AI)-powered cough audio classifier for detecting these respiratory conditions in rural Tanzania. METHODS This is a nonexperimental cross-sectional research with the primary objective of collection and analysis of cough sounds from patients with active TB, asthma, and COPD in outpatient clinics to generate and evaluate a noninvasive cough audio classifier. Specialized cough sound recording devices, designed to be nonintrusive and user-friendly, will facilitate the collection of diverse cough sound samples from patients attending outpatient clinics in 20 health care facilities in the Shinyanga region. The collected cough sound data will undergo rigorous analysis, using advanced AI signal processing and machine learning techniques. By comparing acoustic features and patterns associated with TB, asthma, and COPD, a robust algorithm capable of automated disease discrimination will be generated facilitating the development of a smartphone-based cough sound classifier. The classifier will be evaluated against the calculated reference standards including clinical assessments, sputum smear, GeneXpert, chest x-ray, culture and sensitivity, spirometry and peak expiratory flow, and sensitivity and predictive values. RESULTS This research represents a vital step toward enhancing the diagnostic capabilities available in outpatient clinics, with the potential to revolutionize the field of respiratory disease diagnosis. Findings from the 4 phases of the study will be presented as descriptions supported by relevant images, tables, and figures. The anticipated outcome of this research is the creation of a reliable, noninvasive diagnostic cough classifier that empowers health care professionals and patients themselves to identify and differentiate these respiratory diseases based on cough sound patterns. CONCLUSIONS Cough sound classifiers use advanced technology for early detection and management of respiratory conditions, offering a less invasive and more efficient alternative to traditional diagnostics. This technology promises to ease public health burdens, improve patient outcomes, and enhance health care access in under-resourced areas, potentially transforming respiratory disease management globally. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) PRR1-10.2196/54388.
Collapse
Affiliation(s)
- Kahabi Ganka Isangula
- School of Nursing and Midwifery, Aga Khan University, Dar Es Salaam, United Republic of Tanzania
| | - Rogers John Haule
- School of Nursing and Midwifery, Aga Khan University, Dar Es Salaam, United Republic of Tanzania
| |
Collapse
|
9
|
Anibal JT, Landa AJ, Hang NTT, Song MJ, Peltekian AK, Shin A, Huth HB, Hazen LA, Christou AS, Rivera J, Morhard RA, Bagci U, Li M, Bensoussan Y, Clifton DA, Wood BJ. Omicron detection with large language models and YouTube audio data. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2022.09.13.22279673. [PMID: 36172131 PMCID: PMC9516853 DOI: 10.1101/2022.09.13.22279673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Publicly available audio data presents a unique opportunity for the development of digital health technologies with large language models (LLMs). In this study, YouTube was mined to collect audio data from individuals with self-declared positive COVID-19 tests as well as those with other upper respiratory infections (URI) and healthy subjects discussing a diverse range of topics. The resulting dataset was transcribed with the Whisper model and used to assess the capacity of LLMs for detecting self-reported COVID-19 cases and performing variant classification. Following prompt optimization, LLMs achieved accuracies of 0.89, 0.97, respectively, in the tasks of identifying self-reported COVID-19 cases and other respiratory illnesses. The model also obtained a mean accuracy of 0.77 at identifying the variant of self-reported COVID-19 cases using only symptoms and other health-related factors described in the YouTube videos. In comparison with past studies, which used scripted, standardized voice samples to capture biomarkers, this study focused on extracting meaningful information from public online audio data. This work introduced novel design paradigms for pandemic management tools, showing the potential of audio data in clinical and public health applications.
Collapse
Affiliation(s)
- James T Anibal
- Center for Interventional Oncology, Radiology and Imaging Sciences, NIH Clinical Center, National Cancer Institute, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, 10 Center Dr, Building 10, Room 1C341, MSC 1182, Bethesda, MD 20892-1182 USA
- Computational Health Informatics Lab, Oxford Institute of Biomedical Engineering, University of Oxford, Old Road Campus Research Building, Headington, Oxford OX3 7DQ, United Kingdom
| | - Adam J Landa
- Center for Interventional Oncology, Radiology and Imaging Sciences, NIH Clinical Center, National Cancer Institute, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, 10 Center Dr, Building 10, Room 1C341, MSC 1182, Bethesda, MD 20892-1182 USA
| | - Nguyen T T Hang
- Oxford University Clinical Research Unit, Centre for Tropical Medicine, Ho Chi Minh City, Vietnam
| | - Miranda J Song
- Center for Interventional Oncology, Radiology and Imaging Sciences, NIH Clinical Center, National Cancer Institute, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, 10 Center Dr, Building 10, Room 1C341, MSC 1182, Bethesda, MD 20892-1182 USA
| | - Alec K Peltekian
- Department of Computer Science, McCormick School of Engineering, Northwestern University, Mudd Hall, 2233 Tech Drive, Third Floor, Evanston, IL, 60208 USA
| | - Ashley Shin
- National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD, 20894
| | - Hannah B Huth
- Center for Interventional Oncology, Radiology and Imaging Sciences, NIH Clinical Center, National Cancer Institute, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, 10 Center Dr, Building 10, Room 1C341, MSC 1182, Bethesda, MD 20892-1182 USA
| | - Lindsey A Hazen
- Center for Interventional Oncology, Radiology and Imaging Sciences, NIH Clinical Center, National Cancer Institute, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, 10 Center Dr, Building 10, Room 1C341, MSC 1182, Bethesda, MD 20892-1182 USA
| | - Anna S Christou
- Center for Interventional Oncology, Radiology and Imaging Sciences, NIH Clinical Center, National Cancer Institute, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, 10 Center Dr, Building 10, Room 1C341, MSC 1182, Bethesda, MD 20892-1182 USA
| | - Jocelyne Rivera
- Center for Interventional Oncology, Radiology and Imaging Sciences, NIH Clinical Center, National Cancer Institute, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, 10 Center Dr, Building 10, Room 1C341, MSC 1182, Bethesda, MD 20892-1182 USA
| | - Robert A Morhard
- Center for Interventional Oncology, Radiology and Imaging Sciences, NIH Clinical Center, National Cancer Institute, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, 10 Center Dr, Building 10, Room 1C341, MSC 1182, Bethesda, MD 20892-1182 USA
| | - Ulas Bagci
- Feinberg School of Medicine, Northwestern University, 420 E Superior St, Chicago, IL 60611 USA
| | - Ming Li
- Center for Interventional Oncology, Radiology and Imaging Sciences, NIH Clinical Center, National Cancer Institute, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, 10 Center Dr, Building 10, Room 1C341, MSC 1182, Bethesda, MD 20892-1182 USA
| | - Yael Bensoussan
- Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| | - David A Clifton
- Computational Health Informatics Lab, Oxford Institute of Biomedical Engineering, University of Oxford, Old Road Campus Research Building, Headington, Oxford OX3 7DQ, United Kingdom
| | - Bradford J Wood
- Center for Interventional Oncology, Radiology and Imaging Sciences, NIH Clinical Center, National Cancer Institute, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, 10 Center Dr, Building 10, Room 1C341, MSC 1182, Bethesda, MD 20892-1182 USA
| |
Collapse
|
10
|
Malik H, Anees T. Multi-modal deep learning methods for classification of chest diseases using different medical imaging and cough sounds. PLoS One 2024; 19:e0296352. [PMID: 38470893 DOI: 10.1371/journal.pone.0296352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Accepted: 12/11/2023] [Indexed: 03/14/2024] Open
Abstract
Chest disease refers to a wide range of conditions affecting the lungs, such as COVID-19, lung cancer (LC), consolidation lung (COL), and many more. When diagnosing chest disorders medical professionals may be thrown off by the overlapping symptoms (such as fever, cough, sore throat, etc.). Additionally, researchers and medical professionals make use of chest X-rays (CXR), cough sounds, and computed tomography (CT) scans to diagnose chest disorders. The present study aims to classify the nine different conditions of chest disorders, including COVID-19, LC, COL, atelectasis (ATE), tuberculosis (TB), pneumothorax (PNEUTH), edema (EDE), pneumonia (PNEU). Thus, we suggested four novel convolutional neural network (CNN) models that train distinct image-level representations for nine different chest disease classifications by extracting features from images. Furthermore, the proposed CNN employed several new approaches such as a max-pooling layer, batch normalization layers (BANL), dropout, rank-based average pooling (RBAP), and multiple-way data generation (MWDG). The scalogram method is utilized to transform the sounds of coughing into a visual representation. Before beginning to train the model that has been developed, the SMOTE approach is used to calibrate the CXR and CT scans as well as the cough sound images (CSI) of nine different chest disorders. The CXR, CT scan, and CSI used for training and evaluating the proposed model come from 24 publicly available benchmark chest illness datasets. The classification performance of the proposed model is compared with that of seven baseline models, namely Vgg-19, ResNet-101, ResNet-50, DenseNet-121, EfficientNetB0, DenseNet-201, and Inception-V3, in addition to state-of-the-art (SOTA) classifiers. The effectiveness of the proposed model is further demonstrated by the results of the ablation experiments. The proposed model was successful in achieving an accuracy of 99.01%, making it superior to both the baseline models and the SOTA classifiers. As a result, the proposed approach is capable of offering significant support to radiologists and other medical professionals.
Collapse
Affiliation(s)
- Hassaan Malik
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| | - Tayyaba Anees
- Department of Software Engineering, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| |
Collapse
|
11
|
Aytekin I, Dalmaz O, Gonc K, Ankishan H, Saritas EU, Bagci U, Celik H, Cukur T. COVID-19 Detection From Respiratory Sounds With Hierarchical Spectrogram Transformers. IEEE J Biomed Health Inform 2024; 28:1273-1284. [PMID: 38051612 PMCID: PMC11658170 DOI: 10.1109/jbhi.2023.3339700] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/07/2023]
Abstract
Monitoring of prevalent airborne diseases such as COVID-19 characteristically involves respiratory assessments. While auscultation is a mainstream method for preliminary screening of disease symptoms, its utility is hampered by the need for dedicated hospital visits. Remote monitoring based on recordings of respiratory sounds on portable devices is a promising alternative, which can assist in early assessment of COVID-19 that primarily affects the lower respiratory tract. In this study, we introduce a novel deep learning approach to distinguish patients with COVID-19 from healthy controls given audio recordings of cough or breathing sounds. The proposed approach leverages a novel hierarchical spectrogram transformer (HST) on spectrogram representations of respiratory sounds. HST embodies self-attention mechanisms over local windows in spectrograms, and window size is progressively grown over model stages to capture local to global context. HST is compared against state-of-the-art conventional and deep-learning baselines. Demonstrations on crowd-sourced multi-national datasets indicate that HST outperforms competing methods, achieving over 90% area under the receiver operating characteristic curve (AUC) in detecting COVID-19 cases.
Collapse
|
12
|
Kapetanidis P, Kalioras F, Tsakonas C, Tzamalis P, Kontogiannis G, Karamanidou T, Stavropoulos TG, Nikoletseas S. Respiratory Diseases Diagnosis Using Audio Analysis and Artificial Intelligence: A Systematic Review. SENSORS (BASEL, SWITZERLAND) 2024; 24:1173. [PMID: 38400330 PMCID: PMC10893010 DOI: 10.3390/s24041173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 02/03/2024] [Accepted: 02/04/2024] [Indexed: 02/25/2024]
Abstract
Respiratory diseases represent a significant global burden, necessitating efficient diagnostic methods for timely intervention. Digital biomarkers based on audio, acoustics, and sound from the upper and lower respiratory system, as well as the voice, have emerged as valuable indicators of respiratory functionality. Recent advancements in machine learning (ML) algorithms offer promising avenues for the identification and diagnosis of respiratory diseases through the analysis and processing of such audio-based biomarkers. An ever-increasing number of studies employ ML techniques to extract meaningful information from audio biomarkers. Beyond disease identification, these studies explore diverse aspects such as the recognition of cough sounds amidst environmental noise, the analysis of respiratory sounds to detect respiratory symptoms like wheezes and crackles, as well as the analysis of the voice/speech for the evaluation of human voice abnormalities. To provide a more in-depth analysis, this review examines 75 relevant audio analysis studies across three distinct areas of concern based on respiratory diseases' symptoms: (a) cough detection, (b) lower respiratory symptoms identification, and (c) diagnostics from the voice and speech. Furthermore, publicly available datasets commonly utilized in this domain are presented. It is observed that research trends are influenced by the pandemic, with a surge in studies on COVID-19 diagnosis, mobile data acquisition, and remote diagnosis systems.
Collapse
Affiliation(s)
- Panagiotis Kapetanidis
- Computer Engineering and Informatics Department, University of Patras, 26504 Patras, Greece (C.T.); (G.K.); (S.N.)
| | - Fotios Kalioras
- Computer Engineering and Informatics Department, University of Patras, 26504 Patras, Greece (C.T.); (G.K.); (S.N.)
| | - Constantinos Tsakonas
- Computer Engineering and Informatics Department, University of Patras, 26504 Patras, Greece (C.T.); (G.K.); (S.N.)
| | - Pantelis Tzamalis
- Computer Engineering and Informatics Department, University of Patras, 26504 Patras, Greece (C.T.); (G.K.); (S.N.)
| | - George Kontogiannis
- Computer Engineering and Informatics Department, University of Patras, 26504 Patras, Greece (C.T.); (G.K.); (S.N.)
| | - Theodora Karamanidou
- Pfizer Center for Digital Innovation, 55535 Thessaloniki, Greece; (T.K.); (T.G.S.)
| | | | - Sotiris Nikoletseas
- Computer Engineering and Informatics Department, University of Patras, 26504 Patras, Greece (C.T.); (G.K.); (S.N.)
| |
Collapse
|
13
|
Hoffer O, Brzezinski RY, Ganim A, Shalom P, Ovadia-Blechman Z, Ben-Baruch L, Lewis N, Peled R, Shimon C, Naftali-Shani N, Katz E, Zimmer Y, Rabin N. Smartphone-based detection of COVID-19 and associated pneumonia using thermal imaging and a transfer learning algorithm. JOURNAL OF BIOPHOTONICS 2024:e202300486. [PMID: 38253344 DOI: 10.1002/jbio.202300486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 12/28/2023] [Accepted: 12/31/2023] [Indexed: 01/24/2024]
Abstract
COVID-19-related pneumonia is typically diagnosed using chest x-ray or computed tomography images. However, these techniques can only be used in hospitals. In contrast, thermal cameras are portable, inexpensive devices that can be connected to smartphones. Thus, they can be used to detect and monitor medical conditions outside hospitals. Herein, a smartphone-based application using thermal images of a human back was developed for COVID-19 detection. Image analysis using a deep learning algorithm revealed a sensitivity and specificity of 88.7% and 92.3%, respectively. The findings support the future use of noninvasive thermal imaging in primary screening for COVID-19 and associated pneumonia.
Collapse
Affiliation(s)
- Oshrit Hoffer
- School of Electrical Engineering, Afeka Tel Aviv Academic College of Engineering, Tel Aviv, Israel
| | - Rafael Y Brzezinski
- Neufeld Cardiac Research Institute, Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Tamman Cardiovascular Research Institute, Leviev Heart Center, Sheba Medical Center Tel Hashomer, Ramat Gan, Israel
- Internal Medicine "C" and "E", Tel Aviv Medical Center, Tel Aviv, Israel
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Adam Ganim
- School of Electrical Engineering, Afeka Tel Aviv Academic College of Engineering, Tel Aviv, Israel
| | - Perry Shalom
- School of Software Engineering, Afeka Tel Aviv Academic College of Engineering, Tel Aviv, Israel
| | - Zehava Ovadia-Blechman
- School of Medical Engineering, Afeka Tel Aviv Academic College of Engineering, Tel Aviv, Israel
| | - Lital Ben-Baruch
- School of Electrical Engineering, Afeka Tel Aviv Academic College of Engineering, Tel Aviv, Israel
| | - Nir Lewis
- Neufeld Cardiac Research Institute, Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Tamman Cardiovascular Research Institute, Leviev Heart Center, Sheba Medical Center Tel Hashomer, Ramat Gan, Israel
| | - Racheli Peled
- Neufeld Cardiac Research Institute, Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Tamman Cardiovascular Research Institute, Leviev Heart Center, Sheba Medical Center Tel Hashomer, Ramat Gan, Israel
| | - Carmi Shimon
- School of Electrical Engineering, Afeka Tel Aviv Academic College of Engineering, Tel Aviv, Israel
| | - Nili Naftali-Shani
- Neufeld Cardiac Research Institute, Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Tamman Cardiovascular Research Institute, Leviev Heart Center, Sheba Medical Center Tel Hashomer, Ramat Gan, Israel
| | - Eyal Katz
- School of Electrical Engineering, Afeka Tel Aviv Academic College of Engineering, Tel Aviv, Israel
| | - Yair Zimmer
- School of Medical Engineering, Afeka Tel Aviv Academic College of Engineering, Tel Aviv, Israel
| | - Neta Rabin
- Department of Industrial Engineering, Tel-Aviv University, Tel Aviv, Israel
| |
Collapse
|
14
|
Saeed T, Ijaz A, Sadiq I, Qureshi HN, Rizwan A, Imran A. An AI-Enabled Bias-Free Respiratory Disease Diagnosis Model Using Cough Audio. Bioengineering (Basel) 2024; 11:55. [PMID: 38247932 PMCID: PMC10813025 DOI: 10.3390/bioengineering11010055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 12/25/2023] [Accepted: 01/03/2024] [Indexed: 01/23/2024] Open
Abstract
Cough-based diagnosis for respiratory diseases (RDs) using artificial intelligence (AI) has attracted considerable attention, yet many existing studies overlook confounding variables in their predictive models. These variables can distort the relationship between cough recordings (input data) and RD status (output variable), leading to biased associations and unrealistic model performance. To address this gap, we propose the Bias-Free Network (RBF-Net), an end-to-end solution that effectively mitigates the impact of confounders in the training data distribution. RBF-Net ensures accurate and unbiased RD diagnosis features, emphasizing its relevance by incorporating a COVID-19 dataset in this study. This approach aims to enhance the reliability of AI-based RD diagnosis models by navigating the challenges posed by confounding variables. A hybrid of a Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) networks is proposed for the feature encoder module of RBF-Net. An additional bias predictor is incorporated in the classification scheme to formulate a conditional Generative Adversarial Network (c-GAN) that helps in decorrelating the impact of confounding variables from RD prediction. The merit of RBF-Net is demonstrated by comparing classification performance with a State-of-The-Art (SoTA) Deep Learning (DL) model (CNN-LSTM) after training on different unbalanced COVID-19 data sets, created by using a large-scale proprietary cough data set. RBF-Net proved its robustness against extremely biased training scenarios by achieving test set accuracies of 84.1%, 84.6%, and 80.5% for the following confounding variables-gender, age, and smoking status, respectively. RBF-Net outperforms the CNN-LSTM model test set accuracies by 5.5%, 7.7%, and 8.2%, respectively.
Collapse
Affiliation(s)
- Tabish Saeed
- AI4Networks Research Center, Department of Electrical & Computer Engineering, University of Oklahoma, Tulsa, OK 74135, USA; (H.N.Q.); (A.I.)
| | - Aneeqa Ijaz
- AI4Networks Research Center, Department of Electrical & Computer Engineering, University of Oklahoma, Tulsa, OK 74135, USA; (H.N.Q.); (A.I.)
| | - Ismail Sadiq
- James Watt School of Engineering, University of Glasgow, Glasgow G12 8QQ, UK;
| | - Haneya Naeem Qureshi
- AI4Networks Research Center, Department of Electrical & Computer Engineering, University of Oklahoma, Tulsa, OK 74135, USA; (H.N.Q.); (A.I.)
| | - Ali Rizwan
- AI4lyf, Bahria Town Lahore, Lahore 54000, Pakistan;
| | - Ali Imran
- AI4Networks Research Center, Department of Electrical & Computer Engineering, University of Oklahoma, Tulsa, OK 74135, USA; (H.N.Q.); (A.I.)
- James Watt School of Engineering, University of Glasgow, Glasgow G12 8QQ, UK;
| |
Collapse
|
15
|
Sharma M, Nduba V, Njagi LN, Murithi W, Mwongera Z, Hawn TR, Patel SN, Horne DJ. TBscreen: A passive cough classifier for tuberculosis screening with a controlled dataset. SCIENCE ADVANCES 2024; 10:eadi0282. [PMID: 38170773 PMCID: PMC10776005 DOI: 10.1126/sciadv.adi0282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 12/01/2023] [Indexed: 01/05/2024]
Abstract
Recent respiratory disease screening studies suggest promising performance of cough classifiers, but potential biases in model training and dataset quality preclude robust conclusions. To examine tuberculosis (TB) cough diagnostic features, we enrolled subjects with pulmonary TB (N = 149) and controls with other respiratory illnesses (N = 46) in Nairobi. We collected a dataset with 33,000 passive coughs and 1600 forced coughs in a controlled setting with similar demographics. We trained a ResNet18-based cough classifier using images of passive cough scalogram as input and obtained a fivefold cross-validation sensitivity of 0.70 (±0.11 SD). The smartphone-based model had better performance in subjects with higher bacterial load {receiver operating characteristic-area under the curve (ROC-AUC): 0.87 [95% confidence interval (CI): 0.87 to 0.88], P < 0.001} or lung cavities [ROC-AUC: 0.89 (95% CI: 0.88 to 0.89), P < 0.001]. Overall, our data suggest that passive cough features distinguish TB from non-TB subjects and are associated with bacterial burden and disease severity.
Collapse
Affiliation(s)
- Manuja Sharma
- Department of Electrical and Computer Engineering, University of Washington, 185 E Stevens Way NE, Seattle, WA 98195, USA
| | - Videlis Nduba
- Centre for Respiratory Diseases Research, Kenya Medical Research Institute, Mbagathi Rd, Nairobi 610101, Kenya
| | - Lilian N. Njagi
- Centre for Respiratory Diseases Research, Kenya Medical Research Institute, Mbagathi Rd, Nairobi 610101, Kenya
| | - Wilfred Murithi
- Centre for Respiratory Diseases Research, Kenya Medical Research Institute, Mbagathi Rd, Nairobi 610101, Kenya
| | - Zipporah Mwongera
- Centre for Respiratory Diseases Research, Kenya Medical Research Institute, Mbagathi Rd, Nairobi 610101, Kenya
| | - Thomas R. Hawn
- Department of Medicine, University of Washington, 1959 NE Pacific Street, Seattle, WA 98195, USA
| | - Shwetak N. Patel
- Department of Electrical and Computer Engineering, University of Washington, 185 E Stevens Way NE, Seattle, WA 98195, USA
- Paul G. Allen School of Computer Science and Engineering, University of Washington, 185 E Stevens Way NE, Seattle, WA 98195, USA
| | - David J. Horne
- Department of Medicine, University of Washington, 1959 NE Pacific Street, Seattle, WA 98195, USA
| |
Collapse
|
16
|
Singh K, Kaur N, Prabhu A. Combating COVID-19 Crisis using Artificial Intelligence (AI) Based Approach: Systematic Review. Curr Top Med Chem 2024; 24:737-753. [PMID: 38318824 DOI: 10.2174/0115680266282179240124072121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 12/19/2023] [Accepted: 12/27/2023] [Indexed: 02/07/2024]
Abstract
BACKGROUND SARS-CoV-2, the unique coronavirus that causes COVID-19, has wreaked damage around the globe, with victims displaying a wide range of difficulties that have encouraged medical professionals to look for innovative technical solutions and therapeutic approaches. Artificial intelligence-based methods have contributed a significant part in tackling complicated issues, and some institutions have been quick to embrace and tailor these solutions in response to the COVID-19 pandemic's obstacles. Here, in this review article, we have covered a few DL techniques for COVID-19 detection and diagnosis, as well as ML techniques for COVID-19 identification, severity classification, vaccine and drug development, mortality rate prediction, contact tracing, risk assessment, and public distancing. This review illustrates the overall impact of AI/ML tools on tackling and managing the outbreak. PURPOSE The focus of this research was to undertake a thorough evaluation of the literature on the part of Artificial Intelligence (AI) as a complete and efficient solution in the battle against the COVID-19 epidemic in the domains of detection and diagnostics of disease, mortality prediction and vaccine as well as drug development. METHODS A comprehensive exploration of PubMed, Web of Science, and Science Direct was conducted using PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analysis) regulations to find all possibly suitable papers conducted and made publicly available between December 1, 2019, and August 2023. COVID-19, along with AI-specific words, was used to create the query syntax. RESULTS During the period covered by the search strategy, 961 articles were published and released online. Out of these, a total of 135 papers were chosen for additional investigation. Mortality rate prediction, early detection and diagnosis, vaccine as well as drug development, and lastly, incorporation of AI for supervising and controlling the COVID-19 pandemic were the four main topics focused entirely on AI applications used to tackle the COVID-19 crisis. Out of 135, 60 research papers focused on the detection and diagnosis of the COVID-19 pandemic. Next, 19 of the 135 studies applied a machine-learning approach for mortality rate prediction. Another 22 research publications emphasized the vaccine as well as drug development. Finally, the remaining studies were concentrated on controlling the COVID-19 pandemic by applying AI AI-based approach to it. CONCLUSION We compiled papers from the available COVID-19 literature that used AI-based methodologies to impart insights into various COVID-19 topics in this comprehensive study. Our results suggest crucial characteristics, data types, and COVID-19 tools that can aid in medical and translational research facilitation.
Collapse
Affiliation(s)
- Kavya Singh
- Department of Biotechnology, Banasthali University, Banasthali Vidyapith, Banasthali, 304022, Rajasthan, India
| | - Navjeet Kaur
- Department of Chemistry & Division of Research and Development, Lovely Professional University, Phagwara, 144411, Punjab, India
| | - Ashish Prabhu
- Biotechnology Department, NIT Warangal, Warangal, 506004, Telangana, India
| |
Collapse
|
17
|
Efromson J, Ferrero G, Bègue A, Doman TJJ, Dugo C, Barker A, Saliu V, Reamey P, Kim K, Harfouche M, Yoder JA. Automated, high-throughput quantification of EGFP-expressing neutrophils in zebrafish by machine learning and a highly-parallelized microscope. PLoS One 2023; 18:e0295711. [PMID: 38060605 PMCID: PMC10703246 DOI: 10.1371/journal.pone.0295711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 11/26/2023] [Indexed: 12/18/2023] Open
Abstract
Normal development of the immune system is essential for overall health and disease resistance. Bony fish, such as the zebrafish (Danio rerio), possess all the major immune cell lineages as mammals and can be employed to model human host response to immune challenge. Zebrafish neutrophils, for example, are present in the transparent larvae as early as 48 hours post fertilization and have been examined in numerous infection and immunotoxicology reports. One significant advantage of the zebrafish model is the ability to affordably generate high numbers of individual larvae that can be arrayed in multi-well plates for high throughput genetic and chemical exposure screens. However, traditional workflows for imaging individual larvae have been limited to low-throughput studies using traditional microscopes and manual analyses. Using a newly developed, parallelized microscope, the Multi-Camera Array Microscope (MCAM™), we have optimized a rapid, high-resolution algorithmic method to count fluorescently labeled cells in zebrafish larvae in vivo. Using transgenic zebrafish larvae, in which neutrophils express EGFP, we captured 18 gigapixels of images across a full 96-well plate, in 75 seconds, and processed the resulting datastream, counting individual fluorescent neutrophils in all individual larvae in 5 minutes. This automation is facilitated by a machine learning segmentation algorithm that defines the most in-focus view of each larva in each well after which pixel intensity thresholding and blob detection are employed to locate and count fluorescent cells. We validated this method by comparing algorithmic neutrophil counts to manual counts in larvae subjected to changes in neutrophil numbers, demonstrating the utility of this approach for high-throughput genetic and chemical screens where a change in neutrophil number is an endpoint metric. Using the MCAM™ we have been able to, within minutes, acquire both enough data to create an automated algorithm and execute a biological experiment with statistical significance. Finally, we present this open-source software package which allows the user to train and evaluate a custom machine learning segmentation model and use it to localize zebrafish and analyze cell counts within the segmented region of interest. This software can be modified as needed for studies involving other zebrafish cell lineages using different transgenic reporter lines and can also be adapted for studies using other amenable model species.
Collapse
Affiliation(s)
- John Efromson
- Ramona Optics Inc., Durham, NC, United States of America
| | - Giuliano Ferrero
- Department of Molecular Biological Sciences, North Carolina State University, Raleigh, NC, United States of America
| | - Aurélien Bègue
- Ramona Optics Inc., Durham, NC, United States of America
| | | | - Clay Dugo
- Ramona Optics Inc., Durham, NC, United States of America
| | - Andi Barker
- Department of Molecular Biological Sciences, North Carolina State University, Raleigh, NC, United States of America
| | - Veton Saliu
- Ramona Optics Inc., Durham, NC, United States of America
| | - Paul Reamey
- Ramona Optics Inc., Durham, NC, United States of America
| | - Kanghyun Kim
- Department of Biomedical Engineering, Duke University, Durham, NC, United States of America
| | - Mark Harfouche
- Ramona Optics Inc., Durham, NC, United States of America
| | - Jeffrey A. Yoder
- Department of Molecular Biological Sciences, North Carolina State University, Raleigh, NC, United States of America
| |
Collapse
|
18
|
Janssen S, Upton CM, De Jager VR, Faraj A, Pahar M, Miranda IDS, Diacon AH, Simonsson USH, Niesler TR. Cough as Noninvasive Biomarker for Monitoring Tuberculosis Treatment: A Proof-of-Concept Study. Ann Am Thorac Soc 2023; 20:1822-1825. [PMID: 37751498 DOI: 10.1513/annalsats.202305-456rl] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 09/26/2023] [Indexed: 09/28/2023] Open
|
19
|
Dang T, Spathis D, Ghosh A, Mascolo C. Human-centred artificial intelligence for mobile health sensing: challenges and opportunities. ROYAL SOCIETY OPEN SCIENCE 2023; 10:230806. [PMID: 38026044 PMCID: PMC10646451 DOI: 10.1098/rsos.230806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 10/23/2023] [Indexed: 12/01/2023]
Abstract
Advances in wearable sensing and mobile computing have enabled the collection of health and well-being data outside of traditional laboratory and hospital settings, paving the way for a new era of mobile health. Meanwhile, artificial intelligence (AI) has made significant strides in various domains, demonstrating its potential to revolutionize healthcare. Devices can now diagnose diseases, predict heart irregularities and unlock the full potential of human cognition. However, the application of machine learning (ML) to mobile health sensing poses unique challenges due to noisy sensor measurements, high-dimensional data, sparse and irregular time series, heterogeneity in data, privacy concerns and resource constraints. Despite the recognition of the value of mobile sensing, leveraging these datasets has lagged behind other areas of ML. Furthermore, obtaining quality annotations and ground truth for such data is often expensive or impractical. While recent large-scale longitudinal studies have shown promise in leveraging wearable sensor data for health monitoring and prediction, they also introduce new challenges for data modelling. This paper explores the challenges and opportunities of human-centred AI for mobile health, focusing on key sensing modalities such as audio, location and activity tracking. We discuss the limitations of current approaches and propose potential solutions.
Collapse
Affiliation(s)
- Ting Dang
- University of Cambridge, Cambridge, UK
- Nokia Bell Labs, Cambridge, UK
| | - Dimitris Spathis
- University of Cambridge, Cambridge, UK
- Nokia Bell Labs, Cambridge, UK
| | - Abhirup Ghosh
- University of Cambridge, Cambridge, UK
- University of Birmingham, Birmingham, UK
| | | |
Collapse
|
20
|
Pentakota P, Rudraraju G, Sripada NR, Mamidgi B, Gottipulla C, Jalukuru C, Palreddy SD, Bhoge NKR, Firmal P, Yechuri V, Jain M, Peddireddi VS, Bhimarasetty DM, Sreenivas S, Prasad K KL, Joshi N, Vijayan S, Turaga S, Avasarala V. Screening COVID-19 by Swaasa AI platform using cough sounds: a cross-sectional study. Sci Rep 2023; 13:18284. [PMID: 37880351 PMCID: PMC10600180 DOI: 10.1038/s41598-023-45104-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Accepted: 10/16/2023] [Indexed: 10/27/2023] Open
Abstract
The Advent of Artificial Intelligence (AI) has led to the use of auditory data for detecting various diseases, including COVID-19. SARS-CoV-2 infection has claimed more than six million lives to date and therefore, needs a robust screening technique to control the disease spread. In the present study we created and validated the Swaasa AI platform, which uses the signature cough sound and symptoms presented by patients to screen and prioritize COVID-19 patients. We collected cough data from 234 COVID-19 suspects to validate our Convolutional Neural Network (CNN) architecture and Feedforward Artificial Neural Network (FFANN) (tabular features) based algorithm. The final output from both models was combined to predict the likelihood of having the disease. During the clinical validation phase, our model showed a 75.54% accuracy rate in detecting the likely presence of COVID-19, with 95.45% sensitivity and 73.46% specificity. We conducted pilot testing on 183 presumptive COVID subjects, of which 58 were truly COVID-19 positive, resulting in a Positive Predictive Value of 70.73%. Due to the high cost and technical expertise required for currently available rapid screening methods, there is a need for a cost-effective and remote monitoring tool that can serve as a preliminary screening method for potential COVID-19 subjects. Therefore, Swaasa would be highly beneficial in detecting the disease and could have a significant impact in reducing its spread.
Collapse
Affiliation(s)
| | | | | | | | | | - Charan Jalukuru
- Salcit Technologies, Jayabheri Silicon Towers, Hyderabad, India
| | | | | | - Priyanka Firmal
- Salcit Technologies, Jayabheri Silicon Towers, Hyderabad, India
| | - Venkat Yechuri
- Salcit Technologies, Jayabheri Silicon Towers, Hyderabad, India
| | - Manmohan Jain
- Salcit Technologies, Jayabheri Silicon Towers, Hyderabad, India
| | | | | | - S Sreenivas
- Andhra Medical College, Visakhapatnam, India
| | | | | | - Shibu Vijayan
- Qure.Ai Technologies, Oberoi Commerz II, Mumbai, India
| | | | - Vardhan Avasarala
- Otolaryngology - Head and Neck Surgery, Northeast Ohio Medical University, Rootstown, USA
| |
Collapse
|
21
|
Alghushairy O, Ali F, Alghamdi W, Khalid M, Alsini R, Asiry O. Machine learning-based model for accurate identification of druggable proteins using light extreme gradient boosting. J Biomol Struct Dyn 2023; 42:12330-12341. [PMID: 37850427 DOI: 10.1080/07391102.2023.2269280] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 10/04/2023] [Indexed: 10/19/2023]
Abstract
The identification of druggable proteins (DPs) is significant for the development of new drugs, personalized medicine, understanding of disease mechanisms, drug repurposing, and economic benefits. By identifying new druggable targets, researchers can develop new therapies for a range of diseases, leading to better patient outcomes. Identification of DPs by machine learning strategies is more efficient and cost-effective than conventional methods. In this study, a computational predictor, namely Drug-LXGB, is introduced to enhance the identification of DPs. Features are discovered by composition, transition, and distribution (CTD), composition of K-spaced amino acid pair (CKSAAP), pseudo-position-specific scoring matrix (PsePSSM), and a novel descriptor, called multi-block pseudo amino acid composition (MB-PseAAC). The dimensions of CTD, CKSAAP, PsePSSM, and MB-PseAAC are integrated and utilized the sequential forward selection as feature selection algorithm. The best characteristics are provided by random forest, extreme gradient boosting, and light eXtreme gradient boosting (LXGB). The predictive analysis of these learning methods is measured via 10-fold cross-validation. The LXGB-based model secures the highest results than other existing predictors. Our novel protocol will perform an active role in designing novel drugs and would be fruitful to explore the potential target. This study will help better to capture a more universal view of a potential target.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Omar Alghushairy
- Department of Information Systems and Technology, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia
| | - Farman Ali
- Department of Software Engineering, Sarhad University of Science and Information Technology Peshawar Mardan Campus, Peshawar, Pakistan
| | - Wajdi Alghamdi
- Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Majdi Khalid
- Department of Computer Science, College of Computers and Information Systems, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Raed Alsini
- Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Othman Asiry
- Department of Information Technology, College of Computing and Information Technology at Khulais, University of Jeddah, Jeddah, Saudi Arabia
| |
Collapse
|
22
|
Malik H, Anees T, Al-Shamaylehs AS, Alharthi SZ, Khalil W, Akhunzada A. Deep Learning-Based Classification of Chest Diseases Using X-rays, CT Scans, and Cough Sound Images. Diagnostics (Basel) 2023; 13:2772. [PMID: 37685310 PMCID: PMC10486427 DOI: 10.3390/diagnostics13172772] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 08/14/2023] [Accepted: 08/21/2023] [Indexed: 09/10/2023] Open
Abstract
Chest disease refers to a variety of lung disorders, including lung cancer (LC), COVID-19, pneumonia (PNEU), tuberculosis (TB), and numerous other respiratory disorders. The symptoms (i.e., fever, cough, sore throat, etc.) of these chest diseases are similar, which might mislead radiologists and health experts when classifying chest diseases. Chest X-rays (CXR), cough sounds, and computed tomography (CT) scans are utilized by researchers and doctors to identify chest diseases such as LC, COVID-19, PNEU, and TB. The objective of the work is to identify nine different types of chest diseases, including COVID-19, edema (EDE), LC, PNEU, pneumothorax (PNEUTH), normal, atelectasis (ATE), and consolidation lung (COL). Therefore, we designed a novel deep learning (DL)-based chest disease detection network (DCDD_Net) that uses a CXR, CT scans, and cough sound images for the identification of nine different types of chest diseases. The scalogram method is used to convert the cough sounds into an image. Before training the proposed DCDD_Net model, the borderline (BL) SMOTE is applied to balance the CXR, CT scans, and cough sound images of nine chest diseases. The proposed DCDD_Net model is trained and evaluated on 20 publicly available benchmark chest disease datasets of CXR, CT scan, and cough sound images. The classification performance of the DCDD_Net is compared with four baseline models, i.e., InceptionResNet-V2, EfficientNet-B0, DenseNet-201, and Xception, as well as state-of-the-art (SOTA) classifiers. The DCDD_Net achieved an accuracy of 96.67%, a precision of 96.82%, a recall of 95.76%, an F1-score of 95.61%, and an area under the curve (AUC) of 99.43%. The results reveal that DCDD_Net outperformed the other four baseline models in terms of many performance evaluation metrics. Thus, the proposed DCDD_Net model can provide significant assistance to radiologists and medical experts. Additionally, the proposed model was also shown to be resilient by statistical evaluations of the datasets using McNemar and ANOVA tests.
Collapse
Affiliation(s)
- Hassaan Malik
- School of Systems and Technology, University of Management and Technology, Lahore 54770, Pakistan; (H.M.); (T.A.)
| | - Tayyaba Anees
- School of Systems and Technology, University of Management and Technology, Lahore 54770, Pakistan; (H.M.); (T.A.)
| | - Ahmad Sami Al-Shamaylehs
- Department of Networks and Cybersecurity, Faculty of Information Technology, Al-Ahliyya Amman University, Amman 19328, Jordan;
| | - Salman Z. Alharthi
- Department of Information System, College of Computers and Information Systems, Al-Lith Campus, Umm AL-Qura University, P.O. Box 7745, AL-Lith 21955, Saudi Arabia
| | - Wajeeha Khalil
- Department of Computer Science and Information Technology, University of Engineering and Technology Peshawar, Peshawar 25000, Pakistan;
| | - Adnan Akhunzada
- College of Computing & IT, University of Doha for Science and Technology, Doha P.O. Box 24449, Qatar;
| |
Collapse
|
23
|
Efromson J, Ferrero G, Bègue A, Doman TJJ, Dugo C, Barker A, Saliu V, Reamey P, Kim K, Harfouche M, Yoder JA. Automated, high-throughput quantification of EGFP-expressing neutrophils in zebrafish by machine learning and a highly-parallelized microscope. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.16.553550. [PMID: 37645798 PMCID: PMC10462042 DOI: 10.1101/2023.08.16.553550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Normal development of the immune system is essential for overall health and disease resistance. Bony fish, such as the zebrafish (Danio rerio), possess all the major immune cell lineages as mammals and can be employed to model human host response to immune challenge. Zebrafish neutrophils, for example, are present in the transparent larvae as early as 48 hours post fertilization and have been examined in numerous infection and immunotoxicology reports. One significant advantage of the zebrafish model is the ability to affordably generate high numbers of individual larvae that can be arrayed in multi-well plates for high throughput genetic and chemical exposure screens. However, traditional workflows for imaging individual larvae have been limited to low-throughput studies using traditional microscopes and manual analyses. Using a newly developed, parallelized microscope, the Multi-Camera Array Microscope (MCAM™), we have optimized a rapid, high-resolution algorithmic method to count fluorescently labeled cells in zebrafish larvae in vivo. Using transgenic zebrafish larvae, in which neutrophils express EGFP, we captured 18 gigapixels of images across a full 96-well plate, in 75 seconds, and processed the resulting datastream, counting individual fluorescent neutrophils in all individual larvae in 5 minutes. This automation is facilitated by a machine learning segmentation algorithm that defines the most in-focus view of each larva in each well after which pixel intensity thresholding and blob detection are employed to locate and count fluorescent cells. We validated this method by comparing algorithmic neutrophil counts to manual counts in larvae subjected to changes in neutrophil numbers, demonstrating the utility of this approach for high-throughput genetic and chemical screens where a change in neutrophil number is an endpoint metric. Using the MCAM™ we have been able to, within minutes, acquire both enough data to create an automated algorithm and execute a biological experiment with statistical significance. Finally, we present this open-source software package which allows the user to train and evaluate a custom machine learning segmentation model and use it to localize zebrafish and analyze cell counts within the segmented region of interest. This software can be modified as needed for studies involving other zebrafish cell lineages using different transgenic reporter lines and can also be adapted for studies using other amenable model species.
Collapse
Affiliation(s)
| | - Giuliano Ferrero
- Department of Molecular Biological Sciences, North Carolina State University, Raleigh, NC
| | | | | | | | - Andi Barker
- Department of Molecular Biological Sciences, North Carolina State University, Raleigh, NC
| | | | | | - Kanghyun Kim
- Department of Biomedical Engineering, Duke University, Durham, NC
| | | | - Jeffrey A. Yoder
- Department of Molecular Biological Sciences, North Carolina State University, Raleigh, NC
| |
Collapse
|
24
|
Ko K, Lee B, Kim D, Hong J, Ko H. Open Set Bioacoustic Signal Classification based on Class Anchor Clustering with Closed Set Unknown Bioacoustic Signals. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023; 2023:1-4. [PMID: 38082607 DOI: 10.1109/embc40787.2023.10340108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]
Abstract
Recently, deep learning-driven studies have been introduced for bioacoustic signal classification. Most of them, however, have the limitation that the input of the classifier needs to match with a trained label which is known as closed set recognition (CSR). To this end, the classifier trained by CSR would not cover a real stream task since the input of the classifier has so many variations. To combat real-world tasks, open set recognition (OSR) has been developed. In OSR, randomly collected inputs are fed to the classifier and the classifier predicts target classes and Unknown class. However, this OSR has been spotlighted in the studies of computer vision and speech domains while the domain of bioacoustic signal is less developed. Especially, to our best knowledge, OSR for animal sound classification has not been studied. This paper proposes a novel method for open set bioacoustic signal classification based on Class Anchored Clustering (CAC) loss with closed set unknown bioacoustic signals. To use the closed set unknown signals for training, a total of n +1 classes are used by adding one additional Unknown class to n target classes, and n +1 cross-entropy loss is added to the CAC loss. To evaluate the proposed method, we build an animal sound dataset that includes 101 species of sounds and compare its performance with baseline methods. In the experiments, our proposed method shows higher performance than other baseline methods in the area under the receiver operating curve for detecting target class and unknown class, the classification accuracy of open set signals, and classification accuracy for target classes. As a result, the closed set class samples are well classified while the open set unknown class can be also recognized with high accuracy at the same time.
Collapse
|
25
|
Tan T, Wang J, Xu C, Tan Z. An Optimized Federated Learning Approach with the Data-Sharing Function to the Analysis of Cardiothoracic Time-Series Signals. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023; 2023:1-6. [PMID: 38082948 DOI: 10.1109/embc40787.2023.10340752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]
Abstract
Machine/deep learning has been widely used for big data analysis in the field of healthcare, but it is still a question to ensure both computation efficiency and data security/confidentiality for the protection of private information. Referring to the data-sharing function of the federated learning (FedL) model, we propose an optimized data-sharing FedL (DSFedL) framework via a data-sharing hub by evaluating an accuracy-privacy loss function. When applied to the derived non-identically and independently distributed (nonIID) datasets simulated from three open-source cardiothoracic databases (i.e., ICBHI, Coswara COVID-19, MIT-BIH Arrhythmia), our optimized DSFedL works efficiently and the results show an optimal outcome of both the accuracy/efficiency and data security/confidentiality management.Clinical Relevance-This provides a proof of concept for using DSFedL in clinical applications, particularly in those settings that require data confidentiality control.
Collapse
|
26
|
Zhou L, Zhao C, Liu N, Yao X, Cheng Z. Improved LSTM-based deep learning model for COVID-19 prediction using optimized approach. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE 2023; 122:106157. [PMID: 36968247 PMCID: PMC10017389 DOI: 10.1016/j.engappai.2023.106157] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 03/08/2023] [Accepted: 03/13/2023] [Indexed: 05/25/2023]
Abstract
Individuals in any country are badly impacted both economically and physically whenever an epidemic of infectious illnesses breaks out. A novel coronavirus strain was responsible for the outbreak of the coronavirus sickness in 2019. Corona Virus Disease 2019 (COVID-19) is the name that the World Health Organization (WHO) officially gave to the pneumonia that was caused by the novel coronavirus on February 11, 2020. The use of models that are informed by machine learning is currently a major focus of study in the field of improved forecasting. By displaying annual trends, forecasting models can be of use in performing impact assessments of potential outcomes. In this paper, proposed forecast models consisting of time series models such as long short-term memory (LSTM), bidirectional long short-term memory (Bi-LSTM), generalized regression unit (GRU), and dense-LSTM have been evaluated for time series prediction of confirmed cases, deaths, and recoveries in 12 major countries that have been affected by COVID-19. Tensorflow1.0 was used for programming. Indices known as mean absolute error (MAE), root means square error (RMSE), Median Absolute Error (MEDAE) and r2 score are utilized in the process of evaluating the performance of models. We presented various ways to time-series forecasting by making use of LSTM models (LSTM, BiLSTM), and we compared these proposed methods to other machine learning models to evaluate the performance of the models. Our study suggests that LSTM based models are among the most advanced models to forecast time series data.
Collapse
Affiliation(s)
- Luyu Zhou
- Department of Pharmacy, College of Biology, Hunan University, Changsha, Hunan 410082, China
- Institute for Translational Medicine, The Affiliated Hospital of Qingdao University, College of Medicine, Qingdao University, Qingdao 266021, China
| | - Chun Zhao
- Department of Pharmacy, College of Biology, Hunan University, Changsha, Hunan 410082, China
| | - Ning Liu
- Institute for Translational Medicine, The Affiliated Hospital of Qingdao University, College of Medicine, Qingdao University, Qingdao 266021, China
| | - Xingduo Yao
- Institute for Translational Medicine, The Affiliated Hospital of Qingdao University, College of Medicine, Qingdao University, Qingdao 266021, China
| | - Zewei Cheng
- Institute for Translational Medicine, The Affiliated Hospital of Qingdao University, College of Medicine, Qingdao University, Qingdao 266021, China
| |
Collapse
|
27
|
Awais M, Bhuva A, Bhuva D, Fatima S, Sadiq T. Optimized DEC: An effective cough detection framework using optimal weighted Features-aided deep Ensemble classifier for COVID-19. Biomed Signal Process Control 2023:105026. [PMID: 37361196 PMCID: PMC10183638 DOI: 10.1016/j.bspc.2023.105026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 04/17/2023] [Accepted: 05/09/2023] [Indexed: 06/28/2023]
Abstract
Since the year 2019, the entire world has been facing the most hazardous and contagious disease as Corona Virus Disease 2019 (COVID-19). Based on the symptoms, the virus can be identified and diagnosed. Amongst, cough is the primary syndrome to detect COVID-19. Existing method requires a long processing time. Early screening and detection is a complex task. To surmount the research drawbacks, a novel ensemble-based deep learning model is designed on heuristic development. The prime intention of the designed work is to detect COVID-19 disease using cough audio signals. At the initial stage, the source signals are fetched and undergo for signal decomposition phase by Empirical Mean Curve Decomposition (EMCD). Consequently, the decomposed signal is called "Mel Frequency Cepstral Coefficients (MFCC), spectral features, and statistical features". Further, all three features are fused and provide the optimal weighted features with the optimal weight value with the help of "Modified Cat and Mouse Based Optimizer (MCMBO)". Lastly, the optimal weighted features are fed as input to the Optimized Deep Ensemble Classifier (ODEC) that is fused together with various classifiers such as "Radial Basis Function (RBF), Long-Short Term Memory (LSTM), and Deep Neural Network (DNN)". In order to attain the best detection results, the parameters in ODEC are optimized by the MCMBO algorithm. Throughout the validation, the designed method attains 96% and 92% concerning accuracy and precision. Thus, result analysis elucidates that the proposed work achieves the desired detective value that aids practitioners to early diagnose COVID-19 ailments.
Collapse
Affiliation(s)
- Muhammad Awais
- Department of Creative Technologies, Air University, Islamabad, Pakistan
| | - Abhishek Bhuva
- Department of Computer Science, University of Massachusetts Boston, United States
| | - Dipen Bhuva
- Department of EECS, Cleveland State University, United States
| | - Saman Fatima
- Department of Medical Education, The University of Lahore, Lahore, Pakistan
| | - Touseef Sadiq
- Department of Information and Communication Technology, University of Agder, Norway
| |
Collapse
|
28
|
Kaliappan J, Bagepalli AR, Almal S, Mishra R, Hu YC, Srinivasan K. Impact of Cross-Validation on Machine Learning Models for Early Detection of Intrauterine Fetal Demise. Diagnostics (Basel) 2023; 13:diagnostics13101692. [PMID: 37238178 DOI: 10.3390/diagnostics13101692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 05/04/2023] [Accepted: 05/08/2023] [Indexed: 05/28/2023] Open
Abstract
Intrauterine fetal demise in women during pregnancy is a major contributing factor in prenatal mortality and is a major global issue in developing and underdeveloped countries. When an unborn fetus passes away in the womb during the 20th week of pregnancy or later, early detection of the fetus can help reduce the chances of intrauterine fetal demise. Machine learning models such as Decision Trees, Random Forest, SVM Classifier, KNN, Gaussian Naïve Bayes, Adaboost, Gradient Boosting, Voting Classifier, and Neural Networks are trained to determine whether the fetal health is Normal, Suspect, or Pathological. This work uses 22 features related to fetal heart rate obtained from the Cardiotocogram (CTG) clinical procedure for 2126 patients. Our paper focuses on applying various cross-validation techniques, namely, K-Fold, Hold-Out, Leave-One-Out, Leave-P-Out, Monte Carlo, Stratified K-fold, and Repeated K-fold, on the above ML algorithms to enhance them and determine the best performing algorithm. We conducted exploratory data analysis to obtain detailed inferences on the features. Gradient Boosting and Voting Classifier achieved 99% accuracy after applying cross-validation techniques. The dataset used has the dimension of 2126 × 22, and the label is multiclass classified as Normal, Suspect, and Pathological condition. Apart from incorporating cross-validation strategies on several machine learning algorithms, the research paper focuses on Blackbox evaluation, which is an Interpretable Machine Learning Technique used to understand the underlying working mechanism of each model and the means by which it picks features to train and predict values.
Collapse
Affiliation(s)
- Jayakumar Kaliappan
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, India
| | - Apoorva Reddy Bagepalli
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, India
| | - Shubh Almal
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, India
| | - Rishabh Mishra
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, India
| | - Yuh-Chung Hu
- Department of Mechanical and Electromechanical Engineering, National ILan University, Yilan 26047, Taiwan
| | - Kathiravan Srinivasan
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, India
| |
Collapse
|
29
|
Sangle SB, Gaikwad CJ. Accumulated bispectral image-based respiratory sound signal classification using deep learning. SIGNAL, IMAGE AND VIDEO PROCESSING 2023; 17:1-8. [PMID: 37362234 PMCID: PMC10161180 DOI: 10.1007/s11760-023-02589-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 04/06/2023] [Accepted: 04/07/2023] [Indexed: 06/28/2023]
Abstract
The COVID-19 virus is increasingly crucial to human health since new variants appear frequently. Detection of COVID-19 through respiratory sound has been an important area of research. This study analyzes respiratory sounds using novel accumulated bi-spectral features. The principal domain bispectrum is used for computing accumulated bispectrum. The resulting magnitude bispectrum is used in forming the bispectral image. In this work, a convolutional neural network (CNN) and ResNet-50 algorithms are designed to classify respiratory sounds as either COVID-19 or healthy. The performance of the proposed method is compared with the state-of-the-art methods. The proposed CNN-based method achieves the highest accuracy of 87.68% for shallow breath sounds, and ResNet-50 achieves the highest accuracy of 87.62% for deep breath sounds. Similarly, proposed methods gives the improved performance for other respiratory sounds.
Collapse
Affiliation(s)
- Sandeep B. Sangle
- Department of Electronics and Telecommunication Engineering, Ramrao Adik Institute of Technology, D.Y. Patil Deemed to be University, Nerul, Navi Mumbai, Maharashtra 400706 India
| | - Chandrakant J. Gaikwad
- Department of Electronics and Telecommunication Engineering, Ramrao Adik Institute of Technology, D.Y. Patil Deemed to be University, Nerul, Navi Mumbai, Maharashtra 400706 India
| |
Collapse
|
30
|
Siddiqui SA, Ahmad A, Fatima N. IoT-based disease prediction using machine learning. COMPUTERS & ELECTRICAL ENGINEERING : AN INTERNATIONAL JOURNAL 2023; 108:108675. [PMID: 36987496 PMCID: PMC10036218 DOI: 10.1016/j.compeleceng.2023.108675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 03/09/2023] [Accepted: 03/10/2023] [Indexed: 06/19/2023]
Abstract
COVID-19 disrupted lives and livelihoods and affected various sectors of the economy. One such domain was the already overburdened healthcare sector, which faced fresh challenges as the number of patients rose exponentially and became difficult to deal with. In such a scenario, telemedicine, teleconsultation, and virtual consultation became increasingly common to comply with social distancing norms. To overcome this pressing need of increasing 'remote' consultations in the 'post-COVID' era, the Internet of Things (IoT) has the potential to play a pivotal role, and this present paper attempts to develop a novel system that implements the most efficient machine learning (ML) algorithm and takes input from the patients such as symptoms, audio recordings, available medical reports, and other histories of illnesses to accurately and holistically predict the disease that the patients are suffering from. A few of the symptoms, such as fever and low blood oxygen, can also be measured via sensors using Arduino and ESP8266. It then provides for the appropriate diagnosis and treatment of the disease based on its constantly updated database, which can be developed as an application-based or website-based platform.
Collapse
Affiliation(s)
- Salman Ahmad Siddiqui
- Department of Electronics and Communication Engineering, Jamia Millia Islamia, New Delhi, India
| | - Anwar Ahmad
- Department of Electronics and Communication Engineering, Jamia Millia Islamia, New Delhi, India
| | - Neda Fatima
- Department of Electronics and Communication Engineering, Jamia Millia Islamia, New Delhi, India
| |
Collapse
|
31
|
Ali Y, Khan HU. A Survey on harnessing the Applications of Mobile Computing in Healthcare during the COVID-19 Pandemic: Challenges and Solutions. COMPUTER NETWORKS 2023; 224:109605. [PMID: 36776582 PMCID: PMC9894776 DOI: 10.1016/j.comnet.2023.109605] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 11/17/2022] [Accepted: 01/31/2023] [Indexed: 06/18/2023]
Abstract
The COVID-19 pandemic ravaged almost every walk of life but it triggered many challenges for the healthcare system, globally. Different cutting-edge technologies such as Internet of things (IoT), machine learning, Virtual Reality (VR), Big data, Blockchain etc. have been adopted to cope with this menace. In this regard, various surveys have been conducted to highlight the importance of these technologies. However, among these technologies, the role of mobile computing is of paramount importance which is not found in the existing literature. Hence, this survey in mainly targeted to highlight the significant role of mobile computing in alleviating the impacts of COVID-19 in healthcare sector. The major applications of mobile computing such as software-based solutions, hardware-based solutions and wireless communication-based support for diagnosis, prevention, self-symptom reporting, contact tracing, social distancing, telemedicine and treatment related to coronavirus are discussed in detailed and comprehensive fashion. A state-of-the-art work is presented to identify the challenges along with possible solutions in adoption of mobile computing with respect to COVID-19 pandemic. Hopefully, this research will help the researchers, policymakers and healthcare professionals to understand the current research gaps and future research directions in this domain. To the best level of our knowledge, this is the first survey of its type to address the COVID-19 pandemic by exploring the holistic contribution of mobile computing technologies in healthcare area.
Collapse
Affiliation(s)
- Yasir Ali
- Higher Education Department, Khyber Pakhtunkhwa, Government Degree College Kotha Swabi, KP, Pakistan
- Higher Education Department, Shahzeb Shaheed Government Degree College Razzar, Swabi, KP, Pakistan
| | - Habib Ullah Khan
- Accounting and Information, College of Business and Economics, Qatar University, Doha Qatar
| |
Collapse
|
32
|
Nayak SS, Darji AD, Shah PK. Machine learning approach for detecting Covid-19 from speech signal using Mel frequency magnitude coefficient. SIGNAL, IMAGE AND VIDEO PROCESSING 2023; 17:1-8. [PMID: 37362229 PMCID: PMC10039689 DOI: 10.1007/s11760-023-02537-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 02/09/2023] [Accepted: 02/19/2023] [Indexed: 06/28/2023]
Abstract
The Covid-19 pandemic is one of the most significant global health concerns that have emerged in this decade. Intelligent healthcare technology and techniques based on speech signal and artificial intelligence make it feasible to provide a faster and more efficient timely detection of Covid-19. The main objective of our study is to design speech signal-based noninvasive, low-cost, remote diagnosis of Covid-19. In this study, we have developed system to detect Covid-19 from speech signal using Mel frequency magnitude coefficients (MFMC) and machine learning techniques. In order to capture higher-order spectral features, the spectrum is divided into a larger number of subbands with narrower bandwidths as MFMC, which leads to better frequency resolution and less overall noise. As a consequence of an improvement in frequency resolution as well as a decrease in the quantity of noise that is included with the extraction of MFMC, the higher-order MFMCs are able to identify Covid-19 from speech signals with an increased level of accuracy. The procedures for machine learning are often less complicated than those for deep learning, and they may commonly be carried out on regular computers. However, deep learning systems need extensive computing power and data storage. Twelve, twenty-four, thirty, and forty spectral coefficients are obtained using MFMC in our study, and from these coefficients, performance is accessed using machine learning classifiers, such as random forests and K-nearest neighbor (KNN); however, KNN has performed better than the other model with having AUC score of 0.80.
Collapse
Affiliation(s)
| | - Anand D. Darji
- Sardar Vallabhbhai National Institute of Technology, Surat, Gujarat India
| | - Prashant K. Shah
- Sardar Vallabhbhai National Institute of Technology, Surat, Gujarat India
| |
Collapse
|
33
|
Gürsoy E, Kaya Y. An overview of deep learning techniques for COVID-19 detection: methods, challenges, and future works. MULTIMEDIA SYSTEMS 2023; 29:1603-1627. [PMID: 37261262 PMCID: PMC10039775 DOI: 10.1007/s00530-023-01083-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Accepted: 03/20/2023] [Indexed: 06/02/2023]
Abstract
The World Health Organization (WHO) declared a pandemic in response to the coronavirus COVID-19 in 2020, which resulted in numerous deaths worldwide. Although the disease appears to have lost its impact, millions of people have been affected by this virus, and new infections still occur. Identifying COVID-19 requires a reverse transcription-polymerase chain reaction test (RT-PCR) or analysis of medical data. Due to the high cost and time required to scan and analyze medical data, researchers are focusing on using automated computer-aided methods. This review examines the applications of deep learning (DL) and machine learning (ML) in detecting COVID-19 using medical data such as CT scans, X-rays, cough sounds, MRIs, ultrasound, and clinical markers. First, the data preprocessing, the features used, and the current COVID-19 detection methods are divided into two subsections, and the studies are discussed. Second, the reported publicly available datasets, their characteristics, and the potential comparison materials mentioned in the literature are presented. Third, a comprehensive comparison is made by contrasting the similar and different aspects of the studies. Finally, the results, gaps, and limitations are summarized to stimulate the improvement of COVID-19 detection methods, and the study concludes by listing some future research directions for COVID-19 classification.
Collapse
Affiliation(s)
- Ercan Gürsoy
- Department of Computer Engineering, Adana Alparslan Turkes Science and Technology University, 01250 Adana, Turkey
| | - Yasin Kaya
- Department of Computer Engineering, Adana Alparslan Turkes Science and Technology University, 01250 Adana, Turkey
| |
Collapse
|
34
|
Yellapu GD, Rudraraju G, Sripada NR, Mamidgi B, Jalukuru C, Firmal P, Yechuri V, Varanasi S, Peddireddi VS, Bhimarasetty DM, Kanisetti S, Joshi N, Mohapatra P, Pamarthi K. Development and clinical validation of Swaasa AI platform for screening and prioritization of pulmonary TB. Sci Rep 2023; 13:4740. [PMID: 36959347 PMCID: PMC10034902 DOI: 10.1038/s41598-023-31772-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 03/16/2023] [Indexed: 03/25/2023] Open
Abstract
Acoustic signal analysis has been employed in various medical devices. However, studies involving cough sound analysis to screen the potential pulmonary tuberculosis (PTB) suspects are very few. The main objective of this cross-sectional validation study was to develop and validate the Swaasa AI platform to screen and prioritize at risk patients for PTB based on the signature cough sound as well as symptomatic information provided by the subjects. The voluntary cough sound data was collected at Andhra Medical College-India. An Algorithm based on multimodal convolutional neural network architecture and feedforward artificial neural network (tabular features) was built and validated on a total of 567 subjects, comprising 278 positive and 289 negative PTB cases. The output from these two models was combined to detect the likely presence (positive cases) of PTB. In the clinical validation phase, the AI-model was found to be 86.82% accurate in detecting the likely presence of PTB with 90.36% sensitivity and 84.67% specificity. The pilot testing of model was conducted at a peripheral health care centre, RHC Simhachalam-India on 65 presumptive PTB cases. Out of which, 15 subjects truly turned out to be PTB positive with a positive predictive value of 75%. The validation results obtained from the model are quite encouraging. This platform has the potential to fulfil the unmet need of a cost-effective PTB screening method. It works remotely, presents instantaneous results, and does not require a highly trained operator. Therefore, it could be implemented in various inaccessible, resource-poor parts of the world.
Collapse
Affiliation(s)
| | | | | | | | - Charan Jalukuru
- Salcit Technologies, Jayabheri Silicon Towers, Hyderabad, India
| | - Priyanka Firmal
- Salcit Technologies, Jayabheri Silicon Towers, Hyderabad, India
| | - Venkat Yechuri
- Salcit Technologies, Jayabheri Silicon Towers, Hyderabad, India
| | | | | | | | | | - Niranjan Joshi
- Centre for Cellular and Molecular Platforms, Bengaluru, India
| | - Prasant Mohapatra
- Department of Computer Science, University of California, Davis, USA
| | | |
Collapse
|
35
|
Manzella F, Pagliarini G, Sciavicco G, Stan IE. The voice of COVID-19: Breath and cough recording classification with temporal decision trees and random forests. Artif Intell Med 2023; 137:102486. [PMID: 36868683 PMCID: PMC9904537 DOI: 10.1016/j.artmed.2022.102486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 12/27/2022] [Accepted: 12/28/2022] [Indexed: 02/05/2023]
Abstract
Symbolic learning is the logic-based approach to machine learning, and its mission is to provide algorithms and methodologies to extract logical information from data and express it in an interpretable way. Interval temporal logic has been recently proposed as a suitable tool for symbolic learning, specifically via the design of an interval temporal logic decision tree extraction algorithm. In order to improve their performances, interval temporal decision trees can be embedded into interval temporal random forests, mimicking the corresponding schema at the propositional level. In this article we consider a dataset of cough and breath sample recordings of volunteer subjects, labeled with their COVID-19 status, originally collected by the University of Cambridge. By interpreting such recordings as multivariate time series, we study the problem of their automated classification using interval temporal decision trees and forests. While this problem has been approached with the same dataset as well as with other datasets, in all cases, non-symbolic learning methods (usually, deep learning-based) have been applied to solve it; in this article we apply a symbolic approach, and show that it does not only outperform the state-of-the-art obtained with the same dataset, but its results are also superior to those of most non-symbolic techniques applied on other datasets. As an added bonus, thanks to the symbolic nature of our approach, we are also able to extract explicit knowledge to help physicians characterize typical COVID-positive cough and breath.
Collapse
Affiliation(s)
- F Manzella
- Department of Mathematics and Computer Science, University of Ferrara, Italy.
| | - G Pagliarini
- Department of Mathematics and Computer Science, University of Ferrara, Italy.
| | - G Sciavicco
- Department of Mathematics and Computer Science, University of Ferrara, Italy.
| | - I E Stan
- Department of Mathematics and Computer Science, University of Ferrara, Italy.
| |
Collapse
|
36
|
Nguyen LH, Pham NT, Do VH, Nguyen LT, Nguyen TT, Nguyen H, Nguyen ND, Nguyen TT, Nguyen SD, Bhatti A, Lim CP. Fruit-CoV: An efficient vision-based framework for speedy detection and diagnosis of SARS-CoV-2 infections through recorded cough sounds. EXPERT SYSTEMS WITH APPLICATIONS 2023; 213:119212. [PMID: 36407848 PMCID: PMC9639421 DOI: 10.1016/j.eswa.2022.119212] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 10/20/2022] [Accepted: 11/01/2022] [Indexed: 06/16/2023]
Abstract
COVID-19 is an infectious disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). This deadly virus has spread worldwide, leading to a global pandemic since March 2020. A recent variant of SARS-CoV-2 named Delta is intractably contagious and responsible for more than four million deaths globally. Therefore, developing an efficient self-testing service for SARS-CoV-2 at home is vital. In this study, a two-stage vision-based framework, namely Fruit-CoV, is introduced for detecting SARS-CoV-2 infections through recorded cough sounds. Specifically, audio signals are converted into Log-Mel spectrograms, and the EfficientNet-V2 network is used to extract their visual features in the first stage. In the second stage, 14 convolutional layers extracted from the large-scale Pretrained Audio Neural Networks for audio pattern recognition (PANNs) and the Wavegram-Log-Mel-CNN are employed to aggregate feature representations of the Log-Mel spectrograms and the waveform. Finally, the combined features are used to train a binary classifier. In this study, a dataset provided by the AICovidVN 115M Challenge is employed for evaluation. It includes 7,371 recorded cough sounds collected throughout Vietnam, India, and Switzerland. Experimental results indicate that the proposed model achieves an Area Under the Receiver Operating Characteristic Curve (AUC) score of 92.8% and ranks first on the final leaderboard of the AICovidVN 115M Challenge. Our code is publicly available.
Collapse
Affiliation(s)
- Long H Nguyen
- Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam
| | - Nhat Truong Pham
- Division of Computational Mechatronics, Institute for Computational Science, Ton Duc Thang University, Ho Chi Minh City, Vietnam
- Faculty of Electrical and Electronics Engineering, Ton Duc Thang University, Ho Chi Minh City, Vietnam
| | | | - Liu Tai Nguyen
- Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam
| | - Thanh Tin Nguyen
- Human Computer Interaction Lab, Sejong University, Seoul, South Korea
| | - Hai Nguyen
- Khoury College of Computer Sciences, Northeastern University, Boston, USA
| | - Ngoc Duy Nguyen
- Institute for Intelligent Systems Research and Innovation, Deakin University, Victoria, Australia
| | - Thanh Thi Nguyen
- School of Information Technology, Deakin University, Victoria, Australia
| | - Sy Dzung Nguyen
- Laboratory for Computational Mechatronics, Institute for Computational Science and Artificial Intelligence, Van Lang University, Ho Chi Minh City, Vietnam
- Faculty of Mechanical-Electrical and Computer Engineering, Van Lang University, Ho Chi Minh City, Vietnam
| | - Asim Bhatti
- Institute for Intelligent Systems Research and Innovation, Deakin University, Victoria, Australia
| | - Chee Peng Lim
- Institute for Intelligent Systems Research and Innovation, Deakin University, Victoria, Australia
| |
Collapse
|
37
|
Davidson C, Caguana OA, Lozano-García M, Arita Guevara M, Estrada-Petrocelli L, Ferrer-Lluis I, Castillo-Escario Y, Ausín P, Gea J, Jané R. Differences in acoustic features of cough by pneumonia severity in patients with COVID-19: a cross-sectional study. ERJ Open Res 2023; 9:00247-2022. [PMID: 37131524 PMCID: PMC9922471 DOI: 10.1183/23120541.00247-2022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 01/07/2023] [Indexed: 02/05/2023] Open
Abstract
BackgroundAcute respiratory syndrome due to coronavirus 2 (SARS-CoV-2) is characterised by heterogeneous levels of disease severity. It is not necessarily apparent whether a patient will develop a severe disease or not. This cross-sectional study explores whether acoustic properties of the cough sound of patients with coronavirus disease (COVID-19), the illness caused by SARS-CoV-2, correlate with their disease and pneumonia severity, with the aim of identifying patients with a severe disease.MethodsVoluntary cough sounds were recorded using a smartphone in 70 COVID-19 patients within the first 24 h of their hospital arrival, between April 2020 and May 2021. Based on gas exchange abnormalities, patients were classified as mild, moderate, or severe. Time- and frequency-based variables were obtained from each cough effort and analysed using a linear mixed-effects modelling approach.ResultsRecords from 62 patients (37% female) were eligible for inclusion in the analysis, with mild, moderate, and severe groups consisting of 31, 14 and 17 patients respectively. 5 of the parameters examined were found to be significantly different in the cough of patients at different disease levels of severity, with a further 2 parameters found to be affected differently by the disease severity in men and women.ConclusionsWe suggest that all these differences reflect the progressive pathophysiological alterations occurring in the respiratory system of COVID-19 patients, and potentially would provide an easy and cost-effective way to initially stratify patients, identifying those with more severe disease, and thereby most effectively allocate healthcare resources.
Collapse
|
38
|
Li Y, Liu S. The Threat of Adversarial Attack on a COVID-19 CT Image-Based Deep Learning System. Bioengineering (Basel) 2023; 10:bioengineering10020194. [PMID: 36829688 PMCID: PMC9952300 DOI: 10.3390/bioengineering10020194] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 01/15/2023] [Accepted: 01/31/2023] [Indexed: 02/05/2023] Open
Abstract
The coronavirus disease 2019 (COVID-19) rapidly spread around the world, and resulted in a global pandemic. Applying artificial intelligence to COVID-19 research can produce very exciting results. However, most research has focused on applying AI techniques in the study of COVID-19, but has ignored the security and reliability of AI systems. In this paper, we explore adversarial attacks on a deep learning system based on COVID-19 CT images with the aim of helping to address this problem. Firstly, we built a deep learning system that could identify COVID-19 CT images and non-COVID-19 CT images with an average accuracy of 76.27%. Secondly, we attacked the pretrained model with an adversarial attack algorithm, i.e., FGSM, to cause the COVID-19 deep learning system to misclassify the CT images, and the classification accuracy of non-COVID-19 CT images dropped from 80% to 0%. Finally, in response to this attack, we proposed how a more secure and reliable deep learning model based on COVID-19 medical images could be built. This research is based on a COVID-19 CT image recognition system, which studies the security of a COVID-19 CT image-based deep learning system. We hope to draw more researchers' attention to the security and reliability of medical deep learning systems.
Collapse
|
39
|
Jeyananthan P. Role of different types of RNA molecules in the severity prediction of SARS-CoV-2 patients. Pathol Res Pract 2023; 242:154311. [PMID: 36657221 PMCID: PMC9840815 DOI: 10.1016/j.prp.2023.154311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 01/11/2023] [Accepted: 01/14/2023] [Indexed: 01/16/2023]
Abstract
SARS-CoV-2 pandemic is the current threat of the world with enormous number of deceases. As most of the countries have constraints on resources, particularly for intensive care and oxygen, severity prediction with high accuracy is crucial. This prediction will help the medical society in the selection of patients with the need for these constrained resources. Literature shows that using clinical data in this study is the common trend and molecular data is rarely utilized in this prediction. As molecular data carry more disease related information, in this study, three different types of RNA molecules ( lncRNA, miRNA and mRNA) of SARS-COV-2 patients are used to predict the severity stage and treatment stage of those patients. Using seven different machine learning algorithms along with several feature selection techniques shows that in both phenotypes, feature importance selected features provides the best accuracy along with random forest classifier. Further to this, it shows that in the severity stage prediction miRNA and lncRNA give the best performance, and lncRNA data gives the best in treatment stage prediction. As most of the studies related to molecular data uses mRNA data, this is an interesting finding.
Collapse
|
40
|
Zhang J, Wu J, Qiu Y, Song A, Li W, Li X, Liu Y. Intelligent speech technologies for transcription, disease diagnosis, and medical equipment interactive control in smart hospitals: A review. Comput Biol Med 2023; 153:106517. [PMID: 36623438 PMCID: PMC9814440 DOI: 10.1016/j.compbiomed.2022.106517] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Revised: 12/23/2022] [Accepted: 12/31/2022] [Indexed: 01/07/2023]
Abstract
The growing and aging of the world population have driven the shortage of medical resources in recent years, especially during the COVID-19 pandemic. Fortunately, the rapid development of robotics and artificial intelligence technologies help to adapt to the challenges in the healthcare field. Among them, intelligent speech technology (IST) has served doctors and patients to improve the efficiency of medical behavior and alleviate the medical burden. However, problems like noise interference in complex medical scenarios and pronunciation differences between patients and healthy people hamper the broad application of IST in hospitals. In recent years, technologies such as machine learning have developed rapidly in intelligent speech recognition, which is expected to solve these problems. This paper first introduces IST's procedure and system architecture and analyzes its application in medical scenarios. Secondly, we review existing IST applications in smart hospitals in detail, including electronic medical documentation, disease diagnosis and evaluation, and human-medical equipment interaction. In addition, we elaborate on an application case of IST in the early recognition, diagnosis, rehabilitation training, evaluation, and daily care of stroke patients. Finally, we discuss IST's limitations, challenges, and future directions in the medical field. Furthermore, we propose a novel medical voice analysis system architecture that employs active hardware, active software, and human-computer interaction to realize intelligent and evolvable speech recognition. This comprehensive review and the proposed architecture offer directions for future studies on IST and its applications in smart hospitals.
Collapse
Affiliation(s)
- Jun Zhang
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China,Corresponding author
| | - Jingyue Wu
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Yiyi Qiu
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Aiguo Song
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Weifeng Li
- Department of Emergency Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, 510080, China
| | - Xin Li
- Department of Emergency Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, 510080, China
| | - Yecheng Liu
- Emergency Department, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China
| |
Collapse
|
41
|
Peng P, Jiang K, You M, Xie J, Zhou H, Xu W, Lu J, Li X, Xu Y. Design of an Efficient CNN-based Cough Detection System on Lightweight FPGA. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2023; PP:116-128. [PMID: 37018680 DOI: 10.1109/tbcas.2023.3236976] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Precisely and automatically detecting the cough sound is of vital clinical importance. Nevertheless, due to privacy protection considerations, transmitting the raw audio data to the cloud is not permitted, and therefore there is a great demand for an efficient, accurate, and low-cost solution at the edge device. To address this challenge, we propose a semi-custom software-hardware co-design methodology to help build the cough detection system. Specifically, we first design a scalable and compact convolutional neural network (CNN) structure that generates many network instances. Second, we develop a dedicated hardware accelerator to perform the inference computation efficiently, and then we find the optimal network instance by applying network design space exploration. Finally, we compile the optimal network and let it run on the hardware accelerator. The experimental results demonstrate that our model achieves 88.8% classification accuracy, 91.2% sensitivity, 86.5% specificity, and 86.5% precision, while the computation complexity is only 1.09M multiply-accumulation (MAC). Additionally, when implemented on a lightweight field programmable gate array (FPGA), the complete cough detection system only occupies 7.9K lookup tables (LUTs), 12.9K flip-flops (FFs), and 41 digital signal processing (DSP) slices, providing 8.3 GOP/s actual inference throughput and total power dissipation of 0.93 W. This framework meets the needs of partial application and can be easily extended or integrated into other healthcare applications.
Collapse
|
42
|
Isgut M, Gloster L, Choi K, Venugopalan J, Wang MD. Systematic Review of Advanced AI Methods for Improving Healthcare Data Quality in Post COVID-19 Era. IEEE Rev Biomed Eng 2023; 16:53-69. [PMID: 36269930 DOI: 10.1109/rbme.2022.3216531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
At the beginning of the COVID-19 pandemic, there was significant hype about the potential impact of artificial intelligence (AI) tools in combatting COVID-19 on diagnosis, prognosis, or surveillance. However, AI tools have not yet been widely successful. One of the key reason is the COVID-19 pandemic has demanded faster real-time development of AI-driven clinical and health support tools, including rapid data collection, algorithm development, validation, and deployment. However, there was not enough time for proper data quality control. Learning from the hard lessons in COVID-19, we summarize the important health data quality challenges during COVID-19 pandemic such as lack of data standardization, missing data, tabulation errors, and noise and artifact. Then we conduct a systematic investigation of computational methods that address these issues, including emerging novel advanced AI data quality control methods that achieve better data quality outcomes and, in some cases, simplify or automate the data cleaning process. We hope this article can assist healthcare community to improve health data quality going forward with novel AI development.
Collapse
|
43
|
Hasan MM, Islam MU, Sadeq MJ, Fung WK, Uddin J. Review on the Evaluation and Development of Artificial Intelligence for COVID-19 Containment. SENSORS (BASEL, SWITZERLAND) 2023; 23:527. [PMID: 36617124 PMCID: PMC9824505 DOI: 10.3390/s23010527] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 12/23/2022] [Accepted: 12/29/2022] [Indexed: 06/17/2023]
Abstract
Artificial intelligence has significantly enhanced the research paradigm and spectrum with a substantiated promise of continuous applicability in the real world domain. Artificial intelligence, the driving force of the current technological revolution, has been used in many frontiers, including education, security, gaming, finance, robotics, autonomous systems, entertainment, and most importantly the healthcare sector. With the rise of the COVID-19 pandemic, several prediction and detection methods using artificial intelligence have been employed to understand, forecast, handle, and curtail the ensuing threats. In this study, the most recent related publications, methodologies and medical reports were investigated with the purpose of studying artificial intelligence's role in the pandemic. This study presents a comprehensive review of artificial intelligence with specific attention to machine learning, deep learning, image processing, object detection, image segmentation, and few-shot learning studies that were utilized in several tasks related to COVID-19. In particular, genetic analysis, medical image analysis, clinical data analysis, sound analysis, biomedical data classification, socio-demographic data analysis, anomaly detection, health monitoring, personal protective equipment (PPE) observation, social control, and COVID-19 patients' mortality risk approaches were used in this study to forecast the threatening factors of COVID-19. This study demonstrates that artificial-intelligence-based algorithms integrated into Internet of Things wearable devices were quite effective and efficient in COVID-19 detection and forecasting insights which were actionable through wide usage. The results produced by the study prove that artificial intelligence is a promising arena of research that can be applied for disease prognosis, disease forecasting, drug discovery, and to the development of the healthcare sector on a global scale. We prove that artificial intelligence indeed played a significantly important role in helping to fight against COVID-19, and the insightful knowledge provided here could be extremely beneficial for practitioners and research experts in the healthcare domain to implement the artificial-intelligence-based systems in curbing the next pandemic or healthcare disaster.
Collapse
Affiliation(s)
- Md. Mahadi Hasan
- Department of Computer Science and Engineering, Asian University of Bangladesh, Ashulia 1349, Bangladesh
| | - Muhammad Usama Islam
- School of Computing and Informatics, University of Louisiana at Lafayette, Lafayette, LA 70504, USA
| | - Muhammad Jafar Sadeq
- Department of Computer Science and Engineering, Asian University of Bangladesh, Ashulia 1349, Bangladesh
| | - Wai-Keung Fung
- Department of Applied Computing and Engineering, Cardiff School of Technologies, Cardiff Metropolitan University, Cardiff CF5 2YB, UK
| | - Jasim Uddin
- Department of Applied Computing and Engineering, Cardiff School of Technologies, Cardiff Metropolitan University, Cardiff CF5 2YB, UK
| |
Collapse
|
44
|
Jeyananthan P. SARS-CoV-2 Diagnosis Using Transcriptome Data: A Machine Learning Approach. SN COMPUTER SCIENCE 2023; 4:218. [PMID: 36844504 PMCID: PMC9936926 DOI: 10.1007/s42979-023-01703-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 01/24/2023] [Indexed: 05/02/2023]
Abstract
SARS-CoV-2 pandemic is the big issue of the whole world right now. The health community is struggling to rescue the public and countries from this spread, which revives time to time with different waves. Even the vaccination seems to be not prevents this spread. Accurate identification of infected people on time is essential these days to control the spread. So far, Polymerase chain reaction (PCR) and rapid antigen tests are widely used in this identification, accepting their own drawbacks. False negative cases are the menaces in this scenario. To avoid these problems, this study uses machine learning techniques to build a classification model with higher accuracy to filter the COVID-19 cases from the non-COVID individuals. Transcriptome data of the SARS-CoV-2 patients along with the control are used in this stratification using three different feature selection algorithms and seven classification models. Differently expressed genes also studied between these two groups of people and used in this classification. Results shows that mutual information (or DEGs) along with naïve Bayes (or SVM) gives the best accuracy (0.98 ± 0.04) among these methods. Supplementary Information The online version contains supplementary material available at 10.1007/s42979-023-01703-6.
Collapse
|
45
|
Kuluozturk M, Kobat MA, Barua PD, Dogan S, Tuncer T, Tan RS, Ciaccio EJ, Acharya UR. DKPNet41: Directed knight pattern network-based cough sound classification model for automatic disease diagnosis. Med Eng Phys 2022; 110:103870. [PMID: 35989223 PMCID: PMC9356574 DOI: 10.1016/j.medengphy.2022.103870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 08/03/2022] [Accepted: 08/05/2022] [Indexed: 01/18/2023]
Abstract
PROBLEM Cough-based disease detection is a hot research topic for machine learning, and much research has been published on the automatic detection of Covid-19. However, these studies are useful for the diagnosis of different diseases. AIM In this work, we collected a new and large (n=642 subjects) cough sound dataset comprising four diagnostic categories: 'Covid-19', 'heart failure', 'acute asthma', and 'healthy', and used it to train, validate, and test a novel model designed for automatic detection. METHOD The model consists of four main components: novel feature generation based on a specifically directed knight pattern (DKP), signal decomposition using four pooling methods, feature selection using iterative neighborhood analysis (INCA), and classification using the k-nearest neighbor (kNN) classifier with ten-fold cross-validation. Multilevel multiple pooling decomposition combined with DKP yielded 41 feature vectors (40 extracted plus one original cough sound). From these, the ten best feature vectors were selected. Based on each vector's misclassification rate, redundant feature vectors were eliminated and then merged. The merged vector's most informative features automatically selected using INCA were input to a standard kNN classifier. RESULTS The model, called DKPNet41, attained a high accuracy of 99.39% for cough sound-based multiclass classification of the four categories. CONCLUSIONS The results obtained in the study showed that the DKPNet41 model automatically and efficiently classifies cough sounds for disease diagnosis.
Collapse
Affiliation(s)
- Mutlu Kuluozturk
- Department of Pulmonology, Firat University Hospital, Elazig, Turkey
| | - Mehmet Ali Kobat
- Department of Cardiology, Firat University Hospital, Elazig, Turkey
| | - Prabal Datta Barua
- School of Management & Enterprise, University of Southern Queensland, Australia; Faculty of Engineering and Information Technology, University of Technology Sydney, Australia
| | - Sengul Dogan
- Department of Digital Forensics Engineering, College of Technology, Firat University, Elazig, Turkey.
| | - Turker Tuncer
- Department of Digital Forensics Engineering, College of Technology, Firat University, Elazig, Turkey
| | - Ru-San Tan
- Department of Cardiology, National Heart Centre Singapore, Singapore; Duke-NUS Medical School, Singapore
| | - Edward J Ciaccio
- Department of Medicine, Columbia University Irving Medical Center, USA
| | - U Rajendra Acharya
- Ngee Ann Polytechnic, Department of Electronics and Computer Engineering, 599489, Singapore; Department of Biomedical Engineering, School of Science and Technology, SUSS University, Singapore; Department of Biomedical Informatics and Medical Engineering, Asia University, Taichung, Taiwan
| |
Collapse
|
46
|
Zhang M, Sykes DL, Brindle K, Sadofsky LR, Morice AH. Chronic cough-the limitation and advances in assessment techniques. J Thorac Dis 2022; 14:5097-5119. [PMID: 36647459 PMCID: PMC9840016 DOI: 10.21037/jtd-22-874] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Accepted: 11/04/2022] [Indexed: 11/23/2022]
Abstract
Accurate and consistent assessments of cough are essential to advance the understanding of the mechanisms of cough and individualised the management of patients. Considerable progress has been made in this work. Here we reviewed the currently available tools for subjectively and objectively measuring both cough sensitivity and severity. We also provided some opinions on the new techniques and future directions. The simple and practical Visual Analogue Scale (VAS), the Leicester Cough Questionnaire (LCQ), and the Cough Specific Quality of Life Questionnaire (CQLQ) are the most widely used self-reported questionnaires for evaluating and quantifying cough severity. The Hull Airway Reflux Questionnaire (HARQ) is a tool to elucidate the constellation of symptoms underlying the diagnosis of chronic cough. Chemical excitation tests are widely used to explore the pathophysiological mechanisms of the cough reflex, such as capsaicin, citric acid and adenosine triphosphate (ATP) challenge test. Cough frequency is an ideal primary endpoint for clinical research, but the application of cough counters has been limited in clinical practice by the high cost and reliance on aural validation. The ongoing development of cough detection technology for smartphone apps and wearable devices will hopefully simplify cough counting, thus transitioning it from niche research to a widely available clinical application.
Collapse
Affiliation(s)
- Mengru Zhang
- Centre for Clinical Science, Respiratory Medicine, Hull York Medical School, University of Hull, Castle Hill Hospital, Cottingham, East Yorkshire, UK;,Department of Pulmonary and Critical Care Medicine, Tongji Hospital, Tongji University School of Medicine, Shanghai, China
| | - Dominic L. Sykes
- Centre for Clinical Science, Respiratory Medicine, Hull York Medical School, University of Hull, Castle Hill Hospital, Cottingham, East Yorkshire, UK
| | - Kayleigh Brindle
- Centre for Clinical Science, Respiratory Medicine, Hull York Medical School, University of Hull, Castle Hill Hospital, Cottingham, East Yorkshire, UK
| | - Laura R. Sadofsky
- Centre for Clinical Science, Respiratory Medicine, Hull York Medical School, University of Hull, Castle Hill Hospital, Cottingham, East Yorkshire, UK
| | - Alyn H. Morice
- Centre for Clinical Science, Respiratory Medicine, Hull York Medical School, University of Hull, Castle Hill Hospital, Cottingham, East Yorkshire, UK
| |
Collapse
|
47
|
Xia T, Han J, Mascolo C. Exploring machine learning for audio-based respiratory condition screening: A concise review of databases, methods, and open issues. Exp Biol Med (Maywood) 2022; 247:2053-2061. [PMID: 35974706 PMCID: PMC9791302 DOI: 10.1177/15353702221115428] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Auscultation plays an important role in the clinic, and the research community has been exploring machine learning (ML) to enable remote and automatic auscultation for respiratory condition screening via sounds. To give the big picture of what is going on in this field, in this narrative review, we describe publicly available audio databases that can be used for experiments, illustrate the developed ML methods proposed to date, and flag some under-considered issues which still need attention. Compared to existing surveys on the topic, we cover the latest literature, especially those audio-based COVID-19 detection studies which have gained extensive attention in the last two years. This work can help to facilitate the application of artificial intelligence in the respiratory auscultation field.
Collapse
|
48
|
A classification framework for identifying bronchitis and pneumonia in children based on a small-scale cough sounds dataset. PLoS One 2022; 17:e0275479. [PMID: 36301797 PMCID: PMC9612535 DOI: 10.1371/journal.pone.0275479] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 09/18/2022] [Indexed: 11/07/2022] Open
Abstract
Bronchitis and pneumonia are the common respiratory diseases, of which pneumonia is the leading cause of mortality in pediatric patients worldwide and impose intense pressure on health care systems. This study aims to classify bronchitis and pneumonia in children by analyzing cough sounds. We propose a Classification Framework based on Cough Sounds (CFCS) to identify bronchitis and pneumonia in children. Our dataset includes cough sounds from 173 outpatients at the West China Second University Hospital, Sichuan University, Chengdu, China. We adopt aggregation operation to obtain patients’ disease features because some cough chunks carry the disease information while others do not. In the stage of classification in our framework, we adopt Support Vector Machine (SVM) to classify the diseases due to the small scale of our dataset. Furthermore, we apply data augmentation to our dataset to enlarge the number of samples and then adopt Long Short-Term Memory Network (LSTM) to classify. After 45 random tests on RAW dataset, SVM achieves the best classification accuracy of 86.04% and standard deviation of 4.7%. The precision of bronchitis and pneumonia is 93.75% and 87.5%, and their recall is 88.24% and 93.33%. The AUC of SVM and LSTM classification models on the dataset with pitch-shifting data augmentation reach 0.92 and 0.93, respectively. Extensive experimental results show that CFCS can effectively classify children into bronchitis and pneumonia.
Collapse
|
49
|
Aleixandre JG, Elgendi M, Menon C. The Use of Audio Signals for Detecting COVID-19: A Systematic Review. SENSORS (BASEL, SWITZERLAND) 2022; 22:8114. [PMID: 36365811 PMCID: PMC9653621 DOI: 10.3390/s22218114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 10/17/2022] [Accepted: 10/20/2022] [Indexed: 06/16/2023]
Abstract
A systematic review on the topic of automatic detection of COVID-19 using audio signals was performed. A total of 48 papers were obtained after screening 659 records identified in the PubMed, IEEE Xplore, Embase, and Google Scholar databases. The reviewed studies employ a mixture of open-access and self-collected datasets. Because COVID-19 has only recently been investigated, there is a limited amount of available data. Most of the data are crowdsourced, which motivated a detailed study of the various pre-processing techniques used by the reviewed studies. Although 13 of the 48 identified papers show promising results, several have been performed with small-scale datasets (<200). Among those papers, convolutional neural networks and support vector machine algorithms were the best-performing methods. The analysis of the extracted features showed that Mel-frequency cepstral coefficients and zero-crossing rate continue to be the most popular choices. Less common alternatives, such as non-linear features, have also been proven to be effective. The reported values for sensitivity range from 65.0% to 99.8% and those for accuracy from 59.0% to 99.8%.
Collapse
Affiliation(s)
- José Gómez Aleixandre
- Biomedical and Mobile Health Technology Lab, ETH Zurich, 8008 Zurich, Switzerland
- Department of Physics, ETH Zurich, 8093 Zurich, Switzerland
| | - Mohamed Elgendi
- Biomedical and Mobile Health Technology Lab, ETH Zurich, 8008 Zurich, Switzerland
| | - Carlo Menon
- Biomedical and Mobile Health Technology Lab, ETH Zurich, 8008 Zurich, Switzerland
| |
Collapse
|
50
|
Alsaaidah B, Al-Hadidi MR, Al-Nsour H, Masadeh R, AlZubi N. Comprehensive Survey of Machine Learning Systems for COVID-19 Detection. J Imaging 2022; 8:267. [PMID: 36286361 PMCID: PMC9604704 DOI: 10.3390/jimaging8100267] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Revised: 09/11/2022] [Accepted: 09/20/2022] [Indexed: 01/14/2023] Open
Abstract
The last two years are considered the most crucial and critical period of the COVID-19 pandemic affecting most life aspects worldwide. This virus spreads quickly within a short period, increasing the fatality rate associated with the virus. From a clinical perspective, several diagnosis methods are carried out for early detection to avoid virus propagation. However, the capabilities of these methods are limited and have various associated challenges. Consequently, many studies have been performed for COVID-19 automated detection without involving manual intervention and allowing an accurate and fast decision. As is the case with other diseases and medical issues, Artificial Intelligence (AI) provides the medical community with potential technical solutions that help doctors and radiologists diagnose based on chest images. In this paper, a comprehensive review of the mentioned AI-based detection solution proposals is conducted. More than 200 papers are reviewed and analyzed, and 145 articles have been extensively examined to specify the proposed AI mechanisms with chest medical images. A comprehensive examination of the associated advantages and shortcomings is illustrated and summarized. Several findings are concluded as a result of a deep analysis of all the previous works using machine learning for COVID-19 detection, segmentation, and classification.
Collapse
Affiliation(s)
- Bayan Alsaaidah
- Department of Computer Science, Prince Abdullah bin Ghazi Faculty of Information Technology and Communications, Al-Balqa Applied University, Salt 19117, Jordan
| | - Moh’d Rasoul Al-Hadidi
- Department of Electrical Engineering, Electrical Power Engineering and Computer Engineering, Faculty of Engineering, Al-Balqa Applied University, Salt 19117, Jordan
| | - Heba Al-Nsour
- Department of Computer Science, Prince Abdullah bin Ghazi Faculty of Information Technology and Communications, Al-Balqa Applied University, Salt 19117, Jordan
| | - Raja Masadeh
- Computer Science Department, The World Islamic Sciences and Education University, Amman 11947, Jordan
| | - Nael AlZubi
- Department of Electrical Engineering, Electrical Power Engineering and Computer Engineering, Faculty of Engineering, Al-Balqa Applied University, Salt 19117, Jordan
| |
Collapse
|