Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Park HY, Park D, Kang HS, Kim H, Lee S, Im S. Post-stroke respiratory complications using machine learning with voice features from mobile devices. Sci Rep 2022;12:16682. [PMID: 36202829 PMCID: PMC9537337 DOI: 10.1038/s41598-022-20348-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 09/12/2022] [Indexed: 11/11/2022] Open

For:	Park HY, Park D, Kang HS, Kim H, Lee S, Im S. Post-stroke respiratory complications using machine learning with voice features from mobile devices. Sci Rep 2022;12:16682. [PMID: 36202829 PMCID: PMC9537337 DOI: 10.1038/s41598-022-20348-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 09/12/2022] [Indexed: 11/11/2022] Open

Number

Cited by Other Article(s)

Shin B, Lee SH, Kwon K, Lee YJ, Crispe N, Ahn SY, Shelly S, Sundholm N, Tkaczuk A, Yeo MK, Choo HJ, Yeo WH. Automatic Clinical Assessment of Swallowing Behavior and Diagnosis of Silent Aspiration Using Wireless Multimodal Wearable Electronics. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024;11:e2404211. [PMID: 38981027 PMCID: PMC11425633 DOI: 10.1002/advs.202404211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2024] [Revised: 06/21/2024] [Indexed: 07/11/2024]

Affiliation(s)

Beomjune Shin George W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA Wearable Intelligent Systems and Healthcare Center (WISH Center), Institute for Matter and Systems, Georgia Institute of Technology, Atlanta, GA, 30332, USA
Sung Hoon Lee Wearable Intelligent Systems and Healthcare Center (WISH Center), Institute for Matter and Systems, Georgia Institute of Technology, Atlanta, GA, 30332, USA School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA
Kangkyu Kwon Wearable Intelligent Systems and Healthcare Center (WISH Center), Institute for Matter and Systems, Georgia Institute of Technology, Atlanta, GA, 30332, USA School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA
Yoon Jae Lee Wearable Intelligent Systems and Healthcare Center (WISH Center), Institute for Matter and Systems, Georgia Institute of Technology, Atlanta, GA, 30332, USA School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA
Nikita Crispe Wearable Intelligent Systems and Healthcare Center (WISH Center), Institute for Matter and Systems, Georgia Institute of Technology, Atlanta, GA, 30332, USA Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University School of Medicine, Atlanta, GA, 30332, USA
So-Young Ahn Department of Rehabilitation Medicine, Chungnam National University School of Medicine, Daejeon, 35015, Republic of Korea
Sandeep Shelly Department of Otolaryngology-Head and Neck Surgery, School of Medicine, Emory University, Atlanta, GA, 30322, USA
Nathaniel Sundholm Department of Otolaryngology-Head and Neck Surgery, School of Medicine, Emory University, Atlanta, GA, 30322, USA
Andrew Tkaczuk Department of Otolaryngology-Head and Neck Surgery, School of Medicine, Emory University, Atlanta, GA, 30322, USA
Min-Kyung Yeo Department of Pathology, Chungnam National University School of Medicine, Daejeon, 35015, Republic of Korea
Hyojung J Choo Department of Cell Biology, School of Medicine, Emory University, Atlanta, GA, 30322, USA
Woon-Hong Yeo George W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA Wearable Intelligent Systems and Healthcare Center (WISH Center), Institute for Matter and Systems, Georgia Institute of Technology, Atlanta, GA, 30332, USA Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University School of Medicine, Atlanta, GA, 30332, USA Parker H. Petit Institute for Bioengineering and Biosciences, Institute for Robotics and Intelligent Machines, Georgia Institute of Technology, Atlanta, GA, 30332, USA

Collapse

Heo S, Uhm KE, Yuk D, Kwon BM, Yoo B, Kim J, Lee J. Deep learning approach for dysphagia detection by syllable-based speech analysis with daily conversations. Sci Rep 2024;14:20270. [PMID: 39217249 PMCID: PMC11365951 DOI: 10.1038/s41598-024-70774-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 08/21/2024] [Indexed: 09/04/2024] Open

Kim JM, Kim MS, Choi SY, Lee K, Ryu JS. A deep learning approach to dysphagia-aspiration detecting algorithm through pre- and post-swallowing voice changes. Front Bioeng Biotechnol 2024;12:1433087. [PMID: 39157445 PMCID: PMC11327512 DOI: 10.3389/fbioe.2024.1433087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Accepted: 07/16/2024] [Indexed: 08/20/2024] Open

Abstract

Introduction

This study aimed to identify differences in voice characteristics and changes between patients with dysphagia-aspiration and healthy individuals using a deep learning model, with a focus on under-researched areas of pre- and post-swallowing voice changes in patients with dysphagia. We hypothesized that these variations may be due to weakened muscles and blocked airways in patients with dysphagia.

Methods

A prospective cohort study was conducted on 198 participants aged >40 years at the Seoul National University Bundang Hospital from October 2021 to February 2023. Pre- and post-swallowing voice data of the participants were converted to a 64-kbps mp3 format, and all voice data were trimmed to a length of 2 s. The data were divided for 10-fold cross-validation and stored in HDF5 format with anonymized IDs and labels for the normal and aspiration groups. During preprocessing, the data were converted to Mel spectrograms, and the EfficientAT model was modified using the final layer of MobileNetV3 to effectively detect voice changes and analyze pre- and post-swallowing voices. This enabled the model to probabilistically categorize new patient voices as normal or aspirated.

Results

In a study of the machine-learning model for aspiration detection, area under the receiver operating characteristic curve (AUC) values were analyzed across sexes under different configurations. The average AUC values for males ranged from 0.8117 to 0.8319, with the best performance achieved at a learning rate of 3.00e-5 and a batch size of 16. The average AUC values for females improved from 0.6975 to 0.7331, with the best performance observed at a learning rate of 5.00e-5 and a batch size of 32. As there were fewer female participants, a combined model was developed to maintain the sex balance. In the combined model, the average AUC values ranged from 0.7746 to 0.7997, and optimal performance was achieved at a learning rate of 3.00e-5 and a batch size of 16.

Conclusion

This study evaluated a voice analysis-based program to detect pre- and post-swallowing changes in patients with dysphagia, potentially aiding in real-time monitoring. Such a system can provide healthcare professionals with daily insights into the conditions of patients, allowing for personalized interventions.

Clinical Trial Registration

ClinicalTrials.gov, identifier NCT05149976.

Collapse

Kim JM, Kim MS, Choi SY, Ryu JS. Prediction of dysphagia aspiration through machine learning-based analysis of patients' postprandial voices. J Neuroeng Rehabil 2024;21:43. [PMID: 38555417 PMCID: PMC10981344 DOI: 10.1186/s12984-024-01329-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 02/26/2024] [Indexed: 04/02/2024] Open

Abstract

BACKGROUND

Conventional diagnostic methods for dysphagia have limitations such as long wait times, radiation risks, and restricted evaluation. Therefore, voice-based diagnostic and monitoring technologies are required to overcome these limitations. Based on our hypothesis regarding the impact of weakened muscle strength and the presence of aspiration on vocal characteristics, this single-center, prospective study aimed to develop a machine-learning algorithm for predicting dysphagia status (normal, and aspiration) by analyzing postprandial voice limiting intake to 3 cc.

METHODS

Conducted from September 2021 to February 2023 at Seoul National University Bundang Hospital, this single center, prospective cohort study included 198 participants aged 40 or older, with 128 without suspected dysphagia and 70 with dysphagia-aspiration. Voice data from participants were collected and used to develop dysphagia prediction models using the Multi-Layer Perceptron (MLP) with MobileNet V3. Male-only, female-only, and combined models were constructed using 10-fold cross-validation. Through the inference process, we established a model capable of probabilistically categorizing a new patient's voice as either normal or indicating the possibility of aspiration.

RESULTS

The pre-trained models (mn40_as and mn30_as) exhibited superior performance compared to the non-pre-trained models (mn4.0 and mn3.0). Overall, the best-performing model, mn30_as, which is a pre-trained model, demonstrated an average AUC across 10 folds as follows: combined model 0.8361 (95% CI 0.7667-0.9056; max 0.9541), male model 0.8010 (95% CI 0.6589-0.9432; max 1.000), and female model 0.7572 (95% CI 0.6578-0.8567; max 0.9779). However, for the female model, a slightly higher result was observed with the mn4.0, which scored 0.7679 (95% CI 0.6426-0.8931; max 0.9722). Additionally, the other models (pre-trained; mn40_as, non-pre-trained; mn4.0 and mn3.0) also achieved performance above 0.7 in most cases, and the highest fold-level performance for most models was approximately around 0.9. The 'mn' in model names refers to MobileNet and the following number indicates the 'width_mult' parameter.

CONCLUSIONS

In this study, we used mel-spectrogram analysis and a MobileNetV3 model for predicting dysphagia aspiration. Our research highlights voice analysis potential in dysphagia screening, diagnosis, and monitoring, aiming for non-invasive safer, and more effective interventions.

TRIAL REGISTRATION

This study was approved by the IRB (No. B-2109-707-303) and registered on clinicaltrials.gov (ID: NCT05149976).

Collapse

Saab R, Balachandar A, Mahdi H, Nashnoush E, Perri LX, Waldron AL, Sadeghian A, Rubenfeld G, Crowley M, Boulos MI, Murray BJ, Khosravani H. Machine-learning assisted swallowing assessment: a deep learning-based quality improvement tool to screen for post-stroke dysphagia. Front Neurosci 2023;17:1302132. [PMID: 38130696 PMCID: PMC10734030 DOI: 10.3389/fnins.2023.1302132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Accepted: 11/07/2023] [Indexed: 12/23/2023] Open

Abstract

Introduction

Post-stroke dysphagia is common and associated with significant morbidity and mortality, rendering bedside screening of significant clinical importance. Using voice as a biomarker coupled with deep learning has the potential to improve patient access to screening and mitigate the subjectivity associated with detecting voice change, a component of several validated screening protocols.

Methods

In this single-center study, we developed a proof-of-concept model for automated dysphagia screening and evaluated the performance of this model on training and testing cohorts. Patients were admitted to a comprehensive stroke center, where primary English speakers could follow commands without significant aphasia and participated on a rolling basis. The primary outcome was classification either as a pass or fail equivalent using a dysphagia screening test as a label. Voice data was recorded from patients who spoke a standardized set of vowels, words, and sentences from the National Institute of Health Stroke Scale. Seventy patients were recruited and 68 were included in the analysis, with 40 in training and 28 in testing cohorts, respectively. Speech from patients was segmented into 1,579 audio clips, from which 6,655 Mel-spectrogram images were computed and used as inputs for deep-learning models (DenseNet and ConvNext, separately and together). Clip-level and participant-level swallowing status predictions were obtained through a voting method.

Results

The models demonstrated clip-level dysphagia screening sensitivity of 71% and specificity of 77% (F1 = 0.73, AUC = 0.80 [95% CI: 0.78-0.82]). At the participant level, the sensitivity and specificity were 89 and 79%, respectively (F1 = 0.81, AUC = 0.91 [95% CI: 0.77-1.05]).

Discussion

This study is the first to demonstrate the feasibility of applying deep learning to classify vocalizations to detect post-stroke dysphagia. Our findings suggest potential for enhancing dysphagia screening in clinical settings. https://github.com/UofTNeurology/masa-open-source.

Collapse

Affiliation(s)

Rami Saab Hurvitz Brain Sciences Program, Division of Neurology, Department of Medicine, Sunnybrook Health Sciences Centre, University of Toronto, Toronto, ON, Canada
Arjun Balachandar Hurvitz Brain Sciences Program, Division of Neurology, Department of Medicine, Sunnybrook Health Sciences Centre, University of Toronto, Toronto, ON, Canada
Hamza Mahdi Hurvitz Brain Sciences Program, Division of Neurology, Department of Medicine, Sunnybrook Health Sciences Centre, University of Toronto, Toronto, ON, Canada
Eptehal Nashnoush Hurvitz Brain Sciences Program, Division of Neurology, Department of Medicine, Sunnybrook Health Sciences Centre, University of Toronto, Toronto, ON, Canada
Lucas X. Perri Goodfellow-Waldron Initiative in Stroke Innovation and Recovery, Division of Neurology, Neurology Quality and Innovation Lab, University of Toronto, Toronto, ON, Canada
Ashley L. Waldron Goodfellow-Waldron Initiative in Stroke Innovation and Recovery, Division of Neurology, Neurology Quality and Innovation Lab, University of Toronto, Toronto, ON, Canada
Alireza Sadeghian Department of Computer Science, Faculty of Science, Toronto Metropolitan University, Toronto, ON, Canada
Gordon Rubenfeld Institute of Medical Science, University of Toronto, Toronto, ON, Canada Interdepartmental Division of Critical Care, Faculty of Medicine, University of Toronto, Toronto, ON, Canada
Mark Crowley Department of Electrical and Computer Engineering, University of Waterloo, Waterloo, ON, Canada
Mark I. Boulos Hurvitz Brain Sciences Program, Division of Neurology, Department of Medicine, Sunnybrook Health Sciences Centre, University of Toronto, Toronto, ON, Canada Institute of Medical Science, University of Toronto, Toronto, ON, Canada
Brian J. Murray Hurvitz Brain Sciences Program, Division of Neurology, Department of Medicine, Sunnybrook Health Sciences Centre, University of Toronto, Toronto, ON, Canada
Houman Khosravani Hurvitz Brain Sciences Program, Division of Neurology, Department of Medicine, Sunnybrook Health Sciences Centre, University of Toronto, Toronto, ON, Canada Goodfellow-Waldron Initiative in Stroke Innovation and Recovery, Division of Neurology, Neurology Quality and Innovation Lab, University of Toronto, Toronto, ON, Canada

Collapse

Lai DKH, Cheng ESW, Lim HJ, So BPH, Lam WK, Cheung DSK, Wong DWC, Cheung JCW. Computer-aided screening of aspiration risks in dysphagia with wearable technology: a Systematic Review and meta-analysis on test accuracy. Front Bioeng Biotechnol 2023;11:1205009. [PMID: 37441197 PMCID: PMC10334490 DOI: 10.3389/fbioe.2023.1205009] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 06/20/2023] [Indexed: 07/15/2023] Open

Abstract

Aspiration caused by dysphagia is a prevalent problem that causes serious health consequences and even death. Traditional diagnostic instruments could induce pain, discomfort, nausea, and radiation exposure. The emergence of wearable technology with computer-aided screening might facilitate continuous or frequent assessments to prompt early and effective management. The objectives of this review are to summarize these systems to identify aspiration risks in dysphagic individuals and inquire about their accuracy. Two authors independently searched electronic databases, including CINAHL, Embase, IEEE Xplore^® Digital Library, PubMed, Scopus, and Web of Science (PROSPERO reference number: CRD42023408960). The risk of bias and applicability were assessed using QUADAS-2. Nine (n = 9) articles applied accelerometers and/or acoustic devices to identify aspiration risks in patients with neurodegenerative problems (e.g., dementia, Alzheimer's disease), neurogenic problems (e.g., stroke, brain injury), in addition to some children with congenital abnormalities, using videofluoroscopic swallowing study (VFSS) or fiberoptic endoscopic evaluation of swallowing (FEES) as the reference standard. All studies employed a traditional machine learning approach with a feature extraction process. Support vector machine (SVM) was the most famous machine learning model used. A meta-analysis was conducted to evaluate the classification accuracy and identify risky swallows. Nevertheless, we decided not to conclude the meta-analysis findings (pooled diagnostic odds ratio: 21.5, 95% CI, 2.7-173.6) because studies had unique methodological characteristics and major differences in the set of parameters/thresholds, in addition to the substantial heterogeneity and variations, with sensitivity levels ranging from 21.7% to 90.0% between studies. Small sample sizes could be a critical problem in existing studies (median = 34.5, range 18-449), especially for machine learning models. Only two out of the nine studies had an optimized model with sensitivity over 90%. There is a need to enlarge the sample size for better generalizability and optimize signal processing, segmentation, feature extraction, classifiers, and their combinations to improve the assessment performance. Systematic Review Registration: (https://www.crd.york.ac.uk/prospero/), identifier (CRD42023408960).

Collapse

Hwang H, Lee S, Park HY, Lim HY, Park KH, Park GY, Im S. Investigating the Impact of Voice Impairment on Quality of Life in Stroke Patients: The Voice Handicap Index (VHI) Questionnaire Study. BRAIN & NEUROREHABILITATION 2023;16:e10. [PMID: 37033000 PMCID: PMC10079476 DOI: 10.12786/bn.2023.16.e10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 03/23/2023] [Accepted: 03/24/2023] [Indexed: 04/05/2023] Open

Qiao J, Jiang YT, Dai Y, Gong YB, Dai M, Liu YX, Dou ZL. Research on a real-time dynamic monitoring method for silent aspiration after stroke based on semisupervised deep learning: A protocol study. Digit Health 2023;9:20552076231183548. [PMID: 37434729 PMCID: PMC10331777 DOI: 10.1177/20552076231183548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 06/05/2023] [Indexed: 07/13/2023] Open

Abstract

Objective

This study aims to establish a real-time dynamic monitoring system for silent aspiration (SA) to provide evidence for the early diagnosis of and precise intervention for SA after stroke.

Methods

Multisource signals, including sound, nasal airflow, electromyographic, pressure and acceleration signals, will be obtained by multisource sensors during swallowing events. The extracted signals will be labeled according to videofluoroscopic swallowing studies (VFSSs) and input into a special dataset. Then, a real-time dynamic monitoring model for SA will be built and trained based on semisupervised deep learning. Model optimization will be performed based on the mapping relationship between multisource signals and insula-centered cerebral cortex-brainstem functional connectivity through resting-state functional magnetic resonance imaging. Finally, a real-time dynamic monitoring system for SA will be established, of which the sensitivity and specificity will be improved by clinical application.

Results

Multisource signals will be stably extracted by multisource sensors. Data from a total of 3200 swallows will be obtained from patients with SA, including 1200 labeled swallows from the nonaspiration category from VFSSs and 2000 unlabeled swallows. A significant difference in the multisource signals is expected to be found between the SA and nonaspiration groups. The features of labeled and pseudolabeled multisource signals will be extracted through semisupervised deep learning to establish a dynamic monitoring model for SA. Moreover, strong correlations are expected to be found between the Granger causality analysis (GCA) value (from the left middle frontal gyrus to the right anterior insula) and the laryngeal rise time (LRT). Finally, a dynamic monitoring system will be established based on the former model, by which SA can be identified precisely.

Conclusion

The study will establish a real-time dynamic monitoring system for SA with high sensitivity, specificity, accuracy and F1 score.

Collapse