Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wang J, Kothalkar PV, Kim M, Bandini A, Cao B, Yunusova Y, Campbell TF, Heitzman D, Green JR. Automatic prediction of intelligible speaking rate for individuals with ALS from speech acoustic and articulatory samples. Int J Speech Lang Pathol 2018;20:669-679. [PMID: 30409057 PMCID: PMC6506394 DOI: 10.1080/17549507.2018.1508499] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]

For:	Wang J, Kothalkar PV, Kim M, Bandini A, Cao B, Yunusova Y, Campbell TF, Heitzman D, Green JR. Automatic prediction of intelligible speaking rate for individuals with ALS from speech acoustic and articulatory samples. Int J Speech Lang Pathol 2018;20:669-679. [PMID: 30409057 PMCID: PMC6506394 DOI: 10.1080/17549507.2018.1508499] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]

Number

Cited by Other Article(s)

Bouvier L, McKinlay S, Truong J, Genge A, Dupré N, Dionne A, Kalra S, Yunusova Y. Speech timing and monosyllabic diadochokinesis measures in the assessment of amyotrophic lateral sclerosis in Canadian French. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2024;26:267-277. [PMID: 37272348 PMCID: PMC10696137 DOI: 10.1080/17549507.2023.2214706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Abstract

PURPOSE

The primary objective of this study was to determine if speech and pause measures obtained using a passage reading task and timing measures from a monosyllabic diadochokinesis (DDK) task differ across speakers of Canadian French diagnosed with amyotrophic lateral sclerosis (ALS) presenting with and without bulbar symptoms, and healthy controls. The secondary objective was to determine if these measures can reflect the severity of bulbar symptoms.

METHOD

A total of 29 Canadian French speakers with ALS (classified as bulbar symptomatic [n = 14] or pre-symptomatic [n = 15]) and 17 age-matched healthy controls completed a passage reading task and a monosyllabic DDK task (/pa/ and /ta/), for up to three follow-up visits. Measures of speaking rate, total duration, speech duration, and pause events were extracted from the passage reading recordings using a semi-automated speech and pause analysis procedure. Manual analysis of DDK recordings provided measures of DDK rate and variability.

RESULT

Group comparisons revealed significant differences (p = < .05) between the symptomatic group and the pre-symptomatic and control groups for all passage measures and DDK rates. Only the DDK rate in /ta/ differentiated the pre-symptomatic and control groups. Repeated measures correlations revealed moderate correlations (rrm = > 0.40; p = < 0.05) between passage measures of total duration, speaking rate, speech duration, and number of pauses, and ALSFRS-R total and bulbar scores, as well as between DDK rate and ALSFRS-R total score.

CONCLUSION

Speech and pause measures in passage and timing measures in monosyllabic DDK tasks might be suitable for monitoring bulbar functional symptoms in French speakers with ALS, but more work is required to identify which measures are sensitive to the earliest stages of the disease.

Collapse

Rowe HP, Stipancic KL, Campbell TF, Yunusova Y, Green JR. The association between longitudinal declines in speech sound accuracy and speech intelligibility in speakers with amyotrophic lateral sclerosis. CLINICAL LINGUISTICS & PHONETICS 2024;38:227-248. [PMID: 37122073 PMCID: PMC10613582 DOI: 10.1080/02699206.2023.2202297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 04/01/2023] [Accepted: 04/03/2023] [Indexed: 05/27/2023]

Teplansky KJ, Wisler A, Goffman L, Wang J. The Impact of Stimulus Length in Tongue and Lip Movement Pattern Stability in Amyotrophic Lateral Sclerosis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023:1-13. [PMID: 37988653 DOI: 10.1044/2023_jslhr-23-00079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2023]

Abstract

PURPOSE

This study aimed to investigate the effect of stimulus signal length on tongue and lip motion pattern stability in speakers diagnosed with amyotrophic lateral sclerosis (ALS) compared to healthy controls.

METHOD

Electromagnetic articulography was used to derive articulatory motion patterns from individuals with mild (n = 27) and severe (n = 16) ALS and healthy controls (n = 25). The spatiotemporal index (STI) was used as a measure of articulatory stability. Two experiments were conducted to evaluate signal length effects on the STI: (a) the effect of the number of syllables on STI values and (b) increasing lengths of subcomponents of a single phrase. Two-way mixed analyses of variance were conducted to assess the effects of syllable length and group on the STI for the tongue tip (TT), tongue back (TB), and lower lip (LL).

RESULTS

Experiment 1 showed a significant main effect of syllable length (TT, p < .001; TB, p < .001; and LL, p < .001) and group (TT, p = .037; TB, p = .007; and LL, p = .017). TB and LL stability was generally higher with speech stimuli that included a greater number of syllables. Articulatory variability was significantly higher in speakers diagnosed with ALS compared to healthy controls. Experiment 2 showed a significant main effect of length (TT, p < .001; TB, p = .015; and LL, p < .001), providing additional support that STI values tend to be greater when calculated on longer speech signals.

CONCLUSIONS

Articulatory stability is influenced by the length of speech signals and manifests similarly in both healthy speakers and persons with ALS. TT stability may be significantly impacted by phonemic content due to greater movement flexibility. Compared to healthy controls, there was an increase in articulatory variability in those with ALS, which likely reflects deviations in speech motor control.

SUPPLEMENTAL MATERIAL

https://doi.org/10.23641/asha.24463924.

Collapse

Migliorelli L, Berardini D, Cela K, Coccia M, Villani L, Frontoni E, Moccia S. A store-and-forward cloud-based telemonitoring system for automatic assessing dysarthria evolution in neurological diseases from video-recording analysis. Comput Biol Med 2023;163:107194. [PMID: 37421736 DOI: 10.1016/j.compbiomed.2023.107194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 06/06/2023] [Accepted: 06/19/2023] [Indexed: 07/10/2023]

Idrisoglu A, Dallora AL, Anderberg P, Berglund JS. Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review. J Med Internet Res 2023;25:e46105. [PMID: 37467031 PMCID: PMC10398366 DOI: 10.2196/46105] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 04/26/2023] [Accepted: 05/23/2023] [Indexed: 07/20/2023] Open

Abstract

BACKGROUND

Normal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, pulmonary, and muscular functionality. This sensitivity inspired using voice as a biomarker to examine disorders that affect the voice. Technological improvements and emerging machine learning (ML) technologies have enabled possibilities of extracting digital vocal features from the voice for automated diagnosis and monitoring systems.

OBJECTIVE

This study aims to summarize a comprehensive view of research on voice-affecting disorders that uses ML techniques for diagnosis and monitoring through voice samples where systematic conditions, nonlaryngeal aerodigestive disorders, and neurological disorders are specifically of interest.

METHODS

This systematic literature review (SLR) investigated the state of the art of voice-based diagnostic and monitoring systems with ML technologies, targeting voice-affecting disorders without direct relation to the voice box from the point of view of applied health technology. Through a comprehensive search string, studies published from 2012 to 2022 from the databases Scopus, PubMed, and Web of Science were scanned and collected for assessment. To minimize bias, retrieval of the relevant references in other studies in the field was ensured, and 2 authors assessed the collected studies. Low-quality studies were removed through a quality assessment and relevant data were extracted through summary tables for analysis. The articles were checked for similarities between author groups to prevent cumulative redundancy bias during the screening process, where only 1 article was included from the same author group.

RESULTS

In the analysis of the 145 included studies, support vector machines were the most utilized ML technique (51/145, 35.2%), with the most studied disease being Parkinson disease (PD; reported in 87/145, 60%, studies). After 2017, 16 additional voice-affecting disorders were examined, in contrast to the 3 investigated previously. Furthermore, an upsurge in the use of artificial neural network-based architectures was observed after 2017. Almost half of the included studies were published in last 2 years (2021 and 2022). A broad interest from many countries was observed. Notably, nearly one-half (n=75) of the studies relied on 10 distinct data sets, and 11/145 (7.6%) used demographic data as an input for ML models.

CONCLUSIONS

This SLR revealed considerable interest across multiple countries in using ML techniques for diagnosing and monitoring voice-affecting disorders, with PD being the most studied disorder. However, the review identified several gaps, including limited and unbalanced data set usage in studies, and a focus on diagnostic test rather than disorder-specific monitoring. Despite the limitations of being constrained by only peer-reviewed publications written in English, the SLR provides valuable insights into the current state of research on ML-based voice-affecting disorder diagnosis and monitoring and highlighting areas to address in future research.

Collapse

Visibelli A, Roncaglia B, Spiga O, Santucci A. The Impact of Artificial Intelligence in the Odyssey of Rare Diseases. Biomedicines 2023;11:887. [PMID: 36979866 PMCID: PMC10045927 DOI: 10.3390/biomedicines11030887] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 02/28/2023] [Accepted: 03/08/2023] [Indexed: 03/16/2023] Open

Simmatis LER, Robin J, Pommée T, McKinlay S, Sran R, Taati N, Truong J, Koyani B, Yunusova Y. Validation of automated pipeline for the assessment of a motor speech disorder in amyotrophic lateral sclerosis (ALS). Digit Health 2023;9:20552076231219102. [PMID: 38144173 PMCID: PMC10748679 DOI: 10.1177/20552076231219102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 11/20/2023] [Indexed: 12/26/2023] Open

Abstract

Background and objective

Amyotrophic lateral sclerosis (ALS) frequently causes speech impairments, which can be valuable early indicators of decline. Automated acoustic assessment of speech in ALS is attractive, and there is a pressing need to validate such tools in line with best practices, including analytical and clinical validation. We hypothesized that data analysis using a novel speech assessment pipeline would correspond strongly to analyses performed using lab-standard practices and that acoustic features from the novel pipeline would correspond to clinical outcomes of interest in ALS.

Methods

We analyzed data from three standard speech assessment tasks (i.e., vowel phonation, passage reading, and diadochokinesis) in 122 ALS patients. Data were analyzed automatically using a pipeline developed by Winterlight Labs, which yielded 53 acoustic features. First, for analytical validation, data were analyzed using a lab-standard analysis pipeline for comparison. This was followed by univariate analysis (Spearman correlations between individual features in Winterlight and in-lab datasets) and multivariate analysis (sparse canonical correlation analysis (SCCA)). Subsequently, clinical validation was performed. This included univariate analysis (Spearman correlation between automated acoustic features and clinical measures) and multivariate analysis (interpretable autoencoder-based dimensionality reduction).

Results

Analytical validity was demonstrated by substantial univariate correlations (Spearman's ρ > 0.70) between corresponding pairs of features from automated and lab-based datasets, as well as interpretable SCCA feature groups. Clinical validity was supported by strong univariate correlations between automated features and clinical measures (Spearman's ρ > 0.70), as well as associations between multivariate outputs and clinical measures.

Conclusion

This novel, automated speech assessment feature set demonstrates substantial promise as a valid tool for analyzing impaired speech in ALS patients and for the further development of these technologies.

Collapse

Guarin DL, Taati B, Abrahao A, Zinman L, Yunusova Y. Video-Based Facial Movement Analysis in the Assessment of Bulbar Amyotrophic Lateral Sclerosis: Clinical Validation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022;65:4667-4678. [PMID: 36367528 PMCID: PMC9940890 DOI: 10.1044/2022_jslhr-22-00072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 05/31/2022] [Accepted: 08/12/2022] [Indexed: 06/03/2023]

Thomas A, Teplansky KJ, Wisler A, Heitzman D, Austin S, Wang J. Voice Onset Time in Early- and Late-Stage Amyotrophic Lateral Sclerosis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022;65:2586-2593. [PMID: 35858258 PMCID: PMC9907452 DOI: 10.1044/2022_jslhr-21-00632] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 02/24/2022] [Accepted: 04/11/2022] [Indexed: 05/26/2023]

Teplansky KJ, Wisler A, Green JR, Campbell T, Heitzman D, Austin SG, Wang J. Tongue and Lip Acceleration as a Measure of Speech Decline in Amyotrophic Lateral Sclerosis. Folia Phoniatr Logop 2022;75:23-34. [PMID: 35760064 PMCID: PMC9792632 DOI: 10.1159/000525514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 06/02/2022] [Indexed: 01/22/2023] Open

Abstract

PURPOSE

The goal of this study was to examine the efficacy of acceleration-based articulatory measures in characterizing the decline in speech motor control due to amyotrophic lateral sclerosis (ALS).

METHOD

Electromagnetic articulography was used to record tongue and lip movements during the production of 20 phrases. Data were collected from 50 individuals diagnosed with ALS. Articulatory kinematic variability was measured using the spatiotemporal index of both instantaneous acceleration and speed signals. Linear regression models were used to analyze the relationship between variability measures and intelligible speaking rate (a clinical measure of disease progression). A machine learning algorithm (support vector regression, SVR) was used to assess whether acceleration or speed features (e.g., mean, median, maximum) showed better performance at predicting speech severity in patients with ALS.

RESULTS

As intelligible speaking rate declined, the variability of acceleration of tongue and lip movement patterns significantly increased (p < 0.001). The variability of speed and vertical displacement did not significantly predict speech performance measures. Additionally, based on R2 and root mean square error (RMSE) values, the SVR model was able to predict speech severity more accurately from acceleration features (R2 = 0.601, RMSE = 38.453) and displacement features (R2 = 0.218, RMSE = 52.700) than from speed features (R2 = 0.554, RMSE = 40.772).

CONCLUSION

Results from these models highlight differences in speech motor control in participants with ALS. The variability in acceleration of tongue and lip movements increases as speech performance declines, potentially reflecting physiological deviations due to the progression of ALS. Our findings suggest that acceleration is a more sensitive indicator of speech deterioration due to ALS than displacement and speed and may contribute to improved algorithm designs for monitoring disease progression from speech signals.

Collapse

Lehner K, Ziegler W. Indicators of Communication Limitation in Dysarthria and Their Relation to Auditory-Perceptual Speech Symptoms: Construct Validity of the KommPaS Web App. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022;65:22-42. [PMID: 34890213 DOI: 10.1044/2021_jslhr-21-00215] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Abstract

PURPOSE

Despite extensive research into communication-related parameters in dysarthria, such as intelligibility, naturalness, and perceived listener effort, the existing evidence has not been translated into a clinically applicable, comprehensive, and valid diagnostic tool so far. This study addresses Communication-Related Parameters in Speech Disorders (KommPaS), a new web-based diagnostic instrument for measuring indices of communication limitation in individuals with dysarthria through online crowdsourcing. More specifically, it answers questions about the construct validity of KommPaS. In the first part, the interrelationship of the KommPaS variables intelligibility, naturalness, perceived listener effort, and speech rate were explored in order to draw a comprehensive picture of a patient's limitations and avoid the collection of redundant information. Second, the influences of motor speech symptoms on the KommPaS variables were studied in order to delineate the structural relationships between two complementary diagnostic perspectives.

METHOD

One hundred persons with dysarthria of different etiologies and varying degrees of severity were examined with KommPaS to obtain layperson-based data on communication-level parameters, and with the Bogenhausen Dysarthria Scale (BoDyS) to obtain expert-based, function-level data on dysarthria symptoms. The internal structure of the KommPaS variables and their dependence on the BoDyS variables were analyzed using structural equation modeling.

RESULTS

Despite a high multicollinearity, all KommPaS variables were shown to provide complementary diagnostic information and their mutual interconnections were delineated in a path graph model. Regarding the influence of the BoDyS scales on the KommPaS variables, separate linear regression models revealed plausible predictor sets. A complete path model of KommPaS and BoDyS variables was developed to map the complex interplay between variables at the functional and the communication levels of dysarthria assessment.

CONCLUSION

In validating a new clinical tool for the diagnostics of communication limitations in dysarthria, this study is the first to draw a comprehensive picture of how auditory-perceptual characteristics of dysarthria interact at the levels of expert-based functional and layperson-based communicative assessments.

Collapse

Stipancic KL, Palmer KM, Rowe HP, Yunusova Y, Berry JD, Green JR. "You Say Severe, I Say Mild": Toward an Empirical Classification of Dysarthria Severity. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021;64:4718-4735. [PMID: 34762814 PMCID: PMC9150682 DOI: 10.1044/2021_jslhr-21-00197] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Revised: 07/07/2021] [Accepted: 08/12/2021] [Indexed: 05/19/2023]

Abstract

PURPOSE

The main purpose of this study was to create an empirical classification system for speech severity in patients with dysarthria secondary to amyotrophic lateral sclerosis (ALS) by exploring the reliability and validity of speech-language pathologists' (SLPs') ratings of dysarthric speech.

METHOD

Ten SLPs listened to speech samples from 52 speakers with ALS and 20 healthy control speakers. SLPs were asked to rate the speech severity of the speakers using five response options: normal, mild, moderate, severe, and profound. Four severity-surrogate measures were also calculated: SLPs transcribed the speech samples for the calculation of speech intelligibility and rated the effort it took to understand the speakers on a visual analog scale. In addition, speaking rate and intelligible speaking rate were calculated for each speaker. Intrarater and interrater reliability were calculated for each measure. We explored the validity of clinician-based severity ratings by comparing them to the severity-surrogate measures. Receiver operating characteristic (ROC) curves were conducted to create optimal cutoff points for defining dysarthria severity categories.

RESULTS

Intrarater and interrater reliability for the clinician-based severity ratings were excellent and were comparable to reliability for the severity-surrogate measures explored. Clinician severity ratings were strongly associated with all severity-surrogate measures, suggesting strong construct validity. We also provided a range of values for each severity-surrogate measure within each severity category based on the cutoff points obtained from the ROC analyses.

CONCLUSIONS

Clinician severity ratings of dysarthric speech are reliable and valid. We discuss the underlying challenges that arise when selecting a stratification measure and offer recommendations for a classification scheme when stratifying patients and research participants into speech severity categories.

Collapse

Woisard V, Balaguer M, Fredouille C, Farinas J, Ghio A, Lalain M, Puech M, Astesano C, Pinquier J, Lepage B. Construction of an automatic score for the evaluation of speech disorders among patients treated for a cancer of the oral cavity or the oropharynx: The Carcinologic Speech Severity Index. Head Neck 2021;44:71-88. [PMID: 34729847 DOI: 10.1002/hed.26903] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 08/15/2021] [Accepted: 10/05/2021] [Indexed: 01/26/2023] Open

Abstract

BACKGROUND

Speech disorders impact quality of life for patients treated with oral cavity and oropharynx cancers. However, there is a lack of uniform and applicable methods for measuring the impact on speech production after treatment in this tumor location.

OBJECTIVE

The objective of this work is to (1) model an automatic severity index of speech applicable in clinical practice, that is equivalent or superior to a severity score obtained by human listeners, via several acoustics parameters extracted (a) directly from speech signal and (b) resulting from speech processing and (2) derive an automatic speech intelligibility classification (i.e., mild, moderate, severe) to predict speech disability and handicap by combining the listener comprehension score with self-reported quality of life related to speech.

METHODS

Eighty-seven patients treated for cancer of the oral cavity or the oropharynx and 35 controls performed different tasks of speech production and completed questionnaires on speech-related quality of life. The audio recordings were then evaluated by human perception and automatic speech processing. Then, a score was developed through a classic logistic regression model allowing description of the severity of patients' speech disorders.

RESULTS

Among the group of parameters subject to extraction from automatic processing of the speech signal, six were retained, producing a correlation at 0.87 with the perceptual reference score, 0.77 with the comprehension score, and 0.5 with speech-related quality of life. The parameters that contributed the most are based on automatic speech recognition systems. These are mainly the automatic average normalized likelihood score on a text reading task and the score of cumulative rankings on pseudowords. The reduced automatic YC2SI is modeled in this way: Y_C2SIp = 11.48726 + (1.52926 × _{Xaveraged normalized likelihood reading} ) + (-1.94e-06 × _{Xscore of cumulative ranks pseudowords} ).

CONCLUSION

Automatic processing of speech makes it possible to arrive at valid, reliable, and reproducible parameters able to serve as references in the framework of follow-up of patients treated for cancer of the oral cavity or the oropharynx.

Collapse

Wisler A, Teplansky K, Heitzman D, Wang J. The Effects of Symptom Onset Location on Automatic Amyotrophic Lateral Sclerosis Detection Using the Correlation Structure of Articulatory Movements. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021;64:2276-2286. [PMID: 33647219 PMCID: PMC8740667 DOI: 10.1044/2020_jslhr-20-00288] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Revised: 09/22/2020] [Accepted: 11/19/2020] [Indexed: 06/12/2023]

Fernandes F, Barbalho I, Barros D, Valentim R, Teixeira C, Henriques J, Gil P, Dourado Júnior M. Biomedical signals and machine learning in amyotrophic lateral sclerosis: a systematic review. Biomed Eng Online 2021;20:61. [PMID: 34130692 PMCID: PMC8207575 DOI: 10.1186/s12938-021-00896-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Accepted: 06/09/2021] [Indexed: 12/11/2022] Open

Tena A, Claria F, Solsona F, Meister E, Povedano M. Detection of Bulbar Involvement in Patients With Amyotrophic Lateral Sclerosis by Machine Learning Voice Analysis: Diagnostic Decision Support Development Study. JMIR Med Inform 2021;9:e21331. [PMID: 33688838 PMCID: PMC7991994 DOI: 10.2196/21331] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 10/26/2020] [Accepted: 01/17/2021] [Indexed: 11/13/2022] Open

Abstract

Background

Bulbar involvement is a term used in amyotrophic lateral sclerosis (ALS) that refers to motor neuron impairment in the corticobulbar area of the brainstem, which produces a dysfunction of speech and swallowing. One of the earliest symptoms of bulbar involvement is voice deterioration characterized by grossly defective articulation; extremely slow, laborious speech; marked hypernasality; and severe harshness. Bulbar involvement requires well-timed and carefully coordinated interventions. Therefore, early detection is crucial to improving the quality of life and lengthening the life expectancy of patients with ALS who present with this dysfunction. Recent research efforts have focused on voice analysis to capture bulbar involvement.

Objective

The main objective of this paper was (1) to design a methodology for diagnosing bulbar involvement efficiently through the acoustic parameters of uttered vowels in Spanish, and (2) to demonstrate that the performance of the automated diagnosis of bulbar involvement is superior to human diagnosis.

Methods

The study focused on the extraction of features from the phonatory subsystem—jitter, shimmer, harmonics-to-noise ratio, and pitch—from the utterance of the five Spanish vowels. Then, we used various supervised classification algorithms, preceded by principal component analysis of the features obtained.

Results

To date, support vector machines have performed better (accuracy 95.8%) than the models analyzed in the related work. We also show how the model can improve human diagnosis, which can often misdiagnose bulbar involvement.

Conclusions

The results obtained are very encouraging and demonstrate the efficiency and applicability of the automated model presented in this paper. It may be an appropriate tool to help in the diagnosis of ALS by multidisciplinary clinical teams, in particular to improve the diagnosis of bulbar involvement.

Collapse

Goyal NA, Berry JD, Windebank A, Staff NP, Maragakis NJ, van den Berg LH, Genge A, Miller R, Baloh RH, Kern R, Gothelf Y, Lebovits C, Cudkowicz M. Addressing heterogeneity in amyotrophic lateral sclerosis CLINICAL TRIALS. Muscle Nerve 2020;62:156-166. [PMID: 31899540 PMCID: PMC7496557 DOI: 10.1002/mus.26801] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Revised: 12/30/2019] [Accepted: 12/31/2019] [Indexed: 12/12/2022]

Wisler AA, Fletcher AR, McAuliffe MJ. Predicting Montreal Cognitive Assessment Scores From Measures of Speech and Language. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020;63:1752-1761. [PMID: 32459131 DOI: 10.1044/2020_jslhr-19-00183] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Generative Adversarial Network-Based Neural Audio Caption Model for Oral Evaluation. ELECTRONICS 2020. [DOI: 10.3390/electronics9030424] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract Oral evaluation is one of the most critical processes in children’s language learning. Traditionally, the Scoring Rubric is widely used in oral evaluation for providing a ranking score by assessing word accuracy, phoneme accuracy, fluency, and accent position of a tester. In recent years, by the emerging demands of the market, oral evaluation requires not only providing a single score from pronunciation but also in-depth, meaning comments based on content, context, logic, and understanding. However, the Scoring Rubric requires massive human work (oral evaluation experts) to provide such deep meaning comments. It is considered uneconomical and inefficient in the current market. Therefore, this paper proposes an automated expert comment generation approach for oral evaluation. The approach first extracts the oral features from the children’s audio as well as the text features from the corresponding expert comments. Then, a Gated Recurrent Unit (GRU) is applied to encode the oral features into the model. Afterwards, a Long Short-Term Memory (LSTM) model is applied to train the mappings between oral features and text features and generate expert comments for the new coming oral audio. Finally, a Generative Adversarial Network (GAN) is combined to improve the quality of the generated comments. It generates pseudo-comments to train the discriminator to recognize the human-like comments. The proposed approach is evaluated in a real-world audio dataset (children oral audio) collected by our collaborative company. The proposed approach is also integrated into a commercial application to generate expert comments for children’s oral evaluation. The experimental results and the lessons learned from real-world applications show that the proposed approach is effective for providing meaningful comments for oral evaluation. Collapse

Barnett C, Green JR, Marzouqah R, Stipancic KL, Berry JD, Korngut L, Genge A, Shoesmith C, Briemberg H, Abrahao A, Kalra S, Zinman L, Yunusova Y. Reliability and validity of speech & pause measures during passage reading in ALS. Amyotroph Lateral Scler Frontotemporal Degener 2020;21:42-50. [PMID: 32138555 PMCID: PMC7080316 DOI: 10.1080/21678421.2019.1697888] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 11/11/2019] [Indexed: 10/25/2022]

Affiliation(s)

Carolina Barnett Division of Neurology, Department of Medicine, University of Toronto and University Health Network, Toronto, Canada Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
Jordan R Green Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA, USA Speech and Hearing Biosciences and Technology Program, Harvard University, Cambridge, MA, USA
Reeman Marzouqah Department of Speech-Language Pathology, University of Toronto, Toronto, Canada
Kaila L Stipancic Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA, USA
James D Berry Harvard Medical School, Department of Neurology, Massachusetts General Hospital (MGH), Boston, Massachusetts, USA,
Lawrence Korngut Department of Clinical Neurosciences, Hotchkiss Brain Institute, University of Calgary, Calgary, Canada
Angela Genge Montreal Neurological Institute, Neurosurgery, McGill University, Montreal, Canada
Christen Shoesmith Department of Clinical Neurological Sciences, University of Western Ontario, London, Canada
Hannah Briemberg Division of Neurology, University of British Columbia, Vancouver, Canada
Agessandro Abrahao Department of Medicine, Division of Neurology, Sunnybrook Health Sciences Centre, Toronto, Canada Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Canada
Sanjay Kalra Neuroscience and Mental Health Institute, University of Alberta, Edmonton, Canada Division of Neurology, University of Alberta, Edmonton, Canada
Lorne Zinman Department of Medicine, Division of Neurology, Sunnybrook Health Sciences Centre, Toronto, Canada Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Canada L.C. Campbell Cognitive Neurology Research Unit, Sunnybrook Research Institute, University of Toronto, Toronto, Canada, and
Yana Yunusova Department of Speech-Language Pathology, University of Toronto, Toronto, Canada Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Canada Toronto Rehabilitation Institute, University Health Network, Toronto, Canada

Collapse

Chiaramonte R, Bonfiglio M. Acoustic analysis of voice in bulbar amyotrophic lateral sclerosis: a systematic review and meta-analysis of studies. LOGOP PHONIATR VOCO 2019;45:151-163. [DOI: 10.1080/14015439.2019.1687748] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]