1
|
Bouvier L, McKinlay S, Truong J, Genge A, Dupré N, Dionne A, Kalra S, Yunusova Y. Speech timing and monosyllabic diadochokinesis measures in the assessment of amyotrophic lateral sclerosis in Canadian French. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2024; 26:267-277. [PMID: 37272348 PMCID: PMC10696137 DOI: 10.1080/17549507.2023.2214706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
PURPOSE The primary objective of this study was to determine if speech and pause measures obtained using a passage reading task and timing measures from a monosyllabic diadochokinesis (DDK) task differ across speakers of Canadian French diagnosed with amyotrophic lateral sclerosis (ALS) presenting with and without bulbar symptoms, and healthy controls. The secondary objective was to determine if these measures can reflect the severity of bulbar symptoms. METHOD A total of 29 Canadian French speakers with ALS (classified as bulbar symptomatic [n = 14] or pre-symptomatic [n = 15]) and 17 age-matched healthy controls completed a passage reading task and a monosyllabic DDK task (/pa/ and /ta/), for up to three follow-up visits. Measures of speaking rate, total duration, speech duration, and pause events were extracted from the passage reading recordings using a semi-automated speech and pause analysis procedure. Manual analysis of DDK recordings provided measures of DDK rate and variability. RESULT Group comparisons revealed significant differences (p = < .05) between the symptomatic group and the pre-symptomatic and control groups for all passage measures and DDK rates. Only the DDK rate in /ta/ differentiated the pre-symptomatic and control groups. Repeated measures correlations revealed moderate correlations (rrm = > 0.40; p = < 0.05) between passage measures of total duration, speaking rate, speech duration, and number of pauses, and ALSFRS-R total and bulbar scores, as well as between DDK rate and ALSFRS-R total score. CONCLUSION Speech and pause measures in passage and timing measures in monosyllabic DDK tasks might be suitable for monitoring bulbar functional symptoms in French speakers with ALS, but more work is required to identify which measures are sensitive to the earliest stages of the disease.
Collapse
Affiliation(s)
- Liziane Bouvier
- School of Communication Sciences and Disorders, McGill University, Montreal, Canada
| | - Scotia McKinlay
- Department of Speech-Language Pathology, University of Toronto, Toronto, Canada
| | - Justin Truong
- Department of Speech-Language Pathology, University of Toronto, Toronto, Canada
| | - Angela Genge
- Montreal Neurological Institute-Hospital – The Neuro, Montréal, Canada
| | - Nicolas Dupré
- Neurosciences axis, CHU de Québec-Université Laval, Quebec City, QC, Canada
- Department of Medicine, Faculty of Medicine, Université Laval, Quebec City, QC, Canada
| | - Annie Dionne
- Neurosciences axis, CHU de Québec-Université Laval, Quebec City, QC, Canada
- Department of Medicine, Faculty of Medicine, Université Laval, Quebec City, QC, Canada
| | - Sanjay Kalra
- Neuroscience and Mental Health Institute, University of Alberta, Edmonton, Canada
- Division of Neurology, University of Alberta, Edmonton, Canada
| | - Yana Yunusova
- Department of Speech-Language Pathology, University of Toronto, Toronto, Canada
- Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Canada
- University Health Network—Toronto Rehabilitation Institute, Toronto, Canada
| |
Collapse
|
2
|
Rowe HP, Stipancic KL, Campbell TF, Yunusova Y, Green JR. The association between longitudinal declines in speech sound accuracy and speech intelligibility in speakers with amyotrophic lateral sclerosis. CLINICAL LINGUISTICS & PHONETICS 2024; 38:227-248. [PMID: 37122073 PMCID: PMC10613582 DOI: 10.1080/02699206.2023.2202297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 04/01/2023] [Accepted: 04/03/2023] [Indexed: 05/27/2023]
Abstract
The purpose of this study was to examine how neurodegeneration secondary to amyotrophic lateral sclerosis (ALS) impacts speech sound accuracy over time and how speech sound accuracy, in turn, is related to speech intelligibility. Twenty-one participants with ALS read the Bamboo Passage over multiple data collection sessions across several months. Phonemic and orthographic transcriptions were completed for all speech samples. The percentage of phonemes accurately produced was calculated across each phoneme, sound class (i.e. consonants versus vowels), and distinctive feature (i.e. features involved in Manner of Articulation, Place of Articulation, Laryngeal Voicing, Tongue Height, and Tongue Advancement). Intelligibility was determined by calculating the percentage of words correctly transcribed orthographically by naive listeners. Linear mixed effects models were conducted to assess the decline of each distinctive feature over time and its impact on intelligibility. The results demonstrated that overall phonemic production accuracy had a nonlinear relationship with speech intelligibility and that a subset of features (i.e. those dependent on precise lingual and labial constriction and/or extensive lingual and labial movement) were more important for intelligibility and were more impacted over time than other features. Furthermore, findings revealed that consonants were more strongly associated with intelligibility than vowels, but consonants did not significantly differ from vowels in their decline over time. These findings have the potential to (1) strengthen mechanistic understanding of the physiological constraints imposed by neuronal degeneration on speech production and (2) inform the timing and selection of treatment and assessment targets for individuals with ALS.
Collapse
Affiliation(s)
- Hannah P Rowe
- Department of Rehabilitation Sciences, MGH Institute of Health Professions, Boston, Massachusetts, USA
| | - Kaila L Stipancic
- Department of Communicative Disorders and Sciences, The State University of New York, Buffalo, New York, USA
| | - Thomas F Campbell
- Callier Center for Communication Disorders, University of Texas, Dallas, Texas, USA
| | - Yana Yunusova
- Department of Speech-Language Pathology and Rehabilitation Sciences Institute, University of Toronto, Toronto, Ontario, Canada
- Hurvitz Brain Sciences Research Program, Sunnybrook Research Institute, Toronto, Ontario, Canada
- KITE Research Center, Toronto Rehabilitation Institute, Toronto, Ontario, Canada
| | - Jordan R Green
- Department of Rehabilitation Sciences, MGH Institute of Health Professions, Boston, Massachusetts, USA
| |
Collapse
|
3
|
Teplansky KJ, Wisler A, Goffman L, Wang J. The Impact of Stimulus Length in Tongue and Lip Movement Pattern Stability in Amyotrophic Lateral Sclerosis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023:1-13. [PMID: 37988653 DOI: 10.1044/2023_jslhr-23-00079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2023]
Abstract
PURPOSE This study aimed to investigate the effect of stimulus signal length on tongue and lip motion pattern stability in speakers diagnosed with amyotrophic lateral sclerosis (ALS) compared to healthy controls. METHOD Electromagnetic articulography was used to derive articulatory motion patterns from individuals with mild (n = 27) and severe (n = 16) ALS and healthy controls (n = 25). The spatiotemporal index (STI) was used as a measure of articulatory stability. Two experiments were conducted to evaluate signal length effects on the STI: (a) the effect of the number of syllables on STI values and (b) increasing lengths of subcomponents of a single phrase. Two-way mixed analyses of variance were conducted to assess the effects of syllable length and group on the STI for the tongue tip (TT), tongue back (TB), and lower lip (LL). RESULTS Experiment 1 showed a significant main effect of syllable length (TT, p < .001; TB, p < .001; and LL, p < .001) and group (TT, p = .037; TB, p = .007; and LL, p = .017). TB and LL stability was generally higher with speech stimuli that included a greater number of syllables. Articulatory variability was significantly higher in speakers diagnosed with ALS compared to healthy controls. Experiment 2 showed a significant main effect of length (TT, p < .001; TB, p = .015; and LL, p < .001), providing additional support that STI values tend to be greater when calculated on longer speech signals. CONCLUSIONS Articulatory stability is influenced by the length of speech signals and manifests similarly in both healthy speakers and persons with ALS. TT stability may be significantly impacted by phonemic content due to greater movement flexibility. Compared to healthy controls, there was an increase in articulatory variability in those with ALS, which likely reflects deviations in speech motor control. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.24463924.
Collapse
Affiliation(s)
- Kristin J Teplansky
- Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin
| | - Alan Wisler
- Department of Mathematics and Statistics, Utah State University, Logan
| | - Lisa Goffman
- Callier Center for Communication Disorders, The University of Texas at Dallas, Richardson
| | - Jun Wang
- Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin
- Department of Neurology, The University of Texas at Austin
| |
Collapse
|
4
|
Migliorelli L, Berardini D, Cela K, Coccia M, Villani L, Frontoni E, Moccia S. A store-and-forward cloud-based telemonitoring system for automatic assessing dysarthria evolution in neurological diseases from video-recording analysis. Comput Biol Med 2023; 163:107194. [PMID: 37421736 DOI: 10.1016/j.compbiomed.2023.107194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 06/06/2023] [Accepted: 06/19/2023] [Indexed: 07/10/2023]
Abstract
BACKGROUND AND OBJECTIVES Patients suffering from neurological diseases may develop dysarthria, a motor speech disorder affecting the execution of speech. Close and quantitative monitoring of dysarthria evolution is crucial for enabling clinicians to promptly implement patients' management strategies and maximizing effectiveness and efficiency of communication functions in term of restoring, compensating or adjusting. In the clinical assessment of orofacial structures and functions, at rest condition or during speech and non-speech movements, a qualitative evaluation is usually performed, throughout visual observation. METHODS To overcome limitations posed by qualitative assessments, this work presents a store-and-forward self-service telemonitoring system that integrates, within its cloud architecture, a convolutional neural network (CNN) for analyzing video recordings acquired by individuals with dysarthria. This architecture - called facial landmark Mask RCNN - aims at locating facial landmarks as a prior for assessing the orofacial functions related to speech and examining dysarthria evolution in neurological diseases. RESULTS When tested on the Toronto NeuroFace dataset, a publicly available annotated dataset of video recordings from patients with amyotrophic lateral sclerosis (ALS) and stroke, the proposed CNN achieved a normalized mean error equal to 1.79 on localizing the facial landmarks. We also tested our system in a real-life scenario on 11 bulbar-onset ALS subjects, obtaining promising outcomes in terms of facial landmark position estimation. DISCUSSION AND CONCLUSIONS This preliminary study represents a relevant step towards the use of remote tools to support clinicians in monitoring the evolution of dysarthria.
Collapse
Affiliation(s)
- Lucia Migliorelli
- Department of Information Engineering, Univeristà Politecnica Delle Marche, Via Brecce Bianche 12, Ancona, 60121, Italy; AIDAPT S.r.l, Via Brecce Bianche 12, Ancona, 60121, Italy.
| | - Daniele Berardini
- Department of Information Engineering, Univeristà Politecnica Delle Marche, Via Brecce Bianche 12, Ancona, 60121, Italy.
| | - Kevin Cela
- Department of Information Engineering, Univeristà Politecnica Delle Marche, Via Brecce Bianche 12, Ancona, 60121, Italy; AIDAPT S.r.l, Via Brecce Bianche 12, Ancona, 60121, Italy.
| | - Michela Coccia
- Centro Clinico NeuroMuscular Omnicentre (NeMO), Fondazione Serena Onlus, Via Conca 71, Torrette (Ancona), 60126, Italy.
| | - Laura Villani
- Department of Neuroscience, Neurorehabilitation Clinic, Azienda Ospedaliero-Universitaria delle Marche, Via Conca 71, Torrette (Ancona), 60126, Italy.
| | - Emanuele Frontoni
- AIDAPT S.r.l, Via Brecce Bianche 12, Ancona, 60121, Italy; Department of Political Sciences, Communication, and International Relations, Università Degli Studi di Macerata, Via Giovanni Mario Crescimbeni 30, Macerata, 62100, Italy; NeMO Lab, Piazza dell'Ospedale Maggiore, Milano, 20162, Italy.
| | - Sara Moccia
- The BioRobotics Institute, Department of Excellence in Robotics and AI, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33, Pisa, 56127, Italy.
| |
Collapse
|
5
|
Idrisoglu A, Dallora AL, Anderberg P, Berglund JS. Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review. J Med Internet Res 2023; 25:e46105. [PMID: 37467031 PMCID: PMC10398366 DOI: 10.2196/46105] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 04/26/2023] [Accepted: 05/23/2023] [Indexed: 07/20/2023] Open
Abstract
BACKGROUND Normal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, pulmonary, and muscular functionality. This sensitivity inspired using voice as a biomarker to examine disorders that affect the voice. Technological improvements and emerging machine learning (ML) technologies have enabled possibilities of extracting digital vocal features from the voice for automated diagnosis and monitoring systems. OBJECTIVE This study aims to summarize a comprehensive view of research on voice-affecting disorders that uses ML techniques for diagnosis and monitoring through voice samples where systematic conditions, nonlaryngeal aerodigestive disorders, and neurological disorders are specifically of interest. METHODS This systematic literature review (SLR) investigated the state of the art of voice-based diagnostic and monitoring systems with ML technologies, targeting voice-affecting disorders without direct relation to the voice box from the point of view of applied health technology. Through a comprehensive search string, studies published from 2012 to 2022 from the databases Scopus, PubMed, and Web of Science were scanned and collected for assessment. To minimize bias, retrieval of the relevant references in other studies in the field was ensured, and 2 authors assessed the collected studies. Low-quality studies were removed through a quality assessment and relevant data were extracted through summary tables for analysis. The articles were checked for similarities between author groups to prevent cumulative redundancy bias during the screening process, where only 1 article was included from the same author group. RESULTS In the analysis of the 145 included studies, support vector machines were the most utilized ML technique (51/145, 35.2%), with the most studied disease being Parkinson disease (PD; reported in 87/145, 60%, studies). After 2017, 16 additional voice-affecting disorders were examined, in contrast to the 3 investigated previously. Furthermore, an upsurge in the use of artificial neural network-based architectures was observed after 2017. Almost half of the included studies were published in last 2 years (2021 and 2022). A broad interest from many countries was observed. Notably, nearly one-half (n=75) of the studies relied on 10 distinct data sets, and 11/145 (7.6%) used demographic data as an input for ML models. CONCLUSIONS This SLR revealed considerable interest across multiple countries in using ML techniques for diagnosing and monitoring voice-affecting disorders, with PD being the most studied disorder. However, the review identified several gaps, including limited and unbalanced data set usage in studies, and a focus on diagnostic test rather than disorder-specific monitoring. Despite the limitations of being constrained by only peer-reviewed publications written in English, the SLR provides valuable insights into the current state of research on ML-based voice-affecting disorder diagnosis and monitoring and highlighting areas to address in future research.
Collapse
Affiliation(s)
- Alper Idrisoglu
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
| | - Ana Luiza Dallora
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
| | - Peter Anderberg
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
- School of Health Sciences, University of Skövde, Skövde, Sweden
| | | |
Collapse
|
6
|
Visibelli A, Roncaglia B, Spiga O, Santucci A. The Impact of Artificial Intelligence in the Odyssey of Rare Diseases. Biomedicines 2023; 11:887. [PMID: 36979866 PMCID: PMC10045927 DOI: 10.3390/biomedicines11030887] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 02/28/2023] [Accepted: 03/08/2023] [Indexed: 03/16/2023] Open
Abstract
Emerging machine learning (ML) technologies have the potential to significantly improve the research and treatment of rare diseases, which constitute a vast set of diseases that affect a small proportion of the total population. Artificial Intelligence (AI) algorithms can help to quickly identify patterns and associations that would be difficult or impossible for human analysts to detect. Predictive modeling techniques, such as deep learning, have been used to forecast the progression of rare diseases, enabling the development of more targeted treatments. Moreover, AI has also shown promise in the field of drug development for rare diseases with the identification of subpopulations of patients who may be most likely to respond to a particular drug. This review aims to highlight the achievements of AI algorithms in the study of rare diseases in the past decade and advise researchers on which methods have proven to be most effective. The review will focus on specific rare diseases, as defined by a prevalence rate that does not exceed 1-9/100,000 on Orphanet, and will examine which AI methods have been most successful in their study. We believe this review can guide clinicians and researchers in the successful application of ML in rare diseases.
Collapse
Affiliation(s)
- Anna Visibelli
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, 53100 Siena, Italy
| | - Bianca Roncaglia
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, 53100 Siena, Italy
| | - Ottavia Spiga
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, 53100 Siena, Italy
- Competence Center ARTES 4.0, 53100 Siena, Italy
- SienabioACTIVE—SbA, 53100 Siena, Italy
| | - Annalisa Santucci
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, 53100 Siena, Italy
- Competence Center ARTES 4.0, 53100 Siena, Italy
- SienabioACTIVE—SbA, 53100 Siena, Italy
| |
Collapse
|
7
|
Simmatis LER, Robin J, Pommée T, McKinlay S, Sran R, Taati N, Truong J, Koyani B, Yunusova Y. Validation of automated pipeline for the assessment of a motor speech disorder in amyotrophic lateral sclerosis (ALS). Digit Health 2023; 9:20552076231219102. [PMID: 38144173 PMCID: PMC10748679 DOI: 10.1177/20552076231219102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 11/20/2023] [Indexed: 12/26/2023] Open
Abstract
Background and objective Amyotrophic lateral sclerosis (ALS) frequently causes speech impairments, which can be valuable early indicators of decline. Automated acoustic assessment of speech in ALS is attractive, and there is a pressing need to validate such tools in line with best practices, including analytical and clinical validation. We hypothesized that data analysis using a novel speech assessment pipeline would correspond strongly to analyses performed using lab-standard practices and that acoustic features from the novel pipeline would correspond to clinical outcomes of interest in ALS. Methods We analyzed data from three standard speech assessment tasks (i.e., vowel phonation, passage reading, and diadochokinesis) in 122 ALS patients. Data were analyzed automatically using a pipeline developed by Winterlight Labs, which yielded 53 acoustic features. First, for analytical validation, data were analyzed using a lab-standard analysis pipeline for comparison. This was followed by univariate analysis (Spearman correlations between individual features in Winterlight and in-lab datasets) and multivariate analysis (sparse canonical correlation analysis (SCCA)). Subsequently, clinical validation was performed. This included univariate analysis (Spearman correlation between automated acoustic features and clinical measures) and multivariate analysis (interpretable autoencoder-based dimensionality reduction). Results Analytical validity was demonstrated by substantial univariate correlations (Spearman's ρ > 0.70) between corresponding pairs of features from automated and lab-based datasets, as well as interpretable SCCA feature groups. Clinical validity was supported by strong univariate correlations between automated features and clinical measures (Spearman's ρ > 0.70), as well as associations between multivariate outputs and clinical measures. Conclusion This novel, automated speech assessment feature set demonstrates substantial promise as a valid tool for analyzing impaired speech in ALS patients and for the further development of these technologies.
Collapse
Affiliation(s)
- Leif ER Simmatis
- Department of Speech-Language Pathology, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
- KITE-Toronto Rehabilitation Institute, University Health Network, Toronto, ON, Canada
| | | | - Timothy Pommée
- Department of Speech-Language Pathology, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
- Sunnybrook Research Institute, Sunnybrook Health Sciences Centre, Toronto, ON, Canada
| | - Scotia McKinlay
- Department of Speech-Language Pathology, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | - Rupinder Sran
- Department of Speech-Language Pathology, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | - Niyousha Taati
- Department of Speech-Language Pathology, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | - Justin Truong
- Department of Speech-Language Pathology, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | | | - Yana Yunusova
- Department of Speech-Language Pathology, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
- KITE-Toronto Rehabilitation Institute, University Health Network, Toronto, ON, Canada
- Sunnybrook Research Institute, Sunnybrook Health Sciences Centre, Toronto, ON, Canada
| |
Collapse
|
8
|
Guarin DL, Taati B, Abrahao A, Zinman L, Yunusova Y. Video-Based Facial Movement Analysis in the Assessment of Bulbar Amyotrophic Lateral Sclerosis: Clinical Validation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:4667-4678. [PMID: 36367528 PMCID: PMC9940890 DOI: 10.1044/2022_jslhr-22-00072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 05/31/2022] [Accepted: 08/12/2022] [Indexed: 06/03/2023]
Abstract
PURPOSE Facial movement analysis during facial gestures and speech provides clinically useful information for assessing bulbar amyotrophic lateral sclerosis (ALS). However, current kinematic methods have limited clinical application due to the equipment costs. Recent advancements in consumer-grade hardware and machine/deep learning made it possible to estimate facial movements from videos. This study aimed to establish the clinical validity of a video-based facial analysis for disease staging classification and estimation of clinical scores. METHOD Fifteen individuals with ALS and 11 controls participated in this study. Participants with ALS were stratified into early and late bulbar ALS groups based on their speaking rate. Participants were recorded with a three-dimensional (3D) camera (color + depth) while repeating a simple sentence 10 times. The lips and jaw movements were estimated, and features related to sentence duration and facial movements were used to train a machine learning model for multiclass classification and to predict the Amyotrophic Lateral Sclerosis Functional Rating Scale-Revised (ALSFRS-R) bulbar subscore and speaking rate. RESULTS The classification model successfully separated healthy controls, the early ALS group, and the late ALS group with an overall accuracy of 96.1%. Video-based features demonstrated a high ability to estimate the speaking rate (adjusted R 2 = .82) and a moderate ability to predict the ALSFRS-R bulbar subscore (adjusted R 2 = .55). CONCLUSIONS The proposed approach based on a 3D camera and machine learning algorithms represents an easy-to-use and inexpensive system that can be included as part of a clinical assessment of bulbar ALS to integrate facial movement analysis with other clinical data seamlessly.
Collapse
Affiliation(s)
- Diego L. Guarin
- Department of Applied Physiology and Kinesiology, University of Florida, Gainesville
| | - Babak Taati
- KITE–Toronto Rehabilitation Institute, University Health Network, Ontario, Canada
- Department of Computer Science, University of Toronto, Ontario, Canada
- Institute of Biomedical Engineering, University of Toronto, Ontario, Canada
| | - Agessandro Abrahao
- Hurvitz Brain Sciences Research Program, Sunnybrook Research Institute, Toronto, Ontario, Canada
- Harquail Centre for Neuromodulation, Sunnybrook Health Sciences Centre, Toronto, Ontario, Canada
- Division of Neurology, Department of Medicine, University of Toronto, Ontario, Canada
| | - Lorne Zinman
- Hurvitz Brain Sciences Research Program, Sunnybrook Research Institute, Toronto, Ontario, Canada
- Division of Neurology, Department of Medicine, University of Toronto, Ontario, Canada
- L.C. Campbell Cognitive Neurology Research Unit, Cognitive Neurology, Sunnybrook Research Institute, University of Toronto, Ontario, Canada
| | - Yana Yunusova
- KITE–Toronto Rehabilitation Institute, University Health Network, Ontario, Canada
- Hurvitz Brain Sciences Research Program, Sunnybrook Research Institute, Toronto, Ontario, Canada
- Department of Speech-Language Pathology and Rehabilitation Sciences Institute, University of Toronto, Ontario, Canada
| |
Collapse
|
9
|
Thomas A, Teplansky KJ, Wisler A, Heitzman D, Austin S, Wang J. Voice Onset Time in Early- and Late-Stage Amyotrophic Lateral Sclerosis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:2586-2593. [PMID: 35858258 PMCID: PMC9907452 DOI: 10.1044/2022_jslhr-21-00632] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 02/24/2022] [Accepted: 04/11/2022] [Indexed: 05/26/2023]
Abstract
PURPOSE Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease that affects bulbar functions including speech and voice. Voice onset time (VOT) was examined in speakers with ALS in early and late stages to explore the coordination of the articulatory and phonatory systems during speech production. METHOD VOT was measured in nonword /bap/ produced by speakers with early-stage ALS (n = 11), late-stage ALS (n = 6), and healthy controls (n = 13), and compared with speech performance decline (a marker of disease progression) in ALS. RESULTS Overall comparison of the VOT values among the three groups showed a significant difference, F(2,27) = 11.71, p < .01. Speakers in late-stage ALS displayed longer voicing lead (negative VOT) than both healthy speakers and speakers in early-stage ALS. VOT was also significantly negatively correlated with speech performance (i.e., Intelligible Speaking Rate), r(15) = .74, p < .01. CONCLUSIONS Speakers with more severe ALS showed greater occurrence of voicing lead and longer voicing lead. Findings show voicing precedes articulatory onset with disease progression in the production of bilabial stops, which suggests that the relative timing of coordination between the supralaryngeal structures and the phonatory system is affected in the late stage of ALS.
Collapse
Affiliation(s)
- Anusha Thomas
- Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin
| | - Kristin J. Teplansky
- Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin
| | - Alan Wisler
- Department of Mathematics and Statistics, Utah State University, Logan
| | | | - Sara Austin
- Department of Neurology, Dell Medical School, The University of Texas at Austin
| | - Jun Wang
- Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin
- Department of Neurology, Dell Medical School, The University of Texas at Austin
| |
Collapse
|
10
|
Teplansky KJ, Wisler A, Green JR, Campbell T, Heitzman D, Austin SG, Wang J. Tongue and Lip Acceleration as a Measure of Speech Decline in Amyotrophic Lateral Sclerosis. Folia Phoniatr Logop 2022; 75:23-34. [PMID: 35760064 PMCID: PMC9792632 DOI: 10.1159/000525514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 06/02/2022] [Indexed: 01/22/2023] Open
Abstract
PURPOSE The goal of this study was to examine the efficacy of acceleration-based articulatory measures in characterizing the decline in speech motor control due to amyotrophic lateral sclerosis (ALS). METHOD Electromagnetic articulography was used to record tongue and lip movements during the production of 20 phrases. Data were collected from 50 individuals diagnosed with ALS. Articulatory kinematic variability was measured using the spatiotemporal index of both instantaneous acceleration and speed signals. Linear regression models were used to analyze the relationship between variability measures and intelligible speaking rate (a clinical measure of disease progression). A machine learning algorithm (support vector regression, SVR) was used to assess whether acceleration or speed features (e.g., mean, median, maximum) showed better performance at predicting speech severity in patients with ALS. RESULTS As intelligible speaking rate declined, the variability of acceleration of tongue and lip movement patterns significantly increased (p < 0.001). The variability of speed and vertical displacement did not significantly predict speech performance measures. Additionally, based on R2 and root mean square error (RMSE) values, the SVR model was able to predict speech severity more accurately from acceleration features (R2 = 0.601, RMSE = 38.453) and displacement features (R2 = 0.218, RMSE = 52.700) than from speed features (R2 = 0.554, RMSE = 40.772). CONCLUSION Results from these models highlight differences in speech motor control in participants with ALS. The variability in acceleration of tongue and lip movements increases as speech performance declines, potentially reflecting physiological deviations due to the progression of ALS. Our findings suggest that acceleration is a more sensitive indicator of speech deterioration due to ALS than displacement and speed and may contribute to improved algorithm designs for monitoring disease progression from speech signals.
Collapse
Affiliation(s)
- Kristin J Teplansky
- Speech, Language, and Hearing Sciences, University of Texas, Austin, Texas, USA,
| | - Alan Wisler
- Mathematics and Statistics, Utah State University, Logan, Utah, USA
| | - Jordan R Green
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, Massachusetts, USA
- Speech and Hearing Bioscience and Technology Program, Harvard University, Boston, Massachusetts, USA
| | - Thomas Campbell
- Callier Center for Communication Disorders, University of Texas at Dallas, Dallas, Texas, USA
| | | | - Sara G Austin
- Neurology, Dell Medical School, University of Texas, Austin, Texas, USA
| | - Jun Wang
- Speech, Language, and Hearing Sciences, University of Texas, Austin, Texas, USA
- Neurology, Dell Medical School, University of Texas, Austin, Texas, USA
| |
Collapse
|
11
|
Lehner K, Ziegler W. Indicators of Communication Limitation in Dysarthria and Their Relation to Auditory-Perceptual Speech Symptoms: Construct Validity of the KommPaS Web App. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:22-42. [PMID: 34890213 DOI: 10.1044/2021_jslhr-21-00215] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
PURPOSE Despite extensive research into communication-related parameters in dysarthria, such as intelligibility, naturalness, and perceived listener effort, the existing evidence has not been translated into a clinically applicable, comprehensive, and valid diagnostic tool so far. This study addresses Communication-Related Parameters in Speech Disorders (KommPaS), a new web-based diagnostic instrument for measuring indices of communication limitation in individuals with dysarthria through online crowdsourcing. More specifically, it answers questions about the construct validity of KommPaS. In the first part, the interrelationship of the KommPaS variables intelligibility, naturalness, perceived listener effort, and speech rate were explored in order to draw a comprehensive picture of a patient's limitations and avoid the collection of redundant information. Second, the influences of motor speech symptoms on the KommPaS variables were studied in order to delineate the structural relationships between two complementary diagnostic perspectives. METHOD One hundred persons with dysarthria of different etiologies and varying degrees of severity were examined with KommPaS to obtain layperson-based data on communication-level parameters, and with the Bogenhausen Dysarthria Scale (BoDyS) to obtain expert-based, function-level data on dysarthria symptoms. The internal structure of the KommPaS variables and their dependence on the BoDyS variables were analyzed using structural equation modeling. RESULTS Despite a high multicollinearity, all KommPaS variables were shown to provide complementary diagnostic information and their mutual interconnections were delineated in a path graph model. Regarding the influence of the BoDyS scales on the KommPaS variables, separate linear regression models revealed plausible predictor sets. A complete path model of KommPaS and BoDyS variables was developed to map the complex interplay between variables at the functional and the communication levels of dysarthria assessment. CONCLUSION In validating a new clinical tool for the diagnostics of communication limitations in dysarthria, this study is the first to draw a comprehensive picture of how auditory-perceptual characteristics of dysarthria interact at the levels of expert-based functional and layperson-based communicative assessments.
Collapse
Affiliation(s)
- Katharina Lehner
- Clinical Neuropsychology Research Group, Institute for Phonetics and Speech Processing, Ludwig-Maximilians-University Munich, Germany
| | - Wolfram Ziegler
- Clinical Neuropsychology Research Group, Institute for Phonetics and Speech Processing, Ludwig-Maximilians-University Munich, Germany
| |
Collapse
|
12
|
Stipancic KL, Palmer KM, Rowe HP, Yunusova Y, Berry JD, Green JR. "You Say Severe, I Say Mild": Toward an Empirical Classification of Dysarthria Severity. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:4718-4735. [PMID: 34762814 PMCID: PMC9150682 DOI: 10.1044/2021_jslhr-21-00197] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Revised: 07/07/2021] [Accepted: 08/12/2021] [Indexed: 05/19/2023]
Abstract
PURPOSE The main purpose of this study was to create an empirical classification system for speech severity in patients with dysarthria secondary to amyotrophic lateral sclerosis (ALS) by exploring the reliability and validity of speech-language pathologists' (SLPs') ratings of dysarthric speech. METHOD Ten SLPs listened to speech samples from 52 speakers with ALS and 20 healthy control speakers. SLPs were asked to rate the speech severity of the speakers using five response options: normal, mild, moderate, severe, and profound. Four severity-surrogate measures were also calculated: SLPs transcribed the speech samples for the calculation of speech intelligibility and rated the effort it took to understand the speakers on a visual analog scale. In addition, speaking rate and intelligible speaking rate were calculated for each speaker. Intrarater and interrater reliability were calculated for each measure. We explored the validity of clinician-based severity ratings by comparing them to the severity-surrogate measures. Receiver operating characteristic (ROC) curves were conducted to create optimal cutoff points for defining dysarthria severity categories. RESULTS Intrarater and interrater reliability for the clinician-based severity ratings were excellent and were comparable to reliability for the severity-surrogate measures explored. Clinician severity ratings were strongly associated with all severity-surrogate measures, suggesting strong construct validity. We also provided a range of values for each severity-surrogate measure within each severity category based on the cutoff points obtained from the ROC analyses. CONCLUSIONS Clinician severity ratings of dysarthric speech are reliable and valid. We discuss the underlying challenges that arise when selecting a stratification measure and offer recommendations for a classification scheme when stratifying patients and research participants into speech severity categories.
Collapse
Affiliation(s)
- Kaila L. Stipancic
- Department of Communicative Disorders and Sciences, University at Buffalo, NY
| | - Kira M. Palmer
- Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA
| | - Hannah P. Rowe
- Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA
| | - Yana Yunusova
- Department of Speech-Language Pathology, University of Toronto, Ontario, Canada
| | - James D. Berry
- Sean M. Healey and AMG Center for ALS, Massachusetts General Hospital, Boston
| | - Jordan R. Green
- Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
13
|
Woisard V, Balaguer M, Fredouille C, Farinas J, Ghio A, Lalain M, Puech M, Astesano C, Pinquier J, Lepage B. Construction of an automatic score for the evaluation of speech disorders among patients treated for a cancer of the oral cavity or the oropharynx: The Carcinologic Speech Severity Index. Head Neck 2021; 44:71-88. [PMID: 34729847 DOI: 10.1002/hed.26903] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 08/15/2021] [Accepted: 10/05/2021] [Indexed: 01/26/2023] Open
Abstract
BACKGROUND Speech disorders impact quality of life for patients treated with oral cavity and oropharynx cancers. However, there is a lack of uniform and applicable methods for measuring the impact on speech production after treatment in this tumor location. OBJECTIVE The objective of this work is to (1) model an automatic severity index of speech applicable in clinical practice, that is equivalent or superior to a severity score obtained by human listeners, via several acoustics parameters extracted (a) directly from speech signal and (b) resulting from speech processing and (2) derive an automatic speech intelligibility classification (i.e., mild, moderate, severe) to predict speech disability and handicap by combining the listener comprehension score with self-reported quality of life related to speech. METHODS Eighty-seven patients treated for cancer of the oral cavity or the oropharynx and 35 controls performed different tasks of speech production and completed questionnaires on speech-related quality of life. The audio recordings were then evaluated by human perception and automatic speech processing. Then, a score was developed through a classic logistic regression model allowing description of the severity of patients' speech disorders. RESULTS Among the group of parameters subject to extraction from automatic processing of the speech signal, six were retained, producing a correlation at 0.87 with the perceptual reference score, 0.77 with the comprehension score, and 0.5 with speech-related quality of life. The parameters that contributed the most are based on automatic speech recognition systems. These are mainly the automatic average normalized likelihood score on a text reading task and the score of cumulative rankings on pseudowords. The reduced automatic YC2SI is modeled in this way: YC2SIp = 11.48726 + (1.52926 × Xaveraged normalized likelihood reading ) + (-1.94e-06 × Xscore of cumulative ranks pseudowords ). CONCLUSION Automatic processing of speech makes it possible to arrive at valid, reliable, and reproducible parameters able to serve as references in the framework of follow-up of patients treated for cancer of the oral cavity or the oropharynx.
Collapse
Affiliation(s)
- Virginie Woisard
- ENT Department, University Hospital of Toulouse, Toulouse, France.,Oncorehabilation Unit, University Institute of Cancer of Toulouse Oncopole, Toulouse, France.,Laboratoire Octogone-Lordat, Jean Jaures University Toulouse II, Toulouse, France
| | - Mathieu Balaguer
- ENT Department, University Hospital of Toulouse, Toulouse, France.,Institut de Recherche en Informatique de Toulouse, CNRS, Paul Sabatier University Toulouse III, Toulouse, France
| | - Corinne Fredouille
- Laboratoire d'Informatique d'Avignon, Avignon University, Avignon, France
| | - Jérôme Farinas
- Oncorehabilation Unit, University Institute of Cancer of Toulouse Oncopole, Toulouse, France
| | - Alain Ghio
- Laboratoire Parole et Langage, Aix-Marseille University, Marseille, France
| | - Muriel Lalain
- Laboratoire Parole et Langage, Aix-Marseille University, Marseille, France
| | - Michèle Puech
- ENT Department, University Hospital of Toulouse, Toulouse, France.,Oncorehabilation Unit, University Institute of Cancer of Toulouse Oncopole, Toulouse, France
| | - Corine Astesano
- Laboratoire Octogone-Lordat, Jean Jaures University Toulouse II, Toulouse, France
| | - Julien Pinquier
- Oncorehabilation Unit, University Institute of Cancer of Toulouse Oncopole, Toulouse, France
| | - Benoît Lepage
- ENT Department, University Hospital of Toulouse, Toulouse, France.,USMR, Université Paul Sabatier Toulouse III, Toulouse, France
| |
Collapse
|
14
|
Wisler A, Teplansky K, Heitzman D, Wang J. The Effects of Symptom Onset Location on Automatic Amyotrophic Lateral Sclerosis Detection Using the Correlation Structure of Articulatory Movements. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:2276-2286. [PMID: 33647219 PMCID: PMC8740667 DOI: 10.1044/2020_jslhr-20-00288] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Revised: 09/22/2020] [Accepted: 11/19/2020] [Indexed: 06/12/2023]
Abstract
Purpose Kinematic measurements of speech have demonstrated some success in automatic detection of early symptoms of amyotrophic lateral sclerosis (ALS). In this study, we examined how the region of symptom onset (bulbar vs. spinal) affects the ability of data-driven models to detect ALS. Method We used a correlation structure of articulatory movements combined with a machine learning model (i.e., artificial neural network) to detect differences between people with ALS and healthy controls. The performance of this system was evaluated separately for participants with bulbar onset and spinal onset to examine how region of onset affects classification performance. We then performed a regression analysis to examine how different severity measures and region of onset affects model performance. Results The proposed model was significantly more accurate in classifying the bulbar-onset participants, achieving an area under the curve of 0.809 relative to the 0.674 achieved for spinal-onset participants. The regression analysis, however, found that differences in classifier performance across participants were better explained by their speech performance (intelligible speaking rate), and no significant differences were observed based on region of onset when intelligible speaking rate was accounted for. Conclusions Although we found a significant difference in the model's ability to detect ALS depending on the region of onset, this disparity can be primarily explained by observable differences in speech motor symptoms. Thus, when the severity of speech symptoms (e.g., intelligible speaking rate) was accounted for, symptom onset location did not affect the proposed computational model's ability to detect ALS.
Collapse
Affiliation(s)
- Alan Wisler
- Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin
| | - Kristin Teplansky
- Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin
| | | | - Jun Wang
- Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin
- Department of Neurology, Dell Medical School, The University of Texas at Austin
| |
Collapse
|
15
|
Fernandes F, Barbalho I, Barros D, Valentim R, Teixeira C, Henriques J, Gil P, Dourado Júnior M. Biomedical signals and machine learning in amyotrophic lateral sclerosis: a systematic review. Biomed Eng Online 2021; 20:61. [PMID: 34130692 PMCID: PMC8207575 DOI: 10.1186/s12938-021-00896-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Accepted: 06/09/2021] [Indexed: 12/11/2022] Open
Abstract
INTRODUCTION The use of machine learning (ML) techniques in healthcare encompasses an emerging concept that envisages vast contributions to the tackling of rare diseases. In this scenario, amyotrophic lateral sclerosis (ALS) involves complexities that are yet not demystified. In ALS, the biomedical signals present themselves as potential biomarkers that, when used in tandem with smart algorithms, can be useful to applications within the context of the disease. METHODS This Systematic Literature Review (SLR) consists of searching for and investigating primary studies that use ML techniques and biomedical signals related to ALS. Following the definition and execution of the SLR protocol, 18 articles met the inclusion, exclusion, and quality assessment criteria, and answered the SLR research questions. DISCUSSIONS Based on the results, we identified three classes of ML applications combined with biomedical signals in the context of ALS: diagnosis (72.22%), communication (22.22%), and survival prediction (5.56%). CONCLUSIONS Distinct algorithmic models and biomedical signals have been reported and present promising approaches, regardless of their classes. In summary, this SLR provides an overview of the primary studies analyzed as well as directions for the construction and evolution of technology-based research within the scope of ALS.
Collapse
Affiliation(s)
- Felipe Fernandes
- Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, RN Brazil
| | - Ingridy Barbalho
- Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, RN Brazil
| | - Daniele Barros
- Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, RN Brazil
| | - Ricardo Valentim
- Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, RN Brazil
| | - César Teixeira
- Department of Informatics Engineering, Univ Coimbra, CISUC-Center for Informatics and Systems of the University of Coimbra, Coimbra, Portugal
| | - Jorge Henriques
- Department of Informatics Engineering, Univ Coimbra, CISUC-Center for Informatics and Systems of the University of Coimbra, Coimbra, Portugal
| | - Paulo Gil
- Department of Informatics Engineering, Univ Coimbra, CISUC-Center for Informatics and Systems of the University of Coimbra, Coimbra, Portugal
| | - Mário Dourado Júnior
- Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, RN Brazil
| |
Collapse
|
16
|
Tena A, Claria F, Solsona F, Meister E, Povedano M. Detection of Bulbar Involvement in Patients With Amyotrophic Lateral Sclerosis by Machine Learning Voice Analysis: Diagnostic Decision Support Development Study. JMIR Med Inform 2021; 9:e21331. [PMID: 33688838 PMCID: PMC7991994 DOI: 10.2196/21331] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 10/26/2020] [Accepted: 01/17/2021] [Indexed: 11/13/2022] Open
Abstract
Background Bulbar involvement is a term used in amyotrophic lateral sclerosis (ALS) that refers to motor neuron impairment in the corticobulbar area of the brainstem, which produces a dysfunction of speech and swallowing. One of the earliest symptoms of bulbar involvement is voice deterioration characterized by grossly defective articulation; extremely slow, laborious speech; marked hypernasality; and severe harshness. Bulbar involvement requires well-timed and carefully coordinated interventions. Therefore, early detection is crucial to improving the quality of life and lengthening the life expectancy of patients with ALS who present with this dysfunction. Recent research efforts have focused on voice analysis to capture bulbar involvement. Objective The main objective of this paper was (1) to design a methodology for diagnosing bulbar involvement efficiently through the acoustic parameters of uttered vowels in Spanish, and (2) to demonstrate that the performance of the automated diagnosis of bulbar involvement is superior to human diagnosis. Methods The study focused on the extraction of features from the phonatory subsystem—jitter, shimmer, harmonics-to-noise ratio, and pitch—from the utterance of the five Spanish vowels. Then, we used various supervised classification algorithms, preceded by principal component analysis of the features obtained. Results To date, support vector machines have performed better (accuracy 95.8%) than the models analyzed in the related work. We also show how the model can improve human diagnosis, which can often misdiagnose bulbar involvement. Conclusions The results obtained are very encouraging and demonstrate the efficiency and applicability of the automated model presented in this paper. It may be an appropriate tool to help in the diagnosis of ALS by multidisciplinary clinical teams, in particular to improve the diagnosis of bulbar involvement.
Collapse
Affiliation(s)
- Alberto Tena
- Information and Communication Technologies Group, International Centre for Numerical Methods in Engineering, Barcelona, Spain
| | - Francec Claria
- Department of Computer Science, Universitat de Lleida, Lleida, Spain
| | - Francesc Solsona
- Department of Computer Science, Universitat de Lleida, Lleida, Spain
| | - Einar Meister
- Institute of Cybernetics, Tallinn University of Technology, Tallinn, Estonia
| | - Monica Povedano
- Motoneuron Functional Unit, Hospital Universitari de Bellvitge, Barcelona, Spain
| |
Collapse
|
17
|
Goyal NA, Berry JD, Windebank A, Staff NP, Maragakis NJ, van den Berg LH, Genge A, Miller R, Baloh RH, Kern R, Gothelf Y, Lebovits C, Cudkowicz M. Addressing heterogeneity in amyotrophic lateral sclerosis CLINICAL TRIALS. Muscle Nerve 2020; 62:156-166. [PMID: 31899540 PMCID: PMC7496557 DOI: 10.1002/mus.26801] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Revised: 12/30/2019] [Accepted: 12/31/2019] [Indexed: 12/12/2022]
Abstract
Amyotrophic lateral sclerosis (ALS) is a debilitating neurodegenerative disorder with complex biology and significant clinical heterogeneity. Many preclinical and early phase ALS clinical trials have yielded promising results that could not be replicated in larger phase 3 confirmatory trials. One reason for the lack of reproducibility may be ALS biological and clinical heterogeneity. Therefore, in this review, we explore sources of ALS heterogeneity that may reduce statistical power to evaluate efficacy in ALS trials. We also review efforts to manage clinical heterogeneity, including use of validated disease outcome measures, predictive biomarkers of disease progression, and individual clinical risk stratification. We propose that personalized prognostic models with use of predictive biomarkers may identify patients with ALS for whom a specific therapeutic strategy may be expected to be more successful. Finally, the rapid application of emerging clinical and biomarker strategies may reduce heterogeneity, increase trial efficiency, and, in turn, accelerate ALS drug development.
Collapse
Affiliation(s)
| | - James D. Berry
- Healey Center at Massachusetts General HospitalBostonMassachusetts
| | | | | | | | | | - Angela Genge
- Montreal Neurological Institute and HospitalMontreal, QuebecCanada
| | - Robert Miller
- California Pacific Medical CenterSan FranciscoCalifornia
| | - Robert H. Baloh
- Robert H. Baloh, Cedars‐Sinai Medical CenterCaliforniaLos Angeles
| | - Ralph Kern
- Brainstorm Cell TherapeuticsNew YorkNew York
| | | | | | - Merit Cudkowicz
- Healey Center at Massachusetts General HospitalBostonMassachusetts
| |
Collapse
|
18
|
Wisler AA, Fletcher AR, McAuliffe MJ. Predicting Montreal Cognitive Assessment Scores From Measures of Speech and Language. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:1752-1761. [PMID: 32459131 DOI: 10.1044/2020_jslhr-19-00183] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Purpose This study examined the relationship between measurements derived from spontaneous speech and participants' scores on the Montreal Cognitive Assessment. Method Participants (N = 521) aged between 64 and 97 years completed the cognitive assessment and were prompted to describe an early childhood memory. A range of acoustic and linguistic measures was extracted from the resulting speech sample. A least absolute shrinkage and selection operator approach was used to model the relationship between acoustic, lexical, and demographic information and participants' scores on the cognitive assessment. Results Using the covariance test statistic, four important variables were identified, which, together, explained 16.52% of the variance in participants' cognitive scores. Conclusions The degree to which cognition can be accurately predicted through spontaneously produced speech samples is limited. Statistically significant relationships were found between specific measurements of lexical variation, participants' speaking rate, and their scores on the Montreal Cognitive Assessment.
Collapse
Affiliation(s)
- Alan A Wisler
- New Zealand Institute of Language, Brain and Behaviour, Christchurch, New Zealand
| | - Annalise R Fletcher
- Department of Audiology and Speech-Language Pathology, University of North Texas, Denton
| | - Megan J McAuliffe
- Department of Communication Disorders, University of Canterbury, Christchurch, New Zealand
| |
Collapse
|
19
|
Generative Adversarial Network-Based Neural Audio Caption Model for Oral Evaluation. ELECTRONICS 2020. [DOI: 10.3390/electronics9030424] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Oral evaluation is one of the most critical processes in children’s language learning. Traditionally, the Scoring Rubric is widely used in oral evaluation for providing a ranking score by assessing word accuracy, phoneme accuracy, fluency, and accent position of a tester. In recent years, by the emerging demands of the market, oral evaluation requires not only providing a single score from pronunciation but also in-depth, meaning comments based on content, context, logic, and understanding. However, the Scoring Rubric requires massive human work (oral evaluation experts) to provide such deep meaning comments. It is considered uneconomical and inefficient in the current market. Therefore, this paper proposes an automated expert comment generation approach for oral evaluation. The approach first extracts the oral features from the children’s audio as well as the text features from the corresponding expert comments. Then, a Gated Recurrent Unit (GRU) is applied to encode the oral features into the model. Afterwards, a Long Short-Term Memory (LSTM) model is applied to train the mappings between oral features and text features and generate expert comments for the new coming oral audio. Finally, a Generative Adversarial Network (GAN) is combined to improve the quality of the generated comments. It generates pseudo-comments to train the discriminator to recognize the human-like comments. The proposed approach is evaluated in a real-world audio dataset (children oral audio) collected by our collaborative company. The proposed approach is also integrated into a commercial application to generate expert comments for children’s oral evaluation. The experimental results and the lessons learned from real-world applications show that the proposed approach is effective for providing meaningful comments for oral evaluation.
Collapse
|
20
|
Barnett C, Green JR, Marzouqah R, Stipancic KL, Berry JD, Korngut L, Genge A, Shoesmith C, Briemberg H, Abrahao A, Kalra S, Zinman L, Yunusova Y. Reliability and validity of speech & pause measures during passage reading in ALS. Amyotroph Lateral Scler Frontotemporal Degener 2020; 21:42-50. [PMID: 32138555 PMCID: PMC7080316 DOI: 10.1080/21678421.2019.1697888] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 11/11/2019] [Indexed: 10/25/2022]
Abstract
Objective: The use of speech measures is becoming a common practice in the assessment of bulbar disease progression in amyotrophic lateral sclerosis (ALS). This study aimed to establish psychometric properties (e.g. reliability, validity, sensitivity, specificity) of speech and pause timing measures during a standardized passage. Methods: A large number of passage recordings (ALS N = 775; Neurotypical controls N = 323) was analyzed using a semi-automatic method (Speech and Pause Analysis, SPA). Results: The results revealed acceptable reliability of the speech and pause measures across repeated recording by the control participants. Strong construct validity was established via significant group differences between patients and controls and correlation statistics with clinical measures of overall ALS and bulbar disease severity. Speaking rate, pause events, and mean pause duration were able to detect ALS participants at the presymptomatic stage of bulbar disease with a good discrimination ability (AUC 0.81). Conclusions: Based on the current psychometric evaluation, performing passage recording and speech and pause timing analysis was deemed useful for detecting early and progressive changes associated with bulbar ALS.
Collapse
Affiliation(s)
- Carolina Barnett
- Division of Neurology, Department of Medicine, University of Toronto and University Health Network, Toronto, Canada
- Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
| | - Jordan R Green
- Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA, USA
- Speech and Hearing Biosciences and Technology Program, Harvard University, Cambridge, MA, USA
| | - Reeman Marzouqah
- Department of Speech-Language Pathology, University of Toronto, Toronto, Canada
| | - Kaila L Stipancic
- Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA, USA
| | - James D Berry
- Harvard Medical School, Department of Neurology, Massachusetts General Hospital (MGH), Boston, Massachusetts, USA,
| | - Lawrence Korngut
- Department of Clinical Neurosciences, Hotchkiss Brain Institute, University of Calgary, Calgary, Canada
| | - Angela Genge
- Montreal Neurological Institute, Neurosurgery, McGill University, Montreal, Canada
| | - Christen Shoesmith
- Department of Clinical Neurological Sciences, University of Western Ontario, London, Canada
| | - Hannah Briemberg
- Division of Neurology, University of British Columbia, Vancouver, Canada
| | - Agessandro Abrahao
- Department of Medicine, Division of Neurology, Sunnybrook Health Sciences Centre, Toronto, Canada
- Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Canada
| | - Sanjay Kalra
- Neuroscience and Mental Health Institute, University of Alberta, Edmonton, Canada
- Division of Neurology, University of Alberta, Edmonton, Canada
| | - Lorne Zinman
- Department of Medicine, Division of Neurology, Sunnybrook Health Sciences Centre, Toronto, Canada
- Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Canada
- L.C. Campbell Cognitive Neurology Research Unit, Sunnybrook Research Institute, University of Toronto, Toronto, Canada, and
| | - Yana Yunusova
- Department of Speech-Language Pathology, University of Toronto, Toronto, Canada
- Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Canada
- Toronto Rehabilitation Institute, University Health Network, Toronto, Canada
| |
Collapse
|
21
|
Chiaramonte R, Bonfiglio M. Acoustic analysis of voice in bulbar amyotrophic lateral sclerosis: a systematic review and meta-analysis of studies. LOGOP PHONIATR VOCO 2019; 45:151-163. [DOI: 10.1080/14015439.2019.1687748] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
- Rita Chiaramonte
- Department of Physical Medicine and Rehabilitation, University of Catania, Catania, Italy
| | - Marco Bonfiglio
- Department for Health Activities, ASP Siracusa, Siracusa, Italy
| |
Collapse
|