1
|
Rong P, Heidrick L, Pattee GL. A multimodal approach to automated hierarchical assessment of bulbar involvement in amyotrophic lateral sclerosis. Front Neurol 2024; 15:1396002. [PMID: 38836001 PMCID: PMC11148322 DOI: 10.3389/fneur.2024.1396002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 05/01/2024] [Indexed: 06/06/2024] Open
Abstract
Introduction As a hallmark feature of amyotrophic lateral sclerosis (ALS), bulbar involvement leads to progressive declines of speech and swallowing functions, significantly impacting social, emotional, and physical health, and quality of life. Standard clinical tools for bulbar assessment focus primarily on clinical symptoms and functional outcomes. However, ALS is known to have a long, clinically silent prodromal stage characterized by complex subclinical changes at various levels of the bulbar motor system. These changes accumulate over time and eventually culminate in clinical symptoms and functional declines. Detection of these subclinical changes is critical, both for mechanistic understanding of bulbar neuromuscular pathology and for optimal clinical management of bulbar dysfunction in ALS. To this end, we developed a novel multimodal measurement tool based on two clinically readily available, noninvasive instruments-facial surface electromyography (sEMG) and acoustic techniques-to hierarchically assess seven constructs of bulbar/speech motor control at the neuromuscular and acoustic levels. These constructs, including prosody, pause, functional connectivity, amplitude, rhythm, complexity, and regularity, are both mechanically and clinically relevant to bulbar involvement. Methods Using a custom-developed, fully automated data analytic algorithm, a variety of features were extracted from the sEMG and acoustic recordings of a speech task performed by 13 individuals with ALS and 10 neurologically healthy controls. These features were then factorized into 10 composite outcome measures using confirmatory factor analysis. Statistical and machine learning techniques were applied to these composite outcome measures to evaluate their reliability (internal consistency), validity (concurrent and construct), and efficacy for early detection and progress monitoring of bulbar involvement in ALS. Results The composite outcome measures were demonstrated to (1) be internally consistent and structurally valid in measuring the targeted constructs; (2) hold concurrent validity with the existing clinical and functional criteria for bulbar assessment; and (3) outperform the outcome measures obtained from each constituent modality in differentiating individuals with ALS from healthy controls. Moreover, the composite outcome measures combined demonstrated high efficacy for detecting subclinical changes in the targeted constructs, both during the prodromal stage and during the transition from prodromal to symptomatic stages. Discussion The findings provided compelling initial evidence for the utility of the multimodal measurement tool for improving early detection and progress monitoring of bulbar involvement in ALS, which have important implications in facilitating timely access to and delivery of optimal clinical care of bulbar dysfunction.
Collapse
Affiliation(s)
- Panying Rong
- Department of Speech-Language-Hearing: Sciences and Disorders, University of Kansas, Lawrence, KS, United States
| | - Lindsey Heidrick
- Department of Hearing and Speech, University of Kansas Medical Center, Kansas City, KS, United States
| | - Gary L Pattee
- Neurology Associate P.C., Lincoln, NE, United States
| |
Collapse
|
2
|
Han X, Bai Z, Mogushi K, Hase T, Takeuchi K, Iida Y, Sumita YI, Wakabayashi N. Machine Learning Prediction of Tongue Pressure in Elderly Patients with Head and Neck Tumor: A Cross-Sectional Study. J Clin Med 2024; 13:2363. [PMID: 38673635 PMCID: PMC11051183 DOI: 10.3390/jcm13082363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 04/05/2024] [Accepted: 04/07/2024] [Indexed: 04/28/2024] Open
Abstract
Background: This investigation sought to cross validate the predictors of tongue pressure recovery in elderly patients' post-treatment for head and neck tumors, leveraging advanced machine learning techniques. Methods: By employing logistic regression, support vector regression, random forest, and extreme gradient boosting, the study analyzed an array of variables including patient demographics, surgery types, dental health status, and age, drawn from comprehensive medical records and direct tongue pressure assessments. Results: Among the models, logistic regression emerged as the most effective, demonstrating an accuracy of 0.630 [95% confidence interval (CI): 0.370-0.778], F1 score of 0.688 [95% confidence interval (CI): 0.435-0.853], precision of 0.611 [95% confidence interval (CI): 0.313-0.801], recall of 0.786 [95% confidence interval (CI): 0.413-0.938] and an area under the receiver operating characteristic curve of 0.626 [95% confidence interval (CI): 0.409-0.806]. This model distinctly highlighted the significance of glossectomy (p = 0.039), the presence of functional teeth (p = 0.043), and the patient's age (p = 0.044) as pivotal factors influencing tongue pressure, setting the threshold for statistical significance at p < 0.05. Conclusions: The analysis underscored the critical role of glossectomy, the presence of functional natural teeth, and age as determinants of tongue pressure in logistics regression, with the presence of natural teeth and the tumor site located in the tongue consistently emerging as the key predictors across all computational models employed in this study.
Collapse
Affiliation(s)
- Xuewei Han
- Department of Advanced Prosthodontics, Graduate School, Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (X.H.); (Z.B.); (N.W.)
| | - Ziyi Bai
- Department of Advanced Prosthodontics, Graduate School, Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (X.H.); (Z.B.); (N.W.)
| | - Kaoru Mogushi
- Institute of Education, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (K.M.); (T.H.)
| | - Takeshi Hase
- Institute of Education, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (K.M.); (T.H.)
- Faculty of Pharmacy, Keio University, Tokyo 1088345, Japan
- Center for Mathematical Modelling and Data Science, Osaka University, Osaka 5608531, Japan
- The Systems Biology Institute, Tokyo 1410022, Japan
| | - Katsuyuki Takeuchi
- Institute of Education, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (K.M.); (T.H.)
| | - Yoritsugu Iida
- Institute of Education, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (K.M.); (T.H.)
| | - Yuka I. Sumita
- Department of Advanced Prosthodontics, Graduate School, Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (X.H.); (Z.B.); (N.W.)
- Department of Partial and Complete Denture, The Nippon Dental University School of Life Dentistry, Tokyo 1028159, Japan
| | - Noriyuki Wakabayashi
- Department of Advanced Prosthodontics, Graduate School, Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (X.H.); (Z.B.); (N.W.)
| |
Collapse
|
3
|
Rong P, Rasmussen L. A Fine-Grained Temporal Analysis of Multimodal Oral Diadochokinetic Performance to Assess Speech Impairment in Amyotrophic Lateral Sclerosis. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2024; 33:307-332. [PMID: 38064644 DOI: 10.1044/2023_ajslp-23-00177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
PURPOSE This study used a semiautomated fine-grained temporal analysis to extract features of temporal oral diadochokinetic (DDK) performance across multiple modalities and tasks, from neurologically healthy and impaired individuals secondary to amyotrophic lateral sclerosis (ALS). The aims were to (a) delineate temporal oral DDK deficits relating to the neuromotor pathology of ALS and (b) identify the optimal task-feature combinations to detect speech impairment in ALS. METHOD Mandibular myoelectric, kinematic, and acoustic data were acquired from 13 individuals with ALS and 10 healthy controls producing three alternating motion rate tasks and one sequential motion rate task. Twenty-seven features were extracted from the multimodal data, characterizing three temporal constructs: duration/rate, variability, and coordination. The disease impacts on these features were assessed across tasks, and the task eliciting the greatest disease-related change was identified for each feature. Such "optimal" task-feature combinations were fed into logistic regression to differentiate individuals with ALS from healthy controls. RESULTS Temporal deficits in ALS were characterized by (a) increased duration and variability and reduced coordination of jaw muscle activities, (b) increased duration and variability and altered temporal symmetry of jaw velocity profile, (c) increased muscle-burst-to-peak-velocity duration, and (d) increased motion-to-voice onset duration. These temporal features were differentially affected across tasks. The optimal task-feature combinations, which were further clustered into three composite factors reflecting temporal variability, coarser-grained duration, and finer-grained duration, differentiated ALS from controls with an F1 score of 0.86 (precision = 1.00, recall = 0.75). CONCLUSIONS Temporal oral DDK deficits are likely attributed to a hierarchy of interrelated neurophysiological and biomechanical factors associated with the neuromotor pathology of ALS. These deficits, as assessed crossmodally, provide previously unavailable insights into the multifaceted timing impairment of oromotor performance in ALS. The optimal task-feature combinations targeting these deficits show promise as quantitative markers for (early) detection of speech impairment in ALS.
Collapse
Affiliation(s)
- Panying Rong
- Department of Speech-Language-Hearing: Sciences & Disorders, The University of Kansas, Lawrence
| | - Lily Rasmussen
- Department of Speech-Language-Hearing: Sciences & Disorders, The University of Kansas, Lawrence
| |
Collapse
|
4
|
Rong P, Benson J. Intergenerational choral singing to improve communication outcomes in Parkinson's disease: Development of a theoretical framework and an integrated measurement tool. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2023; 25:722-745. [PMID: 36106430 DOI: 10.1080/17549507.2022.2110281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Purpose: This study presented an initial step towards developing the evidence base for intergenerational choral singing as a communication-focussed rehabilitative approach for Parkinson's disease (PD).Method: A theoretical framework was established to conceptualise the rehabilitative effect of intergenerational choral singing on four domains of communication impairments - motor drive, timing mechanism, sensorimotor integration, higher-level cognitive and affective functions - as well as activity/participation, and quality of life. A computer-assisted multidimensional acoustic analysis was developed to objectively assess the targeted domains of communication impairments. Voice Handicap Index and the World Health Organization's Quality of Life assessment-abbreviated version were used to obtain patient-reported outcomes at the activity/participation and quality of life levels. As a proof of concept, a single subject with PD was recruited to participate in 9 weekly 1-h intergenerational choir rehearsals. The subject was assessed before, 1 week post, and 8 weeks post-choir.Result: Notable trends of improvement were observed in multiple domains of communication impairments at 1 week post-choir. Some improvements were maintained at 8 weeks post-choir. Patient-reported outcomes exhibited limited pre-post changes.Conclusion: This study provided the theoretical groundwork and an empirical measurement tool for future validation of intergenerational choral singing as a novel rehabilitation for PD.
Collapse
Affiliation(s)
- Panying Rong
- Department of Speech-Language-Hearing: Sciences & Disorders, University of Kansas, Lawrence, KS, USA and
| | | |
Collapse
|
5
|
van Brenk F, Lowit A, Tjaden K. Effects of Speaking Rate on Variability of Second Formant Frequency Transitions in Dysarthria. Folia Phoniatr Logop 2023; 76:295-308. [PMID: 37769645 PMCID: PMC10972778 DOI: 10.1159/000534337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 09/26/2023] [Indexed: 10/03/2023] Open
Abstract
INTRODUCTION This study examined the utility of multiple second formant (F2) slope metrics to capture differences in speech production for individuals with dysarthria and healthy controls as a function of speaking rate. In addition, the utility of F2 slope metrics for predicting severity of intelligibility impairment in dysarthria was examined. METHODS Twenty three speakers with Parkinson's disease and mild to moderate hypokinetic dysarthria (HD), 9 speakers with various neurological diseases and mild to severe ataxic or ataxic-spastic dysarthria (AD), and 26 age-matched healthy control speakers (CON) participated in a sentence repetition task. Sentences were produced at habitual, fast, and slow speaking rate. A variety of metrics were derived from the rising F2 transition portion of the diphthong /ai/. To obtain measures of intelligibility for the two clinical speaker groups, 15 undergraduate SLP students participated in a transcription experiment. RESULTS Significantly shallower slopes were found for the speakers with HD compared to control speakers. Steeper F2 slopes were associated with increased speaking rate for all groups. Higher variability in F2 slope metrics was found for the speakers with AD compared to the two other speaker groups. For both clinical speaker groups, there was a negative association between intelligibility and F2 slope variability metrics, indicating lower variability in speech production was associated with higher intelligibility. DISCUSSION F2 slope metrics were sensitive to dysarthria presence, dysarthria type, and speaking rate. The current study provided evidence that the use of F2 slope variability measures has additional value to F2 slope averaged measures for predicting severity of intelligibility impairment in dysarthria.
Collapse
Affiliation(s)
- Frits van Brenk
- Department of Communicative Disorders and Sciences, University at Buffalo, NY, USA
| | - Anja Lowit
- School of Psychological Sciences and Health, Strathclyde University, Scotland
| | - Kris Tjaden
- Department of Communicative Disorders and Sciences, University at Buffalo, NY, USA
| |
Collapse
|
6
|
Rong P, Heidrick L. Functional Role of Temporal Patterning of Articulation in Speech Production: A Novel Perspective Toward Global Timing-Based Motor Speech Assessment and Rehabilitation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:4577-4607. [PMID: 36399794 DOI: 10.1044/2022_jslhr-22-00089] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
PURPOSE This study aimed to (a) relate temporal patterning of articulation to functional speech outcomes in neurologically healthy and impaired speakers, (b) identify changes in temporal patterning of articulation in neurologically impaired speakers, and (c) evaluate how these changes can be modulated by speaking rate manipulation. METHOD Thirteen individuals with amyotrophic lateral sclerosis (ALS) and 10 neurologically healthy controls read a sentence 3 times, first at their habitual rate and then at a voluntarily slowed rate. Temporal patterning of articulation was assessed by 24 features characterizing the modulation patterns within (intra) and between (inter) four articulators (tongue tip, tongue body, lower lip, and jaw) at three linguistically relevant, hierarchically nested timescales corresponding to stress, syllable, and onset-rime/phoneme. For Aim 1, the features for the habitual rate condition were factorized and correlated with two functional speech outcomes-speech intelligibility and intelligible speaking rate. For Aims 2 and 3, the features were compared between groups and rate conditions, respectiely. RESULTS For Aim 1, the modulation features combined were moderately to strongly correlated with intelligibility (R 2 = .51-.53) and intelligible speaking rate (R 2 = .63-.73). For Aim 2, intra-articulator modulation was impaired in ALS, manifested by moderate-to-large decreases in modulation depth at all timescales and cross-timescale phase synchronization. Interarticulator modulation was relatively unaffected. For Aim 3, voluntary rate reduction improved several intra-articulator modulation features identified as being susceptible to the disease effect in individuals with ALS. CONCLUSIONS Disrupted temporal patterning of articulation, presumably reflecting impaired articulatory entrainment to linguistic rhythms, may contribute to functional speech declines in ALS. These impairments tend to be improved through voluntary rate reduction, possibly by reshaping the temporal template of motor plans to better accommodate the disease-related neuromechanical constraints in the articulatory system. These findings shed light on a novel perspective toward global timing-based motor speech assessment and rehabilitation.
Collapse
Affiliation(s)
- Panying Rong
- Department of Speech-Language-Hearing: Sciences & Disorders, The University of Kansas, Lawrence
| | - Lindsey Heidrick
- Department of Hearing and Speech, The University of Kansas Medical Center, Kansas City
| |
Collapse
|
7
|
Rong P, Hansen O, Heidrick L. Relationship between rate-elicited changes in muscular-kinematic control strategies and acoustic performance in individuals with ALS-A multimodal investigation. JOURNAL OF COMMUNICATION DISORDERS 2022; 99:106253. [PMID: 36007484 DOI: 10.1016/j.jcomdis.2022.106253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 08/08/2022] [Accepted: 08/09/2022] [Indexed: 06/15/2023]
Abstract
INTRODUCTION As a key control variable, duration has been long suspected to mediate the organization of speech motor control strategies, which has management implications for neuromotor speech disorders. This study aimed to experimentally delineate the role of duration in organizing speech motor control in neurologically healthy and impaired speakers using a voluntary speaking rate manipulation paradigm. METHODS Thirteen individuals with amyotrophic lateral sclerosis (ALS) and 10 healthy controls performed a sentence reading task three times, first at their habitual rate, then at a slower rate. A multimodal approach combining surface electromyography, kinematic, and acoustic technologies was used to record jaw muscle activities, jaw kinematics, and speech acoustics. Six muscular-kinematic features were extracted and factor-analyzed to characterize the organization of the mandibular control hierarchy. Five acoustic features were extracted, measuring the spectrotemporal properties of the diphthong /ɑɪ/ and the plosives /t/ and /k/. RESULTS The muscular-kinematic features converged into two interpretable latent factors, reflecting the level and cohesiveness/flexibility of mandibular control, respectively. Voluntary rate reduction led to a trend toward (1) finer, less cohesive, and more flexible mandibular control, and (2) increased range and decreased transition slope of the diphthong formants, across neurologically healthy and impaired groups. Differential correlations were found between the rate-elicited changes in mandibular control and acoustic performance for neurologically healthy and impaired speakers. CONCLUSIONS The results provided empirical evidence for the long-suspected but previously unsubstantiated role of duration in (re)organizing speech motor control strategies. The rate-elicited reorganization of muscular-kinematic control contributed to the acoustic performance of healthy speakers, in ways consistent with theoretical predictions. Such contributions were less consistent in impaired speakers, implying the complex nature of speaking rate reduction in ALS, possibly reflecting an interplay of disease-related constraints and volitional duration control. This information may help to stratify and identify candidates for the rate manipulation therapy.
Collapse
Affiliation(s)
- Panying Rong
- Department of Speech-Language-Hearing: Sciences & Disorders, University of Kansas, Lawrence KS, USA.
| | - Olivia Hansen
- Department of Speech-Language-Hearing: Sciences & Disorders, University of Kansas, Lawrence KS, USA; Department of Hearing & Speech, University of Kansas Medical Center, Kansas City, KS, USA
| | - Lindsey Heidrick
- Department of Hearing & Speech, University of Kansas Medical Center, Kansas City, KS, USA
| |
Collapse
|
8
|
Rong P, Pattee GL. A multidimensional facial surface EMG analysis for objective assessment of bulbar involvement in amyotrophic lateral sclerosis. Clin Neurophysiol 2022; 135:74-84. [DOI: 10.1016/j.clinph.2021.11.074] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 11/01/2021] [Accepted: 11/07/2021] [Indexed: 11/03/2022]
|