1
|
Meyer T, Favaro A, Oh ES, Butala A, Motley C, Irazoqui P, Dehak N, Moro-Velázquez L. Deep Stroop: Integrating eye tracking and speech processing to characterize people with neurodegenerative disorders while performing neuropsychological tests. Comput Biol Med 2025; 184:109398. [PMID: 39616880 DOI: 10.1016/j.compbiomed.2024.109398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 10/09/2024] [Accepted: 11/07/2024] [Indexed: 12/22/2024]
Abstract
Neurodegenerative diseases (NDs) can be difficult to precisely characterize and monitor as they present complex and overlapping signs despite affecting different neural circuits. Neuropsychological tests are important tools for assessing signs, but only considering patient-generated output can limit insight. Here, we present an improvement to the neuropsychological test evaluation paradigm by deeply characterizing patient interaction and behavior during tests based on multiple perspectives alongside typically evaluated output by performing multi-modal analysis of eye movement and speech data. Using the well-known Stroop Test, we compare behaviors of healthy controls to patients with Alzheimer's Disease (AD), Mild Cognitive Impairment, Parkinson's Disease (PD), and secondary Parkinsonism. We maximize accessibility and reproducibility by automatically extracting metrics, including eye motor behavior, speech patterns, and multimodal interplay, with almost no human input required. We find many metrics including increased horizontal saccade distances sensitive to all NDs, delayed task initiation in AD, response error patterns and blinking patterns that differ between AD and PD. Our metrics show both significantly different distributions between disease groups and simultaneous correlation with the MoCA and MDS-UPDRS-III clinical rating scales. Our findings show the utility of incorporating several perspectives into one output representation, as our metric breadth creates unique sign profiles that quantify and visualize a patient's dysfunction. These metrics provide much better sign characterization between diseases and correlation with disease severity than traditional Stroop measures. This methodology offers the potential to expand its application to other traditional neuropsychological tests, shifting the paradigm in diagnostic precision for NDs and advancing patient care.
Collapse
Affiliation(s)
- Trevor Meyer
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA.
| | - Anna Favaro
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA
| | - Esther S Oh
- Division of Geriatric Medicine and Gerontology, Department of Medicine, The Johns Hopkins School of Medicine, Laurel, MD, USA
| | - Ankur Butala
- Department of Neurology, The Johns Hopkins School of Medicine, Baltimore, MD, USA; Department of Psychiatry and Behavioral Sciences, The Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Chelsie Motley
- Department of Neurology, The Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Pedro Irazoqui
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA; Department of Biomedical Engineering, The Johns Hopkins University, Baltimore, MD, USA
| | - Najim Dehak
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA
| | - Laureano Moro-Velázquez
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA
| |
Collapse
|
2
|
Lim WS, Chiu SI, Peng PL, Jang JSR, Lee SH, Lin CH, Kim HJ. A cross-language speech model for detection of Parkinson's disease. J Neural Transm (Vienna) 2024:10.1007/s00702-024-02874-z. [PMID: 39739129 DOI: 10.1007/s00702-024-02874-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Accepted: 12/17/2024] [Indexed: 01/02/2025]
Abstract
Speech change is a biometric marker for Parkinson's disease (PD). However, evaluating speech variability across diverse languages is challenging. We aimed to develop a cross-language algorithm differentiating between PD patients and healthy controls using a Taiwanese and Korean speech data set. We recruited 299 healthy controls and 347 patients with PD from Taiwan and Korea. Participants with PD underwent smartphone-based speech recordings during the "on" phase. Each Korean participant performed various speech texts, while the Taiwanese participant read a standardized, fixed-length article. Korean short-speech (≦15 syllables) and long-speech (> 15 syllables) recordings were combined with the Taiwanese speech dataset. The merged dataset was split into a training set (controls vs. early-stage PD) and a validation set (controls vs. advanced-stage PD) to evaluate the model's effectiveness in differentiating PD patients from controls across languages based on speech length. Numerous acoustic and linguistic speech features were extracted and combined with machine learning algorithms to distinguish PD patients from controls. The area under the receiver operating characteristic (AUROC) curve was calculated to assess diagnostic performance. Random forest and AdaBoost classifiers showed an AUROC 0.82 for distinguishing patients with early-stage PD from controls. In the validation cohort, the random forest algorithm maintained this value (0.90) for discriminating advanced-stage PD patients. The model showed superior performance in the combined language cohort (AUROC 0.90) than either the Korean (AUROC 0.87) or Taiwanese (AUROC 0.88) cohorts individually. However, with another merged speech data set of short-speech recordings < 25 characters, the diagnostic performance to identify early-stage PD patients from controls dropped to 0.72 and showed a further limited ability to discriminate advanced-stage patients. Leveraging multifaceted speech features, including both acoustic and linguistic characteristics, could aid in distinguishing PD patients from healthy individuals, even across different languages.
Collapse
Affiliation(s)
- Wee Shin Lim
- Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan
| | - Shu-I Chiu
- Department of Computer Science, National Chengchi University, Taipei, Taiwan
| | - Pei-Ling Peng
- Department of Neurology, College of Medicine, National Taiwan University Hospital, National Taiwan University, Taipei, 100, Taiwan
| | - Jyh-Shing Roger Jang
- Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan
| | - Sol-Hee Lee
- Department of Neurology, Seoul National University Hospital and Seoul National University College of Medicine, Seoul, Korea
| | - Chin-Hsien Lin
- Department of Neurology, College of Medicine, National Taiwan University Hospital, National Taiwan University, Taipei, 100, Taiwan.
- Colleague of Medicine, National Taiwan University, Taipei, Taiwan.
- Department of Biomedical Engineering, National Taiwan University, Taipei, Taiwan.
- Institute of Molecular Medicine, College of Medicine, National Taiwan University, Taipei, Taiwan.
| | - Han-Joon Kim
- Department of Neurology, Seoul National University Hospital and Seoul National University College of Medicine, Seoul, Korea.
| |
Collapse
|
3
|
Favaro A, Butala A, Thebaud T, Villalba J, Dehak N, Moro-Velázquez L. Unveiling early signs of Parkinson's disease via a longitudinal analysis of celebrity speech recordings. NPJ Parkinsons Dis 2024; 10:207. [PMID: 39465276 PMCID: PMC11514279 DOI: 10.1038/s41531-024-00817-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Accepted: 10/08/2024] [Indexed: 10/29/2024] Open
Abstract
Numerous studies proposed methods to detect Parkinson's disease (PD) via speech analysis. However, existing corpora often lack prodromal recordings, have small sample sizes, and lack longitudinal data. Speech samples from celebrities who publicly disclosed their PD diagnosis provide longitudinal data, allowing the creation of a new corpus, ParkCeleb. We collected videos from 40 subjects with PD and 40 controls and analyzed evolving speech features from 10 years before to 20 years after diagnosis. Our longitudinal analysis, focused on 15 subjects with PD and 15 controls, revealed features like pitch variability, pause duration, speech rate, and syllable duration, indicating PD progression. Early dysarthria patterns were detectable in the prodromal phase, with the best classifiers achieving AUCs of 0.72 and 0.75 for data collected ten and five years before diagnosis, respectively, and 0.93 post-diagnosis. This study highlights the potential for early detection methods, aiding treatment response identification and screening in clinical trials.
Collapse
Affiliation(s)
- Anna Favaro
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA.
- Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA.
| | - Ankur Butala
- Department of Neurology, Johns Hopkins Medicine, Baltimore, MD, USA
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins Medicine, Baltimore, MD, USA
| | - Thomas Thebaud
- Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA
| | - Jesús Villalba
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA
- Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA
| | - Najim Dehak
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA
- Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA
| | - Laureano Moro-Velázquez
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA
- Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA
| |
Collapse
|
4
|
Pinto S, Cardoso R, Atkinson-Clement C, Guimarães I, Sadat J, Santos H, Mercier C, Carvalho J, Cuartero MC, Oliveira P, Welby P, Frota S, Cavazzini E, Vigário M, Letanneux A, Cruz M, Brulefert C, Desmoulins M, Martins IP, Rothe-Neves R, Viallet F, Ferreira JJ. Do Acoustic Characteristics of Dysarthria in People With Parkinson's Disease Differ Across Languages? JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:2822-2841. [PMID: 38754039 DOI: 10.1044/2024_jslhr-23-00525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2024]
Abstract
PURPOSE Cross-language studies suggest more similarities than differences in how dysarthria affects the speech of people with Parkinson's disease (PwPD) who speak different languages. In this study, we aimed to identify the relative contribution of acoustic variables to distinguish PwPD from controls who spoke varieties of two Romance languages, French and Portuguese. METHOD This bi-national, cross-sectional, and case-controlled study included 129 PwPD and 124 healthy controls who spoke French or Portuguese. All participants underwent the same clinical examinations, voice/speech recordings, and self-assessment questionnaires. PwPD were evaluated off and on optimal medication. Inferential analyses included Disease (controls vs. PwPD) and Language (French vs. Portuguese) as factors, and random decision forest algorithms identified relevant acoustic variables able to distinguish participants: (a) by language (French vs. Portuguese) and (b) by clinical status (PwPD on and off medication vs. controls). RESULTS French-speaking and Portuguese-speaking individuals were distinguished from each other with over 90% accuracy by five acoustic variables (the mean fundamental frequency and the shimmer of the sustained vowel /a/ production, the oral diadochokinesis performance index, the relative sound level pressure and the relative sound pressure level standard deviation of the text reading). A distinct set of parameters discriminated between controls and PwPD: for men, maximum phonation time and the oral diadochokinesis speech proportion were the most significant variables; for women, variables calculated from the oral diadochokinesis were the most discriminative. CONCLUSIONS Acoustic variables related to phonation and voice quality distinguished between speakers of the two languages. Variables related to pneumophonic coordination and articulation rate were the more effective in distinguishing PwPD from controls. Thus, our research findings support that respiration and diadochokinesis tasks appear to be the most appropriate to pinpoint signs of dysarthria, which are largely homogeneous and language-universal. In contrast, identifying language-specific variables with the speech tasks and acoustic variables studied was less conclusive.
Collapse
Affiliation(s)
- Serge Pinto
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
| | - Rita Cardoso
- CNS - Campus Neurológico Sénior, Torres Vedras, Portugal
- Instituto de Medicina Molecular, Faculdade de Medicina, University of Lisbon, Portugal
| | - Cyril Atkinson-Clement
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
- Precision Imaging Beacon, School of Medicine, University of Nottingham, United Kingdom
| | - Isabel Guimarães
- Instituto de Medicina Molecular, Faculdade de Medicina, University of Lisbon, Portugal
- Speech Therapy Department, Alcoitão Health School of Sciences, Alcabideche, Portugal
| | - Jasmin Sadat
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
| | - Helena Santos
- CNS - Campus Neurológico Sénior, Torres Vedras, Portugal
| | - Céline Mercier
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
- Neurology Department, Centre Hospitalier Intercommunal du Pays d'Aix, Aix-en-Provence, France
| | - Joana Carvalho
- CNS - Campus Neurológico Sénior, Torres Vedras, Portugal
| | | | | | - Pauline Welby
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
| | - Sónia Frota
- Center of Linguistics, School of Arts and Humanities, University of Lisbon, Portugal
| | | | - Marina Vigário
- Center of Linguistics, School of Arts and Humanities, University of Lisbon, Portugal
| | - Alban Letanneux
- ESPE Université Paris-Est Créteil, Laboratoire CHArt-UPEC (EA 4004), Bonneuil-sur-Marne, France
| | - Marisa Cruz
- Center of Linguistics, School of Arts and Humanities, University of Lisbon, Portugal
| | | | | | - Isabel Pavão Martins
- Language Research Laboratory, Department of Neurology, University of Lisbon, Portugal
| | - Rui Rothe-Neves
- Laboratório de Fonética, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - François Viallet
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
- Neurology Department, Centre Hospitalier Intercommunal du Pays d'Aix, Aix-en-Provence, France
| | - Joaquim J Ferreira
- CNS - Campus Neurológico Sénior, Torres Vedras, Portugal
- Instituto de Medicina Molecular, Faculdade de Medicina, University of Lisbon, Portugal
| |
Collapse
|
5
|
Levy ES, Moya-Galé G. Revisiting Dysarthria Treatment Across Languages: The Hybrid Approach. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:2893-2902. [PMID: 38056466 DOI: 10.1044/2023_jslhr-23-00629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/08/2023]
Abstract
PURPOSE Ten years after Miller and Lowit's (2014) groundbreaking book providing a cross-linguistic perspective on motor speech disorders, we ask where we are regarding dysarthria treatment across languages in two specific populations: adults with Parkinson's disease (PD) and children with cerebral palsy (CP). METHOD In this commentary, we consider preliminary evidence for both language-independent and language-specific approaches to treatment and propose a hybrid approach to speech treatment across languages, centered on the individual with dysarthria who speaks any given language. CONCLUSIONS Treatment research on individuals with dysarthria secondary to PD and CP is advancing, but several areas remain to be explored. Next steps are suggested for addressing the paucity and complexity of cross-linguistic speech treatment research.
Collapse
|
6
|
Harris C, Tang Y, Birnbaum E, Cherian C, Mendhe D, Chen MH. Digital Neuropsychology beyond Computerized Cognitive Assessment: Applications of Novel Digital Technologies. Arch Clin Neuropsychol 2024; 39:290-304. [PMID: 38520381 PMCID: PMC11485276 DOI: 10.1093/arclin/acae016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 02/16/2024] [Indexed: 03/25/2024] Open
Abstract
Compared with other health disciplines, there is a stagnation in technological innovation in the field of clinical neuropsychology. Traditional paper-and-pencil tests have a number of shortcomings, such as low-frequency data collection and limitations in ecological validity. While computerized cognitive assessment may help overcome some of these issues, current computerized paradigms do not address the majority of these limitations. In this paper, we review recent literature on the applications of novel digital health approaches, including ecological momentary assessment, smartphone-based assessment and sensors, wearable devices, passive driving sensors, smart homes, voice biomarkers, and electronic health record mining, in neurological populations. We describe how each digital tool may be applied to neurologic care and overcome limitations of traditional neuropsychological assessment. Ethical considerations, limitations of current research, as well as our proposed future of neuropsychological practice are also discussed.
Collapse
Affiliation(s)
- Che Harris
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
- Department of Neurology, Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ, USA
| | - Yingfei Tang
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
- Department of Neurology, Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ, USA
| | - Eliana Birnbaum
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
| | - Christine Cherian
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
| | - Dinesh Mendhe
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
| | - Michelle H Chen
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
- Department of Neurology, Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ, USA
| |
Collapse
|
7
|
Kaser AN, Lacritz LH, Winiarski HR, Gabirondo P, Schaffert J, Coca AJ, Jiménez-Raboso J, Rojo T, Zaldua C, Honorato I, Gallego D, Nieves ER, Rosenstein LD, Cullum CM. A novel speech analysis algorithm to detect cognitive impairment in a Spanish population. Front Neurol 2024; 15:1342907. [PMID: 38638311 PMCID: PMC11024431 DOI: 10.3389/fneur.2024.1342907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 02/26/2024] [Indexed: 04/20/2024] Open
Abstract
Objective Early detection of cognitive impairment in the elderly is crucial for diagnosis and appropriate care. Brief, cost-effective cognitive screening instruments are needed to help identify individuals who require further evaluation. This study presents preliminary data on a new screening technology using automated voice recording analysis software in a Spanish population. Method Data were collected from 174 Spanish-speaking individuals clinically diagnosed as cognitively normal (CN, n = 87) or impaired (mild cognitive impairment [MCI], n = 63; all-cause dementia, n = 24). Participants were recorded performing four common language tasks (Animal fluency, alternating fluency [sports and fruits], phonemic "F" fluency, and Cookie Theft Description). Recordings were processed via text-transcription and digital-signal processing techniques to capture neuropsychological variables and audio characteristics. A training sample of 122 subjects with similar demographics across groups was used to develop an algorithm to detect cognitive impairment. Speech and task features were used to develop five independent machine learning (ML) models to compute scores between 0 and 1, and a final algorithm was constructed using repeated cross-validation. A socio-demographically balanced subset of 52 participants was used to test the algorithm. Analysis of covariance (ANCOVA), covarying for demographic characteristics, was used to predict logistically-transformed algorithm scores. Results Mean logit algorithm scores were significantly different across groups in the testing sample (p < 0.01). Comparisons of CN with impaired (MCI + dementia) and MCI groups using the final algorithm resulted in an AUC of 0.93/0.90, with overall accuracy of 88.4%/87.5%, sensitivity of 87.5/83.3, and specificity of 89.2/89.2, respectively. Conclusion Findings provide initial support for the utility of this automated speech analysis algorithm as a screening tool for cognitive impairment in Spanish speakers. Additional study is needed to validate this technology in larger and more diverse clinical populations.
Collapse
Affiliation(s)
- Alyssa N. Kaser
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Laura H. Lacritz
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Department of Neurology, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Holly R. Winiarski
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | | | - Jeff Schaffert
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Alberto J. Coca
- AcceXible Impacto, Sociedad Limitada, Bilbao, Spain
- Cambridge Mathematics of Information in Healthcare Hub, University of Cambridge, Cambridge, United Kingdom
| | | | - Tomas Rojo
- AcceXible Impacto, Sociedad Limitada, Bilbao, Spain
| | - Carla Zaldua
- AcceXible Impacto, Sociedad Limitada, Bilbao, Spain
| | | | | | - Emmanuel Rosario Nieves
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Parkland Health and Hospital System Behavioral Health Clinic, Dallas, TX, United States
| | - Leslie D. Rosenstein
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Parkland Health and Hospital System Behavioral Health Clinic, Dallas, TX, United States
| | - C. Munro Cullum
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Department of Neurology, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Department of Neurological Surgery, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| |
Collapse
|
8
|
García AM, Johann F, Echegoyen R, Calcaterra C, Riera P, Belloli L, Carrillo F. Toolkit to Examine Lifelike Language (TELL): An app to capture speech and language markers of neurodegeneration. Behav Res Methods 2024; 56:2886-2900. [PMID: 37759106 PMCID: PMC11200269 DOI: 10.3758/s13428-023-02240-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/11/2023] [Indexed: 09/29/2023]
Abstract
Automated speech and language analysis (ASLA) is a promising approach for capturing early markers of neurodegenerative diseases. However, its potential remains underexploited in research and translational settings, partly due to the lack of a unified tool for data collection, encryption, processing, download, and visualization. Here we introduce the Toolkit to Examine Lifelike Language (TELL) v.1.0.0, a web-based app designed to bridge such a gap. First, we outline general aspects of its development. Second, we list the steps to access and use the app. Third, we specify its data collection protocol, including a linguistic profile survey and 11 audio recording tasks. Fourth, we describe the outputs the app generates for researchers (downloadable files) and for clinicians (real-time metrics). Fifth, we survey published findings obtained through its tasks and metrics. Sixth, we refer to TELL's current limitations and prospects for expansion. Overall, with its current and planned features, TELL aims to facilitate ASLA for research and clinical aims in the neurodegeneration arena. A demo version can be accessed here: https://demo.sci.tellapp.org/ .
Collapse
Affiliation(s)
- Adolfo M García
- Global Brain Health Institute, University of California, 505 Parnassus Ave, San Francisco, CA, 94143, USA.
- Cognitive Neuroscience Center, Universidad de San Andrés, Buenos Aires, Argentina.
- Departamento de Lingüística y Literatura, Facultad de Humanidades, Universidad de Santiago de Chile, Santiago, Chile.
- TELL Toolkit SA, Beethovenstraat, Netherlands.
| | - Fernando Johann
- Cognitive Neuroscience Center, Universidad de San Andrés, Buenos Aires, Argentina
- TELL Toolkit SA, Beethovenstraat, Netherlands
| | - Raúl Echegoyen
- Cognitive Neuroscience Center, Universidad de San Andrés, Buenos Aires, Argentina
- TELL Toolkit SA, Beethovenstraat, Netherlands
| | - Cecilia Calcaterra
- Cognitive Neuroscience Center, Universidad de San Andrés, Buenos Aires, Argentina
- TELL Toolkit SA, Beethovenstraat, Netherlands
| | - Pablo Riera
- Instituto de Investigación en Ciencias de la Computación (ICC), CONICET-Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Laouen Belloli
- Instituto de Investigación en Ciencias de la Computación (ICC), CONICET-Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Facundo Carrillo
- Instituto de Investigación en Ciencias de la Computación (ICC), CONICET-Universidad de Buenos Aires, Buenos Aires, Argentina
| |
Collapse
|
9
|
Li D, Butala AA, Moro-Velazquez L, Meyer T, Oh ES, Motley C, Villalba J, Dehak N. Automating the analysis of eye movement for different neurodegenerative disorders. Comput Biol Med 2024; 170:107951. [PMID: 38219646 DOI: 10.1016/j.compbiomed.2024.107951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 12/08/2023] [Accepted: 01/01/2024] [Indexed: 01/16/2024]
Abstract
The clinical observation and assessment of extra-ocular movements is common practice in assessing neurodegenerative disorders but remains observer-dependent. In the present study, we propose an algorithm that can automatically identify saccades, fixation, smooth pursuit, and blinks using a non-invasive eye tracker. Subsequently, response-to-stimuli-derived interpretable features were elicited that objectively and quantitatively assess patient behaviors. The cohort analysis encompasses persons with mild cognitive impairment (MCI), Alzheimer's disease (AD), Parkinson's disease (PD), Parkinson's disease mimics (PDM), and controls (CTRL). Overall, results suggested that the AD/MCI and PD groups had significantly different saccade and pursuit characteristics compared to CTRL when the target moved faster or covered a larger visual angle during smooth pursuit. These two groups also displayed more omitted antisaccades and longer average antisaccade latency than CTRL. When reading a text passage silently, people with AD/MCI had more fixations. During visual exploration, people with PD demonstrated a more variable saccade duration than other groups. In the prosaccade task, the PD group showed a significantly smaller average hypometria gain and accuracy, with the most statistical significance and highest AUC scores of features studied. The minimum saccade gain was a PD-specific feature different from CTRL and PDM. These features, as oculographic biomarkers, can be potentially leveraged in distinguishing different types of NDs, yielding more objective and precise protocols to diagnose and monitor disease progression.
Collapse
Affiliation(s)
- Deming Li
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, USA.
| | - Ankur A Butala
- Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, 21205, MD, USA
| | - Laureano Moro-Velazquez
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, USA
| | - Trevor Meyer
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, USA
| | - Esther S Oh
- Division of Geriatric Medicine and Gerontology, Johns Hopkins University School of Medicine, Baltimore, 21205, MD, USA
| | - Chelsey Motley
- Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, 21205, MD, USA
| | - Jesús Villalba
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, USA
| | - Najim Dehak
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, USA
| |
Collapse
|
10
|
Rios-Urrego CD, Rusz J, Orozco-Arroyave JR. Automatic speech-based assessment to discriminate Parkinson's disease from essential tremor with a cross-language approach. NPJ Digit Med 2024; 7:37. [PMID: 38368458 PMCID: PMC10874421 DOI: 10.1038/s41746-024-01027-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 02/05/2024] [Indexed: 02/19/2024] Open
Abstract
Parkinson's disease (PD) and essential tremor (ET) are prevalent movement disorders that mainly affect elderly people, presenting diagnostic challenges due to shared clinical features. While both disorders exhibit distinct speech patterns-hypokinetic dysarthria in PD and hyperkinetic dysarthria in ET-the efficacy of speech assessment for differentiation remains unexplored. Developing technology for automatic discrimination could enable early diagnosis and continuous monitoring. However, the lack of data for investigating speech behavior in these patients has inhibited the development of a framework for diagnostic support. In addition, phonetic variability across languages poses practical challenges in establishing a universal speech assessment system. Therefore, it is necessary to develop models robust to the phonetic variability present in different languages worldwide. We propose a method based on Gaussian mixture models to assess domain adaptation from models trained in German and Spanish to classify PD and ET patients in Czech. We modeled three different speech dimensions: articulation, phonation, and prosody and evaluated the models' performance in both bi-class and tri-class classification scenarios (with the addition of healthy controls). Our results show that a fusion of the three speech dimensions achieved optimal results in binary classification, with accuracies up to 81.4 and 86.2% for monologue and /pa-ta-ka/ tasks, respectively. In tri-class scenarios, incorporating healthy speech signals resulted in accuracies of 63.3 and 71.6% for monologue and /pa-ta-ka/ tasks, respectively. Our findings suggest that automated speech analysis, combined with machine learning is robust, accurate, and can be adapted to different languages to distinguish between PD and ET patients.
Collapse
Affiliation(s)
| | - Jan Rusz
- Department of Circuit Theory, Czech Technical University in Prague, Prague, Czech Republic.
| | - Juan Rafael Orozco-Arroyave
- GITA Lab, Faculty of Engineering, University of Antioquia, Medellín, Colombia.
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.
| |
Collapse
|
11
|
Favaro A, Tsai YT, Butala A, Thebaud T, Villalba J, Dehak N, Moro-Velázquez L. Interpretable speech features vs. DNN embeddings: What to use in the automatic assessment of Parkinson's disease in multi-lingual scenarios. Comput Biol Med 2023; 166:107559. [PMID: 37852107 DOI: 10.1016/j.compbiomed.2023.107559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 10/07/2023] [Accepted: 10/09/2023] [Indexed: 10/20/2023]
Abstract
Speech-based approaches for assessing Parkinson's Disease (PD) often rely on feature extraction for automatic classification or detection. While many studies prioritize accuracy by using non-interpretable embeddings from Deep Neural Networks, this work aims to explore the predictive capabilities and language robustness of both feature types in a systematic fashion. As interpretable features, prosodic, linguistic, and cognitive descriptors were adopted, while x-vectors, Wav2Vec 2.0, HuBERT, and TRILLsson representations were used as non-interpretable features. Mono-lingual, multi-lingual, and cross-lingual machine learning experiments were conducted leveraging six data sets comprising speech recordings from various languages: American English, Castilian Spanish, Colombian Spanish, Italian, German, and Czech. For interpretable feature-based models, the mean of the best F1-scores obtained from each language was 81% in mono-lingual, 81% in multi-lingual, and 71% in cross-lingual experiments. For non-interpretable feature-based models, instead, they were 85% in mono-lingual, 88% in multi-lingual, and 79% in cross-lingual experiments. Firstly, models based on non-interpretable features outperformed interpretable ones, especially in cross-lingual experiments. Specifically, TRILLsson provided the most stable and accurate results across tasks and data sets. Conversely, the two types of features adopted showed some level of language robustness in multi-lingual and cross-lingual experiments. Overall, these results suggest that interpretable feature-based models can be used by clinicians to evaluate the deterioration of the speech of patients with PD, while non-interpretable feature-based models can be leveraged to achieve higher detection accuracy.
Collapse
Affiliation(s)
- Anna Favaro
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America.
| | - Yi-Ting Tsai
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Ankur Butala
- Department of Neurology, The Johns Hopkins University, Baltimore, 21218, MD, United States of America; Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Thomas Thebaud
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Jesús Villalba
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Najim Dehak
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Laureano Moro-Velázquez
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| |
Collapse
|