1
|
Li R, Huang G, Wang X, Lawler K, Goldberg LR, Roccati E, St George RJ, Aiyede M, King AE, Bindoff AD, Vickers JC, Bai Q, Alty J. Smartphone automated motor and speech analysis for early detection of Alzheimer's disease and Parkinson's disease: Validation of TapTalk across 20 different devices. ALZHEIMER'S & DEMENTIA (AMSTERDAM, NETHERLANDS) 2024; 16:e70025. [PMID: 39445342 PMCID: PMC11496774 DOI: 10.1002/dad2.70025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Revised: 09/17/2024] [Accepted: 09/23/2024] [Indexed: 10/25/2024]
Abstract
INTRODUCTION Smartphones are proving useful in assessing movement and speech function in Alzheimer's disease and other neurodegenerative conditions. Valid outcomes across different smartphones are needed before population-level tests are deployed. This study introduces the TapTalk protocol, a novel app designed to capture hand and speech function and validate it in smartphones against gold-standard measures. METHODS Twenty different smartphones collected video data from motor tests and audio data from speech tests. Features were extracted using Google Mediapipe (movement) and Python audio analysis packages (speech). Electromagnetic sensors (60 Hz) and a microphone acquired simultaneous movement and voice data, respectively. RESULTS TapTalk video and audio outcomes were comparable to gold-standard data: 90.3% of video, and 98.3% of audio, data recorded tapping/speech frequencies within ± 1 Hz of the gold-standard measures. DISCUSSION Validation of TapTalk across a range of devices is an important step in the development of smartphone-based telemedicine and was achieved in this study. Highlights TapTalk evaluates hand motor and speech functions across a wide range of smartphones.Data showed 90.3% motor and 98.3% speech accuracy within +/-1 Hz of gold standards.Validation advances smartphone-based telemedicine for neurodegenerative diseases.
Collapse
Affiliation(s)
- Renjie Li
- Wicking Dementia Research and Education CentreUniversity of TasmaniaHobartTasmaniaAustralia
- School of ICTUniversity of TasmaniaHobartTasmaniaAustralia
| | - Guan Huang
- Wicking Dementia Research and Education CentreUniversity of TasmaniaHobartTasmaniaAustralia
| | - Xinyi Wang
- Wicking Dementia Research and Education CentreUniversity of TasmaniaHobartTasmaniaAustralia
| | - Katherine Lawler
- Wicking Dementia Research and Education CentreUniversity of TasmaniaHobartTasmaniaAustralia
- School of Allied HealthHuman Services and SportLa Trobe UniversityMelbourneVictoriaAustralia
| | - Lynette R. Goldberg
- Wicking Dementia Research and Education CentreUniversity of TasmaniaHobartTasmaniaAustralia
| | - Eddy Roccati
- Wicking Dementia Research and Education CentreUniversity of TasmaniaHobartTasmaniaAustralia
| | | | - Mimieveshiofuo Aiyede
- Wicking Dementia Research and Education CentreUniversity of TasmaniaHobartTasmaniaAustralia
| | - Anna E. King
- Wicking Dementia Research and Education CentreUniversity of TasmaniaHobartTasmaniaAustralia
| | - Aidan D. Bindoff
- Wicking Dementia Research and Education CentreUniversity of TasmaniaHobartTasmaniaAustralia
| | - James C. Vickers
- Wicking Dementia Research and Education CentreUniversity of TasmaniaHobartTasmaniaAustralia
| | - Quan Bai
- School of ICTUniversity of TasmaniaHobartTasmaniaAustralia
| | - Jane Alty
- Wicking Dementia Research and Education CentreUniversity of TasmaniaHobartTasmaniaAustralia
- School of MedicineUniversity of TasmaniaHobartTasmaniaAustralia
- Neurology DepartmentRoyal Hobart HospitalHobartTasmaniaAustralia
| |
Collapse
|
2
|
Zhang H, Dai X, Ma W, Ding H, Zhang Y. Investigating Perception to Production Transfer in Children With Cochlear Implants: A High Variability Phonetic Training Study. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:1206-1228. [PMID: 38466170 DOI: 10.1044/2023_jslhr-23-00573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
PURPOSE This study builds upon an established effective training method to investigate the advantages of high variability phonetic identification training for enhancing lexical tone perception and production in Mandarin-speaking pediatric cochlear implant (CI) recipients, who typically face ongoing challenges in these areas. METHOD Thirty-two Mandarin-speaking children with CIs were quasirandomly assigned into the training group (TG) and the control group (CG). The 16 TG participants received five sessions of high variability phonetic training (HVPT) within a period of 3 weeks. The CG participants did not receive the training. Perception and production of Mandarin tones were administered before (pretest) and immediately after (posttest) the completion of HVPT via lexical tone recognition task and picture naming task. Both groups participated in the identical pretest and posttest with the same time frame between the two test sessions. RESULTS TG showed significant improvement from pretest to posttest in identifying Mandarin tones for both trained and untrained speech stimuli. Moreover, perceptual learning of HVPT significantly facilitated trainees' production of T1 and T2 as rated by a cohort of 10 Mandarin-speaking adults with normal hearing, which was corroborated by acoustic analyses revealing improved fundamental frequency (F0) median for T1 and T2 production and enlarged F0 movement for T2 production. In contrast, TG children's production of T3 and T4 showed nonsignificant changes across two test sessions. Meanwhile, CG did not exhibit significant changes in either perception or production. CONCLUSIONS The results suggest a limited and inconsistent transfer of perceptual learning to lexical tone production in children with CIs, which challenges the notion of a robust transfer and highlights the complexity of the interaction between perceptual training and production outcomes. Further research on individual differences with a longitudinal design is needed to optimize the training protocol or tailor interventions to better meet the diverse needs of learners.
Collapse
Affiliation(s)
- Hao Zhang
- Center for Clinical Neurolinguistics, School of Foreign Languages and Literature, Shandong University, Jinan, China
| | - Xuequn Dai
- Center for Clinical Neurolinguistics, School of Foreign Languages and Literature, Shandong University, Jinan, China
| | - Wen Ma
- Center for Clinical Neurolinguistics, School of Foreign Languages and Literature, Shandong University, Jinan, China
| | - Hongwei Ding
- Speech-Language-Hearing Center, School of Foreign Languages, Shanghai Jiao Tong University, Shanghai, China
| | - Yang Zhang
- Department of Speech-Language-Hearing Sciences and Masonic Institute for the Developing Brain, University of Minnesota, Minneapolis
| |
Collapse
|
3
|
Novotny M, Cmejla R, Tykalova T. Automated prediction of children's age from voice acoustics. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
4
|
Ngo QC, Motin MA, Pah ND, Drotár P, Kempster P, Kumar D. Computerized analysis of speech and voice for Parkinson's disease: A systematic review. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 226:107133. [PMID: 36183641 DOI: 10.1016/j.cmpb.2022.107133] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 09/13/2022] [Accepted: 09/13/2022] [Indexed: 06/16/2023]
Abstract
BACKGROUND AND OBJECTIVE Speech impairment is an early symptom of Parkinson's disease (PD). This study has summarized the literature related to speech and voice in detecting PD and assessing its severity. METHODS A systematic review of the literature from 2010 to 2021 to investigate analysis methods and signal features. The keywords "Automatic analysis" in conjunction with "PD speech" or "PD voice" were used, and the PubMed and ScienceDirect databases were searched. A total of 838 papers were found on the first run, of which 189 were selected. One hundred and forty-seven were found to be suitable for the review. The different datasets, recording protocols, signal analysis methods and features that were reported are listed. Values of the features that separate PD patients from healthy controls were tabulated. Finally, the barriers that limit the wide use of computerized speech analysis are discussed. RESULTS Speech and voice may be valuable markers for PD. However, large differences between the datasets make it difficult to compare different studies. In addition, speech analytic methods that are not informed by physiological understanding may alienate clinicians. CONCLUSIONS The potential usefulness of speech and voice for the detection and assessment of PD is confirmed by evidence from the classification and correlation results.
Collapse
Affiliation(s)
| | - Mohammod Abdul Motin
- Biosignals Lab, RMIT University, Melbourne, Australia; Department of Electrical & Electronic Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Nemuel Daniel Pah
- Biosignals Lab, RMIT University, Melbourne, Australia; Universitas Surabaya, Indonesia
| | - Peter Drotár
- Intelligent Information Systems Lab, Technical University of Kosice, Letna 9, 42001, Kosice, Slovakia
| | - Peter Kempster
- Neurosciences Department, Monash Health, Clayton, VIC, Australia; Department of Medicine, School of Clinical Sciences, Monash University, Clayton, VIC, Australia
| | - Dinesh Kumar
- Biosignals Lab, RMIT University, Melbourne, Australia.
| |
Collapse
|
5
|
Kouba T, Illner V, Rusz J. Study protocol for using a smartphone application to investigate speech biomarkers of Parkinson's disease and other synucleinopathies: SMARTSPEECH. BMJ Open 2022; 12:e059871. [PMID: 35772829 PMCID: PMC9247696 DOI: 10.1136/bmjopen-2021-059871] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
INTRODUCTION Early identification of Parkinson's disease (PD) in its prodromal stage has fundamental implications for the future development of neuroprotective therapies. However, no sufficiently accurate biomarkers of prodromal PD are currently available to facilitate early identification. The vocal assessment of patients with isolated rapid eye movement sleep behaviour disorder (iRBD) and PD appears to have intriguing potential as a diagnostic and progressive biomarker of PD and related synucleinopathies. METHODS AND ANALYSIS Speech patterns in the spontaneous speech of iRBD, early PD and control participants' voice calls will be collected from data acquired via a developed smartphone application over a period of 2 years. A significant increase in several aspects of PD-related speech disorders is expected, and is anticipated to reflect the underlying neurodegeneration processes. ETHICS AND DISSEMINATION The study has been approved by the Ethics Committee of the General University Hospital in Prague, Czech Republic and all the participants will provide written, informed consent prior to their inclusion in the research. The application satisfies the General Data Protection Regulation law requirements of the European Union. The study findings will be published in peer-reviewed journals and presented at international scientific conferences.
Collapse
Affiliation(s)
- Tomáš Kouba
- Department of Circuit Theory, Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic
| | - Vojtěch Illner
- Department of Circuit Theory, Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic
| | - Jan Rusz
- Department of Circuit Theory, Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic
| |
Collapse
|
6
|
Šimek M, Rusz J. Validation of cepstral peak prominence in assessing early voice changes of Parkinson's disease: Effect of speaking task and ambient noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:4522. [PMID: 34972306 DOI: 10.1121/10.0009063] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 12/03/2021] [Indexed: 06/14/2023]
Abstract
Although the cepstral peak prominence (CPP) and its variant, the cepstral peak prominence smooth (CPPS), are considered to be robust acoustic measures for the evaluation of dysphonia, whether they are sensitive to capture early voice changes in Parkinson's disease (PD) has not yet been explored. This study aimed to investigate the voice changes via the CPP measures in the idiopathic rapid eye movement sleep behavior disorder (iRBD), a special case of prodromal neurodegeneration, and recently diagnosed and advanced-stage Parkinson's disease (AS-PD) patients using different speaking tasks across noise-free and noisy environments. The sustained vowel phonation, reading of passages, and monologues of 60 early stage untreated PD, 30 advanced-stage Parkinson's disease, 60 iRBD, and 60 healthy control (HC) participants were evaluated. Significant differences were found between the PD groups and controls in sustained phonation via the CPP (p < 0.05) and CPPS (p < 0.01) and the monologue via the CPP (p < 0.01), although neither the CPP nor CPPS measures were sufficiently sensitive to capture the possible prodromal dysphonia in the iRBD. The quality of the CPP and CPPS measures was influenced substantially by the addition of ambient noise. It was anticipated that the CPP measures might serve as a promising digital biomarker in assessing the dysphonia from the early stages of PD.
Collapse
Affiliation(s)
- Michal Šimek
- Department of Circuit Theory, Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic
| | - Jan Rusz
- Department of Circuit Theory, Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic
| |
Collapse
|
7
|
Rusz J, Tykalová T, Novotný M, Růžička E, Dušek P. Distinct patterns of speech disorder in early-onset and late-onset de-novo Parkinson's disease. NPJ Parkinsons Dis 2021; 7:98. [PMID: 34764299 PMCID: PMC8585880 DOI: 10.1038/s41531-021-00243-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Accepted: 10/21/2021] [Indexed: 11/28/2022] Open
Abstract
Substantial variability and severity of dysarthric patterns across Parkinson's disease (PD) patients may reflect distinct phenotypic differences. We aimed to compare patterns of speech disorder in early-onset PD (EOPD) and late-onset PD (LOPD) in drug-naive patients at early stages of disease. Speech samples were acquired from a total of 96 participants, including two subgroups of 24 de-novo PD patients and two subgroups of 24 age- and sex-matched young and old healthy controls. The EOPD group included patients with age at onset below 51 (mean 42.6, standard deviation 6.1) years and LOPD group patients with age at onset above 69 (mean 73.9, standard deviation 3.0) years. Quantitative acoustic vocal assessment of 10 unique speech dimensions related to respiration, phonation, articulation, prosody, and speech timing was performed. Despite similar perceptual dysarthria severity in both PD subgroups, EOPD showed weaker inspirations (p = 0.03), while LOPD was characterized by decreased voice quality (p = 0.02) and imprecise consonant articulation (p = 0.03). In addition, age-independent occurrence of monopitch (p < 0.001), monoloudness (p = 0.008), and articulatory decay (p = 0.04) was observed in both PD subgroups. The worsening of consonant articulation was correlated with the severity of axial gait symptoms (r = 0.38, p = 0.008). Speech abnormalities in EOPD and LOPD share common features but also show phenotype-specific characteristics, likely reflecting the influence of aging on the process of neurodegeneration. The distinct pattern of imprecise consonant articulation can be interpreted as an axial motor symptom of PD.
Collapse
Affiliation(s)
- Jan Rusz
- Department of Circuit Theory, Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic.
- Department of Neurology and Centre of Clinical Neuroscience, First Faculty of Medicine, Charles University, Prague, Czech Republic.
| | - Tereza Tykalová
- Department of Circuit Theory, Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic
| | - Michal Novotný
- Department of Circuit Theory, Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic
| | - Evžen Růžička
- Department of Neurology and Centre of Clinical Neuroscience, First Faculty of Medicine, Charles University, Prague, Czech Republic
| | - Petr Dušek
- Department of Neurology and Centre of Clinical Neuroscience, First Faculty of Medicine, Charles University, Prague, Czech Republic
| |
Collapse
|
8
|
Madruga M, Campos-Roca Y, Pérez CJ. Impact of noise on the performance of automatic systems for vocal fold lesions detection. Biocybern Biomed Eng 2021. [DOI: 10.1016/j.bbe.2021.07.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
9
|
Krýže P, Tykalová T, Růžička E, Rusz J. Effect of reading passage length on quantitative acoustic speech assessment in Czech-speaking individuals with Parkinson's disease treated with subthalamic nucleus deep brain stimulation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:3366. [PMID: 34241103 DOI: 10.1121/10.0005050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 04/29/2021] [Indexed: 06/13/2023]
Abstract
Little is known about the minimum sample length required for the stable acoustic assessment of speech in Parkinson's disease (PD). This study aimed to investigate the effect of the duration of the reading passage on the determination of reliable acoustic patterns in individuals with PD treated with subthalamic nucleus deep brain stimulation. A phonetically balanced reading text of 313 words was collected from 32 Czech persons with PD, and 32 age- and sex-matched healthy controls. The reading passage was segmented to produce ten sub-texts of increasing length ranging from a one- to a ten-segment-long sub-text. An error rate analysis was used to estimate the required stabilization value by evaluating the differences between the sub-texts and the entire text across seven hypokinetic dysarthria features. The minimum length of a reading passage equal to 128 words was found to be necessary for acoustic assessment, with similar lengths being required for the controls (120 words) and the two PD subgroups, including Parkinsonian individuals with a mild (126 words) and moderate (128 words) dysarthria severity. The current study provides important guidelines for the necessary sample length for future expert instrumental dysarthria assessments and assists in decreasing the time required for clinical speech evaluations.
Collapse
Affiliation(s)
- Petr Krýže
- Department of Circuit Theory, Faculty of Electrical Engineering, Czech Technical University, Prague, Czech Republic
| | - Tereza Tykalová
- Department of Circuit Theory, Faculty of Electrical Engineering, Czech Technical University, Prague, Czech Republic
| | - Evžen Růžička
- Department of Neurology and Centre of Clinical Neuroscience, First Faculty of Medicine, Charles University, Prague, Czech Republic
| | - Jan Rusz
- Department of Circuit Theory, Faculty of Electrical Engineering, Czech Technical University, Prague, Czech Republic
| |
Collapse
|
10
|
Ilesanmi AE, Idowu OP, Chaumrattanakul U, Makhanov SS. Multiscale hybrid algorithm for pre-processing of ultrasound images. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2020.102396] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
11
|
Robin J, Harrison JE, Kaufman LD, Rudzicz F, Simpson W, Yancheva M. Evaluation of Speech-Based Digital Biomarkers: Review and Recommendations. Digit Biomark 2020; 4:99-108. [PMID: 33251474 DOI: 10.1159/000510820] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 08/11/2020] [Indexed: 12/23/2022] Open
Abstract
Speech represents a promising novel biomarker by providing a window into brain health, as shown by its disruption in various neurological and psychiatric diseases. As with many novel digital biomarkers, however, rigorous evaluation is currently lacking and is required for these measures to be used effectively and safely. This paper outlines and provides examples from the literature of evaluation steps for speech-based digital biomarkers, based on the recent V3 framework (Goldsack et al., 2020). The V3 framework describes 3 components of evaluation for digital biomarkers: verification, analytical validation, and clinical validation. Verification includes assessing the quality of speech recordings and comparing the effects of hardware and recording conditions on the integrity of the recordings. Analytical validation includes checking the accuracy and reliability of data processing and computed measures, including understanding test-retest reliability, demographic variability, and comparing measures to reference standards. Clinical validity involves verifying the correspondence of a measure to clinical outcomes which can include diagnosis, disease progression, or response to treatment. For each of these sections, we provide recommendations for the types of evaluation necessary for speech-based biomarkers and review published examples. The examples in this paper focus on speech-based biomarkers, but they can be used as a template for digital biomarker development more generally.
Collapse
Affiliation(s)
| | - John E Harrison
- Metis Cognition Ltd., Park House, Kilmington Common, Warminster, United Kingdom.,Alzheimer Center, AUmc, Amsterdam, The Netherlands.,Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, United Kingdom
| | | | - Frank Rudzicz
- Li Ka Shing Knowledge Institute, St Michael's Hospital, Toronto, Ontario, Canada.,Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.,Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada
| | - William Simpson
- Winterlight Labs, Toronto, Ontario, Canada.,Department of Psychiatry and Behavioural Neuroscience, McMaster University, Hamilton, Ontario, Canada
| | | |
Collapse
|