1
|
Favaro A, Butala A, Thebaud T, Villalba J, Dehak N, Moro-Velázquez L. Unveiling early signs of Parkinson's disease via a longitudinal analysis of celebrity speech recordings. NPJ Parkinsons Dis 2024; 10:207. [PMID: 39465276 PMCID: PMC11514279 DOI: 10.1038/s41531-024-00817-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Accepted: 10/08/2024] [Indexed: 10/29/2024] Open
Abstract
Numerous studies proposed methods to detect Parkinson's disease (PD) via speech analysis. However, existing corpora often lack prodromal recordings, have small sample sizes, and lack longitudinal data. Speech samples from celebrities who publicly disclosed their PD diagnosis provide longitudinal data, allowing the creation of a new corpus, ParkCeleb. We collected videos from 40 subjects with PD and 40 controls and analyzed evolving speech features from 10 years before to 20 years after diagnosis. Our longitudinal analysis, focused on 15 subjects with PD and 15 controls, revealed features like pitch variability, pause duration, speech rate, and syllable duration, indicating PD progression. Early dysarthria patterns were detectable in the prodromal phase, with the best classifiers achieving AUCs of 0.72 and 0.75 for data collected ten and five years before diagnosis, respectively, and 0.93 post-diagnosis. This study highlights the potential for early detection methods, aiding treatment response identification and screening in clinical trials.
Collapse
Affiliation(s)
- Anna Favaro
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA.
- Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA.
| | - Ankur Butala
- Department of Neurology, Johns Hopkins Medicine, Baltimore, MD, USA
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins Medicine, Baltimore, MD, USA
| | - Thomas Thebaud
- Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA
| | - Jesús Villalba
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA
- Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA
| | - Najim Dehak
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA
- Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA
| | - Laureano Moro-Velázquez
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA
- Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA
| |
Collapse
|
2
|
Pinto S, Cardoso R, Atkinson-Clement C, Guimarães I, Sadat J, Santos H, Mercier C, Carvalho J, Cuartero MC, Oliveira P, Welby P, Frota S, Cavazzini E, Vigário M, Letanneux A, Cruz M, Brulefert C, Desmoulins M, Martins IP, Rothe-Neves R, Viallet F, Ferreira JJ. Do Acoustic Characteristics of Dysarthria in People With Parkinson's Disease Differ Across Languages? JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:2822-2841. [PMID: 38754039 DOI: 10.1044/2024_jslhr-23-00525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2024]
Abstract
PURPOSE Cross-language studies suggest more similarities than differences in how dysarthria affects the speech of people with Parkinson's disease (PwPD) who speak different languages. In this study, we aimed to identify the relative contribution of acoustic variables to distinguish PwPD from controls who spoke varieties of two Romance languages, French and Portuguese. METHOD This bi-national, cross-sectional, and case-controlled study included 129 PwPD and 124 healthy controls who spoke French or Portuguese. All participants underwent the same clinical examinations, voice/speech recordings, and self-assessment questionnaires. PwPD were evaluated off and on optimal medication. Inferential analyses included Disease (controls vs. PwPD) and Language (French vs. Portuguese) as factors, and random decision forest algorithms identified relevant acoustic variables able to distinguish participants: (a) by language (French vs. Portuguese) and (b) by clinical status (PwPD on and off medication vs. controls). RESULTS French-speaking and Portuguese-speaking individuals were distinguished from each other with over 90% accuracy by five acoustic variables (the mean fundamental frequency and the shimmer of the sustained vowel /a/ production, the oral diadochokinesis performance index, the relative sound level pressure and the relative sound pressure level standard deviation of the text reading). A distinct set of parameters discriminated between controls and PwPD: for men, maximum phonation time and the oral diadochokinesis speech proportion were the most significant variables; for women, variables calculated from the oral diadochokinesis were the most discriminative. CONCLUSIONS Acoustic variables related to phonation and voice quality distinguished between speakers of the two languages. Variables related to pneumophonic coordination and articulation rate were the more effective in distinguishing PwPD from controls. Thus, our research findings support that respiration and diadochokinesis tasks appear to be the most appropriate to pinpoint signs of dysarthria, which are largely homogeneous and language-universal. In contrast, identifying language-specific variables with the speech tasks and acoustic variables studied was less conclusive.
Collapse
Affiliation(s)
- Serge Pinto
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
| | - Rita Cardoso
- CNS - Campus Neurológico Sénior, Torres Vedras, Portugal
- Instituto de Medicina Molecular, Faculdade de Medicina, University of Lisbon, Portugal
| | - Cyril Atkinson-Clement
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
- Precision Imaging Beacon, School of Medicine, University of Nottingham, United Kingdom
| | - Isabel Guimarães
- Instituto de Medicina Molecular, Faculdade de Medicina, University of Lisbon, Portugal
- Speech Therapy Department, Alcoitão Health School of Sciences, Alcabideche, Portugal
| | - Jasmin Sadat
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
| | - Helena Santos
- CNS - Campus Neurológico Sénior, Torres Vedras, Portugal
| | - Céline Mercier
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
- Neurology Department, Centre Hospitalier Intercommunal du Pays d'Aix, Aix-en-Provence, France
| | - Joana Carvalho
- CNS - Campus Neurológico Sénior, Torres Vedras, Portugal
| | | | | | - Pauline Welby
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
| | - Sónia Frota
- Center of Linguistics, School of Arts and Humanities, University of Lisbon, Portugal
| | | | - Marina Vigário
- Center of Linguistics, School of Arts and Humanities, University of Lisbon, Portugal
| | - Alban Letanneux
- ESPE Université Paris-Est Créteil, Laboratoire CHArt-UPEC (EA 4004), Bonneuil-sur-Marne, France
| | - Marisa Cruz
- Center of Linguistics, School of Arts and Humanities, University of Lisbon, Portugal
| | | | | | - Isabel Pavão Martins
- Language Research Laboratory, Department of Neurology, University of Lisbon, Portugal
| | - Rui Rothe-Neves
- Laboratório de Fonética, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - François Viallet
- Aix-Marseille Univ, CNRS, LPL, Aix-en-Provence, France
- Neurology Department, Centre Hospitalier Intercommunal du Pays d'Aix, Aix-en-Provence, France
| | - Joaquim J Ferreira
- CNS - Campus Neurológico Sénior, Torres Vedras, Portugal
- Instituto de Medicina Molecular, Faculdade de Medicina, University of Lisbon, Portugal
| |
Collapse
|
3
|
Convey RB, Laukkanen AM, Ylinen S, Penttilä N. Analysis of Voice Changes in Early-Stage Parkinson's Disease with AVQI and ABI: A Follow-up Study. J Voice 2024:S0892-1997(24)00160-7. [PMID: 38897855 DOI: 10.1016/j.jvoice.2024.05.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 05/17/2024] [Accepted: 05/18/2024] [Indexed: 06/21/2024]
Abstract
OBJECTIVES The purpose of this pilot study was to examine voice quality changes in individuals with early-stage Parkinson's disease (PD) utilizing the Acoustic Voice Quality Index (AVQI) and Acoustic Breathiness Index (ABI) over approximately a 1-year period. STUDY DESIGN Follow-up study. METHODS Baseline and follow-up data were gathered from the PDSTUlong speech corpus. The data for both time points included: speaker background information, sustained vowels, reading samples, and measures of PD severity (Hoehn and Yahr scores and Unified Parkinson's Disease Rating Scale III scores [UPDRS-III]). All speakers (N = 12) were native Finnish speakers. AVQIv03.01 and ABI analysis were completed in VOXplot v2.0.1. Changes in AVQI and ABI scores between baseline and follow-up were examined via causal analysis. Further, AVQI and ABI were analyzed in relation to measures of PD severity. RESULTS Baseline mean AVQI score was 1.79 (range 0.14-4.83, SD=1.60), whereas follow-up mean AVQI score was 2.25 (range 0.55-4.53, SD=1.36). Baseline mean ABI score, in turn, was 2.92 (range 1-27 - 5.31, SD=1.57), whereas follow-up mean ABI score was 3.42 (range 1.40-5.40, SD=1.38). A significant difference was found between baseline and follow-up measures for both AVQI (Z = -2.002, P = 0.045) and ABI (Z = -2.197, P = 0.028). A significant difference in smoothed cepstral peak prominence (Z = -2.118, P = 0.034) and harmonics-to-noise ratio (Z = -1.961, P = 0.050) was also found between the two measurement periods. Change in AVQI and ABI were not correlated with the change in measures of PD severity. CONCLUSION Over approximately 1-year, a statistical change was observed in AVQI and ABI scores, even in such a small dataset. The specific qualities of breathiness and hoarseness showed the most significant progression. Changes in voice quality were more prominent in ABI analysis.
Collapse
Affiliation(s)
- Rachel B Convey
- Faculty of Social Sciences, Tampere University, Tampere, Finland.
| | | | - Sari Ylinen
- Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Nelly Penttilä
- Faculty of Social Sciences, Tampere University, Tampere, Finland
| |
Collapse
|
4
|
Srinivasan S, Ramadass P, Mathivanan SK, Panneer Selvam K, Shivahare BD, Shah MA. Detection of Parkinson disease using multiclass machine learning approach. Sci Rep 2024; 14:13813. [PMID: 38877028 PMCID: PMC11178918 DOI: 10.1038/s41598-024-64004-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 06/04/2024] [Indexed: 06/16/2024] Open
Abstract
Parkinson's Disease (PD) is a prevalent neurological condition characterized by motor and cognitive impairments, typically manifesting around the age of 50 and presenting symptoms such as gait difficulties and speech impairments. Although a cure remains elusive, symptom management through medication is possible. Timely detection is pivotal for effective disease management. In this study, we leverage Machine Learning (ML) and Deep Learning (DL) techniques, specifically K-Nearest Neighbor (KNN) and Feed-forward Neural Network (FNN) models, to differentiate between individuals with PD and healthy individuals based on voice signal characteristics. Our dataset, sourced from the University of California at Irvine (UCI), comprises 195 voice recordings collected from 31 patients. To optimize model performance, we employ various strategies including Synthetic Minority Over-sampling Technique (SMOTE) for addressing class imbalance, Feature Selection to identify the most relevant features, and hyperparameter tuning using RandomizedSearchCV. Our experimentation reveals that the FNN and KSVM models, trained on an 80-20 split of the dataset for training and testing respectively, yield the most promising results. The FNN model achieves an impressive overall accuracy of 99.11%, with 98.78% recall, 99.96% precision, and a 99.23% f1-score. Similarly, the KSVM model demonstrates strong performance with an overall accuracy of 95.89%, recall of 96.88%, precision of 98.71%, and an f1-score of 97.62%. Overall, our study showcases the efficacy of ML and DL techniques in accurately identifying PD from voice signals, underscoring the potential for these approaches to contribute significantly to early diagnosis and intervention strategies for Parkinson's Disease.
Collapse
Affiliation(s)
- Saravanan Srinivasan
- Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, 600062, India
| | - Parthasarathy Ramadass
- Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, 600062, India
| | | | - Karthikeyan Panneer Selvam
- Department of Computer Applications, School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore, Tamil Nadu, 632014, India
| | - Basu Dev Shivahare
- School of Computer Science and Engineering, Galgotias University, Greater Noida, 203201, India
| | - Mohd Asif Shah
- Department of Economics, Kabridahar University, Po Box 250, Kebri Dehar, Ethiopia.
- Division of Research and Development, Lovely Professional University, Phagwara, Punjab, 144001, India.
| |
Collapse
|
5
|
Manes JL, Bullock L, Meier AM, Turner RS, Richardson RM, Guenther FH. A neurocomputational view of the effects of Parkinson's disease on speech production. Front Hum Neurosci 2024; 18:1383714. [PMID: 38812472 PMCID: PMC11133703 DOI: 10.3389/fnhum.2024.1383714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 04/23/2024] [Indexed: 05/31/2024] Open
Abstract
The purpose of this article is to review the scientific literature concerning speech in Parkinson's disease (PD) with reference to the DIVA/GODIVA neurocomputational modeling framework. Within this theoretical view, the basal ganglia (BG) contribute to several different aspects of speech motor learning and execution. First, the BG are posited to play a role in the initiation and scaling of speech movements. Within the DIVA/GODIVA framework, initiation and scaling are carried out by initiation map nodes in the supplementary motor area acting in concert with the BG. Reduced support of the initiation map from the BG in PD would result in reduced movement intensity as well as susceptibility to early termination of movement. A second proposed role concerns the learning of common speech sequences, such as phoneme sequences comprising words; this view receives support from the animal literature as well as studies identifying speech sequence learning deficits in PD. Third, the BG may play a role in the temporary buffering and sequencing of longer speech utterances such as phrases during conversational speech. Although the literature does not support a critical role for the BG in representing sequence order (since incorrectly ordered speech is not characteristic of PD), the BG are posited to contribute to the scaling of individual movements in the sequence, including increasing movement intensity for emphatic stress on key words. Therapeutic interventions for PD have inconsistent effects on speech. In contrast to dopaminergic treatments, which typically either leave speech unchanged or lead to minor improvements, deep brain stimulation (DBS) can degrade speech in some cases and improve it in others. However, cases of degradation may be due to unintended stimulation of efferent motor projections to the speech articulators. Findings of spared speech after bilateral pallidotomy appear to indicate that any role played by the BG in adult speech must be supplementary rather than mandatory, with the sequential order of well-learned sequences apparently represented elsewhere (e.g., in cortico-cortical projections).
Collapse
Affiliation(s)
- Jordan L. Manes
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA, United States
- Department of Communicative Disorders and Sciences, University at Buffalo, Buffalo, NY, United States
| | - Latané Bullock
- Program in Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, United States
- Department of Neurosurgery, Massachusetts General Hospital, Boston, MA, United States
- Harvard Medical School, Boston, MA, United States
| | - Andrew M. Meier
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA, United States
| | - Robert S. Turner
- Department of Neurobiology, University of Pittsburgh, Pittsburgh, PA, United States
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, United States
| | - R. Mark Richardson
- Department of Neurosurgery, Massachusetts General Hospital, Boston, MA, United States
- Harvard Medical School, Boston, MA, United States
| | - Frank H. Guenther
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA, United States
- Department of Biomedical Engineering, Boston University, Boston, MA, United States
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA, United States
| |
Collapse
|
6
|
Abumalloh RA, Nilashi M, Samad S, Ahmadi H, Alghamdi A, Alrizq M, Alyami S. Parkinson's disease diagnosis using deep learning: A bibliometric analysis and literature review. Ageing Res Rev 2024; 96:102285. [PMID: 38554785 DOI: 10.1016/j.arr.2024.102285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 03/20/2024] [Accepted: 03/24/2024] [Indexed: 04/02/2024]
Abstract
Parkinson's Disease (PD) is a progressive neurodegenerative illness triggered by decreased dopamine secretion. Deep Learning (DL) has gained substantial attention in PD diagnosis research, with an increase in the number of published papers in this discipline. PD detection using DL has presented more promising outcomes as compared with common machine learning approaches. This article aims to conduct a bibliometric analysis and a literature review focusing on the prominent developments taking place in this area. To achieve the target of the study, we retrieved and analyzed the available research papers in the Scopus database. Following that, we conducted a bibliometric analysis to inspect the structure of keywords, authors, and countries in the surveyed studies by providing visual representations of the bibliometric data using VOSviewer software. The study also provides an in-depth review of the literature focusing on different indicators of PD, deployed approaches, and performance metrics. The outcomes indicate the firm development of PD diagnosis using DL approaches over time and a large diversity of studies worldwide. Additionally, the literature review presented a research gap in DL approaches related to incremental learning, particularly in relation to big data analysis.
Collapse
Affiliation(s)
- Rabab Ali Abumalloh
- Department of Computer Science and Engineering, Qatar University, Doha 2713, Qatar
| | - Mehrbakhsh Nilashi
- Institute of Research and Development, Duy Tan University, Da Nang, Vietnam; School of Computer Science, Duy Tan University, Da Nang, Vietnam; UCSI Graduate Business School, UCSI University, No. 1 Jalan Menara Gading, UCSI Heights, Cheras, Kuala Lumpur 56000, Malaysia; Centre for Global Sustainability Studies (CGSS), Universiti Sains Malaysia, Penang 11800, Malaysia.
| | - Sarminah Samad
- Faculty of Business, UNITAR International University, Tierra Crest, Jalan SS6/3, Petaling Jaya, Selangor 47301, Malaysia
| | - Hossein Ahmadi
- Centre for Health Technology, Faculty of Health, University of Plymouth, Plymouth PL4 8AA, UK
| | - Abdullah Alghamdi
- Information Systems Dept., College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia; AI Lab, Scientific and Engineering Research Center (SERC), Najran University, Najran, Saudi Arabia
| | - Mesfer Alrizq
- Information Systems Dept., College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia; AI Lab, Scientific and Engineering Research Center (SERC), Najran University, Najran, Saudi Arabia
| | - Sultan Alyami
- AI Lab, Scientific and Engineering Research Center (SERC), Najran University, Najran, Saudi Arabia; Computer Science Dept., College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia
| |
Collapse
|
7
|
Pettorino M, Maffia M. Analyzing changes in parkinsonian speech over time: a diachronic experimental phonetics study. Front Aging Neurosci 2024; 16:1334198. [PMID: 38533425 PMCID: PMC10963411 DOI: 10.3389/fnagi.2024.1334198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 02/26/2024] [Indexed: 03/28/2024] Open
Abstract
In this contribution the use of web resources for the longitudinal study of speech rhythm of a 'well-known' person diagnosed with Parkinson's disease, the American actor Alan Alda, is proposed. A corpus of 20 speech samples produced in the period between 1979 and 2021 was collected from the web. A rhythmical analysis was conducted, based on two parameters: the percentage of vocalic portion on the total duration of the utterance (%V) and the VtoV, the mean duration of the interval between two consecutive vowel onset points. The results of this study confirm an early alteration of rhythm in parkinsonian speech, with an abnormal increase of %V, already occurring some years before the clinical diagnosis. The observation of speech rhythm variation can therefore be considered as the basis for the realization of a sustainable and non-invasive procedure in support to early diagnosis of Parkinson's disease.
Collapse
|
8
|
Maetzler W, Mirelman A, Pilotto A, Bhidayasiri R. Identifying Subtle Motor Deficits Before Parkinson's Disease is Diagnosed: What to Look for? JOURNAL OF PARKINSON'S DISEASE 2024; 14:S287-S296. [PMID: 38363620 PMCID: PMC11492040 DOI: 10.3233/jpd-230350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 01/11/2024] [Indexed: 02/17/2024]
Abstract
Motor deficits typical of Parkinson's disease (PD), such as gait and balance disturbances, tremor, reduced arm swing and finger movement, and voice and breathing changes, are believed to manifest several years prior to clinical diagnosis. Here we describe the evidence for the presence and progression of motor deficits in this pre-diagnostic phase in order to provide suggestions for the design of future observational studies for an effective, quantitatively oriented investigation. On the one hand, these future studies must detect these motor deficits in as large (potentially, population-based) cohorts as possible with high sensitivity and specificity. On the other hand, they must describe the progression of these motor deficits in the pre-diagnostic phase as accurately as possible, to support the testing of the effect of pharmacological and non-pharmacological interventions. Digital technologies and artificial intelligence can substantially accelerate this process.
Collapse
Affiliation(s)
- Walter Maetzler
- Department of Neurology, University Hospital Schleswig-Holstein and Kiel University, Kiel, Germany
| | - Anat Mirelman
- Laboratory for Early Markers of Neurodegeneration, Neurological Institute, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
- Sagol School of Neuroscience and Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Andrea Pilotto
- Neurology Unit, Department of Clinical and Experimental Sciences, University of Brescia, Brescia, Italy
- Laboratory of Digital Neurology and Biosensors, University of Brescia, Brescia, Italy
- Neurology Unit, Department of Continuity of Care and Frailty, ASST Spedali Civili Brescia Hospital, Brescia, Italy
| | - Roongroj Bhidayasiri
- Chulalongkorn Centre of Excellence for Parkinson’s Disease & Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
- The Academy of Science, The Royal Society of Thailand, Bangkok, Thailand
| |
Collapse
|
9
|
Favaro A, Tsai YT, Butala A, Thebaud T, Villalba J, Dehak N, Moro-Velázquez L. Interpretable speech features vs. DNN embeddings: What to use in the automatic assessment of Parkinson's disease in multi-lingual scenarios. Comput Biol Med 2023; 166:107559. [PMID: 37852107 DOI: 10.1016/j.compbiomed.2023.107559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 10/07/2023] [Accepted: 10/09/2023] [Indexed: 10/20/2023]
Abstract
Speech-based approaches for assessing Parkinson's Disease (PD) often rely on feature extraction for automatic classification or detection. While many studies prioritize accuracy by using non-interpretable embeddings from Deep Neural Networks, this work aims to explore the predictive capabilities and language robustness of both feature types in a systematic fashion. As interpretable features, prosodic, linguistic, and cognitive descriptors were adopted, while x-vectors, Wav2Vec 2.0, HuBERT, and TRILLsson representations were used as non-interpretable features. Mono-lingual, multi-lingual, and cross-lingual machine learning experiments were conducted leveraging six data sets comprising speech recordings from various languages: American English, Castilian Spanish, Colombian Spanish, Italian, German, and Czech. For interpretable feature-based models, the mean of the best F1-scores obtained from each language was 81% in mono-lingual, 81% in multi-lingual, and 71% in cross-lingual experiments. For non-interpretable feature-based models, instead, they were 85% in mono-lingual, 88% in multi-lingual, and 79% in cross-lingual experiments. Firstly, models based on non-interpretable features outperformed interpretable ones, especially in cross-lingual experiments. Specifically, TRILLsson provided the most stable and accurate results across tasks and data sets. Conversely, the two types of features adopted showed some level of language robustness in multi-lingual and cross-lingual experiments. Overall, these results suggest that interpretable feature-based models can be used by clinicians to evaluate the deterioration of the speech of patients with PD, while non-interpretable feature-based models can be leveraged to achieve higher detection accuracy.
Collapse
Affiliation(s)
- Anna Favaro
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America.
| | - Yi-Ting Tsai
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Ankur Butala
- Department of Neurology, The Johns Hopkins University, Baltimore, 21218, MD, United States of America; Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Thomas Thebaud
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Jesús Villalba
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Najim Dehak
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Laureano Moro-Velázquez
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| |
Collapse
|
10
|
Sara JDS, Orbelo D, Maor E, Lerman LO, Lerman A. Guess What We Can Hear-Novel Voice Biomarkers for the Remote Detection of Disease. Mayo Clin Proc 2023; 98:1353-1375. [PMID: 37661144 PMCID: PMC10043966 DOI: 10.1016/j.mayocp.2023.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 02/08/2023] [Accepted: 03/16/2023] [Indexed: 03/30/2023]
Abstract
The advancement of digital biomarkers and the provision of remote health care greatly progressed during the coronavirus disease 2019 global pandemic. Combining voice/speech data with artificial intelligence and machine-based learning offers a novel solution to the growing demand for telemedicine. Voice biomarkers, obtained from the extraction of characteristic acoustic and linguistic features, are associated with a variety of diseases and even coronavirus disease 2019. In the current review, we (1) describe the basis on which digital voice biomarkers could facilitate "telemedicine," (2) discuss potential mechanisms that may explain the association between voice biomarkers and disease, (3) offer a novel classification system to conceptualize voice biomarkers depending on different methods for recording and analyzing voice/speech samples, (4) outline evidence revealing an association between voice biomarkers and a number of disease states, and (5) describe the process of developing a voice biomarker from recording, storing voice samples, and extracting acoustic and linguistic features relevant to training and testing deep and machine-based learning algorithms to detect disease. We further explore several important future considerations in this area of research, including the necessity for clinical trials and the importance of safeguarding data and individual privacy. To this end, we searched PubMed and Google Scholar to identify studies evaluating the relationship between voice/speech features and biomarkers and various diseases. Search terms included digital biomarker, telemedicine, voice features, voice biomarker, speech features, speech biomarkers, acoustics, linguistics, cardiovascular disease, neurologic disease, psychiatric disease, and infectious disease. The search was limited to studies published in English in peer-reviewed journals between 1980 and the present. To identify potential studies not captured by our database search strategy, we also searched studies listed in the bibliography of relevant publications and reviews.
Collapse
Affiliation(s)
| | - Diana Orbelo
- Division of Otolaryngology, Mayo Clinic College of Medicine and Science, Rochester, MN; Chaim Sheba Medical Center, Tel HaShomer, Israel
| | - Elad Maor
- Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Lilach O Lerman
- Division of Nephrology and Hypertension, Mayo Clinic Rochester, MN
| | - Amir Lerman
- Department of Cardiovascular Medicine, Mayo Clinic College of Medicine and Science, Rochester, MN.
| |
Collapse
|
11
|
Friedman L, Lauber M, Behroozmand R, Fogerty D, Kunecki D, Berry-Kravis E, Klusek J. Atypical vocal quality in women with the FMR1 premutation: an indicator of impaired sensorimotor control. Exp Brain Res 2023; 241:1975-1987. [PMID: 37347418 PMCID: PMC10863608 DOI: 10.1007/s00221-023-06653-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 06/13/2023] [Indexed: 06/23/2023]
Abstract
Women with the FMR1 premutation are susceptible to motor involvement related to atypical cerebellar function, including risk for developing fragile X tremor ataxia syndrome. Vocal quality analyses are sensitive to subtle differences in motor skills but have not yet been applied to the FMR1 premutation. This study examined whether women with the FMR1 premutation demonstrate differences in vocal quality, and whether such differences relate to FMR1 genetic, executive, motor, or health features of the FMR1 premutation. Participants included 35 women with the FMR1 premutation and 45 age-matched women without the FMR1 premutation who served as a comparison group. Three sustained /a/ vowels were analyzed for pitch (mean F0), variability of pitch (standard deviation of F0), and overall vocal quality (jitter, shimmer, and harmonics-to-noise ratio). Executive, motor, and health indices were obtained from direct and self-report measures and genetic samples were analyzed for FMR1 CGG repeat length and activation ratio. Women with the FMR1 premutation had a lower pitch, larger pitch variability, and poorer vocal quality than the comparison group. Working memory was related to harmonics-to-noise ratio and shimmer in women with the FMR1 premutation. Vocal quality abnormalities differentiated women with the FMR1 premutation from the comparison group and were evident even in the absence of other clinically evident motor deficits. This study supports vocal quality analyses as a tool that may prove useful in the detection of early signs of motor involvement in this population.
Collapse
Affiliation(s)
- Laura Friedman
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA
| | - Meagan Lauber
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA
| | - Roozbeh Behroozmand
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA
| | - Daniel Fogerty
- Department of Speech and Hearing Science, University of Illinois Urbana-Champaign, Champaign, USA
| | - Dariusz Kunecki
- Department of Pediatrics, Rush University Medical Center, Chicago, USA
| | | | - Jessica Klusek
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA.
| |
Collapse
|
12
|
Romero Arias T, Redondo Cortés I, Pérez Del Olmo A. Biomechanical Parameters of Voice in Parkinson's Disease Patients. Folia Phoniatr Logop 2023; 76:91-101. [PMID: 37499642 DOI: 10.1159/000533289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 07/24/2023] [Indexed: 07/29/2023] Open
Abstract
INTRODUCTION Previous research on voice in Parkinson's disease (PD) has consistently demonstrated alterations in acoustic parameters, including fundamental frequency (F0), maximum phonation time, Shimmer, and Jitter. However, investigations into acoustic parameter alterations in individuals with PD are limited. METHODS We conducted an experimental study involving 20 PD patients (six women and fourteen men). Subjective measures of voice (VHI-30 scale and GRBAS) and objective measures using the OnlineLAB App tool for analyzing biomechanical correlates of voice were recorded. The app analyzed a total of 22 biomechanical parameters of voice. RESULTS The results of subjective measures were consistent with findings from previous studies. However, the results of objective measures did not align with studies that employed acoustic measures. CONCLUSIONS The biomechanical analysis revealed alterations in various parameters according to gender. These findings open up a new avenue of research in voice analysis for patients with PD, whether through acoustic or biomechanical analysis, aiming to determine whether the observed changes in these patients' voices are attributable to age or disease progression. This line of investigation will help elucidate the relative contribution of these factors to vocal alterations in PD patients and provide a more comprehensive understanding of the underlying mechanisms.
Collapse
Affiliation(s)
- Tatiana Romero Arias
- Faculty of Health Sciences, Speech Therapy Section, Pontifical University of Salamanca, Salamanca, Spain
| | - Inés Redondo Cortés
- Faculty of Health Sciences, Speech Therapy Section, Pontifical University of Salamanca, Salamanca, Spain
| | - Adrián Pérez Del Olmo
- Faculty of Health Sciences, Speech Therapy Section, Pontifical University of Salamanca, Salamanca, Spain
| |
Collapse
|
13
|
Bóna J. Speech rate and fluency in young-onset Parkinson's disease: A longitudinal case study from early to post brain surgery stage. CLINICAL LINGUISTICS & PHONETICS 2023; 37:385-397. [PMID: 36314241 DOI: 10.1080/02699206.2022.2138784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 10/11/2022] [Accepted: 10/13/2022] [Indexed: 05/20/2023]
Abstract
The aim of this study is to analyse the speech rate, pausing and fluency of a patient with young-onset Parkinson's Disease in different stages of the disease. Speech samples of the patient were recorded in the early stages of the disease until after the brain surgery. The recordings were compared to the speech of healthy control speakers. Speech rate, articulation rate, pausing and the frequency of disfluencies were analysed. Results show that all parameters are influenced by the severity of the disease, but articulation rate is the most affected.
Collapse
Affiliation(s)
- Judit Bóna
- Department of Applied Linguistics and Phonetics, ELTE Eötvös Loránd University, Budapest, Hungary
| |
Collapse
|
14
|
Campi M, Peters GW, Toczydlowska D. Ataxic speech disorders and Parkinson's disease diagnostics via stochastic embedding of empirical mode decomposition. PLoS One 2023; 18:e0284667. [PMID: 37099544 PMCID: PMC10132693 DOI: 10.1371/journal.pone.0284667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Accepted: 04/05/2023] [Indexed: 04/27/2023] Open
Abstract
Medical diagnostic methods that utilise modalities of patient symptoms such as speech are increasingly being used for initial diagnostic purposes and monitoring disease state progression. Speech disorders are particularly prevalent in neurological degenerative diseases such as Parkinson's disease, the focus of the study undertaken in this work. We will demonstrate state-of-the-art statistical time-series methods that combine elements of statistical time series modelling and signal processing with modern machine learning methods based on Gaussian process models to develop methods to accurately detect a core symptom of speech disorder in individuals who have Parkinson's disease. We will show that the proposed methods out-perform standard best practices of speech diagnostics in detecting ataxic speech disorders, and we will focus the study, particularly on a detailed analysis of a well regarded Parkinson's data speech study publicly available making all our results reproducible. The methodology developed is based on a specialised technique not widely adopted in medical statistics that found great success in other domains such as signal processing, seismology, speech analysis and ecology. In this work, we will present this method from a statistical perspective and generalise it to a stochastic model, which will be used to design a test for speech disorders when applied to speech time series signals. As such, this work is making contributions both of a practical and statistical methodological nature.
Collapse
Affiliation(s)
- Marta Campi
- CERIAH, Institut de L’Audition, Institut Pasteur, Paris, France
| | - Gareth W. Peters
- Department of Statistics & Applied Probability, University of California, Santa Barbara (UCSB), Santa Barbara, California, United States of America
| | - Dorota Toczydlowska
- School of Mathematics and Physical Science, University of Technology Sydney, Sydney, Australia
| |
Collapse
|
15
|
Biswas SK, Nath Boruah A, Saha R, Raj RS, Chakraborty M, Bordoloi M. Early detection of Parkinson disease using stacking ensemble method. Comput Methods Biomech Biomed Engin 2023; 26:527-539. [PMID: 35587795 DOI: 10.1080/10255842.2022.2072683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Parkinson's disease (PD) is a common progressive neurodegenerative disorder that occurs due to corrosion of the substantianigra, located in the thalamic region of the human brain, and is responsible for the transmission of neural signals throughout the human body using brain chemical, termed as "dopamine." Diagnosis of PD is difficult, as it is often affected by the characteristics of the medical data of the patients, which include the presence of various indicators, imbalance cases of patients' data records, similar cases of healthy/affected persons, etc. Hence, sometimes the process of diagnosis may also be affected by human error. To overcome this problem some intelligent models have been proposed; however, most of them are single classifier-based models and due to this these models cannot handle noisy and imbalanced data properly and thus sometimes overfit the model. To reduce bias and variance, and to avoid overfitting of a single classifier-based model, this paper proposes an ensemble-based PD diagnosis model, named Ensembled Expert System for Diagnosis of Parkinson's Disease (EESDPD) with relevant features and a simple stacking ensemble technique. The proposed EESDPD aggregates diverse assumptions for making the prediction. The performance of the proposed EESDPD is compared with the performances of logistic regression, SVM, Naïve Bayes, Random Forest, XGBoost, simple Decision Tree, B-TDS-PD and B-TESM-PD in terms of classification accuracy, precision, recall and F1-score measures.
Collapse
Affiliation(s)
- Saroj Kumar Biswas
- Computer Science and Engineering Department, National Institute of Technology, Silchar, India
| | - Arpita Nath Boruah
- Computer Science and Engineering Department, National Institute of Technology, Silchar, India
| | - Rajib Saha
- Computer Science and Engineering Department, National Institute of Technology, Silchar, India
| | - Ravi Shankar Raj
- Computer Science and Engineering Department, National Institute of Technology, Silchar, India
| | - Manomita Chakraborty
- School of Computer Science and Engineering, VIT-AP University, Amaravathi, India
| | - Monali Bordoloi
- School of Computer Science and Engineering, VIT-AP University, Amaravathi, India
| |
Collapse
|
16
|
Alshammri R, Alharbi G, Alharbi E, Almubark I. Machine learning approaches to identify Parkinson's disease using voice signal features. Front Artif Intell 2023; 6:1084001. [PMID: 37056913 PMCID: PMC10086231 DOI: 10.3389/frai.2023.1084001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Accepted: 02/17/2023] [Indexed: 03/30/2023] Open
Abstract
Parkinson's Disease (PD) is the second most common age-related neurological disorder that leads to a range of motor and cognitive symptoms. A PD diagnosis is difficult since its symptoms are quite similar to those of other disorders, such as normal aging and essential tremor. When people reach 50, visible symptoms such as difficulties walking and communicating begin to emerge. Even though there is no cure for PD, certain medications can relieve some of the symptoms. Patients can maintain their lifestyles by controlling the complications caused by the disease. At this point, it is essential to detect this disease and prevent it from progressing. The diagnosis of the disease has been the subject of much research. In our project, we aim to detect PD using different types of Machine Learning (ML), and Deep Learning (DL) models such as Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), K-Nearest Neighbor (KNN), and Multi-Layer Perceptron (MLP) to differentiate between healthy and PD patients by voice signal features. The dataset taken from the University of California at Irvine (UCI) machine learning repository consisted of 195 voice recordings of examinations carried out on 31 patients. Moreover, our models were trained using different techniques such as Synthetic Minority Over-sampling Technique (SMOTE), Feature Selection, and hyperparameter tuning (GridSearchCV) to enhance their performance. At the end, we found that MLP and SVM with a ratio of 70:30 train/test split using GridSearchCV with SMOTE gave the best results for our project. MLP performed with an overall accuracy of 98.31%, an overall recall of 98%, an overall precision of 100%, and f1-score of 99%. In addition, SVM performed with an overall accuracy of 95%, an overall recall of 96%, an overall precision of 98%, and f1-score of 97%. The experimental results of this research imply that the proposed method can be used to reliably predict PD and can be easily incorporated into healthcare for diagnosis purposes.
Collapse
|
17
|
Khaskhoussy R, Ayed YB. Improving Parkinson’s disease recognition through voice analysis using deep learning. Pattern Recognit Lett 2023. [DOI: 10.1016/j.patrec.2023.03.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/12/2023]
|
18
|
Ma A, Desai N, Lau KK, Palaniswami M, O'Brien TJ, Palaniswami P, Thyagarajan D. Automated measurement of inter-arytenoid distance on 4D laryngeal CT: A validation study. PLoS One 2023; 18:e0279927. [PMID: 36652423 PMCID: PMC9847963 DOI: 10.1371/journal.pone.0279927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 12/16/2022] [Indexed: 01/19/2023] Open
Abstract
Changes to the voice are prevalent and occur early in Parkinson's disease. Correlates of these voice changes on four-dimensional laryngeal computed-tomography imaging, such as the inter-arytenoid distance, are promising biomarkers of the disease's presence and severity. However, manual measurement of the inter-arytenoid distance is a laborious process, limiting its feasibility in large-scale research and clinical settings. Automated methods of measurement provide a solution. Here, we present a machine-learning module which determines the inter-arytenoid distance in an automated manner. We obtained automated inter-arytenoid distance readings on imaging from participants with Parkinson's disease as well as healthy controls, and then validated these against manually derived estimates. On a modified Bland-Altman analysis, we found a mean bias of 1.52 mm (95% limits of agreement -1.7 to 4.7 mm) between the automated and manual techniques, which improves to a mean bias of 0.52 mm (95% limits of agreement -1.9 to 2.9 mm) when variability due to differences in slice selection between the automated and manual methods are removed. Our results demonstrate that estimates of the inter-arytenoid distance with our automated machine-learning module are accurate, and represents a promising tool to be utilized in future work studying the laryngeal changes in Parkinson's disease.
Collapse
Affiliation(s)
- Andrew Ma
- Department of Neuroscience, Central Clinical School, Monash University, Melbourne, Victoria, Australia
- Department of Neurology, Alfred Health, Melbourne, Victoria, Australia
- Department of Neurology, Monash Health, Melbourne, Victoria, Australia
| | - Nandakishor Desai
- Department of Electrical and Electronic Engineering, The University of Melbourne, Melbourne, Victoria, Australia
| | - Kenneth K Lau
- School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia
- Monash Health Imaging, Monash Health, Melbourne, Victoria, Australia
| | - Marimuthu Palaniswami
- Department of Electrical and Electronic Engineering, The University of Melbourne, Melbourne, Victoria, Australia
| | - Terence J O'Brien
- Department of Neuroscience, Central Clinical School, Monash University, Melbourne, Victoria, Australia
- Department of Neurology, Alfred Health, Melbourne, Victoria, Australia
| | - Paari Palaniswami
- School of Computing and Information Systems, Faculty of Engineering and Information Technology, The University of Melbourne, Melbourne, Victoria, Australia
| | - Dominic Thyagarajan
- Department of Neuroscience, Central Clinical School, Monash University, Melbourne, Victoria, Australia
- Department of Neurology, Alfred Health, Melbourne, Victoria, Australia
- Department of Neurology, Monash Health, Melbourne, Victoria, Australia
| |
Collapse
|
19
|
A type-2 neuro-fuzzy system with a novel learning method for Parkinson’s disease diagnosis. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04276-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
20
|
Wang Q, Fu Y, Shao B, Chang L, Ren K, Chen Z, Ling Y. Early detection of Parkinson’s disease from multiple signal speech: Based on Mandarin language dataset. Front Aging Neurosci 2022; 14:1036588. [DOI: 10.3389/fnagi.2022.1036588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2022] [Accepted: 10/20/2022] [Indexed: 11/11/2022] Open
Abstract
Parkinson’s disease (PD) is a neurodegenerative disorder that negatively affects millions of people. Early detection is of vital importance. As recent researches showed dysarthria level provides good indicators to the computer-assisted diagnosis and remote monitoring of patients at the early stages. It is the goal of this study to develop an automatic detection method based on newest collected Chinese dataset. Unlike English, no agreement was reached on the main features indicating language disorders due to vocal organ dysfunction. Thus, one of our approaches is to classify the speech phonation and articulation with a machine learning-based feature selection model. Based on a relatively big sample, three feature selection algorithms (LASSO, mRMR, Relief-F) were tested to select the vocal features extracted from speech signals collected in a controlled setting, followed by four classifiers (Naïve Bayes, K-Nearest Neighbor, Logistic Regression and Stochastic Gradient Descent) to detect the disorder. The proposed approach shows an accuracy of 75.76%, sensitivity of 82.44%, specificity of 73.15% and precision of 76.57%, indicating the feasibility and promising future for an automatic and unobtrusive detection on Chinese PD. The comparison among the three selection algorithms reveals that LASSO selector has the best performance regardless types of vocal features. The best detection accuracy is obtained by SGD classifier, while the best resulting sensitivity is obtained by LR classifier. More interestingly, articulation features are more representative and indicative than phonation features among all the selection and classifying algorithms. The most prominent articulation features are F1, F2, DDF1, DDF2, BBE and MFCC.
Collapse
|
21
|
On the Relationship between Speech Intelligibility and Fluency Indicators among English-Speaking Individuals with Parkinson’s Diseases. Behav Neurol 2022; 2022:1224680. [PMID: 36225387 PMCID: PMC9550446 DOI: 10.1155/2022/1224680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 08/23/2022] [Accepted: 09/09/2022] [Indexed: 12/03/2022] Open
Abstract
The purpose of the study is to investigate how much of variance in Parkinson's Disease (PD) individuals' speech intelligibility could be predicted by seven speech fluency indicators (i.e., repetition, omission, distortion, correction, unfilled pauses, filled pauses, and speaking rate). Speech data were retrieved from a database containing a reading task produced by a group of 16 English-speaking individuals with PD (Jaeger, Trivedi & Stadtchnitzer, 2019). The results from a multiple regression indicated that an addition of 54% of variance in the speech intelligibility scores among individuals with PD could be accounted for after the speakers' PD severity level measured based on Hoehn and Yahr's (1967) disease stage was included as a covariate. In addition, omission and correction were the two fluency indicators that contributed to the general intelligibility score in a statistically significant way. Specifically, for every one-unit gain in the number of correction and omission, speech intelligibility scores would decline by 0.687 and 0.131 point (out of a 7-point scale), respectively. The current study hence supported Magee, Copland, and Vogel's (2019) view that the language production abilities and quantified dysarthria measures among individuals with PD should be explored together. Additionally, the clinical implications based on the current findings were discussed.
Collapse
|
22
|
Terriza M, Navarro J, Retuerta I, Alfageme N, San-Segundo R, Kontaxakis G, Garcia-Martin E, Marijuan PC, Panetsos F. Use of Laughter for the Detection of Parkinson's Disease: Feasibility Study for Clinical Decision Support Systems, Based on Speech Recognition and Automatic Classification Techniques. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:10884. [PMID: 36078600 PMCID: PMC9518165 DOI: 10.3390/ijerph191710884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 08/25/2022] [Accepted: 08/27/2022] [Indexed: 06/15/2023]
Abstract
Parkinson's disease (PD) is an incurable neurodegenerative disorder which affects over 10 million people worldwide. Early detection and correct evaluation of the disease is critical for appropriate medication and to slow the advance of the symptoms. In this scenario, it is critical to develop clinical decision support systems contributing to an early, efficient, and reliable diagnosis of this illness. In this paper we present a feasibility study for a clinical decision support system for the diagnosis of PD based on the acoustic characteristics of laughter. Our decision support system is based on laugh analysis with speech recognition methods and automatic classification techniques. We evaluated different cepstral coefficients to identify laugh characteristics of healthy and ill subjects combined with machine learning classification models. The decision support system reached 83% accuracy rate with an AUC value of 0.86 for PD-healthy laughs classification in a database of 20,000 samples randomly generated from a pool of 120 laughs from healthy and PD subjects. Laughter could be employed for the efficient and reliable detection of PD; such a detection system can be achieved using speech recognition and automatic classification techniques; a clinical decision support system can be built using the above techniques. Significance: PD clinical decision support systems for the early detection of the disease will help to improve the efficiency of available and upcoming therapeutic treatments which, in turn, would improve life conditions of the affected people and would decrease costs and efforts in public and private healthcare systems.
Collapse
Affiliation(s)
- Miguel Terriza
- Neuro-Computing & Neuro-Robotics Research Group, Complutense University of Madrid, 28040 Madrid, Spain
- Innovation Group, Institute for Health Research San Carlos Clinical Hospital (IdISSC), 28040 Madrid, Spain
| | - Jorge Navarro
- Department of Economic Structure, CASETEM Research Group, Faculty of Economy, University of Zaragoza, 50009 Zaragoza, Spain
| | - Irene Retuerta
- Independent Researchers, Affiliated to Bioinformation and Systems Biology Group, Aragon Health Sciences Institute (IACS-IIS Aragon), 50009 Zaragoza, Spain
| | - Nuria Alfageme
- Neuro-Computing & Neuro-Robotics Research Group, Complutense University of Madrid, 28040 Madrid, Spain
- Innovation Group, Institute for Health Research San Carlos Clinical Hospital (IdISSC), 28040 Madrid, Spain
| | - Ruben San-Segundo
- Speech Technology Group, Information Processing and Telecommunications Center, 28040 Madrid, Spain
| | - George Kontaxakis
- Biomedical Image Technologies Group, Information Processing and Telecommunications Center, Universidad Politécnica de Madrid, 28040 Madrid, Spain
| | - Elena Garcia-Martin
- Department of Ophthalmology, Miguel Servet University Hospital, 50009 Zaragoza, Spain
- Miguel Servet Ophthalmology Research Group (GIMSO), Aragon Health Research Institute (IIS Aragón), University of Zaragoza, 50009 Zaragoza, Spain
| | - Pedro C. Marijuan
- Independent Researchers, Affiliated to Bioinformation and Systems Biology Group, Aragon Health Sciences Institute (IACS-IIS Aragon), 50009 Zaragoza, Spain
| | - Fivos Panetsos
- Neuro-Computing & Neuro-Robotics Research Group, Complutense University of Madrid, 28040 Madrid, Spain
- Innovation Group, Institute for Health Research San Carlos Clinical Hospital (IdISSC), 28040 Madrid, Spain
| |
Collapse
|
23
|
Khaskhoussy R, Ayed YB. Speech processing for early Parkinson’s disease diagnosis: machine learning and deep learning-based approach. SOCIAL NETWORK ANALYSIS AND MINING 2022. [DOI: 10.1007/s13278-022-00905-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
24
|
Auditory and somatosensory feedback mechanisms of laryngeal and articulatory speech motor control. Exp Brain Res 2022; 240:2155-2173. [PMID: 35736994 DOI: 10.1007/s00221-022-06395-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 06/02/2022] [Indexed: 11/04/2022]
Abstract
PURPOSE Speech production is a complex motor task involving multiple subsystems. The relationships between these subsystems need to be comprehensively investigated to understand the underlying mechanisms of speech production. The goal of this paper is to examine the differential contributions of 1) auditory and somatosensory feedback control mechanisms, and 2) laryngeal and articulatory speech production subsystems on speech motor control at an individual speaker level using altered auditory and somatosensory feedback paradigms. METHODS Twenty young adults completed speaking tasks in which sudden and unpredictable auditory and physical perturbations were applied to the laryngeal and articulatory speech production subsystems. Auditory perturbations were applied to laryngeal or articulatory acoustic features of speech. Physical perturbations were applied to the larynx and the jaw. Pearson-product moment correlation coefficients were calculated between 1) auditory and somatosensory reflexive responses to investigate relationships between auditory and somatosensory feedback control mechanisms, and 2) laryngeal and articulatory reflexive responses as well as acuity measures to investigate the relationship between auditory-motor features of laryngeal and articulatory subsystems. RESULTS No statistically significant correlations were found concerning the relationships between auditory and somatosensory feedback. No statistically significant correlations were found between auditory-motor features in the laryngeal and articulatory control subsystems. CONCLUSION Results suggest that the laryngeal and articulatory speech production subsystems operate with differential auditory and somatosensory feedback control mechanisms. The outcomes suggest that current models of speech motor control should consider decoupling laryngeal and articulatory domains to better model speech motor control processes.
Collapse
|
25
|
Random Forest Algorithm Based on Speech for Early Identification of Parkinson's Disease. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:3287068. [PMID: 35586090 PMCID: PMC9110120 DOI: 10.1155/2022/3287068] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Accepted: 02/04/2022] [Indexed: 11/17/2022]
Abstract
To investigate the effectiveness of identifying patients with Parkinson's disease (PD) from speech signals, various acoustic parameters including prosodic and segmental features are extracted from speech and then the random forest classification (RF) algorithm based on these acoustic parameters is applied to diagnose early-stage PD patients. To validate the proposed method of RF algorithm in early-stage PD identification, this study compares the accuracy rate of RF with that of neurologists' judgments based on auditory test outcomes, and the results clearly show the superiority of the proposed method over its rival. Random forest algorithm based on speech can improve the accuracy of patients' identification, which provides an efficient auxiliary method in the early diagnosis of PD patients.
Collapse
|
26
|
Medina CA, Vargas E, Munger SJ, Miller JE. Vocal changes in a zebra finch model of Parkinson's disease characterized by alpha-synuclein overexpression in the song-dedicated anterior forebrain pathway. PLoS One 2022; 17:e0265604. [PMID: 35507553 PMCID: PMC9067653 DOI: 10.1371/journal.pone.0265604] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 03/06/2022] [Indexed: 11/18/2022] Open
Abstract
Deterioration in the quality of a person's voice and speech is an early marker of Parkinson's disease (PD). In humans, the neural circuit that supports vocal motor control consists of a cortico-basal ganglia-thalamo-cortico loop. The basal ganglia regions, striatum and globus pallidus, in this loop play a role in modulating the acoustic features of vocal behavior such as loudness, pitch, and articulatory rate. In PD, this area is implicated in pathogenesis. In animal models of PD, the accumulation of toxic aggregates containing the neuronal protein alpha-synuclein (αsyn) in the midbrain and striatum result in limb and vocal motor impairments. It has been challenging to study vocal impairments given the lack of well-defined cortico-basal ganglia circuitry for vocalization in rodent models. Furthermore, whether deterioration of voice quality early in PD is a direct result of αsyn-induced neuropathology is not yet known. Here, we take advantage of the well-characterized vocal circuits of the adult male zebra finch songbird to experimentally target a song-dedicated pathway, the anterior forebrain pathway, using an adeno-associated virus expressing the human wild-type αsyn gene, SNCA. We found that overexpression of αsyn in this pathway coincides with higher levels of insoluble, monomeric αsyn compared to control finches. Impairments in song production were also detected along with shorter and poorer quality syllables, which are the most basic unit of song. These vocal changes are similar to the vocal abnormalities observed in individuals with PD.
Collapse
Affiliation(s)
- Cesar A. Medina
- Graduate Interdisciplinary Program in Neuroscience, University of Arizona, Tucson, Arizona, United State of America
- Department of Neuroscience, University of Arizona, Tucson, Arizona, United States of America
| | - Eddie Vargas
- Department of Neuroscience, University of Arizona, Tucson, Arizona, United States of America
| | - Stephanie J. Munger
- Department of Neuroscience, University of Arizona, Tucson, Arizona, United States of America
| | - Julie E. Miller
- Graduate Interdisciplinary Program in Neuroscience, University of Arizona, Tucson, Arizona, United State of America
- Department of Neuroscience, University of Arizona, Tucson, Arizona, United States of America
- Department of Speech, Language, and Hearing Sciences, University of Arizona, Tucson, Arizona, United States of America
- Department of Neurology, University of Arizona, Tucson, Arizona, United States of America
- BIO5 Institute, University of Arizona, Tucson, Arizona, United States of America
| |
Collapse
|
27
|
Darling-White M, Anspach Z, Huber JE. Longitudinal Effects of Parkinson's Disease on Speech Breathing During an Extemporaneous Connected Speech Task. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:1402-1415. [PMID: 35302868 PMCID: PMC9499370 DOI: 10.1044/2022_jslhr-21-00485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Revised: 11/17/2021] [Accepted: 01/03/2022] [Indexed: 06/14/2023]
Abstract
PURPOSE A critical component to the development of any type of intervention to improve speech production in individuals with Parkinson's disease (PD) is a complete understanding of the speech impairments present at each stage of the disease and how these impairments change with disease progression. The purpose of this longitudinal study was to examine the impact of disease on speech production and speech breathing during an extemporaneous speech task in individuals with PD over the course of approximately 3.5 years. METHOD Eight individuals with PD and eight age- and sex-matched control participants produced an extemporaneous connected speech task on two occasions (Time 1 and Time 2) an average of 3 years 7 months apart. Dependent variables included sound pressure level; utterance length; speech rate; lung volume initiation, termination, and excursion; and percent vital capacity per syllable. RESULTS From Time 1 to Time 2, individuals with PD demonstrated decreased utterance length and lung volume initiation, termination, and excursion and increased speech rate. Control participants demonstrated decreased utterance length and lung volume termination and increased lung volume excursion and percent vital capacity per syllable from Time 1 to Time 2. CONCLUSIONS Changes in speech production and speech breathing variables experienced by individuals with PD over the course of several years are related to their disease process and not typical aging. Changes to speech breathing highlight the need to provide intervention focused on increasing efficient respiratory patterning for speech production.
Collapse
Affiliation(s)
- Meghan Darling-White
- Department of Speech, Language, and Hearing Sciences, The University of Arizona, Tucson
| | - Zeina Anspach
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, IN
| | - Jessica E. Huber
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, IN
| |
Collapse
|
28
|
Quan C, Ren K, Luo Z, Chen Z, Ling Y. End-to-end deep learning approach for Parkinson’s disease detection from speech signals. Biocybern Biomed Eng 2022. [DOI: 10.1016/j.bbe.2022.04.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
29
|
Suppa A, Costantini G, Asci F, Di Leo P, Al-Wardat MS, Di Lazzaro G, Scalise S, Pisani A, Saggio G. Voice in Parkinson's Disease: A Machine Learning Study. Front Neurol 2022; 13:831428. [PMID: 35242101 PMCID: PMC8886162 DOI: 10.3389/fneur.2022.831428] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Accepted: 01/10/2022] [Indexed: 12/13/2022] Open
Abstract
Introduction Parkinson's disease (PD) is characterized by specific voice disorders collectively termed hypokinetic dysarthria. We here investigated voice changes by using machine learning algorithms, in a large cohort of patients with PD in different stages of the disease, OFF and ON therapy. Methods We investigated 115 patients affected by PD (mean age: 68.2 ± 9.2 years) and 108 age-matched healthy subjects (mean age: 60.2 ± 11.0 years). The PD cohort included 57 early-stage patients (Hoehn &Yahr ≤ 2) who never took L-Dopa for their disease at the time of the study, and 58 mid-advanced-stage patients (Hoehn &Yahr >2) who were chronically-treated with L-Dopa. We clinically evaluated voices using specific subitems of the Unified Parkinson's Disease Rating Scale and the Voice Handicap Index. Voice samples recorded through a high-definition audio recorder underwent machine learning analysis based on the support vector machine classifier. We also calculated the receiver operating characteristic curves to examine the diagnostic accuracy of the analysis and assessed possible clinical-instrumental correlations. Results Voice is abnormal in early-stage PD and as the disease progresses, voice increasingly degradres as demonstrated by high accuracy in the discrimination between healthy subjects and PD patients in the early-stage and mid-advanced-stage. Also, L-dopa therapy improves but not restore voice in PD as shown by high accuracy in the comparison between patients OFF and ON therapy. Finally, for the first time we achieved significant clinical-instrumental correlations by using a new score (LR value) calculated by machine learning. Conclusion Voice is abnormal in early-stage PD, progressively degrades in mid-advanced-stage and can be improved but not restored by L-Dopa. Lastly, machine learning allows tracking disease severity and quantifying the symptomatic effect of L-Dopa on voice parameters with previously unreported high accuracy, thus representing a potential new biomarker of PD.
Collapse
Affiliation(s)
- Antonio Suppa
- Department of Human Neurosciences, Sapienza University of Rome, Rome, Italy.,IRCCS Neuromed Institute, Pozzilli, Italy
| | - Giovanni Costantini
- Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy
| | | | - Pietro Di Leo
- Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy
| | | | - Giulia Di Lazzaro
- Neurology Unit, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy
| | - Simona Scalise
- Department of System Medicine UOSD Parkinson, University of Rome Tor Vergata, Rome, Italy
| | - Antonio Pisani
- Department of Brain and Behavioral Sciences, University of Pavia, Pavia, Italy.,IRCCS Mondino Foundation, Pavia, Italy
| | - Giovanni Saggio
- Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy
| |
Collapse
|
30
|
Giannakopoulou KM, Roussaki I, Demestichas K. Internet of Things Technologies and Machine Learning Methods for Parkinson's Disease Diagnosis, Monitoring and Management: A Systematic Review. SENSORS (BASEL, SWITZERLAND) 2022; 22:1799. [PMID: 35270944 PMCID: PMC8915040 DOI: 10.3390/s22051799] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/28/2021] [Revised: 02/17/2022] [Accepted: 02/21/2022] [Indexed: 12/15/2022]
Abstract
Parkinson's disease is a chronic neurodegenerative disease that affects a large portion of the population, especially the elderly. It manifests with motor, cognitive and other types of symptoms, decreasing significantly the patients' quality of life. The recent advances in the Internet of Things and Artificial Intelligence fields, including the subdomains of machine learning and deep learning, can support Parkinson's disease patients, their caregivers and clinicians at every stage of the disease, maximizing the treatment effectiveness and minimizing the respective healthcare costs at the same time. In this review, the considered studies propose machine learning models, trained on data acquired via smart devices, wearable or non-wearable sensors and other Internet of Things technologies, to provide predictions or estimations regarding Parkinson's disease aspects. Seven hundred and seventy studies have been retrieved from three dominant academic literature databases. Finally, one hundred and twelve of them have been selected in a systematic way and have been considered in the state-of-the-art systematic review presented in this paper. These studies propose various methods, applied on various sensory data to address different Parkinson's disease-related problems. The most widely deployed sensors, the most commonly addressed problems and the best performing algorithms are highlighted. Finally, some challenges are summarized along with some future considerations and opportunities that arise.
Collapse
Affiliation(s)
- Konstantina-Maria Giannakopoulou
- School of Electrical and Computer Engineering, National Technical University of Athens, 15773 Athens, Greece; (K.-M.G.); (K.D.)
- Institute of Communication and Computer Systems, 10682 Athens, Greece
| | - Ioanna Roussaki
- School of Electrical and Computer Engineering, National Technical University of Athens, 15773 Athens, Greece; (K.-M.G.); (K.D.)
- Institute of Communication and Computer Systems, 10682 Athens, Greece
| | - Konstantinos Demestichas
- School of Electrical and Computer Engineering, National Technical University of Athens, 15773 Athens, Greece; (K.-M.G.); (K.D.)
- Institute of Communication and Computer Systems, 10682 Athens, Greece
| |
Collapse
|
31
|
Things to Consider When Automatically Detecting Parkinson’s Disease Using the Phonation of Sustained Vowels: Analysis of Methodological Issues. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12030991] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Diagnosing Parkinson’s Disease (PD) necessitates monitoring symptom progression. Unfortunately, diagnostic confirmation often occurs years after disease onset. A more sensitive and objective approach is paramount to the expedient diagnosis and treatment of persons with PD (PwPDs). Recent studies have shown that we can train accurate models to detect signs of PD from audio recordings of confirmed PwPDs. However, disparities exist between studies and may be caused, in part, by differences in employed corpora or methodologies. Our hypothesis is that unaccounted covariates in methodology, experimental design, and data preparation resulted in overly optimistic results in studies of PD automatic detection employing sustained vowels. These issues include record-wise fold creation rather than subject-wise; an imbalance of age between the PwPD and control classes; using too small of a corpus compared to the sizes of feature vectors; performing cross-validation without including development data; and the absence of cross-corpora testing to confirm results. In this paper, we evaluate the influence of these methodological issues in the automatic detection of PD employing sustained vowels. We perform several experiments isolating each issue to measure its influence employing three different corpora. Moreover, we analyze if the perceived dysphonia of the speakers could be causing differences in results between the corpora. Results suggest that each independent methodological issue analyzed has an effect on classification accuracy. Consequently, we recommend a list of methodological steps to be considered in future experiments to avoid overoptimistic or misleading results.
Collapse
|
32
|
Islam R, Abdel-Raheem E, Tarique M. A study of using cough sounds and deep neural networks for the early detection of Covid-19. BIOMEDICAL ENGINEERING ADVANCES 2022; 3:100025. [PMID: 35013733 PMCID: PMC8732907 DOI: 10.1016/j.bea.2022.100025] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2021] [Revised: 12/15/2021] [Accepted: 01/04/2022] [Indexed: 11/30/2022] Open
Abstract
The current clinical diagnosis of COVID-19 requires person-to-person contact, needs variable time to produce results, and is expensive. It is even inaccessible to the general population in some developing countries due to insufficient healthcare facilities. Hence, a low-cost, quick, and easily accessible solution for COVID-19 diagnosis is vital. This paper presents a study that involves developing an algorithm for automated and noninvasive diagnosis of COVID-19 using cough sound samples and a deep neural network. The cough sounds provide essential information about the behavior of glottis under different respiratory pathological conditions. Hence, the characteristics of cough sounds can identify respiratory diseases like COVID-19. The proposed algorithm consists of three main steps (a) extraction of acoustic features from the cough sound samples, (b) formation of a feature vector, and (c) classification of the cough sound samples using a deep neural network. The output from the proposed system provides a COVID-19 likelihood diagnosis. In this work, we consider three acoustic feature vectors, namely (a) time-domain, (b) frequency-domain, and (c) mixed-domain (i.e., a combination of features in both time-domain and frequency-domain). The performance of the proposed algorithm is evaluated using cough sound samples collected from healthy and COVID-19 patients. The results show that the proposed algorithm automatically detects COVID-19 cough sound samples with an overall accuracy of 89.2%, 97.5%, and 93.8% using time-domain, frequency-domain, and mixed-domain feature vectors, respectively. The proposed algorithm, coupled with its high accuracy, demonstrates that it can be used for quick identification or early screening of COVID-19. We also compare our results with that of some state-of-the-art works.
Collapse
Affiliation(s)
- Rumana Islam
- Department of Electrical and Computer Engineering, University of Windsor, ON N9B 3P4, Canada
| | - Esam Abdel-Raheem
- Department of Electrical and Computer Engineering, University of Windsor, ON N9B 3P4, Canada
| | - Mohammed Tarique
- Department of Electrical Engineering, University of Science and Technology of Fujairah, P.O. Box 2202, UAE
| |
Collapse
|
33
|
A Simple and Effective Approach Based on a Multi-Level Feature Selection for Automated Parkinson's Disease Detection. J Pers Med 2022; 12:jpm12010055. [PMID: 35055370 PMCID: PMC8781034 DOI: 10.3390/jpm12010055] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 12/27/2021] [Accepted: 12/30/2021] [Indexed: 12/07/2022] Open
Abstract
Parkinson’s disease (PD), which is a slowly progressing neurodegenerative disorder, negatively affects people’s daily lives. Early diagnosis is of great importance to minimize the effects of PD. One of the most important symptoms in the early diagnosis of PD disease is the monotony and distortion of speech. Artificial intelligence-based approaches can help specialists and physicians to automatically detect these disorders. In this study, a new and powerful approach based on multi-level feature selection was proposed to detect PD from features containing voice recordings of already-diagnosed cases. At the first level, feature selection was performed with the Chi-square and L1-Norm SVM algorithms (CLS). Then, the features that were extracted from these algorithms were combined to increase the representation power of the samples. At the last level, those samples that were highly distinctive from the combined feature set were selected with feature importance weights using the ReliefF algorithm. In the classification stage, popular classifiers such as KNN, SVM, and DT were used for machine learning, and the best performance was achieved with the KNN classifier. Moreover, the hyperparameters of the KNN classifier were selected with the Bayesian optimization algorithm, and the performance of the proposed approach was further improved. The proposed approach was evaluated using a 10-fold cross-validation technique on a dataset containing PD and normal classes, and a classification accuracy of 95.4% was achieved.
Collapse
|
34
|
Hybrid Feature Selection Framework for the Parkinson Imbalanced Dataset Prediction Problem. MEDICINA (KAUNAS, LITHUANIA) 2021; 57:medicina57111217. [PMID: 34833435 PMCID: PMC8619928 DOI: 10.3390/medicina57111217] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/12/2021] [Revised: 10/29/2021] [Accepted: 11/05/2021] [Indexed: 11/16/2022]
Abstract
Background and Objectives: Recently, many studies have focused on the early detection of Parkinson's disease (PD). This disease belongs to a group of neurological problems that immediately affect brain cells and influence the movement, hearing, and various cognitive functions. Medical data sets are often not equally distributed in their classes and this gives a bias in the classification of patients. We performed a Hybrid feature selection framework that can deal with imbalanced datasets like PD. Use the SOMTE algorithm to deal with unbalanced datasets. Removing the contradiction from the features in the dataset and decrease the processing time by using Recursive Feature Elimination (RFE), and Principle Component Analysis (PCA). Materials and Methods: PD acoustic datasets and the characteristics of control subjects were used to construct classification models such as Bagging, K-nearest neighbour (KNN), multilayer perceptron, and the support vector machine (SVM). In the prepressing stage, the synthetic minority over-sampling technique (SMOTE) with two-feature selection RFE and PCA were used. The PD dataset comprises a large difference between the numbers of the infected and uninfected patients, which causes the classification bias problem. Therefore, SMOTE was used to resolve this problem. Results: For model evaluation, the train-test split technique was used for the experiment. All the models were Grid-search tuned, the evaluation results of the SVM model showed the highest accuracy of 98.2%, and the KNN model exhibited the highest specificity of 99%. Conclusions: the proposed method is compared with the current modern methods of detecting Parkinson's disease and other methods for medical diseases, it was noted that our developed system could treat data bias and reach a high prediction of PD and this can be beneficial for health organizations to properly prioritize assets.
Collapse
|
35
|
Ma A, Lau KK, Thyagarajan D. Radiological correlates of vocal fold bowing as markers of Parkinson's disease progression: A cross-sectional study utilizing dynamic laryngeal CT. PLoS One 2021; 16:e0258786. [PMID: 34653231 PMCID: PMC8519464 DOI: 10.1371/journal.pone.0258786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 10/05/2021] [Indexed: 11/19/2022] Open
Abstract
OBJECTIVE To determine whether arytenoid cartilage position and dynamics change with advancing duration and severity (as graded by MDS-UPDRS part III scores) in Parkinson's disease, in a cross-sectional study design, we performed laryngeal four-dimensional computed tomography (4D-CT) in people with Parkinson's disease and controls. METHODS 31 people with Parkinson's disease covering a range of disease duration and severity and 19 controls underwent laryngeal 4D-CT whilst repeatedly vocalizing. We measured on each CT volume the glottic area (GA), inter-arytenoid distance (IAD), IAD-Area index (IAI) and arytenoid cartilage velocity ([Formula: see text]). RESULTS People with Parkinson's disease had reductions in the mean/effective minimum IAD when compared to controls, while mean/effective minimum GA and mean/effective maximum IAI were increased. Arytenoid cartilage velocities showed no difference. On Spearman correlation analyses, advancing disease duration and severity of PD showed moderately strong and significant correlations with increasing mean/effective minimum GA, increasing mean/effective maximum IAI and decreasing effective minimum IAD. Linear mixed models which considered the effects of intra and inter-individual variation showed that both disease duration (b = -0.011, SEb = 0.053, 95% CI [-0.022, 0], t(27) = -2.10, p = 0.045) and severity (b = -0.069, SEb = 0.032, 95% CI [-0.14,-0.0039], t(27) = -2.17, p = 0.039) were significant predictors for IAD, and also for transformed values of the GA and IAI. CONCLUSIONS There are progressive alterations in phonatory posturing as Parkinson's disease advances. The increases in GA despite reductions in IAD are concordant with prior observations of vocal fold bowing. Our study provides a basis for using laryngeal 4D-CT to assess disease progression in Parkinson's disease.
Collapse
Affiliation(s)
- Andrew Ma
- Department of Neurosciences, Central Clinical School, Monash University, Melbourne, Australia
- Department of Neurology, Monash Health, Melbourne, Australia
- Department of Neurology, Alfred Health, Melbourne, Australia
| | - Kenneth K. Lau
- School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
- Monash Health Imaging, Monash Health, Melbourne, Australia
| | - Dominic Thyagarajan
- Department of Neurosciences, Central Clinical School, Monash University, Melbourne, Australia
- Department of Neurology, Monash Health, Melbourne, Australia
- Department of Neurology, Alfred Health, Melbourne, Australia
- * E-mail:
| |
Collapse
|
36
|
Laganas C, Iakovakis D, Hadjidimitriou S, Charisis V, Dias SB, Bostantzopoulou S, Katsarou Z, Klingelhoefer L, Reichmann H, Trivedi D, Chaudhuri KR, Hadjileontiadis LJ. Parkinson's Disease Detection Based on Running Speech Data From Phone Calls. IEEE Trans Biomed Eng 2021; 69:1573-1584. [PMID: 34596531 DOI: 10.1109/tbme.2021.3116935] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
OBJECTIVE Parkinson's Disease (PD) is a progressive neurodegenerative disorder, manifesting with subtle early signs, which often hinder timely and early diagnosis and treatment. The development of accessible, technology-based methods for longitudinal PD symptoms tracking in daily living offers the potential for transforming the disease assessment and accelerating PD diagnosis. METHODS A privacy-aware method for classifying PD patients and healthy controls (HC), on the grounds of speech impairment present in PD, is proposed here. Voice features from running speech signals were extracted from recordings passively captured over voice phone calls. Features are fed in a language-aware training of multiple- and single-instance learning classifiers, along with demographic variables, exploiting a multilingual cohort of 498 subjects (392/106 self-reported HC/PD patients) to classify PD. RESULTS By means of leave-one-subject-out cross-validation, the best-performing models yielded 0.69/0.68/0.63/0.83 area under the Receiver Operating Characteristic curve (AUC) for the binary classification of PD patient vs. HC in sub-cohorts of English/Greek/German/Portuguese-speaking subjects, respectively. Out-of-sample testing of the best performing models was conducted in an additional dataset, generated by 63 clinically-assessed subjects (24/39 HC/early PD patients). Testing has resulted in 0.84/0.93/0.83 AUC for the English/Greek/German-speaking sub-cohorts, respectively. Comparative analysis with other approaches for language-aware PD detection justified the efficiency of the proposed one, considering the ecological validity of the acquired voice data. CONCLUSIONS The present work demonstrates increased robustness in PD detection using voice data captured in-the-wild. SIGNIFICANCE A high-frequency, privacy-aware and unobtrusive PD screening tool is introduced for the first time, based on analysis of voice samples captured during routine phone calls.
Collapse
|
37
|
Silbergleit AK, Schultz L, Hamilton K, LeWitt PA, Sidiropoulos C. Self-Perception of Voice and Swallowing Handicap in Parkinson's Disease. JOURNAL OF PARKINSONS DISEASE 2021; 11:2027-2034. [PMID: 34366369 DOI: 10.3233/jpd-212621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND Hypokinetic dysarthria and dysphagia are known features of Parkinson's disease; however, self-perception of their handicapping effects on emotional, physical, and functional aspects of quality of life over disease duration is less understood. OBJECTIVE 1) Based upon patient self-perception, to determine the relationship of the handicapping effects of dysphagia and dysphonia with time since diagnosis in individuals with Parkinson's disease; 2)To determine if there is a relationship between voice and swallowing handicap throughout the course of Parkinson's disease. METHOD 277 subjects completed the Dysphagia Handicap Index and the Voice Handicap Index. Subjects were divided into three groups based on disease duration: 0-4 years, 5-9 years, and 10 + years. RESULTS Subjects in the longer duration group identified significantly greater perceptions of voice and swallowing handicap compared to the shorter duration groups. There was a significant positive correlation between the DHI and VHI. CONCLUSION Self-perception of swallowing and voice handicap in Parkinson's disease are associated with later stages of disease and progress in a linear fashion. Self-perception of voice and swallowing handicap parallel each other throughout disease progression in Parkinson's disease. Individuals may be able to compensate for changes in voice and swallowing early while sensory perceptual feedback is intact. Results support early targeted questioning of patient self-perception of voice and swallowing handicap as identification of one problem indicates awareness of the other, thus creating an opportunity for early treatment and maintenance of swallowing and communication quality of life for as long as possible.
Collapse
Affiliation(s)
- Alice K Silbergleit
- Department of Neurology, Division of Speech-Language Sciences and Disorders, Henry Ford Health System, Detroit, MI, USA.,Department of Neurology, Wayne State University School of Medicine, Detroit, MI, USA
| | - Lonni Schultz
- Department of Public Health Sciences, Henry Ford Health System, Detroit, MI, USA
| | - Kendra Hamilton
- Department of Neurology, Henry Ford Health System, Detroit, MI, USA
| | - Peter A LeWitt
- Department of Neurology, Wayne State University School of Medicine, Detroit, MI, USA.,Department of Neurology, Henry Ford Health System, Detroit, MI, USA
| | | |
Collapse
|
38
|
Simonet C, Noyce A. Mild parkinsonian signs: the interface between aging and Parkinson’s disease. ADVANCES IN CLINICAL NEUROSCIENCE & REHABILITATION 2021. [DOI: 10.47795/khgp5988] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Mild Parkinsonian Signs (MPS) describe a spectrum that exists between the expected motor decline of normal aging and a more serious motor deterioration resulting from Parkinson’s disease (PD) and neurodegeneration. Although MPS are a feature of the prodromal stage of PD, their formal definition is unclear and still relies somewhat on conventional clinical criteria for PD. This review will summarise the early motor features of PD and methods of assessment, from conventional clinical scales to advances in quantitative measures. Finally, the boundaries of motor decline as part of normal aging and pathological neurodegeneration will be discussed.
Collapse
|
39
|
Objective vowel sound characteristics and their relationship with motor dysfunction in Asian Parkinson's disease patients. J Neurol Sci 2021; 426:117487. [PMID: 34004464 DOI: 10.1016/j.jns.2021.117487] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 05/05/2021] [Accepted: 05/10/2021] [Indexed: 11/23/2022]
Abstract
BACKGROUND Speech impairments are very common in patients with Parkinson's disease (PD). However, knowledge of their objective characteristics and relationship to other motor symptoms amongst Asian PD patients is limited. OBJECTIVES To identify objective vowel sound characteristics in Thai PD patients and correlate with disease severity, as determined by UPDRS and various sub-scores. METHOD We evaluated 100 Thai PD patients, with a mean age of 66.56 years (±7.52) and HY of 2.7 (±1.08), and 101 age-matched controls. Phonatory evaluation, comprising of 15 objective parameters, was conducted using the Multi-Dimensional Voice Programme with a sustained /a/ phonation. RESULTS PD patients exhibited significantly higher values of all dimensions of the phonatory parameters evaluated compared to controls (All, p < 0.001) except for duration of sustained phonation, which was significantly shorter in PD patients. When early- and advanced-stage patients were compared, significantly different parameters were limited to frequency perturbation parameters (Jitt, p = 0.01; RAP, p = 0.013; PPQ, p = 0.01; sPPQ, p = 0.001; vF0, p = 0.011), and NHR (p = 0.028). Several significant and moderate correlations were observed between both STD and frequency perturbation parameters and UPDRS-III, bradykinesia sub-score, and gait and postural instability sub-score. Both vF0, and STD significantly correlated with UPDRS-III and sub-scores in advanced stage patients. CONCLUSION Our study provides objective evidence of phonatory dysfunction in Asian PD patients with certain characteristics correlated with advanced stage or different motor dysfunction. Sustained vowel phonation is a promising digital outcome for global phenotyping a large number of PD patients.
Collapse
|
40
|
Yücelbaş C. A new approach: information gain algorithm-based k-nearest neighbors hybrid diagnostic system for Parkinson's disease. Phys Eng Sci Med 2021; 44:511-524. [PMID: 33852120 DOI: 10.1007/s13246-021-01001-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 04/09/2021] [Indexed: 11/28/2022]
Abstract
Parkinson's disease (PD) is a slow and insidiously progressive neurological brain disorder. The development of expert systems capable of automatically and highly accurately diagnosing early stages of PD based on speech signals would provide an important contribution to the health sector. For this purpose, the Information Gain Algorithm-based K-Nearest Neighbors (IGKNN) model was developed. This approach was applied to the feature data sets formed using the Tunable Q-factor Wavelet Transform (TQWT) method. First, 12 sub-feature data sets forming the TQWT feature group were analyzed separately after which the one with the best performance was selected, and the IGKNN model was applied to this sub-feature data set. Finally, it was observed that the performance results provided with the IGKNN system for this sub-feature data set were better than those for the complete set of data. According to the results, values of receiver operating characteristic and precision-recall curves exceeded 0.95, and a classification accuracy of almost 98% was obtained with the 22 features selected from this sub-group. In addition, the kappa coefficient was 0.933 and showed a perfect agreement between actual and predicted values. The performance of the IGKNN system was also compared with results from other studies in the literature in which the same data were used, and the approach proposed in this study far outperformed any approaches reported in the literature. Also, as in this IGKNN approach, an expert system that can diagnose PD and achieve maximum performance with fewer features from the audio signals has not been previously encountered.
Collapse
Affiliation(s)
- Cüneyt Yücelbaş
- Electrical-Electronics Engineering Department, Hakkari University, 30000, Hakkari, Turkey.
| |
Collapse
|
41
|
Tabashum T, Zaffer A, Yousefzai R, Colletta K, Jost MB, Park Y, Chawla J, Gaynes B, Albert MV, Xiao T. Detection of Parkinson's Disease Through Automated Pupil Tracking of the Post-illumination Pupillary Response. Front Med (Lausanne) 2021; 8:645293. [PMID: 33842509 PMCID: PMC8026862 DOI: 10.3389/fmed.2021.645293] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 03/01/2021] [Indexed: 12/12/2022] Open
Abstract
Parkinson's disease (PD) is one of the most common neurodegenerative disorders, but it is often diagnosed after the majority of dopaminergic cells are already damaged. It is critical to develop biomarkers to identify the disease as early as possible for early intervention. PD patients appear to have an altered pupillary response consistent with an abnormality in photoreceptive retinal ganglion cells. Tracking the pupil size manually is a tedious process and offline automated systems can be prone to errors that may require intervention; for this reason in this work we describe a system for pupil size estimation with a user interface to allow rapid adjustment of parameters and extraction of pupil parameters of interest for the present study. We implemented a user-friendly system designed for clinicians to automate the process of tracking the pupil diameter to measure the post-illumination pupillary response (PIPR), permit manual corrections when needed, and continue automation after correction. Tracking was automated using a Kalman filter estimating the pupil center and diameter over time. The resulting system was tested on a PD classification task in which PD subjects are known to have similar responses for two wavelengths of light. The pupillary response is measured in the contralateral eye to two different light stimuli (470 and 610 nm) for 19 PD and 10 control subjects. The measured Net PIPR indicating different responsiveness to the wavelengths was 0.13 mm for PD subjects and 0.61 mm for control subjects, demonstrating a highly significant difference (p < 0.001). Net PIPR has the potential to be a biomarker for PD, suggesting further study to determine clinical validity.
Collapse
Affiliation(s)
- Thasina Tabashum
- Department of Computer Science and Engineering, University of North Texas, Denton, TX, United States
| | - Adnaan Zaffer
- Edward Hines Jr. VA Medical Center, Hines, IL, United States
| | - Raman Yousefzai
- Edward Hines Jr. VA Medical Center, Hines, IL, United States
| | - Kalea Colletta
- Edward Hines Jr. VA Medical Center, Hines, IL, United States
| | - Mary Beth Jost
- Edward Hines Jr. VA Medical Center, Hines, IL, United States
| | - Youngsook Park
- Edward Hines Jr. VA Medical Center, Hines, IL, United States
| | | | - Bruce Gaynes
- Edward Hines Jr. VA Medical Center, Hines, IL, United States.,Department of Ophthalmology, Loyola University Chicago Stritch School of Medicine, Maywood, IL, United States
| | - Mark V Albert
- Department of Computer Science and Engineering, University of North Texas, Denton, TX, United States.,Department of Biomedical Engineering, University of North Texas, Denton, TX, United States
| | - Ting Xiao
- Department of Computer Science and Engineering, University of North Texas, Denton, TX, United States
| |
Collapse
|
42
|
Gómez A, Gómez P, Palacios D, Rodellar V, Nieto V, Álvarez A, Tsanas A. A Neuromotor to Acoustical Jaw-Tongue Projection Model With Application in Parkinson's Disease Hypokinetic Dysarthria. Front Hum Neurosci 2021; 15:622825. [PMID: 33790751 PMCID: PMC8005556 DOI: 10.3389/fnhum.2021.622825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 02/17/2021] [Indexed: 11/13/2022] Open
Abstract
Aim The present work proposes the study of the neuromotor activity of the masseter-jaw-tongue articulation during diadochokinetic exercising to establish functional statistical relationships between surface Electromyography (sEMG), 3D Accelerometry (3DAcc), and acoustic features extracted from the speech signal, with the aim of characterizing Hypokinetic Dysarthria (HD). A database of multi-trait signals of recordings from an age-matched control and PD participants are used in the experimental study. Hypothesis: The main assumption is that information between sEMG and 3D acceleration, and acoustic features may be quantified using linear regression methods. Methods Recordings from a cohort of eight age-matched control participants (4 males, 4 females) and eight PD participants (4 males, 4 females) were collected during the utterance of a diadochokinetic exercise (the fast repetition of diphthong [aI]). The dynamic and acoustic absolute kinematic velocities produced during the exercises were estimated by acoustic filter inversion and numerical integration and differentiation of the speech signal. The amplitude distributions of the absolute kinematic and acoustic velocities (AKV and AFV) are estimated to allow comparisons in terms of Mutual Information. Results The regression results show the relationships between sEMG and dynamic and acoustic estimates. The projection methodology may help in understanding the basic neuromotor muscle activity regarding neurodegenerative speech in remote monitoring neuromotor and neurocognitive diseases using speech as the vehicular tool, and in the study of other speech-related disorders. The study also showed strong and significant cross-correlations between articulation kinematics, both for the control and the PD cohorts. The absolute kinematic variables presents an observable difference for the PD participants compared to the control group. Conclusion Kinematic distributions derived from acoustic analysis may be useful biomarkers toward characterizing HD in neuromotor disorders providing new insights into PD.
Collapse
Affiliation(s)
- Andrés Gómez
- Old Medical School, Medical School, Usher Institute, University of Edinburgh, Edinburgh, United Kingdom.,NeuSpeLab, Center for Biomedical Technology, Universidad Politécnica de Madrid, Madrid, Spain
| | - Pedro Gómez
- NeuSpeLab, Center for Biomedical Technology, Universidad Politécnica de Madrid, Madrid, Spain
| | - Daniel Palacios
- NeuSpeLab, Center for Biomedical Technology, Universidad Politécnica de Madrid, Madrid, Spain.,Escuela Técnica Superior de Ingeniería Informática-Universidad Rey Juan Carlos, Móstoles, Spain
| | - Victoria Rodellar
- NeuSpeLab, Center for Biomedical Technology, Universidad Politécnica de Madrid, Madrid, Spain
| | - Víctor Nieto
- NeuSpeLab, Center for Biomedical Technology, Universidad Politécnica de Madrid, Madrid, Spain
| | - Agustín Álvarez
- NeuSpeLab, Center for Biomedical Technology, Universidad Politécnica de Madrid, Madrid, Spain
| | - Athanasios Tsanas
- Old Medical School, Medical School, Usher Institute, University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
43
|
Chiu Y, Neel A, Loux T. Acoustic characteristics in relation to intelligibility reduction in noise for speakers with Parkinson's disease. CLINICAL LINGUISTICS & PHONETICS 2021; 35:222-236. [PMID: 32539544 DOI: 10.1080/02699206.2020.1777585] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Revised: 05/30/2020] [Accepted: 05/30/2020] [Indexed: 06/11/2023]
Abstract
Decreased speech intelligibility in noisy environments is frequently observed in speakers with Parkinson's disease (PD). This study investigated which acoustic characteristics across the speech subsystems contributed to poor intelligibility in noise for speakers with PD. Speech samples were obtained from 13 speakers with PD and five healthy controls reading 56 sentences. Intelligibility analysis was conducted in quiet and noisy listening conditions. Seventy-two young listeners transcribed the recorded sentences in quiet and another 72 listeners transcribed in noise. The acoustic characteristics of the speakers with PD who experienced large intelligibility reduction from quiet to noise were compared to those with smaller intelligibility reduction in noise and healthy controls. The acoustic measures in the study included second formant transitions, cepstral and spectral measures of voice (cepstral peak prominence and low/high spectral ratio), pitch variation, and articulation rate to represent speech components across speech subsystems of articulation, phonation, and prosody. The results show that speakers with PD who had larger intelligibility reduction in noise exhibited decreased second formant transition, limited cepstral and spectral variations, and faster articulation rate. These findings suggest that the adverse effect of noise on speech intelligibility in PD is related to speech changes in the articulatory and phonatory systems.
Collapse
Affiliation(s)
- Yi Chiu
- Department of Communication Sciences and Disorders, Saint Louis University , Saint Louis, MO, USA
| | - Amy Neel
- Department of Speech and Hearing Sciences, University of New Mexico , Albuquerque, NM, USA
| | - Travis Loux
- Department of Epidemiology and Biostatistics, Saint Louis University , Saint Louis, MO, USA
| |
Collapse
|
44
|
Age-related and noise-induced hearing loss alters grasshopper mouse (Onychomys) vocalizations. Hear Res 2021; 404:108210. [PMID: 33713993 DOI: 10.1016/j.heares.2021.108210] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 01/27/2021] [Accepted: 02/10/2021] [Indexed: 11/23/2022]
Abstract
Age-related and noise-induced hearing loss disorders are among the most common pathologies affecting Americans across their lifespans. Loss of auditory feedback due to hearing disorders is correlated with changes in voice and speech-motor control in humans. Although rodents are increasingly used to model human age- and noise-induced hearing loss, few studies have assessed vocal changes after acoustic trauma. Northern grasshopper mice (Onychomys leucogaster) represent a candidate model because their hearing sensitivity is matched to the frequencies of long-distance vocalizations that are produced using vocal fold vibrations similar to human speech. In this study, we quantified changes in auditory brainstem responses (ABRs) and vocalizations related to aging and noise-induced acoustic trauma. Mice showed a progressive decrease in hearing sensitivity across 4-32 kHz, with males losing hearing more rapidly than females. In addition, noise-exposed mice had a 61.55 dB SPL decrease in ABR sensitivity following a noise exposure, with some individuals exhibiting a 21.25 dB recovery 300-330 days after noise exposure. We also found that older grasshopper mice produced calls with lower fundamental frequency. Sex differences were measured in duration of calls with females producing longer calls with age. Our findings indicate that grasshopper mice experience age- and noise- induced hearing loss and concomitant changes in vocal output, making them a promising model for hearing and communication disorders.
Collapse
|
45
|
Zhang H, Song C, Rathore AS, Huang MC, Zhang Y, Xu W. mHealth Technologies Towards Parkinson's Disease Detection and Monitoring in Daily Life: A Comprehensive Review. IEEE Rev Biomed Eng 2021; 14:71-81. [PMID: 32365035 DOI: 10.1109/rbme.2020.2991813] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Parkinson's disease (PD) can gradually affect people's lives thus attracting tremendous attention. Early PD detection and treatment can help control the disease progress, relief from the symptoms and improve the patients' life quality. However, the current practice of PD diagnosis is conducted in a clinical setup and administrated by a PD specialist due to the early signs of PD are not noticeable in daily life. According to the report of CDC/NIH, the diagnosed time of PD ranges from 2-10 years after onset. Therefore, a more accessible PD diagnosis approach is urgently demanded. In recent years, mobile health (for short mHealth) technology has been intensively investigated for preventive medicine, particularly in chronic disease management. Notably, many types of research have explored the possibility of using mobile and wearable personal devices to detect the symptom of PD and shown promising results. It provides opportunities for transforming early PD detection from clinical to daily life. This survey paper attempts to conduct a comprehensive review of mHealth technologies for PD detection from 2000 to 2019, and compares their pros and cons in practical applications and provides insights to close the performance gap between state-of-the-art clinical approaches and mHealth technologies.
Collapse
|
46
|
Tsanas A, Little MA, Ramig LO. Remote Assessment of Parkinson's Disease Symptom Severity Using the Simulated Cellular Mobile Telephone Network. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2021; 9:11024-11036. [PMID: 33495722 PMCID: PMC7821632 DOI: 10.1109/access.2021.3050524] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Accepted: 12/25/2020] [Indexed: 06/12/2023]
Abstract
Telemonitoring of Parkinson's Disease (PD) has attracted considerable research interest because of its potential to make a lasting, positive impact on the life of patients and their carers. Purpose-built devices have been developed that record various signals which can be associated with average PD symptom severity, as quantified on standard clinical metrics such as the Unified Parkinson's Disease Rating Scale (UPDRS). Speech signals are particularly promising in this regard, because they can be easily recorded without the use of expensive, dedicated hardware. Previous studies have demonstrated replication of UPDRS to within less than 2 points of a clinical raters' assessment of symptom severity, using high-quality speech signals collected using dedicated telemonitoring hardware. Here, we investigate the potential of using the standard voice-over-GSM (2G) or UMTS (3G) cellular mobile telephone networks for PD telemonitoring, networks that, together, have greater than 5 billion subscribers worldwide. We test the robustness of this approach using a simulated noisy mobile communication network over which speech signals are transmitted, and approximately 6000 recordings from 42 PD subjects. We show that UPDRS can be estimated to within less than 3.5 points difference from the clinical raters' assessment, which is clinically useful given that the inter-rater variability for UPDRS can be as high as 4-5 UPDRS points. This provides compelling evidence that the existing voice telephone network has potential towards facilitating inexpensive, mass-scale PD symptom telemonitoring applications.
Collapse
Affiliation(s)
- Athanasios Tsanas
- Edinburgh Medical SchoolUsher Institute, The University of EdinburghEdinburghEH16 4UXU.K.
| | - Max A. Little
- School of Computer ScienceUniversity of BirminghamBirminghamB15 2TTU.K.
| | - Lorraine O. Ramig
- Department of Speech, Language, and Hearing ScienceUniversity of Colorado BoulderBoulderCO80309USA
- National Center for Voice and SpeechDenverCO80014USA
| |
Collapse
|
47
|
Machine Learning Methods with Decision Forests for Parkinson’s Detection. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11020581] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Biomedical engineers prefer decision forests over traditional decision trees to design state-of-the-art Parkinson’s Detection Systems (PDS) on massive acoustic signal data. However, the challenges that the researchers are facing with decision forests is identifying the minimum number of decision trees required to achieve maximum detection accuracy with the lowest error rate. This article examines two recent decision forest algorithms Systematically Developed Forest (SysFor), and Decision Forest by Penalizing Attributes (ForestPA) along with the popular Random Forest to design three distinct Parkinson’s detection schemes with optimum number of decision trees. The proposed approach undertakes minimum number of decision trees to achieve maximum detection accuracy. The training and testing samples and the density of trees in the forest are kept dynamic and incremental to achieve the decision forests with maximum capability for detecting Parkinson’s Disease (PD). The incremental tree densities with dynamic training and testing of decision forests proved to be a better approach for detection of PD. The proposed approaches are examined along with other state-of-the-art classifiers including the modern deep learning techniques to observe the detection capability. The article also provides a guideline to generate ideal training and testing split of two modern acoustic datasets of Parkinson’s and control subjects donated by the Department of Neurology in Cerrahpaşa, Istanbul and Departamento de Matemáticas, Universidad de Extremadura, Cáceres, Spain. Among the three proposed detection schemes the Forest by Penalizing Attributes (ForestPA) proved to be a promising Parkinson’s disease detector with a little number of decision trees in the forest to score the highest detection accuracy of 94.12% to 95.00%.
Collapse
|
48
|
Lin H, Karjadi C, Ang TFA, Prajakta J, McManus C, Alhanai TW, Glass J, Au R. Identification of digital voice biomarkers for cognitive health. EXPLORATION OF MEDICINE 2020; 1:406-417. [PMID: 33665648 PMCID: PMC7929495 DOI: 10.37349/emed.2020.00028] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 09/04/2020] [Indexed: 01/03/2023] Open
Abstract
AIM Human voice contains rich information. Few longitudinal studies have been conducted to investigate the potential of voice to monitor cognitive health. The objective of this study is to identify voice biomarkers that are predictive of future dementia. METHODS Participants were recruited from the Framingham Heart Study. The vocal responses to neuropsychological tests were recorded, which were then diarized to identify participant voice segments. Acoustic features were extracted with the OpenSMILE toolkit (v2.1). The association of each acoustic feature with incident dementia was assessed by Cox proportional hazards models. RESULTS Our study included 6, 528 voice recordings from 4, 849 participants (mean age 63 ± 15 years old, 54.6% women). The majority of participants (71.2%) had one voice recording, 23.9% had two voice recordings, and the remaining participants (4.9%) had three or more voice recordings. Although all asymptomatic at the time of examination, participants who developed dementia tended to have shorter segments than those who were dementia free (P < 0.001). Additionally, 14 acoustic features were significantly associated with dementia after adjusting for multiple testing (P < 0.05/48 = 1 × 10-3). The most significant acoustic feature was jitterDDP_sma_de (P = 7.9 × 10-7), which represents the differential frame-to-frame Jitter. A voice based linear classifier was also built that was capable of predicting incident dementia with area under curve of 0.812. CONCLUSIONS Multiple acoustic and linguistic features are identified that are associated with incident dementia among asymptomatic participants, which could be used to build better prediction models for passive cognitive health monitoring.
Collapse
Affiliation(s)
- Honghuang Lin
- Section of Computational Biomedicine, Department of Medicine, Boston University School of Medicine, Boston, MA 02118, USA
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
| | - Cody Karjadi
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA
| | - Ting F. A. Ang
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Epidemiology, Boston University School of Public Health, Boston, MA 02118, USA
- Slone Epidemiology Center, Boston University School of Medicine, Boston, MA 02118, USA
| | - Joshi Prajakta
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA
| | - Chelsea McManus
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA
| | - Tuka W. Alhanai
- Department of Electrical and Computer Engineering, New York University Abu Dhabi, Abu Dhabi, UAE
| | - James Glass
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Rhoda Au
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Epidemiology, Boston University School of Public Health, Boston, MA 02118, USA
- Slone Epidemiology Center, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Neurology, Boston University School of Medicine, Boston, MA 02118, USA
| |
Collapse
|
49
|
Darling-White M, Huber JE. The Impact of Parkinson's Disease on Breath Pauses and Their Relationship to Speech Impairment: A Longitudinal Study. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2020; 29:1910-1922. [PMID: 32693630 PMCID: PMC8740572 DOI: 10.1044/2020_ajslp-20-00003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Revised: 04/26/2020] [Accepted: 05/11/2020] [Indexed: 05/12/2023]
Abstract
Purpose The purposes of this longitudinal study were to (a) examine the impact of Parkinson's disease (PD) progression on breath pause patterns and speech and linguistic errors and (b) determine the extent to which breath pauses and speech and linguistic errors contribute to speech impairment. Method Eight individuals with PD and eight age- and sex-matched control participants produced a reading passage on two occasions (Time 1 and Time 2) 3 years and 7 months apart on average. Two speech-language pathologists rated the severity of speech impairment for all participants at each time. Dependent variables included the location of each breath pause relative to syntax and punctuation as well as the number of disfluencies and mazes. Results At Time 1, there were no significant differences between the groups regarding breath pause patterns. At Time 2, individuals with PD produced significantly fewer breath pauses at major syntactic boundaries and periods as well as significantly more breath pauses at locations with no punctuation than control participants. Individuals with PD produced a significantly greater number of disfluencies than control participants at both time points. There were no significant differences between the groups in the number of mazes produced at either time point. Together, the number of mazes and the percentage of breath pauses at locations with no punctuation explained 50% of the variance associated with the ratings of severity of speech impairment. Conclusion These results highlight the importance of targeting both respiratory physiological and cognitive-linguistic systems in order to improve speech production in individuals with PD.
Collapse
Affiliation(s)
- Meghan Darling-White
- Department of Speech, Language, and Hearing Sciences, The University of Arizona, Tucson
| | - Jessica E. Huber
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, IN
| |
Collapse
|
50
|
Siena FL, Vernon M, Watts P, Byrom B, Crundall D, Breedon P. Proof-of-Concept Study: a Mobile Application to Derive Clinical Outcome Measures from Expression and Speech for Mental Health Status Evaluation. J Med Syst 2020; 44:209. [PMID: 33175234 PMCID: PMC7658062 DOI: 10.1007/s10916-020-01671-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Accepted: 09/17/2020] [Indexed: 11/30/2022]
Abstract
This proof-of-concept study aimed to assess the ability of a mobile application and cloud analytics software solution to extract facial expression information from participant selfie videos. This is one component of a solution aimed at extracting possible health outcome measures based on expression, voice acoustics and speech sentiment from video diary data provided by patients. Forty healthy volunteers viewed 21 validated images from the International Affective Picture System database through a mobile app which simultaneously captured video footage of their face using the selfie camera. Images were intended to be associated with the following emotional responses: anger, disgust, sadness, contempt, fear, surprise and happiness. Both valence and arousal scores estimated from the video footage associated with each image were adequate predictors of the IAPS image scores (p < 0.001 and p = 0.04 respectively). 12.2% of images were categorised as containing a positive expression response in line with the target expression; with happiness and sadness responses providing the greatest frequency of responders: 41.0% and 21.4% respectively. 71.2% of images were associated with no change in expression. This proof-of-concept study provides early encouraging findings that changes in facial expression can be detected when they exist. Combined with voice acoustical measures and speech sentiment analysis, this may lead to novel measures of health status in patients using a video diary in indications including depression, schizophrenia, autism spectrum disorder and PTSD amongst other conditions.
Collapse
Affiliation(s)
| | | | - Paul Watts
- Nottingham Trent University, Nottingham, UK
| | | | | | | |
Collapse
|