1
|
Kaser AN, Lacritz LH, Winiarski HR, Gabirondo P, Schaffert J, Coca AJ, Jiménez-Raboso J, Rojo T, Zaldua C, Honorato I, Gallego D, Nieves ER, Rosenstein LD, Cullum CM. A novel speech analysis algorithm to detect cognitive impairment in a Spanish population. Front Neurol 2024; 15:1342907. [PMID: 38638311 PMCID: PMC11024431 DOI: 10.3389/fneur.2024.1342907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 02/26/2024] [Indexed: 04/20/2024] Open
Abstract
Objective Early detection of cognitive impairment in the elderly is crucial for diagnosis and appropriate care. Brief, cost-effective cognitive screening instruments are needed to help identify individuals who require further evaluation. This study presents preliminary data on a new screening technology using automated voice recording analysis software in a Spanish population. Method Data were collected from 174 Spanish-speaking individuals clinically diagnosed as cognitively normal (CN, n = 87) or impaired (mild cognitive impairment [MCI], n = 63; all-cause dementia, n = 24). Participants were recorded performing four common language tasks (Animal fluency, alternating fluency [sports and fruits], phonemic "F" fluency, and Cookie Theft Description). Recordings were processed via text-transcription and digital-signal processing techniques to capture neuropsychological variables and audio characteristics. A training sample of 122 subjects with similar demographics across groups was used to develop an algorithm to detect cognitive impairment. Speech and task features were used to develop five independent machine learning (ML) models to compute scores between 0 and 1, and a final algorithm was constructed using repeated cross-validation. A socio-demographically balanced subset of 52 participants was used to test the algorithm. Analysis of covariance (ANCOVA), covarying for demographic characteristics, was used to predict logistically-transformed algorithm scores. Results Mean logit algorithm scores were significantly different across groups in the testing sample (p < 0.01). Comparisons of CN with impaired (MCI + dementia) and MCI groups using the final algorithm resulted in an AUC of 0.93/0.90, with overall accuracy of 88.4%/87.5%, sensitivity of 87.5/83.3, and specificity of 89.2/89.2, respectively. Conclusion Findings provide initial support for the utility of this automated speech analysis algorithm as a screening tool for cognitive impairment in Spanish speakers. Additional study is needed to validate this technology in larger and more diverse clinical populations.
Collapse
Affiliation(s)
- Alyssa N. Kaser
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Laura H. Lacritz
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Department of Neurology, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Holly R. Winiarski
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | | | - Jeff Schaffert
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Alberto J. Coca
- AcceXible Impacto, Sociedad Limitada, Bilbao, Spain
- Cambridge Mathematics of Information in Healthcare Hub, University of Cambridge, Cambridge, United Kingdom
| | | | - Tomas Rojo
- AcceXible Impacto, Sociedad Limitada, Bilbao, Spain
| | - Carla Zaldua
- AcceXible Impacto, Sociedad Limitada, Bilbao, Spain
| | | | | | - Emmanuel Rosario Nieves
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Parkland Health and Hospital System Behavioral Health Clinic, Dallas, TX, United States
| | - Leslie D. Rosenstein
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Parkland Health and Hospital System Behavioral Health Clinic, Dallas, TX, United States
| | - C. Munro Cullum
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Department of Neurology, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Department of Neurological Surgery, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| |
Collapse
|
2
|
Martínez-Nicolás I, Martínez-Sánchez F, Ivanova O, Meilán JJG. Reading and lexical-semantic retrieval tasks outperforms single task speech analysis in the screening of mild cognitive impairment and Alzheimer's disease. Sci Rep 2023; 13:9728. [PMID: 37322073 PMCID: PMC10272227 DOI: 10.1038/s41598-023-36804-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 06/12/2023] [Indexed: 06/17/2023] Open
Abstract
Age-related cognitive impairment have increased dramatically in recent years, which has risen the interes in developing screening tools for mild cognitive impairment and Alzheimer's disease. Speech analysis allows to exploit the behavioral consequences of cognitive deficits on the patient's vocal performance so that it is possible to identify pathologies affecting speech production such as dementia. Previous studies have further shown that the speech task used determines how the speech parameters are altered. We aim to combine the impairments in several speech production tasks in order to improve the accuracy of screening through speech analysis. The sample consists of 72 participants divided into three equal groups of healthy older adults, people with mild cognitive impairment, or Alzheimer's disease, matched by age and education. A complete neuropsychological assessment and two voice recordings were performed. The tasks required the participants to read a text, and complete a sentence with semantic information. A stepwise linear discriminant analysis was performed to select speech parameters with discriminative power. The discriminative functions obtained an accuracy of 83.3% in simultaneous classifications of several levels of cognitive impairment. It would therefore be a promising screening tool for dementia.
Collapse
Affiliation(s)
| | | | - Olga Ivanova
- Faculty of Philology, University of Salamanca, 37008, Salamanca, Spain
| | - Juan J G Meilán
- Faculty of Psychology, University of Salamanca, 37008, Salamanca, Spain
- Institute of Neuroscience of Castilla y León, 37007, Salamanca, Spain
| |
Collapse
|
3
|
Kálmán J, Devanand DP, Gosztolya G, Balogh R, Imre N, Tóth L, Hoffmann I, Kovács I, Vincze V, Pákáski M. Temporal speech parameters detect mild cognitive impairment in different languages: validation and comparison of the Speech-GAP Test® in English and Hungarian. Curr Alzheimer Res 2022; 19:373-386. [PMID: 35440309 DOI: 10.2174/1567205019666220418155130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 02/08/2022] [Accepted: 02/17/2022] [Indexed: 11/22/2022]
Abstract
BACKGROUND The development of automatic speech recognition (ASR) technology allows the analysis of temporal (time-based) speech parameters characteristic of mild cognitive impairment (MCI). However, no information has been available on whether the analysis of spontaneous speech can be used with the same efficiency in different language environments. OBJECTIVE The main goal of this international pilot study is to address the question whether the Speech-Gap Test® (S-GAP Test®), previously tested in the Hungarian language, is appropriate for and applicable to the recognition of MCI in other languages such as English. METHOD After an initial screening of 88 individuals, English-speaking (n = 33) and Hungarian-speaking (n = 33) participants were classified as having MCI or as healthy controls (HC) based on Petersen's criteria. Speech of each participant was recorded via a spontaneous speech task. 15 temporal parameters were determined and calculated by means of ASR. RESULTS Seven temporal parameters in the English-speaking sample and 5 in the Hungarian-speaking sample showed significant differences between the MCI and the HC group. Receiver operating characteristics (ROC) analysis clearly distinguished the English-speaking MCI cases from the HC group based on speech tempo and articulation tempo with 100% sensitivity, and on three more temporal parameters with high sensitivity (85.7%). In the Hungarian-speaking sample, the ROC analysis showed similar sensitivity rates (92.3%). CONCLUSION The results of this study in different native-speaking populations suggest that changes in acoustic parameters detected by the S-GAP Test® might be present across different languages.
Collapse
Affiliation(s)
- János Kálmán
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - Davangere P Devanand
- Columbia University Medical Center, New York, NY.,New York State Psychiatric Institute, New York, NY
| | - Gábor Gosztolya
- MTA-SZTE Research Group on Artificial Intelligence, Faculty of Science and Informatics, University of Szeged, Szeged
| | - Réka Balogh
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - Nóra Imre
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - László Tóth
- Faculty of Science and Informatics, University of Szeged, Szeged
| | - Ildikó Hoffmann
- Faculty of Humanities and Social Sciences, University of Szeged, Szeged.,Hungarian Research Centre for Linguistics, Eötvös Loránd Research Network, Budapest
| | - Ildikó Kovács
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - Veronika Vincze
- MTA-SZTE Research Group on Artificial Intelligence, Faculty of Science and Informatics, University of Szeged, Szeged
| | - Magdolna Pákáski
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| |
Collapse
|
4
|
Speech- and Language-Based Classification of Alzheimer's Disease: A Systematic Review. BIOENGINEERING (BASEL, SWITZERLAND) 2022; 9:bioengineering9010027. [PMID: 35049736 PMCID: PMC8772820 DOI: 10.3390/bioengineering9010027] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 11/25/2021] [Accepted: 11/30/2021] [Indexed: 12/04/2022]
Abstract
Background: Alzheimer’s disease (AD) has paramount importance due to its rising prevalence, the impact on the patient and society, and the related healthcare costs. However, current diagnostic techniques are not designed for frequent mass screening, delaying therapeutic intervention and worsening prognoses. To be able to detect AD at an early stage, ideally at a pre-clinical stage, speech analysis emerges as a simple low-cost non-invasive procedure. Objectives: In this work it is our objective to do a systematic review about speech-based detection and classification of Alzheimer’s Disease with the purpose of identifying the most effective algorithms and best practices. Methods: A systematic literature search was performed from Jan 2015 up to May 2020 using ScienceDirect, PubMed and DBLP. Articles were screened by title, abstract and full text as needed. A manual complementary search among the references of the included papers was also performed. Inclusion criteria and search strategies were defined a priori. Results: We were able: to identify the main resources that can support the development of decision support systems for AD, to list speech features that are correlated with the linguistic and acoustic footprint of the disease, to recognize the data models that can provide robust results and to observe the performance indicators that were reported. Discussion: A computational system with the adequate elements combination, based on the identified best-practices, can point to a whole new diagnostic approach, leading to better insights about AD symptoms and its disease patterns, creating conditions to promote a longer life span as well as an improvement in patient quality of life. The clinically relevant results that were identified can be used to establish a reference system and help to define research guidelines for future developments.
Collapse
|
5
|
Martínez-Nicolás I, Llorente TE, Martínez-Sánchez F, Meilán JJG. Ten Years of Research on Automatic Voice and Speech Analysis of People With Alzheimer's Disease and Mild Cognitive Impairment: A Systematic Review Article. Front Psychol 2021; 12:620251. [PMID: 33833713 PMCID: PMC8021952 DOI: 10.3389/fpsyg.2021.620251] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 02/15/2021] [Indexed: 11/25/2022] Open
Abstract
Background: The field of voice and speech analysis has become increasingly popular over the last 10 years, and articles on its use in detecting neurodegenerative diseases have proliferated. Many studies have identified characteristic speech features that can be used to draw an accurate distinction between healthy aging among older people and those with mild cognitive impairment and Alzheimer's disease. Speech analysis has been singled out as a cost-effective and reliable method for detecting the presence of both conditions. In this research, a systematic review was conducted to determine these features and their diagnostic accuracy. Methods: Peer-reviewed literature was located across multiple databases, involving studies that apply new procedures of automatic speech analysis to collect behavioral evidence of linguistic impairments along with their diagnostic accuracy on Alzheimer's disease and mild cognitive impairment. The risk of bias was assessed by using JBI and QUADAS-2 checklists. Results: Thirty-five papers met the inclusion criteria; of these, 11 were descriptive studies that either identified voice features or explored their cognitive correlates, and the rest were diagnostic studies. Overall, the studies were of good quality and presented solid evidence of the usefulness of this technique. The distinctive acoustic and rhythmic features found are gathered. Most studies record a diagnostic accuracy over 88% for Alzheimer's and 80% for mild cognitive impairment. Conclusion: Automatic speech analysis is a promising tool for diagnosing mild cognitive impairment and Alzheimer's disease. The reported features seem to be indicators of the cognitive changes in older people. The specific features and the cognitive changes involved could be the subject of further research.
Collapse
Affiliation(s)
- Israel Martínez-Nicolás
- Faculty of Psychology, University of Salamanca, Salamanca, Spain.,Institute of Neuroscience of Castilla y León, University of Salamanca, Salamanca, Spain
| | - Thide E Llorente
- Faculty of Psychology, University of Salamanca, Salamanca, Spain.,Institute of Neuroscience of Castilla y León, University of Salamanca, Salamanca, Spain
| | | | - Juan José G Meilán
- Faculty of Psychology, University of Salamanca, Salamanca, Spain.,Institute of Neuroscience of Castilla y León, University of Salamanca, Salamanca, Spain
| |
Collapse
|
6
|
Roshanzamir A, Aghajan H, Soleymani Baghshah M. Transformer-based deep neural network language models for Alzheimer's disease risk assessment from targeted speech. BMC Med Inform Decis Mak 2021; 21:92. [PMID: 33750385 PMCID: PMC7971114 DOI: 10.1186/s12911-021-01456-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 02/23/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND We developed transformer-based deep learning models based on natural language processing for early risk assessment of Alzheimer's disease from the picture description test. METHODS The lack of large datasets poses the most important limitation for using complex models that do not require feature engineering. Transformer-based pre-trained deep language models have recently made a large leap in NLP research and application. These models are pre-trained on available large datasets to understand natural language texts appropriately, and are shown to subsequently perform well on classification tasks with small training sets. The overall classification model is a simple classifier on top of the pre-trained deep language model. RESULTS The models are evaluated on picture description test transcripts of the Pitt corpus, which contains data of 170 AD patients with 257 interviews and 99 healthy controls with 243 interviews. The large bidirectional encoder representations from transformers (BERTLarge) embedding with logistic regression classifier achieves classification accuracy of 88.08%, which improves the state-of-the-art by 2.48%. CONCLUSIONS Using pre-trained language models can improve AD prediction. This not only solves the problem of lack of sufficiently large datasets, but also reduces the need for expert-defined features.
Collapse
Affiliation(s)
- Alireza Roshanzamir
- Department of Computer Engineering, Sharif University of Technology, Azadi Avenue, Tehran, Iran
| | - Hamid Aghajan
- Department of Electrical Engineering, Sharif University of Technology, Azadi Avenue, Tehran, Iran
| | | |
Collapse
|
7
|
de la Fuente Garcia S, Ritchie CW, Luz S. Artificial Intelligence, Speech, and Language Processing Approaches to Monitoring Alzheimer's Disease: A Systematic Review. J Alzheimers Dis 2020; 78:1547-1574. [PMID: 33185605 PMCID: PMC7836050 DOI: 10.3233/jad-200888] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
BACKGROUND Language is a valuable source of clinical information in Alzheimer's disease, as it declines concurrently with neurodegeneration. Consequently, speech and language data have been extensively studied in connection with its diagnosis. OBJECTIVE Firstly, to summarize the existing findings on the use of artificial intelligence, speech, and language processing to predict cognitive decline in the context of Alzheimer's disease. Secondly, to detail current research procedures, highlight their limitations, and suggest strategies to address them. METHODS Systematic review of original research between 2000 and 2019, registered in PROSPERO (reference CRD42018116606). An interdisciplinary search covered six databases on engineering (ACM and IEEE), psychology (PsycINFO), medicine (PubMed and Embase), and Web of Science. Bibliographies of relevant papers were screened until December 2019. RESULTS From 3,654 search results, 51 articles were selected against the eligibility criteria. Four tables summarize their findings: study details (aim, population, interventions, comparisons, methods, and outcomes), data details (size, type, modalities, annotation, balance, availability, and language of study), methodology (pre-processing, feature generation, machine learning, evaluation, and results), and clinical applicability (research implications, clinical potential, risk of bias, and strengths/limitations). CONCLUSION Promising results are reported across nearly all 51 studies, but very few have been implemented in clinical research or practice. The main limitations of the field are poor standardization, limited comparability of results, and a degree of disconnect between study aims and clinical applications. Active attempts to close these gaps will support translation of future research into clinical practice.
Collapse
Affiliation(s)
| | - Craig W. Ritchie
- Centre for Clinical Brain Sciences, The University of Edinburgh, Scotland, UK
| | - Saturnino Luz
- Usher Institute, Edinburgh Medical School, The University of Edinburgh, Scotland, UK
| |
Collapse
|
8
|
Hernández-Domínguez L, Ratté S, Sierra-Martínez G, Roche-Bergua A. Computer-based evaluation of Alzheimer's disease and mild cognitive impairment patients during a picture description task. ALZHEIMER'S & DEMENTIA: DIAGNOSIS, ASSESSMENT & DISEASE MONITORING 2018; 10:260-268. [PMID: 29780871 PMCID: PMC5956933 DOI: 10.1016/j.dadm.2018.02.004] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Introduction We present a methodology to automatically evaluate the performance of patients during picture description tasks. Methods Transcriptions and audio recordings of the Cookie Theft picture description task were used. With 25 healthy elderly control (HC) samples and an information coverage measure, we automatically generated a population-specific referent. We then assessed 517 transcriptions (257 Alzheimer's disease [AD], 217 HC, and 43 mild cognitively impaired samples) according to their informativeness and pertinence against this referent. We extracted linguistic and phonetic metrics which previous literature correlated to early-stage AD. We trained two learners to distinguish HCs from cognitively impaired individuals. Results Our measures significantly (P < .001) correlated with the severity of the cognitive impairment and the Mini–Mental State Examination score. The classification sensitivity was 81% (area under the curve of receiver operating characteristics = 0.79) and 85% (area under the curve of receiver operating characteristics = 0.76) between HCs and AD and between HCs and AD and mild cognitively impaired, respectively. Discussion An automated assessment of a picture description task could assist clinicians in the detection of early signs of cognitive impairment and AD.
Collapse
Affiliation(s)
- Laura Hernández-Domínguez
- École de technologie supérieure, Université du Québec, Montreal, Quebec, Canada
- Corresponding author. Tel.: +1-514-431-1557.
| | - Sylvie Ratté
- École de technologie supérieure, Université du Québec, Montreal, Quebec, Canada
| | | | - Andrés Roche-Bergua
- Psychogeriatric Unit, Hospital Psiquiátrico Fray Bernardino Álvarez, Mexico City, Mexico
| |
Collapse
|