1
|
Takeshige-Amano H, Oyama G, Ogawa M, Fusegi K, Kambe T, Shiina K, Ueno SI, Okuzumi A, Hatano T, Motoi Y, Kawakami I, Ando M, Nakayama S, Ishida Y, Maei S, Lu X, Kobayashi T, Wooden R, Ota S, Morito K, Ito Y, Nakajima Y, Yoritaka A, Kato T, Hattori N. Digital detection of Alzheimer's disease using smiles and conversations with a chatbot. Sci Rep 2024; 14:26309. [PMID: 39487204 PMCID: PMC11530557 DOI: 10.1038/s41598-024-77220-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Accepted: 10/21/2024] [Indexed: 11/04/2024] Open
Abstract
In super-aged societies, dementia has become a critical issue, underscoring the urgent need for tools to assess cognitive status effectively in various sectors, including financial and business settings. Facial and speech features have been tried as cost-effective biomarkers of dementia including Alzheimer's disease (AD). We aimed to establish an easy, automatic, and extensive screening tool for AD using a chatbot and artificial intelligence. Smile images and visual and auditory data of natural conversations with a chatbot from 99 healthy controls (HCs) and 93 individuals with AD or mild cognitive impairment due to AD (PwA) were analyzed using machine learning. A subset of 8 facial and 21 sound features successfully distinguished PwA from HCs, with a high area under the receiver operating characteristic curve of 0.94 ± 0.05. Another subset of 8 facial and 20 sound features predicted the cognitive test scores, with a mean absolute error as low as 5.78 ± 0.08. These results were superior to those obtained from face or auditory data alone or from conventional image depiction tasks. Thus, by combining spontaneous sound and facial data obtained through conversations with a chatbot, the proposed model can be put to practical use in real-life scenarios.
Collapse
Affiliation(s)
- Haruka Takeshige-Amano
- Department of Neurology, Faculty of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan
| | - Genko Oyama
- Department of Neurology, Faculty of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan.
| | - Mayuko Ogawa
- Department of Neurology, Faculty of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan
| | - Keiko Fusegi
- Department of Neurology, Faculty of Medicine, Juntendo University Koshigaya Hospital, Saitama, Japan
| | - Taiki Kambe
- Department of Neurology, Faculty of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan
| | - Kenta Shiina
- Department of Neurology, Faculty of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan
| | - Shin-Ichi Ueno
- Department of Neurology, Faculty of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan
| | - Ayami Okuzumi
- Department of Neurology, Faculty of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan
| | - Taku Hatano
- Department of Neurology, Faculty of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan
| | - Yumiko Motoi
- Department of Neurology, Faculty of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan
| | - Ito Kawakami
- Department of Psychiatry, Faculty of Medicine, Juntendo University, Tokyo, Japan
| | - Maya Ando
- Department of Neurology, Faculty of Medicine, Juntendo University Koshigaya Hospital, Saitama, Japan
| | - Sachiko Nakayama
- Department of Neurology, Faculty of Medicine, Juntendo University Koshigaya Hospital, Saitama, Japan
| | | | - Shun Maei
- IBM Consulting, IBM Japan, Ltd., Tokyo, Japan
| | - Xiangxun Lu
- IBM Consulting, IBM Japan, Ltd., Tokyo, Japan
| | | | - Rina Wooden
- IBM Consulting, IBM Japan, Ltd., Tokyo, Japan
| | - Susumu Ota
- IBM Consulting, IBM Japan, Ltd., Tokyo, Japan
| | | | | | | | - Asako Yoritaka
- Department of Neurology, Faculty of Medicine, Juntendo University Koshigaya Hospital, Saitama, Japan
| | - Tadafumi Kato
- Department of Psychiatry, Faculty of Medicine, Juntendo University, Tokyo, Japan
| | - Nobutaka Hattori
- Department of Neurology, Faculty of Medicine, Juntendo University, 2-1-1 Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan.
- Neurodegenerative Disorders Collaborative Laboratory, RIKEN Center for Brain Science, Saitama, Japan.
| |
Collapse
|
2
|
Chou CJ, Chang CT, Chang YN, Lee CY, Chuang YF, Chiu YL, Liang WL, Fan YM, Liu YC. Screening for early Alzheimer's disease: enhancing diagnosis with linguistic features and biomarkers. Front Aging Neurosci 2024; 16:1451326. [PMID: 39376506 PMCID: PMC11456453 DOI: 10.3389/fnagi.2024.1451326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Accepted: 09/11/2024] [Indexed: 10/09/2024] Open
Abstract
Introduction Research has shown that speech analysis demonstrates sensitivity in detecting early Alzheimer's disease (AD), but the relation between linguistic features and cognitive tests or biomarkers remains unclear. This study aimed to investigate how linguistic features help identify cognitive impairments in patients in the early stages of AD. Method This study analyzed connected speech from 80 participants and categorized the participants into early-AD and normal control (NC) groups. The participants underwent amyloid-β positron emission tomography scans, brain magnetic resonance imaging, and comprehensive neuropsychological testing. Participants' speech data from a picture description task were examined. A total of 15 linguistic features were analyzed to classify groups and predict cognitive performance. Results We found notable linguistic differences between the early-AD and NC groups in lexical diversity, syntactic complexity, and language disfluency. Using machine learning classifiers (SVM, KNN, and RF), we achieved up to 88% accuracy in distinguishing early-AD patients from normal controls, with mean length of utterance (MLU) and long pauses ratio (LPR) serving as core linguistic indicators. Moreover, the integration of linguistic indicators with biomarkers significantly improved predictive accuracy for AD. Regression analysis also highlighted crucial linguistic features, such as MLU, LPR, Type-to-Token ratio (TTR), and passive construction ratio (PCR), which were sensitive to changes in cognitive function. Conclusion Findings support the efficacy of linguistic analysis as a screening tool for the early detection of AD and the assessment of subtle cognitive decline. Integrating linguistic features with biomarkers significantly improved diagnostic accuracy.
Collapse
Affiliation(s)
- Chia-Ju Chou
- Department of Neurology, Cardinal Tien Hospital, Taipei, Taiwan
| | - Chih-Ting Chang
- Department of Speech-Language Pathology and Audiology, National Taipei University of Nursing and Health Sciences, Taipei, Taiwan
| | - Ya-Ning Chang
- Miin Wu School of Computing, National Cheng Kung University, Tainan, Taiwan
| | | | - Yi-Fang Chuang
- Institute of Public Health, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
- International Health Program, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
- Health Innovation Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Yen-Ling Chiu
- Department of Medical Research, Far Eastern Memorial Hospital, Taipei, Taiwan
- Graduate Program in Biomedical Informatics and Graduate Institute of Medicine, Yuan Ze University, Taoyuan, Taiwan
- Graduate Institute of Clinical Medicine, National Taiwan University, Taipei, Taiwan
| | - Wan-Lin Liang
- Department of Neurology, Cardinal Tien Hospital, Taipei, Taiwan
| | - Yu-Ming Fan
- School of Medicine, Fu Jen Catholic University, Taipei, Taiwan
- Department of Nuclear Medicine, Cardinal Tien Hospital, Taipei, Taiwan
| | - Yi-Chien Liu
- Department of Neurology, Cardinal Tien Hospital, Taipei, Taiwan
- School of Medicine, Fu Jen Catholic University, Taipei, Taiwan
| |
Collapse
|
3
|
Ding H, Lister A, Karjadi C, Au R, Lin H, Bischoff B, Hwang PH. Detection of Mild Cognitive Impairment From Non-Semantic, Acoustic Voice Features: The Framingham Heart Study. JMIR Aging 2024; 7:e55126. [PMID: 39173144 PMCID: PMC11377909 DOI: 10.2196/55126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 05/06/2024] [Accepted: 07/15/2024] [Indexed: 08/24/2024] Open
Abstract
BACKGROUND With the aging global population and the rising burden of Alzheimer disease and related dementias (ADRDs), there is a growing focus on identifying mild cognitive impairment (MCI) to enable timely interventions that could potentially slow down the onset of clinical dementia. The production of speech by an individual is a cognitively complex task that engages various cognitive domains. The ease of audio data collection highlights the potential cost-effectiveness and noninvasive nature of using human speech as a tool for cognitive assessment. OBJECTIVE This study aimed to construct a machine learning pipeline that incorporates speaker diarization, feature extraction, feature selection, and classification to identify a set of acoustic features derived from voice recordings that exhibit strong MCI detection capability. METHODS The study included 100 MCI cases and 100 cognitively normal controls matched for age, sex, and education from the Framingham Heart Study. Participants' spoken responses on neuropsychological tests were recorded, and the recorded audio was processed to identify segments of each participant's voice from recordings that included voices of both testers and participants. A comprehensive set of 6385 acoustic features was then extracted from these voice segments using OpenSMILE and Praat software. Subsequently, a random forest model was constructed to classify cognitive status using the features that exhibited significant differences between the MCI and cognitively normal groups. The MCI detection performance of various audio lengths was further examined. RESULTS An optimal subset of 29 features was identified that resulted in an area under the receiver operating characteristic curve of 0.87, with a 95% CI of 0.81-0.94. The most important acoustic feature for MCI classification was the number of filled pauses (importance score=0.09, P=3.10E-08). There was no substantial difference in the performance of the model trained on the acoustic features derived from different lengths of voice recordings. CONCLUSIONS This study showcases the potential of monitoring changes to nonsemantic and acoustic features of speech as a way of early ADRD detection and motivates future opportunities for using human speech as a measure of brain health.
Collapse
Affiliation(s)
- Huitong Ding
- Department of Anatomy and Neurobiology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, United States
- The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, United States
| | - Adrian Lister
- Headwaters Innovation, Inc., Inver Grove Heights, MN, United States
| | - Cody Karjadi
- Department of Anatomy and Neurobiology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, United States
- The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, United States
| | - Rhoda Au
- Department of Anatomy and Neurobiology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, United States
- The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, United States
- Department of Epidemiology, Boston University School of Public Health, Boston, MA, United States
- Slone Epidemiology Center and Departments of Neurology and Medicine, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, United States
| | - Honghuang Lin
- Department of Medicine, University of Massachusetts Chan Medical School, Worcester, MA, United States
| | - Brian Bischoff
- Headwaters Innovation, Inc., Inver Grove Heights, MN, United States
| | - Phillip H Hwang
- Department of Anatomy and Neurobiology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, United States
- The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, United States
- Department of Epidemiology, Boston University School of Public Health, Boston, MA, United States
| |
Collapse
|
4
|
He R, Al-Tamimi J, Sánchez-Benavides G, Montaña-Valverde G, Domingo Gispert J, Grau-Rivera O, Suárez-Calvet M, Minguillon C, Fauria K, Navarro A, Hinzen W. Atypical cortical hierarchy in Aβ-positive older adults and its reflection in spontaneous speech. Brain Res 2024; 1830:148806. [PMID: 38365129 DOI: 10.1016/j.brainres.2024.148806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 02/07/2024] [Indexed: 02/18/2024]
Abstract
Abnormal deposition of Aβ amyloid is an early neuropathological marker of Alzheimer's disease (AD), arising long ahead of clinical symptoms. Non-invasive measures of associated early neurofunctional changes, together with easily accessible behavioral readouts of these changes, could be of great clinical benefit. We pursued this aim by investigating large-scale cortical gradients of functional connectivity with functional MRI, which capture the hierarchical integration of cortical functions, together with acoustic-prosodic features from spontaneous speech, in cognitively unimpaired older adults with and without Aβ positivity (total N = 188). We predicted distortions of the cortical hierarchy associated with prosodic changes in the Aβ + group. Results confirmed substantially altered cortical hierarchies and less variability in these in the Aβ + group, together with an increase in quantitative prosodic measures, which correlated with gradient variability as well as digit span test scores. Overall, these findings confirm that long before the clinical stage and objective cognitive impairment, increased risk of cognitive decline as indexed by Aβ accumulation is marked by neurofunctional changes in the cortical hierarchy, which are related to automatically extractable speech patterns and alterations in working memory functions.
Collapse
Affiliation(s)
- Rui He
- Department of Translation & Language Sciences, Universitat Pompeu Fabra, 08018 Barcelona, Spain.
| | - Jalal Al-Tamimi
- Université Paris Cité, Laboratoire de Linguistique Formelle (LLF), CNRS, 75013 Paris, France
| | - Gonzalo Sánchez-Benavides
- Barcelonaβeta Brain Research Center (BBRC), Pasqual Maragall Foundation, 08005 Barcelona, Spain; Neurosciences Department, IMIM (Hospital del Mar Medical Research Institute), 08003 Barcelona, Spain; Centro de Investigación Biomédica en Red de Fragilidad y Envejecimiento Saludable (CIBERFES), Instituto de Salud Carlos III, 28029 Madrid, Spain
| | | | - Juan Domingo Gispert
- Barcelonaβeta Brain Research Center (BBRC), Pasqual Maragall Foundation, 08005 Barcelona, Spain; Neurosciences Department, IMIM (Hospital del Mar Medical Research Institute), 08003 Barcelona, Spain; Department of Medicine and Life Sciences, Universitat Pompeu Fabra, 08003 Barcelona, Spain; Centro de Investigación Biomédica en Red de Bioingeniería, Biomateriales y Nanomedicina (CIBERBBN), Instituto de Salud Carlos III, Madrid, Spain
| | - Oriol Grau-Rivera
- Barcelonaβeta Brain Research Center (BBRC), Pasqual Maragall Foundation, 08005 Barcelona, Spain; Neurosciences Department, IMIM (Hospital del Mar Medical Research Institute), 08003 Barcelona, Spain; Centro de Investigación Biomédica en Red de Fragilidad y Envejecimiento Saludable (CIBERFES), Instituto de Salud Carlos III, 28029 Madrid, Spain; Servei de Neurologia, Hospital del Mar, 08003 Barcelona, Spain
| | - Marc Suárez-Calvet
- Barcelonaβeta Brain Research Center (BBRC), Pasqual Maragall Foundation, 08005 Barcelona, Spain; Neurosciences Department, IMIM (Hospital del Mar Medical Research Institute), 08003 Barcelona, Spain; Centro de Investigación Biomédica en Red de Fragilidad y Envejecimiento Saludable (CIBERFES), Instituto de Salud Carlos III, 28029 Madrid, Spain; Servei de Neurologia, Hospital del Mar, 08003 Barcelona, Spain
| | - Carolina Minguillon
- Barcelonaβeta Brain Research Center (BBRC), Pasqual Maragall Foundation, 08005 Barcelona, Spain; Neurosciences Department, IMIM (Hospital del Mar Medical Research Institute), 08003 Barcelona, Spain; Centro de Investigación Biomédica en Red de Fragilidad y Envejecimiento Saludable (CIBERFES), Instituto de Salud Carlos III, 28029 Madrid, Spain
| | - Karine Fauria
- Barcelonaβeta Brain Research Center (BBRC), Pasqual Maragall Foundation, 08005 Barcelona, Spain; Centro de Investigación Biomédica en Red de Fragilidad y Envejecimiento Saludable (CIBERFES), Instituto de Salud Carlos III, 28029 Madrid, Spain
| | - Arcadi Navarro
- Barcelonaβeta Brain Research Center (BBRC), Pasqual Maragall Foundation, 08005 Barcelona, Spain; Catalan Institution of Research and Advanced Studies (ICREA), 08010 Barcelona, Spain; Department of Medicine and Life Sciences, Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, 08003 Barcelona, Spain; CRG, Centre for Genomic Regulation, Barcelona Institute of Science and Technology (BIST), 08003 Barcelona, Spain
| | - Wolfram Hinzen
- Department of Translation & Language Sciences, Universitat Pompeu Fabra, 08018 Barcelona, Spain; Catalan Institution of Research and Advanced Studies (ICREA), 08010 Barcelona, Spain
| |
Collapse
|
5
|
Ambrosini E, Giangregorio C, Lomurno E, Moccia S, Milis M, Loizou C, Azzolino D, Cesari M, Cid Gala M, Galán de Isla C, Gomez-Raja J, Borghese NA, Matteucci M, Ferrante S. Automatic Spontaneous Speech Analysis for the Detection of Cognitive Functional Decline in Older Adults: Multilanguage Cross-Sectional Study. JMIR Aging 2024; 7:e50537. [PMID: 38386279 DOI: 10.2196/50537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 12/18/2023] [Accepted: 02/12/2024] [Indexed: 02/23/2024] Open
Abstract
BACKGROUND The rise in life expectancy is associated with an increase in long-term and gradual cognitive decline. Treatment effectiveness is enhanced at the early stage of the disease. Therefore, there is a need to find low-cost and ecological solutions for mass screening of community-dwelling older adults. OBJECTIVE This work aims to exploit automatic analysis of free speech to identify signs of cognitive function decline. METHODS A sample of 266 participants older than 65 years were recruited in Italy and Spain and were divided into 3 groups according to their Mini-Mental Status Examination (MMSE) scores. People were asked to tell a story and describe a picture, and voice recordings were used to extract high-level features on different time scales automatically. Based on these features, machine learning algorithms were trained to solve binary and multiclass classification problems by using both mono- and cross-lingual approaches. The algorithms were enriched using Shapley Additive Explanations for model explainability. RESULTS In the Italian data set, healthy participants (MMSE score≥27) were automatically discriminated from participants with mildly impaired cognitive function (20≤MMSE score≤26) and from those with moderate to severe impairment of cognitive function (11≤MMSE score≤19) with accuracy of 80% and 86%, respectively. Slightly lower performance was achieved in the Spanish and multilanguage data sets. CONCLUSIONS This work proposes a transparent and unobtrusive assessment method, which might be included in a mobile app for large-scale monitoring of cognitive functionality in older adults. Voice is confirmed to be an important biomarker of cognitive decline due to its noninvasive and easily accessible nature.
Collapse
Affiliation(s)
- Emilia Ambrosini
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy
| | - Chiara Giangregorio
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy
| | - Eugenio Lomurno
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy
| | - Sara Moccia
- BioRobotics Institute and Department of Excellence in Robotics and AI, Scuola Superiore Sant'Anna, Pisa, Italy
| | | | - Christos Loizou
- Department of Electrical Engineering, Computer Engineering and Informatics, Cyprus University of Technology, Limassol, Cyprus
| | - Domenico Azzolino
- Geriatric Unit, Fondazione Istituto di Ricovero e Cura a Carattere Scientifico Ca' Granda Ospedale Maggiore Policlinico, Milano, Italy
| | - Matteo Cesari
- Ageing and Health Unit, Department of Maternal, Newborn, Child, Adolescent Health and Ageing, World Health Organization, Geneva, Switzerland
| | - Manuel Cid Gala
- Consejería de Sanidad y Servicios Sociales, Junta de Extremadura, Merida, Spain
| | | | - Jonathan Gomez-Raja
- Consejería de Sanidad y Servicios Sociales, Junta de Extremadura, Merida, Spain
| | | | - Matteo Matteucci
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy
| | - Simona Ferrante
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milano, Italy
- Laboratory of E-Health Technologies and Artificial Intelligence Research in Neurology, Joint Research Platform, Fondazione Istituto di Ricovero e Cura a Carattere Scientifico Istituto Neurologico Carlo Besta, Milano, Italy
| |
Collapse
|
6
|
Kaser AN, Lacritz LH, Winiarski HR, Gabirondo P, Schaffert J, Coca AJ, Jiménez-Raboso J, Rojo T, Zaldua C, Honorato I, Gallego D, Nieves ER, Rosenstein LD, Cullum CM. A novel speech analysis algorithm to detect cognitive impairment in a Spanish population. Front Neurol 2024; 15:1342907. [PMID: 38638311 PMCID: PMC11024431 DOI: 10.3389/fneur.2024.1342907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 02/26/2024] [Indexed: 04/20/2024] Open
Abstract
Objective Early detection of cognitive impairment in the elderly is crucial for diagnosis and appropriate care. Brief, cost-effective cognitive screening instruments are needed to help identify individuals who require further evaluation. This study presents preliminary data on a new screening technology using automated voice recording analysis software in a Spanish population. Method Data were collected from 174 Spanish-speaking individuals clinically diagnosed as cognitively normal (CN, n = 87) or impaired (mild cognitive impairment [MCI], n = 63; all-cause dementia, n = 24). Participants were recorded performing four common language tasks (Animal fluency, alternating fluency [sports and fruits], phonemic "F" fluency, and Cookie Theft Description). Recordings were processed via text-transcription and digital-signal processing techniques to capture neuropsychological variables and audio characteristics. A training sample of 122 subjects with similar demographics across groups was used to develop an algorithm to detect cognitive impairment. Speech and task features were used to develop five independent machine learning (ML) models to compute scores between 0 and 1, and a final algorithm was constructed using repeated cross-validation. A socio-demographically balanced subset of 52 participants was used to test the algorithm. Analysis of covariance (ANCOVA), covarying for demographic characteristics, was used to predict logistically-transformed algorithm scores. Results Mean logit algorithm scores were significantly different across groups in the testing sample (p < 0.01). Comparisons of CN with impaired (MCI + dementia) and MCI groups using the final algorithm resulted in an AUC of 0.93/0.90, with overall accuracy of 88.4%/87.5%, sensitivity of 87.5/83.3, and specificity of 89.2/89.2, respectively. Conclusion Findings provide initial support for the utility of this automated speech analysis algorithm as a screening tool for cognitive impairment in Spanish speakers. Additional study is needed to validate this technology in larger and more diverse clinical populations.
Collapse
Affiliation(s)
- Alyssa N. Kaser
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Laura H. Lacritz
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Department of Neurology, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Holly R. Winiarski
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | | | - Jeff Schaffert
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Alberto J. Coca
- AcceXible Impacto, Sociedad Limitada, Bilbao, Spain
- Cambridge Mathematics of Information in Healthcare Hub, University of Cambridge, Cambridge, United Kingdom
| | | | - Tomas Rojo
- AcceXible Impacto, Sociedad Limitada, Bilbao, Spain
| | - Carla Zaldua
- AcceXible Impacto, Sociedad Limitada, Bilbao, Spain
| | | | | | - Emmanuel Rosario Nieves
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Parkland Health and Hospital System Behavioral Health Clinic, Dallas, TX, United States
| | - Leslie D. Rosenstein
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Parkland Health and Hospital System Behavioral Health Clinic, Dallas, TX, United States
| | - C. Munro Cullum
- Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Department of Neurology, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Department of Neurological Surgery, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| |
Collapse
|
7
|
Gkoumas D, Wang B, Tsakalidis A, Wolters M, Purver M, Zubiaga A, Liakata M. A longitudinal multi-modal dataset for dementia monitoring and diagnosis. LANG RESOUR EVAL 2024; 58:883-902. [PMID: 39323983 PMCID: PMC11420249 DOI: 10.1007/s10579-023-09718-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/21/2023] [Indexed: 09/27/2024]
Abstract
Dementia affects cognitive functions of adults, including memory, language, and behaviour. Standard diagnostic biomarkers such as MRI are costly, whilst neuropsychological tests suffer from sensitivity issues in detecting dementia onset. The analysis of speech and language has emerged as a promising and non-intrusive technology to diagnose and monitor dementia. Currently, most work in this direction ignores the multi-modal nature of human communication and interactive aspects of everyday conversational interaction. Moreover, most studies ignore changes in cognitive status over time due to the lack of consistent longitudinal data. Here we introduce a novel fine-grained longitudinal multi-modal corpus collected in a natural setting from healthy controls and people with dementia over two phases, each spanning 28 sessions. The corpus consists of spoken conversations, a subset of which are transcribed, as well as typed and written thoughts and associated extra-linguistic information such as pen strokes and keystrokes. We present the data collection process and describe the corpus in detail. Furthermore, we establish baselines for capturing longitudinal changes in language across different modalities for two cohorts, healthy controls and people with dementia, outlining future research directions enabled by the corpus.
Collapse
Affiliation(s)
- Dimitris Gkoumas
- School of Electronic Engineering and Computer Science, Queen Mary University of London, London, UK
| | - Bo Wang
- Center for Precision Psychiatry, Massachusetts General Hospital, Boston, USA
| | - Adam Tsakalidis
- School of Electronic Engineering and Computer Science, Queen Mary University of London, London, UK
- The Alan Turing Institute, London, UK
| | - Maria Wolters
- The Alan Turing Institute, London, UK
- School of informatics, University of Edinburgh, Edinburgh, Scotland
| | - Matthew Purver
- School of Electronic Engineering and Computer Science, Queen Mary University of London, London, UK
- The Alan Turing Institute, London, UK
- Department of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia
| | - Arkaitz Zubiaga
- School of Electronic Engineering and Computer Science, Queen Mary University of London, London, UK
| | - Maria Liakata
- School of Electronic Engineering and Computer Science, Queen Mary University of London, London, UK
- The Alan Turing Institute, London, UK
| |
Collapse
|
8
|
Zhang F, Liu B, Wu P, Johnson C, Bouharati M, Zhang Y, He P, Jiang JJ. Nonlinear Dynamic Analysis and Perturbation Measurement Used for Discriminating Pathological Voices and Their Correlations With Perceptual Evaluation. J Voice 2024:S0892-1997(23)00320-X. [PMID: 38458820 DOI: 10.1016/j.jvoice.2023.10.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2023] [Revised: 10/06/2023] [Accepted: 10/06/2023] [Indexed: 03/10/2024]
Abstract
OBJECTIVE 1. To investigate the discriminatory and diagnostic power of nonlinear dynamic analysis measures concerning voices from normal, benign, and malignant voice disorders. 2. To study the correlations of nonlinear dynamic analysis measures with perceptual ratings to evaluate the reliability of the objective acoustic analysis in predicting severity of voice. METHOD The perturbation analysis metrics used were Jitter%, Shimmer%, and signal-to-noise ratio. The nonlinear dynamic analysis metrics used were spectrum convergence ratio (SCR), nonlinear energy difference ratio (NEDR), and rate of divergence (ROD). Subjects were enrolled and divided into three groups based on laryngeal pathology: normal, benign, and malignant. Vowel sound and reading samples were recorded. Perceptual evaluation was applied to these voice samples to investigate correlations between metrics and auditory perception. RESULT Each metric was capable of discriminating laryngeal pathology, except for SCR in the case of distinguishing between benign and malignant pathologies. Perturbation analysis parameters had a moderate ability to differentiate between normal and benign pathologies, but were unable to characterize malignant pathologies for certain diseases, such as Reinke's edema. All metrics significantly correlated with perceptual G scores. Nonlinear dynamic analysis was superior when applied to cases of severe dysphonia, where linear metrics such as Jitter% and Shimmer% tended to lose utility. NEDR and ROD were successful at differentiating between the different pathologies, whereas SCR could not discriminate between the benign and malignant groups. CONCLUSION Perturbation and nonlinear dynamic analyses are comparable in their discriminating power with respect to normal and benign voices, and normal and malignant voices. The nonlinear dynamic analysis metrics NEDR and ROD may be superior in clinical settings with respect to discriminating voice pathology ranging from mild pathological voice to severe dysphonia, and with respect to discriminating benign and malignant voice. SCR was found unable to discriminate pathological voices.
Collapse
Affiliation(s)
- Fan Zhang
- Department of Otolaryngology, EENT Hospital, Fudan University, Shanghai, China
| | - Boquan Liu
- School of Humanities, Shanghai Jiaotong University, Shanghai, China
| | - Pinhong Wu
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin - Madison, Madison, Wisconsin
| | - Colbey Johnson
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin - Madison, Madison, Wisconsin
| | - Melissa Bouharati
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin - Madison, Madison, Wisconsin
| | - Yi Zhang
- Department of Otolaryngology, EENT Hospital, Fudan University, Shanghai, China
| | - Peijie He
- Department of Otolaryngology, EENT Hospital, Fudan University, Shanghai, China.
| | - Jack J Jiang
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin - Madison, Madison, Wisconsin
| |
Collapse
|
9
|
Cerasa A. Fractals in Neuropsychology and Cognitive Neuroscience. ADVANCES IN NEUROBIOLOGY 2024; 36:761-778. [PMID: 38468062 DOI: 10.1007/978-3-031-47606-8_38] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
The fractal dimension of cognition refers to the idea that the cognitive processes of the human brain exhibit fractal properties. This means that certain patterns of cognitive activity, such as visual perception, memory, language, or problem-solving, can be described using the mathematical concept of fractal dimension.The idea that cognition is fractal has been proposed by some researchers as a way to understand the complex, self-similar nature of the human brain. However, it's a relatively new idea and is still under investigation, so it's not yet clear to what extent cognitive processes exhibit fractal properties or what implications this might have for our understanding of the brain and clinical practice. Indeed, the mission of the "fractal neuroscience" field is to define the characteristics of fractality in human cognition in order to differently characterize the emergence of brain disorders.
Collapse
Affiliation(s)
- Antonio Cerasa
- Institute for Biomedical Research and Innovation, National Research Council, IRIB-CNR, Messina, Italy
- S. Anna Institute, Crotone, Italy
- Pharmacotechnology Documentation and Transfer Unit, Preclinical and Translational Pharmacology, Department of Pharmacy, Health Science and Nutrition, University of Calabria, Arcavacata, Italy
| |
Collapse
|
10
|
García AM, de Leon J, Tee BL, Blasi DE, Gorno-Tempini ML. Speech and language markers of neurodegeneration: a call for global equity. Brain 2023; 146:4870-4879. [PMID: 37497623 PMCID: PMC10690018 DOI: 10.1093/brain/awad253] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 06/29/2023] [Accepted: 07/15/2023] [Indexed: 07/28/2023] Open
Abstract
In the field of neurodegeneration, speech and language assessments are useful for diagnosing aphasic syndromes and for characterizing other disorders. As a complement to classic tests, scalable and low-cost digital tools can capture relevant anomalies automatically, potentially supporting the quest for globally equitable markers of brain health. However, this promise remains unfulfilled due to limited linguistic diversity in scientific works and clinical instruments. Here we argue for cross-linguistic research as a core strategy to counter this problem. First, we survey the contributions of linguistic assessments in the study of primary progressive aphasia and the three most prevalent neurodegenerative disorders worldwide-Alzheimer's disease, Parkinson's disease, and behavioural variant frontotemporal dementia. Second, we address two forms of linguistic unfairness in the literature: the neglect of most of the world's 7000 languages and the preponderance of English-speaking cohorts. Third, we review studies showing that linguistic dysfunctions in a given disorder may vary depending on the patient's language and that English speakers offer a suboptimal benchmark for other language groups. Finally, we highlight different approaches, tools and initiatives for cross-linguistic research, identifying core challenges for their deployment. Overall, we seek to inspire timely actions to counter a looming source of inequity in behavioural neurology.
Collapse
Affiliation(s)
- Adolfo M García
- Global Brain Health Institute, University of California, San Francisco, CA 94143, USA
- Cognitive Neuroscience Center, Universidad de San Andrés, Buenos Aires B1644BID, Argentina
- Departamento de Lingüística y Literatura, Facultad de Humanidades, Universidad de Santiago de Chile, Santiago 9160000, Chile
- Latin American Brain Health (BrainLat) Institute, Universidad Adolfo Ibáñez, Avenida Diagonal Las Torres 2640 (7941169), Santiago, Peñalolén, Región Metropolitana, Chile
| | - Jessica de Leon
- Memory and Aging Center, Department of Neurology, University of California, San Francisco, CA 94143, USA
| | - Boon Lead Tee
- Global Brain Health Institute, University of California, San Francisco, CA 94143, USA
- Memory and Aging Center, Department of Neurology, University of California, San Francisco, CA 94143, USA
| | - Damián E Blasi
- Data Science Initiative, Harvard University, Cambridge, MA 02138, USA
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Department of Linguistic and Cultural Evolution, Max Planck Institute for the Science of Human History, Jena 07745, Germany
| | - Maria Luisa Gorno-Tempini
- Memory and Aging Center, Department of Neurology, University of California, San Francisco, CA 94143, USA
| |
Collapse
|
11
|
Kim SY, Park J, Choi H, Loeser M, Ryu H, Seo K. Digital Marker for Early Screening of Mild Cognitive Impairment Through Hand and Eye Movement Analysis in Virtual Reality Using Machine Learning: First Validation Study. J Med Internet Res 2023; 25:e48093. [PMID: 37862101 PMCID: PMC10625097 DOI: 10.2196/48093] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 08/07/2023] [Accepted: 09/22/2023] [Indexed: 10/21/2023] Open
Abstract
BACKGROUND With the global rise in Alzheimer disease (AD), early screening for mild cognitive impairment (MCI), which is a preclinical stage of AD, is of paramount importance. Although biomarkers such as cerebrospinal fluid amyloid level and magnetic resonance imaging have been studied, they have limitations, such as high cost and invasiveness. Digital markers to assess cognitive impairment by analyzing behavioral data collected from digital devices in daily life can be a new alternative. In this context, we developed a "virtual kiosk test" for early screening of MCI by analyzing behavioral data collected when using a kiosk in a virtual environment. OBJECTIVE We aimed to investigate key behavioral features collected from a virtual kiosk test that could distinguish patients with MCI from healthy controls with high statistical significance. Also, we focused on developing a machine learning model capable of early screening of MCI based on these behavioral features. METHODS A total of 51 participants comprising 20 healthy controls and 31 patients with MCI were recruited by 2 neurologists from a university hospital. The participants performed a virtual kiosk test-developed by our group-where we recorded various behavioral data such as hand and eye movements. Based on these time series data, we computed the following 4 behavioral features: hand movement speed, proportion of fixation duration, time to completion, and the number of errors. To compare these behavioral features between healthy controls and patients with MCI, independent-samples 2-tailed t tests were used. Additionally, we used these behavioral features to train and validate a machine learning model for early screening of patients with MCI from healthy controls. RESULTS In the virtual kiosk test, all 4 behavioral features showed statistically significant differences between patients with MCI and healthy controls. Compared with healthy controls, patients with MCI had slower hand movement speed (t49=3.45; P=.004), lower proportion of fixation duration (t49=2.69; P=.04), longer time to completion (t49=-3.44; P=.004), and a greater number of errors (t49=-3.77; P=.001). All 4 features were then used to train a support vector machine to distinguish between healthy controls and patients with MCI. Our machine learning model achieved 93.3% accuracy, 100% sensitivity, 83.3% specificity, 90% precision, and 94.7% F1-score. CONCLUSIONS Our research preliminarily suggests that analyzing hand and eye movements in the virtual kiosk test holds potential as a digital marker for early screening of MCI. In contrast to conventional biomarkers, this digital marker in virtual reality is advantageous as it can collect ecologically valid data at an affordable cost and in a short period (5-15 minutes), making it a suitable means for early screening of MCI. We call for further studies to confirm the reliability and validity of this approach.
Collapse
Affiliation(s)
- Se Young Kim
- Department of Applied Artificial Intelligence, Seoul National University of Science and Technology, Seoul, Republic of Korea
| | - Jinseok Park
- Department of Neurology, College of Medicine, Hanyang University, Seoul, Republic of Korea
| | - Hojin Choi
- Department of Neurology, College of Medicine, Hanyang University, Seoul, Republic of Korea
| | - Martin Loeser
- Department of Computer Science, Electrical Engineering and Mechatronics, ZHAW Zurich University of Applied Sciences, Winterthur, Switzerland
| | - Hokyoung Ryu
- Graduate School of Technology and Innovation Management, Hanyang University, Seoul, Republic of Korea
| | - Kyoungwon Seo
- Department of Applied Artificial Intelligence, Seoul National University of Science and Technology, Seoul, Republic of Korea
| |
Collapse
|
12
|
Mandal PK, Mahto RV. Deep Multi-Branch CNN Architecture for Early Alzheimer's Detection from Brain MRIs. SENSORS (BASEL, SWITZERLAND) 2023; 23:8192. [PMID: 37837027 PMCID: PMC10574860 DOI: 10.3390/s23198192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Revised: 09/25/2023] [Accepted: 09/26/2023] [Indexed: 10/15/2023]
Abstract
Alzheimer's disease (AD) is a neurodegenerative disease that can cause dementia and result in a severe reduction in brain function, inhibiting simple tasks, especially if no preventative care is taken. Over 1 in 9 Americans suffer from AD-induced dementia, and unpaid care for people with AD-related dementia is valued at USD 271.6 billion. Hence, various approaches have been developed for early AD diagnosis to prevent its further progression. In this paper, we first review other approaches that could be used for the early detection of AD. We then give an overview of our dataset and propose a deep convolutional neural network (CNN) architecture consisting of 7,866,819 parameters. This model comprises three different convolutional branches, each having a different length. Each branch is comprised of different kernel sizes. This model can predict whether a patient is non-demented, mild-demented, or moderately demented with a 99.05% three-class accuracy. In summary, the deep CNN model demonstrated exceptional accuracy in the early diagnosis of AD, offering a significant advancement in the field and the potential to improve patient care.
Collapse
Affiliation(s)
- Paul K. Mandal
- Department of Computer Science, University of Texas, Austin, TX 78712, USA
| | - Rakeshkumar V. Mahto
- Department of Electrical and Computer Engineering, California State University, Fullerton, CA 92831, USA;
| |
Collapse
|
13
|
García-Gutiérrez F, Marquié M, Muñoz N, Alegret M, Cano A, de Rojas I, García-González P, Olivé C, Puerta R, Orellana A, Montrreal L, Pytel V, Ricciardi M, Zaldua C, Gabirondo P, Hinzen W, Lleonart N, García-Sánchez A, Tárraga L, Ruiz A, Boada M, Valero S. Harnessing acoustic speech parameters to decipher amyloid status in individuals with mild cognitive impairment. Front Neurosci 2023; 17:1221401. [PMID: 37746151 PMCID: PMC10512723 DOI: 10.3389/fnins.2023.1221401] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 08/08/2023] [Indexed: 09/26/2023] Open
Abstract
Alzheimer's disease (AD) is a neurodegenerative condition characterized by a gradual decline in cognitive functions. Currently, there are no effective treatments for AD, underscoring the importance of identifying individuals in the preclinical stages of mild cognitive impairment (MCI) to enable early interventions. Among the neuropathological events associated with the onset of the disease is the accumulation of amyloid protein in the brain, which correlates with decreased levels of Aβ42 peptide in the cerebrospinal fluid (CSF). Consequently, the development of non-invasive, low-cost, and easy-to-administer proxies for detecting Aβ42 positivity in CSF becomes particularly valuable. A promising approach to achieve this is spontaneous speech analysis, which combined with machine learning (ML) techniques, has proven highly useful in AD. In this study, we examined the relationship between amyloid status in CSF and acoustic features derived from the description of the Cookie Theft picture in MCI patients from a memory clinic. The cohort consisted of fifty-two patients with MCI (mean age 73 years, 65% female, and 57% positive amyloid status). Eighty-eight acoustic parameters were extracted from voice recordings using the extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS), and several ML models were used to classify the amyloid status. Furthermore, interpretability techniques were employed to examine the influence of input variables on the determination of amyloid-positive status. The best model, based on acoustic variables, achieved an accuracy of 75% with an area under the curve (AUC) of 0.79 in the prediction of amyloid status evaluated by bootstrapping and Leave-One-Out Cross Validation (LOOCV), outperforming conventional neuropsychological tests (AUC = 0.66). Our results showed that the automated analysis of voice recordings derived from spontaneous speech tests offers valuable insights into AD biomarkers during the preclinical stages. These findings introduce novel possibilities for the use of digital biomarkers to identify subjects at high risk of developing AD.
Collapse
Affiliation(s)
| | - Marta Marquié
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
- Networking Research Center on Neurodegenerative Diseases (CIBERNED), Instituto de Salud Carlos III, Madrid, Spain
| | - Nathalia Muñoz
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
| | - Montserrat Alegret
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
- Networking Research Center on Neurodegenerative Diseases (CIBERNED), Instituto de Salud Carlos III, Madrid, Spain
| | - Amanda Cano
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
- Networking Research Center on Neurodegenerative Diseases (CIBERNED), Instituto de Salud Carlos III, Madrid, Spain
| | - Itziar de Rojas
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
- Networking Research Center on Neurodegenerative Diseases (CIBERNED), Instituto de Salud Carlos III, Madrid, Spain
| | - Pablo García-González
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
| | - Clàudia Olivé
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
| | - Raquel Puerta
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
| | - Adelina Orellana
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
- Networking Research Center on Neurodegenerative Diseases (CIBERNED), Instituto de Salud Carlos III, Madrid, Spain
| | - Laura Montrreal
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
| | - Vanesa Pytel
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
| | - Mario Ricciardi
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
| | | | | | - Wolfram Hinzen
- Department of Translation and Language Sciences, Universitat Pompeu Fabra, Barcelona, Spain
- Institut Català de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| | - Núria Lleonart
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
| | - Ainhoa García-Sánchez
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
| | - Lluís Tárraga
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
- Networking Research Center on Neurodegenerative Diseases (CIBERNED), Instituto de Salud Carlos III, Madrid, Spain
| | - Agustín Ruiz
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
- Networking Research Center on Neurodegenerative Diseases (CIBERNED), Instituto de Salud Carlos III, Madrid, Spain
| | - Mercè Boada
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
- Networking Research Center on Neurodegenerative Diseases (CIBERNED), Instituto de Salud Carlos III, Madrid, Spain
| | - Sergi Valero
- Ace Alzheimer Center Barcelona, Universitat Internacional de Catalunya, Barcelona, Spain
- Networking Research Center on Neurodegenerative Diseases (CIBERNED), Instituto de Salud Carlos III, Madrid, Spain
| |
Collapse
|
14
|
Friedman L, Lauber M, Behroozmand R, Fogerty D, Kunecki D, Berry-Kravis E, Klusek J. Atypical vocal quality in women with the FMR1 premutation: an indicator of impaired sensorimotor control. Exp Brain Res 2023; 241:1975-1987. [PMID: 37347418 PMCID: PMC10863608 DOI: 10.1007/s00221-023-06653-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 06/13/2023] [Indexed: 06/23/2023]
Abstract
Women with the FMR1 premutation are susceptible to motor involvement related to atypical cerebellar function, including risk for developing fragile X tremor ataxia syndrome. Vocal quality analyses are sensitive to subtle differences in motor skills but have not yet been applied to the FMR1 premutation. This study examined whether women with the FMR1 premutation demonstrate differences in vocal quality, and whether such differences relate to FMR1 genetic, executive, motor, or health features of the FMR1 premutation. Participants included 35 women with the FMR1 premutation and 45 age-matched women without the FMR1 premutation who served as a comparison group. Three sustained /a/ vowels were analyzed for pitch (mean F0), variability of pitch (standard deviation of F0), and overall vocal quality (jitter, shimmer, and harmonics-to-noise ratio). Executive, motor, and health indices were obtained from direct and self-report measures and genetic samples were analyzed for FMR1 CGG repeat length and activation ratio. Women with the FMR1 premutation had a lower pitch, larger pitch variability, and poorer vocal quality than the comparison group. Working memory was related to harmonics-to-noise ratio and shimmer in women with the FMR1 premutation. Vocal quality abnormalities differentiated women with the FMR1 premutation from the comparison group and were evident even in the absence of other clinically evident motor deficits. This study supports vocal quality analyses as a tool that may prove useful in the detection of early signs of motor involvement in this population.
Collapse
Affiliation(s)
- Laura Friedman
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA
| | - Meagan Lauber
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA
| | - Roozbeh Behroozmand
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA
| | - Daniel Fogerty
- Department of Speech and Hearing Science, University of Illinois Urbana-Champaign, Champaign, USA
| | - Dariusz Kunecki
- Department of Pediatrics, Rush University Medical Center, Chicago, USA
| | | | - Jessica Klusek
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA.
| |
Collapse
|
15
|
Idrisoglu A, Dallora AL, Anderberg P, Berglund JS. Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review. J Med Internet Res 2023; 25:e46105. [PMID: 37467031 PMCID: PMC10398366 DOI: 10.2196/46105] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 04/26/2023] [Accepted: 05/23/2023] [Indexed: 07/20/2023] Open
Abstract
BACKGROUND Normal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, pulmonary, and muscular functionality. This sensitivity inspired using voice as a biomarker to examine disorders that affect the voice. Technological improvements and emerging machine learning (ML) technologies have enabled possibilities of extracting digital vocal features from the voice for automated diagnosis and monitoring systems. OBJECTIVE This study aims to summarize a comprehensive view of research on voice-affecting disorders that uses ML techniques for diagnosis and monitoring through voice samples where systematic conditions, nonlaryngeal aerodigestive disorders, and neurological disorders are specifically of interest. METHODS This systematic literature review (SLR) investigated the state of the art of voice-based diagnostic and monitoring systems with ML technologies, targeting voice-affecting disorders without direct relation to the voice box from the point of view of applied health technology. Through a comprehensive search string, studies published from 2012 to 2022 from the databases Scopus, PubMed, and Web of Science were scanned and collected for assessment. To minimize bias, retrieval of the relevant references in other studies in the field was ensured, and 2 authors assessed the collected studies. Low-quality studies were removed through a quality assessment and relevant data were extracted through summary tables for analysis. The articles were checked for similarities between author groups to prevent cumulative redundancy bias during the screening process, where only 1 article was included from the same author group. RESULTS In the analysis of the 145 included studies, support vector machines were the most utilized ML technique (51/145, 35.2%), with the most studied disease being Parkinson disease (PD; reported in 87/145, 60%, studies). After 2017, 16 additional voice-affecting disorders were examined, in contrast to the 3 investigated previously. Furthermore, an upsurge in the use of artificial neural network-based architectures was observed after 2017. Almost half of the included studies were published in last 2 years (2021 and 2022). A broad interest from many countries was observed. Notably, nearly one-half (n=75) of the studies relied on 10 distinct data sets, and 11/145 (7.6%) used demographic data as an input for ML models. CONCLUSIONS This SLR revealed considerable interest across multiple countries in using ML techniques for diagnosing and monitoring voice-affecting disorders, with PD being the most studied disorder. However, the review identified several gaps, including limited and unbalanced data set usage in studies, and a focus on diagnostic test rather than disorder-specific monitoring. Despite the limitations of being constrained by only peer-reviewed publications written in English, the SLR provides valuable insights into the current state of research on ML-based voice-affecting disorder diagnosis and monitoring and highlighting areas to address in future research.
Collapse
Affiliation(s)
- Alper Idrisoglu
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
| | - Ana Luiza Dallora
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
| | - Peter Anderberg
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
- School of Health Sciences, University of Skövde, Skövde, Sweden
| | | |
Collapse
|
16
|
Bae M, Seo MG, Ko H, Ham H, Kim KY, Lee JY. The efficacy of memory load on speech-based detection of Alzheimer's disease. Front Aging Neurosci 2023; 15:1186786. [PMID: 37333455 PMCID: PMC10272350 DOI: 10.3389/fnagi.2023.1186786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 05/16/2023] [Indexed: 06/20/2023] Open
Abstract
Introduction The study aims to test whether an increase in memory load could improve the efficacy in detection of Alzheimer's disease and prediction of the Mini-Mental State Examination (MMSE) score. Methods Speech from 45 mild-to-moderate Alzheimer's disease patients and 44 healthy older adults were collected using three speech tasks with varying memory loads. We investigated and compared speech characteristics of Alzheimer's disease across speech tasks to examine the effect of memory load on speech characteristics. Finally, we built Alzheimer's disease classification models and MMSE prediction models to assess the diagnostic value of speech tasks. Results The speech characteristics of Alzheimer's disease in pitch, loudness, and speech rate were observed and the high-memory-load task intensified such characteristics. The high-memory-load task outperformed in AD classification with an accuracy of 81.4% and MMSE prediction with a mean absolute error of 4.62. Discussion The high-memory-load recall task is an effective method for speech-based Alzheimer's disease detection.
Collapse
Affiliation(s)
- Minju Bae
- Interdisciplinary Program in Cognitive Science, Seoul National University, Seoul, Republic of Korea
- Department of Psychiatry, SMG-SNU Boramae Medical Center, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Myo-Gyeong Seo
- Department of Psychiatry, SMG-SNU Boramae Medical Center, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Hyunwoong Ko
- Interdisciplinary Program in Cognitive Science, Seoul National University, Seoul, Republic of Korea
- Samsung Medical Center, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University, Seoul, Republic of Korea
| | - Hyunsun Ham
- Interdisciplinary Program in Cognitive Science, Seoul National University, Seoul, Republic of Korea
| | - Keun You Kim
- Department of Psychiatry, SMG-SNU Boramae Medical Center, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Jun-Young Lee
- Interdisciplinary Program in Cognitive Science, Seoul National University, Seoul, Republic of Korea
- Department of Psychiatry, SMG-SNU Boramae Medical Center, Seoul National University College of Medicine, Seoul, Republic of Korea
| |
Collapse
|
17
|
Liu J, Fu F, Li L, Yu J, Zhong D, Zhu S, Zhou Y, Liu B, Li J. Efficient Pause Extraction and Encode Strategy for Alzheimer's Disease Detection Using Only Acoustic Features from Spontaneous Speech. Brain Sci 2023; 13:477. [PMID: 36979287 PMCID: PMC10046767 DOI: 10.3390/brainsci13030477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 03/06/2023] [Accepted: 03/10/2023] [Indexed: 03/14/2023] Open
Abstract
Clinical studies have shown that speech pauses can reflect the cognitive function differences between Alzheimer's Disease (AD) and non-AD patients, while the value of pause information in AD detection has not been fully explored. Herein, we propose a speech pause feature extraction and encoding strategy for only acoustic-signal-based AD detection. First, a voice activity detection (VAD) method was constructed to detect pause/non-pause feature and encode it to binary pause sequences that are easier to calculate. Then, an ensemble machine-learning-based approach was proposed for the classification of AD from the participants' spontaneous speech, based on the VAD Pause feature sequence and common acoustic feature sets (ComParE and eGeMAPS). The proposed pause feature sequence was verified in five machine-learning models. The validation data included two public challenge datasets (ADReSS and ADReSSo, English voice) and a local dataset (10 audio recordings containing five patients and five controls, Chinese voice). Results showed that the VAD Pause feature was more effective than common feature sets (ComParE: 6373 features and eGeMAPS: 88 features) for AD classification, and that the ensemble method improved the accuracy by more than 5% compared to several baseline methods (8% on the ADReSS dataset; 5.9% on the ADReSSo dataset). Moreover, the pause-sequence-based AD detection method could achieve 80% accuracy on the local dataset. Our study further demonstrated the potential of pause information in speech-based AD detection, and also contributed to a more accessible and general pause feature extraction and encoding method for AD detection.
Collapse
Affiliation(s)
- Jiamin Liu
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Fan Fu
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Liang Li
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Junxiao Yu
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Dacheng Zhong
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Songsheng Zhu
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Yuxuan Zhou
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Bin Liu
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Jianqing Li
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing 211166, China
| |
Collapse
|
18
|
Yasin S, Othmani A, Raza I, Hussain SA. Machine learning based approaches for clinical and non-clinical depression recognition and depression relapse prediction using audiovisual and EEG modalities: A comprehensive review. Comput Biol Med 2023; 159:106741. [PMID: 37105109 DOI: 10.1016/j.compbiomed.2023.106741] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 02/25/2023] [Accepted: 03/02/2023] [Indexed: 03/07/2023]
Abstract
Mental disorders are rapidly increasing each year and have become a major challenge affecting the social and financial well-being of individuals. There is a need for phenotypic characterization of psychiatric disorders with biomarkers to provide a rich signature for Major Depressive Disorder, improving the understanding of the pathophysiological mechanisms underlying these mental disorders. This comprehensive review focuses on depression and relapse detection modalities such as self-questionnaires, audiovisuals, and EEG, highlighting noteworthy publications in the last ten years. The article concentrates on the literature that adopts machine learning by audiovisual and EEG signals. It also outlines preprocessing, feature extraction, and public datasets for depression detection. The review concludes with recommendations that will help improve the reliability of developed models and the determinism of computational intelligence-based systems in psychiatry. To the best of our knowledge, this survey is the first comprehensive review on depression and relapse prediction by self-questionnaires, audiovisual, and EEG-based approaches. The findings of this review will serve as a useful and structured starting point for researchers studying clinical and non-clinical depression recognition and relapse through machine learning-based approaches.
Collapse
Affiliation(s)
- Sana Yasin
- Department of Computer Science, COMSATS University Islamabad, Lahore Campus Lahore, Pakistan; Department of Computer Science, University of Okara, Okara, Pakistan.
| | - Alice Othmani
- Université Paris-Est Créteil (UPEC), LISSI, Vitry sur Seine, 94400, France.
| | - Imran Raza
- Department of Computer Science, COMSATS University Islamabad, Lahore Campus Lahore, Pakistan.
| | - Syed Asad Hussain
- Department of Computer Science, COMSATS University Islamabad, Lahore Campus Lahore, Pakistan.
| |
Collapse
|
19
|
Yang Q, Li X, Ding X, Xu F, Ling Z. Deep learning-based speech analysis for Alzheimer's disease detection: a literature review. Alzheimers Res Ther 2022; 14:186. [PMID: 36517837 PMCID: PMC9749308 DOI: 10.1186/s13195-022-01131-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 11/23/2022] [Indexed: 12/23/2022]
Abstract
BACKGROUND Alzheimer's disease has become one of the most common neurodegenerative diseases worldwide, which seriously affects the health of the elderly. Early detection and intervention are the most effective prevention methods currently. Compared with traditional detection methods such as traditional scale tests, electroencephalograms, and magnetic resonance imaging, speech analysis is more convenient for automatic large-scale Alzheimer's disease detection and has attracted extensive attention from researchers. In particular, deep learning-based speech analysis and language processing techniques for Alzheimer's disease detection have been studied and achieved impressive results. METHODS To integrate the latest research progresses, hundreds of relevant papers from ACM, DBLP, IEEE, PubMed, Scopus, Web of Science electronic databases, and other sources were retrieved. We used these keywords for paper search: (Alzheimer OR dementia OR cognitive impairment) AND (speech OR voice OR audio) AND (deep learning OR neural network). CONCLUSIONS Fifty-two papers were finally retained after screening. We reviewed and presented the speech databases, deep learning methods, and model performances of these studies. In the end, we pointed out the mainstreams and limitations in the current studies and provided a direction for future research.
Collapse
Affiliation(s)
- Qin Yang
- iFlytek Research, iFlytek Co.Ltd, Hefei, China
| | - Xin Li
- NELSLIP, University of Science and Technology of China, Hefei, China.
- iFlytek Research, iFlytek Co.Ltd, Hefei, China.
| | - Xinyun Ding
- iFlytek Research, iFlytek Co.Ltd, Hefei, China
| | - Feiyang Xu
- iFlytek Research, iFlytek Co.Ltd, Hefei, China
| | - Zhenhua Ling
- NELSLIP, University of Science and Technology of China, Hefei, China
| |
Collapse
|
20
|
Nasiri M, Moayedfar S, Purmohammad M, Ghasisin L. Investigating sentence processing and working memory in patients with mild Alzheimer and elderly people. PLoS One 2022; 17:e0266552. [PMID: 36318545 PMCID: PMC9624401 DOI: 10.1371/journal.pone.0266552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Accepted: 03/22/2022] [Indexed: 11/18/2022] Open
Abstract
INTRODUCTION Linguistic disorders are one of the common problems in Alzheimer's disease, which in recent years has been considered as one of the key parameters in the diagnosis of Alzheimer (AD). Given that changes in sentence processing and working memory and the relationship between these two activities may be a diagnostic parameter in the early and preclinical stages of AD, the present study examines the comprehension and production of sentences and working memory in AD patients and healthy aged people. METHODS Twenty-five people with mild Alzheimer's and 25 healthy elderly people participated in the study. In this study, we used the digit span to evaluate working memory. Syntactic priming and sentence completion tasks in canonical and non-canonical conditions were used for evaluating sentence production. We administered sentence picture matching and cross-modal naming tasks to assess sentence comprehension. RESULTS The results of the present study revealed that healthy elderly people and patients with mild Alzheimer's disease have a significant difference in comprehension of relative clause sentences (P <0.05). There was no significant difference between the two groups in comprehension of simple active, simple active with noun phrase and passive sentences (P> 0.05). They had a significant difference in auditory and visual reaction time (P <0.05). Also there was a significant difference between the two groups in syntactic priming and sentence completion tasks. However, in non-canonical condition of sentence completion, the difference between the two groups was not significant (P> 0.05). CONCLUSION The results of the present study showed that the mean scores related to comprehension, production and working memory in people with mild Alzheimer's were lower than healthy aged people, which indicate sentence processing problems at this level of the disease. People with Alzheimer have difficulty comprehending and producing complex syntactic structures and have poorer performance in tasks that required more memory demands. It seems that the processing problems of these people are due to both working memory and language problems, which are not separate from each other and both are involved in.
Collapse
Affiliation(s)
- Maryam Nasiri
- Student Research Committee, School of Rehabilitation, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Saeideh Moayedfar
- Department of speech therapy, School of Rehabilitation Sciences, Tehran University of Medical Sciences, Tehran, Iran
| | - Mehdi Purmohammad
- Institute for Cognitive and Brain Sciences, Shahid Beheshti University, Tehran, Iran
| | - Leila Ghasisin
- Communication Disorders Research Center, Isfahan University of Medical Sciences, Isfahan, Iran
- * E-mail: ,
| |
Collapse
|
21
|
Ngo QC, Motin MA, Pah ND, Drotár P, Kempster P, Kumar D. Computerized analysis of speech and voice for Parkinson's disease: A systematic review. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 226:107133. [PMID: 36183641 DOI: 10.1016/j.cmpb.2022.107133] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 09/13/2022] [Accepted: 09/13/2022] [Indexed: 06/16/2023]
Abstract
BACKGROUND AND OBJECTIVE Speech impairment is an early symptom of Parkinson's disease (PD). This study has summarized the literature related to speech and voice in detecting PD and assessing its severity. METHODS A systematic review of the literature from 2010 to 2021 to investigate analysis methods and signal features. The keywords "Automatic analysis" in conjunction with "PD speech" or "PD voice" were used, and the PubMed and ScienceDirect databases were searched. A total of 838 papers were found on the first run, of which 189 were selected. One hundred and forty-seven were found to be suitable for the review. The different datasets, recording protocols, signal analysis methods and features that were reported are listed. Values of the features that separate PD patients from healthy controls were tabulated. Finally, the barriers that limit the wide use of computerized speech analysis are discussed. RESULTS Speech and voice may be valuable markers for PD. However, large differences between the datasets make it difficult to compare different studies. In addition, speech analytic methods that are not informed by physiological understanding may alienate clinicians. CONCLUSIONS The potential usefulness of speech and voice for the detection and assessment of PD is confirmed by evidence from the classification and correlation results.
Collapse
Affiliation(s)
| | - Mohammod Abdul Motin
- Biosignals Lab, RMIT University, Melbourne, Australia; Department of Electrical & Electronic Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Nemuel Daniel Pah
- Biosignals Lab, RMIT University, Melbourne, Australia; Universitas Surabaya, Indonesia
| | - Peter Drotár
- Intelligent Information Systems Lab, Technical University of Kosice, Letna 9, 42001, Kosice, Slovakia
| | - Peter Kempster
- Neurosciences Department, Monash Health, Clayton, VIC, Australia; Department of Medicine, School of Clinical Sciences, Monash University, Clayton, VIC, Australia
| | - Dinesh Kumar
- Biosignals Lab, RMIT University, Melbourne, Australia.
| |
Collapse
|
22
|
Sun X, Sun X, Wang Q, Wang X, Feng L, Yang Y, Jing Y, Yang C, Zhang S. Biosensors toward behavior detection in diagnosis of alzheimer’s disease. Front Bioeng Biotechnol 2022; 10:1031833. [PMID: 36338126 PMCID: PMC9626796 DOI: 10.3389/fbioe.2022.1031833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 10/03/2022] [Indexed: 11/30/2022] Open
Abstract
In recent years, a huge number of individuals all over the world, elderly people, in particular, have been suffering from Alzheimer’s disease (AD), which has had a significant negative impact on their quality of life. To intervene early in the progression of the disease, accurate, convenient, and low-cost detection technologies are gaining increased attention. As a result of their multiple merits in the detection and assessment of AD, biosensors are being frequently utilized in this field. Behavioral detection is a prospective way to diagnose AD at an early stage, which is a more objective and quantitative approach than conventional neuropsychological scales. Furthermore, it provides a safer and more comfortable environment than those invasive methods (such as blood and cerebrospinal fluid tests) and is more economical than neuroimaging tests. Behavior detection is gaining increasing attention in AD diagnosis. In this review, cutting-edge biosensor-based devices for AD diagnosis together with their measurement parameters and diagnostic effectiveness have been discussed in four application subtopics: body movement behavior detection, eye movement behavior detection, speech behavior detection, and multi-behavior detection. Finally, the characteristics of behavior detection sensors in various application scenarios are summarized and the prospects of their application in AD diagnostics are presented as well.
Collapse
Affiliation(s)
- Xiaotong Sun
- Ningbo Innovation Center, School of Mechanical Engineering, Zhejiang University, Ningbo, China
- Faculty of Science and Engineering, University of Nottingham Ningbo, Ningbo, China
| | - Xu Sun
- Faculty of Science and Engineering, University of Nottingham Ningbo, Ningbo, China
- Nottingham Ningbo China Beacons of Excellence Research and Innovation Institute, University of Nottingham Ningbo, Ningbo, China
- *Correspondence: Sheng Zhang, ; Xu Sun,
| | - Qingfeng Wang
- Nottingham University Business School China, University of Nottingham Ningbo China, Ningbo, Zhejiang, China
| | - Xiang Wang
- Ningbo Innovation Center, School of Mechanical Engineering, Zhejiang University, Ningbo, China
- Faculty of Science and Engineering, University of Nottingham Ningbo, Ningbo, China
| | - Luying Feng
- Ningbo Innovation Center, School of Mechanical Engineering, Zhejiang University, Ningbo, China
| | - Yifan Yang
- Ningbo Innovation Center, School of Mechanical Engineering, Zhejiang University, Ningbo, China
- Faculty of Science and Engineering, University of Nottingham Ningbo, Ningbo, China
| | - Ying Jing
- Business School, NingboTech University, Ningbo, China
| | - Canjun Yang
- Ningbo Innovation Center, School of Mechanical Engineering, Zhejiang University, Ningbo, China
| | - Sheng Zhang
- Ningbo Innovation Center, School of Mechanical Engineering, Zhejiang University, Ningbo, China
- Faculty of Science and Engineering, University of Nottingham Ningbo, Ningbo, China
- *Correspondence: Sheng Zhang, ; Xu Sun,
| |
Collapse
|
23
|
Herath HMDPM, Weraniyagoda WASA, Rajapaksha RTM, Wijesekara PADSN, Sudheera KLK, Chong PHJ. Automatic Assessment of Aphasic Speech Sensed by Audio Sensors for Classification into Aphasia Severity Levels to Recommend Speech Therapies. SENSORS (BASEL, SWITZERLAND) 2022; 22:6966. [PMID: 36146316 PMCID: PMC9501827 DOI: 10.3390/s22186966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 09/01/2022] [Accepted: 09/08/2022] [Indexed: 06/16/2023]
Abstract
Aphasia is a type of speech disorder that can cause speech defects in a person. Identifying the severity level of the aphasia patient is critical for the rehabilitation process. In this research, we identify ten aphasia severity levels motivated by specific speech therapies based on the presence or absence of identified characteristics in aphasic speech in order to give more specific treatment to the patient. In the aphasia severity level classification process, we experiment on different speech feature extraction techniques, lengths of input audio samples, and machine learning classifiers toward classification performance. Aphasic speech is required to be sensed by an audio sensor and then recorded and divided into audio frames and passed through an audio feature extractor before feeding into the machine learning classifier. According to the results, the mel frequency cepstral coefficient (MFCC) is the most suitable audio feature extraction method for the aphasic speech level classification process, as it outperformed the classification performance of all mel-spectrogram, chroma, and zero crossing rates by a large margin. Furthermore, the classification performance is higher when 20 s audio samples are used compared with 10 s chunks, even though the performance gap is narrow. Finally, the deep neural network approach resulted in the best classification performance, which was slightly better than both K-nearest neighbor (KNN) and random forest classifiers, and it was significantly better than decision tree algorithms. Therefore, the study shows that aphasia level classification can be completed with accuracy, precision, recall, and F1-score values of 0.99 using MFCC for 20 s audio samples using the deep neural network approach in order to recommend corresponding speech therapy for the identified level. A web application was developed for English-speaking aphasia patients to self-diagnose the severity level and engage in speech therapies.
Collapse
Affiliation(s)
| | | | | | | | | | - Peter Han Joo Chong
- Department of Electrical and Electronic Engineering, Auckland University of Technology, Auckland 1010, New Zealand
| |
Collapse
|
24
|
AI-Atroshi C, Rene Beulah J, Singamaneni KK, Pretty Diana Cyril C, Neelakandan S, Velmurugan S. Automated speech based evaluation of mild cognitive impairment and Alzheimer’s disease detection using with deep belief network model. INTERNATIONAL JOURNAL OF HEALTHCARE MANAGEMENT 2022. [DOI: 10.1080/20479700.2022.2097764] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Affiliation(s)
- Chiai AI-Atroshi
- Department of Educational Counseling, College of Basic Education, University of Duhok, Dahuk, Iraq
| | - J. Rene Beulah
- Department of Computing Technologies, Faculty of Engineering and Technology, College of Engineering and Technology, SRM Institute of Science and Technology, Chennai, India
| | | | - C. Pretty Diana Cyril
- Department of Computing Technologies, Faculty of Engineering and Technology, College of Engineering and Technology, SRM Institute of Science and Technology, Chennai, India
| | - S. Neelakandan
- Department of CSE, R.M.K Engineering College, Chennai, India
| | - S. Velmurugan
- Department of CSE, Vel Tech Multi Tech Dr.Rangarajan Dr.Sakunthala Engineering College, Chennai, India
| |
Collapse
|
25
|
Lofgren M, Hinzen W. Breaking the flow of thought: Increase of empty pauses in the connected speech of people with mild and moderate Alzheimer's disease. JOURNAL OF COMMUNICATION DISORDERS 2022; 97:106214. [PMID: 35397387 DOI: 10.1016/j.jcomdis.2022.106214] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 03/03/2022] [Accepted: 03/15/2022] [Indexed: 06/14/2023]
Abstract
INTRODUCTION The profile of spontaneous speech in Alzheimer's disease (AD) includes increased pausing as a window into cognitive decline. We here aimed to further characterize the pausing profile of AD by linking pauses to the syntactic positions in which they appear and disease progression. METHODS Speech was obtained through a picture description task, thus minimizing demands on episodic memory (EM), from a group of mild (N = 21) and moderate AD (N = 19), and healthy elderly controls (N = 40). Pauses were sub-indexed according to whether they occurred within-clauses, clause-initially, or utterance-initially, and whether they preceded nouns, verbs, or adjectives/adverbs, when occurring within-clauses. Additionally, relations to verbal fluency (VF) measures at the single-word level were explored. RESULTS Pause rate but not duration distinguished controls from both AD groups, while fillers did not distinguish any groups. The analysis by syntactic position revealed a highly differentiated picture, with largest effect sizes of significant group differences seen in the utterance-initial pause rate. The two AD groups patterned differently when compared to controls, while none of the measures differentiated the AD groups. Specifically, moderate but not mild AD differed from controls in clause-initial pauses, while mild but not moderate AD differed from controls in within-clause positions. At the within-clause level, the effect dividing controls from mild-AD was specifically driven by pauses ahead of nouns. A significant negative correlation emerged between pausing rate in spontaneous speech and VF measures in the mild-AD group only. CONCLUSIONS Increased empty (non-filled) pauses in AD are not confined to pauses in within-clause positions, which are most directly related to problems in the retrieval of words. Even in early disease stages, where these within-clause pause effects are seen, they are confined to nouns, revealing a grammatically specific problem possibly related to the referencing of objects. At all disease stages, pauses increase in utterance-sized units of structure, indicating progressive problems in the creative configuration of complete thoughts.
Collapse
Affiliation(s)
- Mary Lofgren
- Dept. Translation & Language Sciences, Universitat Pompeu Fabra, Carrer Roc Boronat, 138, Barcelona 08018, Spain.
| | - Wolfram Hinzen
- Dept. Translation & Language Sciences, Universitat Pompeu Fabra, Carrer Roc Boronat, 138, Barcelona 08018, Spain; Intitut Català de Recerca i Estudis Avançats (ICREA), Barcelona, Spain, Passeig de Lluís Companys, 23, 08010 Barcelona, Spain
| |
Collapse
|
26
|
Kálmán J, Devanand DP, Gosztolya G, Balogh R, Imre N, Tóth L, Hoffmann I, Kovács I, Vincze V, Pákáski M. Temporal speech parameters detect mild cognitive impairment in different languages: validation and comparison of the Speech-GAP Test® in English and Hungarian. Curr Alzheimer Res 2022; 19:373-386. [PMID: 35440309 DOI: 10.2174/1567205019666220418155130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 02/08/2022] [Accepted: 02/17/2022] [Indexed: 11/22/2022]
Abstract
BACKGROUND The development of automatic speech recognition (ASR) technology allows the analysis of temporal (time-based) speech parameters characteristic of mild cognitive impairment (MCI). However, no information has been available on whether the analysis of spontaneous speech can be used with the same efficiency in different language environments. OBJECTIVE The main goal of this international pilot study is to address the question whether the Speech-Gap Test® (S-GAP Test®), previously tested in the Hungarian language, is appropriate for and applicable to the recognition of MCI in other languages such as English. METHOD After an initial screening of 88 individuals, English-speaking (n = 33) and Hungarian-speaking (n = 33) participants were classified as having MCI or as healthy controls (HC) based on Petersen's criteria. Speech of each participant was recorded via a spontaneous speech task. 15 temporal parameters were determined and calculated by means of ASR. RESULTS Seven temporal parameters in the English-speaking sample and 5 in the Hungarian-speaking sample showed significant differences between the MCI and the HC group. Receiver operating characteristics (ROC) analysis clearly distinguished the English-speaking MCI cases from the HC group based on speech tempo and articulation tempo with 100% sensitivity, and on three more temporal parameters with high sensitivity (85.7%). In the Hungarian-speaking sample, the ROC analysis showed similar sensitivity rates (92.3%). CONCLUSION The results of this study in different native-speaking populations suggest that changes in acoustic parameters detected by the S-GAP Test® might be present across different languages.
Collapse
Affiliation(s)
- János Kálmán
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - Davangere P Devanand
- Columbia University Medical Center, New York, NY.,New York State Psychiatric Institute, New York, NY
| | - Gábor Gosztolya
- MTA-SZTE Research Group on Artificial Intelligence, Faculty of Science and Informatics, University of Szeged, Szeged
| | - Réka Balogh
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - Nóra Imre
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - László Tóth
- Faculty of Science and Informatics, University of Szeged, Szeged
| | - Ildikó Hoffmann
- Faculty of Humanities and Social Sciences, University of Szeged, Szeged.,Hungarian Research Centre for Linguistics, Eötvös Loránd Research Network, Budapest
| | - Ildikó Kovács
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - Veronika Vincze
- MTA-SZTE Research Group on Artificial Intelligence, Faculty of Science and Informatics, University of Szeged, Szeged
| | - Magdolna Pákáski
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| |
Collapse
|
27
|
Imre N, Balogh R, Gosztolya G, Tóth L, Hoffmann I, Várkonyi T, Lengyel C, Pákáski M, Kálmán J. Temporal Speech Parameters Indicate Early Cognitive Decline in Elderly Patients With Type 2 Diabetes Mellitus. Alzheimer Dis Assoc Disord 2022; 36:148-155. [PMID: 35293378 PMCID: PMC9132238 DOI: 10.1097/wad.0000000000000492] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 12/28/2021] [Indexed: 12/02/2022]
Abstract
INTRODUCTION The earliest signs of cognitive decline include deficits in temporal (time-based) speech characteristics. Type 2 diabetes mellitus (T2DM) patients are more prone to mild cognitive impairment (MCI). The aim of this study was to compare the temporal speech characteristics of elderly (above 50 y) T2DM patients with age-matched nondiabetic subjects. MATERIALS AND METHODS A total of 160 individuals were screened, 100 of whom were eligible (T2DM: n=51; nondiabetic: n=49). Participants were classified either as having healthy cognition (HC) or showing signs of MCI. Speech recordings were collected through a phone call. Based on automatic speech recognition, 15 temporal parameters were calculated. RESULTS The HC with T2DM group showed significantly shorter utterance length, higher duration rate of silent pause and total pause, and higher average duration of silent pause and total pause compared with the HC without T2DM group. Regarding the MCI participants, parameters were similar between the T2DM and the nondiabetic subgroups. CONCLUSIONS Temporal speech characteristics of T2DM patients showed early signs of altered cognitive functioning, whereas neuropsychological tests did not detect deterioration. This method is useful for identifying the T2DM patients most at risk for manifest MCI, and could serve as a remote cognitive screening tool.
Collapse
Affiliation(s)
| | | | - Gábor Gosztolya
- MTA-SZTE Research Group on Artificial Intelligence, University of Szeged, Szeged
| | - László Tóth
- MTA-SZTE Research Group on Artificial Intelligence, University of Szeged, Szeged
| | - Ildikó Hoffmann
- Hungarian Linguistics
- Hungarian Research Centre for Linguistics, Eötvös Loránd Research Network, Budapest, Hungary
| | | | | | | | | |
Collapse
|
28
|
Tang F, Chen J, Dodge HH, Zhou J. The Joint Effects of Acoustic and Linguistic Markers for Early Identification of Mild Cognitive Impairment. Front Digit Health 2022; 3:702772. [PMID: 35224534 PMCID: PMC8878676 DOI: 10.3389/fdgth.2021.702772] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 12/27/2021] [Indexed: 12/14/2022] Open
Abstract
In recent years, behavioral markers such as spoken language and lexical preferences have been studied in the early detection of mild cognitive impairment (MCI) using conversations. While the combination of linguistic and acoustic signals have been shown to be effective in detecting MCI, they have generally been restricted to structured conversations in which the interviewee responds to fixed prompts. In this study, we show that linguistic and acoustic features can be combined synergistically to identify MCI in semi-structured conversations. Using conversational data from an on-going clinical trial (Clinicaltrials.gov: NCT02871921), we find that the combination of linguistic and acoustic features on semi-structured conversations achieves a mean AUC of 82.7, significantly (p < 0.01) out-performing linguistic-only (74.9 mean AUC) or acoustic-only (65.0 mean AUC) detections on hold-out data. Additionally, features (linguistic, acoustic and combination) obtained from semi-structured conversations outperform their counterparts obtained from structured weekly conversations in identifying MCI. Some linguistic categories are significantly better at predicting MCI status (e.g., death, home) than others.
Collapse
Affiliation(s)
- Fengyi Tang
- Department of Computer Science of Engineering, Michigan State University, East Lansing, MI, United States
| | - Jun Chen
- Department of Bioinformatics, University of Michigan, Ann Arbor, MI, United States
| | - Hiroko H. Dodge
- Department of Neurology, Layton Aging and Alzheimer's Disease Center, Oregon Health & Science University, Portland, OR, United States
| | - Jiayu Zhou
- Department of Computer Science of Engineering, Michigan State University, East Lansing, MI, United States
| |
Collapse
|
29
|
Assogna M, Sprugnoli G, Press D, Dickerson B, Macone J, Bonnì S, Borghi I, Connor A, Hoffman M, Grover N, Wong B, Shen C, Martorana A, O'Reilly M, Ruffini G, El Fakhri G, Koch G, Santarnecchi E. Gamma-induction in frontotemporal dementia (GIFTeD) randomized placebo-controlled trial: Rationale, noninvasive brain stimulation protocol, and study design. ALZHEIMER'S & DEMENTIA (NEW YORK, N. Y.) 2022; 7:e12219. [PMID: 35141396 PMCID: PMC8813035 DOI: 10.1002/trc2.12219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 08/02/2021] [Accepted: 09/20/2021] [Indexed: 11/30/2022]
Abstract
INTRODUCTION Frontotemporal dementia (FTD) is a neurodegenerative disorder for which there is no effective pharmacological treatment. Recently, interneuron activity responsible for fast oscillatory brain activity has been found to be impaired in a mouse model of FTD with consequent cognitive and behavioral alterations. In this study, we aim to investigate the safety, tolerability, and efficacy of a novel promising therapeutic intervention for FTD based on 40 Hz transcranial alternating current stimulation (tACS), a form of non-invasive brain stimulation thought to engage neural activity in a frequency-specific manner and thus suited to restore altered brain oscillatory patterns. METHODS This is a multi-site, randomized, double-blind, placebo-controlled trial on 50 patients with a diagnosis of behavioral variant FTD (bvFTD). Participants will be randomized to undergo either 30 days of 1-hour daily tACS or Sham (placebo) tACS. The outcomes will be assessed at baseline, right after the intervention and at a 3- to 6-months follow-up. The primary outcome measures are represented by the safety and feasibility of tACS administration, which will be assessed considering the nature, frequency, and severity of adverse events as well as attrition rate, respectively. To assess secondary outcomes, participants will undergo extensive neuropsychological and behavioral assessments and fluorodeoxyglucose (FDG)-positron emission tomography (PET) scans to evaluate changes in brain metabolism, functional and structural magnetic resonance imaging (MRI), resting and evoked electroencephalography, as well as blood biomarkers to measure changes in neurodegenerative and neuroinflammatory markers. RESULTS The trial started in October 2020 and will end in October 2023. Study protocols have been approved by the local institutional review board (IRB) at each data-collection site. DISCUSSION This study will evaluate the safety and tolerability of 40 Hz tACS in bvFTD patients and its efficacy on gamma oscillatory activity, cognitive function, and brain glucose hypometabolism.
Collapse
Affiliation(s)
- Martina Assogna
- Berenson‐Allen Center for Noninvasive Brain StimulationBeth Israel Deaconess Medical CenterHarvard Medical SchoolBostonMassachusettsUSA
- Non‐Invasive Brain Stimulation UnitDepartment of Behavioural and Clinical NeurologySanta Lucia Foundation IRCCSRomeItaly
| | - Giulia Sprugnoli
- Berenson‐Allen Center for Noninvasive Brain StimulationBeth Israel Deaconess Medical CenterHarvard Medical SchoolBostonMassachusettsUSA
- Radiology UnitDepartment of Medicine and SurgeryUniversity of ParmaParmaItaly
| | - Daniel Press
- Berenson‐Allen Center for Noninvasive Brain StimulationBeth Israel Deaconess Medical CenterHarvard Medical SchoolBostonMassachusettsUSA
| | - Brad Dickerson
- Frontotemporal Disorders Unit and Alzheimer's Disease Research CenterDepartments of Psychiatry and NeurologyMassachusetts General HospitalBostonMassachusettsUSA
| | - Joanna Macone
- Berenson‐Allen Center for Noninvasive Brain StimulationBeth Israel Deaconess Medical CenterHarvard Medical SchoolBostonMassachusettsUSA
| | - Sonia Bonnì
- Non‐Invasive Brain Stimulation UnitDepartment of Behavioural and Clinical NeurologySanta Lucia Foundation IRCCSRomeItaly
| | - Ilaria Borghi
- Non‐Invasive Brain Stimulation UnitDepartment of Behavioural and Clinical NeurologySanta Lucia Foundation IRCCSRomeItaly
| | - Ann Connor
- Berenson‐Allen Center for Noninvasive Brain StimulationBeth Israel Deaconess Medical CenterHarvard Medical SchoolBostonMassachusettsUSA
| | - Megan Hoffman
- Berenson‐Allen Center for Noninvasive Brain StimulationBeth Israel Deaconess Medical CenterHarvard Medical SchoolBostonMassachusettsUSA
| | - Nainika Grover
- Berenson‐Allen Center for Noninvasive Brain StimulationBeth Israel Deaconess Medical CenterHarvard Medical SchoolBostonMassachusettsUSA
| | - Bonnie Wong
- Frontotemporal Disorders Unit and Alzheimer's Disease Research CenterDepartments of Psychiatry and NeurologyMassachusetts General HospitalBostonMassachusettsUSA
| | - Changyu Shen
- Richard and Susan Smith Center for Outcomes Research in CardiologyDivision of CardiologyBeth Israel Deaconess Medical and Harvard Medical SchoolBostonMassachusettsUSA
| | | | - Molly O'Reilly
- Berenson‐Allen Center for Noninvasive Brain StimulationBeth Israel Deaconess Medical CenterHarvard Medical SchoolBostonMassachusettsUSA
| | | | - Georges El Fakhri
- Gordon Center for Medical ImagingDepartment of RadiologyMassachusetts General HospitalHarvard Medical SchoolBostonMassachusettsUSA
| | - Giacomo Koch
- Non‐Invasive Brain Stimulation UnitDepartment of Behavioural and Clinical NeurologySanta Lucia Foundation IRCCSRomeItaly
| | - Emiliano Santarnecchi
- Berenson‐Allen Center for Noninvasive Brain StimulationBeth Israel Deaconess Medical CenterHarvard Medical SchoolBostonMassachusettsUSA
- Gordon Center for Medical ImagingDepartment of RadiologyMassachusetts General HospitalHarvard Medical SchoolBostonMassachusettsUSA
| |
Collapse
|
30
|
Reizabal L, Garcia I, Sansinenea E, Sarrionandia A, Fernández E, Lopez-de-Ipina K. Psychological Effects of Secondary Prisonization on Older Parents. TRENDS IN PSYCHOLOGY 2021. [DOI: 10.1007/s43076-021-00127-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Abstract
Secondary prisonization refers to the impact of the incarceration of a relative on the members of their family. This study aimed to analyze the psychological effects of secondary prisonization on older parents. Specifically, levels of depression, anxiety, stress, and well-being (emotional, psychological, and social) were analyzed by means of quantitative and automatic speech analysis methods in a sample of over 65-year-old parents of Basque prisoners incarcerated in remote prisons. The statistical analysis of data and the automatic spontaneous speech analysis showed that secondary prisonization has a negative impact on older parents’ levels of depression, anxiety, stress, and well-being. These results lead us to conclude that remote imprisonment of adult children has negative psychological effects on older parents.
Collapse
|
31
|
Munthuli A, Vongsurakrai S, Anansiripinyo T, Ellermann V, Sroykhumpa K, Onsuwan C, Chutichetpong P, Hemrungrojn S, Kosawat K, Tantibundhit C. Thammasat-NECTEC-Chula's Thai Language and Cognition Assessment (TLCA): The Thai Alzheimer's and Mild Cognitive Impairment Screening Test. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021; 2021:690-694. [PMID: 34891386 DOI: 10.1109/embc46164.2021.9630779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Thammasat-NECTEC-Chula's Thai Language and Cognition Assessment (TLCA) is a cognitive paper-based test consisting of 21 tasks that cover 3 domains: memory, language, and other cognitive abilities. The TLCA follows some aspects of the existing tests (Thai Addenbrooke's Cognitive Examination-Revised (Thai-ACE-R) and the Thai Montreal Cognitive Assessment Test (Thai-MoCA)) and many parts were reconstructed to be more adapted to the Thai culture. Data obtained from the test will be able to precisely distinguish between patients with Mild Cognitive Impairment (MCI), Alzheimer's Disease (AD), and Normal healthy Controls (NC). The TLCA was tested on 90 participants (32 on the paper-based version and 58 on the computerized version) using a scoring procedure and speech features from verbal responses with machine learning classification. The scoring results showed significant difference between non-AD (NC + MCI) vs AD participants in 3 domains and could differentiate between NC and MCI, while machine classification could classify in three settings: NC vs non-NC (MCI + AD), AD vs non-AD and NC vs MCI vs AD. These promising results suggest that TLCA could be further verified and used as an efficient assessment in MCI and AD screening for Thais.Clinical relevance- The speech feature analysis of TLCA showed promising result for screening MCI and AD for Thais.
Collapse
|
32
|
Sangchocanonta S, Vongsurakrai S, Sroykhumpa K, Ellermann V, Munthuli A, Anansiripinyo T, Onsuwan C, Hemrungrojn S, Kosawat K, Tantibundhit C. Development of Thai Picture Description Task for Alzheimer's Screening using Part-of-Speech Tagging. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021; 2021:2104-2109. [PMID: 34891704 DOI: 10.1109/embc46164.2021.9629861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Alzheimer's Disease (AD) and Mild Cognitive Impairment (MCI) are among the most common health conditions in elderly patients. Currently, methods to diagnose AD and MCI are lengthy, costly and require specialized staff to operate. A picture description task was developed to speed up the diagnosis. It was designed to be suitable and relatable to the Thai culture. In this paper, we will be presenting two picture description tasks named Thais-at-Home and Thai Temple Fair. The developed picture set was presented to 90 participants (30 normals, 30 MCI patients, and 30 AD patients). Then, the recording in the form of spontaneous speech is converted to text. A Part-of-Speech (PoS) tagger is used to categorize words into 7 types (noun, pronoun, adjective, verb, conjunction, preposition, and interjection) according to the Office of the Royal Society of Thailand. Six machine learning algorithms were applied to train with the PoS patterns and their performances were compared. Results showed that the PoS can be used to classify patients (MCI and AD) and healthy controls using multilayer perceptron with 90.00% sensitivity, 80.00% specificity, and 86.67% accuracy. Moreover, the findings showed that healthy controls used more conjunctions and verbs but fewer pronouns than the patients.Clinical relevance- The picture description tasks using part-of-speech (PoS) to showed promising results in screening Alzheimer's patients.
Collapse
|
33
|
Vincze V, Szatlóczki G, Tóth L, Gosztolya G, Pákáski M, Hoffmann I, Kálmán J. Telltale silence: temporal speech parameters discriminate between prodromal dementia and mild Alzheimer's disease. CLINICAL LINGUISTICS & PHONETICS 2021; 35:727-742. [PMID: 32993390 DOI: 10.1080/02699206.2020.1827043] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 09/11/2020] [Accepted: 09/18/2020] [Indexed: 06/11/2023]
Abstract
This study presents a novel approach for the early detection of mild cognitive impairment (MCI) and mild Alzheimer's disease (mAD) in the elderly. Participants were 25 elderly controls (C), 25 clinically diagnosed MCI and 25 mAD patients, included after a clinical diagnosis validated by CT or MRI and cognitive tests. Our linguistic protocol involved three connected speech tasks that stimulate different memory systems, which were recorded, then analyzed linguistically by using the PRAAT software. The temporal speech-related parameters successfully differentiate MCI from mAD and C, such as speech rate, number and length of pauses, the rate of pause and signal. Parameters pauses/duration and silent pauses/duration linearly decreased among the groups, in other words, the percentage of pauses in the total duration of speech continuously grows as dementia progresses. Thus, the proposed approach may be an effective tool for screening MCI and mAD.
Collapse
Affiliation(s)
- Veronika Vincze
- MTA-SZTE Research Group on Artifical Intelligence, Szeged, Hungary
| | | | - László Tóth
- Institute of Informatics, University of Szeged, Szeged, Hungary
| | - Gábor Gosztolya
- MTA-SZTE Research Group on Artifical Intelligence, Szeged, Hungary
| | | | - Ildikó Hoffmann
- Department of Linguistics, University of Szeged, Szeged, Hungary
- Research Institute for Linguistics, Hungarian Academy of Sciences, Budapest, Hungary
| | - János Kálmán
- Department of Psychiatry, University of Szeged, Szeged, Hungary
| |
Collapse
|
34
|
Verbal fluency in normal aging and cognitive decline: Results of a longitudinal study. COMPUT SPEECH LANG 2021. [DOI: 10.1016/j.csl.2021.101195] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
35
|
Nasreen S, Rohanian M, Hough J, Purver M. Alzheimer’s Dementia Recognition From Spontaneous Speech Using Disfluency and Interactional Features. FRONTIERS IN COMPUTER SCIENCE 2021. [DOI: 10.3389/fcomp.2021.640669] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Alzheimer’s disease (AD) is a progressive, neurodegenerative disorder mainly characterized by memory loss with deficits in other cognitive domains, including language, visuospatial abilities, and changes in behavior. Detecting diagnostic biomarkers that are noninvasive and cost-effective is of great value not only for clinical assessments and diagnostics but also for research purposes. Several previous studies have investigated AD diagnosis via the acoustic, lexical, syntactic, and semantic aspects of speech and language. Other studies include approaches from conversation analysis that look at more interactional aspects, showing that disfluencies such as fillers and repairs, and purely nonverbal features such as inter-speaker silence, can be key features of AD conversations. These kinds of features, if useful for diagnosis, may have many advantages: They are simple to extract and relatively language-, topic-, and task-independent. This study aims to quantify the role and contribution of these features of interaction structure in predicting whether a dialogue participant has AD. We used a subset of the Carolinas Conversation Collection dataset of patients with AD at moderate stage within the age range 60–89 and similar-aged non-AD patients with other health conditions. Our feature analysis comprised two sets: disfluency features, including indicators such as self-repairs and fillers, and interactional features, including overlaps, turn-taking behavior, and distributions of different types of silence both within patient speech and between patient and interviewer speech. Statistical analysis showed significant differences between AD and non-AD groups for several disfluency features (edit terms, verbatim repeats, and substitutions) and interactional features (lapses, gaps, attributable silences, turn switches per minute, standardized phonation time, and turn length). For the classification of AD patient conversations vs. non-AD patient conversations, we achieved 83% accuracy with disfluency features, 83% accuracy with interactional features, and an overall accuracy of 90% when combining both feature sets using support vector machine classifiers. The discriminative power of these features, perhaps combined with more conventional linguistic features, therefore shows potential for integration into noninvasive clinical assessments for AD at advanced stages.
Collapse
|
36
|
Verma RK, Pandey M, Chawla P, Choudhury H, Mayuren J, Bhattamisra SK, Gorain B, Raja MAG, Amjad MW, Obaidur Rahman S. An insight into the role of Artificial Intelligence in the early diagnosis of Alzheimer's disease. CNS & NEUROLOGICAL DISORDERS-DRUG TARGETS 2021; 21:901-912. [PMID: 33982657 DOI: 10.2174/1871527320666210512014505] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Revised: 01/12/2021] [Accepted: 02/17/2021] [Indexed: 11/22/2022]
Abstract
BACKGROUND The complication of Alzheimer's disease (AD) has made the development of its therapeutic a challenging task. Even after decades of research, we have achieved no more than a few years of symptomatic relief. The inability to diagnose the disease early is the foremost hurdle behind its treatment. Several studies have aimed to identify potential biomarkers that can be detected in body fluids (CSF, blood, urine, etc) or assessed by neuroimaging (i.e., PET and MRI). However, the clinical implementation of these biomarkers is incomplete as they cannot be validated. METHOD To overcome the limitation, the use of artificial intelligence along with technical tools has been extensively investigated for AD diagnosis. For developing a promising artificial intelligence strategy that can diagnose AD early, it is critical to supervise neuropsychological outcomes and imaging-based readouts with a proper clinical review. CONCLUSION Profound knowledge, a large data pool, and detailed investigations are required for the successful implementation of this tool. This review will enlighten various aspects of early diagnosis of AD using artificial intelligence.
Collapse
Affiliation(s)
- Rohit Kumar Verma
- International Medical University Department of Pharmacy Practice, School of Pharmacy, Malaysia
| | - Manisha Pandey
- Department of Pharmaceutical Technology, School of Pharmacy, International Medical University-Bukit Jalil 57000, Kuala Lumpur, Malaysia School of Pharmacy,, Malaysia
| | - Pooja Chawla
- ISF College of Pharmacy, Moga Pharmaceutical Chemistry, India
| | - Hira Choudhury
- International Medical University Pharmaceutical Technology, Malaysia
| | - Jayashree Mayuren
- School of Pharmacy, International Medical University Department of Pharmaceutical Technology,, Malaysia
| | | | - Bapi Gorain
- Lincoln University College Faculty of Pharmacy, Malaysia
| | | | | | - Syed Obaidur Rahman
- Department of Pharmaceutical Medicine, School of Pharmaceutical Education and Research, Jamia Humdard, New Delhi India Pharmacology, India
| |
Collapse
|
37
|
Tăuţan AM, Ionescu B, Santarnecchi E. Artificial intelligence in neurodegenerative diseases: A review of available tools with a focus on machine learning techniques. Artif Intell Med 2021; 117:102081. [PMID: 34127244 DOI: 10.1016/j.artmed.2021.102081] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 02/21/2021] [Accepted: 04/26/2021] [Indexed: 10/21/2022]
Abstract
Neurodegenerative diseases have shown an increasing incidence in the older population in recent years. A significant amount of research has been conducted to characterize these diseases. Computational methods, and particularly machine learning techniques, are now very useful tools in helping and improving the diagnosis as well as the disease monitoring process. In this paper, we provide an in-depth review on existing computational approaches used in the whole neurodegenerative spectrum, namely for Alzheimer's, Parkinson's, and Huntington's Diseases, Amyotrophic Lateral Sclerosis, and Multiple System Atrophy. We propose a taxonomy of the specific clinical features, and of the existing computational methods. We provide a detailed analysis of the various modalities and decision systems employed for each disease. We identify and present the sleep disorders which are present in various diseases and which represent an important asset for onset detection. We overview the existing data set resources and evaluation metrics. Finally, we identify current remaining open challenges and discuss future perspectives.
Collapse
Affiliation(s)
- Alexandra-Maria Tăuţan
- University "Politehnica" of Bucharest, Splaiul Independenţei 313, 060042 Bucharest, Romania.
| | - Bogdan Ionescu
- University "Politehnica" of Bucharest, Splaiul Independenţei 313, 060042 Bucharest, Romania.
| | - Emiliano Santarnecchi
- Berenson-Allen Center for Noninvasive Brain Stimulation, Harvard Medical School, 330 Brookline Avenue, Boston, United States.
| |
Collapse
|
38
|
Martínez-Nicolás I, Llorente TE, Martínez-Sánchez F, Meilán JJG. Ten Years of Research on Automatic Voice and Speech Analysis of People With Alzheimer's Disease and Mild Cognitive Impairment: A Systematic Review Article. Front Psychol 2021; 12:620251. [PMID: 33833713 PMCID: PMC8021952 DOI: 10.3389/fpsyg.2021.620251] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 02/15/2021] [Indexed: 11/25/2022] Open
Abstract
Background: The field of voice and speech analysis has become increasingly popular over the last 10 years, and articles on its use in detecting neurodegenerative diseases have proliferated. Many studies have identified characteristic speech features that can be used to draw an accurate distinction between healthy aging among older people and those with mild cognitive impairment and Alzheimer's disease. Speech analysis has been singled out as a cost-effective and reliable method for detecting the presence of both conditions. In this research, a systematic review was conducted to determine these features and their diagnostic accuracy. Methods: Peer-reviewed literature was located across multiple databases, involving studies that apply new procedures of automatic speech analysis to collect behavioral evidence of linguistic impairments along with their diagnostic accuracy on Alzheimer's disease and mild cognitive impairment. The risk of bias was assessed by using JBI and QUADAS-2 checklists. Results: Thirty-five papers met the inclusion criteria; of these, 11 were descriptive studies that either identified voice features or explored their cognitive correlates, and the rest were diagnostic studies. Overall, the studies were of good quality and presented solid evidence of the usefulness of this technique. The distinctive acoustic and rhythmic features found are gathered. Most studies record a diagnostic accuracy over 88% for Alzheimer's and 80% for mild cognitive impairment. Conclusion: Automatic speech analysis is a promising tool for diagnosing mild cognitive impairment and Alzheimer's disease. The reported features seem to be indicators of the cognitive changes in older people. The specific features and the cognitive changes involved could be the subject of further research.
Collapse
Affiliation(s)
- Israel Martínez-Nicolás
- Faculty of Psychology, University of Salamanca, Salamanca, Spain.,Institute of Neuroscience of Castilla y León, University of Salamanca, Salamanca, Spain
| | - Thide E Llorente
- Faculty of Psychology, University of Salamanca, Salamanca, Spain.,Institute of Neuroscience of Castilla y León, University of Salamanca, Salamanca, Spain
| | | | - Juan José G Meilán
- Faculty of Psychology, University of Salamanca, Salamanca, Spain.,Institute of Neuroscience of Castilla y León, University of Salamanca, Salamanca, Spain
| |
Collapse
|
39
|
Coillot M. [Use of new technologies in the diagnosis of neurodegenerative pathologies]. SOINS. GERONTOLOGIE 2021; 26:15-19. [PMID: 33894908 DOI: 10.1016/j.sger.2021.01.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
These last ten years, new technologies are more and more used in therapeutic and rehabilitation programms for patients with dementia, and used for the diagnosis of theses diseases, from the signal treatment. A review of litterature shows this growing interest among the scientific communauty for these new technologies.
Collapse
Affiliation(s)
- Mickaël Coillot
- c/o Soins gérontologie, Elsevier Masson, 65 rue Camille-Desmoulins, 92442 Issy-les-Moulineaux cedex, France.
| |
Collapse
|
40
|
Fares N, Sherratt RS, Elhajj IH. Directing and Orienting ICT Healthcare Solutions to Address the Needs of the Aging Population. Healthcare (Basel) 2021; 9:147. [PMID: 33540510 PMCID: PMC7912863 DOI: 10.3390/healthcare9020147] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 01/25/2021] [Accepted: 01/26/2021] [Indexed: 02/03/2023] Open
Abstract
BACKGROUND With an aging population, it is essential to maintain good health and autonomy for as long as possible. Instead of hospitalisation or institutionalisation, older people with chronic conditions can be assisted in their own home with numerous "smart" devices that support them in their activities of daily living, manage their medical conditions, and prevent fall incidents. Information and Communication Technology (ICT) solutions facilitate the monitoring and management of older people's health to improve quality of life and physical activity with a decline in caregivers' burden. METHOD The aim of this paper was to conduct a systematic literature review to analyse the state of the art of ICT solutions for older people with chronic conditions, and the impact of these solutions on their quality of life from a biomedical perspective. RESULTS By analysing the literature on the available ICT proposals, it is shown that different approaches have been deployed by noticing that the more cross-interventions are merged then the better the results are, but there is still no evidence of the effects of ICT solutions on older people's health outcomes. Furthermore, there are still unresolved ethical and legal issues. CONCLUSION While there has been much research and development in healthcare ICT solutions for the aging population, ICT solutions still need significant development in order to be user-oriented, affordable, and to manage chronic conditions in the aging wider population.
Collapse
Affiliation(s)
- Nada Fares
- Department of Biomedical Engineering, School of Biological Sciences, University of Reading, Berkshire RG6 6AY, UK;
| | - R. Simon Sherratt
- Department of Biomedical Engineering, School of Biological Sciences, University of Reading, Berkshire RG6 6AY, UK;
| | - Imad H. Elhajj
- Department of Electrical and Computer Engineering, American University of Beirut, Beirut 1107 2020, Lebanon;
| |
Collapse
|
41
|
Linguistic features and automatic classifiers for identifying mild cognitive impairment and dementia. COMPUT SPEECH LANG 2021. [DOI: 10.1016/j.csl.2020.101113] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
42
|
|
43
|
Petti U, Baker S, Korhonen A. A systematic literature review of automatic Alzheimer's disease detection from speech and language. J Am Med Inform Assoc 2020; 27:1784-1797. [PMID: 32929494 PMCID: PMC7671617 DOI: 10.1093/jamia/ocaa174] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 05/14/2020] [Accepted: 07/14/2020] [Indexed: 01/21/2023] Open
Abstract
OBJECTIVE In recent years numerous studies have achieved promising results in Alzheimer's Disease (AD) detection using automatic language processing. We systematically review these articles to understand the effectiveness of this approach, identify any issues and report the main findings that can guide further research. MATERIALS AND METHODS We searched PubMed, Ovid, and Web of Science for articles published in English between 2013 and 2019. We performed a systematic literature review to answer 5 key questions: (1) What were the characteristics of participant groups? (2) What language data were collected? (3) What features of speech and language were the most informative? (4) What methods were used to classify between groups? (5) What classification performance was achieved? RESULTS AND DISCUSSION We identified 33 eligible studies and 5 main findings: participants' demographic variables (especially age ) were often unbalanced between AD and control group; spontaneous speech data were collected most often; informative language features were related to word retrieval and semantic, syntactic, and acoustic impairment; neural nets, support vector machines, and decision trees performed well in AD detection, and support vector machines and decision trees performed well in decline detection; and average classification accuracy was 89% in AD and 82% in mild cognitive impairment detection versus healthy control groups. CONCLUSION The systematic literature review supported the argument that language and speech could successfully be used to detect dementia automatically. Future studies should aim for larger and more balanced datasets, combine data collection methods and the type of information analyzed, focus on the early stages of the disease, and report performance using standardized metrics.
Collapse
Affiliation(s)
- Ulla Petti
- Department of Theoretical and Applied Linguistics, University of Cambridge, Language Technology Lab, Cambridge, UK
| | - Simon Baker
- Department of Theoretical and Applied Linguistics, University of Cambridge, Language Technology Lab, Cambridge, UK
| | - Anna Korhonen
- Department of Theoretical and Applied Linguistics, University of Cambridge, Language Technology Lab, Cambridge, UK
| |
Collapse
|
44
|
Radjenovic S, Voracek M, Adler G. [Validity Study of the Cookie Theft Picture Test - Early Detection of Dementia Based on Linguistic Abnormalities]. PSYCHIATRISCHE PRAXIS 2020; 48:149-155. [PMID: 32869219 DOI: 10.1055/a-1207-1255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
OBJECTIVE Previous studies have provided inconsistent results regarding discriminatory language variables between subjects with dementia and healthy controls. In this study, using the Cookie Theft Picture Test (CTP), selected language variables are tested for predicting actual diagnoses. METHODS 24 healthy subjects and 24 subjects with mild dementia were included in the present study. RESULTS All language variables except repetitions, word finding difficulties and paraphasias showed significant differences between the groups. The variables pause length and clues increase significantly the likelihood of AD, while the variable sentence length decreases it. CONCLUSION Due to the small sample size and insufficient standardization, the study can only be interpreted to a limited extent. Nevertheless, the results indicate that the CTP appears to be suitable for practical use.
Collapse
Affiliation(s)
- Sonja Radjenovic
- Institut für Psychologische Grundlagenforschung und Forschungsmethoden, Fakultät für Psychologie, Universität Wien, Österreich
| | - Martin Voracek
- Institut für Psychologische Grundlagenforschung und Forschungsmethoden, Fakultät für Psychologie, Universität Wien, Österreich
| | - Georg Adler
- Institut für Studien zur Psychischen Gesundheit (ISPG), Mannheim, Deutschland
| |
Collapse
|
45
|
Clarke N, Foltz P, Garrard P. How to do things with (thousands of) words: Computational approaches to discourse analysis in Alzheimer's disease. Cortex 2020; 129:446-463. [PMID: 32622173 DOI: 10.1016/j.cortex.2020.05.001] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Revised: 01/30/2020] [Accepted: 05/07/2020] [Indexed: 12/28/2022]
Abstract
Natural Language Processing (NLP) is an ever-growing field of computational science that aims to model natural human language. Combined with advances in machine learning, which learns patterns in data, it offers practical capabilities including automated language analysis. These approaches have garnered interest from clinical researchers seeking to understand the breakdown of language due to pathological changes in the brain, offering fast, replicable and objective methods. The study of Alzheimer's disease (AD), and preclinical Mild Cognitive Impairment (MCI), suggests that changes in discourse (connected speech or writing) may be key to early detection of disease. There is currently no disease-modifying treatment for AD, the leading cause of dementia in people over the age of 65, but detection of those at risk of developing the disease could help with the identification and testing of medications which can take effect before the underlying pathology has irreversibly spread. We outline important components of natural language, as well as NLP tools and approaches with which they can be extracted, analysed and used for disease identification and risk prediction. We review literature using these tools to model discourse across the spectrum of AD, including the contribution of machine learning approaches and Automatic Speech Recognition (ASR). We conclude that NLP and machine learning techniques are starting to greatly enhance research in the field, with measurable and quantifiable language components showing promise for early detection of disease, but there remain research and practical challenges for clinical implementation of these approaches. Challenges discussed include the availability of large and diverse datasets, ethics of data collection and sharing, diagnostic specificity and clinical acceptability.
Collapse
Affiliation(s)
- Natasha Clarke
- Neurosciences Research Centre, Molecular & Clinical Sciences Research Institute, St George's, University of London, Cranmer Terrace, London, UK.
| | - Peter Foltz
- Institute of Cognitive Science, University of Colorado, Boulder, USA.
| | - Peter Garrard
- Neurosciences Research Centre, Molecular & Clinical Sciences Research Institute, St George's, University of London, Cranmer Terrace, London, UK.
| |
Collapse
|
46
|
Smart Environments and Social Robots for Age-Friendly Integrated Care Services. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:ijerph17113801. [PMID: 32471108 PMCID: PMC7312538 DOI: 10.3390/ijerph17113801] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 05/25/2020] [Accepted: 05/26/2020] [Indexed: 12/13/2022]
Abstract
The world is facing major societal challenges because of an aging population that is putting increasing pressure on the sustainability of care. While demand for care and social services is steadily increasing, the supply is constrained by the decreasing workforce. The development of smart, physical, social and age-friendly environments is identified by World Health Organization (WHO) as a key intervention point for enabling older adults, enabling them to remain as much possible in their residences, delay institutionalization, and ultimately, improve quality of life. In this study, we survey smart environments, machine learning and robot assistive technologies that can offer support for the independent living of older adults and provide age-friendly care services. We describe two examples of integrated care services that are using assistive technologies in innovative ways to assess and deliver of timely interventions for polypharmacy management and for social and cognitive activity support in older adults. We describe the architectural views of these services, focusing on details about technology usage, end-user interaction flows and data models that are developed or enhanced to achieve the envisioned objective of healthier, safer, more independent and socially connected older people.
Collapse
|
47
|
Meilán JJG, Martínez-Sánchez F, Martínez-Nicolás I, Llorente TE, Carro J. Changes in the Rhythm of Speech Difference between People with Nondegenerative Mild Cognitive Impairment and with Preclinical Dementia. Behav Neurol 2020; 2020:4683573. [PMID: 32351632 PMCID: PMC7178534 DOI: 10.1155/2020/4683573] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2019] [Revised: 03/25/2020] [Accepted: 03/26/2020] [Indexed: 11/17/2022] Open
Abstract
This study explores several speech parameters related to mild cognitive impairment, as well as those that might be flagging the presence of an underlying neurodegenerative process. Speech is an excellent biomarker because it is not invasive and, what is more, its analysis is rapid and economical. Our aim has been to ascertain whether the typical speech patterns of people with Alzheimer's disease are also present during the disorder's preclinical stages. To do so, we shall be using a task that involves reading out aloud. This is followed by an analysis of the recordings, looking for the possible parameters differentiating between those older people with MCI and a high probability of developing dementia and those with MCI that will not do so. We found that the disease's most differentiating parameters prior to its onset involve changes in speech duration and an alteration in rhythm rate and intensity. These parameters seem to be related to the first difficulties in lexical access among older people with AD.
Collapse
Affiliation(s)
- Juan J. G. Meilán
- Faculty of Psychology, University of Salamanca, Salamanca, Spain
- Institute of Neurosciences of Castile and Leon, Salamanca., Spain
| | | | - Israel Martínez-Nicolás
- Faculty of Psychology, University of Salamanca, Salamanca, Spain
- Institute of Neurosciences of Castile and Leon, Salamanca., Spain
| | - Thide E. Llorente
- Faculty of Psychology, University of Salamanca, Salamanca, Spain
- Institute of Neurosciences of Castile and Leon, Salamanca., Spain
| | - Juan Carro
- Faculty of Psychology, University of Salamanca, Salamanca, Spain
- Institute of Neurosciences of Castile and Leon, Salamanca., Spain
| |
Collapse
|
48
|
An Interactional Profile to Assist the Differential Diagnosis of Neurodegenerative and Functional Memory Disorders. Alzheimer Dis Assoc Disord 2019; 32:197-206. [PMID: 29319602 DOI: 10.1097/wad.0000000000000231] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVE Specialist services for dementia are seeing an increasing number of patients. We investigated whether interactional and linguistic features in the communication behavior of patients with memory problems could help distinguish between those with problems secondary to neurological disorders (ND) and those with functional memory disorder (FMD). METHODS In part 1 of this study, a diagnostic scoring aid (DSA) was developed encouraging linguists to provide quantitative ratings for 14 interactional features. An optimal cut-off differentiating ND and FMD was established by applying the DSA to 30 initial patient-doctor memory clinic encounters. In part 2, the DSA was tested prospectively in 10 additional cases analyzed independently by 2 conversation analysts blinded to medical information. RESULTS In part 1, the median score of the DSA was +5 in ND and -5 in FMD (P<0.001). The optimal numeric DSA cut-off (+1) identified patients with ND with a sensitivity of 86.7% and a specificity of 100%. In part 2, DSA scores of rater 1 correctly predicted 10/10 and those of rater 2 predicted 9/10 diagnoses. CONCLUSIONS This study indicates that interactional and linguistic features can help distinguish between patients developing dementia and those with FMD and could aid the stratification of patients with memory problems.
Collapse
|
49
|
Martínez-Sánchez F, Meilán JJG, Carro J, Ivanova O. A Prototype for the Voice Analysis Diagnosis of Alzheimer's Disease. J Alzheimers Dis 2019; 64:473-481. [PMID: 29914025 DOI: 10.3233/jad-180037] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND Speech variations enable us to map the performance of cognitive processes of syntactic, semantic, phonological, and articulatory planning and execution. Speaking is one of the first functions to be affected by neurodegenerative complaints such as Alzheimer's disease (AD), which makes the speech a highly promising biomarker for detecting the illness before the first preclinical symptoms appear. OBJECTIVE This paper has sought to develop and validate a technological prototype that adopts an automated approach to speech analysis among older people. METHODS It uses a mathematical algorithm based on certain discriminatory variables to estimate the probability of developing AD. RESULTS AND CONCLUSION This device may be used at a preclinical stage by non-expert health professionals to determine the likelihood of the onset of AD.
Collapse
Affiliation(s)
| | | | - Juan Carro
- Psychology Faculty, University of Salamanca, Salamanca, Spain
| | - Olga Ivanova
- Philology Faculty, University of Extremadura, Cáceres, Spain
| |
Collapse
|
50
|
Al-Hameed S, Benaissa M, Christensen H, Mirheidari B, Blackburn D, Reuber M. A new diagnostic approach for the identification of patients with neurodegenerative cognitive complaints. PLoS One 2019; 14:e0217388. [PMID: 31125389 PMCID: PMC6534304 DOI: 10.1371/journal.pone.0217388] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2018] [Accepted: 05/11/2019] [Indexed: 11/18/2022] Open
Abstract
Neurodegenerative diseases causing dementia are known to affect a person's speech and language. Part of the expert assessment in memory clinics therefore routinely focuses on detecting such features. The current outpatient procedures examining patients' verbal and interactional abilities mainly focus on verbal recall, word fluency, and comprehension. By capturing neurodegeneration-associated characteristics in a person's voice, the incorporation of novel methods based on the automatic analysis of speech signals may give us more information about a person's ability to interact which could contribute to the diagnostic process. In this proof-of-principle study, we demonstrate that purely acoustic features, extracted from recordings of patients' answers to a neurologist's questions in a specialist memory clinic can support the initial distinction between patients presenting with cognitive concerns attributable to progressive neurodegenerative disorders (ND) or Functional Memory Disorder (FMD, i.e., subjective memory concerns unassociated with objective cognitive deficits or a risk of progression). The study involved 15 FMD and 15 ND patients where a total of 51 acoustic features were extracted from the recordings. Feature selection was used to identify the most discriminating features which were then used to train five different machine learning classifiers to differentiate between the FMD/ND classes, achieving a mean classification accuracy of 96.2%. The discriminative power of purely acoustic approaches could be integrated into diagnostic pathways for patients presenting with memory concerns and are computationally less demanding than methods focusing on linguistic elements of speech and language that require automatic speech recognition and understanding.
Collapse
Affiliation(s)
- Sabah Al-Hameed
- Dept of Electronic and Electrical Engineering, University of Sheffield, Sheffield, United Kingdom
| | - Mohammed Benaissa
- Dept of Electronic and Electrical Engineering, University of Sheffield, Sheffield, United Kingdom
| | - Heidi Christensen
- Dept of Computer Science, University of Sheffield, Sheffield, United Kingdom
- Centre for Assistive Technology and Connected Healthcare, University of Sheffield, Sheffield, United Kingdom
| | - Bahman Mirheidari
- Dept of Computer Science, University of Sheffield, Sheffield, United Kingdom
| | - Daniel Blackburn
- Sheffield Institute for Translational Neuroscience (SITraN), University of Sheffield, Sheffield, United Kingdom
| | - Markus Reuber
- Academic Neurology Unit, University of Sheffield, Royal Hallamshire Hospital, Sheffield, United Kingdom
| |
Collapse
|