1
|
Igarashi T, Iijima K, Nitta K, Chen Y. Detailed Analysis of Responses from Older Adults through Natural Speech: Comparison of Questions by AI Agents and Humans. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2024; 21:1170. [PMID: 39338053 PMCID: PMC11431598 DOI: 10.3390/ijerph21091170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2024] [Revised: 08/08/2024] [Accepted: 08/13/2024] [Indexed: 09/30/2024]
Abstract
In recent years, an increasing number of studies have begun to use conversational data in spontaneous speech to estimate cognitive function in older people. The providers of spontaneous speech with older people used to be physicians and licensed psychologists, but it is now possible to have conversations with fully automatic AI agents. However, it has not yet been clarified what differences exist in conversational communication with older people when the examiner is either a human or an AI agent. In this study, elderly people living in the community and attending a silver human resource center and a day service center were the subjects. Dialogues were conducted using generic interview items for estimating cognitive function through daily conversation, which were developed through research on estimation methods for cognitive function. From the data obtained from the dialogues, we compared the effects of human-AI interaction on the number of utterances, speaking time, and silence time. This study was conducted at a facility in Japan and included 32 subjects (12 males and 20 females). The results showed significant differences between human and AI dialogue in the number of utterances and silent time. This study suggests the effectiveness of AI in communication with older people and explores the possibility of using AI in social welfare.
Collapse
Affiliation(s)
- Toshiharu Igarashi
- Simulation of Complex Systems Lab, Department of Human and Engineered Environmental Studies, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo 277-8563, Japan
- AI-UX Design Research Institution, Advanced Institute of Industrial Technology, 10-40 Higashi-Oi 1-Chome, Shinagawa, Tokyo 140-0011, Japan
| | - Katsuya Iijima
- Institute of Gerontology (IOG), The University of Tokyo, Tokyo 113-8656, Japan
- Institute for Future Initiatives (IFI), The University of Tokyo, Tokyo 113-0033, Japan
| | - Kunio Nitta
- Tsukushikai Medical Corporation, Tokyo 186-0005, Japan
| | - Yu Chen
- Simulation of Complex Systems Lab, Department of Human and Engineered Environmental Studies, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo 277-8563, Japan
| |
Collapse
|
2
|
Willemse IHJ, Schootemeijer S, van den Bergh R, Dawes H, Nonnekes JH, van de Warrenburg BPC. Smartphone applications for Movement Disorders: Towards collaboration and re-use. Parkinsonism Relat Disord 2024; 120:105988. [PMID: 38184466 DOI: 10.1016/j.parkreldis.2023.105988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 12/20/2023] [Accepted: 12/31/2023] [Indexed: 01/08/2024]
Abstract
BACKGROUND Numerous smartphone and tablet applications (apps) are available to monitor movement disorders, but an overview of their purpose and stage of development is missing. OBJECTIVES To systematically review published literature and classify smartphone and tablet apps with objective measurement capabilities for the diagnosis, monitoring, assessment, or treatment of movement disorders. METHODS We systematically searched for publications covering smartphone or tablet apps to monitor movement disorders until November 22nd, 2023. We reviewed the target population, measured domains, purpose, and technology readiness level (TRL) of the proposed app and checked their availability in common app stores. RESULTS We identified 113 apps. Most apps were developed for Parkinson's disease specifically (n = 82; 73%) or for movement disorders in general (n = 17; 15%). Apps were either designed to momentarily assess symptoms (n = 65; 58%), support treatment (n = 22; 19%), aid in diagnosis (n = 16; 14%), or passively track symptoms (n = 11; 10%). Commonly assessed domains across movement disorders included fine motor skills (n = 34; 30%), gait (n = 36; 32%), and tremor (n = 32; 28%) for the motor domain and cognition (n = 16; 14%) for the non-motor domain. Twenty-six (23%) apps were proof-of-concepts (TRL 1-3), while most apps were tested in a controlled setting (TRL 4-6; n = 63; 56%). Twenty-four apps were tested in their target setting (TRL 7-9) of which 10 were accessible in common app stores or as Android Package. CONCLUSIONS The development of apps strongly gravitates towards Parkinson's disease and a selection of motor symptoms. Collaboration, re-use and further development of existing apps is encouraged to avoid reinventions of the wheel.
Collapse
Affiliation(s)
- Ilse H J Willemse
- Radboud University Medical Center, Donders Institute for Brain, Cognition and Behaviour, Neurology, Center of Expertise for Parkinson & Movement Disorders, Nijmegen, the Netherlands.
| | - Sabine Schootemeijer
- Radboud University Medical Center, Donders Institute for Brain, Cognition and Behaviour, Neurology, Center of Expertise for Parkinson & Movement Disorders, Nijmegen, the Netherlands
| | - Robin van den Bergh
- Radboud University Medical Center, Donders Institute for Brain, Cognition and Behaviour, Neurology, Center of Expertise for Parkinson & Movement Disorders, Nijmegen, the Netherlands
| | - Helen Dawes
- NIHR Exeter BRC, Medical School, Faculty of Health and Life Sciences, University of Exeter, UK
| | - Jorik H Nonnekes
- Radboud University Medical Center, Donders Institute for Brain, Cognition and Behaviour, Rehabilitation, Center of Expertise for Parkinson & Movement Disorders, Nijmegen, the Netherlands; Department of Rehabilitation, Sint Maartenskliniek, Nijmegen, the Netherlands
| | - Bart P C van de Warrenburg
- Radboud University Medical Center, Donders Institute for Brain, Cognition and Behaviour, Neurology, Center of Expertise for Parkinson & Movement Disorders, Nijmegen, the Netherlands
| |
Collapse
|
3
|
Gagliardi G. Natural language processing techniques for studying language in pathological ageing: A scoping review. INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS 2024; 59:110-122. [PMID: 36960885 DOI: 10.1111/1460-6984.12870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Accepted: 02/27/2023] [Indexed: 06/18/2023]
Abstract
BACKGROUND In the past few years there has been a growing interest in the employment of verbal productions as digital biomarkers, namely objective, quantifiable behavioural data that can be collected and measured by means of digital devices, allowing for a low-cost pathology detection, classification and monitoring. Numerous research papers have been published on the automatic detection of subtle verbal alteration, starting from written texts, raw speech recordings and transcripts, and such linguistic analysis has been singled out as a cost-effective method for diagnosing dementia and other medical conditions common among elderly patients (e.g., cognitive dysfunctions associated with metabolic disorders, dysarthria). AIMS To provide a critical appraisal and synthesis of evidence concerning the application of natural language processing (NLP) techniques for clinical purposes in the geriatric population. In particular, we discuss the state of the art on studying language in healthy and pathological ageing, focusing on the latest research efforts to build non-intrusive language-based tools for the early identification of cognitive frailty due to dementia. We also discuss some challenges and open problems raised by this approach. METHODS & PROCEDURES We performed a scoping review to examine emerging evidence about this novel domain. Potentially relevant studies published up to November 2021 were identified from the databases of MEDLINE, Cochrane and Web of Science. We also browsed the proceedings of leading international conferences (e.g., ACL, COLING, Interspeech, LREC) from 2017 to 2021, and checked the reference lists of relevant studies and reviews. MAIN CONTRIBUTION The paper provides an introductory, but complete, overview of the application of NLP techniques for studying language disruption due to dementia. We also suggest that this technique can be fruitfully applied to other medical conditions (e.g., cognitive dysfunctions associated with dysarthria, cerebrovascular disease and mood disorders). CONCLUSIONS & IMPLICATIONS Despite several critical points need to be addressed by the scientific community, a growing body of empirical evidence shows that NLP techniques can represent a promising tool for studying language changes in pathological aging, with a high potential to lead a significant shift in clinical practice. WHAT THIS PAPER ADDS What is already known on this subject Speech and languages abilities change due to non-pathological neurocognitive ageing and neurodegenerative processes. These subtle verbal modifications can be measured through NLP techniques and used as biomarkers for screening/diagnostic purposes in the geriatric population (i.e., digital linguistic biomarkers-DLBs). What this paper adds to existing knowledge The review shows that DLBs can represent a promising clinical tool, with a high potential to spark a major shift to dementia assessment in the elderly. Some challenges and open problems are also discussed. What are the potential or actual clinical implications of this work? This methodological review represents a starting point for clinicians approaching the DLB research field for studying language in healthy and pathological ageing. It summarizes the state of the art and future research directions of this novel approach.
Collapse
Affiliation(s)
- Gloria Gagliardi
- Department of Classical Philology and Italian Studies, University of Bologna, Bologna, Italy
| |
Collapse
|
4
|
Parsapoor M. AI-based assessments of speech and language impairments in dementia. Alzheimers Dement 2023; 19:4675-4687. [PMID: 37578167 DOI: 10.1002/alz.13395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 06/03/2023] [Accepted: 06/05/2023] [Indexed: 08/15/2023]
Abstract
Recent advancements in the artificial intelligence (AI) domain have revolutionized the early detection of cognitive impairments associated with dementia. This has motivated clinicians to use AI-powered dementia detection systems, particularly systems developed based on individuals' and patients' speech and language, for a quick and accurate identification of patients with dementia. This paper reviews articles about developing assessment tools using machine learning and deep learning algorithms trained by vocal and textual datasets.
Collapse
Affiliation(s)
- Mahboobeh Parsapoor
- Centre de Recherche Informatique de Montréal: CRIM, Montreal, Quebec, Canada
| |
Collapse
|
5
|
Martínez-Nicolás I, Martínez-Sánchez F, Ivanova O, Meilán JJG. Reading and lexical-semantic retrieval tasks outperforms single task speech analysis in the screening of mild cognitive impairment and Alzheimer's disease. Sci Rep 2023; 13:9728. [PMID: 37322073 PMCID: PMC10272227 DOI: 10.1038/s41598-023-36804-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 06/12/2023] [Indexed: 06/17/2023] Open
Abstract
Age-related cognitive impairment have increased dramatically in recent years, which has risen the interes in developing screening tools for mild cognitive impairment and Alzheimer's disease. Speech analysis allows to exploit the behavioral consequences of cognitive deficits on the patient's vocal performance so that it is possible to identify pathologies affecting speech production such as dementia. Previous studies have further shown that the speech task used determines how the speech parameters are altered. We aim to combine the impairments in several speech production tasks in order to improve the accuracy of screening through speech analysis. The sample consists of 72 participants divided into three equal groups of healthy older adults, people with mild cognitive impairment, or Alzheimer's disease, matched by age and education. A complete neuropsychological assessment and two voice recordings were performed. The tasks required the participants to read a text, and complete a sentence with semantic information. A stepwise linear discriminant analysis was performed to select speech parameters with discriminative power. The discriminative functions obtained an accuracy of 83.3% in simultaneous classifications of several levels of cognitive impairment. It would therefore be a promising screening tool for dementia.
Collapse
Affiliation(s)
| | | | - Olga Ivanova
- Faculty of Philology, University of Salamanca, 37008, Salamanca, Spain
| | - Juan J G Meilán
- Faculty of Psychology, University of Salamanca, 37008, Salamanca, Spain
- Institute of Neuroscience of Castilla y León, 37007, Salamanca, Spain
| |
Collapse
|
6
|
Momota Y, Liang K, Horigome T, Kitazawa M, Eguchi Y, Takamiya A, Goto A, Mimura M, Kishimoto T. Language patterns in Japanese patients with Alzheimer disease: A machine learning approach. Psychiatry Clin Neurosci 2023; 77:273-281. [PMID: 36579663 PMCID: PMC11488616 DOI: 10.1111/pcn.13526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 11/09/2022] [Accepted: 12/22/2022] [Indexed: 12/30/2022]
Abstract
AIM The authors applied natural language processing and machine learning to explore the disease-related language patterns that warrant objective measures for assessing language ability in Japanese patients with Alzheimer disease (AD), while most previous studies have used large publicly available data sets in Euro-American languages. METHODS The authors obtained 276 speech samples from 42 patients with AD and 52 healthy controls, aged 50 years or older. A natural language processing library for Python was used, spaCy, with an add-on library, GiNZA, which is a Japanese parser based on Universal Dependencies designed to facilitate multilingual parser development. The authors used eXtreme Gradient Boosting for our classification algorithm. Each unit of part-of-speech and dependency was tagged and counted to create features such as tag-frequency and tag-to-tag transition-frequency. Each feature's importance was computed during the 100-fold repeated random subsampling validation and averaged. RESULTS The model resulted in an accuracy of 0.84 (SD = 0.06), and an area under the curve of 0.90 (SD = 0.03). Among the features that were important for such predictions, seven of the top 10 features were related to part-of-speech, while the remaining three were related to dependency. A box plot analysis demonstrated that the appearance rates of content words-related features were lower among the patients, whereas those with stagnation-related features were higher. CONCLUSION The current study demonstrated a promising level of accuracy for predicting AD and found the language patterns corresponding to the type of lexical-semantic decline known as 'empty speech', which is regarded as a characteristic of AD.
Collapse
Affiliation(s)
- Yuki Momota
- Department of NeuropsychiatryKeio University School of MedicineTokyoJapan
| | - Kuo‐ching Liang
- Department of NeuropsychiatryKeio University School of MedicineTokyoJapan
| | - Toshiro Horigome
- Department of NeuropsychiatryKeio University School of MedicineTokyoJapan
| | - Momoko Kitazawa
- Department of NeuropsychiatryKeio University School of MedicineTokyoJapan
| | - Yoko Eguchi
- Department of NeuropsychiatryKeio University School of MedicineTokyoJapan
- Benesse Institute for Research on Continuing Care, Benesse Style Care Co., Ltd.TokyoJapan
| | - Akihiro Takamiya
- Department of NeuropsychiatryKeio University School of MedicineTokyoJapan
- Neuropsychiatry, Department of NeurosciencesLeuven Brain InstituteKU LeuvenBelgium
| | | | - Masaru Mimura
- Department of NeuropsychiatryKeio University School of MedicineTokyoJapan
| | - Taishiro Kishimoto
- Department of NeuropsychiatryKeio University School of MedicineTokyoJapan
- Psychiatry DepartmentDonald and Barbara Zucker School of MedicineNew YorkNew YorkUSA
| |
Collapse
|
7
|
Yamada Y, Shinkawa K, Nemoto M, Nemoto K, Arai T. A mobile application using automatic speech analysis for classifying Alzheimer's disease and mild cognitive impairment. COMPUT SPEECH LANG 2023. [DOI: 10.1016/j.csl.2023.101514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]
|
8
|
Petti U, Baker S, Korhonen A, Robin J. The Generalizability of Longitudinal Changes in Speech Before Alzheimer's Disease Diagnosis. J Alzheimers Dis 2023; 92:547-564. [PMID: 36776053 DOI: 10.3233/jad-220847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
Abstract
BACKGROUND Language impairment in Alzheimer's disease (AD) has been widely studied but due to limited data availability, relatively few studies have focused on the longitudinal change in language in the individuals who later develop AD. Significant differences in speech have previously been found by comparing the press conference transcripts of President Bush and President Reagan, who was later diagnosed with AD. OBJECTIVE In the current study, we explored whether the patterns previously established in the single AD-healthy control (HC) participant pair apply to a larger group of individuals who later receive AD diagnosis. METHODS We replicated previous methods on two larger corpora of longitudinal spontaneous speech samples of public figures, consisting of 10 and 9 AD-HC participant pairs. As we failed to find generalizable patterns of language change using previous methodology, we proposed alternative methods for data analysis, investigating the benefits of using different language features and their change with age, and compiling the single features into aggregate scores. RESULTS The single features that showed the strongest results were moving average type:token ratio (MATTR) and pronoun-related features. The aggregate scores performed better than the single features, with lexical diversity capturing a similar change in two-thirds of the participants. CONCLUSION Capturing universal patterns of language change prior to AD can be challenging, but the decline in lexical diversity and changes in MATTR and pronoun-related features act as promising measures that reflect the cognitive changes in many participants.
Collapse
Affiliation(s)
- Ulla Petti
- University of Cambridge, Language Technology Lab, Cambridge, UK
| | - Simon Baker
- University of Cambridge, Language Technology Lab, Cambridge, UK
| | - Anna Korhonen
- University of Cambridge, Language Technology Lab, Cambridge, UK
| | | |
Collapse
|
9
|
Mavragani A, Kimura D, Kosugi A, Shinkawa K, Takase T, Kobayashi M, Yamada Y, Nemoto M, Watanabe R, Ota M, Higashi S, Nemoto K, Arai T, Nishimura M. Screening of Mild Cognitive Impairment Through Conversations With Humanoid Robots: Exploratory Pilot Study. JMIR Form Res 2023; 7:e42792. [PMID: 36637896 PMCID: PMC9883738 DOI: 10.2196/42792] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Revised: 11/23/2022] [Accepted: 12/01/2022] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND The rising number of patients with dementia has become a serious social problem worldwide. To help detect dementia at an early stage, many studies have been conducted to detect signs of cognitive decline by prosodic and acoustic features. However, many of these methods are not suitable for everyday use as they focus on cognitive function or conversational speech during the examinations. In contrast, conversational humanoid robots are expected to be used in the care of older people to help reduce the work of care and monitoring through interaction. OBJECTIVE This study focuses on early detection of mild cognitive impairment (MCI) through conversations between patients and humanoid robots without a specific examination, such as neuropsychological examination. METHODS This was an exploratory study involving patients with MCI and cognitively normal (CN) older people. We collected the conversation data during neuropsychological examination (Mini-Mental State Examination [MMSE]) and everyday conversation between a humanoid robot and 94 participants (n=47, 50%, patients with MCI and n=47, 50%, CN older people). We extracted 17 types of prosodic and acoustic features, such as the duration of response time and jitter, from these conversations. We conducted a statistical significance test for each feature to clarify the speech features that are useful when classifying people into CN people and patients with MCI. Furthermore, we conducted an automatic classification experiment using a support vector machine (SVM) to verify whether it is possible to automatically classify these 2 groups by the features identified in the statistical significance test. RESULTS We obtained significant differences in 5 (29%) of 17 types of features obtained from the MMSE conversational speech. The duration of response time, the duration of silent periods, and the proportion of silent periods showed a significant difference (P<.001) and met the reference value r=0.1 (small) of the effect size. Additionally, filler periods (P<.01) and the proportion of fillers (P=.02) showed a significant difference; however, these did not meet the reference value of the effect size. In contrast, we obtained significant differences in 16 (94%) of 17 types of features obtained from the everyday conversations with the humanoid robot. The duration of response time, the duration of speech periods, jitter (local, relative average perturbation [rap], 5-point period perturbation quotient [ppq5], difference of difference of periods [ddp]), shimmer (local, amplitude perturbation quotient [apq]3, apq5, apq11, average absolute differences between the amplitudes of consecutive periods [dda]), and F0cov (coefficient of variation of the fundamental frequency) showed a significant difference (P<.001). In addition, the duration of response time, the duration of silent periods, the filler period, and the proportion of fillers showed significant differences (P<.05). However, only jitter (local) met the reference value r=0.1 (small) of the effect size. In the automatic classification experiment for the classification of participants into CN and MCI groups, the results showed 66.0% accuracy in the MMSE conversational speech and 68.1% accuracy in everyday conversations with the humanoid robot. CONCLUSIONS This study shows the possibility of early and simple screening for patients with MCI using prosodic and acoustic features from everyday conversations with a humanoid robot with the same level of accuracy as the MMSE.
Collapse
Affiliation(s)
| | | | | | | | - Toshiro Takase
- Healthcare and Life Science, IBM Consulting, IBM Japan, Ltd, Tokyo, Japan
| | | | | | - Miyuki Nemoto
- Department of Psychiatry, Division of Clinical Medicine, Faculty of Medicine, University of Tsukuba, Ibaraki, Japan
| | - Ryohei Watanabe
- Department of Psychiatry, Division of Clinical Medicine, Faculty of Medicine, University of Tsukuba, Ibaraki, Japan
| | - Miho Ota
- Department of Psychiatry, Division of Clinical Medicine, Faculty of Medicine, University of Tsukuba, Ibaraki, Japan
| | - Shinji Higashi
- Department of Psychiatry, Ibaraki Medical Center, Tokyo Medical University, Ibaraki, Japan
| | - Kiyotaka Nemoto
- Department of Psychiatry, Division of Clinical Medicine, Faculty of Medicine, University of Tsukuba, Ibaraki, Japan
| | - Tetsuaki Arai
- Department of Psychiatry, Division of Clinical Medicine, Faculty of Medicine, University of Tsukuba, Ibaraki, Japan
| | - Masafumi Nishimura
- Department of Informatics, Graduate School of Intergraded Science and Technology, Shizuoka University, Hamamatsu, Japan
| |
Collapse
|
10
|
Yamada Y, Shinkawa K, Nemoto M, Ota M, Nemoto K, Arai T. Speech and language characteristics differentiate Alzheimer's disease and dementia with Lewy bodies. ALZHEIMER'S & DEMENTIA (AMSTERDAM, NETHERLANDS) 2022; 14:e12364. [PMID: 36320609 PMCID: PMC9614050 DOI: 10.1002/dad2.12364] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 09/11/2022] [Indexed: 11/04/2022]
Abstract
Introduction Early differential diagnosis of Alzheimer's disease (AD) and dementia with Lewy bodies (DLB) is important, but it remains challenging. Different profiles of speech and language impairments between AD and DLB have been suggested, but direct comparisons have not been investigated. Methods We collected speech responses from 121 older adults comprising AD, DLB, and cognitively normal (CN) groups and investigated their acoustic, prosodic, and linguistic features. Results The AD group showed larger differences from the CN group than the DLB group in linguistic features, while the DLB group showed larger differences in prosodic and acoustic features. Machine-learning classifiers using these speech features achieved 87.0% accuracy for AD versus CN, 93.2% for DLB versus CN, and 87.4% for AD versus DLB. Discussion Our findings indicate the discriminative differences in speech features in AD and DLB and the feasibility of using these features in combination as a screening tool for identifying/differentiating AD and DLB.
Collapse
Affiliation(s)
| | | | - Miyuki Nemoto
- Department of PsychiatryDivision of Clinical MedicineFaculty of MedicineUniversity of TsukubaTsukubaIbarakiJapan
| | - Miho Ota
- Department of PsychiatryDivision of Clinical MedicineFaculty of MedicineUniversity of TsukubaTsukubaIbarakiJapan
| | - Kiyotaka Nemoto
- Department of PsychiatryDivision of Clinical MedicineFaculty of MedicineUniversity of TsukubaTsukubaIbarakiJapan
| | - Tetsuaki Arai
- Department of PsychiatryDivision of Clinical MedicineFaculty of MedicineUniversity of TsukubaTsukubaIbarakiJapan
| |
Collapse
|
11
|
Yamada Y, Shinkawa K, Kobayashi M, Badal VD, Glorioso D, Lee EE, Daly R, Nebeker C, Twamley EW, Depp C, Nemoto M, Nemoto K, Kim HC, Arai T, Jeste DV. Automated Analysis of Drawing Process to Estimate Global Cognition in Older Adults: Preliminary International Validation on the US and Japan Data Sets. JMIR Form Res 2022; 6:e37014. [PMID: 35511253 PMCID: PMC9121219 DOI: 10.2196/37014] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 03/25/2022] [Accepted: 04/05/2022] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND With the aging of populations worldwide, early detection of cognitive impairments has become a research and clinical priority, particularly to enable preventive intervention for dementia. Automated analysis of the drawing process has been studied as a promising means for lightweight, self-administered cognitive assessment. However, this approach has not been sufficiently tested for its applicability across populations. OBJECTIVE The aim of this study was to evaluate the applicability of automated analysis of the drawing process for estimating global cognition in community-dwelling older adults across populations in different nations. METHODS We collected drawing data with a digital tablet, along with Montreal Cognitive Assessment (MoCA) scores for assessment of global cognition, from 92 community-dwelling older adults in the United States and Japan. We automatically extracted 6 drawing features that characterize the drawing process in terms of the drawing speed, pauses between drawings, pen pressure, and pen inclinations. We then investigated the association between the drawing features and MoCA scores through correlation and machine learning-based regression analyses. RESULTS We found that, with low MoCA scores, there tended to be higher variability in the drawing speed, a higher pause:drawing duration ratio, and lower variability in the pen's horizontal inclination in both the US and Japan data sets. A machine learning model that used drawing features to estimate MoCA scores demonstrated its capability to generalize from the US dataset to the Japan dataset (R2=0.35; permutation test, P<.001). CONCLUSIONS This study presents initial empirical evidence of the capability of automated analysis of the drawing process as an estimator of global cognition that is applicable across populations. Our results suggest that such automated analysis may enable the development of a practical tool for international use in self-administered, automated cognitive assessment.
Collapse
Affiliation(s)
| | | | | | - Varsha D Badal
- Department of Psychiatry, University of California San Diego, La Jolla, CA, United States
- Sam and Rose Stein Institute for Research on Aging, University of California San Diego, La Jolla, CA, United States
| | - Danielle Glorioso
- Department of Psychiatry, University of California San Diego, La Jolla, CA, United States
- Sam and Rose Stein Institute for Research on Aging, University of California San Diego, La Jolla, CA, United States
| | - Ellen E Lee
- Department of Psychiatry, University of California San Diego, La Jolla, CA, United States
- Sam and Rose Stein Institute for Research on Aging, University of California San Diego, La Jolla, CA, United States
- VA San Diego Healthcare System, San Diego, CA, United States
| | - Rebecca Daly
- Department of Psychiatry, University of California San Diego, La Jolla, CA, United States
- Sam and Rose Stein Institute for Research on Aging, University of California San Diego, La Jolla, CA, United States
| | - Camille Nebeker
- Department of Family Medicine and Public Health, University of California San Diego, La Jolla, CA, United States
| | - Elizabeth W Twamley
- Department of Psychiatry, University of California San Diego, La Jolla, CA, United States
- Sam and Rose Stein Institute for Research on Aging, University of California San Diego, La Jolla, CA, United States
- VA San Diego Healthcare System, San Diego, CA, United States
| | - Colin Depp
- Department of Psychiatry, University of California San Diego, La Jolla, CA, United States
- Sam and Rose Stein Institute for Research on Aging, University of California San Diego, La Jolla, CA, United States
| | - Miyuki Nemoto
- Department of Psychiatry, Division of Clinical Medicine, Faculty of Medicine, University of Tsukuba, Ibaraki, Japan
| | - Kiyotaka Nemoto
- Department of Psychiatry, Division of Clinical Medicine, Faculty of Medicine, University of Tsukuba, Ibaraki, Japan
| | - Ho-Cheol Kim
- AI and Cognitive Software, IBM Almaden Research Center, San Jose, CA, United States
| | - Tetsuaki Arai
- Department of Psychiatry, Division of Clinical Medicine, Faculty of Medicine, University of Tsukuba, Ibaraki, Japan
| | - Dilip V Jeste
- Department of Psychiatry, University of California San Diego, La Jolla, CA, United States
- Sam and Rose Stein Institute for Research on Aging, University of California San Diego, La Jolla, CA, United States
- Department of Neurosciences, University of California San Diego, La Jolla, CA, United States
| |
Collapse
|
12
|
Kálmán J, Devanand DP, Gosztolya G, Balogh R, Imre N, Tóth L, Hoffmann I, Kovács I, Vincze V, Pákáski M. Temporal speech parameters detect mild cognitive impairment in different languages: validation and comparison of the Speech-GAP Test® in English and Hungarian. Curr Alzheimer Res 2022; 19:373-386. [PMID: 35440309 DOI: 10.2174/1567205019666220418155130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 02/08/2022] [Accepted: 02/17/2022] [Indexed: 11/22/2022]
Abstract
BACKGROUND The development of automatic speech recognition (ASR) technology allows the analysis of temporal (time-based) speech parameters characteristic of mild cognitive impairment (MCI). However, no information has been available on whether the analysis of spontaneous speech can be used with the same efficiency in different language environments. OBJECTIVE The main goal of this international pilot study is to address the question whether the Speech-Gap Test® (S-GAP Test®), previously tested in the Hungarian language, is appropriate for and applicable to the recognition of MCI in other languages such as English. METHOD After an initial screening of 88 individuals, English-speaking (n = 33) and Hungarian-speaking (n = 33) participants were classified as having MCI or as healthy controls (HC) based on Petersen's criteria. Speech of each participant was recorded via a spontaneous speech task. 15 temporal parameters were determined and calculated by means of ASR. RESULTS Seven temporal parameters in the English-speaking sample and 5 in the Hungarian-speaking sample showed significant differences between the MCI and the HC group. Receiver operating characteristics (ROC) analysis clearly distinguished the English-speaking MCI cases from the HC group based on speech tempo and articulation tempo with 100% sensitivity, and on three more temporal parameters with high sensitivity (85.7%). In the Hungarian-speaking sample, the ROC analysis showed similar sensitivity rates (92.3%). CONCLUSION The results of this study in different native-speaking populations suggest that changes in acoustic parameters detected by the S-GAP Test® might be present across different languages.
Collapse
Affiliation(s)
- János Kálmán
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - Davangere P Devanand
- Columbia University Medical Center, New York, NY.,New York State Psychiatric Institute, New York, NY
| | - Gábor Gosztolya
- MTA-SZTE Research Group on Artificial Intelligence, Faculty of Science and Informatics, University of Szeged, Szeged
| | - Réka Balogh
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - Nóra Imre
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - László Tóth
- Faculty of Science and Informatics, University of Szeged, Szeged
| | - Ildikó Hoffmann
- Faculty of Humanities and Social Sciences, University of Szeged, Szeged.,Hungarian Research Centre for Linguistics, Eötvös Loránd Research Network, Budapest
| | - Ildikó Kovács
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| | - Veronika Vincze
- MTA-SZTE Research Group on Artificial Intelligence, Faculty of Science and Informatics, University of Szeged, Szeged
| | - Magdolna Pákáski
- Albert Szent-Györgyi Medical School, University of Szeged, Szeged
| |
Collapse
|
13
|
Yamada Y, Shinkawa K, Nemoto M, Arai T. Automatic Assessment of Loneliness in Older Adults Using Speech Analysis on Responses to Daily Life Questions. Front Psychiatry 2021; 12:712251. [PMID: 34966297 PMCID: PMC8710612 DOI: 10.3389/fpsyt.2021.712251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 11/19/2021] [Indexed: 11/13/2022] Open
Abstract
Loneliness is a perceived state of social and emotional isolation that has been associated with a wide range of adverse health effects in older adults. Automatically assessing loneliness by passively monitoring daily behaviors could potentially contribute to early detection and intervention for mitigating loneliness. Speech data has been successfully used for inferring changes in emotional states and mental health conditions, but its association with loneliness in older adults remains unexplored. In this study, we developed a tablet-based application and collected speech responses of 57 older adults to daily life questions regarding, for example, one's feelings and future travel plans. From audio data of these speech responses, we automatically extracted speech features characterizing acoustic, prosodic, and linguistic aspects, and investigated their associations with self-rated scores of the UCLA Loneliness Scale. Consequently, we found that with increasing loneliness scores, speech responses tended to have less inflections, longer pauses, reduced second formant frequencies, reduced variances of the speech spectrum, more filler words, and fewer positive words. The cross-validation results showed that regression and binary-classification models using speech features could estimate loneliness scores with an R 2 of 0.57 and detect individuals with high loneliness scores with 95.6% accuracy, respectively. Our study provides the first empirical results suggesting the possibility of using speech data that can be collected in everyday life for the automatic assessments of loneliness in older adults, which could help develop monitoring technologies for early detection and intervention for mitigating loneliness.
Collapse
Affiliation(s)
| | | | - Miyuki Nemoto
- Dementia Medical Center, University of Tsukuba Hospital, Tsukuba, Japan
| | - Tetsuaki Arai
- Division of Clinical Medicine, Department of Psychiatry, Faculty of Medicine, University of Tsukuba, Tsukuba, Japan
| |
Collapse
|
14
|
Yamada Y, Shinkawa K, Kobayashi M, Caggiano V, Nemoto M, Nemoto K, Arai T. Combining Multimodal Behavioral Data of Gait, Speech, and Drawing for Classification of Alzheimer's Disease and Mild Cognitive Impairment. J Alzheimers Dis 2021; 84:315-327. [PMID: 34542076 PMCID: PMC8609704 DOI: 10.3233/jad-210684] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/16/2021] [Indexed: 11/30/2022]
Abstract
BACKGROUND Gait, speech, and drawing behaviors have been shown to be sensitive to the diagnosis of Alzheimer's disease (AD) and mild cognitive impairment (MCI). However, previous studies focused on only analyzing individual behavioral modalities, although these studies suggested that each of these modalities may capture different profiles of cognitive impairments associated with AD. OBJECTIVE We aimed to investigate if combining behavioral data of gait, speech, and drawing can improve classification performance compared with the use of individual modality and if each of these behavioral data can be associated with different cognitive and clinical measures for the diagnosis of AD and MCI. METHODS Behavioral data of gait, speech, and drawing were acquired from 118 AD, MCI, and cognitively normal (CN) participants. RESULTS Combining all three behavioral modalities achieved 93.0% accuracy for classifying AD, MCI, and CN, and only 81.9% when using the best individual behavioral modality. Each of these behavioral modalities was statistically significantly associated with different cognitive and clinical measures for diagnosing AD and MCI. CONCLUSION Our findings indicate that these behaviors provide different and complementary information about cognitive impairments such that classification of AD and MCI is superior to using either in isolation.
Collapse
Affiliation(s)
| | | | | | - Vittorio Caggiano
- Healthcare and Life Sciences, IBM Research, Yorktown Heights, NY, USA
| | - Miyuki Nemoto
- Department of Psychiatry, University of Tsukuba Hospital, Tsukuba, Ibaraki, Japan
| | - Kiyotaka Nemoto
- Department of Psychiatry, Faculty of Medicine, University of Tsukuba, Tsukuba, Ibaraki, Japan
| | - Tetsuaki Arai
- Department of Psychiatry, Faculty of Medicine, University of Tsukuba, Tsukuba, Ibaraki, Japan
| |
Collapse
|