1
|
Lee J, Kim N, Ha JW, Kang K, Park E, Yoon J, Park KS. Exploring Voice Acoustic Features Associated with Cognitive Status in Korean Speakers: A Preliminary Machine Learning Study. Diagnostics (Basel) 2024; 14:2837. [PMID: 39767198 PMCID: PMC11675567 DOI: 10.3390/diagnostics14242837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2024] [Revised: 12/02/2024] [Accepted: 12/12/2024] [Indexed: 01/11/2025] Open
Abstract
Objective: To develop a non-invasive cognitive impairment detection system using speech data analysis, addressing the growing global dementia crisis and enabling accessible early screening through daily health monitoring. Methods: Speech data from 223 Korean patients were collected across eight tasks. Patients were classified based on Korean Mini-Mental State Examination scores. Four machine learning models were tested for three binary classification tasks. Voice acoustic features were extracted and analyzed. Results: The Deep Neural Network model performed best in two classification tasks, with Precision-Recall Area Under the Curve scores of 0.737 for severe vs. no impairment and 0.726 for mild vs. no impairment, while Random Forest achieved 0.715 for severe + mild vs. no impairment. Several acoustic features emerged as potentially important indicators, with DDA shimmer from the /i/ task and stdevF0 from the /puh-tuh-kuh/ task showing consistent patterns across classification tasks. Conclusions: This preliminary study suggests that certain acoustic features may be associated with cognitive status, though demographic factors significantly influence these relationships. Further research with demographically matched populations is needed to validate these findings.
Collapse
Affiliation(s)
- Jiho Lee
- Neopons Inc., Daegu 41260, Republic of Korea; (J.L.); (N.K.)
| | - Nayeon Kim
- Neopons Inc., Daegu 41260, Republic of Korea; (J.L.); (N.K.)
| | - Ji-Wan Ha
- Department of Speech-Language Pathology, Daegu University, Gyeongsan 38453, Republic of Korea;
| | - Kyunghun Kang
- Department of Neurology, School of Medicine, Kyungpook National University, Daegu 41404, Republic of Korea;
| | - Eunhee Park
- Department of Rehabilitation Medicine, School of Medicine, Kyungpook National University, Daegu 41404, Republic of Korea;
| | - Janghyeok Yoon
- Department of Industrial Engineering, Konkuk University, Seoul 05029, Republic of Korea
| | - Ki-Su Park
- Department of Neurosurgery, School of Medicine, Kyungpook National University, Daegu 41404, Republic of Korea
| |
Collapse
|
2
|
Lin YC, Yan HT, Lin CH, Chang HH. Identifying and Estimating Frailty Phenotypes by Vocal Biomarkers: Cross-Sectional Study. J Med Internet Res 2024; 26:e58466. [PMID: 39515817 PMCID: PMC11584546 DOI: 10.2196/58466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Revised: 06/04/2024] [Accepted: 10/07/2024] [Indexed: 11/16/2024] Open
Abstract
BACKGROUND Researchers have developed a variety of indices to assess frailty. Recent research indicates that the human voice reflects frailty status. Frailty phenotypes are seldom discussed in the literature on the aging voice. OBJECTIVE This study aims to examine potential phenotypes of frail older adults and determine their correlation with vocal biomarkers. METHODS Participants aged ≥60 years who visited the geriatric outpatient clinic of a teaching hospital in central Taiwan between 2020 and 2021 were recruited. We identified 4 frailty phenotypes: energy-based frailty, sarcopenia-based frailty, hybrid-based frailty-energy, and hybrid-based frailty-sarcopenia. Participants were asked to pronounce a sustained vowel "/a/" for approximately 1 second. The speech signals were digitized and analyzed. Four voice parameters-the average number of zero crossings (A1), variations in local peaks and valleys (A2), variations in first and second formant frequencies (A3), and spectral energy ratio (A4)-were used for analyzing changes in voice. Logistic regression was used to elucidate the prediction model. RESULTS Among 277 older adults, an increase in A1 values was associated with a lower likelihood of energy-based frailty (odds ratio [OR] 0.81, 95% CI 0.68-0.96), whereas an increase in A2 values resulted in a higher likelihood of sarcopenia-based frailty (OR 1.34, 95% CI 1.18-1.52). Respondents with larger A3 and A4 values had a higher likelihood of hybrid-based frailty-sarcopenia (OR 1.03, 95% CI 1.002-1.06) and hybrid-based frailty-energy (OR 1.43, 95% CI 1.02-2.01), respectively. CONCLUSIONS Vocal biomarkers might be potentially useful in estimating frailty phenotypes. Clinicians can use 2 crucial acoustic parameters, namely A1 and A2, to diagnose a frailty phenotype that is associated with insufficient energy or reduced muscle function. The assessment of A3 and A4 involves a complex frailty phenotype.
Collapse
Affiliation(s)
- Yu-Chun Lin
- Graduate Institute of Integrated Medicine, College of Chinese Medicine, China Medical University, Taichung, Taiwan
- Department of Chinese Medicine, China Medical University Hospital, Taichung, Taiwan
| | - Huang-Ting Yan
- Institute of Political Science, Academia Sinica, Taipei, Taiwan
| | - Chih-Hsueh Lin
- School of Medicine, College of Medicine, China Medical University, Taichung, Taiwan
- Department of Family Medicine, China Medical University Hospital, Taichung, Taiwan
| | - Hen-Hong Chang
- Graduate Institute of Integrated Medicine, College of Chinese Medicine, China Medical University, Taichung, Taiwan
- Department of Chinese Medicine, China Medical University Hospital, Taichung, Taiwan
- Chinese Medicine Research Centre, China Medical University, Taichung, Taiwan
| |
Collapse
|
3
|
Mahon E, Lachman ME. Voice biomarkers in middle and later adulthood as predictors of cognitive changes. Front Psychol 2024; 15:1422376. [PMID: 39492818 PMCID: PMC11527629 DOI: 10.3389/fpsyg.2024.1422376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 09/16/2024] [Indexed: 11/05/2024] Open
Abstract
Background Prosody voice measures, especially jitter and shimmer, have been associated with cognitive impairment and hold potential as early indicators of risk for cognitive decline. Prior research suggests that voice measures assessed concurrently with longitudinal cognitive outcomes are associated with 10-year cognitive declines in middle-age and older adults from Midlife in the U.S. (MIDUS) study. Results Using a subsample from the same study, we expanded previous research to examine voice measures that were (1) collected 8 years before cognitive outcomes, (2) derived from narrative speech in logical memory tests instead of word list recall tests, and (3) independent of the cognitive outcomes. Multilevel analyses controlled for covariates of age, sex, education, neurological conditions, depressive symptoms, and chronic conditions. The results indicated that higher jitter and lower shimmer predicted greater 10-year declines in episodic memory and working memory. Conclusion These findings extend previous research by highlighting prosody voice measures assessed 8 years earlier as predictors of subsequent cognitive declines over a decade.
Collapse
Affiliation(s)
- Elizabeth Mahon
- Psychology Department of Psychology, Brandeis University, Waltham, MA, United States
| | | |
Collapse
|
4
|
Chen Z, Liang N, Li H, Zhang H, Li H, Yan L, Hu Z, Chen Y, Zhang Y, Wang Y, Ke D, Shi N. Exploring explainable AI features in the vocal biomarkers of lung disease. Comput Biol Med 2024; 179:108844. [PMID: 38981214 DOI: 10.1016/j.compbiomed.2024.108844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Revised: 05/15/2024] [Accepted: 06/04/2024] [Indexed: 07/11/2024]
Abstract
This review delves into the burgeoning field of explainable artificial intelligence (XAI) in the detection and analysis of lung diseases through vocal biomarkers. Lung diseases, often elusive in their early stages, pose a significant public health challenge. Recent advancements in AI have ushered in innovative methods for early detection, yet the black-box nature of many AI models limits their clinical applicability. XAI emerges as a pivotal tool, enhancing transparency and interpretability in AI-driven diagnostics. This review synthesizes current research on the application of XAI in analyzing vocal biomarkers for lung diseases, highlighting how these techniques elucidate the connections between specific vocal features and lung pathology. We critically examine the methodologies employed, the types of lung diseases studied, and the performance of various XAI models. The potential for XAI to aid in early detection, monitor disease progression, and personalize treatment strategies in pulmonary medicine is emphasized. Furthermore, this review identifies current challenges, including data heterogeneity and model generalizability, and proposes future directions for research. By offering a comprehensive analysis of explainable AI features in the context of lung disease detection, this review aims to bridge the gap between advanced computational approaches and clinical practice, paving the way for more transparent, reliable, and effective diagnostic tools.
Collapse
Affiliation(s)
- Zhao Chen
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Ning Liang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Haoyuan Li
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Haili Zhang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Huizhen Li
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Lijiao Yan
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Ziteng Hu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yaxin Chen
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yujing Zhang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yanping Wang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Dandan Ke
- Special Disease Clinic, Huaishuling Branch of Beijing Fengtai Hospital of Integrated Traditional Chinese and Western Medicine, Beijing, China.
| | - Nannan Shi
- State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China; Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China.
| |
Collapse
|
5
|
Amini S, Hao B, Yang J, Karjadi C, Kolachalama VB, Au R, Paschalidis IC. Prediction of Alzheimer's disease progression within 6 years using speech: A novel approach leveraging language models. Alzheimers Dement 2024; 20:5262-5270. [PMID: 38924662 PMCID: PMC11350035 DOI: 10.1002/alz.13886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 03/01/2024] [Accepted: 04/19/2024] [Indexed: 06/28/2024]
Abstract
INTRODUCTION Identification of individuals with mild cognitive impairment (MCI) who are at risk of developing Alzheimer's disease (AD) is crucial for early intervention and selection of clinical trials. METHODS We applied natural language processing techniques along with machine learning methods to develop a method for automated prediction of progression to AD within 6 years using speech. The study design was evaluated on the neuropsychological test interviews of n = 166 participants from the Framingham Heart Study, comprising 90 progressive MCI and 76 stable MCI cases. RESULTS Our best models, which used features generated from speech data, as well as age, sex, and education level, achieved an accuracy of 78.5% and a sensitivity of 81.1% to predict MCI-to-AD progression within 6 years. DISCUSSION The proposed method offers a fully automated procedure, providing an opportunity to develop an inexpensive, broadly accessible, and easy-to-administer screening tool for MCI-to-AD progression prediction, facilitating development of remote assessment. HIGHLIGHTS Voice recordings from neuropsychological exams coupled with basic demographics can lead to strong predictive models of progression to dementia from mild cognitive impairment. The study leveraged AI methods for speech recognition and processed the resulting text using language models. The developed AI-powered pipeline can lead to fully automated assessment that could enable remote and cost-effective screening and prognosis for Alzehimer's disease.
Collapse
Affiliation(s)
- Samad Amini
- Department of Electrical & Computer EngineeringDivision of Systems Engineeringand Department of Biomedical EngineeringBoston UniversityBostonMassachusettsUSA
| | - Boran Hao
- Department of Electrical & Computer EngineeringDivision of Systems Engineeringand Department of Biomedical EngineeringBoston UniversityBostonMassachusettsUSA
| | - Jingmei Yang
- Department of Electrical & Computer EngineeringDivision of Systems Engineeringand Department of Biomedical EngineeringBoston UniversityBostonMassachusettsUSA
| | - Cody Karjadi
- Framingham Heart StudyBoston UniversityFraminghamMassachusettsUSA
| | - Vijaya B. Kolachalama
- Department of MedicineBoston University School of MedicineBostonMassachusettsUSA
- Faculty of Computing & Data SciencesBoston UniversityBostonMassachusettsUSA
- Department of Computer ScienceBoston UniversityBostonMassachusettsUSA
| | - Rhoda Au
- Framingham Heart StudyBoston UniversityFraminghamMassachusettsUSA
- Departments of Anatomy & Neurobiology, Neurology, and EpidemiologyBoston University School of Medicine and School of Public HealthBostonMassachusettsUSA
| | - Ioannis C. Paschalidis
- Department of Electrical & Computer EngineeringDivision of Systems Engineeringand Department of Biomedical EngineeringBoston UniversityBostonMassachusettsUSA
- Faculty of Computing & Data SciencesBoston UniversityBostonMassachusettsUSA
| |
Collapse
|
6
|
Harris C, Tang Y, Birnbaum E, Cherian C, Mendhe D, Chen MH. Digital Neuropsychology beyond Computerized Cognitive Assessment: Applications of Novel Digital Technologies. Arch Clin Neuropsychol 2024; 39:290-304. [PMID: 38520381 PMCID: PMC11485276 DOI: 10.1093/arclin/acae016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 02/16/2024] [Indexed: 03/25/2024] Open
Abstract
Compared with other health disciplines, there is a stagnation in technological innovation in the field of clinical neuropsychology. Traditional paper-and-pencil tests have a number of shortcomings, such as low-frequency data collection and limitations in ecological validity. While computerized cognitive assessment may help overcome some of these issues, current computerized paradigms do not address the majority of these limitations. In this paper, we review recent literature on the applications of novel digital health approaches, including ecological momentary assessment, smartphone-based assessment and sensors, wearable devices, passive driving sensors, smart homes, voice biomarkers, and electronic health record mining, in neurological populations. We describe how each digital tool may be applied to neurologic care and overcome limitations of traditional neuropsychological assessment. Ethical considerations, limitations of current research, as well as our proposed future of neuropsychological practice are also discussed.
Collapse
Affiliation(s)
- Che Harris
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
- Department of Neurology, Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ, USA
| | - Yingfei Tang
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
- Department of Neurology, Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ, USA
| | - Eliana Birnbaum
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
| | - Christine Cherian
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
| | - Dinesh Mendhe
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
| | - Michelle H Chen
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
- Department of Neurology, Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ, USA
| |
Collapse
|
7
|
Rosen-Lang Y, Zoubi S, Cialic R, Orenstein T. Using voice biomarkers for frailty classification. GeroScience 2024; 46:1175-1179. [PMID: 37480417 PMCID: PMC10828289 DOI: 10.1007/s11357-023-00872-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Accepted: 07/11/2023] [Indexed: 07/24/2023] Open
Abstract
Clinicians use the patient's voice intuitively to evaluate general health and frailty. Voice is an emerging health indicator but has been scarcely studied in the context of frailty. This study explored voice parameters as possible predictors of frailty in older adults. Fifty-three participants over 70 years old were recruited from rehabilitation wards at a tertiary medical center. Participants' frailty was assessed using Rockwood frailty index and they were classified as most-frail (n = 33, 68%) or less-frail (n = 20, 32%). Participants were recorded counting from 1 to 10 and backwards using a smartphone recording application. The following voice biomarkers were derived: peak and average volume, peak/average volume ratio, pauses' total length, and pause length standard deviation. The most-frail group had a higher peak volume/average volume ratio (p = 0.03) and greater variance in lengths of pauses between speech segments (p = 0.002). These parameters indicate greater speech irregularity in the most-frail, compared to the less-frail. The most-frail group also had a longer total duration of pauses (p = 0.02). No statistically significant difference was found in peak and average volume (p = 0.75 and 0.39). Most-frail participants' speech had different characteristics, compared to participants in the less-frail group. This is a first step to developing an AI-based frailty assessment tool that can assist in identifying our most vulnerable patients.
Collapse
Affiliation(s)
- Yael Rosen-Lang
- Joseph Sagol Neuroscience Center, Sheba Medical Center, Ramat-Gan, Israel
| | - Saad Zoubi
- Geriatric Division, Tel-Aviv Sourasky Medical Center, Tel-Aviv, Israel
| | - Ron Cialic
- Geriatric Division, Tel-Aviv Sourasky Medical Center, Tel-Aviv, Israel
| | - Tal Orenstein
- Geriatric Division, Tel-Aviv Sourasky Medical Center, Tel-Aviv, Israel.
| |
Collapse
|
8
|
Banks R, Higgins C, Greene BR, Jannati A, Gomes‐Osman J, Tobyne S, Bates D, Pascual‐Leone A. Clinical classification of memory and cognitive impairment with multimodal digital biomarkers. ALZHEIMER'S & DEMENTIA (AMSTERDAM, NETHERLANDS) 2024; 16:e12557. [PMID: 38406610 PMCID: PMC10884988 DOI: 10.1002/dad2.12557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 01/24/2024] [Accepted: 01/24/2024] [Indexed: 02/27/2024]
Abstract
INTRODUCTION Early detection of Alzheimer's disease and cognitive impairment is critical to improving the healthcare trajectories of aging adults, enabling early intervention and potential prevention of decline. METHODS To evaluate multi-modal feature sets for assessing memory and cognitive impairment, feature selection and subsequent logistic regressions were used to identify the most salient features in classifying Rey Auditory Verbal Learning Test-determined memory impairment. RESULTS Multimodal models incorporating graphomotor, memory, and speech and voice features provided the stronger classification performance (area under the curve = 0.83; sensitivity = 0.81, specificity = 0.80). Multimodal models were superior to all other single modality and demographics models. DISCUSSION The current research contributes to the prevailing multimodal profile of those with cognitive impairment, suggesting that it is associated with slower speech with a particular effect on the duration, frequency, and percentage of pauses compared to normal healthy speech.
Collapse
Affiliation(s)
- Russell Banks
- Department of Communicative Sciences & DisordersCollege of Arts & SciencesMichigan State UniversityEast LansingMichiganUSA
| | | | | | - Ali Jannati
- Department of NeurologyHarvard Medical SchoolBostonMassachusettsUSA
| | - Joyce Gomes‐Osman
- Department of NeurologyUniversity of Miami Miller School of MedicineMiamiFloridaUSA
| | | | | | - Alvaro Pascual‐Leone
- Linus HealthBostonMassachusettsUSA
- Department of NeurologyHarvard Medical SchoolBostonMassachusettsUSA
- Hinda and Arthur Marcus Institute for Aging Research and Deanna and Sidney Wolk Center for Memory HealthHebrew SeniorLifeBostonMassachusettsUSA
| |
Collapse
|
9
|
Ding H, Wang B, Hamel AP, Karjadi C, Ang TFA, Au R, Lin H. Exploring cognitive progression subtypes in the Framingham Heart Study. ALZHEIMER'S & DEMENTIA (AMSTERDAM, NETHERLANDS) 2024; 16:e12574. [PMID: 38515438 PMCID: PMC10955221 DOI: 10.1002/dad2.12574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 02/23/2024] [Accepted: 02/27/2024] [Indexed: 03/23/2024]
Abstract
INTRODUCTION Alzheimer's disease (AD) is a heterogeneous disorder characterized by complex underlying neuropathology that is not fully understood. This study aimed to identify cognitive progression subtypes and examine their correlation with clinical outcomes. METHODS Participants of this study were recruited from the Framingham Heart Study. The Subtype and Stage Inference (SuStaIn) method was used to identify cognitive progression subtypes based on eight cognitive domains. RESULTS Three cognitive progression subtypes were identified, including verbal learning (Subtype 1), abstract reasoning (Subtype 2), and visual memory (Subtype 3). These subtypes represent different domains of cognitive decline during the progression of AD. Significant differences in age of onset among the different subtypes were also observed. A higher SuStaIn stage was significantly associated with increased mortality risk. DISCUSSION This study provides a characterization of AD heterogeneity in cognitive progression, emphasizing the importance of developing personalized approaches for risk stratification and intervention. Highlights We used the Subtype and Stage Inference (SuStaIn) method to identify three cognitive progression subtypes.Different subtypes have significant variations in age of onset.Higher stages of progression are associated with increased mortality risk.
Collapse
Affiliation(s)
- Huitong Ding
- Department of Anatomy and NeurobiologyBoston University Chobanian & Avedisian School of MedicineBostonMassachusettsUSA
- The Framingham Heart StudyBoston University Chobanian & Avedisian School of MedicineBostonMassachusettsUSA
| | - Biqi Wang
- Department of MedicineUniversity of Massachusetts Chan Medical SchoolWorcesterMassachusettsUSA
| | - Alexander P. Hamel
- Department of MedicineUniversity of Massachusetts Chan Medical SchoolWorcesterMassachusettsUSA
| | - Cody Karjadi
- Department of Anatomy and NeurobiologyBoston University Chobanian & Avedisian School of MedicineBostonMassachusettsUSA
- The Framingham Heart StudyBoston University Chobanian & Avedisian School of MedicineBostonMassachusettsUSA
| | - Ting F. A. Ang
- Department of Anatomy and NeurobiologyBoston University Chobanian & Avedisian School of MedicineBostonMassachusettsUSA
- The Framingham Heart StudyBoston University Chobanian & Avedisian School of MedicineBostonMassachusettsUSA
- Slone Epidemiology CenterBoston University Chobanian & Avedisian School of MedicineBostonMassachusettsUSA
| | - Rhoda Au
- Department of Anatomy and NeurobiologyBoston University Chobanian & Avedisian School of MedicineBostonMassachusettsUSA
- The Framingham Heart StudyBoston University Chobanian & Avedisian School of MedicineBostonMassachusettsUSA
- Slone Epidemiology CenterBoston University Chobanian & Avedisian School of MedicineBostonMassachusettsUSA
- Department of EpidemiologyBoston University School of Public HealthBostonMassachusettsUSA
- Departments of Neurology and MedicineBoston University Chobanian & Avedisian School of MedicineBostonMassachusettsUSA
| | - Honghuang Lin
- Department of MedicineUniversity of Massachusetts Chan Medical SchoolWorcesterMassachusettsUSA
| |
Collapse
|
10
|
Benítez-Burraco A, Ivanova O. Language in healthy and pathological ageing: Methodological milestones and challenges. INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS 2024; 59:4-12. [PMID: 38149881 DOI: 10.1111/1460-6984.13003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Affiliation(s)
- Antonio Benítez-Burraco
- Department of Spanish, Linguistics and Theory of Literature (Linguistics), Faculty of Philology, University of Seville, Sevilla, Spain
| | - Olga Ivanova
- Spanish Language Department, Faculty of Philology, University of Salamanca/Institute of Neuroscience of Castilla y León (INCYL), Salamanca, Spain
| |
Collapse
|
11
|
Bélisle-Pipon JC, Powell M, English R, Malo MF, Ravitsky V, Bensoussan Y. Stakeholder perspectives on ethical and trustworthy voice AI in health care. Digit Health 2024; 10:20552076241260407. [PMID: 39055787 PMCID: PMC11271113 DOI: 10.1177/20552076241260407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Accepted: 05/21/2024] [Indexed: 07/27/2024] Open
Abstract
Objective Voice as a health biomarker using artificial intelligence (AI) is gaining momentum in research. The noninvasiveness of voice data collection through accessible technology (such as smartphones, telehealth, and ambient recordings) or within clinical contexts means voice AI may help address health disparities and promote the inclusion of marginalized communities. However, the development of AI-ready voice datasets free from bias and discrimination is a complex task. The objective of this study is to better understand the perspectives of engaged and interested stakeholders regarding ethical and trustworthy voice AI, to inform both further ethical inquiry and technology innovation. Methods A questionnaire was administered to voice AI experts, clinicians, scholars, patients, trainees, and policy-makers who participated at the 2023 Voice AI Symposium organized by the Bridge2AI-Voice AI Consortium. The survey used a mix of Likert scale, ranking and open-ended questions. A total of 27 stakeholders participated in the study. Results The main results of the study are the identification of priorities in terms of ethical issues, an initial definition of ethically sourced data for voice AI, insights into the use of synthetic voice data, and proposals for acting on the trustworthiness of voice AI. The study shows a diversity of perspectives and adds nuance to the planning and development of ethical and trustworthy voice AI. Conclusions This study represents the first stakeholder survey related to voice as a biomarker of health published to date. This study sheds light on the critical importance of ethics and trustworthiness in the development of voice AI technologies for health applications.
Collapse
Affiliation(s)
| | - Maria Powell
- Vanderbilt University Medical Center, Department of Otolaryngology-Head & Neck Surgery, Nashville, TN, Canada
| | - Renee English
- Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| | | | - Vardit Ravitsky
- Hastings Center, Garrison, NY, USA
- Department of Global Health and Social Medicine, Harvard University, Cambridge, MA, USA
| | | | - Yael Bensoussan
- Department of Otolaryngology-Head & Neck Surgery, University of South Florida, Tampa, FL, USA
| |
Collapse
|
12
|
Parlak MM, Saylam G, Babademez MA, Munis ÖB, Tokgöz SA. Voice analysis results in individuals with Alzheimer's disease: How do age and cognitive status affect voice parameters? Brain Behav 2023; 13:e3271. [PMID: 37794703 PMCID: PMC10636380 DOI: 10.1002/brb3.3271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 09/17/2023] [Accepted: 09/23/2023] [Indexed: 10/06/2023] Open
Abstract
BACKGROUND Reports of acoustic changes in the voice in individuals with Alzheimer's disease (AD) and the relationship of acoustic changes with age and cognitive status are still limited. OBJECTIVE This study aims to determine the changes in voice analysis results in AD, as well as the effects of age and cognitive status on voice parameters. METHODS The study included 47 (AD: 30; healthy: 17) women with a mean age of 76.13 years. The acoustic voice parameters mean fundamental frequency (F0), relative average perturbation (RAP), jitter percent (Jitt), shimmer percent (Shim), and noise-to-harmonic ratio were detected. The mini-mental state examination (MMSE) was utilized. RESULTS F0, Shim, Jitt, and RAP values were found to be statistically significantly higher in individuals with AD compared to healthy individuals. There was a significant negative correlation between MMSE and F0, Jitt, RAP and Shim, and the MMSE score had a significant negative effect on F0, Jitt, and RAP (p < .05). CONCLUSION Cognitive status was discovered to significantly impact the voice, with fundamental frequency and frequency and amplitude perturbations increasing as cognitive level decreases. In order to contribute to the therapy process for voice disorders, cognitive functions can be focused on in addition to voice therapy.
Collapse
Affiliation(s)
- Mümüne Merve Parlak
- Department of Speech and Language Therapy, Faculty of Health SciencesAnkara Yıldırım Beyazıt UniversityAnkaraTurkey
| | - Güleser Saylam
- Department of OtolaryngologyEtlik City HospitalAnkaraTurkey
| | | | | | | |
Collapse
|
13
|
Idrisoglu A, Dallora AL, Anderberg P, Berglund JS. Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review. J Med Internet Res 2023; 25:e46105. [PMID: 37467031 PMCID: PMC10398366 DOI: 10.2196/46105] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 04/26/2023] [Accepted: 05/23/2023] [Indexed: 07/20/2023] Open
Abstract
BACKGROUND Normal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, pulmonary, and muscular functionality. This sensitivity inspired using voice as a biomarker to examine disorders that affect the voice. Technological improvements and emerging machine learning (ML) technologies have enabled possibilities of extracting digital vocal features from the voice for automated diagnosis and monitoring systems. OBJECTIVE This study aims to summarize a comprehensive view of research on voice-affecting disorders that uses ML techniques for diagnosis and monitoring through voice samples where systematic conditions, nonlaryngeal aerodigestive disorders, and neurological disorders are specifically of interest. METHODS This systematic literature review (SLR) investigated the state of the art of voice-based diagnostic and monitoring systems with ML technologies, targeting voice-affecting disorders without direct relation to the voice box from the point of view of applied health technology. Through a comprehensive search string, studies published from 2012 to 2022 from the databases Scopus, PubMed, and Web of Science were scanned and collected for assessment. To minimize bias, retrieval of the relevant references in other studies in the field was ensured, and 2 authors assessed the collected studies. Low-quality studies were removed through a quality assessment and relevant data were extracted through summary tables for analysis. The articles were checked for similarities between author groups to prevent cumulative redundancy bias during the screening process, where only 1 article was included from the same author group. RESULTS In the analysis of the 145 included studies, support vector machines were the most utilized ML technique (51/145, 35.2%), with the most studied disease being Parkinson disease (PD; reported in 87/145, 60%, studies). After 2017, 16 additional voice-affecting disorders were examined, in contrast to the 3 investigated previously. Furthermore, an upsurge in the use of artificial neural network-based architectures was observed after 2017. Almost half of the included studies were published in last 2 years (2021 and 2022). A broad interest from many countries was observed. Notably, nearly one-half (n=75) of the studies relied on 10 distinct data sets, and 11/145 (7.6%) used demographic data as an input for ML models. CONCLUSIONS This SLR revealed considerable interest across multiple countries in using ML techniques for diagnosing and monitoring voice-affecting disorders, with PD being the most studied disorder. However, the review identified several gaps, including limited and unbalanced data set usage in studies, and a focus on diagnostic test rather than disorder-specific monitoring. Despite the limitations of being constrained by only peer-reviewed publications written in English, the SLR provides valuable insights into the current state of research on ML-based voice-affecting disorder diagnosis and monitoring and highlighting areas to address in future research.
Collapse
Affiliation(s)
- Alper Idrisoglu
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
| | - Ana Luiza Dallora
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
| | - Peter Anderberg
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
- School of Health Sciences, University of Skövde, Skövde, Sweden
| | | |
Collapse
|
14
|
Amini S, Hao B, Zhang L, Song M, Gupta A, Karjadi C, Kolachalama VB, Au R, Paschalidis IC. Automated detection of mild cognitive impairment and dementia from voice recordings: A natural language processing approach. Alzheimers Dement 2023; 19:946-955. [PMID: 35796399 PMCID: PMC10148688 DOI: 10.1002/alz.12721] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Revised: 03/20/2022] [Accepted: 05/18/2022] [Indexed: 11/06/2022]
Abstract
INTRODUCTION Automated computational assessment of neuropsychological tests would enable widespread, cost-effective screening for dementia. METHODS A novel natural language processing approach is developed and validated to identify different stages of dementia based on automated transcription of digital voice recordings of subjects' neuropsychological tests conducted by the Framingham Heart Study (n = 1084). Transcribed sentences from the test were encoded into quantitative data and several models were trained and tested using these data and the participants' demographic characteristics. RESULTS Average area under the curve (AUC) on the held-out test data reached 92.6%, 88.0%, and 74.4% for differentiating Normal cognition from Dementia, Normal or Mild Cognitive Impairment (MCI) from Dementia, and Normal from MCI, respectively. DISCUSSION The proposed approach offers a fully automated identification of MCI and dementia based on a recorded neuropsychological test, providing an opportunity to develop a remote screening tool that could be adapted easily to any language.
Collapse
Affiliation(s)
- Samad Amini
- Department of Electrical & Computer Engineering, Division of Systems Engineering, and Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
| | - Boran Hao
- Department of Electrical & Computer Engineering, Division of Systems Engineering, and Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
| | - Lifu Zhang
- Department of Electrical & Computer Engineering, Division of Systems Engineering, and Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
| | - Mengting Song
- Department of Electrical & Computer Engineering, Division of Systems Engineering, and Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
| | - Aman Gupta
- Department of Electrical & Computer Engineering, Division of Systems Engineering, and Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
| | - Cody Karjadi
- Framingham Heart Study, Boston University, Boston, Massachusetts, USA
| | - Vijaya B. Kolachalama
- Department of Medicine, Boston University School of Medicine, Boston, Massachusetts, USA
- Faculty of Computing & Data Sciences, Boston University, Boston, Massachusetts, USA
- Department of Computer Science, Boston University, Boston, Massachusetts, USA
| | - Rhoda Au
- Framingham Heart Study, Boston University, Boston, Massachusetts, USA
- Departments of Anatomy & Neurobiology, Neurology, and Epidemiology, Boston University School of Medicine and School of Public Health, Boston, Massachusetts, USA
| | - Ioannis Ch. Paschalidis
- Department of Electrical & Computer Engineering, Division of Systems Engineering, and Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
- Faculty of Computing & Data Sciences, Boston University, Boston, Massachusetts, USA
| |
Collapse
|
15
|
Ding H, Mandapati A, Hamel AP, Karjadi C, Ang TFA, Xia W, Au R, Lin H. Multimodal Machine Learning for 10-Year Dementia Risk Prediction: The Framingham Heart Study. J Alzheimers Dis 2023; 96:277-286. [PMID: 37742648 DOI: 10.3233/jad-230496] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
BACKGROUND Early prediction of dementia risk is crucial for effective interventions. Given the known etiologic heterogeneity, machine learning methods leveraging multimodal data, such as clinical manifestations, neuroimaging biomarkers, and well-documented risk factors, could predict dementia more accurately than single modal data. OBJECTIVE This study aims to develop machine learning models that capitalize on neuropsychological (NP) tests, magnetic resonance imaging (MRI) measures, and clinical risk factors for 10-year dementia prediction. METHODS This study included participants from the Framingham Heart Study, and various data modalities such as NP tests, MRI measures, and demographic variables were collected. CatBoost was used with Optuna hyperparameter optimization to create prediction models for 10-year dementia risk using different combinations of data modalities. The contribution of each modality and feature for the prediction task was also quantified using Shapley values. RESULTS This study included 1,031 participants with normal cognitive status at baseline (age 75±5 years, 55.3% women), of whom 205 were diagnosed with dementia during the 10-year follow-up. The model built on three modalities demonstrated the best dementia prediction performance (AUC 0.90±0.01) compared to single modality models (AUC range: 0.82-0.84). MRI measures contributed most to dementia prediction (mean absolute Shapley value: 3.19), suggesting the necessity of multimodal inputs. CONCLUSION This study shows that a multimodal machine learning framework had a superior performance for 10-year dementia risk prediction. The model can be used to increase vigilance for cognitive deterioration and select high-risk individuals for early intervention and risk management.
Collapse
Affiliation(s)
- Huitong Ding
- Department of Anatomy and Neurobiology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA, USA
| | - Amiya Mandapati
- Department of Religious Studies, Brown University, Providence, RI, USA
- The Warren Alpert Medical School, Brown University, Providence, RI, USA
| | - Alexander P Hamel
- Department of Medicine, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Cody Karjadi
- Department of Anatomy and Neurobiology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA, USA
| | - Ting F A Ang
- Department of Anatomy and Neurobiology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA, USA
- Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA
- Slone Epidemiology Center, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
| | - Weiming Xia
- Geriatric Research Education and Clinical Center, VA Bedford Healthcare System, Bedford, MA, USA
- Department of Pharmacology and Experimental Therapeutics, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
- Department of Biological Science, Kennedy College of Sciences, University of Massachusetts Lowell, Lowell, MA, USA
| | - Rhoda Au
- Department of Anatomy and Neurobiology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA, USA
- Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA
- Slone Epidemiology Center, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
- Department of Neurology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
| | - Honghuang Lin
- Department of Medicine, University of Massachusetts Chan Medical School, Worcester, MA, USA
| |
Collapse
|
16
|
Agbavor F, Liang H. Artificial Intelligence-Enabled End-To-End Detection and Assessment of Alzheimer's Disease Using Voice. Brain Sci 2022; 13:28. [PMID: 36672010 PMCID: PMC9856143 DOI: 10.3390/brainsci13010028] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 12/13/2022] [Accepted: 12/20/2022] [Indexed: 12/25/2022] Open
Abstract
There is currently no simple, widely available screening method for Alzheimer's disease (AD), partly because the diagnosis of AD is complex and typically involves expensive and sometimes invasive tests not commonly available outside highly specialized clinical settings. Here, we developed an artificial intelligence (AI)-powered end-to-end system to detect AD and predict its severity directly from voice recordings. At the core of our system is the pre-trained data2vec model, the first high-performance self-supervised algorithm that works for speech, vision, and text. Our model was internally evaluated on the ADReSSo (Alzheimer's Dementia Recognition through Spontaneous Speech only) dataset containing voice recordings of subjects describing the Cookie Theft picture, and externally validated on a test dataset from DementiaBank. The AI model can detect AD with average area under the curve (AUC) of 0.846 and 0.835 on held-out and external test set, respectively. The model was well-calibrated (Hosmer-Lemeshow goodness-of-fit p-value = 0.9616). Moreover, the model can reliably predict the subject's cognitive testing score solely based on raw voice recordings. Our study demonstrates the feasibility of using the AI-powered end-to-end model for early AD diagnosis and severity prediction directly based on voice, showing its potential for screening Alzheimer's disease in a community setting.
Collapse
Affiliation(s)
| | - Hualou Liang
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA 19104, USA
| |
Collapse
|
17
|
Ding H, Mandapati A, Karjadi C, Ang TFA, Lu S, Miao X, Glass J, Au R, Lin H. Association Between Acoustic Features and Neuropsychological Test Performance in the Framingham Heart Study: Observational Study. J Med Internet Res 2022; 24:e42886. [PMID: 36548029 PMCID: PMC9816957 DOI: 10.2196/42886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 11/29/2022] [Accepted: 12/03/2022] [Indexed: 12/07/2022] Open
Abstract
BACKGROUND Human voice has increasingly been recognized as an effective indicator for the detection of cognitive disorders. However, the association of acoustic features with specific cognitive functions and mild cognitive impairment (MCI) has yet to be evaluated in a large community-based population. OBJECTIVE This study aimed to investigate the association between acoustic features and neuropsychological (NP) tests across multiple cognitive domains and evaluate the added predictive power of acoustic composite scores for the classification of MCI. METHODS This study included participants without dementia from the Framingham Heart Study, a large community-based cohort with longitudinal surveillance for incident dementia. For each participant, 65 low-level acoustic descriptors were derived from voice recordings of NP test administration. The associations between individual acoustic descriptors and 18 NP tests were assessed with linear mixed-effect models adjusted for age, sex, and education. Acoustic composite scores were then built by combining acoustic features significantly associated with NP tests. The added prediction power of acoustic composite scores for prevalent and incident MCI was also evaluated. RESULTS The study included 7874 voice recordings from 4950 participants (age: mean 62, SD 14 years; 4336/7874, 55.07% women), of whom 453 were diagnosed with MCI. In all, 8 NP tests were associated with more than 15 acoustic features after adjusting for multiple testing. Additionally, 4 of the acoustic composite scores were significantly associated with prevalent MCI and 7 were associated with incident MCI. The acoustic composite scores can increase the area under the curve of the baseline model for MCI prediction from 0.712 to 0.755. CONCLUSIONS Multiple acoustic features are significantly associated with NP test performance and MCI, which can potentially be used as digital biomarkers for early cognitive impairment monitoring.
Collapse
Affiliation(s)
- Huitong Ding
- Department of Anatomy and Neurobiology, Chobanian & Avedisian School of Medicine, Boston University, Boston, MA, United States
| | - Amiya Mandapati
- Department of Religious Studies, Brown University, Providence, RI, United States
- The Warren Alpert Medical School, Brown University, Providence, RI, United States
| | - Cody Karjadi
- Department of Anatomy and Neurobiology, Chobanian & Avedisian School of Medicine, Boston University, Boston, MA, United States
- The Framingham Heart Study, Chobanian & Avedisian School of Medicine, Boston University, Framingham, MA, United States
| | - Ting Fang Alvin Ang
- Department of Anatomy and Neurobiology, Chobanian & Avedisian School of Medicine, Boston University, Boston, MA, United States
- The Framingham Heart Study, Chobanian & Avedisian School of Medicine, Boston University, Framingham, MA, United States
- Slone Epidemiology Center, Chobanian & Avedisian School of Medicine, Boston University, Boston, MA, United States
- Department of Epidemiology, School of Public Health, Boston University, Boston, MA, United States
| | - Sophia Lu
- Slone Epidemiology Center, Chobanian & Avedisian School of Medicine, Boston University, Boston, MA, United States
| | - Xiao Miao
- Innovation Research Institute of Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai, China
| | - James Glass
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, United States
| | - Rhoda Au
- Department of Anatomy and Neurobiology, Chobanian & Avedisian School of Medicine, Boston University, Boston, MA, United States
- The Framingham Heart Study, Chobanian & Avedisian School of Medicine, Boston University, Framingham, MA, United States
- Slone Epidemiology Center, Chobanian & Avedisian School of Medicine, Boston University, Boston, MA, United States
- Department of Epidemiology, School of Public Health, Boston University, Boston, MA, United States
- Department of Neurology, Chobanian & Avedisian School of Medicine, Boston University, Boston, MA, United States
| | - Honghuang Lin
- Department of Medicine, University of Massachusetts Chan Medical School, Worcester, MA, United States
| |
Collapse
|
18
|
Mahon E, Lachman ME. Voice biomarkers as indicators of cognitive changes in middle and later adulthood. Neurobiol Aging 2022; 119:22-35. [PMID: 35964541 PMCID: PMC9487188 DOI: 10.1016/j.neurobiolaging.2022.06.010] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 06/20/2022] [Accepted: 06/26/2022] [Indexed: 11/20/2022]
Abstract
Voice prosody measures have been linked with Alzheimer's disease (AD), but it is unclear whether they are associated with normal cognitive aging. We assessed relationships between voice measures and 10-year cognitive changes in the MIDUS national sample of middle-aged and older adults ages 42-92, with a mean age of 64.09 (standard deviation = 11.23) at the second wave. Seven cognitive tests were assessed in 2003-2004 (Wave 2) and 2013-2014 (Wave 3). Voice measures were collected at Wave 3 (N = 2585) from audio recordings of the cognitive interviews. Analyses controlled for age, education, depressive symptoms, and health. As predicted, higher jitter was associated with greater declines in episodic memory, verbal fluency, and attention switching. Lower pulse was related to greater decline in episodic memory, and fewer voice breaks were related to greater declines in episodic memory and verbal fluency, although the direction of these effects was contrary to hypotheses. Findings suggest that voice biomarkers may offer a promising approach for early detection of risk factors for cognitive impairment or AD.
Collapse
Affiliation(s)
- Elizabeth Mahon
- Brandeis University, Department of Psychology, Waltham, MA, USA.
| | | |
Collapse
|
19
|
Tröger J, Baykara E, Zhao J, ter Huurne D, Possemis N, Mallick E, Schäfer S, Schwed L, Mina M, Linz N, Ramakers I, Ritchie C. Validation of the Remote Automated ki:e Speech Biomarker for Cognition in Mild Cognitive Impairment: Verification and Validation following DiME V3 Framework. Digit Biomark 2022; 6:107-116. [PMID: 36466952 PMCID: PMC9710455 DOI: 10.1159/000526471] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 08/06/2022] [Indexed: 08/01/2023] Open
Abstract
INTRODUCTION Progressive cognitive decline is the cardinal behavioral symptom in most dementia-causing diseases such as Alzheimer's disease. While most well-established measures for cognition might not fit tomorrow's decentralized remote clinical trials, digital cognitive assessments will gain importance. We present the evaluation of a novel digital speech biomarker for cognition (SB-C) following the Digital Medicine Society's V3 framework: verification, analytical validation, and clinical validation. METHODS Evaluation was done in two independent clinical samples: the Dutch DeepSpA (N = 69 subjective cognitive impairment [SCI], N = 52 mild cognitive impairment [MCI], and N = 13 dementia) and the Scottish SPeAk datasets (N = 25, healthy controls). For validation, two anchor scores were used: the Mini-Mental State Examination (MMSE) and the Clinical Dementia Rating (CDR) scale. RESULTS Verification: The SB-C could be reliably extracted for both languages using an automatic speech processing pipeline. Analytical Validation: In both languages, the SB-C was strongly correlated with MMSE scores. Clinical Validation: The SB-C significantly differed between clinical groups (including MCI and dementia), was strongly correlated with the CDR, and could track the clinically meaningful decline. CONCLUSION Our results suggest that the ki:e SB-C is an objective, scalable, and reliable indicator of cognitive decline, fit for purpose as a remote assessment in clinical early dementia trials.
Collapse
Affiliation(s)
| | | | | | - Daphne ter Huurne
- Alzheimer Center Limburg, School for Mental Health and Neuroscience, Maastricht University, Maastricht, The Netherlands
| | - Nina Possemis
- Alzheimer Center Limburg, School for Mental Health and Neuroscience, Maastricht University, Maastricht, The Netherlands
| | | | | | | | | | | | - Inez Ramakers
- Alzheimer Center Limburg, School for Mental Health and Neuroscience, Maastricht University, Maastricht, The Netherlands
| | - Craig Ritchie
- Edinburgh Dementia Prevention, Centre for Clinical Brain Sciences, The University of Edinburgh, Edinburgh, UK
| |
Collapse
|
20
|
Libon DJ, Swenson R, Lamar M, Price CC, Baliga G, Pascual-Leone A, Au R, Cosentino S, Andersen SL. The Boston Process Approach and Digital Neuropsychological Assessment: Past Research and Future Directions. J Alzheimers Dis 2022; 87:1419-1432. [PMID: 35466941 DOI: 10.3233/jad-220096] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Neuropsychological assessment using the Boston Process Approach (BPA) suggests that an analysis of the strategy or the process by which tasks and neuropsychological tests are completed, and the errors made during test completion convey much information regarding underlying brain and cognition and are as important as overall summary scores. Research over the last several decades employing an analysis of process and errors has been able to dissociate between dementia patients diagnosed with Alzheimer's disease, vascular dementia associated with MRI-determined white matter alterations, and Parkinson's disease; and between mild cognitive impairment subtypes. Nonetheless, BPA methods can be labor intensive to deploy. However, the recent availability of digital platforms for neuropsychological test administration and scoring now enables reliable, rapid, and objective data collection. Further, digital technology can quantify highly nuanced data previously unobtainable to define neurocognitive constructs with high accuracy. In this paper, a brief review of the BPA is provided. Studies that demonstrate how digital technology translates BPA into specific neurocognitive constructs using the Clock Drawing Test, Backward Digit Span Test, and a Digital Pointing Span Test are described. Implications for using data driven artificial intelligence-supported analytic approaches enabling the creation of more sensitive and specific detection/diagnostic algorithms for putative neurodegenerative illness are also discussed.
Collapse
Affiliation(s)
- David J Libon
- New Jersey Institute for Successful Aging, Rowan University, School of Osteopathic Medicine, NJ, USA
| | - Rod Swenson
- Department of Psychiatry and Behavioral Science, University of North Dakota School of Medicine and Health Sciences, Grand Forks, ND, USA
| | - Melissa Lamar
- Rush Alzheimer's Disease Center and the Department of Psychiatry and Behavioral Sciences, Rush University Medical Center, Chicago, IL, USA
| | - Catherine C Price
- Department of Clinical and Health Psychology, University of Florida, Gainesville, FL, USA
| | - Ganesh Baliga
- Department of Computer Science, Rowan University, Glassboro, NJ, USA
| | - Alvaro Pascual-Leone
- Hinda and Arthur Marcus Institute for Aging Research and Deanna and Sidney Wolk Center for Memory Health, Hebrew Senior Life, Boston, MA, USA.,Department of Neurology, Harvard Medical School, Boston, MA, USA.,Guttmann Brain Health Institute, Barcelona, Spain
| | - Rhoda Au
- Departments of Anatomy & Neurobiology and Neurology; Framingham Heart Study, Slone Epidemiology Center and Alzheimer's Disease Research Center, Boston University School of Medicine, Boston, MA, USA.,Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA
| | - Stephanie Cosentino
- Department of Neurology, Taub Institute and Sergievsky Center, Cognitive Neuroscience Division, Columbia University Medical Center, New York, NY, USA
| | - Stacy L Andersen
- Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| |
Collapse
|
21
|
Kaye J, Aisen P, Amariglio R, Au R, Ballard C, Carrillo M, Fillit H, Iwatsubo T, Jimenez-Maggiora G, Lovestone S, Natanegara F, Papp K, Soto ME, Weiner M, Vellas B. Using Digital Tools to Advance Alzheimer's Drug Trials During a Pandemic: The EU/US CTAD Task Force. JPAD-JOURNAL OF PREVENTION OF ALZHEIMERS DISEASE 2021; 8:513-519. [PMID: 34585227 PMCID: PMC8244451 DOI: 10.14283/jpad.2021.36] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The 2020 COVID-19 pandemic has disrupted Alzheimer’s disease (AD) clinical studies worldwide. Digital technologies may help minimize disruptions by enabling remote assessment of subtle cognitive and functional changes over the course of the disease. The EU/US Clinical Trials in Alzheimer’s Disease (CTAD) Task Force met virtually in November 2020 to explore the opportunities and challenges associated with the use of digital technologies in AD clinical research. While recognizing the potential of digital tools to accelerate clinical trials, improve the engagement of diverse populations, capture clinically meaningful data, and lower costs, questions remain regarding the stability, validity, generalizability, and reproducibility of digital data. Substantial concerns also exist regarding regulatory acceptance and privacy. Nonetheless, the Task Force supported further exploration of digital technologies through collaboration and data sharing, noting the need for standardization of digital readouts. They also concluded that while it may be premature to employ remote assessments for trials of novel experimental medications, remote studies of non-invasive, multi-domain approaches may be feasible at this time.
Collapse
Affiliation(s)
- J Kaye
- Jeffrey Kaye, Layton Aging and Alzheimer's Disease Center, School of Medicine, Oregon Health and Science University, Portland, OR, USA,
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Zhang L, Ngo A, Thomas JA, Burkhardt HA, Parsey CM, Au R, Ghomi RH. Neuropsychological test validation of speech markers of cognitive impairment in the Framingham Cognitive Aging Cohort. EXPLORATION OF MEDICINE 2021; 2:232-252. [PMID: 34746927 PMCID: PMC8570561 DOI: 10.37349/emed.2021.00044] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 04/12/2021] [Indexed: 12/15/2022] Open
Abstract
AIM Although clinicians primarily diagnose dementia based on a combination of metrics such as medical history and formal neuropsychological tests, recent work using linguistic analysis of narrative speech to identify dementia has shown promising results. We aim to build upon research by Thomas JA & Burkardt HA et al. (J Alzheimers Dis. 2020;76:905-22) and Alhanai et al. (arXiv:1710.07551v1. 2020) on the Framingham Heart Study (FHS) Cognitive Aging Cohort by 1) demonstrating the predictive capability of linguistic analysis in differentiating cognitively normal from cognitively impaired participants and 2) comparing the performance of the original linguistic features with the performance of expanded features. METHODS Data were derived from a subset of the FHS Cognitive Aging Cohort. We analyzed a sub-selection of 98 participants, which provided 127 unique audio files and clinical observations (n = 127, female = 47%, cognitively impaired = 43%). We built on previous work which extracted original linguistic features from transcribed audio files by extracting expanded features. We used both feature sets to train logistic regression classifiers to distinguish cognitively normal from cognitively impaired participants and compared the predictive power of the original and expanded linguistic feature sets, and participants' Mini-Mental State Examination (MMSE) scores. RESULTS Based on the area under the receiver-operator characteristic curve (AUC) of the models, both the original (AUC = 0.882) and expanded (AUC = 0.883) feature sets outperformed MMSE (AUC = 0.870) in classifying cognitively impaired and cognitively normal participants. Although the original and expanded feature sets had similar AUC, the expanded feature set showed better positive and negative predictive value [expanded: positive predictive value (PPV) = 0.738, negative predictive value (NPV) = 0.889; original: PPV = 0.701, NPV = 0.869]. CONCLUSIONS Linguistic analysis has been shown to be a potentially powerful tool for clinical use in classifying cognitive impairment. This study expands the work of several others, but further studies into the plausibility of speech analysis in clinical use are vital to ensure the validity of speech analysis for clinical classification of cognitive impairment.
Collapse
Affiliation(s)
- Larry Zhang
- Department of Intelligent Systems Engineering, Indiana University Bloomington, Bloomington, Indiana 47408, United States
- Department of Informatics, Indiana University Bloomington, Bloomington, Indiana 47408, United States
| | - Anthony Ngo
- Department of Statistics, University of Washington, Seattle, Washington 98195-0005, United States
| | - Jason A. Thomas
- Department of Biomedical Informatics and Medical Education, University of Washington Seattle Campus, Seattle, Washington 98195-0005, United States
| | - Hannah A. Burkhardt
- Department of Biomedical Informatics and Medical Education, University of Washington Seattle Campus, Seattle, Washington 98195-0005, United States
| | - Carolyn M. Parsey
- Department of Neurology, University of Washington, Seattle, Washington 98195-0005, United States
| | - Rhoda Au
- Department of Anatomy and Neurobiology, Neurology, and Epidemiology, Boston University Schools of Medicine and Public Health, Boston, Massachusetts 02118, United States
| | - Reza Hosseini Ghomi
- Department of Neurology, University of Washington, Seattle, Washington 98195-0005, United States
| |
Collapse
|