1
|
Riad R, Denais M, de Gennes M, Lesage A, Oustric V, Cao XN, Mouchabac S, Bourla A. Automated speech analysis for risk detection of depression, anxiety, insomnia, and fatigue: Algorithm Development and Validation Study. J Med Internet Res 2024. [PMID: 39324329 DOI: 10.2196/58572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/27/2024] Open
Abstract
BACKGROUND While speech analysis holds promise for mental health assessment, research often focuses on single symptoms, despite symptom co-occurrences and interactions. In addition, predictive models in mental health do not properly assess speech-based systems' limitations, such as uncertainty, or fairness for a safe clinical deployment. OBJECTIVE We investigated the predictive potential of mobile-collected speech data for detecting and estimating depression, anxiety, fatigue, and insomnia, focusing on other factors than mere accuracy, in the general population. METHODS We included n=865 healthy adults and recorded their answers regarding their perceived mental and sleep states. We asked how they felt and if they had slept well lately. Clinically validated questionnaires measuring depression, anxiety, insomnia, and fatigue severity were also used. We developed a novel speech and machine learning pipeline involving voice activity detection, feature extraction, and model training. We automatically analyzed participants' speech with a fully ML automatic pipeline to capture speech variability. Then, we modelled speech with pretrained deep learning models that were pre-trained on a large open free database and we selected the best one on the validation set. Based on the best speech modelling approach, we evaluated clinical threshold detection, individual score prediction, model uncertainty estimation, and performance fairness across demographics (age, sex, education). We employed a train-validation-test split for all evaluations: to develop our models, select the best ones and assess the generalizability of held-out data. RESULTS The best model was WhisperM with a max pooling, and oversampling method. Our methods achieved good detection performance for all symptoms, depression (PHQ-9 AUC= 0.76F1=0.49, BDI AUC=0.78, F1=0,65), anxiety (GAD-7 F1=0.50, AUC=0.77) insomnia (AIS AUC=0.73, F1=0.62), and fatigue (MFI Total Score F1=0.88, AUC=0.68). These strengths were maintained for depression detection with BDI and Fatigue for abstention rates for uncertain cases (Risk-Coverage AUCs < 0.4). Individual symptom scores were predicted with good accuracy (Correlations were all significant, with Pearson strengths between 0.31 and 0.49). Fairness analysis revealed that models were consistent for sex (average Disparity Ratio (DR) = 0.86), to a lesser extent for education level (average Disparity Ratio (DR) = 0.47) and worse for age groups (average Disparity Ratio (DR) = 0.33). CONCLUSIONS This study demonstrates the potential of speech-based systems for multifaceted mental health assessment in the general population, not only for detecting clinical thresholds but also for estimating their severity. Addressing fairness and incorporating uncertainty estimation with selective classification are key contributions that can enhance the clinical utility and responsible implementation of such systems. This approach offers promise for more accurate and nuanced mental health assessments, benefiting both patients and clinicians. CLINICALTRIAL
Collapse
Affiliation(s)
| | | | | | | | | | | | - Stéphane Mouchabac
- Department of Psychiatry, Saint-Antoine Hospital, Sorbonne University, AP-HP, Paris, FR
- Infrastructure for Clinical Research in Neurosciences (iCRIN), Paris BrainInstitute, Paris, FR
| | - Alexis Bourla
- Department of Psychiatry, Saint-Antoine Hospital, Sorbonne University, AP-HP, Paris, FR
- Infrastructure for Clinical Research in Neurosciences (iCRIN), Paris BrainInstitute, Paris, FR
- Clariane, Medical Strategy and Innovation Department, Paris, FR
- NeuroStim Psychiatry Practice, Paris, FR
| |
Collapse
|
2
|
Larsen E, Murton O, Song X, Joachim D, Watts D, Kapczinski F, Venesky L, Hurowitz G. Validating the efficacy and value proposition of mental fitness vocal biomarkers in a psychiatric population: prospective cohort study. Front Psychiatry 2024; 15:1342835. [PMID: 38505797 PMCID: PMC10948552 DOI: 10.3389/fpsyt.2024.1342835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 02/14/2024] [Indexed: 03/21/2024] Open
Abstract
Background The utility of vocal biomarkers for mental health assessment has gained increasing attention. This study aims to further this line of research by introducing a novel vocal scoring system designed to provide mental fitness tracking insights to users in real-world settings. Methods A prospective cohort study with 104 outpatient psychiatric participants was conducted to validate the "Mental Fitness Vocal Biomarker" (MFVB) score. The MFVB score was derived from eight vocal features, selected based on literature review. Participants' mental health symptom severity was assessed using the M3 Checklist, which serves as a transdiagnostic tool for measuring depression, anxiety, post-traumatic stress disorder, and bipolar symptoms. Results The MFVB demonstrated an ability to stratify individuals by their risk of elevated mental health symptom severity. Continuous observation enhanced the MFVB's efficacy, with risk ratios improving from 1.53 (1.09-2.14, p=0.0138) for single 30-second voice samples to 2.00 (1.21-3.30, p=0.0068) for data aggregated over two weeks. A higher risk ratio of 8.50 (2.31-31.25, p=0.0013) was observed in participants who used the MFVB 5-6 times per week, underscoring the utility of frequent and continuous observation. Participant feedback confirmed the user-friendliness of the application and its perceived benefits. Conclusions The MFVB is a promising tool for objective mental health tracking in real-world conditions, with potential to be a cost-effective, scalable, and privacy-preserving adjunct to traditional psychiatric assessments. User feedback suggests that vocal biomarkers can offer personalized insights and support clinical therapy and other beneficial activities that are associated with improved mental health risks and outcomes.
Collapse
Affiliation(s)
| | | | | | | | - Devon Watts
- Neuroscience Graduate Program, Department of Health Sciences, McMaster University, Hamilton, ON, Canada
- St. Joseph’s Healthcare Hamilton, Hamilton, ON, Canada
| | - Flavio Kapczinski
- Neuroscience Graduate Program, Department of Health Sciences, McMaster University, Hamilton, ON, Canada
- Department of Psychiatry, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | | | | |
Collapse
|
3
|
McIntyre RS, Greenleaf W, Bulaj G, Taylor ST, Mitsi G, Saliu D, Czysz A, Silvesti G, Garcia M, Jain R. Digital health technologies and major depressive disorder. CNS Spectr 2023; 28:662-673. [PMID: 37042341 DOI: 10.1017/s1092852923002225] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 04/13/2023]
Abstract
There is an urgent need to improve the clinical management of major depressive disorder (MDD), which has become increasingly prevalent over the past two decades. Several gaps and challenges in the awareness, detection, treatment, and monitoring of MDD remain to be addressed. Digital health technologies have demonstrated utility in relation to various health conditions, including MDD. Factors related to the COVID-19 pandemic have accelerated the development of telemedicine, mobile medical apps, and virtual reality apps and have continued to introduce new possibilities across mental health care. Growing access to and acceptance of digital health technologies present opportunities to expand the scope of care and to close gaps in the management of MDD. Digital health technology is rapidly evolving the options for nonclinical support and clinical care for patients with MDD. Iterative efforts to validate and optimize such digital health technologies, including digital therapeutics and digital biomarkers, continue to improve access to and quality of personalized detection, treatment, and monitoring of MDD. The aim of this review is to highlight the existing gaps and challenges in depression management and discuss the current and future landscape of digital health technology as it applies to the challenges faced by patients with MDD and their healthcare providers.
Collapse
Affiliation(s)
- Roger S McIntyre
- Department of Psychiatry and Pharmacology, University of Toronto, Toronto, ON, Canada
| | - Walter Greenleaf
- Virtual Human Interaction Lab, Stanford University, San Francisco, CA, USA
| | - Grzegorz Bulaj
- Department of Medicinal Chemistry, College of Pharmacy, University of Utah, Salt Lake City, UT, USA
| | - Steven T Taylor
- Department of Psychiatry, Harvard Medical School, Boston, MA, USA
- Department of Psychiatry, Massachusetts General Hospital, McLean Hospital, Boston, MA, USA
| | | | | | - Andy Czysz
- Sage Therapeutics, Inc., Cambridge, MA, USA
| | | | | | - Rakesh Jain
- Department of Psychiatry, Texas Tech University School of Medicine, Lubbock, TX, USA
| |
Collapse
|
4
|
Adler-Milstein J, Aggarwal N, Ahmed M, Castner J, Evans BJ, Gonzalez AA, James CA, Lin S, Mandl KD, Matheny ME, Sendak MP, Shachar C, Williams A. Meeting the Moment: Addressing Barriers and Facilitating Clinical Adoption of Artificial Intelligence in Medical Diagnosis. NAM Perspect 2022; 2022:202209c. [PMID: 36713769 PMCID: PMC9875857 DOI: 10.31478/202209c] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
5
|
Lin Y, Liyanage BN, Sun Y, Lu T, Zhu Z, Liao Y, Wang Q, Shi C, Yue W. A deep learning-based model for detecting depression in senior population. Front Psychiatry 2022; 13:1016676. [PMID: 36419976 PMCID: PMC9677587 DOI: 10.3389/fpsyt.2022.1016676] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Accepted: 10/17/2022] [Indexed: 11/09/2022] Open
Abstract
OBJECTIVES With the attention paid to the early diagnosis of depression, this study tries to use the biological information of speech, combined with deep learning to build a rapid binary-classification model of depression in the elderly who use Mandarin and test its effectiveness. METHODS Demographic information and acoustic data of 56 Mandarin-speaking older adults with major depressive disorder (MDD), diagnosed with the Mini-International Neuropsychiatric Interview (MINI) and the fifth edition of Diagnostic and Statistical Manual of Mental Disorders (DSM-5), and 47 controls was collected. Acoustic data were recorded using different smart phones and analyzed by deep learning model which is developed and tested on independent validation set. The accuracy of the model is shown by the ROC curve. RESULTS The quality of the collected speech affected the accuracy of the model. The initial sensitivity and specificity of the model were respectively 82.14% [95%CI, (70.16-90.00)] and 80.85% [95%CI, (67.64-89.58)]. CONCLUSION This study provides a new method for rapid identification and diagnosis of depression utilizing deep learning technology. Vocal biomarkers extracted from raw speech signals have high potential for the early diagnosis of depression in older adults.
Collapse
Affiliation(s)
- Yunhan Lin
- Institute of Mental Health, Peking University Sixth Hospital, Beijing, China.,Research Unit of Diagnosis and Treatment of Mood Cognitive Disorder, Chinese Academy of Medical Sciences (2018RU006), Beijing, China.,National Clinical Research Center for Mental Disorders and NHC Key Laboratory of Mental Health and (Peking University Sixth Hospital), Beijing, China
| | | | - Yutao Sun
- The Fifth Hospital of Tangshan City, Tangshan, China
| | - Tianlan Lu
- Institute of Mental Health, Peking University Sixth Hospital, Beijing, China.,Research Unit of Diagnosis and Treatment of Mood Cognitive Disorder, Chinese Academy of Medical Sciences (2018RU006), Beijing, China.,National Clinical Research Center for Mental Disorders and NHC Key Laboratory of Mental Health and (Peking University Sixth Hospital), Beijing, China
| | | | - Yundan Liao
- Institute of Mental Health, Peking University Sixth Hospital, Beijing, China.,Research Unit of Diagnosis and Treatment of Mood Cognitive Disorder, Chinese Academy of Medical Sciences (2018RU006), Beijing, China.,National Clinical Research Center for Mental Disorders and NHC Key Laboratory of Mental Health and (Peking University Sixth Hospital), Beijing, China
| | | | - Chuan Shi
- Institute of Mental Health, Peking University Sixth Hospital, Beijing, China.,Research Unit of Diagnosis and Treatment of Mood Cognitive Disorder, Chinese Academy of Medical Sciences (2018RU006), Beijing, China.,National Clinical Research Center for Mental Disorders and NHC Key Laboratory of Mental Health and (Peking University Sixth Hospital), Beijing, China
| | - Weihua Yue
- Institute of Mental Health, Peking University Sixth Hospital, Beijing, China.,Research Unit of Diagnosis and Treatment of Mood Cognitive Disorder, Chinese Academy of Medical Sciences (2018RU006), Beijing, China.,National Clinical Research Center for Mental Disorders and NHC Key Laboratory of Mental Health and (Peking University Sixth Hospital), Beijing, China.,PKU-IDG/McGovern Institute for Brain Research, Peking University, Beijing, China.,Chinese Institute for Brain Research, Beijing, China
| |
Collapse
|