Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Cohen AS, Elvevåg B. Automated computerized analysis of speech in psychiatric disorders. Curr Opin Psychiatry 2014;27:203-9. [PMID: 24613984 DOI: 10.1097/YCO.0000000000000056] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

For:	Cohen AS, Elvevåg B. Automated computerized analysis of speech in psychiatric disorders. Curr Opin Psychiatry 2014;27:203-9. [PMID: 24613984 DOI: 10.1097/YCO.0000000000000056] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Number

Cited by Other Article(s)

Nwosu OI, Naunheim MR. Artificial Intelligence in Laryngology, Broncho-Esophagology, and Sleep Surgery. Otolaryngol Clin North Am 2024;57:821-829. [PMID: 38719714 DOI: 10.1016/j.otc.2024.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2024]

Siegel JS, Cohen AS, Szabo ST, Tomioka S, Opler M, Kirkpatrick B, Hopkins S. Enrichment using speech latencies improves treatment effect size in a clinical trial of bipolar depression. Psychiatry Res 2024;340:116105. [PMID: 39151277 DOI: 10.1016/j.psychres.2024.116105] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Revised: 07/23/2024] [Accepted: 07/24/2024] [Indexed: 08/19/2024]

Cohen AS, Rodriguez Z, Opler M, Kirkpatrick B, Milanovic S, Piacentino D, Szabo ST, Tomioka S, Ogirala A, Koblan KS, Siegel JS, Hopkins S. Evaluating speech latencies during structured psychiatric interviews as an automated objective measure of psychomotor slowing. Psychiatry Res 2024;340:116104. [PMID: 39137558 DOI: 10.1016/j.psychres.2024.116104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Revised: 07/23/2024] [Accepted: 07/24/2024] [Indexed: 08/15/2024]

Olah J, Wong WLE, Chaudhry AURR, Mena O, Tang SX. Detecting schizophrenia, bipolar disorder, psychosis vulnerability and major depressive disorder from 5 minutes of online-collected speech. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.09.03.24313020. [PMID: 39281747 PMCID: PMC11398428 DOI: 10.1101/2024.09.03.24313020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/18/2024]

Abstract

Background

Psychosis poses substantial social and healthcare burdens. The analysis of speech is a promising approach for the diagnosis and monitoring of psychosis, capturing symptoms like thought disorder and flattened affect. Recent advancements in Natural Language Processing (NLP) methodologies enable the automated extraction of informative speech features, which has been leveraged for early psychosis detection and assessment of symptomology. However, critical gaps persist, including the absence of standardized sample collection protocols, small sample sizes, and a lack of multi-illness classification, limiting clinical applicability. Our study aimed to (1) identify an optimal assessment approach for the online and remote collection of speech, in the context of assessing the psychosis spectrum and evaluate whether a fully automated, speech-based machine learning (ML) pipeline can discriminate among different conditions on the schizophrenia-bipolar spectrum (SSD-BD-SPE), help-seeking comparison subjects (MDD), and healthy controls (HC) at varying layers of analysis and diagnostic complexity.

Methods

We adopted online data collection methods to collect 20 minutes of speech and demographic information from individuals. Participants were categorized as "healthy" help-seekers (HC), having a schizophrenia-spectrum disorder (SSD), bipolar disorder (BD), major depressive disorder (MDD), or being on the psychosis spectrum with sub-clinical psychotic experiences (SPE). SPE status was determined based on self-reported clinical diagnosis and responses to the PHQ-8 and PQ-16 screening questionnaires, while other diagnoses were determined based on self-report from participants. Linguistic and paralinguistic features were extracted and ensemble learning algorithms (e.g., XGBoost) were used to train models. A 70%-30% train-test split and 30-fold cross-validation was used to validate the model performance.

Results

The final analysis sample included 1140 individuals and 22,650 minutes of speech. Using 5-minutes of speech, our model could discriminate between HC and those with a serious mental illness (SSD or BD) with 86% accuracy (AUC = 0.91, Recall = 0.7, Precision = 0.98). Furthermore, our model could discern among HC, SPE, BD and SSD groups with 86% accuracy (F1 macro = 0.855, Recall Macro = 0.86, Precision Macro = 0.86). Finally, in a 5-class discrimination task including individuals with MDD, our model had 76% accuracy (F1 macro = 0.757, Recall Macro = 0.758, Precision Macro = 0.766).

Conclusion

Our ML pipeline demonstrated disorder-specific learning, achieving excellent or good accuracy across several classification tasks. We demonstrated that the screening of mental disorders is possible via a fully automated, remote speech assessment pipeline. We tested our model on relatively high number conditions (5 classes) in the literature and in a stratified sample of psychosis spectrum, including HC, SPE, SSD and BD (4 classes). We tested our model on a large sample (N = 1150) and demonstrated best-in-class accuracy with remotely collected speech data in the psychosis spectrum, however, further clinical validation is needed to test the reliability of model performance.

Collapse

Luo Q, Di Y, Zhu T. Predictive modeling of neuroticism in depressed and non-depressed cohorts using voice features. J Affect Disord 2024;352:395-402. [PMID: 38342318 DOI: 10.1016/j.jad.2024.02.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 01/30/2024] [Accepted: 02/07/2024] [Indexed: 02/13/2024]

Olah J, Spencer T, Cummins N, Diederen K. Automated analysis of speech as a marker of sub-clinical psychotic experiences. Front Psychiatry 2024;14:1265880. [PMID: 38361830 PMCID: PMC10867252 DOI: 10.3389/fpsyt.2023.1265880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 12/22/2023] [Indexed: 02/17/2024] Open

Aziz D, Dávid S. Multitask and Transfer Learning Approach for Joint Classification and Severity Estimation of Dysphonia. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2023;12:233-244. [PMID: 38196819 PMCID: PMC10776101 DOI: 10.1109/jtehm.2023.3340345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 11/30/2023] [Accepted: 12/04/2023] [Indexed: 01/11/2024]

Abstract

OBJECTIVE

Despite speech being the primary communication medium, it carries valuable information about a speaker's health, emotions, and identity. Various conditions can affect the vocal organs, leading to speech difficulties. Extensive research has been conducted by voice clinicians and academia in speech analysis. Previous approaches primarily focused on one particular task, such as differentiating between normal and dysphonic speech, classifying different voice disorders, or estimating the severity of voice disorders.

METHODS AND PROCEDURES

This study proposes an approach that combines transfer learning and multitask learning (MTL) to simultaneously perform dysphonia classification and severity estimation. Both tasks use a shared representation; network is learned from these shared features. We employed five computer vision models and changed their architecture to support multitask learning. Additionally, we conducted binary 'healthy vs. dysphonia' and multiclass 'healthy vs. organic and functional dysphonia' classification using multitask learning, with the speaker's sex as an auxiliary task.

RESULTS

The proposed method achieved improved performance across all classification metrics compared to single-task learning (STL), which only performs classification or severity estimation. Specifically, the model achieved F1 scores of 93% and 90% in MTL and STL, respectively. Moreover, we observed considerable improvements in both classification tasks by evaluating beta values associated with the weight assigned to the sex-predicting auxiliary task. MTL achieved an accuracy of 77% compared to the STL score of 73.2%. However, the performance of severity estimation in MTL was comparable to STL.

CONCLUSION

Our goal is to improve how voice pathologists and clinicians understand patients' conditions, make it easier to track their progress, and enhance the monitoring of vocal quality and treatment procedures. Clinical and Translational Impact Statement: By integrating both classification and severity estimation of dysphonia using multitask learning, we aim to enable clinicians to gain a better understanding of the patient's situation, effectively monitor their progress and voice quality.

Collapse

Gomez-Zaragoza L, Marin-Morales J, Vargas EP, Giglioli IAC, Raya MA. An Online Attachment Style Recognition System Based on Voice and Machine Learning. IEEE J Biomed Health Inform 2023;27:5576-5587. [PMID: 37566508 DOI: 10.1109/jbhi.2023.3304369] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/13/2023]

Sprotte Y. Computerized text and voice analysis of patients with chronic schizophrenia in art therapy. Sci Rep 2023;13:16062. [PMID: 37749186 PMCID: PMC10520069 DOI: 10.1038/s41598-023-43069-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 09/19/2023] [Indexed: 09/27/2023] Open

Abstract

This explorative study of patients with chronic schizophrenia aimed to clarify whether group art therapy followed by a therapist-guided picture review could influence patients' communication behaviour. Data on voice and speech characteristics were obtained via objective technological instruments, and these characteristics were selected as indicators of communication behaviour. Seven patients were recruited to participate in weekly group art therapy over a period of 6 months. Three days after each group meeting, they talked about their last picture during a standardized interview that was digitally recorded. The audio recordings were evaluated using validated computer-assisted procedures, the transcribed texts were evaluated using the German version of the LIWC2015 program, and the voice recordings were evaluated using the audio analysis software VocEmoApI. The dual methodological approach was intended to form an internal control of the study results. An exploratory factor analysis of the complete sets of output parameters was carried out with the expectation of obtaining typical speech and voice characteristics that map barriers to communication in patients with schizophrenia. The parameters of both methods were thus processed into five factors each, i.e., into a quantitative digitized classification of the texts and voices. The factor scores were subjected to a linear regression analysis to capture possible process-related changes. Most patients continued to participate in the study. This resulted in high-quality datasets for statistical analysis. To answer the study question, two results were summarized: First, text analysis factor called Presence proved to be a potential surrogate parameter for positive language development. Second, quantitative changes in vocal emotional factors were detected, demonstrating differentiated activation patterns of emotions. These results can be interpreted as an expression of a cathartic healing process. The methods presented in this study make a potentially significant contribution to quantitative research into the effectiveness and mode of action of art therapy.

Collapse

Granrud OE, Rodriguez Z, Cowan T, Masucci MD, Cohen AS. Alogia and pressured speech do not fall on a continuum of speech production using objective speech technologies. Schizophr Res 2023;259:121-126. [PMID: 35864001 DOI: 10.1016/j.schres.2022.07.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 07/02/2022] [Accepted: 07/04/2022] [Indexed: 10/17/2022]

Olah J, Diederen K, Gibbs-Dean T, Kempton MJ, Dobson R, Spencer T, Cummins N. Online speech assessment of the psychotic spectrum: Exploring the relationship between overlapping acoustic markers of schizotypy, depression and anxiety. Schizophr Res 2023;259:11-19. [PMID: 37080802 DOI: 10.1016/j.schres.2023.03.044] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 03/22/2023] [Accepted: 03/23/2023] [Indexed: 04/22/2023]

Abstract

BACKGROUND

Remote assessment of acoustic alterations in speech holds promise to increase scalability and validity in research across the psychosis spectrum. A feasible first step in establishing a procedure for online assessments is to assess acoustic alterations in psychometric schizotypy. However, to date, the complex relationship between alterations in speech related to schizotypy and those related to comorbid conditions such as symptoms of depression and anxiety has not been investigated. This study tested whether (1) depression, generalized anxiety and high psychometric schizotypy have similar voice characteristics, (2) which acoustic markers of online collected speech are the strongest predictors of psychometric schizotypy, (3) whether including generalized anxiety and depression symptoms in the model can improve the prediction of schizotypy.

METHODS

We collected cross-sectional, online-recorded speech data from 441 participants, assessing demographics, symptoms of depression, generalized anxiety and psychometric schizotypy.

RESULTS

Speech samples collected online could predict psychometric schizotypy, depression, and anxiety symptoms with weak to moderate predictive power, and with moderate and good predictive power when basic demographic variables were added to the models. Most influential features of these models largely overlapped. The predictive power of speech marker-based models of schizotypy significantly improved after including symptom scores of depression and generalized anxiety in the models (from R2 = 0.296 to R2 = 0. 436).

CONCLUSIONS

Acoustic features of online collected speech are predictive of psychometric schizotypy as well as generalized anxiety and depression symptoms. The acoustic characteristics of schizotypy, depression and anxiety symptoms significantly overlap. Speech models that are designed to predict schizotypy or symptoms of the schizophrenia spectrum might therefore benefit from controlling for symptoms of depression and anxiety.

Collapse

Tan EJ, Neill E, Kleiner JL, Rossell SL. Depressive symptoms are specifically related to speech pauses in schizophrenia spectrum disorders. Psychiatry Res 2023;321:115079. [PMID: 36716551 DOI: 10.1016/j.psychres.2023.115079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 01/03/2023] [Accepted: 01/25/2023] [Indexed: 01/28/2023]

Castro Martínez JC, Santamaría-García H. Understanding mental health through computers: An introduction to computational psychiatry. Front Psychiatry 2023;14:1092471. [PMID: 36824671 PMCID: PMC9941647 DOI: 10.3389/fpsyt.2023.1092471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 01/16/2023] [Indexed: 02/10/2023] Open

Daniel DG, Cohen AS, Velligan D, Harvey PD, Alphs L, Davidson M, Potter W, Kott A, Schooler N, Brodie CR, Moore RC, Lindenmeyer P, Marder SR. Remote Assessment of Negative Symptoms of Schizophrenia. SCHIZOPHRENIA BULLETIN OPEN 2023;4:sgad001. [PMID: 39145343 PMCID: PMC11207840 DOI: 10.1093/schizbullopen/sgad001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/16/2024]

Gumus M, DeSouza DD, Xu M, Fidalgo C, Simpson W, Robin J. Evaluating the utility of daily speech assessments for monitoring depression symptoms. Digit Health 2023;9:20552076231180523. [PMID: 37426590 PMCID: PMC10328009 DOI: 10.1177/20552076231180523] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 05/19/2023] [Indexed: 07/11/2023] Open

Bambini V, Frau F, Bischetti L, Cuoco F, Bechi M, Buonocore M, Agostoni G, Ferri I, Sapienza J, Martini F, Spangaro M, Bigai G, Cocchi F, Cavallaro R, Bosia M. Deconstructing heterogeneity in schizophrenia through language: a semi-automated linguistic analysis and data-driven clustering approach. SCHIZOPHRENIA (HEIDELBERG, GERMANY) 2022;8:102. [PMID: 36446789 PMCID: PMC9708845 DOI: 10.1038/s41537-022-00306-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 10/24/2022] [Indexed: 06/16/2023]

Affiliation(s)

Valentina Bambini Department of Humanities and Life Sciences, University School for Advanced Studies IUSS, Pavia, Italy.
Federico Frau Department of Humanities and Life Sciences, University School for Advanced Studies IUSS, Pavia, Italy
Luca Bischetti Department of Humanities and Life Sciences, University School for Advanced Studies IUSS, Pavia, Italy
Federica Cuoco Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy
Margherita Bechi Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy
Mariachiara Buonocore Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy
Giulia Agostoni Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy School of Medicine, Vita-Salute San Raffaele University, Milan, Italy
Ilaria Ferri Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy
Jacopo Sapienza Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy School of Medicine, Vita-Salute San Raffaele University, Milan, Italy
Francesca Martini Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy
Marco Spangaro Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy
Giorgia Bigai Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy School of Medicine, Vita-Salute San Raffaele University, Milan, Italy
Federica Cocchi Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy
Roberto Cavallaro Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy School of Medicine, Vita-Salute San Raffaele University, Milan, Italy
Marta Bosia Department of Clinical Neurosciences, IRCCS San Raffaele Scientific Institute, Milan, Italy School of Medicine, Vita-Salute San Raffaele University, Milan, Italy

Collapse

Cohen AS, Rodriguez Z, Warren KK, Cowan T, Masucci MD, Edvard Granrud O, Holmlund TB, Chandler C, Foltz PW, Strauss GP. Natural Language Processing and Psychosis: On the Need for Comprehensive Psychometric Evaluation. Schizophr Bull 2022;48:939-948. [PMID: 35738008 PMCID: PMC9434462 DOI: 10.1093/schbul/sbac051] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Abstract

BACKGROUND AND HYPOTHESIS

Despite decades of "proof of concept" findings supporting the use of Natural Language Processing (NLP) in psychosis research, clinical implementation has been slow. One obstacle reflects the lack of comprehensive psychometric evaluation of these measures. There is overwhelming evidence that criterion and content validity can be achieved for many purposes, particularly using machine learning procedures. However, there has been very little evaluation of test-retest reliability, divergent validity (sufficient to address concerns of a "generalized deficit"), and potential biases from demographics and other individual differences.

STUDY DESIGN

This article highlights these concerns in development of an NLP measure for tracking clinically rated paranoia from video "selfies" recorded from smartphone devices. Patients with schizophrenia or bipolar disorder were recruited and tracked over a week-long epoch. A small NLP-based feature set from 499 language samples were modeled on clinically rated paranoia using regularized regression.

STUDY RESULTS

While test-retest reliability was high, criterion, and convergent/divergent validity were only achieved when considering moderating variables, notably whether a patient was away from home, around strangers, or alone at the time of the recording. Moreover, there were systematic racial and sex biases in the model, in part, reflecting whether patients submitted videos when they were away from home, around strangers, or alone.

CONCLUSIONS

Advancing NLP measures for psychosis will require deliberate consideration of test-retest reliability, divergent validity, systematic biases and the potential role of moderators. In our example, a comprehensive psychometric evaluation revealed clear strengths and weaknesses that can be systematically addressed in future research.

Collapse

Who does what to whom? graph representations of action-predication in speech relate to psychopathological dimensions of psychosis. SCHIZOPHRENIA (HEIDELBERG, GERMANY) 2022;8:58. [PMID: 35853912 PMCID: PMC9261087 DOI: 10.1038/s41537-022-00263-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 06/01/2022] [Indexed: 11/09/2022]

Hu HX, Lau WYS, Ma EPY, Hung KSY, Chen SY, Cheng KS, Cheung EFC, Lui SSY, Chan RCK. The Important Role of Motivation and Pleasure Deficits on Social Functioning in Patients With Schizophrenia: A Network Analysis. Schizophr Bull 2022;48:860-870. [PMID: 35524755 PMCID: PMC9212088 DOI: 10.1093/schbul/sbac017] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Cohen AS, Cox CR, Cowan T, Masucci MD, Le TP, Docherty AR, Bedwell JS. High Predictive Accuracy of Negative Schizotypy With Acoustic Measures. Clin Psychol Sci 2022;10:310-323. [PMID: 38031625 PMCID: PMC10686546 DOI: 10.1177/21677026211017835] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/01/2023]

Nahar JK, Lopez-Jimenez F. Utilizing Conversational Artificial Intelligence, Voice, and Phonocardiography Analytics in Heart Failure Care. Heart Fail Clin 2022;18:311-323. [DOI: 10.1016/j.hfc.2021.11.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

Hitczenko K, Cowan HR, Goldrick M, Mittal VA. Racial and Ethnic Biases in Computational Approaches to Psychopathology. Schizophr Bull 2022;48:285-288. [PMID: 34729605 PMCID: PMC8886581 DOI: 10.1093/schbul/sbab131] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Birnbaum ML, Abrami A, Heisig S, Ali A, Arenare E, Agurto C, Lu N, Kane JM, Cecchi G. Acoustic and Facial Features From Clinical Interviews for Machine Learning-Based Psychiatric Diagnosis: Algorithm Development. JMIR Ment Health 2022;9:e24699. [PMID: 35072648 PMCID: PMC8822433 DOI: 10.2196/24699] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 04/29/2021] [Accepted: 12/01/2021] [Indexed: 01/26/2023] Open

Abstract

BACKGROUND

In contrast to all other areas of medicine, psychiatry is still nearly entirely reliant on subjective assessments such as patient self-report and clinical observation. The lack of objective information on which to base clinical decisions can contribute to reduced quality of care. Behavioral health clinicians need objective and reliable patient data to support effective targeted interventions.

OBJECTIVE

We aimed to investigate whether reliable inferences-psychiatric signs, symptoms, and diagnoses-can be extracted from audiovisual patterns in recorded evaluation interviews of participants with schizophrenia spectrum disorders and bipolar disorder.

METHODS

We obtained audiovisual data from 89 participants (mean age 25.3 years; male: 48/89, 53.9%; female: 41/89, 46.1%): individuals with schizophrenia spectrum disorders (n=41), individuals with bipolar disorder (n=21), and healthy volunteers (n=27). We developed machine learning models based on acoustic and facial movement features extracted from participant interviews to predict diagnoses and detect clinician-coded neuropsychiatric symptoms, and we assessed model performance using area under the receiver operating characteristic curve (AUROC) in 5-fold cross-validation.

RESULTS

The model successfully differentiated between schizophrenia spectrum disorders and bipolar disorder (AUROC 0.73) when aggregating face and voice features. Facial action units including cheek-raising muscle (AUROC 0.64) and chin-raising muscle (AUROC 0.74) provided the strongest signal for men. Vocal features, such as energy in the frequency band 1 to 4 kHz (AUROC 0.80) and spectral harmonicity (AUROC 0.78), provided the strongest signal for women. Lip corner-pulling muscle signal discriminated between diagnoses for both men (AUROC 0.61) and women (AUROC 0.62). Several psychiatric signs and symptoms were successfully inferred: blunted affect (AUROC 0.81), avolition (AUROC 0.72), lack of vocal inflection (AUROC 0.71), asociality (AUROC 0.63), and worthlessness (AUROC 0.61).

CONCLUSIONS

This study represents advancement in efforts to capitalize on digital data to improve diagnostic assessment and supports the development of a new generation of innovative clinical tools by employing acoustic and facial data analysis.

Collapse

Ferrer-I-Cancho R, Gómez-Rodríguez C, Esteban JL, Alemany-Puig L. Optimality of syntactic dependency distances. Phys Rev E 2022;105:014308. [PMID: 35193296 DOI: 10.1103/physreve.105.014308] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Accepted: 11/10/2021] [Indexed: 06/14/2023]

Tan EJ, Meyer D, Neill E, Rossell SL. Investigating the diagnostic utility of speech patterns in schizophrenia and their symptom associations. Schizophr Res 2021;238:91-98. [PMID: 34649084 DOI: 10.1016/j.schres.2021.10.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 09/19/2021] [Accepted: 10/03/2021] [Indexed: 12/13/2022]

Fagherazzi G, Fischer A, Ismael M, Despotovic V. Voice for Health: The Use of Vocal Biomarkers from Research to Clinical Practice. Digit Biomark 2021;5:78-88. [PMID: 34056518 PMCID: PMC8138221 DOI: 10.1159/000515346] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 02/18/2021] [Indexed: 12/17/2022] Open

Cohen AS, Cox CR, Tucker RP, Mitchell KR, Schwartz EK, Le TP, Foltz PW, Holmlund TB, Elvevåg B. Validating Biobehavioral Technologies for Use in Clinical Psychiatry. Front Psychiatry 2021;12:503323. [PMID: 34177631 PMCID: PMC8225932 DOI: 10.3389/fpsyt.2021.503323] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 05/11/2021] [Indexed: 11/14/2022] Open

Abstract

The last decade has witnessed the development of sophisticated biobehavioral and genetic, ambulatory, and other measures that promise unprecedented insight into psychiatric disorders. As yet, clinical sciences have struggled with implementing these objective measures and they have yet to move beyond "proof of concept." In part, this struggle reflects a traditional, and conceptually flawed, application of traditional psychometrics (i.e., reliability and validity) for evaluating them. This paper focuses on "resolution," concerning the degree to which changes in a signal can be detected and quantified, which is central to measurement evaluation in informatics, engineering, computational and biomedical sciences. We define and discuss resolution in terms of traditional reliability and validity evaluation for psychiatric measures, then highlight its importance in a study using acoustic features to predict self-injurious thoughts/behaviors (SITB). This study involved tracking natural language and self-reported symptoms in 124 psychiatric patients: (a) over 5-14 recording sessions, collected using a smart phone application, and (b) during a clinical interview. Importantly, the scope of these measures varied as a function of time (minutes, weeks) and spatial setting (i.e., smart phone vs. interview). Regarding reliability, acoustic features were temporally unstable until we specified the level of temporal/spatial resolution. Regarding validity, accuracy based on machine learning of acoustic features predicting SITB varied as a function of resolution. High accuracy was achieved (i.e., ~87%), but only when the acoustic and SITB measures were "temporally-matched" in resolution was the model generalizable to new data. Unlocking the potential of biobehavioral technologies for clinical psychiatry will require careful consideration of resolution.

Collapse

Kelly DL, Spaderna M, Hodzic V, Nair S, Kitchen C, Werkheiser AE, Powell MM, Liu F, Coppersmith G, Chen S, Resnik P. Blinded Clinical Ratings of Social Media Data are Correlated with In-Person Clinical Ratings in Participants Diagnosed with Either Depression, Schizophrenia, or Healthy Controls. Psychiatry Res 2020;294:113496. [PMID: 33065372 DOI: 10.1016/j.psychres.2020.113496] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 10/01/2020] [Indexed: 12/16/2022]

Argolo F, Magnavita G, Mota NB, Ziebold C, Mabunda D, Pan PM, Zugman A, Gadelha A, Corcoran C, Bressan RA. Lowering costs for large-scale screening in psychosis: a systematic review and meta-analysis of performance and value of information for speech-based psychiatric evaluation. REVISTA BRASILEIRA DE PSIQUIATRIA (SAO PAULO, BRAZIL : 1999) 2020;42:673-686. [PMID: 32321060 PMCID: PMC7678898 DOI: 10.1590/1516-4446-2019-0722] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Accepted: 01/23/2020] [Indexed: 11/22/2022]

Robin J, Harrison JE, Kaufman LD, Rudzicz F, Simpson W, Yancheva M. Evaluation of Speech-Based Digital Biomarkers: Review and Recommendations. Digit Biomark 2020;4:99-108. [PMID: 33251474 DOI: 10.1159/000510820] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 08/11/2020] [Indexed: 12/23/2022] Open

Cohen AS, Cox CR, Le TP, Cowan T, Masucci MD, Strauss GP, Kirkpatrick B. Using machine learning of computerized vocal expression to measure blunted vocal affect and alogia. NPJ SCHIZOPHRENIA 2020;6:26. [PMID: 32978400 PMCID: PMC7519104 DOI: 10.1038/s41537-020-00115-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Accepted: 08/06/2020] [Indexed: 11/16/2022]

Digital Phenotyping Using Multimodal Data. Curr Behav Neurosci Rep 2020. [DOI: 10.1007/s40473-020-00215-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Makowski C, Lewis JD, Lepage C, Malla AK, Joober R, Evans AC, Lepage M. Intersection of verbal memory and expressivity on cortical contrast and thickness in first episode psychosis. Psychol Med 2020;50:1923-1936. [PMID: 31456533 DOI: 10.1017/s0033291719002071] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]

Abstract

BACKGROUND

Longitudinal studies of first episode of psychosis (FEP) patients are critical to understanding the dynamic clinical factors influencing functional outcomes; negative symptoms and verbal memory (VM) deficits are two such factors that remain a therapeutic challenge. This study uses white-gray matter contrast at the inner edge of the cortex, in addition to cortical thickness, to probe changes in microstructure and their relation with negative symptoms and possible intersections with verbal memory.

METHODS

T1-weighted images and clinical data were collected longitudinally for patients (N = 88) over a two-year period. Cognitive data were also collected at baseline. Relationships between baseline VM (immediate/delayed recall) and rate of change in two negative symptom dimensions, amotivation and expressivity, were assessed at the behavioral level, as well as at the level of brain structure.

RESULTS

VM, particularly immediate recall, was significantly and positively associated with a steeper rate of expressivity symptom decline (r = 0.32, q = 0.012). Significant interaction effects between baseline delayed recall and change in expressivity were uncovered in somatomotor regions bilaterally for both white-gray matter contrast and cortical thickness. Furthermore, interaction effects between immediate recall and change in expressivity on cortical thickness rates were uncovered across higher-order regions of the language processing network.

CONCLUSIONS

This study shows common neural correlates of language-related brain areas underlying expressivity and VM in FEP, suggesting deficits in these domains may be more linked to speech production rather than general cognitive capacity. Together, white-gray matter contrast and cortical thickness may optimally inform clinical investigations aiming to capture peri-cortical microstructural changes.

Collapse

Cohen AS, Cowan T, Le TP, Schwartz EK, Kirkpatrick B, Raugh IM, Chapman HC, Strauss GP. Ambulatory digital phenotyping of blunted affect and alogia using objective facial and vocal analysis: Proof of concept. Schizophr Res 2020;220:141-146. [PMID: 32247747 PMCID: PMC7306442 DOI: 10.1016/j.schres.2020.03.043] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/12/2019] [Revised: 01/10/2020] [Accepted: 03/21/2020] [Indexed: 11/28/2022]

Agurto C, Cecchi GA, Norel R, Ostrand R, Kirkpatrick M, Baggott MJ, Wardle MC, Wit HD, Bedi G. Detection of acute 3,4-methylenedioxymethamphetamine (MDMA) effects across protocols using automated natural language processing. Neuropsychopharmacology 2020;45:823-832. [PMID: 31978933 PMCID: PMC7075895 DOI: 10.1038/s41386-020-0620-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 11/28/2019] [Accepted: 01/08/2020] [Indexed: 11/17/2022]

Abstract

The detection of changes in mental states such as those caused by psychoactive drugs relies on clinical assessments that are inherently subjective. Automated speech analysis may represent a novel method to detect objective markers, which could help improve the characterization of these mental states. In this study, we employed computer-extracted speech features from multiple domains (acoustic, semantic, and psycholinguistic) to assess mental states after controlled administration of 3,4-methylenedioxymethamphetamine (MDMA) and intranasal oxytocin. The training/validation set comprised within-participants data from 31 healthy adults who, over four sessions, were administered MDMA (0.75, 1.5 mg/kg), oxytocin (20 IU), and placebo in randomized, double-blind fashion. Participants completed two 5-min speech tasks during peak drug effects. Analyses included group-level comparisons of drug conditions and estimation of classification at the individual level within this dataset and on two independent datasets. Promising classification results were obtained to detect drug conditions, achieving cross-validated accuracies of up to 87% in training/validation and 92% in the independent datasets, suggesting that the detected patterns of speech variability are associated with drug consumption. Specifically, we found that oxytocin seems to be mostly driven by changes in emotion and prosody, which are mainly captured by acoustic features. In contrast, mental states driven by MDMA consumption appear to manifest in multiple domains of speech. Furthermore, we find that the experimental task has an effect on the speech response within these mental states, which can be attributed to presence or absence of an interaction with another individual. These results represent a proof-of-concept application of the potential of speech to provide an objective measurement of mental states elicited during intoxication.

Collapse

Cohen AS, Schwartz E, Le T, Cowan T, Cox C, Tucker R, Foltz P, Holmlund TB, Elvevåg B. Validating digital phenotyping technologies for clinical use: the critical importance of "resolution". World Psychiatry 2020;19:114-115. [PMID: 31922662 PMCID: PMC6953543 DOI: 10.1002/wps.20703] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open

Parola A, Simonsen A, Bliksted V, Fusaroli R. Voice patterns in schizophrenia: A systematic review and Bayesian meta-analysis. Schizophr Res 2020;216:24-40. [PMID: 31839552 DOI: 10.1016/j.schres.2019.11.031] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Revised: 09/13/2019] [Accepted: 11/19/2019] [Indexed: 12/28/2022]

Low DM, Bentley KH, Ghosh SS. Automated assessment of psychiatric disorders using speech: A systematic review. Laryngoscope Investig Otolaryngol 2020;5:96-116. [PMID: 32128436 PMCID: PMC7042657 DOI: 10.1002/lio2.354] [Citation(s) in RCA: 156] [Impact Index Per Article: 39.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Revised: 12/31/2019] [Accepted: 01/17/2020] [Indexed: 12/31/2022] Open

Abstract

OBJECTIVE

There are many barriers to accessing mental health assessments including cost and stigma. Even when individuals receive professional care, assessments are intermittent and may be limited partly due to the episodic nature of psychiatric symptoms. Therefore, machine-learning technology using speech samples obtained in the clinic or remotely could one day be a biomarker to improve diagnosis and treatment. To date, reviews have only focused on using acoustic features from speech to detect depression and schizophrenia. Here, we present the first systematic review of studies using speech for automated assessments across a broader range of psychiatric disorders.

METHODS

We followed the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines. We included studies from the last 10 years using speech to identify the presence or severity of disorders within the Diagnostic and Statistical Manual of Mental Disorders (DSM-5). For each study, we describe sample size, clinical evaluation method, speech-eliciting tasks, machine learning methodology, performance, and other relevant findings.

RESULTS

1395 studies were screened of which 127 studies met the inclusion criteria. The majority of studies were on depression, schizophrenia, and bipolar disorder, and the remaining on post-traumatic stress disorder, anxiety disorders, and eating disorders. 63% of studies built machine learning predictive models, and the remaining 37% performed null-hypothesis testing only. We provide an online database with our search results and synthesize how acoustic features appear in each disorder.

CONCLUSION

Speech processing technology could aid mental health assessments, but there are many obstacles to overcome, especially the need for comprehensive transdiagnostic and longitudinal studies. Given the diverse types of data sets, feature extraction, computational methodologies, and evaluation criteria, we provide guidelines for both acquiring data and building machine learning models with a focus on testing hypotheses, open science, reproducibility, and generalizability.

LEVEL OF EVIDENCE

3a.

Collapse

Arevian AC, Bone D, Malandrakis N, Martinez VR, Wells KB, Miklowitz DJ, Narayanan S. Clinical state tracking in serious mental illness through computational analysis of speech. PLoS One 2020;15:e0225695. [PMID: 31940347 PMCID: PMC6961853 DOI: 10.1371/journal.pone.0225695] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2017] [Accepted: 11/11/2019] [Indexed: 11/19/2022] Open

Abstract

Individuals with serious mental illness experience changes in their clinical states over time that are difficult to assess and that result in increased disease burden and care utilization. It is not known if features derived from speech can serve as a transdiagnostic marker of these clinical states. This study evaluates the feasibility of collecting speech samples from people with serious mental illness and explores the potential utility for tracking changes in clinical state over time. Patients (n = 47) were recruited from a community-based mental health clinic with diagnoses of bipolar disorder, major depressive disorder, schizophrenia or schizoaffective disorder. Patients used an interactive voice response system for at least 4 months to provide speech samples. Clinic providers (n = 13) reviewed responses and provided global assessment ratings. We computed features of speech and used machine learning to create models of outcome measures trained using either population data or an individual's own data over time. The system was feasible to use, recording 1101 phone calls and 117 hours of speech. Most (92%) of the patients agreed that it was easy to use. The individually-trained models demonstrated the highest correlation with provider ratings (rho = 0.78, p<0.001). Population-level models demonstrated statistically significant correlations with provider global assessment ratings (rho = 0.44, p<0.001), future provider ratings (rho = 0.33, p<0.05), BASIS-24 summary score, depression sub score, and self-harm sub score (rho = 0.25,0.25, and 0.28 respectively; p<0.05), and the SF-12 mental health sub score (rho = 0.25, p<0.05), but not with other BASIS-24 or SF-12 sub scores. This study brings together longitudinal collection of objective behavioral markers along with a transdiagnostic, personalized approach for tracking of mental health clinical state in a community-based clinical setting.

Collapse

Lundin NB, Hochheiser J, Minor KS, Hetrick WP, Lysaker PH. Piecing together fragments: Linguistic cohesion mediates the relationship between executive function and metacognition in schizophrenia. Schizophr Res 2020;215:54-60. [PMID: 31784337 PMCID: PMC8106973 DOI: 10.1016/j.schres.2019.11.032] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/07/2019] [Revised: 08/24/2019] [Accepted: 11/19/2019] [Indexed: 12/28/2022]

Abstract

Speech disturbances are prevalent in psychosis. These may arise in part from executive function impairment, as research suggests that inhibition and monitoring are associated with production of cohesive discourse. However, it is not yet understood how linguistic and executive function impairments in psychosis interact with disrupted metacognition, or deficits in the ability to integrate information to form a complex sense of oneself and others and use that synthesis to respond to psychosocial challenges. Whereas discourse studies have historically employed manual hand-coding techniques, automated computational tools can characterize deep semantic structures that may be closely linked with metacognition. In the present study, we examined whether higher executive functioning promotes metacognition by way of altering linguistic cohesion. Ninety-four individuals with schizophrenia-spectrum disorders provided illness narratives and completed an executive function task battery (Delis-Kaplan Executive Function System). We assessed the narratives for linguistic cohesion (Coh-Metrix 3.0) and metacognitive capacity (Metacognition Assessment Scale - Abbreviated). Selected linguistic indices measured the frequency of connections between causal and intentional content (deep cohesion), word and theme overlap (referential cohesion), and unique word usage (lexical diversity). In path analyses using bootstrapped confidence intervals, we found that deep cohesion and lexical diversity independently mediated the relationship between executive functioning and metacognitive capacity. Findings suggest that executive control abilities support integration of mental experiences by way of increasing causal, goal-driven speech and word expression in individuals with schizophrenia. Metacognitive-based therapeutic interventions for psychosis may promote insight and recovery in part by scaffolding use of language that links ideas together.

Collapse

Wang J, Zhang L, Liu T, Pan W, Hu B, Zhu T. Acoustic differences between healthy and depressed people: a cross-situation study. BMC Psychiatry 2019;19:300. [PMID: 31615470 PMCID: PMC6794822 DOI: 10.1186/s12888-019-2300-7] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Accepted: 09/20/2019] [Indexed: 11/29/2022] Open

Abstract

BACKGROUND

Abnormalities in vocal expression during a depressed episode have frequently been reported in people with depression, but less is known about if these abnormalities only exist in special situations. In addition, the impacts of irrelevant demographic variables on voice were uncontrolled in previous studies. Therefore, this study compares the vocal differences between depressed and healthy people under various situations with irrelevant variables being regarded as covariates.

METHODS

To examine whether the vocal abnormalities in people with depression only exist in special situations, this study compared the vocal differences between healthy people and patients with unipolar depression in 12 situations (speech scenarios). Positive, negative and neutral voice expressions between depressed and healthy people were compared in four tasks. Multiple analysis of covariance (MANCOVA) was used for evaluating the main effects of variable group (depressed vs. healthy) on acoustic features. The significances of acoustic features were evaluated by both statistical significance and magnitude of effect size.

RESULTS

The results of multivariate analysis of covariance showed that significant differences between the two groups were observed in all 12 speech scenarios. Although significant acoustic features were not the same in different scenarios, we found that three acoustic features (loudness, MFCC5 and MFCC7) were consistently different between people with and without depression with large effect magnitude.

CONCLUSIONS

Vocal differences between depressed and healthy people exist in 12 scenarios. Acoustic features including loudness, MFCC5 and MFCC7 have potentials to be indicators for identifying depression via voice analysis. These findings support that depressed people's voices include both situation-specific and cross-situational patterns of acoustic features.

Collapse

Cohen AS, Fedechko T, Schwartz EK, Le TP, Foltz PW, Bernstein J, Cheng J, Rosenfeld E, Elvevåg B. Psychiatric Risk Assessment from the Clinician's Perspective: Lessons for the Future. Community Ment Health J 2019;55:1165-1172. [PMID: 31154587 DOI: 10.1007/s10597-019-00411-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Accepted: 05/13/2019] [Indexed: 01/30/2023]

Minor KS, Willits JA, Marggraf MP, Jones MN, Lysaker PH. Measuring disorganized speech in schizophrenia: automated analysis explains variance in cognitive deficits beyond clinician-rated scales. Psychol Med 2019;49:440-448. [PMID: 29692287 DOI: 10.1017/s0033291718001046] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Abstract

BACKGROUND

Conveying information cohesively is an essential element of communication that is disrupted in schizophrenia. These disruptions are typically expressed through disorganized symptoms, which have been linked to neurocognitive, social cognitive, and metacognitive deficits. Automated analysis can objectively assess disorganization within sentences, between sentences, and across paragraphs by comparing explicit communication to a large text corpus.

METHOD

Little work in schizophrenia has tested: (1) links between disorganized symptoms measured via automated analysis and neurocognition, social cognition, or metacognition; and (2) if automated analysis explains incremental variance in cognitive processes beyond clinician-rated scales. Disorganization was measured in schizophrenia (n = 81) with Coh-Metrix 3.0, an automated program that calculates basic and complex language indices. Trained staff also assessed neurocognition, social cognition, metacognition, and clinician-rated disorganization.

RESULTS

Findings showed that all three cognitive processes were significantly associated with at least one automated index of disorganization. When automated analysis was compared with a clinician-rated scale, it accounted for significant variance in neurocognition and metacognition beyond the clinician-rated measure. When combined, these two methods explained 28-31% of the variance in neurocognition, social cognition, and metacognition.

CONCLUSIONS

This study illustrated how automated analysis can highlight the specific role of disorganization in neurocognition, social cognition, and metacognition. Generally, those with poor cognition also displayed more disorganization in their speech-making it difficult for listeners to process essential information needed to tie the speaker's ideas together. Our findings showcase how implementing a mixed-methods approach in schizophrenia can explain substantial variance in cognitive processes.

Collapse

Ratana R, Sharifzadeh H, Krishnan J, Pang S. A Comprehensive Review of Computational Methods for Automatic Prediction of Schizophrenia With Insight Into Indigenous Populations. Front Psychiatry 2019;10:659. [PMID: 31607962 PMCID: PMC6759015 DOI: 10.3389/fpsyt.2019.00659] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/14/2019] [Accepted: 08/15/2019] [Indexed: 01/13/2023] Open

de Boer J, Voppel A, Begemann M, Schnack H, Wijnen F, Sommer I. Clinical use of semantic space models in psychiatry and neurology: A systematic review and meta-analysis. Neurosci Biobehav Rev 2018;93:85-92. [DOI: 10.1016/j.neubiorev.2018.06.008] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Revised: 06/07/2018] [Accepted: 06/07/2018] [Indexed: 01/17/2023]

Evidence of disturbances of deep levels of semantic cohesion within personal narratives in schizophrenia. Schizophr Res 2018;197:365-369. [PMID: 29153448 DOI: 10.1016/j.schres.2017.11.014] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Revised: 10/17/2017] [Accepted: 11/10/2017] [Indexed: 12/24/2022]

Pauselli L, Halpern B, Cleary SD, Ku BS, Covington MA, Compton MT. Computational linguistic analysis applied to a semantic fluency task to measure derailment and tangentiality in schizophrenia. Psychiatry Res 2018;263:74-79. [PMID: 29502041 PMCID: PMC6048590 DOI: 10.1016/j.psychres.2018.02.037] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/02/2017] [Revised: 12/18/2017] [Accepted: 02/16/2018] [Indexed: 12/31/2022]

Discriminant document embeddings with an extreme learning machine for classifying clinical narratives. Neurocomputing 2018. [DOI: 10.1016/j.neucom.2017.01.117] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Semantic coherence in psychometric schizotypy: An investigation using Latent Semantic Analysis. Psychiatry Res 2018;259:63-67. [PMID: 29028526 DOI: 10.1016/j.psychres.2017.09.078] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/26/2016] [Revised: 05/23/2017] [Accepted: 09/25/2017] [Indexed: 12/30/2022]

Cohen AS, Mitchell KR, Strauss GP, Blanchard JJ, Buchanan RW, Kelly DL, Gold J, McMahon RP, Adams HA, Carpenter WT. The effects of oxytocin and galantamine on objectively-defined vocal and facial expression: Data from the CIDAR study. Schizophr Res 2017;188:141-143. [PMID: 28130004 PMCID: PMC5524598 DOI: 10.1016/j.schres.2017.01.028] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/10/2016] [Revised: 01/17/2017] [Accepted: 01/18/2017] [Indexed: 11/29/2022]