Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Krajewski J, Schnieder S, Sommer D, Batliner A, Schuller B. Applying multiple classifiers and non-linear dynamics features for detecting sleepiness from speech. Neurocomputing 2012. [DOI: 10.1016/j.neucom.2011.12.021] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

For:	Krajewski J, Schnieder S, Sommer D, Batliner A, Schuller B. Applying multiple classifiers and non-linear dynamics features for detecting sleepiness from speech. Neurocomputing 2012. [DOI: 10.1016/j.neucom.2011.12.021] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Number

Cited by Other Article(s)

Thoret E, Andrillon T, Gauriau C, Léger D, Pressnitzer D. Sleep deprivation detected by voice analysis. PLoS Comput Biol 2024;20:e1011849. [PMID: 38315733 PMCID: PMC10890756 DOI: 10.1371/journal.pcbi.1011849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Revised: 02/23/2024] [Accepted: 01/22/2024] [Indexed: 02/07/2024] Open

Abstract

Sleep deprivation has an ever-increasing impact on individuals and societies. Yet, to date, there is no quick and objective test for sleep deprivation. Here, we used automated acoustic analyses of the voice to detect sleep deprivation. Building on current machine-learning approaches, we focused on interpretability by introducing two novel ideas: the use of a fully generic auditory representation as input feature space, combined with an interpretation technique based on reverse correlation. The auditory representation consisted of a spectro-temporal modulation analysis derived from neurophysiology. The interpretation method aimed to reveal the regions of the auditory representation that supported the classifiers' decisions. Results showed that generic auditory features could be used to detect sleep deprivation successfully, with an accuracy comparable to state-of-the-art speech features. Furthermore, the interpretation revealed two distinct effects of sleep deprivation on the voice: changes in slow temporal modulations related to prosody and changes in spectral features related to voice quality. Importantly, the relative balance of the two effects varied widely across individuals, even though the amount of sleep deprivation was controlled, thus confirming the need to characterize sleep deprivation at the individual level. Moreover, while the prosody factor correlated with subjective sleepiness reports, the voice quality factor did not, consistent with the presence of both explicit and implicit consequences of sleep deprivation. Overall, the findings show that individual effects of sleep deprivation may be observed in vocal biomarkers. Future investigations correlating such markers with objective physiological measures of sleep deprivation could enable "sleep stethoscopes" for the cost-effective diagnosis of the individual effects of sleep deprivation.

Collapse

Virk JS, Singh M, Singh M, Panjwani U, Ray K. A Multimodal Feature Fusion Framework for Sleep-Deprived Fatigue Detection to Prevent Accidents. SENSORS (BASEL, SWITZERLAND) 2023;23:4129. [PMID: 37112470 PMCID: PMC10144633 DOI: 10.3390/s23084129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 04/16/2023] [Accepted: 04/18/2023] [Indexed: 06/19/2023]

Zhao Q, Fan HZ, Li YL, Liu L, Wu YX, Zhao YL, Tian ZX, Wang ZR, Tan YL, Tan SP. Vocal Acoustic Features as Potential Biomarkers for Identifying/Diagnosing Depression: A Cross-Sectional Study. Front Psychiatry 2022;13:815678. [PMID: 35573349 PMCID: PMC9095973 DOI: 10.3389/fpsyt.2022.815678] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 03/30/2022] [Indexed: 11/30/2022] Open

Abstract

BACKGROUND

At present, there is no established biomarker for the diagnosis of depression. Meanwhile, studies show that acoustic features convey emotional information. Therefore, this study explored differences in acoustic characteristics between depressed patients and healthy individuals to investigate whether these characteristics can identify depression.

METHODS

Participants included 71 patients diagnosed with depression from a regional hospital in Beijing, China, and 62 normal controls from within the greater community. We assessed the clinical symptoms of depression of all participants using the Hamilton Depression Scale (HAMD), Hamilton Anxiety Scale (HAMA), and Patient Health Questionnaire (PHQ-9), and recorded the voice of each participant as they read positive, neutral, and negative texts. OpenSMILE was used to analyze their voice acoustics and extract acoustic characteristics from the recordings.

RESULTS

There were significant differences between the depression and control groups in all acoustic characteristics (p < 0.05). Several mel-frequency cepstral coefficients (MFCCs), including MFCC2, MFCC3, MFCC8, and MFCC9, differed significantly between different emotion tasks; MFCC4 and MFCC7 correlated positively with PHQ-9 scores, and correlations were stable in all emotion tasks. The zero-crossing rate in positive emotion correlated positively with HAMA total score and HAMA somatic anxiety score (r = 0.31, r = 0.34, respectively), and MFCC9 of neutral emotion correlated negatively with HAMD anxiety/somatization scores (r = -0.34). Linear regression showed that the MFCC7-negative was predictive on the PHQ-9 score (β = 0.90, p = 0.01) and MFCC9-neutral was predictive on HAMD anxiety/somatization score (β = -0.45, p = 0.049). Logistic regression showed a superior discriminant effect, with a discrimination accuracy of 89.66%.

CONCLUSION

The acoustic expression of emotion among patients with depression differs from that of normal controls. Some acoustic characteristics are related to the severity of depressive symptoms and may be objective biomarkers of depression. A systematic method of assessing vocal acoustic characteristics could provide an accurate and discreet means of screening for depression; this method may be used instead of-or in conjunction with-traditional screening methods, as it is not subject to the limitations associated with self-reported assessments wherein subjects may be inclined to provide socially acceptable responses rather than being truthful.

Collapse

Martin VP, Rouas JL, Micoulaud-Franchi JA, Philip P, Krajewski J. How to Design a Relevant Corpus for Sleepiness Detection Through Voice? Front Digit Health 2021;3:686068. [PMID: 34713156 PMCID: PMC8521834 DOI: 10.3389/fdgth.2021.686068] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 08/19/2021] [Indexed: 12/27/2022] Open

Kaduk SI, Roberts APJ, Stanton NA. The circadian effect on psychophysiological driver state monitoring. THEORETICAL ISSUES IN ERGONOMICS SCIENCE 2020. [DOI: 10.1080/1463922x.2020.1842548] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Voleti R, Liss JM, Berisha V. A Review of Automated Speech and Language Features for Assessment of Cognitive and Thought Disorders. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING 2020;14:282-298. [PMID: 33907590 PMCID: PMC8074691 DOI: 10.1109/jstsp.2019.2952087] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]

Boyer S, Paubel PV, Ruiz R, El Yagoubi R, Daurat A. Human Voice as a Measure of Mental Load Level. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2018;61:2722-2734. [PMID: 30383160 DOI: 10.1044/2018_jslhr-s-18-0066] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2018] [Accepted: 06/07/2018] [Indexed: 06/08/2023]

Li L, Ngan CK. A weight-adjusted-voting framework on an ensemble of classifiers for improving sensitivity. INTELL DATA ANAL 2017. [DOI: 10.3233/ida-163184] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Pattern Recognition Methods and Features Selection for Speech Emotion Recognition System. ScientificWorldJournal 2015;2015:573068. [PMID: 26346654 PMCID: PMC4539500 DOI: 10.1155/2015/573068] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2014] [Accepted: 10/27/2014] [Indexed: 11/17/2022] Open

Zhou Y, Zhao H, Pan X, Shang L. Deception detecting from speech signal using relevance vector machine and non-linear dynamics features. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.04.083] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Schuller B, Steidl S, Batliner A, Schiel F, Krajewski J, Weninger F, Eyben F. Medium-term speaker states—A review on intoxication, sleepiness and the first challenge. COMPUT SPEECH LANG 2014. [DOI: 10.1016/j.csl.2012.12.002] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Caraty MJ, Montacié C. Vocal fatigue induced by prolonged oral reading: Analysis and detection. COMPUT SPEECH LANG 2014. [DOI: 10.1016/j.csl.2012.12.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Montero Benavides A, Fernández Pozo R, Toledano DT, Blanco Murillo JL, López Gonzalo E, Hernández Gómez L. Analysis of voice features related to obstructive sleep apnoea and their application in diagnosis support. COMPUT SPEECH LANG 2014. [DOI: 10.1016/j.csl.2013.08.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Sustainable Reduction of Sleepiness through Salutogenic Self-Care Procedure in Lunch Breaks: A Pilot Study. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2014;2013:387356. [PMID: 24381633 PMCID: PMC3870120 DOI: 10.1155/2013/387356] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/19/2013] [Revised: 09/30/2013] [Accepted: 10/14/2013] [Indexed: 12/02/2022]

An automated optimal engagement and attention detection system using electrocardiogram. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2012;2012:528781. [PMID: 22924060 PMCID: PMC3424596 DOI: 10.1155/2012/528781] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/01/2012] [Accepted: 06/18/2012] [Indexed: 11/19/2022]