Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Zhang Z, Zhang S, Ni D, Wei Z, Yang K, Jin S, Huang G, Liang Z, Zhang L, Li L, Ding H, Zhang Z, Wang J. Multimodal Sensing for Depression Risk Detection: Integrating Audio, Video, and Text Data. SENSORS (BASEL, SWITZERLAND) 2024;24:3714. [PMID: 38931497 PMCID: PMC11207438 DOI: 10.3390/s24123714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Revised: 05/30/2024] [Accepted: 06/06/2024] [Indexed: 06/28/2024]

Abstract

Depression is a major psychological disorder with a growing impact worldwide. Traditional methods for detecting the risk of depression, predominantly reliant on psychiatric evaluations and self-assessment questionnaires, are often criticized for their inefficiency and lack of objectivity. Advancements in deep learning have paved the way for innovations in depression risk detection methods that fuse multimodal data. This paper introduces a novel framework, the Audio, Video, and Text Fusion-Three Branch Network (AVTF-TBN), designed to amalgamate auditory, visual, and textual cues for a comprehensive analysis of depression risk. Our approach encompasses three dedicated branches-Audio Branch, Video Branch, and Text Branch-each responsible for extracting salient features from the corresponding modality. These features are subsequently fused through a multimodal fusion (MMF) module, yielding a robust feature vector that feeds into a predictive modeling layer. To further our research, we devised an emotion elicitation paradigm based on two distinct tasks-reading and interviewing-implemented to gather a rich, sensor-based depression risk detection dataset. The sensory equipment, such as cameras, captures subtle facial expressions and vocal characteristics essential for our analysis. The research thoroughly investigates the data generated by varying emotional stimuli and evaluates the contribution of different tasks to emotion evocation. During the experiment, the AVTF-TBN model has the best performance when the data from the two tasks are simultaneously used for detection, where the F1 Score is 0.78, Precision is 0.76, and Recall is 0.81. Our experimental results confirm the validity of the paradigm and demonstrate the efficacy of the AVTF-TBN model in detecting depression risk, showcasing the crucial role of sensor-based data in mental health detection.

Collapse

Affiliation(s)

Zhenwei Zhang School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen 518060, China; (Z.Z.); (D.N.); (G.H.); (Z.L.); (L.Z.); (L.L.); (H.D.) Guangdong Provincial Key Laboratory of Biomedical Measurements and Ultrasound Imaging, Shenzhen 518060, China
Shengming Zhang Affiliated Mental Health Center, Southern University of Science and Technology, Shenzhen 518055, China;
Dong Ni School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen 518060, China; (Z.Z.); (D.N.); (G.H.); (Z.L.); (L.Z.); (L.L.); (H.D.) Guangdong Provincial Key Laboratory of Biomedical Measurements and Ultrasound Imaging, Shenzhen 518060, China
Zhaoguo Wei Shenzhen Kangning Hospital, Shenzhen 518020, China; (Z.W.); (K.Y.); (S.J.) Shenzhen Mental Health Center, Shenzhen 518020, China
Kongjun Yang Shenzhen Kangning Hospital, Shenzhen 518020, China; (Z.W.); (K.Y.); (S.J.) Shenzhen Mental Health Center, Shenzhen 518020, China
Shan Jin Shenzhen Kangning Hospital, Shenzhen 518020, China; (Z.W.); (K.Y.); (S.J.) Shenzhen Mental Health Center, Shenzhen 518020, China
Gan Huang School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen 518060, China; (Z.Z.); (D.N.); (G.H.); (Z.L.); (L.Z.); (L.L.); (H.D.) Guangdong Provincial Key Laboratory of Biomedical Measurements and Ultrasound Imaging, Shenzhen 518060, China
Zhen Liang School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen 518060, China; (Z.Z.); (D.N.); (G.H.); (Z.L.); (L.Z.); (L.L.); (H.D.) Guangdong Provincial Key Laboratory of Biomedical Measurements and Ultrasound Imaging, Shenzhen 518060, China
Li Zhang School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen 518060, China; (Z.Z.); (D.N.); (G.H.); (Z.L.); (L.Z.); (L.L.); (H.D.) Guangdong Provincial Key Laboratory of Biomedical Measurements and Ultrasound Imaging, Shenzhen 518060, China
Linling Li School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen 518060, China; (Z.Z.); (D.N.); (G.H.); (Z.L.); (L.Z.); (L.L.); (H.D.) Guangdong Provincial Key Laboratory of Biomedical Measurements and Ultrasound Imaging, Shenzhen 518060, China
Huijun Ding School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen 518060, China; (Z.Z.); (D.N.); (G.H.); (Z.L.); (L.Z.); (L.L.); (H.D.) Guangdong Provincial Key Laboratory of Biomedical Measurements and Ultrasound Imaging, Shenzhen 518060, China
Zhiguo Zhang School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen 518055, China Peng Cheng Laboratory, Shenzhen 518055, China
Jianhong Wang Shenzhen Kangning Hospital, Shenzhen 518020, China; (Z.W.); (K.Y.); (S.J.) Shenzhen Mental Health Center, Shenzhen 518020, China

Collapse

Shi H, Fan Y, Zhang Y, Li X, Shu Y, Deng X, Zhang Y, Zheng Y, Yang J. Intelligent bell facial paralysis assessment: a facial recognition model using improved SSD network. Sci Rep 2024;14:12763. [PMID: 38834661 DOI: 10.1038/s41598-024-63478-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Accepted: 05/29/2024] [Indexed: 06/06/2024] Open

Xu X, Li J, Zhu Z, Zhao L, Wang H, Song C, Chen Y, Zhao Q, Yang J, Pei Y. A Comprehensive Review on Synergy of Multi-Modal Data and AI Technologies in Medical Diagnosis. Bioengineering (Basel) 2024;11:219. [PMID: 38534493 DOI: 10.3390/bioengineering11030219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 02/15/2024] [Accepted: 02/21/2024] [Indexed: 03/28/2024] Open

Khoo LS, Lim MK, Chong CY, McNaney R. Machine Learning for Multimodal Mental Health Detection: A Systematic Review of Passive Sensing Approaches. SENSORS (BASEL, SWITZERLAND) 2024;24:348. [PMID: 38257440 PMCID: PMC10820860 DOI: 10.3390/s24020348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 12/14/2023] [Accepted: 12/18/2023] [Indexed: 01/24/2024]

Mao K, Wu Y, Chen J. A systematic review on automated clinical depression diagnosis. NPJ MENTAL HEALTH RESEARCH 2023;2:20. [PMID: 38609509 PMCID: PMC10955993 DOI: 10.1038/s44184-023-00040-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 09/27/2023] [Indexed: 04/14/2024]

Wang JZ, Zhao S, Wu C, Adams RB, Newman MG, Shafir T, Tsachor R. Unlocking the Emotional World of Visual Media: An Overview of the Science, Research, and Impact of Understanding Emotion: Drawing Insights From Psychology, Engineering, and the Arts, This Article Provides a Comprehensive Overview of the Field of Emotion Analysis in Visual Media and Discusses the Latest Research, Systems, Challenges, Ethical Implications, and Potential Impact of Artificial Emotional Intelligence on Society. PROCEEDINGS OF THE IEEE. INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS 2023;111:1236-1286. [PMID: 37859667 PMCID: PMC10586271 DOI: 10.1109/jproc.2023.3273517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/21/2023]

Ma Y, Shen J, Zhao Z, Liang H, Tan Y, Liu Z, Qian K, Yang M, Hu B. What Can Facial Movements Reveal? Depression Recognition and Analysis Based on Optical Flow Using Bayesian Networks. IEEE Trans Neural Syst Rehabil Eng 2023;31:3459-3468. [PMID: 37581961 DOI: 10.1109/tnsre.2023.3305351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/17/2023]

Ye J, Yu Y, Fu G, Zheng Y, Liu Y, Zhu Y, Wang Q. Analysis and Recognition of Voluntary Facial Expression Mimicry Based on Depressed Patients. IEEE J Biomed Health Inform 2023;27:3698-3709. [PMID: 37030686 DOI: 10.1109/jbhi.2023.3260816] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/10/2023]

Kleine AK, Kokje E, Lermer E, Gaube S. Attitudes Toward the Adoption of 2 Artificial Intelligence-Enabled Mental Health Tools Among Prospective Psychotherapists: Cross-sectional Study. JMIR Hum Factors 2023;10:e46859. [PMID: 37436801 PMCID: PMC10372564 DOI: 10.2196/46859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/08/2023] [Accepted: 05/14/2023] [Indexed: 07/13/2023] Open

Abstract

BACKGROUND

Despite growing efforts to develop user-friendly artificial intelligence (AI) applications for clinical care, their adoption remains limited because of the barriers at individual, organizational, and system levels. There is limited research on the intention to use AI systems in mental health care.

OBJECTIVE

This study aimed to address this gap by examining the predictors of psychology students' and early practitioners' intention to use 2 specific AI-enabled mental health tools based on the Unified Theory of Acceptance and Use of Technology.

METHODS

This cross-sectional study included 206 psychology students and psychotherapists in training to examine the predictors of their intention to use 2 AI-enabled mental health care tools. The first tool provides feedback to the psychotherapist on their adherence to motivational interviewing techniques. The second tool uses patient voice samples to derive mood scores that the therapists may use for treatment decisions. Participants were presented with graphic depictions of the tools' functioning mechanisms before measuring the variables of the extended Unified Theory of Acceptance and Use of Technology. In total, 2 structural equation models (1 for each tool) were specified, which included direct and mediated paths for predicting tool use intentions.

RESULTS

Perceived usefulness and social influence had a positive effect on the intention to use the feedback tool (P<.001) and the treatment recommendation tool (perceived usefulness, P=.01 and social influence, P<.001). However, trust was unrelated to use intentions for both the tools. Moreover, perceived ease of use was unrelated (feedback tool) and even negatively related (treatment recommendation tool) to use intentions when considering all predictors (P=.004). In addition, a positive relationship between cognitive technology readiness (P=.02) and the intention to use the feedback tool and a negative relationship between AI anxiety and the intention to use the feedback tool (P=.001) and the treatment recommendation tool (P<.001) were observed.

CONCLUSIONS

The results shed light on the general and tool-dependent drivers of AI technology adoption in mental health care. Future research may explore the technological and user group characteristics that influence the adoption of AI-enabled tools in mental health care.

Collapse

Li Y, Liu Z, Zhou L, Yuan X, Shangguan Z, Hu X, Hu B. A facial depression recognition method based on hybrid multi-head cross attention network. Front Neurosci 2023;17:1188434. [PMID: 37292164 PMCID: PMC10244529 DOI: 10.3389/fnins.2023.1188434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Accepted: 05/02/2023] [Indexed: 06/10/2023] Open

Abstract

Introduction

Deep-learn methods based on convolutional neural networks (CNNs) have demonstrated impressive performance in depression analysis. Nevertheless, some critical challenges need to be resolved in these methods: (1) It is still difficult for CNNs to learn long-range inductive biases in the low-level feature extraction of different facial regions because of the spatial locality. (2) It is difficult for a model with only a single attention head to concentrate on various parts of the face simultaneously, leading to less sensitivity to other important facial regions associated with depression. In the case of facial depression recognition, many of the clues come from a few areas of the face simultaneously, e.g., the mouth and eyes.

Methods

To address these issues, we present an end-to-end integrated framework called Hybrid Multi-head Cross Attention Network (HMHN), which includes two stages. The first stage consists of the Grid-Wise Attention block (GWA) and Deep Feature Fusion block (DFF) for the low-level visual depression feature learning. In the second stage, we obtain the global representation by encoding high-order interactions among local features with Multi-head Cross Attention block (MAB) and Attention Fusion block (AFB).

Results

We experimented on AVEC2013 and AVEC2014 depression datasets. The results of AVEC 2013 (RMSE = 7.38, MAE = 6.05) and AVEC 2014 (RMSE = 7.60, MAE = 6.01) demonstrated the efficacy of our method and outperformed most of the state-of-the-art video-based depression recognition approaches.

Discussion

We proposed a deep learning hybrid model for depression recognition by capturing the higher-order interactions between the depression features of multiple facial regions, which can effectively reduce the error in depression recognition and gives great potential for clinical experiments.

Collapse

Liu Z, Yuan X, Li Y, Shangguan Z, Zhou L, Hu B. PRA-Net: Part-and-Relation Attention Network for depression recognition from facial expression. Comput Biol Med 2023;157:106589. [PMID: 36934531 DOI: 10.1016/j.compbiomed.2023.106589] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Revised: 01/04/2023] [Accepted: 01/22/2023] [Indexed: 01/25/2023]

Francese R, Attanasio P. Emotion detection for supporting depression screening. MULTIMEDIA TOOLS AND APPLICATIONS 2022;82:12771-12795. [PMID: 36570729 PMCID: PMC9761032 DOI: 10.1007/s11042-022-14290-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 10/14/2022] [Accepted: 12/03/2022] [Indexed: 06/17/2023]

Automatic Identification of a Depressive State in Primary Care. Healthcare (Basel) 2022;10:healthcare10122347. [PMID: 36553871 PMCID: PMC9777617 DOI: 10.3390/healthcare10122347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 11/04/2022] [Accepted: 11/19/2022] [Indexed: 11/24/2022] Open

Zhang B, Wei D, Yan G, Lei T, Cai H, Yang Z. Feature-level fusion based on spatial-temporal of pervasive EEG for depression recognition. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022;226:107113. [PMID: 36103735 DOI: 10.1016/j.cmpb.2022.107113] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 08/23/2022] [Accepted: 09/04/2022] [Indexed: 06/15/2023]

Abstract

BACKGROUND AND OBJECTIVE

In view of the depression characteristics such as high prevalence, high disability rate, high fatality rate, and high recurrence rate, early identification and early intervention are the most effective methods to prevent irreversible damage of brain function over time. The traditional method of depression recognition based on questionnaires and interviews is time-consuming and labor-intensive, and heavily depends on the doctor's subjective experience. Therefore, accurate, convenient and effective recognition of depression has important social value and scientific significance.

METHODS

This paper proposes a depression recognition framework based on feature-level fusion of spatial-temporal pervasive electroencephalography (EEG). Time series EEG data were collected by portable three-electrode EEG acquisition instrument, and mapped to a spatial complex network called visibility graph (VG). Then temporal EEG features and spatial VG metric features were extracted and selected. Based on the correlation between features and categories, the differences in contribution of individual feature are explored, and different contribution coefficients are assigned to different features as the data basis of feature-level fusion to ensure the diversity of data. A cascade forest model based on three different decision forests is designed to realize the efficient depression recognition using spatial-temporal feature-level fusion data.

RESULTS

Experimental data were obtained from 26 depressed patients and 29 healthy controls (HC). The results of multiple control experiments show that compared with single type feature, feature-level fusion without contribution coefficient, and independent classifiers, the feature-level method with contribution coefficient of spatial-temporal has a stronger recognition ability of depression, and the highest accuracy is 92.48%.

CONCLUSION

Feature-level fusion method provides an effective computer-aided tool for rapid clinical diagnosis of depression.

Collapse

Cao XJ, Liu XQ. Artificial intelligence-assisted psychosis risk screening in adolescents: Practices and challenges. World J Psychiatry 2022;12:1287-1297. [PMID: 36389087 PMCID: PMC9641379 DOI: 10.5498/wjp.v12.i10.1287] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 08/09/2022] [Accepted: 09/22/2022] [Indexed: 02/05/2023] Open

Wu P, Wang R, Lin H, Zhang F, Tu J, Sun M. Automatic depression recognition by intelligent speech signal processing: A systematic survey. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY 2022. [DOI: 10.1049/cit2.12113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Wang J, Lv K, Liu C, Nie X, Gowda D, Luan S. Automatic Assessment for Severe Self-Reported Depressive Symptoms Using Speech Cues. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2020.3002512] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Zhou Y, Jin L, Liu H, Song E. Color Facial Expression Recognition by Quaternion Convolutional Neural Network With Gabor Attention. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2020.3041642] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Muzammel M, Salam H, Othmani A. End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2021;211:106433. [PMID: 34614452 DOI: 10.1016/j.cmpb.2021.106433] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 09/15/2021] [Indexed: 06/13/2023]

Abstract

BACKGROUND AND OBJECTIVE

Major Depressive Disorder is a highly prevalent and disabling mental health condition. Numerous studies explored multimodal fusion systems combining visual, audio, and textual features via deep learning architectures for clinical depression recognition. Yet, no comparative analysis for multimodal depression analysis has been proposed in the literature.

METHODS

In this paper, an up-to-date literature overview of multimodal depression recognition is presented and an extensive comparative analysis of different deep learning architectures for depression recognition is performed. First, audio features based Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) are studied. Then, early-level and model-level fusion of deep audio features with visual and textual features through LSTM and CNN architectures are investigated.

RESULTS

The performance of the proposed architectures using an hold-out strategy on the DAIC-WOZ dataset (80% training, 10% validation, 10% test split) for binary and severity levels of depression recognition is tested. Using this strategy, a set of experiments have been performed and they have demonstrated: (1) LSTM-based audio features perform slightly better than CNN ones with an accuracy of 66.25% versus 65.60% for binary depression classes. (2) the model level fusion of deep audio and visual features using LSTM network performed the best with an accuracy of 77.16%, a precision of 53% for the depressed class, and a precision of 83% for the non-depressed class. The given network obtained a normalized Root Mean Square Error (RMSE) of 0.15 for depression severity level prediction. Using a Leave-One-Subject-Out strategy, this network achieved an accuracy of 95.38% for binary depression detection, and a normalized RMSE of 0.1476 for depression severity level prediction. Our best-performing architecture outperforms all state-of-the-art approaches on DAIC-WOZ dataset.

CONCLUSIONS

The obtained results show that the proposed LSTM-based surpass the proposed CNN-based architectures allowing to learn temporal dynamics representations of multimodal features. Furthermore, model-level fusion of audio and visual features using an LSTM network leads to the best performance. Our best-performing architecture successfully detects depression using a speech segment of less than 8 seconds, and an average prediction computation time of less than 6ms; making it suitable for real-world clinical applications.

Collapse

He L, Guo C, Tiwari P, Su R, Pandey HM, Dang W. DepNet: An automated industrial intelligent system using deep learning for video‐based depression analysis. INT J INTELL SYST 2021. [DOI: 10.1002/int.22704] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Zhao Y, Liang Z, Du J, Zhang L, Liu C, Zhao L. Multi-Head Attention-Based Long Short-Term Memory for Depression Detection From Speech. Front Neurorobot 2021;15:684037. [PMID: 34512301 PMCID: PMC8426553 DOI: 10.3389/fnbot.2021.684037] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 07/19/2021] [Indexed: 11/13/2022] Open

Niu M, Liu B, Tao J, Li Q. A time-frequency channel attention and vectorization network for automatic depression level prediction. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.04.056] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Dong Y, Yang X. A hierarchical depression detection model based on vocal and emotional cues. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.02.019] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Guo W, Yang H, Liu Z, Xu Y, Hu B. Deep Neural Networks for Depression Recognition Based on 2D and 3D Facial Expressions Under Emotional Stimulus Tasks. Front Neurosci 2021;15:609760. [PMID: 33967675 PMCID: PMC8102822 DOI: 10.3389/fnins.2021.609760] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Accepted: 03/08/2021] [Indexed: 11/23/2022] Open

Abstract

The proportion of individuals with depression has rapidly increased along with the growth of the global population. Depression has been the currently most prevalent mental health disorder. An effective depression recognition system is especially crucial for the early detection of potential depression risk. A depression-related dataset is also critical while evaluating the system for depression or potential depression risk detection. Due to the sensitive nature of clinical data, availability and scale of such datasets are scarce. To our knowledge, there are few extensively practical depression datasets for the Chinese population. In this study, we first create a large-scale dataset by asking subjects to perform five mood-elicitation tasks. After each task, subjects' audio and video are collected, including 3D information (depth information) of facial expressions via a Kinect. The constructed dataset is from a real environment, i.e., several psychiatric hospitals, and has a specific scale. Then we propose a novel approach for potential depression risk recognition based on two kinds of different deep belief network (DBN) models. One model extracts 2D appearance features from facial images collected by an optical camera, while the other model extracts 3D dynamic features from 3D facial points collected by a Kinect. The final decision result comes from the combination of the two models. Finally, we evaluate all proposed deep models on our built dataset. The experimental results demonstrate that (1) our proposed method is able to identify patients with potential depression risk; (2) the recognition performance of combined 2D and 3D features model outperforms using either 2D or 3D features model only; (3) the performance of depression recognition is higher in the positive and negative emotional stimulus, and females' recognition rate is generally higher than that for males. Meanwhile, we compare the performance with other methods on the same dataset. The experimental results show that our integrated 2D and 3D features DBN is more reasonable and universal than other methods, and the experimental paradigm designed for depression is reasonable and practical.

Collapse

He L, Guo C, Tiwari P, Pandey HM, Dang W. Intelligent system for depression scale estimation with facial expressions and case study in industrial intelligence. INT J INTELL SYST 2021. [DOI: 10.1002/int.22426] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Belouali A, Gupta S, Sourirajan V, Yu J, Allen N, Alaoui A, Dutton MA, Reinhard MJ. Acoustic and language analysis of speech for suicidal ideation among US veterans. BioData Min 2021;14:11. [PMID: 33531048 PMCID: PMC7856815 DOI: 10.1186/s13040-021-00245-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 01/20/2021] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Screening for suicidal ideation in high-risk groups such as U.S. veterans is crucial for early detection and suicide prevention. Currently, screening is based on clinical interviews or self-report measures. Both approaches rely on subjects to disclose their suicidal thoughts. Innovative approaches are necessary to develop objective and clinically applicable assessments. Speech has been investigated as an objective marker to understand various mental states including suicidal ideation. In this work, we developed a machine learning and natural language processing classifier based on speech markers to screen for suicidal ideation in US veterans.

METHODOLOGY

Veterans submitted 588 narrative audio recordings via a mobile app in a real-life setting. In addition, participants completed self-report psychiatric scales and questionnaires. Recordings were analyzed to extract voice characteristics including prosodic, phonation, and glottal. The audios were also transcribed to extract textual features for linguistic analysis. We evaluated the acoustic and linguistic features using both statistical significance and ensemble feature selection. We also examined the performance of different machine learning algorithms on multiple combinations of features to classify suicidal and non-suicidal audios.

RESULTS

A combined set of 15 acoustic and linguistic features of speech were identified by the ensemble feature selection. Random Forest classifier, using the selected set of features, correctly identified suicidal ideation in veterans with 86% sensitivity, 70% specificity, and an area under the receiver operating characteristic curve (AUC) of 80%.

CONCLUSIONS

Speech analysis of audios collected from veterans in everyday life settings using smartphones offers a promising approach for suicidal ideation detection. A machine learning classifier may eventually help clinicians identify and monitor high-risk veterans.

Collapse

Automatic depression recognition using CNN with attention mechanism from videos. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.10.015] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Mohammadi Y, Moradi MH. Prediction of Depression Severity Scores Based on Functional Connectivity and Complexity of the EEG Signal. Clin EEG Neurosci 2021;52:52-60. [PMID: 33040603 DOI: 10.1177/1550059420965431] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Karnati M, Seal A, Yazidi A, Krejcar O. LieNet: A Deep Convolution Neural Networks Framework for Detecting Deception. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2021.3086011] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

What reveals about depression level? The role of multimodal features at the level of interview questions. INFORMATION & MANAGEMENT 2020. [DOI: 10.1016/j.im.2020.103349] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Otberdout N, Kacem A, Daoudi M, Ballihi L, Berretti S. Automatic Analysis of Facial Expressions Based on Deep Covariance Trajectories. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020;31:3892-3905. [PMID: 31725395 DOI: 10.1109/tnnls.2019.2947244] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Liu Y, Zhang X, Lin Y, Wang H. Facial Expression Recognition via Deep Action Units Graph Network Based on Psychological Mechanism. IEEE Trans Cogn Dev Syst 2020. [DOI: 10.1109/tcds.2019.2917711] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Su C, Xu Z, Pathak J, Wang F. Deep learning in mental health outcome research: a scoping review. Transl Psychiatry 2020;10:116. [PMID: 32532967 PMCID: PMC7293215 DOI: 10.1038/s41398-020-0780-3] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 02/17/2020] [Accepted: 02/26/2020] [Indexed: 12/17/2022] Open

Rana R, Latif S, Gururajan R, Gray A, Mackenzie G, Humphris G, Dunn J. Automated screening for distress: A perspective for the future. Eur J Cancer Care (Engl) 2019;28:e13033. [DOI: 10.1111/ecc.13033] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2019] [Revised: 02/05/2019] [Accepted: 02/18/2019] [Indexed: 01/13/2023]