1
|
Rehmani F, Shaheen Q, Anwar M, Faheem M, Bhatti SS. Depression detection with machine learning of structural and non-structural dual languages. Healthc Technol Lett 2024; 11:218-226. [PMID: 39100503 PMCID: PMC11294929 DOI: 10.1049/htl2.12088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2023] [Revised: 04/29/2024] [Accepted: 05/30/2024] [Indexed: 08/06/2024] Open
Abstract
Depression is a serious mental state that negatively impacts thoughts, feelings, and actions. Social media use is rapidly growing, with people expressing themselves in their regional languages. In Pakistan and India, many people use Roman Urdu on social media. This makes Roman Urdu important for predicting depression in these regions. However, previous studies show no significant contribution in predicting depression through Roman Urdu or in combination with structured languages like English. The study aims to create a Roman Urdu dataset to predict depression risk in dual languages [Roman Urdu (non-structural language) + English (structural language)]. Two datasets were used: Roman Urdu data manually converted from English on Facebook, and English comments from Kaggle. These datasets were merged for the research experiments. Machine learning models, including Support Vector Machine (SVM), Support Vector Machine Radial Basis Function (SVM-RBF), Random Forest (RF), and Bidirectional Encoder Representations from Transformers (BERT), were tested. Depression risk was classified into not depressed, moderate, and severe. Experimental studies show that the SVM achieved the best result with anaccuracy of 0.84% compared to existing models. The presented study refines thearea of depression to predict the depression in Asian countries.
Collapse
Affiliation(s)
- Filza Rehmani
- Department of Computer Science & Information TechnologyThe Islamia University of BahawalpurBannuPakistan
| | - Qaisar Shaheen
- Department of Computer Science & Information TechnologyThe Islamia University of BahawalpurBannuPakistan
| | - Muhammad Anwar
- Department of Information Sciences, Division of Science and TechnologyUniversity of EducationLahorePakistan
| | - Muhammad Faheem
- School of Technology and InnovationsUniversity of VaasaVaasaFinland
| | | |
Collapse
|
2
|
Zhang W, Kong L, Lee S, Chen Y, Zhang G, Wang H, Song M. Detecting mental and physical disorders using multi-task learning equipped with knowledge graph attention network. Artif Intell Med 2024; 149:102812. [PMID: 38462270 DOI: 10.1016/j.artmed.2024.102812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 01/19/2024] [Accepted: 02/12/2024] [Indexed: 03/12/2024]
Abstract
Mental and physical disorders (MPD) are inextricably linked in many medical cases; psychosomatic diseases can be induced by mental concerns and psychological discomfort can ensue from physiological diseases. However, existing medical informatics studies focus on identifying mental or physical disorders from a unilateral perspective. Consequently, no existing domain knowledge base, corpus, or detection modeling approach considers mental as well as physical aspects concurrently. This paper proposes a joint modeling approach to detect MPD. First, we crawl through online medical consultation records of patients from websites and build an MPD knowledge ontology by extracting the core conceptual features of the text. Based on the ontology, an MPD knowledge graph containing 12,673 nodes and 82,195 relations is obtained using term matching with a domain thesaurus of each concept. Subsequently, an MPD corpus with fine-grained severities (None, Mild, Moderate, Severe, Dangerous) and 8909 records is constructed by formulating MPD classification criteria and a data annotation process under the guidance of domain experts. Taking the knowledge graph and corpus as the dataset, we design a multi-task learning model to detect the MPD severity, in which a knowledge graph attention network (KGAT) is embedded to better extract knowledge features. Experiments are performed to demonstrate the effectiveness of our model. Furthermore, we employ ontology-based and centrality-based methods to discover additional potential inferred knowledge, which can be captured by KGAT so as to improve the prediction performance and interpretability of our model. Our dataset has been made publicly available, so it can be further used as a medical informatics reference in the fields of psychosomatic medicine, psychiatrics, physical co-morbidity, and so on.
Collapse
Affiliation(s)
- Wei Zhang
- School of Information Management, Nanjing Agricultural University, Nanjing 210095, China; Department of Library and Information Science, Yonsei University, Seoul 03722, Republic of Korea
| | - Ling Kong
- School of Information Management, Nanjing Agricultural University, Nanjing 210095, China; Department of Library and Information Science, Yonsei University, Seoul 03722, Republic of Korea
| | - Soobin Lee
- Department of Library and Information Science, Yonsei University, Seoul 03722, Republic of Korea
| | - Yan Chen
- College of Life Sciences, Nanjing University, Nanjing 210023, China
| | - Guangxu Zhang
- The Affiliated Brain Hospital of Nanjing Medical University, Nanjing 210029, China
| | - Hao Wang
- School of Information Management, Nanjing University, Nanjing 210023, China; Jiangsu Key Laboratory of Data Engineering and Knowledge Service, Nanjing 210023, China
| | - Min Song
- Department of Library and Information Science, Yonsei University, Seoul 03722, Republic of Korea.
| |
Collapse
|
3
|
What users’ musical preference on Twitter reveals about psychological disorders. Inf Process Manag 2023. [DOI: 10.1016/j.ipm.2023.103269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
|
4
|
Gu D, Li M, Yang X, Gu Y, Zhao Y, Liang C, Liu H. An analysis of cognitive change in online mental health communities: A textual data analysis based on post replies of support seekers. Inf Process Manag 2023. [DOI: 10.1016/j.ipm.2022.103192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
5
|
Kumar KS, Radhamani A, Kumar TA. Sentiment lexicon for cross-domain adaptation with multi-domain dataset in Indian languages enhanced with BERT classification model. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-220448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Many websites are attempting to offer a platform for users or customers to leave their reviews and comments about the products or services in their native languages. The cross-domain adaptation (CDA) analyses sentiment across domains. The sentiment lexicon falls short resulting in issues like feature mismatch, sparsity, polarity mismatch and polysemy. In this research, an augmented sentiment dictionary is developed in our native regional language (Tamil) that intends to construct the contextual links between terms in multi-domain datasets to reduce problems like polarity mismatch, feature mismatch, and polysemy. Data from the source domain and target domain both labeled and unlabeled are used in the proposed dictionary. To be more specific, the initial dictionary uses normalised pointwise mutual information (nPMI) to derive contextual weight, whereas the final dictionary uses the value of terms across all reviews to compute the accurate rank score. Here, a deep learning model called BERT is used for sentiment classification. For cross-domain adaptation, a modified multi-layer fuzzy-based convolutional neural network (M-FCNN) is deployed. This work aims to build a single dictionary using large number of vocabularies for classifying the reviews in Tamil for several target domains. This extendible dictionary enhances the accuracy of CDA greatly when compared to existing baseline techniques and easily handles a large number of terms in different domains.
Collapse
Affiliation(s)
- K. Suresh Kumar
- IFET College of Engineering (Autonomous), Villupuram, Tamilnadu, India
| | - A.S. Radhamani
- Adhi College of Engineering and Technology, Kanchipuram, Tamilnadu, India
| | - T. Ananth Kumar
- IFET College of Engineering (Autonomous), Villupuram, Tamilnadu, India
| |
Collapse
|
6
|
A mental state Knowledge–aware and Contrastive Network for early stress and depression detection on social media. Inf Process Manag 2022. [DOI: 10.1016/j.ipm.2022.102961] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
7
|
Bitar H, Babour A, Nafa F, Alzamzami O, Alismail S. Increasing Women's Knowledge about HPV Using BERT Text Summarization: An Online Randomized Study. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:8100. [PMID: 35805761 PMCID: PMC9265758 DOI: 10.3390/ijerph19138100] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Revised: 06/28/2022] [Accepted: 06/29/2022] [Indexed: 01/09/2023]
Abstract
Despite the availability of online educational resources about human papillomavirus (HPV), many women around the world may be prevented from obtaining the necessary knowledge about HPV. One way to mitigate the lack of HPV knowledge is the use of auto-generated text summarization tools. This study compares the level of HPV knowledge between women who read an auto-generated summary of HPV made using the BERT deep learning model and women who read a long-form text of HPV. We randomly assigned 386 women to two conditions: half read an auto-generated summary text about HPV (n = 193) and half read an original text about HPV (n = 193). We administrated measures of HPV knowledge that consisted of 29 questions. As a result, women who read the original text were more likely to correctly answer two questions on the general HPV knowledge subscale than women who read the summarized text. For the HPV testing knowledge subscale, there was a statistically significant difference in favor of women who read the original text for only one question. The final subscale, HPV vaccination knowledge questions, did not significantly differ across groups. Using BERT for text summarization has shown promising effectiveness in increasing women's knowledge and awareness about HPV while saving their time.
Collapse
Affiliation(s)
- Hind Bitar
- Information Systems Department, King Abdulaziz University, P.O. Box 80200, Jeddah 21589, Saudi Arabia;
| | - Amal Babour
- Information Systems Department, King Abdulaziz University, P.O. Box 80200, Jeddah 21589, Saudi Arabia;
| | - Fatema Nafa
- Computer Science Department, Salem State University, Salem, MA 01970, USA;
| | - Ohoud Alzamzami
- Computer Science Department, King Abdulaziz University, P.O. Box 80200, Jeddah 21589, Saudi Arabia;
| | - Sarah Alismail
- Center for Information Systems and Technology, Claremont Graduate University, Claremont, CA 91711, USA;
- Beckman Research Institute, City of Hope, Duarte, CA 91010, USA
| |
Collapse
|
8
|
Hu Y, Ding J, Dou Z, Chang H. Short-Text Classification Detector: A Bert-Based Mental Approach. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:8660828. [PMID: 35310586 PMCID: PMC8930219 DOI: 10.1155/2022/8660828] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 02/19/2022] [Indexed: 01/24/2023]
Abstract
With the continuous development of the Internet, social media based on short text has become popular. However, the sparsity and shortness of essays will restrict the accuracy of text classification. Therefore, based on the Bert model, we capture the mental feature of reviewers and apply them for short text classification to improve its classification accuracy. Specifically, we construct a model text at the language level and fine tune the model to better embed mental features. To verify the accuracy of this method, we compare a variety of machine learning methods, such as support vector machine, convolution neural networks, and recurrent neural networks. The results show the following: (1) Through feature comparison, it is found that mental features can significantly improve the accuracy of short text classification. (2) Combining mental features and text as input vectors can provide more classification accuracy than separating them as two independent vectors. (3) Through model comparison, it can be found that Bert model can integrate mental features and short text. Bert can better capture mental features to improve the accuracy of classification results. This will help to promote the development of short text classification.
Collapse
Affiliation(s)
- Yongjun Hu
- School of Management, Guangzhou University, Guangzhou 510006, China
| | - Jia Ding
- School of Data and Computer Science, Sun Yat-sen University, Guangzhou 510006, China
| | - Zixin Dou
- School of Management, Guangzhou University, Guangzhou 510006, China
- Research Center of e-commerce, Guangzhou University, Guangzhou 510006, China
| | - Huiyou Chang
- School of Data and Computer Science, Sun Yat-sen University, Guangzhou 510006, China
| |
Collapse
|