1
|
Mao K, Wu Y, Chen J. A systematic review on automated clinical depression diagnosis. NPJ MENTAL HEALTH RESEARCH 2023; 2:20. [PMID: 38609509 PMCID: PMC10955993 DOI: 10.1038/s44184-023-00040-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 09/27/2023] [Indexed: 04/14/2024]
Abstract
Assessing mental health disorders and determining treatment can be difficult for a number of reasons, including access to healthcare providers. Assessments and treatments may not be continuous and can be limited by the unpredictable nature of psychiatric symptoms. Machine-learning models using data collected in a clinical setting can improve diagnosis and treatment. Studies have used speech, text, and facial expression analysis to identify depression. Still, more research is needed to address challenges such as the need for multimodality machine-learning models for clinical use. We conducted a review of studies from the past decade that utilized speech, text, and facial expression analysis to detect depression, as defined by the Diagnostic and Statistical Manual of Mental Disorders (DSM-5), using the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guideline. We provide information on the number of participants, techniques used to assess clinical outcomes, speech-eliciting tasks, machine-learning algorithms, metrics, and other important discoveries for each study. A total of 544 studies were examined, 264 of which satisfied the inclusion criteria. A database has been created containing the query results and a summary of how different features are used to detect depression. While machine learning shows its potential to enhance mental health disorder evaluations, some obstacles must be overcome, especially the requirement for more transparent machine-learning models for clinical purposes. Considering the variety of datasets, feature extraction techniques, and metrics used in this field, guidelines have been provided to collect data and train machine-learning models to guarantee reproducibility and generalizability across different contexts.
Collapse
Affiliation(s)
- Kaining Mao
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, T6G 2R3, Canada
| | - Yuqi Wu
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, T6G 2R3, Canada
| | - Jie Chen
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, T6G 2R3, Canada.
| |
Collapse
|
2
|
Barclay N, Kelley KA, Brausch AM, Muehlenkamp JJ, Nadorff MR. Changes in the suicide risk behaviors of American college students over time: An analysis of three universities. Suicide Life Threat Behav 2023; 53:764-775. [PMID: 37515442 DOI: 10.1111/sltb.12981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 06/06/2023] [Accepted: 07/10/2023] [Indexed: 07/30/2023]
Abstract
INTRODUCTION Suicide-related behaviors are prevalent among college students, and several mental health problems associated with increased suicide risk have increased over time. Furthermore, notable cultural events (e.g., political changes, COVID-19) have occurred in the past decade, which likely impact trends in suicide-related behaviors. The current study examined how the prevalence of nonsuicidal self-injury (NSSI), suicidal ideation (SI), and suicide attempts has changed from 2012 to 2022 across three different universities. METHOD Archival datasets from multiple years of college student survey data were compiled, and different measures of NSSI, SI, and suicide attempts were dichotomized to assess prevalence. Chi-square goodness-of-fit tests were used to identify changes in suicide-related behaviors across time. RESULTS Results indicated significant increases in the prevalence of most behaviors across each university, with most increases occurring after 2018. Despite sharing a general trend of increased suicide-related behaviors, each university differed considerably in their respective trends between various timepoints, suggesting that unique factors may differentially contribute to growing risk among college students. CONCLUSION Overall, the current study identifies increasing trends in suicide-related behaviors over the past decade and highlights the value of investigating these behaviors at the university level.
Collapse
Affiliation(s)
- Nathan Barclay
- Department of Psychology, Mississippi State University, Starkville, Mississippi, USA
| | - Karen A Kelley
- Department of Psychology, Mississippi State University, Starkville, Mississippi, USA
| | - Amy M Brausch
- Department of Psychological Sciences, Western Kentucky University, Bowling Green, Kentucky, USA
| | - Jennifer J Muehlenkamp
- Department of Psychology, University of Wisconsin-Eau Claire, Eau Claire, Wisconsin, USA
| | - Michael R Nadorff
- Department of Psychology, Mississippi State University, Starkville, Mississippi, USA
| |
Collapse
|
3
|
Biester L, Pennebaker J, Mihalcea R. Emotional and cognitive changes surrounding online depression identity claims. PLoS One 2022; 17:e0278179. [PMID: 36454809 PMCID: PMC9714698 DOI: 10.1371/journal.pone.0278179] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 11/13/2022] [Indexed: 12/02/2022] Open
Abstract
As social media has proliferated, a key aspect to making meaningful connections with people online has been revealing important parts of one's identity. In this work, we study changes that occur in people's language use after they share a specific piece of their identity: a depression diagnosis. To do so, we collect data from over five thousand users who have made such a statement, which we refer to as an identity claim. Prior to making a depression identity claim, the Reddit user's language displays evidence of increasingly higher rates of anxiety, sadness, and cognitive processing language compared to matched controls. After the identity claim, these language markers decrease and more closely match the controls. Similarly, first person singular pronoun usage decreases following the identity claim, which was previously previously found to be indicative of self-focus and associated with depression. By further considering how and to whom people express their identity, we find that the observed longitudinal changes are larger for those who do so in ways that are more correlated with seeking help (sharing in a post instead of a comment; sharing in a mental health support forum). This work suggests that there may be benefits to sharing one's depression diagnosis, especially in a semi-anonymous forum where others are likely to be empathetic.
Collapse
Affiliation(s)
- Laura Biester
- Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, Michigan, United States of America
- * E-mail:
| | - James Pennebaker
- Department of Psychology, The University of Texas at Austin, Austin, Texas, United States of America
| | - Rada Mihalcea
- Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
4
|
Identifying suicidal emotions on social media through transformer-based deep learning. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04060-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
5
|
Psycholinguistic changes in the communication of adolescent users in a suicidal ideation online community during the COVID-19 pandemic. Eur Child Adolesc Psychiatry 2022; 32:975-985. [PMID: 36018514 PMCID: PMC9415261 DOI: 10.1007/s00787-022-02067-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/02/2022] [Accepted: 08/05/2022] [Indexed: 11/03/2022]
Abstract
Since the outbreak of the COVID-19 pandemic, increases in suicidal ideation and suicide attempts in adolescents have been registered. Many adolescents experiencing suicidal ideation turn to online communities for social support. In this retrospective observational study, we investigated the communication-language style, contents and user activity-in 7975 unique posts and 51,119 comments by N = 2862 active adolescent users in a large suicidal ideation support community (SISC) on the social media website reddit.com in the onset period of the COVID-19 pandemic. We found significant relative changes in language style markers for hopelessness such as negative emotion words (+ 10.00%) and positive emotion words (- 3.45%) as well as for social disengagement such as social references (- 8.63%) and 2nd person pronouns (- 33.97%) since the outbreak of the pandemic. Using topic modeling with Latent Dirichlet Allocation (LDA), we identified significant changes in content for the topics Hopelessness (+ 23.98%), Suicide Methods (+ 17.11%), Social Support (- 14.91%), and Reaching Out to users (- 28.97%). Changes in user activity point to an increased expression of mental health issues and decreased engagement with other users. The results indicate a potential shift in communication patterns with more adolescent users expressing their suicidal ideation rather than relating with or supporting other users during the COVID-19 pandemic.
Collapse
|
6
|
Salmi S, Mérelle S, Gilissen R, van der Mei R, Bhulai S. Detecting changes in help seeker conversations on a suicide prevention helpline during the COVID- 19 pandemic: in-depth analysis using encoder representations from transformers. BMC Public Health 2022; 22:530. [PMID: 35300638 PMCID: PMC8930480 DOI: 10.1186/s12889-022-12926-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 02/28/2022] [Indexed: 11/16/2022] Open
Abstract
Background Preventatives measures to combat the spread of COVID− 19 have introduced social isolation, loneliness and financial stress. This study aims to identify whether the COVID-19 pandemic is related to changes in suicide-related problems for help seekers on a suicide prevention helpline. Methods A retrospective cohort study was conducted using chat data from a suicide prevention helpline in the Netherlands. The natural language processing method BERTopic was used to detect common topics in messages from December 1, 2019 until June 1, 2020 (N = 8589). Relative topic occurrence was compared before and during the lock down starting on March 23, 2020. The observed changes in topic usage were likewise analyzed for male and female, younger and older help seekers and help seekers living alone. Results The topic of the COVID-19 pandemic saw an 808% increase in relative occurrence after the lockdown. Furthermore, the results show that help seeker increased mention of thanking the counsellor (+ 15%), and male and young help seekers were grateful for the conversation (+ 45% and + 32% respectively). Coping methods such as watching TV (− 21%) or listening to music (− 15%) saw a decreased mention. Plans for suicide (− 9%) and plans for suicide at a specific location (− 15%) also saw a decreased mention. However, plans for suicide were mentioned more frequently by help seekers over 30 years old (+ 11%) or who live alone and (+ 52%). Furthermore, male help seekers talked about contact with emergency care (+ 43%) and panic and anxiety (+ 24%) more often. Negative emotions (+ 22%) and lack of self-confidence (+ 15%) were mentioned more often by help seekers under 30, and help seekers over 30 saw an increased mention of substance abuse (+ 9%). Conclusion While mentions of distraction, social interaction and plans for suicide decreased, expressions of gratefulness for the helpline increased, highlighting the importance of contact to help seekers during the lockdown. Help seekers under 30, male or who live alone, showed changes that negatively related to suicidality and should be monitored closely.
Collapse
Affiliation(s)
- Salim Salmi
- Centrum Wiskunde & Informatica, Amsterdam, the Netherlands.
| | | | | | | | - Sandjai Bhulai
- Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| |
Collapse
|
7
|
Liu D, Feng XL, Ahmed F, Shahid M, Guo J. Detecting and Measuring Depression on Social Media Using a Machine Learning Approach: Systematic Review. JMIR Ment Health 2022; 9:e27244. [PMID: 35230252 PMCID: PMC8924784 DOI: 10.2196/27244] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 05/26/2021] [Accepted: 12/16/2021] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Detection of depression gained prominence soon after this troublesome disease emerged as a serious public health concern worldwide. OBJECTIVE This systematic review aims to summarize the findings of previous studies concerning applying machine learning (ML) methods to text data from social media to detect depressive symptoms and to suggest directions for future research in this area. METHODS A bibliographic search was conducted for the period of January 1990 to December 2020 in Google Scholar, PubMed, Medline, ERIC, PsycINFO, and BioMed. Two reviewers retrieved and independently assessed the 418 studies consisting of 322 articles identified through database searching and 96 articles identified through other sources; 17 of the studies met the criteria for inclusion. RESULTS Of the 17 studies, 10 had identified depression based on researcher-inferred mental status, 5 had identified it based on users' own descriptions of their mental status, and 2 were identified based on community membership. The ML approaches of 13 of the 17 studies were supervised learning approaches, while 3 used unsupervised learning approaches; the remaining 1 study did not describe its ML approach. Challenges in areas such as sampling, optimization of approaches to prediction and their features, generalizability, privacy, and other ethical issues call for further research. CONCLUSIONS ML approaches applied to text data from users on social media can work effectively in depression detection and could serve as complementary tools in public mental health practice.
Collapse
Affiliation(s)
- Danxia Liu
- School of Sociology, Huazhong University of Science and Technology, Wuhan, China
| | - Xing Lin Feng
- Department of Health Policy and Management, School of Public Health, Peking University, Beijing, China
| | - Farooq Ahmed
- Department of Anthropology, University of Washington Seattle, Seattle, WA, United States.,Department of Anthropology, Quaid-I-Azam University Islamabad, Islamabad, Pakistan
| | - Muhammad Shahid
- School of Insurance and Economics, University of International Business and Economics, Beijing, China
| | - Jing Guo
- Department of Health Policy and Management, School of Public Health, Peking University, Beijing, China
| |
Collapse
|
8
|
Morese R, Gruebner O, Sykora M, Elayan S, Fadda M, Albanese E. Detecting Suicide Ideation in the Era of Social Media: The Population Neuroscience Perspective. Front Psychiatry 2022; 13:652167. [PMID: 35492693 PMCID: PMC9046648 DOI: 10.3389/fpsyt.2022.652167] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 03/21/2022] [Indexed: 11/13/2022] Open
Abstract
Social media platforms are increasingly used across many population groups not only to communicate and consume information, but also to express symptoms of psychological distress and suicidal thoughts. The detection of suicidal ideation (SI) can contribute to suicide prevention. Twitter data suggesting SI have been associated with negative emotions (e.g., shame, sadness) and a number of geographical and ecological variables (e.g., geographic location, environmental stress). Other important research contributions on SI come from studies in neuroscience. To date, very few research studies have been conducted that combine different disciplines (epidemiology, health geography, neurosciences, psychology, and social media big data science), to build innovative research directions on this topic. This article aims to offer a new interdisciplinary perspective, that is, a Population Neuroscience perspective on SI in order to highlight new ways in which multiple scientific fields interact to successfully investigate emotions and stress in social media to detect SI in the population. We argue that a Population Neuroscience perspective may help to better understand the mechanisms underpinning SI and to promote more effective strategies to prevent suicide timely and at scale.
Collapse
Affiliation(s)
- Rosalba Morese
- Faculty of Biomedical Sciences, Università della Svizzera italiana, Lugano, Switzerland.,Faculty of Communication, Culture and Society, Università della Svizzera italiana, Lugano, Switzerland
| | - Oliver Gruebner
- Department of Epidemiology, Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zürich, Switzerland.,Department of Geography, University of Zurich, Zürich, Switzerland
| | - Martin Sykora
- Centre for Information Management (CIM), School of Business and Economics, Loughborough University, Loughborough, United Kingdom
| | - Suzanne Elayan
- Centre for Information Management (CIM), School of Business and Economics, Loughborough University, Loughborough, United Kingdom
| | - Marta Fadda
- Faculty of Biomedical Sciences, Università della Svizzera italiana, Lugano, Switzerland
| | - Emiliano Albanese
- Faculty of Biomedical Sciences, Università della Svizzera italiana, Lugano, Switzerland
| |
Collapse
|
9
|
Chancellor S, Sumner SA, David-Ferdon C, Ahmad T, De Choudhury M. Suicide Risk and Protective Factors in Online Support Forum Posts: Annotation Scheme Development and Validation Study. JMIR Ment Health 2021; 8:e24471. [PMID: 34747705 PMCID: PMC8663675 DOI: 10.2196/24471] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 03/17/2021] [Accepted: 06/03/2021] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Online communities provide support for individuals looking for help with suicidal ideation and crisis. As community data are increasingly used to devise machine learning models to infer who might be at risk, there have been limited efforts to identify both risk and protective factors in web-based posts. These annotations can enrich and augment computational assessment approaches to identify appropriate intervention points, which are useful to public health professionals and suicide prevention researchers. OBJECTIVE This qualitative study aims to develop a valid and reliable annotation scheme for evaluating risk and protective factors for suicidal ideation in posts in suicide crisis forums. METHODS We designed a valid, reliable, and clinically grounded process for identifying risk and protective markers in social media data. This scheme draws on prior work on construct validity and the social sciences of measurement. We then applied the scheme to annotate 200 posts from r/SuicideWatch-a Reddit community focused on suicide crisis. RESULTS We documented our results on producing an annotation scheme that is consistent with leading public health information coding schemes for suicide and advances attention to protective factors. Our study showed high internal validity, and we have presented results that indicate that our approach is consistent with findings from prior work. CONCLUSIONS Our work formalizes a framework that incorporates construct validity into the development of annotation schemes for suicide risk on social media. This study furthers the understanding of risk and protective factors expressed in social media data. This may help public health programming to prevent suicide and computational social science research and investigations that rely on the quality of labels for downstream machine learning tasks.
Collapse
Affiliation(s)
- Stevie Chancellor
- Department of Computer Science & Engineering, University of Minnesota - Twin Cities, Minneapolis, MN, United States
| | - Steven A Sumner
- Office of Strategy and Innovation, National Center for Injury Prevention and Control, Centers for Disease Control and Prevention, Atlanta, GA, United States
| | - Corinne David-Ferdon
- Division of Violence Prevention, Centers for Disease Control and Prevention, Atlanta, GA, United States
| | - Tahirah Ahmad
- School of Interactive Computing, Georgia Institute of Technology, Atlanta, GA, United States
| | - Munmun De Choudhury
- School of Interactive Computing, Georgia Institute of Technology, Atlanta, GA, United States
| |
Collapse
|
10
|
Zhang S, Liu M, Li Y, Chung JE. Teens' Social Media Engagement during the COVID-19 Pandemic: A Time Series Examination of Posting and Emotion on Reddit. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph181910079. [PMID: 34639381 PMCID: PMC8507823 DOI: 10.3390/ijerph181910079] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 09/18/2021] [Accepted: 09/22/2021] [Indexed: 12/26/2022]
Abstract
Research has rarely examined how the COVID-19 pandemic may affect teens' social media engagement and psychological wellbeing, and even less research has compared the difference between teens with and without mental health concerns. We collected and analyzed weekly data from January to December 2020 from teens in four Reddit communities (subreddits), including teens in r/Teenagers and teens who participated in three mental health subreddits (r/Depression, r/Anxiety, and r/SuicideWatch). The results showed that teens' weekly subreddit participation, posting/commenting frequency, and emotion expression were related to significant pandemic events. Teen Redditors on r/Teenagers had a higher posting/commenting frequency but lower negative emotion than teen Redditors on the three mental health subreddits. When comparing posts/comments on r/Teenagers, teens who ever visited one of the three mental health subreddits posted/commented twice as frequently as teens who did not, but their emotion expression was similar. The results from the Interrupted Time Series Analysis (ITSA) indicated that both teens with and without mental health concerns reversed the trend in posting frequency and negative emotion from declining to increasing right after the pandemic outbreak, and teens with mental health concerns had a more rapidly increasing trend in posting/commenting. The findings suggest that teens' social media engagement and emotion expression reflect the pandemic evolution. Teens with mental health concerns are more likely to reveal their emotions on specialized mental health subreddits rather than on the general r/Teenagers subreddit. In addition, the findings indicated that teens with mental health concerns had a strong social interaction desire that various barriers in the real world may inhibit. The findings call for more attention to understand the pandemic's influence on teens by monitoring and analyzing social media data and offering adequate support to teens regarding their mental health wellbeing.
Collapse
Affiliation(s)
- Saijun Zhang
- Department of Social Work, University of Mississippi, Oxford, MS 38677, USA
- Correspondence:
| | - Meirong Liu
- School of Social Work, Howard University, Washington, DC 20059, USA;
| | - Yeefay Li
- Thomas Jefferson High School for Science and Technology, Alexandria, VA 22312, USA
| | - Jae Eun Chung
- Cathy Hughes School of Communications, Howard University, Washington, DC 20059, USA;
| |
Collapse
|
11
|
D'Hotman D, Loh E. AI enabled suicide prediction tools: a qualitative narrative review. BMJ Health Care Inform 2021; 27:bmjhci-2020-100175. [PMID: 33037037 PMCID: PMC7549453 DOI: 10.1136/bmjhci-2020-100175] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Revised: 08/19/2020] [Accepted: 08/21/2020] [Indexed: 12/18/2022] Open
Abstract
Background: Suicide poses a significant health burden worldwide. In many cases, people at risk of suicide do not engage with their doctor or community due to concerns about stigmatisation and forced medical treatment; worse still, people with mental illness (who form a majority of people who die from suicide) may have poor insight into their mental state, and not self-identify as being at risk. These issues are exacerbated by the fact that doctors have difficulty in identifying those at risk of suicide when they do present to medical services. Advances in artificial intelligence (AI) present opportunities for the development of novel tools for predicting suicide. Method: We searched Google Scholar and PubMed for articles relating to suicide prediction using artificial intelligence from 2017 onwards. Conclusions: This paper presents a qualitative narrative review of research focusing on two categories of suicide prediction tools: medical suicide prediction and social suicide prediction. Initial evidence is promising: AI-driven suicide prediction could improve our capacity to identify those at risk of suicide, and, potentially, save lives. Medical suicide prediction may be relatively uncontroversial when it pays respect to ethical and legal principles; however, further research is required to determine the validity of these tools in different contexts. Social suicide prediction offers an exciting opportunity to help identify suicide risk among those who do not engage with traditional health services. Yet, efforts by private companies such as Facebook to use online data for suicide prediction should be the subject of independent review and oversight to confirm safety, effectiveness and ethical permissibility.
Collapse
Affiliation(s)
- Daniel D'Hotman
- Oxford Uehiro Centre for Practical Ethics, University of Oxford, Oxford, United Kingdom
| | - Erwin Loh
- Monash Centre for Health Research and Implementation, Monash University, Clayton, Victoria, Australia.,Group Chief Medical Officer, St Vincent's Health Australia Ltd, East Melbourne, Victoria, Australia
| |
Collapse
|
12
|
Li J, Zhang S, Zhang Y, Lin H, Wang J. Multifeature Fusion Attention Network for Suicide Risk Assessment Based on Social Media: Algorithm Development and Validation. JMIR Med Inform 2021; 9:e28227. [PMID: 34255687 PMCID: PMC8304127 DOI: 10.2196/28227] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 04/30/2021] [Accepted: 05/05/2021] [Indexed: 11/13/2022] Open
Abstract
Background Suicide has become the fifth leading cause of death worldwide. With development of the internet, social media has become an imperative source for studying psychological illnesses such as depression and suicide. Many methods have been proposed for suicide risk assessment. However, most of the existing methods cannot grasp the key information of the text. To solve this problem, we propose an efficient method to extract the core information from social media posts for suicide risk assessment. Objective We developed a multifeature fusion recurrent attention model for suicide risk assessment. Methods We used the bidirectional long short-term memory network to create the text representation with context information from social media posts. We further introduced a self-attention mechanism to extract the core information. We then fused linguistic features to improve our model. Results We evaluated our model on the dataset delivered by the Computational Linguistics and Clinical Psychology 2019 shared task. The experimental results showed that our model improves the risk-F1, urgent-F1, and existence-F1 by 3.3%, 0.9%, and 3.7%, respectively. Conclusions We found that bidirectional long short-term memory performs well for long text representation, and the attention mechanism can identify the key information in the text. The external features can complete the semantic information lost by the neural network during feature extraction and further improve the performance of the model. The experimental results showed that our model performs better than the state-of-the-art method. Our work has theoretical and practical value for suicidal risk assessment.
Collapse
Affiliation(s)
- Jiacheng Li
- College of Computer Science and Technology, Dalian University of Technology, Dalian, China
| | - Shaowu Zhang
- College of Computer Science and Technology, Dalian University of Technology, Dalian, China
| | - Yijia Zhang
- College of Computer Science and Technology, Dalian University of Technology, Dalian, China
| | - Hongfei Lin
- College of Computer Science and Technology, Dalian University of Technology, Dalian, China
| | - Jian Wang
- College of Computer Science and Technology, Dalian University of Technology, Dalian, China
| |
Collapse
|
13
|
Dutta R, Gkotsis G, Velupillai S, Bakolis I, Stewart R. Temporal and diurnal variation in social media posts to a suicide support forum. BMC Psychiatry 2021; 21:259. [PMID: 34011346 PMCID: PMC8136175 DOI: 10.1186/s12888-021-03268-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 05/06/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Rates of suicide attempts and deaths are highest on Mondays and these occur more frequently in the morning or early afternoon, suggesting weekly temporal and diurnal variation in suicidal behaviour. It is unknown whether there are similar time trends on social media, of posts relevant to suicide. We aimed to determine temporal and diurnal variation in posting patterns on the Reddit forum SuicideWatch, an online community for individuals who might be at risk of, or who know someone at risk of suicide. METHODS We used time series analysis to compare date and time stamps of 90,518 SuicideWatch posts from 1st December 2008 to 31st August 2015 to (i) 6,616,431 posts on the most commonly subscribed general subreddit, AskReddit and (ii) 66,934 of these AskReddit posts, which were posted by the SuicideWatch authors. RESULTS Mondays showed the highest proportion of posts on SuicideWatch. Clear diurnal variation was observed, with a peak in the early morning (2:00-5:00 h), and a subsequent decrease to a trough in late morning/early afternoon (11:00-14:00 h). Conversely, the highest volume of posts in the control data was between 20:00-23:00 h. CONCLUSIONS Posts on SuicideWatch occurred most frequently on Mondays: the day most associated with suicide risk. The early morning peak in SuicideWatch posts precedes the time of day during which suicide attempts and deaths most commonly occur. Further research of these weekly and diurnal rhythms should help target populations with support and suicide prevention interventions when needed most.
Collapse
Affiliation(s)
- Rina Dutta
- Department of Psychological Medicine, School of Academic Psychiatry, King’s College London, IoPPN, PO Box 84, 3rd Floor East Wing, Room E3.07, De Crespigny Park, London, SE5 8AF UK
- South London and Maudsley NHS Foundation Trust, London, UK
| | - George Gkotsis
- Department of Psychological Medicine, School of Academic Psychiatry, King’s College London, IoPPN, PO Box 84, 3rd Floor East Wing, Room E3.07, De Crespigny Park, London, SE5 8AF UK
| | - Sumithra Velupillai
- Department of Psychological Medicine, School of Academic Psychiatry, King’s College London, IoPPN, PO Box 84, 3rd Floor East Wing, Room E3.07, De Crespigny Park, London, SE5 8AF UK
- School of Electrical Engineering and Computer Science, KTH, Stockholm, Sweden
| | - Ioannis Bakolis
- Department of Psychological Medicine, School of Academic Psychiatry, King’s College London, IoPPN, PO Box 84, 3rd Floor East Wing, Room E3.07, De Crespigny Park, London, SE5 8AF UK
| | - Robert Stewart
- Department of Psychological Medicine, School of Academic Psychiatry, King’s College London, IoPPN, PO Box 84, 3rd Floor East Wing, Room E3.07, De Crespigny Park, London, SE5 8AF UK
- South London and Maudsley NHS Foundation Trust, London, UK
| |
Collapse
|
14
|
Mok K, Chen N. Characteristics of Online Comments Following High-Profile Celebrity Suicide. CRISIS 2021; 43:348-351. [PMID: 34003023 DOI: 10.1027/0227-5910/a000791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Background: On the Internet, individuals can freely read about or talk to others about suicide. However, little is known about the nature of these online interactions and the potential impact on users. Aims: This study aimed to examine the characteristics of online comments following high-profile celebrity suicide, comparing top-rated comments with controversial comments. Method: Comments from a popular thread on Reddit made following the deaths of Kate Spade and Anthony Bourdain were examined using Linguistic Inquiry and Word Count and qualitative content analysis. Results: Top-scoring comments were associated with a higher level of authenticity, a higher word count, and a greater focus on the past. These comments were characterized by personal stories of experiences with suicidality or knowing someone who had attempted/died by suicide. Limitations: Our small sample size was underpowered for the linguistic characteristic analyses, and differences in some characteristics may not have been identified. Conclusions: Despite concerns over the potential dangers of the Internet on suicide, it can serve as a place for individuals to share personal stories and obtain support from others.
Collapse
Affiliation(s)
- Katherine Mok
- Black Dog Institute, Prince of Wales Hospital, Randwick, NSW, Australia.,Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia
| | - Nicola Chen
- Black Dog Institute, Prince of Wales Hospital, Randwick, NSW, Australia.,Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia
| |
Collapse
|
15
|
Kaufman MR, Bazell AT, Collaco A, Sedoc J. "This show hits really close to home on so many levels": An analysis of Reddit comments about HBO's Euphoria to understand viewers' experiences of and reactions to substance use and mental illness. Drug Alcohol Depend 2021; 220:108468. [PMID: 33540349 PMCID: PMC8183393 DOI: 10.1016/j.drugalcdep.2020.108468] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 11/20/2020] [Accepted: 11/29/2020] [Indexed: 12/28/2022]
Abstract
BACKGROUND Public health has begun using social media forums such as Reddit to enhance surveillance and modernize interventions for young people. The current study's objective was to examine Reddit posts about the HBO series Euphoria to identify show themes that resonate with adolescent and young adult viewers in order to inform future social media interventions. METHODS Reddit comments in the r/television community from June to August 2019 were downloaded. Following filtering, 725 comments were analyzed and coded using a codebook and ATLAS.ti. Coded comments were analyzed for themes relevant to Redditor substance use, reactions to Euphoria and the main character (Rue), and mental health concerns. RESULTS During their discussion of the show, Redditors disclosed both personal recreational and prescription drug use, including substance use to cope with mental illness symptoms. There were approximately equal numbers of comments with positive and negative reactions to the show overall and to the main character, Rue. Redditors often found Euphoria's storyline and portrayed events to be relatable and realistic to the experience of young people who use drugs, as well as sometimes triggering. Overall, Redditors thought Rue accurately depicted an individual's struggle with a substance use disorder. CONCLUSIONS This exploratory study highlights how television and social media can contribute to young peoples' understanding of substance use disorders and mental health. Findings could inform the design of social media interventions for adolescents and young adults on a variety of substance use issues, including stigma and the interconnectedness of substance use and mental health challenges.
Collapse
Affiliation(s)
| | - Alicia T Bazell
- Johns Hopkins Bloomberg School of Public Health, United States
| | - Anne Collaco
- Johns Hopkins Bloomberg School of Public Health, United States
| | - João Sedoc
- New York University, Stern School of Business, United States
| |
Collapse
|
16
|
Abstract
OBJECTIVE To introduce the research methods of computerized text mining and its possible applications in suicide research and to demonstrate the procedures of applying a specific text mining area, document classification, to a suicide-related study. METHOD A systematic search of academic papers that applied text mining methods to suicide research was conducted. Relevant papers were reviewed focusing on their research objectives and sources of data. Furthermore, a case of using natural language processing and document classification methods to analyze a large amount of suicide news was elaborated to showcase the methods. RESULTS Eighty-six papers using text mining methods for suicide research have been published since 2001. The most common research objective (72.1%) was to classify which documents exhibit suicide risk or were written by suicidal people. The most frequently used data source was online social media posts (45.3%), followed by e-healthcare records (25.6%). For the news classification case, the top three classifiers trained for classification tasks achieved 84% or higher accuracy. CONCLUSIONS Computerized text mining methods can help to scale up content analysis capacity and efficiency and uncover new insights and perspectives for suicide research.
Collapse
Affiliation(s)
- Qijin Cheng
- Department of Social Work, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Carrie S M Lui
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
17
|
Yao H, Rashidian S, Dong X, Duanmu H, Rosenthal RN, Wang F. Detection of Suicidality Among Opioid Users on Reddit: Machine Learning-Based Approach. J Med Internet Res 2020; 22:e15293. [PMID: 33245287 PMCID: PMC7732714 DOI: 10.2196/15293] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Revised: 06/14/2020] [Accepted: 09/15/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND In recent years, both suicide and overdose rates have been increasing. Many individuals who struggle with opioid use disorder are prone to suicidal ideation; this may often result in overdose. However, these fatal overdoses are difficult to classify as intentional or unintentional. Intentional overdose is difficult to detect, partially due to the lack of predictors and social stigmas that push individuals away from seeking help. These individuals may instead use web-based means to articulate their concerns. OBJECTIVE This study aimed to extract posts of suicidality among opioid users on Reddit using machine learning methods. The performance of the models is derivative of the data purity, and the results will help us to better understand the rationale of these users, providing new insights into individuals who are part of the opioid epidemic. METHODS Reddit posts between June 2017 and June 2018 were collected from r/suicidewatch, r/depression, a set of opioid-related subreddits, and a control subreddit set. We first classified suicidal versus nonsuicidal languages and then classified users with opioid usage versus those without opioid usage. Several traditional baselines and neural network (NN) text classifiers were trained using subreddit names as the labels and combinations of semantic inputs. We then attempted to extract out-of-sample data belonging to the intersection of suicide ideation and opioid abuse. Amazon Mechanical Turk was used to provide labels for the out-of-sample data. RESULTS Classification results were at least 90% across all models for at least one combination of input; the best classifier was convolutional neural network, which obtained an F1 score of 96.6%. When predicting out-of-sample data for posts containing both suicidal ideation and signs of opioid addiction, NN classifiers produced more false positives and traditional methods produced more false negatives, which is less desirable for predicting suicidal sentiments. CONCLUSIONS Opioid abuse is linked to the risk of unintentional overdose and suicide risk. Social media platforms such as Reddit contain metadata that can aid machine learning and provide information at a personal level that cannot be obtained elsewhere. We demonstrate that it is possible to use NNs as a tool to predict an out-of-sample target with a model built from data sets labeled by characteristics we wish to distinguish in the out-of-sample target.
Collapse
Affiliation(s)
- Hannah Yao
- Stony Brook University, Stony Brook, NY, United States
| | | | - Xinyu Dong
- Stony Brook University, Stony Brook, NY, United States
| | - Hongyi Duanmu
- Stony Brook University, Stony Brook, NY, United States
| | - Richard N Rosenthal
- Renaissance School of Medicine at Stony Brook University, Stony Brook, NY, United States
| | - Fusheng Wang
- Stony Brook University, Stony Brook, NY, United States
| |
Collapse
|
18
|
Gozzi N, Tizzani M, Starnini M, Ciulla F, Paolotti D, Panisson A, Perra N. Collective Response to Media Coverage of the COVID-19 Pandemic on Reddit and Wikipedia: Mixed-Methods Analysis. J Med Internet Res 2020; 22:e21597. [PMID: 32960775 PMCID: PMC7553788 DOI: 10.2196/21597] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 07/31/2020] [Accepted: 09/09/2020] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND The exposure and consumption of information during epidemic outbreaks may alter people's risk perception and trigger behavioral changes, which can ultimately affect the evolution of the disease. It is thus of utmost importance to map the dissemination of information by mainstream media outlets and the public response to this information. However, our understanding of this exposure-response dynamic during the COVID-19 pandemic is still limited. OBJECTIVE The goal of this study is to characterize the media coverage and collective internet response to the COVID-19 pandemic in four countries: Italy, the United Kingdom, the United States, and Canada. METHODS We collected a heterogeneous data set including 227,768 web-based news articles and 13,448 YouTube videos published by mainstream media outlets, 107,898 user posts and 3,829,309 comments on the social media platform Reddit, and 278,456,892 views of COVID-19-related Wikipedia pages. To analyze the relationship between media coverage, epidemic progression, and users' collective web-based response, we considered a linear regression model that predicts the public response for each country given the amount of news exposure. We also applied topic modelling to the data set using nonnegative matrix factorization. RESULTS Our results show that public attention, quantified as user activity on Reddit and active searches on Wikipedia pages, is mainly driven by media coverage; meanwhile, this activity declines rapidly while news exposure and COVID-19 incidence remain high. Furthermore, using an unsupervised, dynamic topic modeling approach, we show that while the levels of attention dedicated to different topics by media outlets and internet users are in good accordance, interesting deviations emerge in their temporal patterns. CONCLUSIONS Overall, our findings offer an additional key to interpret public perception and response to the current global health emergency and raise questions about the effects of attention saturation on people's collective awareness and risk perception and thus on their tendencies toward behavioral change.
Collapse
|
19
|
Rosen G, Kreiner H, Levi-Belz Y. Public Response to Suicide News Reports as Reflected in Computerized Text Analysis of Online Reader Comments. Arch Suicide Res 2020; 24:243-259. [PMID: 30636527 DOI: 10.1080/13811118.2018.1563578] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Previous research has documented the rise in rates of suicidal behaviors following media reports of celebrity suicide. Whereas most research has focused on documenting and analyzing suicide rates, little is known about more subtle psychological effects of celebrity suicide on the public, such as despair and feelings of abandonment. The Internet has revolutionized the responses to news reports, enabling immediate and anonymous responses potentially reflecting these psychological processes. Thus, the current study explored the unique psychological impact of a celebrity suicide on the public by analyzing the big data of readers' comments to suicide news reports, using computational linguistics methods. Readers' comments (N = 14,506) to suicide news reports were retrieved from 4 leading online news sites. The comments were posted in response to 1 of 1 types of reports: a celebrity suicide (Robin Williams), a non-celebrity suicide, and general reports of suicide as a social phenomenon. LIWC software for computerized linguistic analysis was used to calculate the frequency of the various types of words used. Comparison of the responses to the 3 types of suicide reports revealed higher frequency of first-person pronouns and for emotionally charged words on comments to a celebrity suicide, compared with comments to the other types of suicide reports. The findings suggest that celebrity suicide news reports evoke the expression of positive emotions, possibly related to the venerated celebrity, alongside negative, internalized emotions, and feelings of social isolation. Theoretical, practical, and methodological implications are discussed.
Collapse
|
20
|
Suicide on Facebook-the tales of unnoticed departure in Bangladesh. Glob Ment Health (Camb) 2020; 7:e12. [PMID: 32742670 PMCID: PMC7379323 DOI: 10.1017/gmh.2020.5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Revised: 03/06/2020] [Accepted: 03/29/2020] [Indexed: 02/06/2023] Open
Abstract
Facebook has transformed social communication and offers the opportunity to share personal thoughts to people including suicide ideas, plans and attempts. Suicide after Facebook posts has been reported in different parts of the world and it has become a potential area of research for suicide prevention. The analysis of Facebook posts prior to suicide or Facebook live streaming may help in understanding the etiological factors, patterns of communication and possible prevention approaches for a particular community. However, there is a dearth of evidence about suicide incidents after Facebook posts and Facebook live streaming in low and middle-income countries. This study aims to explore the trends and phenomena of suicide after Facebook posts and live streaming in Bangladesh. We conducted an online search using the Google, Facebook and five daily online newspaper archives from 15th August to 15th September 2019. Two research assistants independently conducted the initial searching to find out people who committed suicide after Facebook posts or live streamed suicide in Bangladesh and documented 21 cases. After further evaluation of each of the 21 cases we confirmed 19 cases that met the selection criteria. All of them were under 35-years of age. We observed sucide after Facebook posts were more common in male(78%) e and students. Hanging was the most frequently used method of suicide followed by poisoning. Their Facebook posts and livestream videos indicated relationship problems, academic stress and mental disorders were the common stressors for their suicide. This study lays the foundation for the future researchers to work on suicidal posts on Facebook in Bangladesh and develop culture-specific, real-time suicide preventive systems using a social media platform.
Collapse
|
21
|
Gunn Iii JF, Goldstein SE, Lester D. The Impact of Widely Publicized Suicides on Search Trends: Using Google Trends to Test the Werther and Papageno Effects. Arch Suicide Res 2020; 24:142-155. [PMID: 30300114 DOI: 10.1080/13811118.2018.1522284] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
The objective of this study was to examine the impact of widely publicized suicides on the Werther and Papageno Effects using internet search trends. A list of widely publicized suicides from 2010 through 2018 was compiled along with dates of death for each of these individuals. Google.com/trends data were then collected for searches for "how to suicide" and "suicide prevention" for 14 days prior to a widely publicized suicide/14 days after a widely publicized suicide and 7 days prior to a widely publicized suicide/7 days after a widely publicized suicide. Comparisons were then made between these time periods for "how to suicide" and "suicide prevention." Some celebrities, such as Robin Williams (2014) and Aaron Hernandez (2017) were associated with increased searches. However, for many there was no increase in search trends. Limited support was found for the impact of widely publicized suicides on internet search trends with one case supporting a Werther Effect and one case supporting a Papageno Effect. The finding that only some celebrities were associated with increased searches may be a byproduct of the impact of celebrity status on these effects, with more prominent celebrities having the greatest impact.
Collapse
Affiliation(s)
- John F Gunn Iii
- Family Science and Human Development, Montclair State University, Montclair, NJ, USA
| | - Sara E Goldstein
- Family Science and Human Development, Montclair State University, Montclair, NJ, USA
| | - David Lester
- School of Social and Behavioral Sciences, Stockton University, Galloway Township, NJ, USA
| |
Collapse
|
22
|
Sarker A, Belousov M, Friedrichs J, Hakala K, Kiritchenko S, Mehryary F, Han S, Tran T, Rios A, Kavuluru R, de Bruijn B, Ginter F, Mahata D, Mohammad SM, Nenadic G, Gonzalez-Hernandez G. Data and systems for medication-related text classification and concept normalization from Twitter: insights from the Social Media Mining for Health (SMM4H)-2017 shared task. J Am Med Inform Assoc 2019; 25:1274-1283. [PMID: 30272184 PMCID: PMC6188524 DOI: 10.1093/jamia/ocy114] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Accepted: 08/02/2018] [Indexed: 12/19/2022] Open
Abstract
Objective We executed the Social Media Mining for Health (SMM4H) 2017 shared tasks to enable the community-driven development and large-scale evaluation of automatic text processing methods for the classification and normalization of health-related text from social media. An additional objective was to publicly release manually annotated data. Materials and Methods We organized 3 independent subtasks: automatic classification of self-reports of 1) adverse drug reactions (ADRs) and 2) medication consumption, from medication-mentioning tweets, and 3) normalization of ADR expressions. Training data consisted of 15 717 annotated tweets for (1), 10 260 for (2), and 6650 ADR phrases and identifiers for (3); and exhibited typical properties of social-media-based health-related texts. Systems were evaluated using 9961, 7513, and 2500 instances for the 3 subtasks, respectively. We evaluated performances of classes of methods and ensembles of system combinations following the shared tasks. Results Among 55 system runs, the best system scores for the 3 subtasks were 0.435 (ADR class F1-score) for subtask-1, 0.693 (micro-averaged F1-score over two classes) for subtask-2, and 88.5% (accuracy) for subtask-3. Ensembles of system combinations obtained best scores of 0.476, 0.702, and 88.7%, outperforming individual systems. Discussion Among individual systems, support vector machines and convolutional neural networks showed high performance. Performance gains achieved by ensembles of system combinations suggest that such strategies may be suitable for operational systems relying on difficult text classification tasks (eg, subtask-1). Conclusions Data imbalance and lack of context remain challenges for natural language processing of social media text. Annotated data from the shared task have been made available as reference standards for future studies (http://dx.doi.org/10.17632/rxwfb3tysd.1).
Collapse
Affiliation(s)
- Abeed Sarker
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Maksim Belousov
- School of Computer Science, University of Manchester, Manchester, UK
| | | | - Kai Hakala
- Turku NLP Group, Department of Future Technologies, University of Turku, Turku, Finland.,The University of Turku Graduate School, University of Turku, Turku, Finland
| | - Svetlana Kiritchenko
- Digital Technologies Research Centre, National Research Council Canada, Ottawa, Canada
| | - Farrokh Mehryary
- Turku NLP Group, Department of Future Technologies, University of Turku, Turku, Finland.,The University of Turku Graduate School, University of Turku, Turku, Finland
| | - Sifei Han
- Department of Computer Science, University of Kentucky, Lexington, Kentucky, USA
| | - Tung Tran
- Department of Computer Science, University of Kentucky, Lexington, Kentucky, USA
| | - Anthony Rios
- Department of Computer Science, University of Kentucky, Lexington, Kentucky, USA
| | - Ramakanth Kavuluru
- Department of Computer Science, University of Kentucky, Lexington, Kentucky, USA.,Division of Biomedical Informatics, Department of Internal Medicine, University of Kentucky, Lexington, Kentucky, USA
| | - Berry de Bruijn
- Digital Technologies Research Centre, National Research Council Canada, Ottawa, Canada
| | - Filip Ginter
- Turku NLP Group, Department of Future Technologies, University of Turku, Turku, Finland
| | | | - Saif M Mohammad
- Digital Technologies Research Centre, National Research Council Canada, Ottawa, Canada
| | - Goran Nenadic
- School of Computer Science, University of Manchester, Manchester, UK
| | - Graciela Gonzalez-Hernandez
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
23
|
Conway M, Hu M, Chapman WW. Recent Advances in Using Natural Language Processing to Address Public Health Research Questions Using Social Media and ConsumerGenerated Data. Yearb Med Inform 2019; 28:208-217. [PMID: 31419834 PMCID: PMC6697505 DOI: 10.1055/s-0039-1677918] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
OBJECTIVE We present a narrative review of recent work on the utilisation of Natural Language Processing (NLP) for the analysis of social media (including online health communities) specifically for public health applications. METHODS We conducted a literature review of NLP research that utilised social media or online consumer-generated text for public health applications, focussing on the years 2016 to 2018. Papers were identified in several ways, including PubMed searches and the inspection of recent conference proceedings from the Association of Computational Linguistics (ACL), the Conference on Human Factors in Computing Systems (CHI), and the International AAAI (Association for the Advancement of Artificial Intelligence) Conference on Web and Social Media (ICWSM). Popular data sources included Twitter, Reddit, various online health communities, and Facebook. RESULTS In the recent past, communicable diseases (e.g., influenza, dengue) have been the focus of much social media-based NLP health research. However, mental health and substance use and abuse (including the use of tobacco, alcohol, marijuana, and opioids) have been the subject of an increasing volume of research in the 2016 - 2018 period. Associated with this trend, the use of lexicon-based methods remains popular given the availability of psychologically validated lexical resources suitable for mental health and substance abuse research. Finally, we found that in the period under review "modern" machine learning methods (i.e. deep neural-network-based methods), while increasing in popularity, remain less widely used than "classical" machine learning methods.
Collapse
Affiliation(s)
- Mike Conway
- Department of Biomedical Informatics, University of Utah, Salt Lake City, Utah, United States
| | - Mengke Hu
- Department of Biomedical Informatics, University of Utah, Salt Lake City, Utah, United States
| | - Wendy W Chapman
- Department of Biomedical Informatics, University of Utah, Salt Lake City, Utah, United States
| |
Collapse
|
24
|
Yin Z, Sulieman LM, Malin BA. A systematic literature review of machine learning in online personal health data. J Am Med Inform Assoc 2019; 26:561-576. [PMID: 30908576 PMCID: PMC7647332 DOI: 10.1093/jamia/ocz009] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2018] [Revised: 01/06/2019] [Accepted: 01/11/2019] [Indexed: 02/07/2023] Open
Abstract
OBJECTIVE User-generated content (UGC) in online environments provides opportunities to learn an individual's health status outside of clinical settings. However, the nature of UGC brings challenges in both data collecting and processing. The purpose of this study is to systematically review the effectiveness of applying machine learning (ML) methodologies to UGC for personal health investigations. MATERIALS AND METHODS We searched PubMed, Web of Science, IEEE Library, ACM library, AAAI library, and the ACL anthology. We focused on research articles that were published in English and in peer-reviewed journals or conference proceedings between 2010 and 2018. Publications that applied ML to UGC with a focus on personal health were identified for further systematic review. RESULTS We identified 103 eligible studies which we summarized with respect to 5 research categories, 3 data collection strategies, 3 gold standard dataset creation methods, and 4 types of features applied in ML models. Popular off-the-shelf ML models were logistic regression (n = 22), support vector machines (n = 18), naive Bayes (n = 17), ensemble learning (n = 12), and deep learning (n = 11). The most investigated problems were mental health (n = 39) and cancer (n = 15). Common health-related aspects extracted from UGC were treatment experience, sentiments and emotions, coping strategies, and social support. CONCLUSIONS The systematic review indicated that ML can be effectively applied to UGC in facilitating the description and inference of personal health. Future research needs to focus on mitigating bias introduced when building study cohorts, creating features from free text, improving clinical creditability of UGC, and model interpretability.
Collapse
Affiliation(s)
- Zhijun Yin
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Lina M Sulieman
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Bradley A Malin
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, Tennessee, USA
| |
Collapse
|
25
|
Liu X, Liu X, Sun J, Yu NX, Sun B, Li Q, Zhu T. Proactive Suicide Prevention Online (PSPO): Machine Identification and Crisis Management for Chinese Social Media Users With Suicidal Thoughts and Behaviors. J Med Internet Res 2019; 21:e11705. [PMID: 31344675 PMCID: PMC6682269 DOI: 10.2196/11705] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Revised: 12/02/2018] [Accepted: 03/30/2019] [Indexed: 12/29/2022] Open
Abstract
Background Suicide is a great public health challenge. Two hundred million people attempt suicide in China annually. Existing suicide prevention programs require the help-seeking initiative of suicidal individuals, but many of them have a low motivation to seek the required help. We propose that a proactive and targeted suicide prevention strategy can prompt more people with suicidal thoughts and behaviors to seek help. Objective The goal of the research was to test the feasibility and acceptability of Proactive Suicide Prevention Online (PSPO), a new approach based on social media that combines proactive identification of suicide-prone individuals with specialized crisis management. Methods We first located a microblog group online. Their comments on a suicide note were analyzed by experts to provide a training set for the machine learning models for suicide identification. The best-performing model was used to automatically identify posts that suggested suicidal thoughts and behaviors. Next, a microblog direct message containing crisis management information, including measures that covered suicide-related issues, depression, help-seeking behavior and an acceptability test, was sent to users who had been identified by the model to be at risk of suicide. For those who replied to the message, trained counselors provided tailored crisis management. The Simplified Chinese Linguistic Inquiry and Word Count was also used to analyze the users’ psycholinguistic texts in 1-month time slots prior to and postconsultation. Results A total of 27,007 comments made in April 2017 were analyzed. Among these, 2786 (10.32%) were classified as indicative of suicidal thoughts and behaviors. The performance of the detection model was good, with high precision (.86), recall (.78), F-measure (.86), and accuracy (.88). Between July 3, 2017, and July 3, 2018, we sent out a total of 24,727 direct messages to 12,486 social media users, and 5542 (44.39%) responded. Over one-third of the users who were contacted completed the questionnaires included in the direct message. Of the valid responses, 89.73% (1259/1403) reported suicidal ideation, but more than half (725/1403, 51.67%) reported that they had not sought help. The 9-Item Patient Health Questionnaire (PHQ-9) mean score was 17.40 (SD 5.98). More than two-thirds of the participants (968/1403, 69.00%) thought the PSPO approach was acceptable. Moreover, 2321 users replied to the direct message. In a comparison of the frequency of word usage in their microblog posts 1-month before and after the consultation, we found that the frequency of death-oriented words significantly declined while the frequency of future-oriented words significantly increased. Conclusions The PSPO model is suitable for identifying populations that are at risk of suicide. When followed up with proactive crisis management, it may be a useful supplement to existing prevention programs because it has the potential to increase the accessibility of antisuicide information to people with suicidal thoughts and behaviors but a low motivation to seek help.
Collapse
Affiliation(s)
- Xingyun Liu
- Institute of Psychology, Chinese Academy of Sciences, Beijing, China.,Department of Psychology, University of Chinese Academy of Sciences, Beijing, China.,Department of Social and Behavioural Sciences, City University of Hong Kong, Hong Kong, China (Hong Kong)
| | - Xiaoqian Liu
- Institute of Psychology, Chinese Academy of Sciences, Beijing, China
| | - Jiumo Sun
- Institute of Psychology, Chinese Academy of Sciences, Beijing, China.,Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| | - Nancy Xiaonan Yu
- Department of Social and Behavioural Sciences, City University of Hong Kong, Hong Kong, China (Hong Kong)
| | - Bingli Sun
- Institute of Psychology, Chinese Academy of Sciences, Beijing, China
| | - Qing Li
- Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China (Hong Kong)
| | - Tingshao Zhu
- Institute of Psychology, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
26
|
Tuomchomtam S, Soonthornphisaj N. Community recommendation for text post in social media: A case study on Reddit. INTELL DATA ANAL 2019. [DOI: 10.3233/ida-183861] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
27
|
Nitzburg G, Weber I, Yom-Tov E. Internet Searches for Medical Symptoms Before Seeking Information on 12-Step Addiction Treatment Programs: A Web-Search Log Analysis. J Med Internet Res 2019; 21:e10946. [PMID: 31066685 PMCID: PMC6533047 DOI: 10.2196/10946] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Revised: 11/28/2018] [Accepted: 01/26/2019] [Indexed: 12/12/2022] Open
Abstract
Background Brief intervention is a critical method for identifying patients with problematic substance use in primary care settings and for motivating them to consider treatment options. However, despite considerable evidence of delay discounting in patients with substance use disorders, most brief advice by physicians focuses on the long-term negative medical consequences, which may not be the best way to motivate patients to seek treatment information. Objective Identification of the specific symptoms that most motivate individuals to seek treatment information may offer insights for further improving brief interventions. To this end, we used anonymized internet search engine data to investigate which medical conditions and symptoms preceded searches for 12-step meeting locators and general 12-step information. Methods We extracted all queries made by people in the United States on the Bing search engine from November 2016 to July 2017. These queries were filtered for those who mentioned seeking Alcoholics Anonymous (AA) or Narcotics Anonymous (NA); in addition, queries that contained a medical symptom or condition or a synonym thereof were analyzed. We identified medical symptoms and conditions that predicted searches for seeking treatment at different time lags. Specifically, symptom queries were first determined to be significantly predictive of subsequent 12-step queries if the probability of querying a medical symptom by those who later sought information about the 12-step program exceeded the probability of that same query being made by a comparison group of all other Bing users in the United States. Second, we examined symptom queries preceding queries on the 12-step program at time lags of 0-7 days, 7-14 days, and 14-30 days, where the probability of asking about a medical symptom was greater in the 30-day time window preceding 12-step program information-seeking as compared to all previous times that the symptom was queried. Results In our sample of 11,784 persons, we found 10 medical symptoms that predicted AA information seeking and 9 symptoms that predicted NA information seeking. Of these symptoms, a substantial number could be categorized as nonsevere in nature. Moreover, when medical symptom persistence was examined across a 1-month time period, a substantial number of nonsevere, yet persistent, symptoms were identified. Conclusions Our results suggest that many common or nonsevere medical symptoms and conditions motivate subsequent interest in AA and NA programs. In addition to highlighting severe long-term consequences, brief interventions could be restructured to highlight how increasing substance misuse can worsen discomfort from common medical symptoms in the short term, as well as how these worsening symptoms could exacerbate social embarrassment or decrease physical attractiveness.
Collapse
Affiliation(s)
- George Nitzburg
- Teachers College, Columbia University, New York, NY, United States
| | - Ingmar Weber
- Social Computing Department, Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Elad Yom-Tov
- Microsoft Research, Redmond, WA, United States.,Microsoft Research, Herzeliya, Israel.,Faculty of Industrial Engineering and Management, Technion - Israel Institute of Technology, Haifa, Israel
| |
Collapse
|
28
|
Çakıt E, Karwowski W, Servi L. Application of soft computing techniques for estimating emotional states expressed in Twitter® time series data. Neural Comput Appl 2019. [DOI: 10.1007/s00521-019-04048-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
29
|
Wang Z, Yu G, Tian X. Exploring Behavior of People with Suicidal Ideation in a Chinese Online Suicidal Community. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2018; 16:ijerph16010054. [PMID: 30587805 PMCID: PMC6339245 DOI: 10.3390/ijerph16010054] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/29/2018] [Revised: 12/23/2018] [Accepted: 12/24/2018] [Indexed: 12/19/2022]
Abstract
People with suicidal ideation (PSI) are increasingly using social media to express suicidal feelings. Researchers have found that their internet-based communication may lead to the spread of suicidal ideation, which presents a set of challenges for suicide prevention. To develop effective prevention and intervention strategies that can be efficiently applied in online communities, we need to understand the behavior of PSI in internet-based communities. However, to date there have been no studies that specifically focus on the behavior of PSI in Chinese online communities. A total of 4489 postings in which users explicitly expressed their suicidal ideation were labeled from 560,000 postings in an internet-based suicidal community on Weibo (one of the biggest social media platforms in China) to explore their behavior. The results reveal that PSI are significantly more active than other users in the community. With the use of social network analysis, we also found that the more frequently users communicate with PSI, the more likely that users would become suicidal. In addition, Chinese women may be more likely to be at risk of suicide than men in the community. This study enriches our knowledge of PSI’s behavior in online communities, which may contribute to detecting and assisting PSI on social media.
Collapse
Affiliation(s)
- Zheng Wang
- School of Management, Harbin Institute of Technology, Harbin 150001, China.
| | - Guang Yu
- School of Management, Harbin Institute of Technology, Harbin 150001, China.
| | - Xianyun Tian
- School of Management, Harbin Institute of Technology, Harbin 150001, China.
| |
Collapse
|
30
|
Coppersmith G, Leary R, Crutchley P, Fine A. Natural Language Processing of Social Media as Screening for Suicide Risk. BIOMEDICAL INFORMATICS INSIGHTS 2018; 10:1178222618792860. [PMID: 30158822 PMCID: PMC6111391 DOI: 10.1177/1178222618792860] [Citation(s) in RCA: 112] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Accepted: 06/20/2018] [Indexed: 12/05/2022]
Abstract
Suicide is among the 10 most common causes of death, as assessed by the World
Health Organization. For every death by suicide, an estimated 138 people’s lives
are meaningfully affected, and almost any other statistic around suicide deaths
is equally alarming. The pervasiveness of social media—and the near-ubiquity of
mobile devices used to access social media networks—offers new types of data for
understanding the behavior of those who (attempt to) take their own lives and
suggests new possibilities for preventive intervention. We demonstrate the
feasibility of using social media data to detect those at risk for suicide.
Specifically, we use natural language processing and machine learning
(specifically deep learning) techniques to detect quantifiable signals around
suicide attempts, and describe designs for an automated system for estimating
suicide risk, usable by those without specialized mental health training (eg, a
primary care doctor). We also discuss the ethical use of such technology and
examine privacy implications. Currently, this technology is only used for
intervention for individuals who have “opted in” for the analysis and
intervention, but the technology enables scalable screening for suicide risk,
potentially identifying many people who are at risk preventively and prior to
any engagement with a health care system. This raises a significant cultural
question about the trade-off between privacy and prevention—we have potentially
life-saving technology that is currently reaching only a fraction of the
possible people at risk because of respect for their privacy. Is the current
trade-off between privacy and prevention the right one?
Collapse
|
31
|
Characterizing negative sentiments in at-risk populations via crowd computing: a computational social science approach. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS 2018. [DOI: 10.1007/s41060-018-0135-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
32
|
Grant RN, Kucher D, León AM, Gemmell JF, Raicu DS, Fodeh SJ. Automatic extraction of informal topics from online suicidal ideation. BMC Bioinformatics 2018; 19:211. [PMID: 29897319 PMCID: PMC5998765 DOI: 10.1186/s12859-018-2197-z] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Background Suicide is an alarming public health problem accounting for a considerable number of deaths each year worldwide. Many more individuals contemplate suicide. Understanding the attributes, characteristics, and exposures correlated with suicide remains an urgent and significant problem. As social networking sites have become more common, users have adopted these sites to talk about intensely personal topics, among them their thoughts about suicide. Such data has previously been evaluated by analyzing the language features of social media posts and using factors derived by domain experts to identify at-risk users. Results In this work, we automatically extract informal latent recurring topics of suicidal ideation found in social media posts. Our evaluation demonstrates that we are able to automatically reproduce many of the expertly determined risk factors for suicide. Moreover, we identify many informal latent topics related to suicide ideation such as concerns over health, work, self-image, and financial issues. Conclusions These informal topics topics can be more specific or more general. Some of our topics express meaningful ideas not contained in the risk factors and some risk factors do not have complimentary latent topics. In short, our analysis of the latent topics extracted from social media containing suicidal ideations suggests that users of these systems express ideas that are complementary to the topics defined by experts but differ in their scope, focus, and precision of language.
Collapse
Affiliation(s)
| | | | - Ana M León
- University of Juárez Autónoma de Tabasco, Villahermosa, Tab., 86040, Mexico
| | | | | | - Samah J Fodeh
- Yale Center for Medical Informatics, Yale University, New Haven, CT, USA
| |
Collapse
|
33
|
Park A, Conway M. Tracking Health Related Discussions on Reddit for Public Health Applications. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2018; 2017:1362-1371. [PMID: 29854205 PMCID: PMC5977623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
We use Reddit to demonstrate social media's potential for public health applications. First, we employ a lexicon-based approach to track the prevalence of keywords indicating public interest in Ebola, electronic cigarette, influenza, and marijuana. Second, to better understand the public reactions, we use the Latent Dirichlet Allocation algorithm, to identify either the general themes or motivations for extreme changes in the volume of discussion over time. We observe that discussions related to Ebola and influenza, infectious diseases of public health interests, surged when the first case of Ebola was diagnosed and a new strain of H1N1 influenza virus was confirmed in the United States. We also observed that discussions of a controversial health topic like marijuana increased with the announcement of a major change in United States federal policy. Discussions of electronic cigarette highlighted opportunities for better health education. Lastly, we discuss the implications of our findings for utilizing Reddit data for public health applications.
Collapse
Affiliation(s)
- Albert Park
- Department of Biomedical Informatics, University of Utah, Salt Lake City, Utah
| | - Mike Conway
- Department of Biomedical Informatics, University of Utah, Salt Lake City, Utah
| |
Collapse
|
34
|
Lee KS, Lee H, Myung W, Song GY, Lee K, Kim H, Carroll BJ, Kim DK. Advanced Daily Prediction Model for National Suicide Numbers with Social Media Data. Psychiatry Investig 2018; 15:344-354. [PMID: 29614852 PMCID: PMC5912497 DOI: 10.30773/pi.2017.10.15] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/27/2017] [Revised: 08/03/2017] [Accepted: 10/15/2017] [Indexed: 11/27/2022] Open
Abstract
OBJECTIVE Suicide is a significant public health concern worldwide. Social media data have a potential role in identifying high suicide risk individuals and also in predicting suicide rate at the population level. In this study, we report an advanced daily suicide prediction model using social media data combined with economic/meteorological variables along with observed suicide data lagged by 1 week. METHODS The social media data were drawn from weblog posts. We examined a total of 10,035 social media keywords for suicide prediction. We made predictions of national suicide numbers 7 days in advance daily for 2 years, based on a daily moving 5-year prediction modeling period. RESULTS Our model predicted the likely range of daily national suicide numbers with 82.9% accuracy. Among the social media variables, words denoting economic issues and mood status showed high predictive strength. Observed number of suicides one week previously, recent celebrity suicide, and day of week followed by stock index, consumer price index, and sunlight duration 7 days before the target date were notable predictors along with the social media variables. CONCLUSION These results strengthen the case for social media data to supplement classical social/economic/climatic data in forecasting national suicide events.
Collapse
Affiliation(s)
- Kyung Sang Lee
- Department of Psychiatry, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
| | - Hyewon Lee
- Department of Public Health Sciences, Graduate School of Public Health, Seoul National University, Seoul, Republic of Korea
| | - Woojae Myung
- Department of Neuropsychiatry, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
| | | | - Kihwang Lee
- The Mining Company, Daumsoft, Seoul, Republic of Korea
| | - Ho Kim
- Department of Public Health Sciences, Graduate School of Public Health, Seoul National University, Seoul, Republic of Korea
- Institute of Health and Environment, Seoul National University, Seoul, Republic of Korea
| | - Bernard J. Carroll
- Department of Psychiatry, Emeritus, Duke University Medical Center, Durham, NC, USA
| | - Doh Kwan Kim
- Department of Psychiatry, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
| |
Collapse
|
35
|
Fink DS, Santaella-Tenorio J, Keyes KM. Increase in suicides the months after the death of Robin Williams in the US. PLoS One 2018. [PMID: 29415016 DOI: 10.1371/journal.pone.0191405] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Investigating suicides following the death of Robin Williams, a beloved actor and comedian, on August 11th, 2014, we used time-series analysis to estimate the expected number of suicides during the months following Williams' death. Monthly suicide count data in the US (1999-2015) were from the Centers for Disease Control and Prevention Wide-ranging ONline Data for Epidemiologic Research (CDC WONDER). Expected suicides were calculated using a seasonal autoregressive integrated moving averages model to account for both the seasonal patterns and autoregression. Time-series models indicated that we would expect 16,849 suicides from August to December 2014; however, we observed 18,690 suicides in that period, suggesting an excess of 1,841 cases (9.85% increase). Although excess suicides were observed across gender and age groups, males and persons aged 30-44 had the greatest increase in excess suicide events. This study documents associations between Robin Williams' death and suicide deaths in the population thereafter.
Collapse
Affiliation(s)
- David S Fink
- Department of Epidemiology, Columbia University, New York, New York, United States of America
| | | | - Katherine M Keyes
- Department of Epidemiology, Columbia University, New York, New York, United States of America
| |
Collapse
|
36
|
Fink DS, Santaella-Tenorio J, Keyes KM. Increase in suicides the months after the death of Robin Williams in the US. PLoS One 2018; 13:e0191405. [PMID: 29415016 PMCID: PMC5802858 DOI: 10.1371/journal.pone.0191405] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Accepted: 01/04/2018] [Indexed: 11/18/2022] Open
Abstract
Investigating suicides following the death of Robin Williams, a beloved actor and comedian, on August 11th, 2014, we used time-series analysis to estimate the expected number of suicides during the months following Williams' death. Monthly suicide count data in the US (1999-2015) were from the Centers for Disease Control and Prevention Wide-ranging ONline Data for Epidemiologic Research (CDC WONDER). Expected suicides were calculated using a seasonal autoregressive integrated moving averages model to account for both the seasonal patterns and autoregression. Time-series models indicated that we would expect 16,849 suicides from August to December 2014; however, we observed 18,690 suicides in that period, suggesting an excess of 1,841 cases (9.85% increase). Although excess suicides were observed across gender and age groups, males and persons aged 30-44 had the greatest increase in excess suicide events. This study documents associations between Robin Williams' death and suicide deaths in the population thereafter.
Collapse
Affiliation(s)
- David S. Fink
- Department of Epidemiology, Columbia University, New York, New York, United States of America
- * E-mail:
| | | | - Katherine M. Keyes
- Department of Epidemiology, Columbia University, New York, New York, United States of America
| |
Collapse
|
37
|
Latent sentiment topic modelling and nonparametric discovery of online mental health-related communities. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS 2017. [DOI: 10.1007/s41060-017-0073-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
38
|
Park A, Conway M, Chen AT. Examining Thematic Similarity, Difference, and Membership in Three Online Mental Health Communities from Reddit: A Text Mining and Visualization Approach. COMPUTERS IN HUMAN BEHAVIOR 2017; 78:98-112. [PMID: 29456286 DOI: 10.1016/j.chb.2017.09.001] [Citation(s) in RCA: 68] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Objectives Social media, including online health communities, have become popular platforms for individuals to discuss health challenges and exchange social support with others. These platforms can provide support for individuals who are concerned about social stigma and discrimination associated with their illness. Although mental health conditions can share similar symptoms and even co-occur, the extent to which discussion topics in online mental health communities are similar, different, or overlapping is unknown. Discovering the topical similarities and differences could potentially inform the design of related mental health communities and patient education programs. This study employs text mining, qualitative analysis, and visualization techniques to compare discussion topics in publicly accessible online mental health communities for three conditions: Anxiety, Depression and Post-Traumatic Stress Disorder. Methods First, online discussion content for the three conditions was collected from three Reddit communities (r/Anxiety, r/Depression, and r/PTSD). Second, content was pre-processed, and then clustered using the k-means algorithm to identify themes that were commonly discussed by members. Third, we qualitatively examined the common themes to better understand them, as well as their similarities and differences. Fourth, we employed multiple visualization techniques to form a deeper understanding of the relationships among the identified themes for the three mental health conditions. Results The three mental health communities shared four themes: sharing of positive emotion, gratitude for receiving emotional support, and sleep- and work-related issues. Depression clusters tended to focus on self-expressed contextual aspects of depression, whereas the Anxiety Disorders and Post-Traumatic Stress Disorder clusters addressed more treatment- and medication-related issues. Visualizations showed that discussion topics from the Anxiety Disorders and Post-Traumatic Stress Disorder subreddits shared more similarities to one another than to the depression subreddit. Conclusions We observed that the members of the three communities shared several overlapping concerns (i.e., sleep- and work-related problems) and discussion patterns (i.e., sharing of positive emotion and showing gratitude for receiving emotional support). We also highlighted that the discussions from the r/Anxiety and r/PTSD communities were more similar to one another than to discussions from the r/Depression community. The r/Anxiety and r/PTSD subreddit members are more likely to be individuals whose experiences with a condition are long-term, and who are interested in treatments and medications. The r/Depression subreddit members may be a comparatively diffuse group, many of whom are dealing with transient issues that cause depressed mood. The findings from this study could be used to inform the design of online mental health communities and patient education programs for these conditions. Moreover, we suggest that researchers employ multiple methods to fully understand the subtle differences when comparing similar discussions from online health communities.
Collapse
Affiliation(s)
- Albert Park
- Department of Biomedical Informatics, School of Medicine University of Utah 421 Wakara Way Ste 140, Salt Lake City, UT 84108-3514, USA
| | - Mike Conway
- Department of Biomedical Informatics, School of Medicine University of Utah 421 Wakara Way Ste 140, Salt Lake City, UT 84108-3514, USA
| | - Annie T Chen
- Department of Biomedical Informatics and Medical Education, School of Medicine University of Washington Box SLU-BIME 358047, 850 Republican St, Building C, Seattle, WA 98109-4714, USA
| |
Collapse
|
39
|
Burnap P, Colombo G, Amery R, Hodorog A, Scourfield J. Multi-class machine classification of suicide-related communication on Twitter. ACTA ACUST UNITED AC 2017; 2:32-44. [PMID: 29278258 PMCID: PMC5732584 DOI: 10.1016/j.osnem.2017.08.001] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Revised: 08/04/2017] [Accepted: 08/08/2017] [Indexed: 01/03/2023]
Abstract
The World Wide Web, and online social networks in particular, have increased connectivity between people such that information can spread to millions of people in a matter of minutes. This form of online collective contagion has provided many benefits to society, such as providing reassurance and emergency management in the immediate aftermath of natural disasters. However, it also poses a potential risk to vulnerable Web users who receive this information and could subsequently come to harm. One example of this would be the spread of suicidal ideation in online social networks, about which concerns have been raised. In this paper we report the results of a number of machine classifiers built with the aim of classifying text relating to suicide on Twitter. The classifier distinguishes between the more worrying content, such as suicidal ideation, and other suicide-related topics such as reporting of a suicide, memorial, campaigning and support. It also aims to identify flippant references to suicide. We built a set of baseline classifiers using lexical, structural, emotive and psychological features extracted from Twitter posts. We then improved on the baseline classifiers by building an ensemble classifier using the Rotation Forest algorithm and a Maximum Probability voting classification decision method, based on the outcome of base classifiers. This achieved an F-measure of 0.728 overall (for 7 classes, including suicidal ideation) and 0.69 for the suicidal ideation class. We summarise the results by reflecting on the most significant predictive principle components of the suicidal ideation class to provide insight into the language used on Twitter to express suicidal ideation. Finally, we perform a 12-month case study of suicide-related posts where we further evaluate the classification approach - showing a sustained classification performance and providing anonymous insights into the trends and demographic profile of Twitter users posting content of this type.
Collapse
Affiliation(s)
- Pete Burnap
- School of Computer Science & Informatics, Cardiff University, UK
| | | | | | - Andrei Hodorog
- School of Computer Science & Informatics, Cardiff University, UK
| | | |
Collapse
|
40
|
Gonzalez-Hernandez G, Sarker A, O’Connor K, Savova G. Capturing the Patient's Perspective: a Review of Advances in Natural Language Processing of Health-Related Text. Yearb Med Inform 2017; 26:214-227. [PMID: 29063568 PMCID: PMC6250990 DOI: 10.15265/iy-2017-029] [Citation(s) in RCA: 63] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Background: Natural Language Processing (NLP) methods are increasingly being utilized to mine knowledge from unstructured health-related texts. Recent advances in noisy text processing techniques are enabling researchers and medical domain experts to go beyond the information encapsulated in published texts (e.g., clinical trials and systematic reviews) and structured questionnaires, and obtain perspectives from other unstructured sources such as Electronic Health Records (EHRs) and social media posts. Objectives: To review the recently published literature discussing the application of NLP techniques for mining health-related information from EHRs and social media posts. Methods: Literature review included the research published over the last five years based on searches of PubMed, conference proceedings, and the ACM Digital Library, as well as on relevant publications referenced in papers. We particularly focused on the techniques employed on EHRs and social media data. Results: A set of 62 studies involving EHRs and 87 studies involving social media matched our criteria and were included in this paper. We present the purposes of these studies, outline the key NLP contributions, and discuss the general trends observed in the field, the current state of research, and important outstanding problems. Conclusions: Over the recent years, there has been a continuing transition from lexical and rule-based systems to learning-based approaches, because of the growth of annotated data sets and advances in data science. For EHRs, publicly available annotated data is still scarce and this acts as an obstacle to research progress. On the contrary, research on social media mining has seen a rapid growth, particularly because the large amount of unlabeled data available via this resource compensates for the uncertainty inherent to the data. Effective mechanisms to filter out noise and for mapping social media expressions to standard medical concepts are crucial and latent research problems. Shared tasks and other competitive challenges have been driving factors behind the implementation of open systems, and they are likely to play an imperative role in the development of future systems.
Collapse
Affiliation(s)
- G. Gonzalez-Hernandez
- Department of Epidemiology, Biostatistics, and Informatics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - A. Sarker
- Department of Epidemiology, Biostatistics, and Informatics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - K. O’Connor
- Department of Epidemiology, Biostatistics, and Informatics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - G. Savova
- Boston Children’s Hospital and Harvard Medical School, Boston, MA, USA
| |
Collapse
|
41
|
Cheng Q, Li TM, Kwok CL, Zhu T, Yip PS. Assessing Suicide Risk and Emotional Distress in Chinese Social Media: A Text Mining and Machine Learning Study. J Med Internet Res 2017; 19:e243. [PMID: 28694239 PMCID: PMC5525005 DOI: 10.2196/jmir.7276] [Citation(s) in RCA: 82] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Revised: 04/07/2017] [Accepted: 04/24/2017] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Early identification and intervention are imperative for suicide prevention. However, at-risk people often neither seek help nor take professional assessment. A tool to automatically assess their risk levels in natural settings can increase the opportunity for early intervention. OBJECTIVE The aim of this study was to explore whether computerized language analysis methods can be utilized to assess one's suicide risk and emotional distress in Chinese social media. METHODS A Web-based survey of Chinese social media (ie, Weibo) users was conducted to measure their suicide risk factors including suicide probability, Weibo suicide communication (WSC), depression, anxiety, and stress levels. Participants' Weibo posts published in the public domain were also downloaded with their consent. The Weibo posts were parsed and fitted into Simplified Chinese-Linguistic Inquiry and Word Count (SC-LIWC) categories. The associations between SC-LIWC features and the 5 suicide risk factors were examined by logistic regression. Furthermore, the support vector machine (SVM) model was applied based on the language features to automatically classify whether a Weibo user exhibited any of the 5 risk factors. RESULTS A total of 974 Weibo users participated in the survey. Those with high suicide probability were marked by a higher usage of pronoun (odds ratio, OR=1.18, P=.001), prepend words (OR=1.49, P=.02), multifunction words (OR=1.12, P=.04), a lower usage of verb (OR=0.78, P<.001), and a greater total word count (OR=1.007, P=.008). Second-person plural was positively associated with severe depression (OR=8.36, P=.01) and stress (OR=11, P=.005), whereas work-related words were negatively associated with WSC (OR=0.71, P=.008), severe depression (OR=0.56, P=.005), and anxiety (OR=0.77, P=.02). Inconsistently, third-person plural was found to be negatively associated with WSC (OR=0.02, P=.047) but positively with severe stress (OR=41.3, P=.04). Achievement-related words were positively associated with depression (OR=1.68, P=.003), whereas health- (OR=2.36, P=.004) and death-related (OR=2.60, P=.01) words positively associated with stress. The machine classifiers did not achieve satisfying performance in the full sample set but could classify high suicide probability (area under the curve, AUC=0.61, P=.04) and severe anxiety (AUC=0.75, P<.001) among those who have exhibited WSC. CONCLUSIONS SC-LIWC is useful to examine language markers of suicide risk and emotional distress in Chinese social media and can identify characteristics different from previous findings in the English literature. Some findings are leading to new hypotheses for future verification. Machine classifiers based on SC-LIWC features are promising but still require further optimization for application in real life.
Collapse
Affiliation(s)
- Qijin Cheng
- HKJC Center for Suicide Research and Prevention, The University of Hong Kong, Hong Kong, China (Hong Kong)
| | - Tim Mh Li
- Department of Paediatrics & Adolescent Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China (Hong Kong)
| | - Chi-Leung Kwok
- HKJC Center for Suicide Research and Prevention, The University of Hong Kong, Hong Kong, China (Hong Kong)
| | - Tingshao Zhu
- Institute of Psychology & Insititute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Paul Sf Yip
- HKJC Center for Suicide Research and Prevention, The University of Hong Kong, Hong Kong, China (Hong Kong)
| |
Collapse
|
42
|
Bagroy S, Kumaraguru P, De Choudhury M. A Social Media Based Index of Mental Well-Being in College Campuses. PROCEEDINGS OF THE SIGCHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS. CHI CONFERENCE 2017; 2017:1634-1646. [PMID: 28840202 DOI: 10.1145/3025453.3025909] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Psychological distress in the form of depression, anxiety and other mental health challenges among college students is a growing health concern. Dearth of accurate, continuous, and multi-campus data on mental well-being presents significant challenges to intervention and mitigation efforts in college campuses. We examine the potential of social media as a new "barometer" for quantifying the mental well-being of college populations. Utilizing student-contributed data in Reddit communities of over 100 universities, we first build and evaluate a transfer learning based classification approach that can detect mental health expressions with 97% accuracy. Thereafter, we propose a robust campus-specific Mental Well-being Index: MWI. We find that MWI is able to reveal meaningful temporal patterns of mental well-being in campuses, and to assess how their expressions relate to university attributes like size, academic prestige, and student demographics. We discuss the implications of our work for improving counselor efforts, and in the design of tools that can enable better assessment of the mental health climate of college campuses.
Collapse
|
43
|
Park A, Conway M. Longitudinal Changes in Psychological States in Online Health Community Members: Understanding the Long-Term Effects of Participating in an Online Depression Community. J Med Internet Res 2017; 19:e71. [PMID: 28320692 PMCID: PMC5379019 DOI: 10.2196/jmir.6826] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2016] [Revised: 01/03/2017] [Accepted: 02/08/2017] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Major depression is a serious challenge at both the individual and population levels. Although online health communities have shown the potential to reduce the symptoms of depression, emotional contagion theory suggests that negative emotion can spread within a community, and prolonged interactions with other depressed individuals has potential to worsen the symptoms of depression. OBJECTIVE The goals of our study were to investigate longitudinal changes in psychological states that are manifested through linguistic changes in depression community members who are interacting with other depressed individuals. METHODS We examined emotion-related language usages using the Linguistic Inquiry and Word Count (LIWC) program for each member of a depression community from Reddit. To measure the changes, we applied linear least-squares regression to the LIWC scores against the interaction sequence for each member. We measured the differences in linguistic changes against three online health communities focusing on positive emotion, diabetes, and irritable bowel syndrome. RESULTS On average, members of an online depression community showed improvement in 9 of 10 prespecified linguistic dimensions: "positive emotion," "negative emotion," "anxiety," "anger," "sadness," "first person singular," "negation," "swear words," and "death." Moreover, these members improved either significantly or at least as much as members of other online health communities. CONCLUSIONS We provide new insights into the impact of prolonged participation in an online depression community and highlight the positive emotion change in members. The findings of this study should be interpreted with caution, because participating in an online depression community is not the sole factor for improvement or worsening of depressive symptoms. Still, the consistent statistical results including comparative analyses with different communities could indicate that the emotion-related language usage of depression community members are improving either significantly or at least as much as members of other online communities. On the basis of these findings, we contribute practical suggestions for designing online depression communities to enhance psychosocial benefit gains for members. We consider these results to be an important step toward a better understanding of the impact of prolonged participation in an online depression community, in addition to providing insights into the long-term psychosocial well-being of members.
Collapse
Affiliation(s)
- Albert Park
- Department of Biomedical Informatics, School of Medicine, University of Utah, Salt Lake City, UT, United States
| | - Mike Conway
- Department of Biomedical Informatics, School of Medicine, University of Utah, Salt Lake City, UT, United States
| |
Collapse
|
44
|
Braithwaite SR, Giraud-Carrier C, West J, Barnes MD, Hanson CL. Validating Machine Learning Algorithms for Twitter Data Against Established Measures of Suicidality. JMIR Ment Health 2016; 3:e21. [PMID: 27185366 PMCID: PMC4886102 DOI: 10.2196/mental.4822] [Citation(s) in RCA: 68] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/16/2015] [Revised: 02/04/2016] [Accepted: 02/25/2016] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND One of the leading causes of death in the United States (US) is suicide and new methods of assessment are needed to track its risk in real time. OBJECTIVE Our objective is to validate the use of machine learning algorithms for Twitter data against empirically validated measures of suicidality in the US population. METHODS Using a machine learning algorithm, the Twitter feeds of 135 Mechanical Turk (MTurk) participants were compared with validated, self-report measures of suicide risk. RESULTS Our findings show that people who are at high suicidal risk can be easily differentiated from those who are not by machine learning algorithms, which accurately identify the clinically significant suicidal rate in 92% of cases (sensitivity: 53%, specificity: 97%, positive predictive value: 75%, negative predictive value: 93%). CONCLUSIONS Machine learning algorithms are efficient in differentiating people who are at a suicidal risk from those who are not. Evidence for suicidality can be measured in nonclinical populations using social media data.
Collapse
Affiliation(s)
- Scott R Braithwaite
- Computational Health Science Research Group, Department of Psychology, Brigham Young University, Provo, UT, United States
| | | | | | | | | |
Collapse
|
45
|
De Choudhury M, Kiciman E, Dredze M, Coppersmith G, Kumar M. Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media. PROCEEDINGS OF THE SIGCHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS. CHI CONFERENCE 2016; 2016:2098-2110. [PMID: 29082385 PMCID: PMC5659860 DOI: 10.1145/2858036.2858207] [Citation(s) in RCA: 156] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
History of mental illness is a major factor behind suicide risk and ideation. However research efforts toward characterizing and forecasting this risk is limited due to the paucity of information regarding suicide ideation, exacerbated by the stigma of mental illness. This paper fills gaps in the literature by developing a statistical methodology to infer which individuals could undergo transitions from mental health discourse to suicidal ideation. We utilize semi-anonymous support communities on Reddit as unobtrusive data sources to infer the likelihood of these shifts. We develop language and interactional measures for this purpose, as well as a propensity score matching based statistical approach. Our approach allows us to derive distinct markers of shifts to suicidal ideation. These markers can be modeled in a prediction framework to identify individuals likely to engage in suicidal ideation in the future. We discuss societal and ethical implications of this research.
Collapse
Affiliation(s)
| | | | - Mark Dredze
- Johns Hopkins University, Baltimore MD 21218
| | | | | |
Collapse
|