1
|
Carpenter KA, Nguyen AT, Smith DA, Samori IA, Humphreys K, Lembke A, Kiang MV, Eichstaedt JC, Altman RB. Which social media platforms facilitate monitoring the opioid crisis? MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.07.06.24310035. [PMID: 39006412 PMCID: PMC11245080 DOI: 10.1101/2024.07.06.24310035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Social media can provide real-time insight into trends in substance use, addiction, and recovery. Prior studies have used platforms such as Reddit and X (formerly Twitter), but evolving policies around data access have threatened these platforms' usability in research. We evaluate the potential of a broad set of platforms to detect emerging trends in the opioid epidemic. From these, we created a shortlist of 11 platforms, for which we documented official policies regulating drug-related discussion, data accessibility, geolocatability, and prior use in opioid-related studies. We quantified their volumes of opioid discussion, capturing informal language by including slang generated using a large language model. Beyond the most commonly used Reddit and X, the platforms with high potential for use in opioid-related surveillance are TikTok, YouTube, and Facebook. Leveraging many different social platforms, instead of a single platform, safeguards against sudden changes to data access and may better capture all populations that use opioids than any single platform.
Collapse
|
2
|
Almeida A, Patton T, Conway M, Gupta A, Strathdee SA, Bórquez A. The Use of Natural Language Processing Methods in Reddit to Investigate Opioid Use: Scoping Review. JMIR INFODEMIOLOGY 2024; 4:e51156. [PMID: 39269743 PMCID: PMC11437337 DOI: 10.2196/51156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Revised: 06/01/2024] [Accepted: 06/18/2024] [Indexed: 09/15/2024]
Abstract
BACKGROUND The growing availability of big data spontaneously generated by social media platforms allows us to leverage natural language processing (NLP) methods as valuable tools to understand the opioid crisis. OBJECTIVE We aimed to understand how NLP has been applied to Reddit (Reddit Inc) data to study opioid use. METHODS We systematically searched for peer-reviewed studies and conference abstracts in PubMed, Scopus, PsycINFO, ACL Anthology, IEEE Xplore, and Association for Computing Machinery data repositories up to July 19, 2022. Inclusion criteria were studies investigating opioid use, using NLP techniques to analyze the textual corpora, and using Reddit as the social media data source. We were specifically interested in mapping studies' overarching goals and findings, methodologies and software used, and main limitations. RESULTS In total, 30 studies were included, which were classified into 4 nonmutually exclusive overarching goal categories: methodological (n=6, 20% studies), infodemiology (n=22, 73% studies), infoveillance (n=7, 23% studies), and pharmacovigilance (n=3, 10% studies). NLP methods were used to identify content relevant to opioid use among vast quantities of textual data, to establish potential relationships between opioid use patterns or profiles and contextual factors or comorbidities, and to anticipate individuals' transitions between different opioid-related subreddits, likely revealing progression through opioid use stages. Most studies used an embedding technique (12/30, 40%), prediction or classification approach (12/30, 40%), topic modeling (9/30, 30%), and sentiment analysis (6/30, 20%). The most frequently used programming languages were Python (20/30, 67%) and R (2/30, 7%). Among the studies that reported limitations (20/30, 67%), the most cited was the uncertainty regarding whether redditors participating in these forums were representative of people who use opioids (8/20, 40%). The papers were very recent (28/30, 93%), from 2019 to 2022, with authors from a range of disciplines. CONCLUSIONS This scoping review identified a wide variety of NLP techniques and applications used to support surveillance and social media interventions addressing the opioid crisis. Despite the clear potential of these methods to enable the identification of opioid-relevant content in Reddit and its analysis, there are limits to the degree of interpretive meaning that they can provide. Moreover, we identified the need for standardized ethical guidelines to govern the use of Reddit data to safeguard the anonymity and privacy of people using these forums.
Collapse
Affiliation(s)
- Alexandra Almeida
- Scientific Computing Program, Oswaldo Cruz Foundation, Rio de Janeiro, Brazil
- San Diego State University, School of Social Work, San Diego, CA, United States
- Department of Medicine, University of California San Diego, San Diego, CA, United States
| | - Thomas Patton
- Department of Medicine, University of California San Diego, San Diego, CA, United States
| | - Mike Conway
- School of Computing and Information Systems, The University of Melbourne, Melbourne, Australia
| | - Amarnath Gupta
- San Diego Supercomputer Center, University of California San Diego, San Diego, CA, United States
| | - Steffanie A Strathdee
- Department of Medicine, University of California San Diego, San Diego, CA, United States
| | - Annick Bórquez
- Department of Medicine, University of California San Diego, San Diego, CA, United States
| |
Collapse
|
3
|
Lokala U, Phukan OC, Dastidar TG, Lamy F, Daniulaityte R, Sheth A. Detecting Substance Use Disorder Using Social Media Data and the Dark Web: Time- and Knowledge-Aware Study. JMIRX MED 2024; 5:e48519. [PMID: 38717384 PMCID: PMC11084118 DOI: 10.2196/48519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 02/16/2024] [Accepted: 02/21/2024] [Indexed: 07/10/2024]
Abstract
Background Opioid and substance misuse has become a widespread problem in the United States, leading to the "opioid crisis." The relationship between substance misuse and mental health has been extensively studied, with one possible relationship being that substance misuse causes poor mental health. However, the lack of evidence on the relationship has resulted in opioids being largely inaccessible through legal means. objectives This study aims to analyze social media posts related to substance use and opioids being sold through cryptomarket listings. The study aims to use state-of-the-art deep learning models to generate sentiment and emotion from social media posts to understand users' perceptions of social media. The study also aims to investigate questions such as which synthetic opioids people are optimistic, neutral, or negative about; what kind of drugs induced fear and sorrow; what kind of drugs people love or are thankful about; which drugs people think negatively about; and which opioids cause little to no sentimental reaction. Methods The study used the drug abuse ontology and state-of-the-art deep learning models, including knowledge-aware Bidirectional Encoder Representations From Transformers-based models, to generate sentiment and emotion from social media posts related to substance use and opioids being sold through cryptomarket listings. The study crawled cryptomarket data and extracted posts for fentanyl, fentanyl analogs, and other novel synthetic opioids. The study performed topic analysis associated with the generated sentiments and emotions to understand which topics correlate with people's responses to various drugs. Additionally, the study analyzed time-aware neural models built on these features while considering historical sentiment and emotional activity of posts related to a drug. Results The study found that the most effective model performed well (statistically significant, with a macro-F1-score of 82.12 and recall of 83.58) in identifying substance use disorder. The study also found that there were varying levels of sentiment and emotion associated with different synthetic opioids, with some drugs eliciting more positive or negative responses than others. The study identified topics that correlated with people's responses to various drugs, such as pain relief, addiction, and withdrawal symptoms. Conclusions The study provides insight into users' perceptions of synthetic opioids based on sentiment and emotion expressed in social media posts. The study's findings can be used to inform interventions and policies aimed at reducing substance misuse and addressing the opioid crisis. The study demonstrates the potential of deep learning models for analyzing social media data to gain insights into public health issues.
Collapse
Affiliation(s)
- Usha Lokala
- Department of Computer Science and Computer Engineering, Artificial Intelligence Institute, University of South Carolina, Columbia, SC, United States
| | - Orchid Chetia Phukan
- Department of Computer Science and Engineering, Indraprastha Institute of Information Technology, Delhi, India
| | - Triyasha Ghosh Dastidar
- Department of Computer Science and Engineering, Birla Institute of Technology & Science Pilani, Hyderabad, India
| | - Francois Lamy
- Department of Society and Health, Mahildol University, Salaya, Thailand
| | - Raminta Daniulaityte
- College of Health Solutions, Institute for Social Science Research, Arizona State University, Phoneix, AZ, United States
| | - Amit Sheth
- Department of Computer Science and Computer Engineering, Artificial Intelligence Institute, University of South Carolina, Columbia, SC, United States
| |
Collapse
|
4
|
Eschliman EL, Choe K, DeLucia A, Addison E, Jackson VW, Murray SM, German D, Genberg BL, Kaufman MR. First-hand accounts of structural stigma toward people who use opioids on Reddit. Soc Sci Med 2024; 347:116772. [PMID: 38502980 PMCID: PMC11031276 DOI: 10.1016/j.socscimed.2024.116772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Revised: 03/06/2024] [Accepted: 03/08/2024] [Indexed: 03/21/2024]
Abstract
People who use opioids face multilevel stigma that negatively affects their health and well-being and drives opioid-related overdose. Little research has focused on lived experience of the structural levels of stigma toward opioid use. This study identified and qualitatively analyzed Reddit content about structural stigma toward opioid use. Iterative, human-in-the-loop natural language processing methods were used to identify relevant posts and comments from an opioid-related subforum. Ultimately, 273 posts and comments were qualitatively analyzed via directed content analysis guided by a prominent conceptualization of stigma. Redditors described how structures-including governmental programs and policies, the pharmaceutical industry, and healthcare systems-stigmatize people who use opioids. Structures were reported to stigmatize through labeling (i.e., particularly in medical settings), perpetuating negative stereotypes, separating people who use opioids into those who use opioids "legitimately" versus "illegitimately," and engendering status loss and discrimination (e.g., denial of healthcare, loss of employment). Redditors also posted robust formulations of structural stigma, mostly describing how it manifests in the criminalization of substance use, is often driven by profit motive, and leads to the pervasiveness of fentanyl in the drug supply and the current state of the overdose crisis. Some posts and comments highlighted interpersonal and structural resources (e.g., other people who use opioids, harm reduction programs, telemedicine) leveraged to navigate structural stigma and its effects. These findings reveal key ways by which structural stigma can pervade the lives of people who use opioids and show the value of social media data for investigating complex social processes. Particularly, this study's findings related to structural separation may help encourage efforts to promote solidarity among people who use opioids. Attending to first-hand accounts of structural stigma can help interventions aiming to reduce opioid-related stigma be more responsive to these stigmatizing structural forces and their felt effects.
Collapse
Affiliation(s)
- Evan L Eschliman
- Department of Epidemiology, Columbia University Mailman School of Public Health, USA; Department of Health, Behavior and Society, Johns Hopkins Bloomberg School of Public Health, USA.
| | - Karen Choe
- Department of Social and Behavioral Sciences, School of Global Public Health, New York University, USA
| | - Alexandra DeLucia
- Center for Language and Speech Processing, Johns Hopkins University, USA
| | | | - Valerie W Jackson
- Department of Anesthesia and Perioperative Care, University of California, San Francisco, USA
| | - Sarah M Murray
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, USA
| | - Danielle German
- Department of Health, Behavior and Society, Johns Hopkins Bloomberg School of Public Health, USA
| | - Becky L Genberg
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, USA
| | - Michelle R Kaufman
- Department of Health, Behavior and Society, Johns Hopkins Bloomberg School of Public Health, USA; Department of International Health, Johns Hopkins Bloomberg School of Public Health, USA
| |
Collapse
|
5
|
Chi Y, Chen HY. Investigating Substance Use via Reddit: Systematic Scoping Review. J Med Internet Res 2023; 25:e48905. [PMID: 37878361 PMCID: PMC10637357 DOI: 10.2196/48905] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 08/15/2023] [Accepted: 09/13/2023] [Indexed: 10/26/2023] Open
Abstract
BACKGROUND Reddit's (Reddit Inc) large user base, diverse communities, and anonymity make it a useful platform for substance use research. Despite a growing body of literature on substance use on Reddit, challenges and limitations must be carefully considered. However, no systematic scoping review has been conducted on the use of Reddit as a data source for substance use research. OBJECTIVE This review aims to investigate the use of Reddit for studying substance use by examining previous studies' objectives, reasons, limitations, and methods for using Reddit. In addition, we discuss the implications and contributions of previous studies and identify gaps in the literature that require further attention. METHODS A total of 7 databases were searched using keyword combinations including Reddit and substance-related keywords in April 2022. The initial search resulted in 456 articles, and 227 articles remained after removing duplicates. All included studies were peer reviewed, empirical, available in full text, and pertinent to Reddit and substance use, and they were all written in English. After screening, 60 articles met the eligibility criteria for the review, with 57 articles identified from the initial database search and 3 from the ancestry search. A codebook was developed, and qualitative content analysis was performed to extract relevant evidence related to the research questions. RESULTS The use of Reddit for studying substance use has grown steadily since 2015, with a sharp increase in 2021. The primary objective was to identify tendencies and patterns in various types of substance use discussions (52/60, 87%). Reddit was also used to explore unique user experiences, propose methodologies, investigate user interactions, and develop interventions. A total of 9 reasons for using Reddit to study substance use were identified, such as the platform's anonymity, its widespread popularity, and the explicit topics of subreddits. However, 7 limitations were noted, including the platform's low representativeness of the general population with substance use and the lack of demographic information. Most studies use application programming interfaces for data collection and quantitative approaches for analysis, with few using qualitative approaches. Machine learning algorithms are commonly used for natural language processing tasks. The theoretical, methodological, and practical implications and contributions of the included articles are summarized and discussed. The most prevalent practical implications are investigating prevailing topics in Reddit discussions, providing recommendations for clinical practices and policies, and comparing Reddit discussions on substance use across various sources. CONCLUSIONS This systematic scoping review provides an overview of Reddit's use as a data source for substance use research. Although the limitations of Reddit data must be considered, analyzing them can be useful for understanding patterns and user experiences related to substance use. Our review also highlights gaps in the literature and suggests avenues for future research.
Collapse
Affiliation(s)
- Yu Chi
- School of Information Science, University of Kentucky, Lexington, KY, United States
| | - Huai-Yu Chen
- Department of Communication, University of Kentucky, Lexington, KY, United States
| |
Collapse
|
6
|
Grouin C, Grabar N. Year 2022 in Medical Natural Language Processing: Availability of Language Models as a Step in the Democratization of NLP in the Biomedical Area. Yearb Med Inform 2023; 32:244-252. [PMID: 38147866 PMCID: PMC10751107 DOI: 10.1055/s-0043-1768752] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023] Open
Abstract
OBJECTIVES To analyse the content of publications within the medical Natural Language Processing (NLP) domain in 2022. METHODS Automatic and manual preselection of publications to be reviewed, and selection of the best NLP papers of the year. Analysis of the important issues. RESULTS Three best papers have been selected. We also propose an analysis of the content of the NLP publications in 2022, stressing on some of the topics. CONCLUSION The main trend in 2022 is certainly related to the availability of large language models, especially those based on Transformers, and to their use by non-NLP researchers. This leads to the democratization of the NLP methods. We also observe the renewal of interest to languages other than English, the continuation of research on information extraction and prediction, the massive use of data from social media, and the consideration of needs and interests of patients.
Collapse
Affiliation(s)
- Cyril Grouin
- Université Paris Saclay, CNRS, Laboratoire Interdisciplinaire des Sciences du Numérique, 91400 Orsay, France
| | - Natalia Grabar
- UMR8163 STL, CNRS, Université de Lille, Domaine du Pont-de-bois, 59653 Villeneuve-d'Ascq cedex, France
| | | |
Collapse
|
7
|
Bremer W, Plaisance K, Walker D, Bonn M, Love JS, Perrone J, Sarker A. Barriers to opioid use disorder treatment: A comparison of self-reported information from social media with barriers found in literature. Front Public Health 2023; 11:1141093. [PMID: 37151596 PMCID: PMC10158842 DOI: 10.3389/fpubh.2023.1141093] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Accepted: 03/21/2023] [Indexed: 05/09/2023] Open
Abstract
Introduction Medications such as buprenorphine and methadone are effective for treating opioid use disorder (OUD), but many patients face barriers related to treatment and access. We analyzed two sources of data-social media and published literature-to categorize and quantify such barriers. Methods In this mixed methods study, we analyzed social media (Reddit) posts from three OUD-related forums (subreddits): r/suboxone, r/Methadone, and r/naltrexone. We applied natural language processing to identify posts relevant to treatment barriers, categorized them into insurance- and non-insurance-related, and manually subcategorized them into fine-grained topics. For comparison, we used substance use-, OUD- and barrier-related keywords to identify relevant articles from PubMed published between 2006 and 2022. We searched publications for language expressing fear of barriers, and hesitation or disinterest in medication treatment because of barriers, paying particular attention to the affected population groups described. Results On social media, the top three insurance-related barriers included having no insurance (22.5%), insurance not covering OUD treatment (24.7%), and general difficulties of using insurance for OUD treatment (38.2%); while the top two non-insurance-related barriers included stigma (47.6%), and financial difficulties (26.2%). For published literature, stigma was the most prominently reported barrier, occurring in 78.9% of the publications reviewed, followed by financial and/or logistical issues to receiving medication treatment (73.7%), gender-specific barriers (36.8%), and fear (31.5%). Conclusion The stigma associated with OUD and/or seeking treatment and insurance/cost are the two most common types of barriers reported in the two sources combined. Harm reduction efforts addressing barriers to recovery may benefit from leveraging multiple data sources.
Collapse
Affiliation(s)
- Whitney Bremer
- Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA, United States
- Department of Biomedical Informatics, School of Medicine, College of Engineering and Applied Sciences, Stony Brook University, Stony Brook, NY, United States
| | - Karma Plaisance
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, United States
| | - Drew Walker
- Department of Behavioral, Social and Health Education Sciences, Rollins School of Public Health, Emory University, Atlanta, GA, United States
| | - Matthew Bonn
- Canadian Association of People Who Use Drugs, Dartmouth, NS, Canada
| | - Jennifer S. Love
- Department of Emergency Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Jeanmarie Perrone
- Department of Emergency Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Abeed Sarker
- Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA, United States
| |
Collapse
|
8
|
Krawczyk N, Rivera BD, Levin E, Dooling BCE. Synthesising evidence of the effects of COVID-19 regulatory changes on methadone treatment for opioid use disorder: implications for policy. Lancet Public Health 2023; 8:e238-e246. [PMID: 36841564 PMCID: PMC9949855 DOI: 10.1016/s2468-2667(23)00023-3] [Citation(s) in RCA: 47] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 01/16/2023] [Accepted: 01/17/2023] [Indexed: 02/25/2023]
Abstract
As the USA faces a worsening overdose crisis, improving access to evidence-based treatment for opioid use disorder (OUD) remains a policy priority. Federal regulatory changes in response to the COVID-19 pandemic substantially expanded flexibilities on take-home doses for methadone treatment for OUD. These changes have fuelled questions about the effect of new regulations on OUD outcomes and the potential effect on health of permanently integrating these flexibilities into treatment policy going forward. To aide US policy makers as they consider implementing permanent methadone regulatory changes, we conducted a review synthesising peer-reviewed research on the effect of the flexibilities of methadone take-home policies introduced during COVID-19 on methadone programme operations, OUD patient and provider experiences, and patient health outcomes. We interpret the findings in the context of the federal rule-making process and discuss avenues by which these findings can be incorporated and implemented into US policies on substance use treatment going forward.
Collapse
Affiliation(s)
- Noa Krawczyk
- Department of Population Health, Center for Opioid Epidemiology and Policy (COEP), NYU Grossman School of Medicine, New York, NY, USA.
| | - Bianca D Rivera
- Department of Population Health, Center for Opioid Epidemiology and Policy (COEP), NYU Grossman School of Medicine, New York, NY, USA
| | - Emily Levin
- Regulatory Studies Center, The George Washington University, Washington, DC, USA
| | - Bridget C E Dooling
- Regulatory Studies Center, The George Washington University, Washington, DC, USA
| |
Collapse
|
9
|
Karas B, Qu S, Xu Y, Zhu Q. Experiments with LDA and Top2Vec for embedded topic discovery on social media data—A case study of cystic fibrosis. Front Artif Intell 2022; 5:948313. [PMID: 36062265 PMCID: PMC9433987 DOI: 10.3389/frai.2022.948313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Accepted: 07/25/2022] [Indexed: 11/18/2022] Open
Abstract
Social media has become an important resource for discussing, sharing, and seeking information pertinent to rare diseases by patients and their families, given the low prevalence in the extraordinarily sparse populations. In our previous study, we identified prevalent topics from Reddit via topic modeling for cystic fibrosis (CF). While we were able to derive/access concerns/needs/questions of patients with CF, we observed challenges and issues with the traditional techniques of topic modeling, e.g., Latent Dirichlet Allocation (LDA), for fulfilling the task of topic extraction. Thus, here we present our experiments to extend the previous study with an aim of improving the performance of topic modeling, by experimenting with LDA model optimization and examination of the Top2Vec model with different embedding models. With the demonstrated results with higher coherence and qualitatively higher human readability of derived topics, we implemented the Top2Vec model with doc2vec as the embedding model as our final model to extract topics from a subreddit of CF (“r/CysticFibrosis”) and proposed to expand its use with other types of social media data for other rare diseases for better assessing patients' needs with social media data.
Collapse
Affiliation(s)
- Bradley Karas
- Division of Rare Diseases Research Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Bethesda, MD, United States
| | - Sue Qu
- Division of Rare Diseases Research Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Bethesda, MD, United States
| | - Yanji Xu
- Division of Rare Diseases Research Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Bethesda, MD, United States
- *Correspondence: Yanji Xu
| | - Qian Zhu
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences, (NCATS), National Institutes of Health (NIH), Rockville, MD, United States
- Qian Zhu
| |
Collapse
|