Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

231
(from Reference Citation Analysis)

Article PDFs (60)

Cited by > 0 (123)

Searched Name

topic modeling

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Zhang JM, Wang Y, Mouton M, Zhang J, Shi M. Public Discourse, User Reactions, and Conspiracy Theories on the X Platform About HIV Vaccines: Data Mining and Content Analysis. J Med Internet Res 2024;26:e53375. [PMID: 38568723 PMCID: PMC11024739 DOI: 10.2196/53375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 11/08/2023] [Accepted: 02/28/2024] [Indexed: 04/05/2024] Open

Abstract

BACKGROUND

The initiation of clinical trials for messenger RNA (mRNA) HIV vaccines in early 2022 revived public discussion on HIV vaccines after 3 decades of unsuccessful research. These trials followed the success of mRNA technology in COVID-19 vaccines but unfolded amid intense vaccine debates during the COVID-19 pandemic. It is crucial to gain insights into public discourse and reactions about potential new vaccines, and social media platforms such as X (formerly known as Twitter) provide important channels.

OBJECTIVE

Drawing from infodemiology and infoveillance research, this study investigated the patterns of public discourse and message-level drivers of user reactions on X regarding HIV vaccines by analyzing posts using machine learning algorithms. We examined how users used different post types to contribute to topics and valence and how these topics and valence influenced like and repost counts. In addition, the study identified salient aspects of HIV vaccines related to COVID-19 and prominent anti-HIV vaccine conspiracy theories through manual coding.

METHODS

We collected 36,424 English-language original posts about HIV vaccines on the X platform from January 1, 2022, to December 31, 2022. We used topic modeling and sentiment analysis to uncover latent topics and valence, which were subsequently analyzed across post types in cross-tabulation analyses and integrated into linear regression models to predict user reactions, specifically likes and reposts. Furthermore, we manually coded the 1000 most engaged posts about HIV and COVID-19 to uncover salient aspects of HIV vaccines related to COVID-19 and the 1000 most engaged negative posts to identify prominent anti-HIV vaccine conspiracy theories.

RESULTS

Topic modeling revealed 3 topics: HIV and COVID-19, mRNA HIV vaccine trials, and HIV vaccine and immunity. HIV and COVID-19 underscored the connections between HIV vaccines and COVID-19 vaccines, as evidenced by subtopics about their reciprocal impact on development and various comparisons. The overall valence of the posts was marginally positive. Compared to self-composed posts initiating new conversations, there was a higher proportion of HIV and COVID-19-related and negative posts among quote posts and replies, which contribute to existing conversations. The topic of mRNA HIV vaccine trials, most evident in self-composed posts, increased repost counts. Positive valence increased like and repost counts. Prominent anti-HIV vaccine conspiracy theories often falsely linked HIV vaccines to concurrent COVID-19 and other HIV-related events.

CONCLUSIONS

The results highlight COVID-19 as a significant context for public discourse and reactions regarding HIV vaccines from both positive and negative perspectives. The success of mRNA COVID-19 vaccines shed a positive light on HIV vaccines. However, COVID-19 also situated HIV vaccines in a negative context, as observed in some anti-HIV vaccine conspiracy theories misleadingly connecting HIV vaccines with COVID-19. These findings have implications for public health communication strategies concerning HIV vaccines.

Collapse

Sun S, Zack T, Williams CYK, Sushil M, Butte AJ. Topic modeling on clinical social work notes for exploring social determinants of health factors. JAMIA Open 2024;7:ooad112. [PMID: 38223407 PMCID: PMC10788143 DOI: 10.1093/jamiaopen/ooad112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 12/17/2023] [Accepted: 12/23/2023] [Indexed: 01/16/2024] Open

Abstract

Objective

Existing research on social determinants of health (SDoH) predominantly focuses on physician notes and structured data within electronic medical records. This study posits that social work notes are an untapped, potentially rich source for SDoH information. We hypothesize that clinical notes recorded by social workers, whose role is to ameliorate social and economic factors, might provide a complementary information source of data on SDoH compared to physician notes, which primarily concentrate on medical diagnoses and treatments. We aimed to use word frequency analysis and topic modeling to identify prevalent terms and robust topics of discussion within a large cohort of social work notes including both outpatient and in-patient consultations.

Materials and methods

We retrieved a diverse, deidentified corpus of 0.95 million clinical social work notes from 181 644 patients at the University of California, San Francisco. We conducted word frequency analysis related to ICD-10 chapters to identify prevalent terms within the notes. We then applied Latent Dirichlet Allocation (LDA) topic modeling analysis to characterize this corpus and identify potential topics of discussion, which was further stratified by note types and disease groups.

Results

Word frequency analysis primarily identified medical-related terms associated with specific ICD10 chapters, though it also detected some subtle SDoH terms. In contrast, the LDA topic modeling analysis extracted 11 topics explicitly related to social determinants of health risk factors, such as financial status, abuse history, social support, risk of death, and mental health. The topic modeling approach effectively demonstrated variations between different types of social work notes and across patients with different types of diseases or conditions.

Discussion

Our findings highlight LDA topic modeling's effectiveness in extracting SDoH-related themes and capturing variations in social work notes, demonstrating its potential for informing targeted interventions for at-risk populations.

Conclusion

Social work notes offer a wealth of unique and valuable information on an individual's SDoH. These notes present consistent and meaningful topics of discussion that can be effectively analyzed and utilized to improve patient care and inform targeted interventions for at-risk populations.

Collapse

Karabacak M, Jagtiani P, Jain A, Panov F, Margetis K. Tracing topics and trends in drug-resistant epilepsy research using a natural language processing-based topic modeling approach. Epilepsia 2024;65:861-872. [PMID: 38314969 DOI: 10.1111/epi.17890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 01/08/2024] [Accepted: 01/09/2024] [Indexed: 02/07/2024]

Abstract

Epilepsy is a common neurological disorder affecting over 70 million people worldwide. Although many patients achieve seizure control with anti-epileptic drugs (AEDs), 30%-40% develop drug-resistant epilepsy (DRE), where seizures persist despite adequate trials of AEDs. DRE is associated with reduced quality of life, increased mortality and morbidity, and greater socioeconomic challenges. The continued intractability of DRE has fueled exponential growth in research that aims to understand and treat this serious condition. However, synthesizing this vast and continuously expanding DRE literature to derive insights poses considerable difficulties for investigators and clinicians. Conventional review methods are often prolonged, hampering the timely application of findings. More-efficient approaches to analyze the voluminous research are needed. In this study, we utilize a natural language processing (NLP)-based topic modeling approach to examine the DRE publication landscape, uncovering key topics and trends. Documents were retrieved from Scopus, preprocessed, and modeled using BERTopic. This technique employs transformer models like BERT (Bidirectional Encoder Representations from Transformers) for contextual understanding, thereby enabling accurate topic categorization. Analysis revealed 18 distinct topics spanning various DRE research areas. The 10 most common topics, including "AEDs," "Neuromodulation Therapy," and "Genomics," were examined further. "Cannabidiol," "Functional Brain Mapping," and "Autoimmune Encephalitis" emerged as the hottest topics of the current decade, and were examined further. This NLP methodology provided valuable insights into the evolving DRE research landscape, revealing shifting priorities and declining interests. Moreover, we demonstrate an efficient approach to synthesizing and visualizing patterns within extensive literature that could be applied to other research fields.

Collapse

Ruocco L, Zhuang Y, Ng R, Munthali RJ, Hudec KL, Wang AY, Vereschagin M, Vigo DV. A platform for connecting social media data to domain-specific topics using large language models: an application to student mental health. JAMIA Open 2024;7:ooae001. [PMID: 38250583 PMCID: PMC10799551 DOI: 10.1093/jamiaopen/ooae001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 12/20/2023] [Accepted: 01/05/2024] [Indexed: 01/23/2024] Open

Malagoli G, Valle F, Barillot E, Caselle M, Martignetti L. Identification of Interpretable Clusters and Associated Signatures in Breast Cancer Single-Cell Data: A Topic Modeling Approach. Cancers (Basel) 2024;16:1350. [PMID: 38611028 PMCID: PMC11011054 DOI: 10.3390/cancers16071350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 03/25/2024] [Accepted: 03/28/2024] [Indexed: 04/14/2024] Open

He Y, Zhu W, Wang T, Chen H, Xin J, Liu Y, Lei J, Liang J. Mining User Reviews From Hypertension Management Mobile Health Apps to Explore Factors Influencing User Satisfaction and Their Asymmetry: Comparative Study. JMIR Mhealth Uhealth 2024;12:e55199. [PMID: 38547475 PMCID: PMC11009850 DOI: 10.2196/55199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 12/19/2023] [Accepted: 03/14/2024] [Indexed: 04/02/2024] Open

Abstract

BACKGROUND

Hypertension significantly impacts the well-being and health of individuals globally. Hypertension management apps (HMAs) have been shown to assist patients in controlling blood pressure (BP), with their efficacy validated in clinical trials. However, the utilization of HMAs continues to be suboptimal. Presently, there is a dearth of real-world research based on big data and exploratory mining that compares Chinese and American HMAs.

OBJECTIVE

This study aims to systematically gather HMAs and their user reviews from both China and the United States. Subsequently, using data mining techniques, the study aims to compare the user experience, satisfaction levels, influencing factors, and asymmetry between Chinese and American users of HMAs. In addition, the study seeks to assess the disparities in satisfaction and its determinants while delving into the asymmetry of these factors.

METHODS

The study sourced HMAs and user reviews from 10 prominent Chinese and American app stores globally. Using the latent Dirichlet allocation (LDA) topic model, the research identified various topics within user reviews. Subsequently, the Tobit model was used to investigate the impact and distinctions of each topic on user satisfaction. The Wald test was applied to analyze differences in effects across various factors.

RESULTS

We examined a total of 261 HMAs along with their associated user reviews, amounting to 116,686 reviews in total. In terms of quantity and overall satisfaction levels, Chinese HMAs (n=91) and corresponding reviews (n=16,561) were notably fewer compared with their American counterparts (n=220 HMAs and n=100,125 reviews). The overall satisfaction rate among HMA users was 75.22% (87,773/116,686), with Chinese HMAs demonstrating a higher satisfaction rate (13,866/16,561, 83.73%) compared with that for American HMAs (73,907/100,125, 73.81%). Chinese users primarily focus on reliability (2165/16,561, 13.07%) and measurement accuracy (2091/16,561, 12.63%) when considering HMAs, whereas American users prioritize BP tracking (17,285/100,125, 17.26%) and data synchronization (12,837/100,125, 12.82%). Seven factors (easy to use: P<.001; measurement accuracy: P<.001; compatibility: P<.001; cost: P<.001; heart rate detection function: P=.02; blood pressure tracking function: P<.001; and interface design: P=.01) significantly influenced the positive deviation (PD) of Chinese HMA user satisfaction, while 8 factors (easy to use: P<.001; reliability: P<.001; measurement accuracy: P<.001; compatibility: P<.001; cost: P<.001; interface design: P<.001; real-time: P<.001; and data privacy: P=.001) affected the negative deviation (ND). Notably, BP tracking had the greatest effect on PD (β=.354, P<.001), while cost had the most significant impact on ND (β=3.703, P<.001). All 12 factors (easy to use: P<.001; blood pressure tracking function: P<.001; data synchronization: P<.001; blood pressure management effect: P<.001; heart rate detection function: P<.001; data sharing: P<.001; reliability: P<.001; compatibility: P<.001; interface design: P<.001; advertisement distribution: P<.001; measurement accuracy: P<.001; and cost: P<.001) significantly influenced the PD and ND of American HMA user satisfaction. Notably, BP tracking had the greatest effect on PD (β=0.312, P<.001), while data synchronization had the most significant impact on ND (β=2.662, P<.001). In addition, the influencing factors of PD and ND in user satisfaction of HMA in China and the United States are different.

CONCLUSIONS

User satisfaction factors varied significantly between different countries, showing considerable asymmetry. For Chinese HMA users, ease of use and interface design emerged as motivational factors, while factors such as cost, measurement accuracy, and compatibility primarily contributed to user dissatisfaction. For American HMA users, motivational factors were ease of use, BP tracking, BP management effect, interface design, measurement accuracy, and cost. Moreover, users expect features such as data sharing, synchronization, software reliability, compatibility, heart rate detection, and nonintrusive advertisement distribution. Tailored experience plans should be devised for different user groups in various countries to address these diverse preferences and requirements.

Collapse

Amjad T, Timakum T, Xie Q, Song M. Editorial: Text mining-based mental health research. Front Res Metr Anal 2024;9:1388691. [PMID: 38585672 PMCID: PMC10995400 DOI: 10.3389/frma.2024.1388691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Accepted: 03/12/2024] [Indexed: 04/09/2024] Open

Molenaar A, Lukose D, Brennan L, Jenkins EL, McCaffrey TA. Using Natural Language Processing to Explore Social Media Opinions on Food Security: Sentiment Analysis and Topic Modeling Study. J Med Internet Res 2024;26:e47826. [PMID: 38512326 PMCID: PMC10995791 DOI: 10.2196/47826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 12/05/2023] [Accepted: 12/20/2023] [Indexed: 03/22/2024] Open

Abstract

BACKGROUND

Social media has the potential to be of great value in understanding patterns in public health using large-scale analysis approaches (eg, data science and natural language processing [NLP]), 2 of which have been used in public health: sentiment analysis and topic modeling; however, their use in the area of food security and public health nutrition is limited.

OBJECTIVE

This study aims to explore the potential use of NLP tools to gather insights from real-world social media data on the public health issue of food security.

METHODS

A search strategy for obtaining tweets was developed using food security terms. Tweets were collected using the Twitter application programming interface from January 1, 2019, to December 31, 2021, filtered for Australia-based users only. Sentiment analysis of the tweets was performed using the Valence Aware Dictionary and Sentiment Reasoner. Topic modeling exploring the content of tweets was conducted using latent Dirichlet allocation with BigML (BigML, Inc). Sentiment, topic, and engagement (the sum of likes, retweets, quotations, and replies) were compared across years.

RESULTS

In total, 38,070 tweets were collected from 14,880 Twitter users. Overall, the sentiment when discussing food security was positive, although this varied across the 3 years. Positive sentiment remained higher during the COVID-19 lockdown periods in Australia. The topic model contained 10 topics (in order from highest to lowest probability in the data set): "Global production," "Food insecurity and health," "Use of food banks," "Giving to food banks," "Family poverty," "Food relief provision," "Global food insecurity," "Climate change," "Australian food insecurity," and "Human rights." The topic "Giving to food banks," which focused on support and donation, had the highest proportion of positive sentiment, and "Global food insecurity," which covered food insecurity prevalence worldwide, had the highest proportion of negative sentiment. When compared with news, there were some events, such as COVID-19 support payment introduction and bushfires across Australia, that were associated with high periods of positive or negative sentiment. Topics related to food insecurity prevalence, poverty, and food relief in Australia were not consistently more prominent during the COVID-19 pandemic than before the pandemic. Negative tweets received substantially higher engagement across 2019 and 2020. There was no clear relationship between topics that were more likely to be positive or negative and have higher or lower engagement, indicating that the identified topics are discrete issues.

CONCLUSIONS

In this study, we demonstrated the potential use of sentiment analysis and topic modeling to explore evolution in conversations on food security using social media data. Future use of NLP in food security requires the context of and interpretation by public health experts and the use of broader data sets, with the potential to track dimensions or events related to food security to inform evidence-based decision-making in this area.

Collapse

Ehnert P, Schröter J. Key point generation as an instrument for generating core statements of a political debate on Twitter. Front Artif Intell 2024;7:1200949. [PMID: 38576459 PMCID: PMC10993730 DOI: 10.3389/frai.2024.1200949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 03/01/2024] [Indexed: 04/06/2024] Open

Abstract

Identifying key statements in large volumes of short, user-generated texts is essential for decision-makers to quickly grasp their key content. To address this need, this research introduces a novel abstractive key point generation (KPG) approach applicable to unlabeled text corpora, using an unsupervised approach, a feature not yet seen in existing abstractive KPG methods. The proposed method uniquely combines topic modeling for unsupervised data space segmentation with abstractive summarization techniques to efficiently generate semantically representative key points from text collections. This is further enhanced by hyperparameter tuning to optimize both the topic modeling and abstractive summarization processes. The hyperparameter tuning of the topic modeling aims at making the cluster assignment more deterministic as the probabilistic nature of the process would otherwise lead to high variability in the output. The abstractive summarization process is optimized using a Davies-Bouldin Index specifically adapted to this use case, so that the generated key points more accurately reflect the characteristic properties of this cluster. In addition, our research recommends an automated evaluation that provides a quantitative complement to the traditional qualitative analysis of KPG. This method regards KPG as a specialized form of Multidocument summarization (MDS) and employs both word-based and word-embedding-based metrics for evaluation. These criteria allow for a comprehensive and nuanced analysis of the KPG output. Demonstrated through application to a political debate on Twitter, the versatility of this approach extends to various domains, such as product review analysis and survey evaluation. This research not only paves the way for innovative development in abstractive KPG methods but also sets a benchmark for their evaluation.

Collapse

Chen P, Jin Y, Ma X, Lin Y. Public perception on active aging after COVID-19: an unsupervised machine learning analysis of 44,343 posts. Front Public Health 2024;12:1329704. [PMID: 38515596 PMCID: PMC10956692 DOI: 10.3389/fpubh.2024.1329704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Accepted: 02/27/2024] [Indexed: 03/23/2024] Open

Karabacak M, Jain A, Jagtiani P, Hickman ZL, Dams-O'Connor K, Margetis K. Exploiting Natural Language Processing to Unveil Topics and Trends of Traumatic Brain Injury Research. Neurotrauma Rep 2024;5:203-214. [PMID: 38463422 PMCID: PMC10924051 DOI: 10.1089/neur.2023.0102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/12/2024] Open

Abstract

Traumatic brain injury (TBI) has evolved from a topic of relative obscurity to one of widespread scientific and lay interest. The scope and focus of TBI research have shifted, and research trends have changed in response to public and scientific interest. This study has two primary goals: first, to identify the predominant themes in TBI research; and second, to delineate "hot" and "cold" areas of interest by evaluating the current popularity or decline of these topics. Hot topics may be dwarfed in absolute numbers by other, larger TBI research areas but are rapidly gaining interest. Likewise, cold topics may present opportunities for researchers to revisit unanswered questions. We utilized BERTopic, an advanced natural language processing (NLP)-based technique, to analyze TBI research articles published since 1990. This approach facilitated the identification of key topics by extracting sets of distinctive keywords representative of each article's core themes. Using these topics' probabilities, we trained linear regression models to detect trends over time, recognizing topics that were gaining (hot) or losing (cold) relevance. Additionally, we conducted a specific analysis focusing on the trends observed in TBI research in the current decade (the 2020s). Our topic modeling analysis categorized 42,422 articles into 27 distinct topics. The 10 most frequently occurring topics were: "Rehabilitation," "Molecular Mechanisms of TBI," "Concussion," "Repetitive Head Impacts," "Surgical Interventions," "Biomarkers," "Intracranial Pressure," "Posttraumatic Neurodegeneration," "Chronic Traumatic Encephalopathy," and "Blast Induced TBI," while our trend analysis indicated that the hottest topics of the current decade were "Genomics," "Sex Hormones," and "Diffusion Tensor Imaging," while the cooling topics were "Posttraumatic Sleep," "Sensory Functions," and "Hyperosmolar Therapies." This study highlights the dynamic nature of TBI research and underscores the shifting emphasis within the field. The findings from our analysis can aid in the identification of emerging topics of interest and areas where there is little new research reported. By utilizing NLP to effectively synthesize and analyze an extensive collection of TBI-related scholarly literature, we demonstrate the potential of machine learning techniques in understanding and guiding future research prospects. This approach sets the stage for similar analyses in other medical disciplines, offering profound insights and opportunities for further exploration.

Collapse

Aldosery A, Carruthers R, Kay K, Cave C, Reynolds P, Kostkova P. Enhancing public health response: a framework for topics and sentiment analysis of COVID-19 in the UK using Twitter and the embedded topic model. Front Public Health 2024;12:1105383. [PMID: 38450124 PMCID: PMC10915179 DOI: 10.3389/fpubh.2024.1105383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 01/10/2024] [Indexed: 03/08/2024] Open

Abstract

Introduction

To protect citizens during the COVID-19 pandemic unprecedented public health restrictions were imposed on everyday life in the UK and around the world. In emergencies like COVID-19, it is crucial for policymakers to be able to gauge the public response and sentiment to such measures in almost real-time and establish best practices for the use of social media for emergency response.

Methods

In this study, we explored Twitter as a data source for assessing public reaction to the pandemic. We conducted an analysis of sentiment by topic using 25 million UK tweets, collected from 26th May 2020 to 8th March 2021. We combined an innovative combination of sentiment analysis via a recurrent neural network and topic clustering through an embedded topic model.

Results

The results demonstrated interpretable per-topic sentiment signals across time and geography in the UK that could be tied to specific public health and policy events during the pandemic. Unique to this investigation is the juxtaposition of derived sentiment trends against behavioral surveys conducted by the UK Office for National Statistics, providing a robust gauge of the public mood concurrent with policy announcements.

Discussion

While much of the existing research focused on specific questions or new techniques, we developed a comprehensive framework for the assessment of public response by policymakers for COVID-19 and generalizable for future emergencies. The emergent methodology not only elucidates the public's stance on COVID-19 policies but also establishes a generalizable framework for public policymakers to monitor and assess the buy-in and acceptance of their policies almost in real-time. Further, the proposed approach is generalizable as a tool for policymakers and could be applied to further subjects of political and public interest.

Collapse

Oh WO, Lee E, Heo YJ, Jung MJ, Han J. Understanding global research trends in the control and prevention of infectious diseases for children: Insights from text mining and topic modeling. J Nurs Scholarsh 2024. [PMID: 38380588 DOI: 10.1111/jnu.12963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 12/14/2023] [Accepted: 01/30/2024] [Indexed: 02/22/2024]

Abstract

INTRODUCTION

The emergence of novel infectious diseases has amplified the urgent need for effective prevention strategies, especially ones targeting vulnerable populations such as children. Factors such as the high incidence of both emerging and existing infectious diseases, delays in vaccinations, and routine exposure in communal settings heighten children's susceptibility to infections. Despite this pressing need, a comprehensive exploration of research trends in this domain remains lacking. This study aims to address this gap by employing text mining and modeling techniques to conduct a comprehensive analysis of the existing literature, thereby identifying emerging research trends in infectious disease prevention among children.

METHODS

A cross-sectional text mining approach was adopted, focusing on journal articles published between January 1, 2003, and August 31, 2022. These articles, related to infectious disease prevention in children, were sourced from databases such as PubMed, CINAHL, MEDLINE (Ovid), Scopus, and Korean RISS. The data underwent preprocessing using the Natural Language Toolkit (NLTK) in Python, with a semantic network analysis and topic modeling conducted using R software.

RESULTS

The final dataset comprised 509 journal articles extracted from multiple databases. The study began with a word frequency analysis to pinpoint relevant themes, subsequently visualized through a word cloud. Dominant terms encompassed "vaccination," "adolescent," "infant," "parent," "family," "school," "country," "household," "community," "HIV," "HPV," "COVID-19," "influenza," and "diarrhea." The semantic analysis identified "age" as a key term across infection, control, and intervention discussions. Notably, the relationship between "hand" and "handwashing" was prominent, especially in educational contexts linked with "school" and "absence." Latent Dirichlet Allocation (LDA) topic modeling further delineated seven topics related to infectious disease prevention for children, encompassing (1) educational programs, (2) vaccination efforts, (3) family-level responses, (4) care for immunocompromised individuals, (5) country-specific responses, (6) school-based strategies, and (7) persistent threats from established infectious diseases.

CONCLUSION

The study emphasizes the indispensable role of personalized interventions tailored for various child demographics, highlighting the pivotal contributions of both parental guidance and school participation.

CLINICAL RELEVANCE

The study provides insights into the complex public health challenges associated with preventing and managing infectious diseases in children. The insights derived could inform the formulation of evidence-based public health policies, steering practical interventions and fostering interdisciplinary synergy for holistic prevention strategies.

Collapse

Ramamoorthy T, Kulothungan V, Mappillairaju B. Topic modeling and social network analysis approach to explore diabetes discourse on Twitter in India. Front Artif Intell 2024;7:1329185. [PMID: 38410423 PMCID: PMC10895681 DOI: 10.3389/frai.2024.1329185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2023] [Accepted: 01/22/2024] [Indexed: 02/28/2024] Open

Abstract

Introduction

The utilization of social media presents a promising avenue for the prevention and management of diabetes. To effectively cater to the diabetes-related knowledge, support, and intervention needs of the community, it is imperative to attain a deeper understanding of the extent and content of discussions pertaining to this health issue. This study aims to assess and compare various topic modeling techniques to determine the most effective model for identifying the core themes in diabetes-related tweets, the sources responsible for disseminating this information, the reach of these themes, and the influential individuals within the Twitter community in India.

Methods

Twitter messages from India, dated between 7 November 2022 and 28 February 2023, were collected using the Twitter API. The unsupervised machine learning topic models, namely, Latent Dirichlet Allocation (LDA), non-negative matrix factorization (NMF), BERTopic, and Top2Vec, were compared, and the best-performing model was used to identify common diabetes-related topics. Influential users were identified through social network analysis.

Results

The NMF model outperformed the LDA model, whereas BERTopic performed better than Top2Vec. Diabetes-related conversations revolved around eight topics, namely, promotion, management, drug and personal story, consequences, risk factors and research, raising awareness and providing support, diet, and opinion and lifestyle changes. The influential nodes identified were mainly health professionals and healthcare organizations.

Discussion

The study identified important topics of discussion along with health professionals and healthcare organizations involved in sharing diabetes-related information with the public. Collaborations among influential healthcare organizations, health professionals, and the government can foster awareness and prevent noncommunicable diseases.

Collapse

Rhee JU, Huang Y, Soroosh AJ, Alsudais S, Ni S, Kumar A, Paredes J, Li C, Timberlake DS. The Marketing and Perceptions of Non-Tobacco Blunt Wraps on Twitter. Subst Use Misuse 2024;59:469-477. [PMID: 37982451 DOI: 10.1080/10826084.2023.2280572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/21/2023]

Guo F, Liu Z, Lu Q, Ji S, Zhang C. Public Opinion About COVID-19 on a Microblog Platform in China: Topic Modeling and Multidimensional Sentiment Analysis of Social Media. J Med Internet Res 2024;26:e47508. [PMID: 38294856 PMCID: PMC10833090 DOI: 10.2196/47508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 09/09/2023] [Accepted: 12/20/2023] [Indexed: 02/01/2024] Open

Ari A, Wang T, Hudgins AM. Impact of Program Components on Perceived Organizational Support in Respiratory Care Education. Respir Care 2024;69:210-217. [PMID: 37643868 PMCID: PMC10898472 DOI: 10.4187/respcare.11225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]

Abstract

BACKGROUND

Perceived organizational support has been linked to employee commitment and job satisfaction. Understanding the effects of perceived organizational support on employees allows leaders to improve employees' performance and the success of their organizations. The purpose of this study was to identify the perceived organizational support across different respiratory care education programs in the United States.

METHODS

All chairs and program directors of bachelor's of science and master's of science degree respiratory care education programs in the United States were surveyed (N = 97). The Survey of Perceived Organizational Support was modified after written approval, and the final instrument included 31 items with a Likert scale (1 = strongly disagree, 7 = strongly agree). Descriptive statistics, multiple regression, and topic modeling were used for data analysis (P < .05).

RESULTS

A total of 67 respondents responded to the perceived organizational support survey; a 69% response rate. They were satisfied with their job and committed to their institutions. They also reported that faculty salaries were equitable relative to the national average, and their institutions encouraged teamwork among faculty. The respondents' titles, total years of administrative experience, students' scores on the national credentialing therapist multiple choice examination (TMC), and institutions that offer both bachelor's of science and master's of science degree programs had a direct relationship with perceived organizational support in respiratory care education programs. Age and sex were inversely related to perceived organizational support. A topic modeling analysis based on the respondents' opinions about perceived organizational support showed that the respondents frequently mentioned the words support, institution, budget, year, nursing, and experience. The respondents emphasized the importance of support, institution marketing, their years of experience, and the program budget. They also mentioned that nursing programs overshadowed respiratory care education programs at their institutions.

CONCLUSIONS

Age, sex, job title, years of administrative experience, students' TMC scores, and the type of programs offered impacted perceived organizational support by respiratory care directors. Student-, program- and participant-related factors can be used to improve perceived organizational support in respiratory care education.

Collapse

Yin S, Chen S, Ge Y. Dynamic Associations Between Centers for Disease Control and Prevention Social Media Contents and Epidemic Measures During COVID-19: Infoveillance Study. JMIR Infodemiology 2024;4:e49756. [PMID: 38261367 PMCID: PMC10848128 DOI: 10.2196/49756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 10/02/2023] [Accepted: 10/14/2023] [Indexed: 01/24/2024]

Abstract

BACKGROUND

Health agencies have been widely adopting social media to disseminate important information, educate the public on emerging health issues, and understand public opinions. The Centers for Disease Control and Prevention (CDC) widely used social media platforms during the COVID-19 pandemic to communicate with the public and mitigate the disease in the United States. It is crucial to understand the relationships between the CDC's social media communications and the actual epidemic metrics to improve public health agencies' communication strategies during health emergencies.

OBJECTIVE

This study aimed to identify key topics in tweets posted by the CDC during the pandemic, investigate the temporal dynamics between these key topics and the actual COVID-19 epidemic measures, and make recommendations for the CDC's digital health communication strategies for future health emergencies.

METHODS

Two types of data were collected: (1) a total of 17,524 COVID-19-related English tweets posted by the CDC between December 7, 2019, and January 15, 2022, and (2) COVID-19 epidemic measures in the United States from the public GitHub repository of Johns Hopkins University from January 2020 to July 2022. Latent Dirichlet allocation topic modeling was applied to identify key topics from all COVID-19-related tweets posted by the CDC, and the final topics were determined by domain experts. Various multivariate time series analysis techniques were applied between each of the identified key topics and actual COVID-19 epidemic measures to quantify the dynamic associations between these 2 types of time series data.

RESULTS

Four major topics from the CDC's COVID-19 tweets were identified: (1) information on the prevention of health outcomes of COVID-19; (2) pediatric intervention and family safety; (3) updates of the epidemic situation of COVID-19; and (4) research and community engagement to curb COVID-19. Multivariate analyses showed that there were significant variabilities of progression between the CDC's topics and the actual COVID-19 epidemic measures. Some CDC topics showed substantial associations with the COVID-19 measures over different time spans throughout the pandemic, expressing similar temporal dynamics between these 2 types of time series data.

CONCLUSIONS

Our study is the first to comprehensively investigate the dynamic associations between topics discussed by the CDC on Twitter and the COVID-19 epidemic measures in the United States. We identified 4 major topic themes via topic modeling and explored how each of these topics was associated with each major epidemic measure by performing various multivariate time series analyses. We recommend that it is critical for public health agencies, such as the CDC, to update and disseminate timely and accurate information to the public and align major topics with key epidemic measures over time. We suggest that social media can help public health agencies to inform the public on health emergencies and to mitigate them effectively.

Collapse

Shiferaw KB, Wali P, Waltemath D, Zeleke AA. Navigating the AI frontiers in cardiovascular research: a bibliometric exploration and topic modeling. Front Cardiovasc Med 2024;10:1308668. [PMID: 38235288 PMCID: PMC10793658 DOI: 10.3389/fcvm.2023.1308668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 12/04/2023] [Indexed: 01/19/2024] Open

Abstract

Artificial intelligence (AI) has emerged as a promising field in cardiovascular disease (CVD) research, offering innovative approaches to enhance diagnosis, treatment, and patient outcomes. In this study, we conducted bibliometric analysis combined with topic modeling to provide a comprehensive overview of the AI research landscape in CVD. Our analysis included 23,846 studies from Web of Science and PubMed, capturing the latest advancements and trends in this rapidly evolving field. By employing LDA (Latent Dirichlet Allocation) we identified key research themes, trends, and collaborations within the AI-CVD domain. The findings revealed the exponential growth of AI-related research in CVD, underscoring its immense potential to revolutionize cardiovascular healthcare. The annual scientific publication of machine learning papers in CVD increases continuously and significantly since 2016, with an overall annual growth rate of 22.8%. Almost half (46.2%) of the growth happened in the last 5 years. USA, China, India, UK and Korea were the top five productive countries in number of publications. UK, Germany and Australia were the most collaborative countries with a multiple country publication (MCP) value of 42.8%, 40.3% and 40.0% respectively. We observed the emergence of twenty-two distinct research topics, including "stroke and robotic rehabilitation therapy," "robotic-assisted cardiac surgery," and "cardiac image analysis," which persisted as major topics throughout the years. Other topics, such as "retinal image analysis and CVD" and "biomarker and wearable signal analyses," have recently emerged as dominant areas of research in cardiovascular medicine. Convolutional neural network appears to be the most mentioned algorithm followed by LSTM (Long Short-Term Memory) and KNN (K-Nearest Neighbours). This indicates that the future direction of AI cardiovascular research is predominantly directing toward neural networks and image analysis. As AI continues to shape the landscape of CVD research, our study serves as a comprehensive guide for researchers, practitioners, and policymakers, providing valuable insights into the current state of AI in CVD research. This study offers a deep understanding of research trends and paves the way for future directions to maximiz the potential of AI to effectively combat cardiovascular diseases.

Collapse

Serrano-Guerrero J, Bani-Doumi M, Chiclana F, Romero FP, Olivas JA. How satisfied are patients with nursing care and why? A comprehensive study based on social media and opinion mining. Inform Health Soc Care 2024;49:14-27. [PMID: 38178275 DOI: 10.1080/17538157.2023.2297307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2024]

Park B, Jang IS, Kwak D. Sentiment analysis of the COVID-19 vaccine perception. Health Informatics J 2024;30:14604582241236131. [PMID: 38403926 DOI: 10.1177/14604582241236131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]

Kim A, Choi S, Woo K. Navigating the Landscape of Telemedicine Research: A Topic Modeling Approach for the Present and Future. Telemed J E Health 2023. [PMID: 38153985 DOI: 10.1089/tmj.2023.0523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2023] Open

Abstract

Introduction: Telemedicine, which is the provision of remote clinical services via telecommunication technology, has undergone an upsurge since the COVID-19 pandemic. To capture this paradigm, this study surveyed telemedicine literature, including postpandemic publications, to identify dominant research themes and temporal trends and suggest directions for future research. Methods: A corpus of 56,445 telemedicine studies is sourced from PubMed. Latent Dirichlet allocation (LDA) topic modeling performed using the Konstanz Information Miner platform. The textual data for topic modeling were processed by following standard procedures for natural language processing. Moreover, the term frequency-inverse document frequency approach was used to capture the importance of words within the corpus. We assessed perplexity, coherence, and the elbow method to determine the optimal number of topics for modeling. Results: The findings confirm the surge in telemedicine research after 2020, signifying its prominence. LDA topic modeling reveals seven distinct research themes, with the most prominent topic being "patient satisfaction" (21.38%) followed by "perspectives and challenges" (17.95%), and "smartphone apps" (14.32%). Furthermore, the results demonstrate a noticeable shift in topics from screening to therapeutic applications of telemedicine. Conclusions: This study serves as a guide for a broad range of telemedicine research topics. This synthesis of themes reflects the commitment of scholars to address the changing dynamics and health care needs, such as the COVID-19 pandemic, aging in place, smartphone usage, and technological advancement. The analysis also reveals flexible research responses to policy and contextual shifts, highlighting the collective drive to broaden the application of telemedicine in community health care.

Collapse

Smith BP, Hoots B, DePadilla L, Roehler DR, Holland KM, Bowen DA, Sumner SA. Using Transformer-Based Topic Modeling to Examine Discussions of Delta-8 Tetrahydrocannabinol: Content Analysis. J Med Internet Res 2023;25:e49469. [PMID: 38127427 PMCID: PMC10767625 DOI: 10.2196/49469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 08/15/2023] [Accepted: 10/11/2023] [Indexed: 12/23/2023] Open

Abstract

BACKGROUND

Delta-8 tetrahydrocannabinol (THC) is a psychoactive cannabinoid found in small amounts naturally in the cannabis plant; it can also be synthetically produced in larger quantities from hemp-derived cannabidiol. Most states permit the sale of hemp and hemp-derived cannabidiol products; thus, hemp-derived delta-8 THC products have become widely available in many state hemp marketplaces, even where delta-9 THC, the most prominently occurring THC isomer in cannabis, is not currently legal. Health concerns related to the processing of delta-8 THC products and their psychoactive effects remain understudied.

OBJECTIVE

The goal of this study is to implement a novel topic modeling approach based on transformers, a state-of-the-art natural language processing architecture, to identify and describe emerging trends and topics of discussion about delta-8 THC from social media discourse, including potential symptoms and adverse health outcomes experienced by people using delta-8 THC products.

METHODS

Posts from January 2008 to December 2021 discussing delta-8 THC were isolated from cannabis-related drug forums on Reddit (Reddit Inc), a social media platform that hosts the largest web-based drug forums worldwide. Unsupervised topic modeling with state-of-the-art transformer-based models was used to cluster posts into topics and assign labels describing the kinds of issues being discussed with respect to delta-8 THC. Results were then validated by human subject matter experts.

RESULTS

There were 41,191 delta-8 THC posts identified and 81 topics isolated, the most prevalent being (1) discussion of specific brands or products, (2) comparison of delta-8 THC to other hemp-derived cannabinoids, and (3) safety warnings. About 5% (n=1220) of posts from the resulting topics included content discussing health-related symptoms such as anxiety, sleep disturbance, and breathing problems. Until 2020, Reddit posts contained fewer than 10 mentions of delta-8-THC for every 100,000 cannabis posts annually. However, in 2020, these rates increased by 13 times the 2019 rate (to 99.2 mentions per 100,000 cannabis posts) and continued to increase into 2021 (349.5 mentions per 100,000 cannabis posts).

CONCLUSIONS

Our study provides insights into emerging public health concerns around delta-8 THC, a novel substance about which little is known. Furthermore, we demonstrate the use of transformer-based unsupervised learning approaches to derive intelligible topics from highly unstructured discussions of delta-8 THC, which may help improve the timeliness of identification of emerging health concerns related to new substances.

Collapse

Shi J, Bendig D, Vollmar HC, Rasche P. Mapping the Bibliometrics Landscape of AI in Medicine: Methodological Study. J Med Internet Res 2023;25:e45815. [PMID: 38064255 PMCID: PMC10746970 DOI: 10.2196/45815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 08/16/2023] [Accepted: 09/30/2023] [Indexed: 12/18/2023] Open

Abstract

BACKGROUND

Artificial intelligence (AI), conceived in the 1950s, has permeated numerous industries, intensifying in tandem with advancements in computing power. Despite the widespread adoption of AI, its integration into medicine trails other sectors. However, medical AI research has experienced substantial growth, attracting considerable attention from researchers and practitioners.

OBJECTIVE

In the absence of an existing framework, this study aims to outline the current landscape of medical AI research and provide insights into its future developments by examining all AI-related studies within PubMed over the past 2 decades. We also propose potential data acquisition and analysis methods, developed using Python (version 3.11) and to be executed in Spyder IDE (version 5.4.3), for future analogous research.

METHODS

Our dual-pronged approach involved (1) retrieving publication metadata related to AI from PubMed (spanning 2000-2022) via Python, including titles, abstracts, authors, journals, country, and publishing years, followed by keyword frequency analysis and (2) classifying relevant topics using latent Dirichlet allocation, an unsupervised machine learning approach, and defining the research scope of AI in medicine. In the absence of a universal medical AI taxonomy, we used an AI dictionary based on the European Commission Joint Research Centre AI Watch report, which emphasizes 8 domains: reasoning, planning, learning, perception, communication, integration and interaction, service, and AI ethics and philosophy.

RESULTS

From 2000 to 2022, a comprehensive analysis of 307,701 AI-related publications from PubMed highlighted a 36-fold increase. The United States emerged as a clear frontrunner, producing 68,502 of these articles. Despite its substantial contribution in terms of volume, China lagged in terms of citation impact. Diving into specific AI domains, as the Joint Research Centre AI Watch report categorized, the learning domain emerged dominant. Our classification analysis meticulously traced the nuanced research trajectories across each domain, revealing the multifaceted and evolving nature of AI's application in the realm of medicine.

CONCLUSIONS

The research topics have evolved as the volume of AI studies increases annually. Machine learning remains central to medical AI research, with deep learning expected to maintain its fundamental role. Empowered by predictive algorithms, pattern recognition, and imaging analysis capabilities, the future of AI research in medicine is anticipated to concentrate on medical diagnosis, robotic intervention, and disease management. Our topic modeling outcomes provide a clear insight into the focus of AI research in medicine over the past decades and lay the groundwork for predicting future directions. The domains that have attracted considerable research attention, primarily the learning domain, will continue to shape the trajectory of AI in medicine. Given the observed growing interest, the domain of AI ethics and philosophy also stands out as a prospective area of increased focus.

Collapse

Scholz S, Berns I, Winkler C. Listen to the patients! Identifying CML patients' needs analyzing patient-generated content with AI-driven methodologies. Front Digit Health 2023;5:1243215. [PMID: 38116100 PMCID: PMC10729659 DOI: 10.3389/fdgth.2023.1243215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 11/20/2023] [Indexed: 12/21/2023] Open

Abstract

Background

Various patient support programs exist to provide successful therapy options for patients. Pharmaceutical companies are increasingly recognizing the importance of actively supporting patients in their long-term treatment. In order to effectively assist patients, it is crucial to understand their current needs by taking a look at the patients' opinions.

Objective

This study focuses specifically on chronic myeloid leukemia (CML) and aims to determine if the current patient engagement offerings from pharmaceutical companies adequately address the needs of CML patients. To achieve this, the study uses content generated by CML patients to assess the patient engagement strategies of selected pharmaceutical companies, explore the relevance of medication, their products, and services, and analyze key concerns from the perspective of the patients.

Methods

To address the research questions, various methodologies were employed. Initially, desk research was conducted to identify relevant pharmaceutical companies and internet forums related to CML. Subsequently, content generated by patients was acquired and AI-driven techniques such as topic modeling and topic evolution analyses were used to examine this user-generated content (UGC) within the identified public forums. This involved analyzing topic models and tracking topic changes over time.

Results

The desk research revealed that pharmaceutical companies primarily offer information about the disease and available treatment options. The UGC analysis confirmed the significant role played by the industry in supporting CML patients. Key areas of interest for patients include the disease itself, potential treatment methods and associated side effects, dosage of active substances, and the possibility of switching therapies due to treatment failure or resistance. Stem cell transplantation was also discussed.

Conclusions

Overall, the pharmaceutical industry adequately addresses the needs of CML patients. However, there is room for improvement in educating patients about treatment options, drugs, and their side effects. Psychological support should not be neglected. Since CML patients frequently engage with clinical trial outcomes, there is potential for increased patient involvement in such trials. Further research in this area is recommended.

Collapse

Thornton C, Lanyi K, Wilkins G, Potter R, Hunter E, Kolehmainen N, Pearson F. Scoping the Priorities and Concerns of Parents: Infodemiology Study of Posts on Mumsnet and Reddit. J Med Internet Res 2023;25:e47849. [PMID: 38015600 PMCID: PMC10716753 DOI: 10.2196/47849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 09/18/2023] [Accepted: 09/28/2023] [Indexed: 11/29/2023] Open

Abstract

BACKGROUND

Health technology innovation is increasingly supported by a bottom-up approach to priority setting, aiming to better reflect the concerns of its intended beneficiaries. Web-based forums provide parents with an outlet to share concerns, advice, and information related to parenting and the health and well-being of their children. They provide a rich source of data on parenting concerns and priorities that could inform future child health research and innovation.

OBJECTIVE

The aim of the study is to identify common concerns expressed on 2 major web-based forums and cluster these to identify potential family health concern topics as indicative priority areas for future research and innovation.

METHODS

We text-mined the r/Parenting subreddit (69,846 posts) and the parenting section of Mumsnet (99,848 posts) to create a large corpus of posts. A generative statistical model (latent Dirichlet allocation) was used to identify the most discussed topics in the corpus, and content analysis was applied to identify the parenting concerns found in a subset of posts.

RESULTS

A model with 25 topics produced the highest coherence and a wide range of meaningful parenting concern topics. The most frequently expressed parenting concerns are related to their child's sleep, self-care, eating (and food), behavior, childcare context, and the parental context including parental conflict. Topics directly associated with infants, such as potty training and bottle feeding, were more common on Mumsnet, while parental context and screen time were more common on r/Parenting.

CONCLUSIONS

Latent Dirichlet allocation topic modeling can be applied to gain a rapid, yet meaningful overview of parent concerns expressed on a large and diverse set of social media posts and used to complement traditional insight gathering methods. Parents framed their concerns in terms of children's everyday health concerns, generating topics that overlap significantly with established family health concern topics. We provide evidence of the range of family health concerns found at these sources and hope this can be used to generate material for use alongside traditional insight gathering methods.

Collapse

Ni C, Song Q, Malin B, Song L, Commiskey P, Stratton L, Yin Z. Examining Online Behaviors of Adult-Child and Spousal Caregivers for People Living With Alzheimer Disease or Related Dementias: Comparative Study in an Open Online Community. J Med Internet Res 2023;25:e48193. [PMID: 37976095 PMCID: PMC10692884 DOI: 10.2196/48193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Revised: 09/05/2023] [Accepted: 10/11/2023] [Indexed: 11/19/2023] Open

Abstract

BACKGROUND

Alzheimer disease or related dementias (ADRD) are severe neurological disorders that impair the thinking and memory skills of older adults. Most persons living with dementia receive care at home from their family members or other unpaid informal caregivers; this results in significant mental, physical, and financial challenges for these caregivers. To combat these challenges, many informal ADRD caregivers seek social support in online environments. Although research examining online caregiving discussions is growing, few investigations have distinguished caregivers according to their kin relationships with persons living with dementias. Various studies have suggested that caregivers in different relationships experience distinct caregiving challenges and support needs.

OBJECTIVE

This study aims to examine and compare the online behaviors of adult-child and spousal caregivers, the 2 largest groups of informal ADRD caregivers, in an open online community.

METHODS

We collected posts from ALZConnected, an online community managed by the Alzheimer's Association. To gain insights into online behaviors, we first applied structural topic modeling to identify topics and topic prevalence between adult-child and spousal caregivers. Next, we applied VADER (Valence Aware Dictionary for Sentiment Reasoning) and LIWC (Linguistic Inquiry and Word Count) to evaluate sentiment changes in the online posts over time for both types of caregivers. We further built machine learning models to distinguish the posts of each caregiver type and evaluated them in terms of precision, recall, F1-score, and area under the precision-recall curve. Finally, we applied the best prediction model to compare the temporal trend of relationship-predicting capacities in posts between the 2 types of caregivers.

RESULTS

Our analysis showed that the number of posts from both types of caregivers followed a long-tailed distribution, indicating that most caregivers in this online community were infrequent users. In comparison with adult-child caregivers, spousal caregivers tended to be more active in the community, publishing more posts and engaging in discussions on a wider range of caregiving topics. Spousal caregivers also exhibited slower growth in positive emotional communication over time. The best machine learning model for predicting adult-child, spousal, or other caregivers achieved an area under the precision-recall curve of 81.3%. The subsequent trend analysis showed that it became more difficult to predict adult-child caregiver posts than spousal caregiver posts over time. This suggests that adult-child and spousal caregivers might gradually shift their discussions from questions that are more directly related to their own experiences and needs to questions that are more general and applicable to other types of caregivers.

CONCLUSIONS

Our findings suggest that it is important for researchers and community organizers to consider the heterogeneity of caregiving experiences and subsequent online behaviors among different types of caregivers when tailoring online peer support to meet the specific needs of each caregiver group.

Collapse

Beierle F, Pryss R, Aizawa A. Sentiments about Mental Health on Twitter-Before and during the COVID-19 Pandemic. Healthcare (Basel) 2023;11:2893. [PMID: 37958038 PMCID: PMC10647444 DOI: 10.3390/healthcare11212893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 10/23/2023] [Accepted: 11/01/2023] [Indexed: 11/15/2023] Open

Timakum T, Xie Q, Lee S. Identifying mental health discussion topic in social media community: subreddit of bipolar disorder analysis. Front Res Metr Anal 2023;8:1243407. [PMID: 38025958 PMCID: PMC10654961 DOI: 10.3389/frma.2023.1243407] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 10/17/2023] [Indexed: 12/01/2023] Open

Ozkara BB, Karabacak M, Margetis K, Yedavalli VS, Wintermark M, Bisdas S. Assessment of Computed Tomography Perfusion Research Landscape: A Topic Modeling Study. Tomography 2023;9:2016-2028. [PMID: 37987344 PMCID: PMC10661298 DOI: 10.3390/tomography9060158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 10/22/2023] [Accepted: 10/23/2023] [Indexed: 11/22/2023] Open

Lan D, Ren W, Ni K, Zhu Y. Topic and Trend Analysis of Weibo Discussions About COVID-19 Medications Before and After China's Exit from the Zero-COVID Policy: Retrospective Infoveillance Study. J Med Internet Res 2023;25:e48789. [PMID: 37889532 PMCID: PMC10638631 DOI: 10.2196/48789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Revised: 07/27/2023] [Accepted: 09/26/2023] [Indexed: 10/28/2023] Open

Abstract

BACKGROUND

After 3 years of its zero-COVID policy, China lifted its stringent pandemic control measures with the announcement of the 10 new measures on December 7, 2022. Existing estimates suggest 90%-97% of the total population was infected during December. This change created a massive demand for COVID-19 medications and treatments, either modern medicines or traditional Chinese medicine (TCM).

OBJECTIVE

This study aimed to explore (1) how China's exit from the zero-COVID policy impacted media and the public's attention to COVID-19 medications; (2) how social COVID-19 medication discussions were related to existing model estimates of daily cases during that period; (3) what the diversified themes and topics were and how they changed and developed from November 1 to December 31, 2022; and (4) which topics about COVID-19 medications were focused on by mainstream and self-media accounts during the exit. The answers to these questions could help us better understand the consequences of exit strategies and explore the utilities of Sina Weibo data for future infoveillance studies.

METHODS

Using a scrapper for data retrieval and the structural topic modeling (STM) algorithm for analysis, this study built 3 topic models (all data, before a policy change, and after a policy change) of relevant discussions on the Chinese social media platform Weibo. We compared topic distributions against existing estimates of daily cases and between models before and after the change. We also compared proportions of weibos published by mainstream versus self-media accounts over time on different topics.

RESULTS

We found that Weibo discussions shifted sharply from concerns of social risks (case tracking, governmental regulations, etc) to those of personal risks (symptoms, purchases, etc) surrounding COVID-19 infection after the exit from the zero-COVID policy. Weibo topics of "symptom sharing" and "purchase and shortage" of modern medicines correlated more strongly with existing susceptible-exposed-infected-recovered (SEIR) model estimates compared to "TCM formulae" and other topics. During the exit, mainstream accounts showed efforts to specifically engage in topics related to worldwide pandemic control policy comparison and regulations about import and reimbursement of medications.

CONCLUSIONS

The exit from the zero-COVID policy in China was accompanied by a sudden increase in social media discussions about COVID-19 medications, the demand for which substantially increased after the exit. A large proportion of Weibo discussions were emotional and expressed increased risk concerns over medication shortage, unavailability, and delay in delivery. Topic keywords showed that self-medication was sometimes practiced alone or with unprofessional help from others, while mainstream accounts also tried to provide certain medication instructions. Of the 16 topics identified in all 3 STM models, only "symptom sharing" and "purchase and shortage" showed a considerable correlation with SEIR model estimates of daily cases. Future studies could consider topic exploration before conducting predictive infoveillance analysis, even with narrowly defined search criteria with Weibo data.

Collapse

Unlu A, Truong S, Tammi T, Lohiniva AL. Exploring Political Mistrust in Pandemic Risk Communication: Mixed-Method Study Using Social Media Data Analysis. J Med Internet Res 2023;25:e50199. [PMID: 37862088 PMCID: PMC10625074 DOI: 10.2196/50199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 09/08/2023] [Accepted: 09/18/2023] [Indexed: 10/21/2023] Open

Abstract

BACKGROUND

This research extends prior studies by the Finnish Institute for Health and Welfare on pandemic-related risk perception, concentrating on the role of trust in health authorities and its impact on public health outcomes.

OBJECTIVE

The paper aims to investigate variations in trust levels over time and across social media platforms, as well as to further explore 12 subcategories of political mistrust. It seeks to understand the dynamics of political trust, including mistrust accumulation, fluctuations over time, and changes in topic relevance. Additionally, the study aims to compare qualitative research findings with those obtained through computational methods.

METHODS

Data were gathered from a large-scale data set consisting of 13,629 Twitter and Facebook posts from 2020 to 2023 related to COVID-19. For analysis, a fine-tuned FinBERT model with an 80% accuracy rate was used for predicting political mistrust. The BERTopic model was also used for superior topic modeling performance.

RESULTS

Our preliminary analysis identifies 43 mistrust-related topics categorized into 9 major themes. The most salient topics include COVID-19 mortality, coping strategies, polymerase chain reaction testing, and vaccine efficacy. Discourse related to mistrust in authority is associated with perceptions of disease severity, willingness to adopt health measures, and information-seeking behavior. Our findings highlight that the distinct user engagement mechanisms and platform features of Facebook and Twitter contributed to varying patterns of mistrust and susceptibility to misinformation during the pandemic.

CONCLUSIONS

The study highlights the effectiveness of computational methods like natural language processing in managing large-scale engagement and misinformation. It underscores the critical role of trust in health authorities for effective risk communication and public compliance. The findings also emphasize the necessity for transparent communication from authorities, concluding that a holistic approach to public health communication is integral for managing health crises effectively.

Collapse

Lee JW, Song S, Kim Y, Park SB, Han DH. Soccer's AI transformation: deep learning's analysis of soccer's pandemic research evolution. Front Psychol 2023;14:1244404. [PMID: 37908810 PMCID: PMC10613686 DOI: 10.3389/fpsyg.2023.1244404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 09/13/2023] [Indexed: 11/02/2023] Open

Takano R, Matsuo A, Kawano K. Development of a Japanese version of the Awe Experience Scale (AWE-S): A structural topic modeling approach. F1000Res 2023;12:515. [PMID: 37900197 PMCID: PMC10611950 DOI: 10.12688/f1000research.134275.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 10/10/2023] [Indexed: 10/31/2023] Open

Voskergian D, Bakir-Gungor B, Yousef M. TextNetTopics Pro, a topic model-based text classification for short text by integration of semantic and document-topic distribution information. Front Genet 2023;14:1243874. [PMID: 37867598 PMCID: PMC10585361 DOI: 10.3389/fgene.2023.1243874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 08/30/2023] [Indexed: 10/24/2023] Open

Abstract

With the exponential growth in the daily publication of scientific articles, automatic classification and categorization can assist in assigning articles to a predefined category. Article titles are concise descriptions of the articles' content with valuable information that can be useful in document classification and categorization. However, shortness, data sparseness, limited word occurrences, and the inadequate contextual information of scientific document titles hinder the direct application of conventional text mining and machine learning algorithms on these short texts, making their classification a challenging task. This study firstly explores the performance of our earlier study, TextNetTopics on the short text. Secondly, here we propose an advanced version called TextNetTopics Pro, which is a novel short-text classification framework that utilizes a promising combination of lexical features organized in topics of words and topic distribution extracted by a topic model to alleviate the data-sparseness problem when classifying short texts. We evaluate our proposed approach using nine state-of-the-art short-text topic models on two publicly available datasets of scientific article titles as short-text documents. The first dataset is related to the Biomedical field, and the other one is related to Computer Science publications. Additionally, we comparatively evaluate the predictive performance of the models generated with and without using the abstracts. Finally, we demonstrate the robustness and effectiveness of the proposed approach in handling the imbalanced data, particularly in the classification of Drug-Induced Liver Injury articles as part of the CAMDA challenge. Taking advantage of the semantic information detected by topic models proved to be a reliable way to improve the overall performance of ML classifiers.

Collapse

Chandrasekaran R, Bapat P, Jeripity Venkata P, Moustakas E. Do Patients Assess Physicians Differently in Video Visits as Compared with In-Person Visits? Insights from Text-Mining Online Physician Reviews. Telemed J E Health 2023;29:1557-1565. [PMID: 36847352 DOI: 10.1089/tmj.2022.0507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2023] Open

Abstract

Introduction: Use of both in-person and video visits have become a common norm in health care delivery, especially after the COVID-19 pandemic. It is imperative to understand how patients feel about their providers and their experiences during in-person and video visits. This study examines the important factors that patients use in their reviews and differences in the relative importance. Methods: We performed sentiment analysis and topic modeling on online physician reviews from April 2020 to April 2022. Our dataset comprised 34,824 reviews posted by patients after completing in-person or video visits. Results: Sentiment analysis yielded 27,507 (92.69%) positive and 2,168 (7.31%) negative reviews for in-person visits, and 4,610 (89.53%) positive and 539 (10.47%) negative reviews for video visits. Topic modeling identified seven factors patients used in their reviews: Bedside manners, Medical Expertise, Communication, Visit Environment, Scheduling and Follow-up, Wait times, and Costs and insurance. Patients who gave positive reviews after in-person consultations more frequently mentioned communication, office environment and staff, and bedside manners. Those who gave negative reviews after in-person visits mentioned longer wait times, providers' office and staff, medical expertise, and costs and insurance problems. Patients with positive reviews after video visits emphasized communication, bedside manners, and medical expertise. However, patients posting negative reviews after video visits frequently mentioned problems with appointment scheduling and follow-up, medical expertise, wait times, costs and insurance, and technical problems in video visits. Conclusions: This study identified key factors that influence patients' assessment of their providers in in-person and video visits. Paying attention to these factors can help improve the overall patient experience.

Collapse

Lee JW, Han DH. Data Analysis of Psychological Approaches to Soccer Research: Using LDA Topic Modeling. Behav Sci (Basel) 2023;13:787. [PMID: 37887437 PMCID: PMC10604603 DOI: 10.3390/bs13100787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 09/03/2023] [Accepted: 09/12/2023] [Indexed: 10/28/2023] Open

Abstract

This study identifies the topical areas of research that have attempted a psychological approach to soccer research over the last 33 years (1990-2022) and explored the growth and stagnation of the topic as well as research contributions to soccer development. Data were obtained from 1863 papers from the Web of Science database. The data were collected through keyword text mining and data preprocessing to determine the keywords needed for analysis. Based on the keywords, latent Dirichlet allocation-based topic modeling analysis was performed to analyze the topic distribution of papers and explore research trends by topic area. The topic modeling process included four topic area and fifty topics. The "Coaching Essentials in Football" topic area had the highest frequency, but it was not statistically identified as a trend. However, coaching, including training, is expected to continue to be an important research topic, as it is a key requirement for success in the highly competitive elite football world. Interest in the research field of "Psychological Skills for Performance Development" has waned in recent years. This may be due to the predominance of other subject areas rather than a lack of interest. Various high-tech interventions and problem-solving attempts are being made in this field, providing opportunities for qualitative and quantitative expansion. "Motivation, cognition, and emotion" is a largely underrated subject area in soccer psychology. This could be because survey-based psychological evaluation attempts have decreased as the importance of rapid field application has been emphasized in recent soccer-related studies. However, measuring psychological factors contributes to the study of football psychology through a new methodology and theoretical background. Recognizing the important role of psychological factors in player performance and mental management, as well as presenting new research directions and approaches that can be directly applied to the field, will advance soccer psychology research.

Collapse

Carbonetto P, Luo K, Sarkar A, Hung A, Tayeb K, Pott S, Stephens M. GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership. bioRxiv 2023:2023.03.03.531029. [PMID: 36945441 PMCID: PMC10028846 DOI: 10.1101/2023.03.03.531029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/11/2023]

Dasgupta P, Amin J, Paris C, MacIntyre CR. News Coverage of Face Masks in Australia During the Early COVID-19 Pandemic: Topic Modeling Study. JMIR Infodemiology 2023;3:e43011. [PMID: 37379362 PMCID: PMC10434701 DOI: 10.2196/43011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 04/20/2023] [Accepted: 05/02/2023] [Indexed: 06/30/2023]

Abstract

BACKGROUND

During the COVID-19 pandemic, web-based media coverage of preventative strategies proliferated substantially. News media was constantly informing people about changes in public health policy and practices such as mask-wearing. Hence, exploring news media content on face mask use is useful to analyze dominant topics and their trends.

OBJECTIVE

The aim of the study was to examine news related to face masks as well as to identify related topics and temporal trends in Australian web-based news media during the early COVID-19 pandemic period.

METHODS

Following data collection from the Google News platform, a trend analysis on the mask-related news titles from Australian news publishers was conducted. Then, a latent Dirichlet allocation topic modeling algorithm was applied along with evaluation matrices (quantitative and qualitative measures). Afterward, topic trends were developed and analyzed in the context of mask use during the pandemic.

RESULTS

A total of 2345 face mask-related eligible news titles were collected from January 25, 2020, to January 25, 2021. Mask-related news showed an increasing trend corresponding to increasing COVID-19 cases in Australia. The best-fitted latent Dirichlet allocation model discovered 8 different topics with a coherence score of 0.66 and a perplexity measure of -11.29. The major topics were T1 (mask-related international affairs), T2 (introducing mask mandate in places such as Melbourne and Sydney), and T4 (antimask sentiment). Topic trends revealed that T2 was the most frequent topic in January 2021 (77 news titles), corresponding to the mandatory mask-wearing policy in Sydney.

CONCLUSIONS

This study demonstrated that Australian news media reflected a wide range of community concerns about face masks, peaking as COVID-19 incidence increased. Harnessing the news media platforms for understanding the media agenda and community concerns may assist in effective health communication during a pandemic response.

Collapse

Goel R, Modhukur V, Täär K, Salumets A, Sharma R, Peters M. Users' Concerns About Endometriosis on Social Media: Sentiment Analysis and Topic Modeling Study. J Med Internet Res 2023;25:e45381. [PMID: 37581905 PMCID: PMC10466158 DOI: 10.2196/45381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 06/14/2023] [Accepted: 07/04/2023] [Indexed: 08/16/2023] Open

Abstract

BACKGROUND

Endometriosis is a debilitating and difficult-to-diagnose gynecological disease. Owing to limited information and awareness, women often rely on social media platforms as a support system to engage in discussions regarding their disease-related concerns.

OBJECTIVE

This study aimed to apply computational techniques to social media posts to identify discussion topics about endometriosis and to identify themes that require more attention from health care professionals and researchers. We also aimed to explore whether, amid the challenging nature of the disease, there are themes within the endometriosis community that gather posts with positive sentiments.

METHODS

We retrospectively extracted posts from the subreddits r/Endo and r/endometriosis from January 2011 to April 2022. We analyzed 45,693 Reddit posts using sentiment analysis and topic modeling-based methods in machine learning.

RESULTS

Since 2011, the number of posts and comments has increased steadily. The posts were categorized into 11 categories, and the highest number of posts were related to either asking for information (Question); sharing the experiences (Rant/Vent); or diagnosing and treating endometriosis, especially surgery (Surgery related). Sentiment analysis revealed that 92.09% (42,077/45,693) of posts were associated with negative sentiments, only 2.3% (1053/45,693) expressed positive feelings, and there were no categories with more positive than negative posts. Topic modeling revealed 27 major topics, and the most popular topics were Surgery, Questions/Advice, Diagnosis, and Pain. The Survey/Research topic, which brought together most research-related posts, was the last in terms of posts.

CONCLUSIONS

Our study shows that posts on social media platforms can provide insights into the concerns of women with endometriosis symptoms. The analysis of the posts confirmed that women with endometriosis have to face negative emotions and pain daily. The large number of posts related to asking questions shows that women do not receive sufficient information from physicians and need community support to cope with the disease. Health care professionals should pay more attention to the symptoms and diagnosis of endometriosis, discuss these topics with patients to reduce their dissatisfaction with doctors, and contribute more to the overall well-being of women with endometriosis. Researchers should also become more involved in social media and share new science-based knowledge regarding endometriosis.

Collapse

Zhang Y, Jiang X, Mentzer AJ, McVean G, Lunter G. Topic modeling identifies novel genetic loci associated with multimorbidities in UK Biobank. Cell Genom 2023;3:100371. [PMID: 37601973 PMCID: PMC10435382 DOI: 10.1016/j.xgen.2023.100371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Revised: 05/04/2023] [Accepted: 07/07/2023] [Indexed: 08/22/2023]

Zaidi Z, Ye M, Samon F, Jama A, Gopalakrishnan B, Gu C, Karunasekera S, Evans J, Kashima Y. Topics in Antivax and Provax Discourse: Yearlong Synoptic Study of COVID-19 Vaccine Tweets. J Med Internet Res 2023;25:e45069. [PMID: 37552535 PMCID: PMC10411425 DOI: 10.2196/45069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 05/14/2023] [Accepted: 06/06/2023] [Indexed: 08/09/2023] Open

Abstract

BACKGROUND

Developing an understanding of the public discourse on COVID-19 vaccination on social media is important not only for addressing the ongoing COVID-19 pandemic but also for future pathogen outbreaks. There are various research efforts in this domain, although, a need still exists for a comprehensive topic-wise analysis of tweets in favor of and against COVID-19 vaccines.

OBJECTIVE

This study characterizes the discussion points in favor of and against COVID-19 vaccines posted on Twitter during the first year of the pandemic. The aim of this study was primarily to contrast the views expressed by both camps, their respective activity patterns, and their correlation with vaccine-related events. A further aim was to gauge the genuineness of the concerns expressed in antivax tweets.

METHODS

We examined a Twitter data set containing 75 million English tweets discussing the COVID-19 vaccination from March 2020 to March 2021. We trained a stance detection algorithm using natural language processing techniques to classify tweets as antivax or provax and examined the main topics of discourse using topic modeling techniques.

RESULTS

Provax tweets (37 million) far outnumbered antivax tweets (10 million) and focused mostly on vaccine development, whereas antivax tweets covered a wide range of topics, including opposition to vaccine mandate and concerns about safety. Although some antivax tweets included genuine concerns, there was a large amount of falsehood. Both stances discussed many of the same topics from opposite viewpoints. Memes and jokes were among the most retweeted messages. Most tweets from both stances (9,007,481/10,566,679, 85.24% antivax and 24,463,708/37,044,507, 66.03% provax tweets) came from dual-stance users who posted both provax and antivax tweets during the observation period.

CONCLUSIONS

This study is a comprehensive account of COVID-19 vaccine discourse in the English language on Twitter from March 2020 to March 2021. The broad range of discussion points covered almost the entire conversation, and their temporal dynamics revealed a significant correlation with COVID-19 vaccine-related events. We did not find any evidence of polarization and prevalence of antivax discourse over Twitter. However, targeted countering of falsehoods is important because only a small fraction of antivax discourse touched on a genuine issue. Future research should examine the role of memes and humor in driving web-based social media activity.

Collapse

Haßler J, Wurst AK, Jungblut M, Schlosser K. Influence of the pandemic lockdown on Fridays for Future's hashtag activism. New Media Soc 2023;25:1991-2013. [PMID: 37461409 PMCID: PMC10345819 DOI: 10.1177/14614448211026575] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 09/20/2023]

Yune SJ, Kim Y, Lee JW. Data Analysis of Physician Competence Research Trend: Social Network Analysis and Topic Modeling Approach. JMIR Med Inform 2023;11:e47934. [PMID: 37467028 PMCID: PMC10398558 DOI: 10.2196/47934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 05/15/2023] [Accepted: 05/16/2023] [Indexed: 07/20/2023] Open

Abstract

BACKGROUND

Studies on competency in medical education often explore the acquisition, performance, and evaluation of particular skills, knowledge, or behaviors that constitute physician competency. As physician competency reflects social demands according to changes in the medical environment, analyzing the research trends of physician competency by period is necessary to derive major research topics for future studies. Therefore, a more macroscopic method is required to analyze the core competencies of physicians in this era.

OBJECTIVE

This study aimed to analyze research trends related to physicians' competency in reflecting social needs according to changes in the medical environment.

METHODS

We used topic modeling to identify potential research topics by analyzing data from studies related to physician competency published between 2011 and 2020. We preprocessed 1354 articles and extracted 272 keywords.

RESULTS

The terms that appeared most frequently in the research related to physician competency since 2010 were knowledge, hospital, family, job, guidelines, management, and communication. The terms that appeared in most studies were education, model, knowledge, and hospital. Topic modeling revealed that the main topics about physician competency included Evidence-based clinical practice, Community-based healthcare, Patient care, Career and self-management, Continuous professional development, and Communication and cooperation. We divided the studies into 4 periods (2011-2013, 2014-2016, 2017-2019, and 2020-2021) and performed a linear regression analysis. The results showed a change in topics by period. The hot topics that have shown increased interest among scholars over time include Community-based healthcare, Career and self-management, and Continuous professional development.

CONCLUSIONS

On the basis of the analysis of research trends, it is predicted that physician professionalism and community-based medicine will continue to be studied in future studies on physician competency.

Collapse

Min S, Han J. Topic Modeling Analysis of Diabetes-Related Health Information during the Coronavirus Disease Pandemic. Healthcare (Basel) 2023;11:1871. [PMID: 37444705 DOI: 10.3390/healthcare11131871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Revised: 06/15/2023] [Accepted: 06/22/2023] [Indexed: 07/15/2023] Open

Barnett GA, Calabrese C, Ruiz JB. A comparison of three methods to determine the subject matter in textual data. Front Res Metr Anal 2023;8:1104691. [PMID: 37334104 PMCID: PMC10272525 DOI: 10.3389/frma.2023.1104691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 05/05/2023] [Indexed: 06/20/2023] Open

Fruchart M, Verdier L, Beuscart JB, Lamer A. Publication Dynamics on Social Media During the Orpea Nursing Homes Scandal: A Twitter Analysis. Stud Health Technol Inform 2023;302:502-503. [PMID: 37203735 DOI: 10.3233/shti230191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

Moy AJ, Withall J, Hobensack M, Yeji Lee R, Levy DR, Rossetti SC, Rosenbloom ST, Johnson K, Cato K. Eliciting Insights From Chat Logs of the 25X5 Symposium to Reduce Documentation Burden: Novel Application of Topic Modeling. J Med Internet Res 2023;25:e45645. [PMID: 37195741 PMCID: PMC10233429 DOI: 10.2196/45645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 03/03/2023] [Accepted: 03/30/2023] [Indexed: 04/03/2023] Open

Abstract

BACKGROUND

Addressing clinician documentation burden through "targeted solutions" is a growing priority for many organizations ranging from government and academia to industry. Between January and February 2021, the 25 by 5: Symposium to Reduce Documentation Burden on US Clinicians by 75% (25X5 Symposium) convened across 2 weekly 2-hour sessions among experts and stakeholders to generate actionable goals for reducing clinician documentation over the next 5 years. Throughout this web-based symposium, we passively collected attendees' contributions to a chat functionality-with their knowledge that the content would be deidentified and made publicly available. This presented a novel opportunity to synthesize and understand participants' perceptions and interests from chat messages. We performed a content analysis of 25X5 Symposium chat logs to identify themes about reducing clinician documentation burden.

OBJECTIVE

The objective of this study was to explore unstructured chat log content from the web-based 25X5 Symposium to elicit latent insights on clinician documentation burden among clinicians, health care leaders, and other stakeholders using topic modeling.

METHODS

Across the 6 sessions, we captured 1787 messages among 167 unique chat participants cumulatively; 14 were private messages not included in the analysis. We implemented a latent Dirichlet allocation (LDA) topic model on the aggregated dataset to identify clinician documentation burden topics mentioned in the chat logs. Coherence scores and manual examination informed optimal model selection. Next, 5 domain experts independently and qualitatively assigned descriptive labels to model-identified topics and classified them into higher-level categories, which were finalized through a panel consensus.

RESULTS

We uncovered ten topics using the LDA model: (1) determining data and documentation needs (422/1773, 23.8%); (2) collectively reassessing documentation requirements in electronic health records (EHRs) (252/1773, 14.2%); (3) focusing documentation on patient narrative (162/1773, 9.1%); (4) documentation that adds value (147/1773, 8.3%); (5) regulatory impact on clinician burden (142/1773, 8%); (6) improved EHR user interface and design (128/1773, 7.2%); (7) addressing poor usability (122/1773, 6.9%); (8) sharing 25X5 Symposium resources (122/1773, 6.9%); (9) capturing data related to clinician practice (113/1773, 6.4%); and (10) the role of quality measures and technology in burnout (110/1773, 6.2%). Among these 10 topics, 5 high-level categories emerged: consensus building (821/1773, 46.3%), burden sources (365/1773, 20.6%), EHR design (250/1773, 14.1%), patient-centered care (162/1773, 9.1%), and symposium comments (122/1773, 6.9%).

CONCLUSIONS

We conducted a topic modeling analysis on 25X5 Symposium multiparticipant chat logs to explore the feasibility of this novel application and elicit additional insights on clinician documentation burden among attendees. Based on the results of our LDA analysis, consensus building, burden sources, EHR design, and patient-centered care may be important themes to consider when addressing clinician documentation burden. Our findings demonstrate the value of topic modeling in discovering topics associated with clinician documentation burden using unstructured textual content. Topic modeling may be a suitable approach to examine latent themes presented in web-based symposium chat logs.

Collapse

Verza M, Camanzi L, Rota C, Cerjak M, Mulazzani L, Malorgio G. Consumer Sentiments and Emotions in New Seafood Product Concept Development: A Co-Creation Approach Using Online Discussion Rooms in Croatia, Italy and Spain. Foods 2023;12:foods12081729. [PMID: 37107524 PMCID: PMC10138044 DOI: 10.3390/foods12081729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 04/04/2023] [Accepted: 04/19/2023] [Indexed: 04/29/2023] Open

Wu X, Li Z, Xu L, Li P, Liu M, Huang C. COVID-19 Vaccine-Related Information on the WeChat Public Platform: Topic Modeling and Content Analysis. J Med Internet Res 2023;25:e45051. [PMID: 37058349 PMCID: PMC10132036 DOI: 10.2196/45051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 03/04/2023] [Accepted: 03/23/2023] [Indexed: 04/15/2023] Open

Abstract

BACKGROUND

The COVID-19 vaccine is an effective tool in the fight against the COVID-19 outbreak. As the main channel of information dissemination in the context of the epidemic, social media influences public trust and acceptance of the vaccine. The rational application of health behavior theory is a guarantee of effective public health information dissemination. However, little is known about the application of health behavior theory in web-based COVID-19 vaccine messages, especially from Chinese social media posts.

OBJECTIVE

This study aimed to understand the main topics and communication characteristics of hot papers related to COVID-19 vaccine on the WeChat platform and assess the health behavior theory application with the aid of health belief model (HBM).

METHODS

A systematic search was conducted on the Chinese social media platform WeChat to identify COVID-19 vaccine-related papers. A coding scheme was established based on the HBM, and the sample was managed and coded using NVivo 12 (QSR International) to assess the application of health behavior theory. The main topics of the papers were extracted through the Latent Dirichlet Allocation algorithm. Finally, temporal analysis was used to explore trends in the evolution of themes and health belief structures in the papers.

RESULTS

A total of 757 papers were analyzed. Almost all (671/757, 89%) of the papers did not have an original logo. By topic modeling, 5 topics were identified, which were vaccine development and effectiveness (267/757, 35%), disease infection and protection (197/757, 26%), vaccine safety and adverse reactions (52/757, 7%), vaccine access (136/757, 18%), and vaccination science popularization (105/757, 14%). All papers identified at least one structure in the extended HBM, but only 29 papers included all of the structures. Descriptions of solutions to obstacles (585/757, 77%) and benefit (468/757, 62%) were the most emphasized components in all samples. Relatively few elements of susceptibility (208/757, 27%) and the least were descriptions of severity (135/757, 18%). Heat map visualization revealed the change in health belief structure before and after vaccine entry into the market.

CONCLUSIONS

To the best of our knowledge, this is the first study to assess the structural expression of health beliefs in information related to the COVID-19 vaccine on the WeChat public platform based on an HBM. The study also identified topics and communication characteristics before and after the market entry of vaccines. Our findings can inform customized education and communication strategies to promote vaccination not only in this pandemic but also in future pandemics.

Collapse