1
|
Zhang Z, Hua Y, Zhou P, Lin S, Li M, Zhang Y, Zhou L, Liao Y, Yang J. Sexual and Gender-Diverse Individuals Face More Health Challenges during COVID-19: A Large-Scale Social Media Analysis with Natural Language Processing. HEALTH DATA SCIENCE 2024; 4:0127. [PMID: 39247070 PMCID: PMC11378377 DOI: 10.34133/hds.0127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 07/25/2024] [Indexed: 09/10/2024]
Abstract
Background: The COVID-19 pandemic has caused a disproportionate impact on the sexual and gender-diverse (SGD) community. Compared with non-SGD populations, their social relations and health status are more vulnerable, whereas public health data regarding SGD are scarce. Methods: To analyze the concerns and health status of SGD individuals, this cohort study leveraged 471,371,477 tweets from 251,455 SGD and 22,644,411 non-SGD users, spanning from 2020 February 1 to 2022 April 30. The outcome measures comprised the distribution and dynamics of COVID-related topics, attitudes toward vaccines, and the prevalence of symptoms. Results: Topic analysis revealed that SGD users engaged more frequently in discussions related to "friends and family" (20.5% vs. 13.1%, P < 0.001) and "wear masks" (10.1% vs. 8.3%, P < 0.001) compared to non-SGD users. Additionally, SGD users exhibited a marked higher proportion of positive sentiment in tweets about vaccines, including Moderna, Pfizer, AstraZeneca, and Johnson & Johnson. Among 102,464 users who self-reported COVID-19 diagnoses, SGD users disclosed significantly higher frequencies of mentioning 61 out of 69 COVID-related symptoms than non-SGD users, encompassing both physical and mental health challenges. Conclusion: The results provide insights into an understanding of the unique needs and experiences of the SGD community during the pandemic, emphasizing the value of social media data in epidemiological and public health research.
Collapse
Affiliation(s)
- Zhiyun Zhang
- Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, China
| | - Yining Hua
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Division of General Internal Medicine and Primary Care, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Peilin Zhou
- Thrust of Data Science and Analytics, The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China
| | - Shixu Lin
- Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, China
| | - Minghui Li
- Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, China
| | - Yujie Zhang
- Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, China
| | - Li Zhou
- Division of General Internal Medicine and Primary Care, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Yanhui Liao
- Department of Psychiatry, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Jie Yang
- Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, China
| |
Collapse
|
2
|
Hua Y, Wu J, Lin S, Li M, Zhang Y, Foer D, Wang S, Zhou P, Yang J, Zhou L. Streamlining social media information retrieval for public health research with deep learning. J Am Med Inform Assoc 2024; 31:1569-1577. [PMID: 38718216 PMCID: PMC11187427 DOI: 10.1093/jamia/ocae118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 03/28/2024] [Accepted: 05/07/2024] [Indexed: 06/21/2024] Open
Abstract
OBJECTIVE Social media-based public health research is crucial for epidemic surveillance, but most studies identify relevant corpora with keyword-matching. This study develops a system to streamline the process of curating colloquial medical dictionaries. We demonstrate the pipeline by curating a Unified Medical Language System (UMLS)-colloquial symptom dictionary from COVID-19-related tweets as proof of concept. METHODS COVID-19-related tweets from February 1, 2020, to April 30, 2022 were used. The pipeline includes three modules: a named entity recognition module to detect symptoms in tweets; an entity normalization module to aggregate detected entities; and a mapping module that iteratively maps entities to Unified Medical Language System concepts. A random 500 entity samples were drawn from the final dictionary for accuracy validation. Additionally, we conducted a symptom frequency distribution analysis to compare our dictionary to a pre-defined lexicon from previous research. RESULTS We identified 498 480 unique symptom entity expressions from the tweets. Pre-processing reduces the number to 18 226. The final dictionary contains 38 175 unique expressions of symptoms that can be mapped to 966 UMLS concepts (accuracy = 95%). Symptom distribution analysis found that our dictionary detects more symptoms and is effective at identifying psychiatric disorders like anxiety and depression, often missed by pre-defined lexicons. CONCLUSIONS This study advances public health research by implementing a novel, systematic pipeline for curating symptom lexicons from social media data. The final lexicon's high accuracy, validated by medical professionals, underscores the potential of this methodology to reliably interpret, and categorize vast amounts of unstructured social media data into actionable medical insights across diverse linguistic and regional landscapes.
Collapse
Affiliation(s)
- Yining Hua
- Department of Epidemiology, Harvard Chan School of Public Health, Boston, MA 02115, United States
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, United States
- Division of General Internal Medicine and Primary Care, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02145, United States
| | - Jiageng Wu
- School of Public Health, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310058, China
| | - Shixu Lin
- School of Public Health, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310058, China
| | - Minghui Li
- School of Public Health, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310058, China
| | - Yujie Zhang
- School of Public Health, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310058, China
| | - Dinah Foer
- Division of General Internal Medicine and Primary Care, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02145, United States
| | - Siwen Wang
- Department of Epidemiology, Harvard Chan School of Public Health, Boston, MA 02115, United States
| | - Peilin Zhou
- Thrust of Data Science and Analytics, The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, Guangdong, 511458, China
| | - Jie Yang
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02120, United States
| | - Li Zhou
- Division of General Internal Medicine and Primary Care, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02145, United States
| |
Collapse
|
3
|
Elmer T. Computational social science is growing up: why puberty consists of embracing measurement validation, theory development, and open science practices. EPJ DATA SCIENCE 2023; 12:58. [PMID: 38098785 PMCID: PMC10716103 DOI: 10.1140/epjds/s13688-023-00434-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 11/30/2023] [Indexed: 12/17/2023]
Abstract
Puberty is a phase in which individuals often test the boundaries of themselves and surrounding others and further define their identity - and thus their uniqueness compared to other individuals. Similarly, as Computational Social Science (CSS) grows up, it must strike a balance between its own practices and those of neighboring disciplines to achieve scientific rigor and refine its identity. However, there are certain areas within CSS that are reluctant to adopt rigorous scientific practices from other fields, which can be observed through an overreliance on passively collected data (e.g., through digital traces, wearables) without questioning the validity of such data. This paper argues that CSS should embrace the potential of combining both passive and active measurement practices to capitalize on the strengths of each approach, including objectivity and psychological quality. Additionally, the paper suggests that CSS would benefit from integrating practices and knowledge from other established disciplines, such as measurement validation, theoretical embedding, and open science practices. Based on this argument, the paper provides ten recommendations for CSS to mature as an interdisciplinary field of research.
Collapse
Affiliation(s)
- Timon Elmer
- Department of Psychology, Applied Social and Health Psychology, University of Zurich, Binzmühlestrasse 14/14, 8050 Zurich, Switzerland
| |
Collapse
|
4
|
Waters AR, Turner C, Easterly CW, Tovar I, Mulvaney M, Poquadeck M, Johnston H, Ghazal LV, Rains SA, Cloyes KG, Kirchhoff AC, Warner EL. Exploring Online Crowdfunding for Cancer-Related Costs Among LGBTQ+ (Lesbian, Gay, Bisexual, Transgender, Queer, Plus) Cancer Survivors: Integration of Community-Engaged and Technology-Based Methodologies. JMIR Cancer 2023; 9:e51605. [PMID: 37902829 PMCID: PMC10644187 DOI: 10.2196/51605] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Revised: 09/14/2023] [Accepted: 09/22/2023] [Indexed: 10/31/2023] Open
Abstract
BACKGROUND Cancer survivors frequently experience cancer-related financial burdens. The extent to which Lesbian, Gay, Bisexual, Transgender, Queer, Plus (LGBTQ+) populations experience cancer-related cost-coping behaviors such as crowdfunding is largely unknown, owing to a lack of sexual orientation and gender identity data collection and social stigma. Web-scraping has previously been used to evaluate inequities in online crowdfunding, but these methods alone do not adequately engage populations facing inequities. OBJECTIVE We describe the methodological process of integrating technology-based and community-engaged methods to explore the financial burden of cancer among LGBTQ+ individuals via online crowdfunding. METHODS To center the LGBTQ+ community, we followed community engagement guidelines by forming a study advisory board (SAB) of LGBTQ+ cancer survivors, caregivers, and professionals who were involved in every step of the research. SAB member engagement was tracked through quarterly SAB meeting attendance and an engagement survey. We then used web-scraping methods to extract a data set of online crowdfunding campaigns. The study team followed an integrated technology-based and community-engaged process to develop and refine term dictionaries for analyses. Term dictionaries were developed and refined in order to identify crowdfunding campaigns that were cancer- and LGBTQ+-related. RESULTS Advisory board engagement was high according to metrics of meeting attendance, meeting participation, and anonymous board feedback. In collaboration with the SAB, the term dictionaries were iteratively edited and refined. The LGBTQ+ term dictionary was developed by the study team, while the cancer term dictionary was refined from an existing dictionary. The advisory board and analytic team members manually coded against the term dictionary and performed quality checks until high confidence in correct classification was achieved using pairwise agreement. Through each phase of manual coding and quality checks, the advisory board identified more misclassified campaigns than the analytic team alone. When refining the LGBTQ+ term dictionary, the analytic team identified 11.8% misclassification while the SAB identified 20.7% misclassification. Once each term dictionary was finalized, the LGBTQ+ term dictionary resulted in a 95% pairwise agreement, while the cancer term dictionary resulted in an 89.2% pairwise agreement. CONCLUSIONS The classification tools developed by integrating community-engaged and technology-based methods were more accurate because of the equity-based approach of centering LGBTQ+ voices and their lived experiences. This exemplar suggests integrating community-engaged and technology-based methods to study inequities is highly feasible and has applications beyond LGBTQ+ financial burden research.
Collapse
Affiliation(s)
- Austin R Waters
- Department of Health Policy and Management, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, United States
- Cancer Control and Population Sciences, Huntsman Cancer Institute at the University of Utah, Salt Lake City, UT, United States
| | - Cindy Turner
- Cancer Control and Population Sciences, Huntsman Cancer Institute at the University of Utah, Salt Lake City, UT, United States
- College of Nursing, University of Utah, Salt Lake City, UT, United States
| | - Caleb W Easterly
- Department of Health Policy and Management, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, United States
| | - Ida Tovar
- Cancer Control and Population Sciences, Huntsman Cancer Institute at the University of Utah, Salt Lake City, UT, United States
- College of Nursing, University of Utah, Salt Lake City, UT, United States
| | - Megan Mulvaney
- Crowdfunding Cancer Costs LGBT Study Advisory Board, Huntsman Cancer Institute at the University of Utah, Salt Lake City, UT, United States
- School of Public Health, Indiana University Bloomington, Bloomington, IN, United States
| | - Matt Poquadeck
- Crowdfunding Cancer Costs LGBT Study Advisory Board, Huntsman Cancer Institute at the University of Utah, Salt Lake City, UT, United States
- Wilmot Cancer Institute, University of Rochester Medical Center, Rochester, NY, United States
| | - Hailey Johnston
- Crowdfunding Cancer Costs LGBT Study Advisory Board, Huntsman Cancer Institute at the University of Utah, Salt Lake City, UT, United States
| | - Lauren V Ghazal
- Crowdfunding Cancer Costs LGBT Study Advisory Board, Huntsman Cancer Institute at the University of Utah, Salt Lake City, UT, United States
- School of Nursing, University of Rochester, Rochester, NY, United States
| | - Stephen A Rains
- Department of Communication, University of Arizona, Tucson, AZ, United States
| | - Kristin G Cloyes
- School of Nursing, Oregon Health & Science University, Portland, OR, United States
| | - Anne C Kirchhoff
- Cancer Control and Population Sciences, Huntsman Cancer Institute at the University of Utah, Salt Lake City, UT, United States
- Department of Pediatrics, University of Utah School of Medicine, Salt Lake City, UT, United States
| | - Echo L Warner
- Cancer Control and Population Sciences, Huntsman Cancer Institute at the University of Utah, Salt Lake City, UT, United States
- College of Nursing, University of Utah, Salt Lake City, UT, United States
| |
Collapse
|
5
|
Unlu A, Truong S, Tammi T, Lohiniva AL. Exploring Political Mistrust in Pandemic Risk Communication: Mixed-Method Study Using Social Media Data Analysis. J Med Internet Res 2023; 25:e50199. [PMID: 37862088 PMCID: PMC10625074 DOI: 10.2196/50199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 09/08/2023] [Accepted: 09/18/2023] [Indexed: 10/21/2023] Open
Abstract
BACKGROUND This research extends prior studies by the Finnish Institute for Health and Welfare on pandemic-related risk perception, concentrating on the role of trust in health authorities and its impact on public health outcomes. OBJECTIVE The paper aims to investigate variations in trust levels over time and across social media platforms, as well as to further explore 12 subcategories of political mistrust. It seeks to understand the dynamics of political trust, including mistrust accumulation, fluctuations over time, and changes in topic relevance. Additionally, the study aims to compare qualitative research findings with those obtained through computational methods. METHODS Data were gathered from a large-scale data set consisting of 13,629 Twitter and Facebook posts from 2020 to 2023 related to COVID-19. For analysis, a fine-tuned FinBERT model with an 80% accuracy rate was used for predicting political mistrust. The BERTopic model was also used for superior topic modeling performance. RESULTS Our preliminary analysis identifies 43 mistrust-related topics categorized into 9 major themes. The most salient topics include COVID-19 mortality, coping strategies, polymerase chain reaction testing, and vaccine efficacy. Discourse related to mistrust in authority is associated with perceptions of disease severity, willingness to adopt health measures, and information-seeking behavior. Our findings highlight that the distinct user engagement mechanisms and platform features of Facebook and Twitter contributed to varying patterns of mistrust and susceptibility to misinformation during the pandemic. CONCLUSIONS The study highlights the effectiveness of computational methods like natural language processing in managing large-scale engagement and misinformation. It underscores the critical role of trust in health authorities for effective risk communication and public compliance. The findings also emphasize the necessity for transparent communication from authorities, concluding that a holistic approach to public health communication is integral for managing health crises effectively.
Collapse
Affiliation(s)
- Ali Unlu
- Finnish Institute for Health and Welfare, Helsinki, Finland
- Department of Computer Science, Aalto University, Espoo, Finland
| | - Sophie Truong
- Department of Computer Science, Aalto University, Espoo, Finland
| | - Tuukka Tammi
- Finnish Institute for Health and Welfare, Helsinki, Finland
| | | |
Collapse
|
6
|
Ng YMM. A Cross-National Study of Fear Appeal Messages in YouTube Trending
Videos About COVID-19. THE AMERICAN BEHAVIORAL SCIENTIST 2023:00027642231155363. [PMCID: PMC9947390 DOI: 10.1177/00027642231155363] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/22/2024]
Abstract
The COVID-19 pandemic has underlined the need for investigating the prevalence and nature of health communication on social media. Applying the Extended Parallel Process Model, this study analyzes the use of fear appeals in 2,152 YouTube trending videos across six countries (the United States, Brazil, Russia, Taiwan, Canada, and New Zealand) from January to May 2020. The findings reveal that, during the early stage of the outbreak, COVID-19-themed videos gained early attention in Taiwan but encountered a prolonged delay in the United States and Brazil. Specifically, COVID-19 videos featured the least in Brazil’s trending list. The results from a supervised machine learning coding approach further suggest that videos’ threat levels exceeded efficacy beliefs across all countries. This imbalance of threat–efficacy messages was most significant in hard-hit countries Brazil and Russia, which social media may run the risk of feeding fear to the public agenda. These findings alert content creators and social media platforms to create a threat–efficacy equilibrium, prioritizing content that promotes a sense of self- and community efficacy and increases people’s belief that effective protective actions are available.
Collapse
Affiliation(s)
- Yee Man Margaret Ng
- Department of Journalism and Institute
of Communications Research, University of Illinois at Urbana-Champaign, Urbana, IL,
USA
| |
Collapse
|
7
|
Fanelli S, Pratici L, Salvatore FP, Donelli CC, Zangrandi A. Big data analysis for decision-making processes: challenges and opportunities for the management of health-care organizations. MANAGEMENT RESEARCH REVIEW 2022. [DOI: 10.1108/mrr-09-2021-0648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
This study aims to provide a picture of the current state of art in the use of big data for decision-making processes for the management of health-care organizations.
Design/methodology/approach
A systematic literature review was carried out. The research uses two analyses: descriptive analysis, describing the evolution of citations; keywords; and the ten most influential papers, and bibliometric analysis, for content evaluation, for which a cluster analysis was performed.
Findings
A total of 48 articles were selected for bibliographic coupling out of an initial sample of more than 5,000 papers. Of the 48 articles, 29 are linked on the basis of their bibliography. Clustering the 29 articles on the basis of actual content, four research areas emerged: quality of care, quality of service, crisis management and data management.
Originality/value
Health-care organizations believe strongly that big data can become the most effective tool for correctly influencing the decision-making processes. Thus, more and more organizations continue to invest in big data analytics, and the literature on this topic has expanded rapidly. This study seeks to provide a comprehensive picture of the different streams of literature existing, together with gaps in research and future perspectives. The literature is mature enough for an analysis to be made and provide managers with useful insights on opportunities, criticisms and perspectives on the use of big data for health-care organizations. However, to date, there is no comprehensive literature review on the big data analysis in health care. Furthermore, as big data is a “sexy catchphrase,” more clarity on its usage may be needed. It represents an important tool to be investigated and its great potential is often yet to be discovered. This study thus sheds light on emerging issues and suggests further research that may be needed.
Collapse
|
8
|
Luo C, Chen A, Cui B, Liao W. Exploring public perceptions of the COVID-19 vaccine online from a cultural perspective: Semantic network analysis of two social media platforms in the United States and China. TELEMATICS AND INFORMATICS 2021; 65:101712. [PMID: 34887618 PMCID: PMC8429027 DOI: 10.1016/j.tele.2021.101712] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2021] [Revised: 07/28/2021] [Accepted: 08/30/2021] [Indexed: 01/14/2023]
Abstract
The development and uptake of the COVID-19 (coronavirus disease 2019) vaccine is a top priority in stifling the COVID-19 pandemic. How the public perceives the COVID-19 vaccine is directly associated with vaccine compliance and vaccination coverage. This study takes a cultural sensitivity perspective and adopts two well-known social media platforms in the United States (Twitter) and China (Weibo) to conduct a public perception comparison around the COVID-19 vaccine. By implementing semantic network analysis, results demonstrate that the two countries' social media users overlapped in themes concerning domestic vaccination policies, priority groups, challenges from COVID-19 variants, and the global pandemic situation. However, Twitter users were prone to disclose individual vaccination experiences, express anti-vaccine attitudes. In comparison, Weibo users manifested evident deference to authorities and exhibited more positive feelings toward the COVID-19 vaccine. Those disparities were explained by the cultural characteristics' differences between the two countries. The findings provide insights into comprehending public health issues in cross-cultural contexts and illustrate the potential of utilizing social media to conduct health informatics studies and investigate public perceptions during public health crisis time.
Collapse
Affiliation(s)
- Chen Luo
- School of Journalism and Communication, Tsinghua University, Haidian District, Beijing, China
| | - Anfan Chen
- School of Journalism and Communication, Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Botao Cui
- New China Asset Management Company, Chaoyang District, Beijing, China
| | - Wang Liao
- Department of Communication, University of California, Davis, CA, United States
| |
Collapse
|
9
|
Luo C, Ji K, Tang Y, Du Z. Exploring the Expression Differences Between Professionals and Laypeople Toward the COVID-19 Vaccine: Text Mining Approach. J Med Internet Res 2021; 23:e30715. [PMID: 34346885 PMCID: PMC8404777 DOI: 10.2196/30715] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Revised: 06/20/2021] [Accepted: 08/01/2021] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND COVID-19 is still rampant all over the world. Until now, the COVID-19 vaccine is the most promising measure to subdue contagion and achieve herd immunity. However, public vaccination intention is suboptimal. A clear division lies between medical professionals and laypeople. While most professionals eagerly promote the vaccination campaign, some laypeople exude suspicion, hesitancy, and even opposition toward COVID-19 vaccines. OBJECTIVE This study aims to employ a text mining approach to examine expression differences and thematic disparities between the professionals and laypeople within the COVID-19 vaccine context. METHODS We collected 3196 answers under 65 filtered questions concerning the COVID-19 vaccine from the China-based question and answer forum Zhihu. The questions were classified into 5 categories depending on their contents and description: adverse reactions, vaccination, vaccine effectiveness, social implications of vaccine, and vaccine development. Respondents were also manually coded into two groups: professional and laypeople. Automated text analysis was performed to calculate fundamental expression characteristics of the 2 groups, including answer length, attitude distribution, and high-frequency words. Furthermore, structural topic modeling (STM), as a cutting-edge branch in the topic modeling family, was used to extract topics under each question category, and thematic disparities were evaluated between the 2 groups. RESULTS Laypeople are more prevailing in the COVID-19 vaccine-related discussion. Regarding differences in expression characteristics, the professionals posted longer answers and showed a conservative stance toward vaccine effectiveness than did laypeople. Laypeople mentioned countries more frequently, while professionals were inclined to raise medical jargon. STM discloses prominent topics under each question category. Statistical analysis revealed that laypeople preferred the "safety of Chinese-made vaccine" topic and other vaccine-related issues in other countries. However, the professionals paid more attention to medical principles and professional standards underlying the COVID-19 vaccine. With respect to topics associated with the social implications of vaccines, the 2 groups showed no significant difference. CONCLUSIONS Our findings indicate that laypeople and professionals share some common grounds but also hold divergent focuses toward the COVID-19 vaccine issue. These incongruities can be summarized as "qualitatively different" in perspective rather than "quantitatively different" in scientific knowledge. Among those questions closely associated with medical expertise, the "qualitatively different" characteristic is quite conspicuous. This study boosts the current understanding of how the public perceives the COVID-19 vaccine, in a more nuanced way. Web-based question and answer forums are a bonanza for examining perception discrepancies among various identities. STM further exhibits unique strengths over the traditional topic modeling method in statistically testing the topic preference of diverse groups. Public health practitioners should be keenly aware of the cognitive differences between professionals and laypeople, and pay special attention to the topics with significant inconsistency across groups to build consensus and promote vaccination effectively.
Collapse
Affiliation(s)
- Chen Luo
- School of Journalism and Communication, Tsinghua University, Beijing, China
- The Faculty of International Media, Communication University of China, Beijing, China
| | - Kaiyuan Ji
- School of Journalism and Communication, Tsinghua University, Beijing, China
| | - Yulong Tang
- Institute of Communication Studies, Communication University of China, Beijing, China
| | - Zhiyuan Du
- School of Journalism and Communication, Tsinghua University, Beijing, China
| |
Collapse
|
10
|
Abstract
In this study, we examined the activities of automated social media accounts or bots that tweet or retweet referencing #COVID-19 and #COVID19. From a total sample of over 50 million tweets, we used a mixed method to extract more than 185,000 messages posted by 127 bots. Our findings show that the majority of these bots tweet, retweet and mention mainstream media outlets, promote health protection and telemedicine, and disseminate breaking news on the number of casualties and deaths caused by COVID-19. We argue that some of these bots are motivated by financial incentives, while other bots actively support the survivalist movement by emphasizing the need to prepare for the pandemic and learn survival skills. We only found a few bots that showed some suspicious activity probably due to the fact that our dataset was limited to two hashtags often used by official health bodies and academic communities.
Collapse
|