1
|
Sasse K, Mahabir R, Gkountouna O, Crooks A, Croitoru A. Understanding the determinants of vaccine hesitancy in the United States: A comparison of social surveys and social media. PLoS One 2024; 19:e0301488. [PMID: 38843170 PMCID: PMC11156396 DOI: 10.1371/journal.pone.0301488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 03/12/2024] [Indexed: 06/09/2024] Open
Abstract
The COVID-19 pandemic prompted governments worldwide to implement a range of containment measures, including mass gathering restrictions, social distancing, and school closures. Despite these efforts, vaccines continue to be the safest and most effective means of combating such viruses. Yet, vaccine hesitancy persists, posing a significant public health concern, particularly with the emergence of new COVID-19 variants. To effectively address this issue, timely data is crucial for understanding the various factors contributing to vaccine hesitancy. While previous research has largely relied on traditional surveys for this information, recent sources of data, such as social media, have gained attention. However, the potential of social media data as a reliable proxy for information on population hesitancy, especially when compared with survey data, remains underexplored. This paper aims to bridge this gap. Our approach uses social, demographic, and economic data to predict vaccine hesitancy levels in the ten most populous US metropolitan areas. We employ machine learning algorithms to compare a set of baseline models that contain only these variables with models that incorporate survey data and social media data separately. Our results show that XGBoost algorithm consistently outperforms Random Forest and Linear Regression, with marginal differences between Random Forest and XGBoost. This was especially the case with models that incorporate survey or social media data, thus highlighting the promise of the latter data as a complementary information source. Results also reveal variations in influential variables across the five hesitancy classes, such as age, ethnicity, occupation, and political inclination. Further, the application of models to different MSAs yields mixed results, emphasizing the uniqueness of communities and the need for complementary data approaches. In summary, this study underscores social media data's potential for understanding vaccine hesitancy, emphasizes the importance of tailoring interventions to specific communities, and suggests the value of combining different data sources.
Collapse
Affiliation(s)
- Kuleen Sasse
- Department of Computer Science, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Ron Mahabir
- Geographic Data Science Lab, Department of Geography and Planning, University of Liverpool, Liverpool, United Kingdom
| | - Olga Gkountouna
- Geographic Data Science Lab, Department of Geography and Planning, University of Liverpool, Liverpool, United Kingdom
| | - Andrew Crooks
- Department of Geography, University at Buffalo, Buffalo, New York, United States of America
| | - Arie Croitoru
- Department of Computational and Data Sciences, George Mason University, Fairfax, Virginia, United States of America
| |
Collapse
|
2
|
Biswas MR, Shah Z. Extracting factors associated with vaccination from Twitter data and mapping to behavioral models. Hum Vaccin Immunother 2023; 19:2281729. [PMID: 38013461 PMCID: PMC10760324 DOI: 10.1080/21645515.2023.2281729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 11/05/2023] [Indexed: 11/29/2023] Open
Abstract
Social media platform, particularly Twitter, is a rich data source that allows monitoring of public opinions and attitudes toward vaccines.Established behavioral models like the 5C psychological antecedents model and the Health Belief Model (HBM) provide a well-structured framework for analyzing shifts in vaccine-related behavior. This study examines if the extracted data from Twitter contains valuable insights regarding public attitudes toward vaccines and can be mapped to two behavioral models. This study focuses on the Arab population, and a search was carried out on Twitter using: ' تلقيحي OR تطعيم OR تطعيمات OR لقاح OR لقاحات' for two years from January 2020 to January 2022. Then, BERTopicmodeling was applied, and several topics were extracted. Finally, the topics were manually mapped to the factors of the 5C model and HBM. 1,068,466 unique users posted 3,368,258 vaccine-related tweets in Arabic. Topic modeling generated 25 topics, which were mapped to the 15 factors of the 5C model and HBM. Among the users, 32.87%were male, and 18.06% were female. A significant 55.77% of the users were from the MENA (Middle East and North Africa) region. Twitter users were more inclined to accept vaccines when they trusted vaccine safety and effectiveness, but vaccine hesitancy increased due to conspiracy theories and misinformation. The association of topics with these theoretical frameworks reveals the availability and diversity of Twitter data that can predict behavioral change toward vaccines. It allows the preparation of timely and effective interventions for vaccination programs compared to traditional methods.
Collapse
Affiliation(s)
- Md. Rafiul Biswas
- Division of Information and Computing Technology, College of Science and Engineering, Hamad Bin Khalifa University, Qatar Foundation, Doha, Qatar
| | - Zubair Shah
- Division of Information and Computing Technology, College of Science and Engineering, Hamad Bin Khalifa University, Qatar Foundation, Doha, Qatar
| |
Collapse
|
3
|
Qorib M, Oladunni T, Denis M, Ososanya E, Cotae P. Covid-19 vaccine hesitancy: Text mining, sentiment analysis and machine learning on COVID-19 vaccination Twitter dataset. EXPERT SYSTEMS WITH APPLICATIONS 2023; 212:118715. [PMID: 36092862 PMCID: PMC9443617 DOI: 10.1016/j.eswa.2022.118715] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Revised: 07/14/2022] [Accepted: 08/26/2022] [Indexed: 05/20/2023]
Abstract
In 2019 there was an outbreak of coronavirus pandemic also known as COVID-19. Many scientists believe that the pandemic originated from Wuhan, China, before spreading to other parts of the globe. To reduce the spread of the disease, decision makers encouraged measures such as hand washing, face masking, and social distancing. In early 2021, some countries including the United States began administering COVID-19 vaccines. Vaccination brought a relief to the public; it also generated a lot of debates from anti-vaccine and pro-vaccine groups. The controversy and debate surrounding COVID-19 vaccine influenced the decision of several people in either to accept or reject vaccination. Because of data limitations, social media data, collected through live streaming public tweets using an Application Programming Interface (API) search, is considered a viable and reliable resource to study the opinion of the public on Covid-19 vaccine hesitancy. Thus, this study examines 3 sentiment computation methods (Azure Machine Learning, VADER, and TextBlob) to analyze COVID-19 vaccine hesitancy. Five learning algorithms (Random Forest, Logistics Regression, Decision Tree, LinearSVC, and Naïve Bayes) with different combination of three vectorization methods (Doc2Vec, CountVectorizer, and TF-IDF) were deployed. Vocabulary normalization was threefold; potter stemming, lemmatization, and potter stemming with lemmatization. For each vocabulary normalization strategy, we designed, developed, and evaluated 42 models. The study shows that Covid-19 vaccine hesitancy slowly decreases over time; suggesting that the public gradually feels warm and optimistic about COVID-19 vaccination. Moreover, combining potter stemming and lemmatization increased model performances. Finally, the result of our experiment shows that TextBlob + TF-IDF + LinearSVC has the best performance in classifying public sentiment into positive, neutral, or negative with an accuracy, precision, recall and F1 score of 0.96752, 0.96921, 0.92807 and 0.94702 respectively. It means that the best performance was achieved when using TextBlob sentiment score, with TF-IDF vectorization and LinearSVC classification model. We also found out that combining two vectorizations (CountVectorizer and TF-IDF) decreases model accuracy.
Collapse
Affiliation(s)
- Miftahul Qorib
- Department of Computer Science and Information Technology, University of the District of Columbia, Washington, DC, United States
| | - Timothy Oladunni
- Department of Computer Science, Morgan State University, Baltimore, MD, United States
| | - Max Denis
- Department of Mechanical and Biomedical Engineering, University of the District of Columbia, Washington, DC, United States
| | - Esther Ososanya
- Department of Electrical and Computer Engineering, University of the District of Columbia, Washington, DC, United States
| | - Paul Cotae
- Department of Electrical and Computer Engineering, University of the District of Columbia, Washington, DC, United States
| |
Collapse
|
4
|
Cascini F, Pantovic A, Al-Ajlouni YA, Failla G, Puleo V, Melnyk A, Lontano A, Ricciardi W. Social media and attitudes towards a COVID-19 vaccination: A systematic review of the literature. EClinicalMedicine 2022; 48:101454. [PMID: 35611343 PMCID: PMC9120591 DOI: 10.1016/j.eclinm.2022.101454] [Citation(s) in RCA: 88] [Impact Index Per Article: 44.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 04/24/2022] [Accepted: 04/27/2022] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Vaccine hesitancy continues to limit global efforts in combatting the COVID-19 pandemic. Emerging research demonstrates the role of social media in disseminating information and potentially influencing people's attitudes towards public health campaigns. This systematic review sought to synthesize the current evidence regarding the potential role of social media in shaping COVID-19 vaccination attitudes, and to explore its potential for shaping public health interventions to address the issue of vaccine hesitancy. METHODS We performed a systematic review of the studies published from inception to 13 of March2022 by searching PubMed, Web of Science, Embase, PsychNET, Scopus, CINAHL, and MEDLINE. Studies that reported outcomes related to coronavirus disease 2019 (COVID-19) vaccine (attitudes, opinion, etc.) gathered from the social media platforms, and those analyzing the relationship between social media use and COVID-19 hesitancy/acceptance were included. Studies that reported no outcome of interest or analyzed data from sources other than social media (websites, newspapers, etc.) will be excluded. The Newcastle Ottawa Scale (NOS) was used to assess the quality of all cross-sectional studies included in this review. This study is registered with PROSPERO (CRD42021283219). FINDINGS Of the 2539 records identified, a total of 156 articles fully met the inclusion criteria. Overall, the quality of the cross-sectional studies was moderate - 2 studies received 10 stars, 5 studies received 9 stars, 9 studies were evaluated with 8, 12 studies with 7,16 studies with 6, 11 studies with 5, and 6 studies with 4 stars. The included studies were categorized into four categories. Cross-sectional studies reporting the association between reliance on social media and vaccine intentions mainly observed a negative relationship. Studies that performed thematic analyses of extracted social media data, mainly observed a domination of vaccine hesitant topics. Studies that explored the degree of polarization of specific social media contents related to COVID-19 vaccines observed a similar degree of content for both positive and negative tone posted on different social media platforms. Finally, studies that explored the fluctuations of vaccination attitudes/opinions gathered from social media identified specific events as significant cofactors that affect and shape vaccination intentions of individuals. INTERPRETATION This thorough examination of the various roles social media can play in disseminating information to the public, as well as how individuals behave on social media in the context of public health events, articulates the potential of social media as a platform of public health intervention to address vaccine hesitancy. FUNDING None.
Collapse
Affiliation(s)
- Fidelia Cascini
- Department of Life Sciences and Public Health, Section of Hygiene and Public Health, Università Cattolica del Sacro Cuore, L.go Francesco Vito 1, Rome 00168, Italy
- Corresponding author.
| | - Ana Pantovic
- Faculty of Biology, University of Belgrade, Belgrade, Serbia
| | | | - Giovanna Failla
- Department of Life Sciences and Public Health, Section of Hygiene and Public Health, Università Cattolica del Sacro Cuore, L.go Francesco Vito 1, Rome 00168, Italy
| | - Valeria Puleo
- Department of Life Sciences and Public Health, Section of Hygiene and Public Health, Università Cattolica del Sacro Cuore, L.go Francesco Vito 1, Rome 00168, Italy
| | - Andriy Melnyk
- Department of Life Sciences and Public Health, Section of Hygiene and Public Health, Università Cattolica del Sacro Cuore, L.go Francesco Vito 1, Rome 00168, Italy
| | - Alberto Lontano
- Department of Life Sciences and Public Health, Section of Hygiene and Public Health, Università Cattolica del Sacro Cuore, L.go Francesco Vito 1, Rome 00168, Italy
| | - Walter Ricciardi
- Department of Life Sciences and Public Health, Section of Hygiene and Public Health, Università Cattolica del Sacro Cuore, L.go Francesco Vito 1, Rome 00168, Italy
| |
Collapse
|
5
|
New Wave of COVID-19 Vaccine Opinions in the Month the 3rd Booster Dose Arrived. Vaccines (Basel) 2022; 10:vaccines10060881. [PMID: 35746490 PMCID: PMC9228932 DOI: 10.3390/vaccines10060881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Revised: 05/20/2022] [Accepted: 05/27/2022] [Indexed: 02/05/2023] Open
Abstract
Vaccination has been proposed as one of the most effective methods to combat the COVID-19 pandemic. Since the day the first vaccine, with an efficiency of more than 90%, was announced, the entire vaccination process and its possible consequences in large populations have generated a series of discussions on social media. Whereas the opinions triggered by the administration of the initial COVID-19 vaccine doses have been discussed in depth in the scientific literature, the approval of the so-called 3rd booster dose has only been analyzed in country-specific studies, primarily using questionnaires. In this context, the present paper conducts a stance analysis using a transformer-based deep learning model on a dataset containing 3,841,594 tweets in English collected between 12 July 2021 and 11 August 2021 (the month in which the 3rd dose arrived) and compares the opinions (in favor, neutral and against) with the ones extracted at the beginning of the vaccination process. In terms of COVID-19 vaccination hesitance, an analysis based on hashtags, n-grams and latent Dirichlet allocation is performed that highlights the main reasons behind the reluctance to vaccinate. The proposed approach can be useful in the context of the campaigns related to COVID-19 vaccination as it provides insights related to the public opinion and can be useful in creating communication messages to support the vaccination campaign.
Collapse
|