1
|
Ismail H, Hussein N, Elabyad R, Abdelhalim S, Elhadef M. Aspect-based classification of vaccine misinformation: a spatiotemporal analysis using Twitter chatter. BMC Public Health 2023; 23:1193. [PMID: 37340455 DOI: 10.1186/s12889-023-16067-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 06/06/2023] [Indexed: 06/22/2023] Open
Abstract
BACKGROUND The spread of misinformation of all types threatens people's safety and interrupts resolutions. COVID-19 vaccination has been a widely discussed topic on social media platforms with numerous misleading and fallacious information. This false information has a critical impact on the safety of society as it prevents many people from taking the vaccine, decelerating the world's ability to go back to normal. Therefore, it is vital to analyze the content shared on social media platforms, detect misinformation, identify aspects of misinformation, and efficiently represent related statistics to combat the spread of misleading information about the vaccine. This paper aims to support stakeholders in decision-making by providing solid and current insights into the spatiotemporal progression of the common misinformation aspects of the various available vaccines. METHODS Approximately 3800 tweets were annotated into four expert-verified aspects of vaccine misinformation obtained from reliable medical resources. Next, an Aspect-based Misinformation Analysis Framework was designed using the Light Gradient Boosting Machine (LightGBM) model, which is one of the most advanced, fast, and efficient machine learning models to date. Based on this dataset, spatiotemporal statistical analysis was performed to infer insights into the progression of aspects of vaccine misinformation among the public. Finally, the Pearson correlation coefficient and p-values are calculated for the global misinformation count against the vaccination counts of 43 countries from December 2020 until July 2021. RESULTS The optimized classification per class (i.e., per an aspect of misinformation) accuracy was 87.4%, 92.7%, 80.1%, and 82.5% for the "Vaccine Constituent," "Adverse Effects," "Agenda," "Efficacy and Clinical Trials" aspects, respectively. The model achieved an Area Under the ROC Curve (AUC) of 90.3% and 89.6% for validation and testing, respectively, which indicates the reliability of the proposed framework in detecting aspects of vaccine misinformation on Twitter. The correlation analysis shows that 37% of the countries addressed in this study were negatively affected by the spread of misinformation on Twitter resulting in reduced number of administered vaccines during the same timeframe. CONCLUSIONS Twitter is a rich source of insight on the progression of vaccine misinformation among the public. Machine Learning models, such as LightGBM, are efficient for multi-class classification and proved reliable in classifying vaccine misinformation aspects even with limited samples in social media datasets.
Collapse
Affiliation(s)
- Heba Ismail
- College of Engineering, Computer Science and Information Technology Department, Abu Dhabi University, Abu Dhabi, United Arab Emirates.
| | - Nada Hussein
- College of Engineering, Electrical, Computer, and Biomedical Engineering Department, Abu Dhabi University, Abu Dhabi, United Arab Emirates
| | - Rawan Elabyad
- College of Engineering, Computer Science and Information Technology Department, Abu Dhabi University, Abu Dhabi, United Arab Emirates
| | - Salma Abdelhalim
- College of Engineering, Computer Science and Information Technology Department, Abu Dhabi University, Abu Dhabi, United Arab Emirates
| | - Mourad Elhadef
- College of Engineering, Computer Science and Information Technology Department, Abu Dhabi University, Abu Dhabi, United Arab Emirates
| |
Collapse
|
2
|
Paul SG, Saha A, Biswas AA, Zulfiker MS, Arefin MS, Rahman MM, Reza AW. Combating Covid-19 using machine learning and deep learning: Applications, challenges, and future perspectives. ARRAY 2023; 17:100271. [PMID: 36530931 PMCID: PMC9737520 DOI: 10.1016/j.array.2022.100271] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 12/05/2022] [Accepted: 12/07/2022] [Indexed: 12/14/2022] Open
Abstract
COVID-19, a worldwide pandemic that has affected many people and thousands of individuals have died due to COVID-19, during the last two years. Due to the benefits of Artificial Intelligence (AI) in X-ray image interpretation, sound analysis, diagnosis, patient monitoring, and CT image identification, it has been further researched in the area of medical science during the period of COVID-19. This study has assessed the performance and investigated different machine learning (ML), deep learning (DL), and combinations of various ML, DL, and AI approaches that have been employed in recent studies with diverse data formats to combat the problems that have arisen due to the COVID-19 pandemic. Finally, this study shows the comparison among the stand-alone ML and DL-based research works regarding the COVID-19 issues with the combinations of ML, DL, and AI-based research works. After in-depth analysis and comparison, this study responds to the proposed research questions and presents the future research directions in this context. This review work will guide different research groups to develop viable applications based on ML, DL, and AI models, and will also guide healthcare institutes, researchers, and governments by showing them how these techniques can ease the process of tackling the COVID-19.
Collapse
Affiliation(s)
- Showmick Guha Paul
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Arpa Saha
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Al Amin Biswas
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh,Corresponding author
| | - Md. Sabab Zulfiker
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Mohammad Shamsul Arefin
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh,Department of Computer Science and Engineering, Chittagong University of Engineering and Technology, Chittagong, Bangladesh
| | - Md. Mahfujur Rahman
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Ahmed Wasif Reza
- Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
| |
Collapse
|
3
|
Alawadh HM, Alabrah A, Meraj T, Rauf HT. Attention-Enriched Mini-BERT Fake News Analyzer Using the Arabic Language. FUTURE INTERNET 2023; 15:44. [DOI: 10.3390/fi15020044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/01/2023] Open
Abstract
Internet use resulted in people becoming more reliant on social media. Social media have become the main source of fake news or rumors. They spread uncertainty in each sector of the real world, whether in politics, sports, or celebrities’ lives—all are affected by the uncontrolled behavior of social media platforms. Intelligent methods used to control this fake news in various languages have already been much discussed and frequently proposed by researchers. However, Arabic grammar and language are a far more complex and crucial language to learn. Therefore, work on Arabic fake-news-based datasets and related studies is much needed to control the spread of fake news on social media and other Internet media. The current study uses a recently published dataset of Arabic fake news annotated by experts. Further, Arabic-language-based embeddings are given to machine learning (ML) classifiers, and the Arabic-language-based trained minibidirectional encoder representations from transformers (BERT) is used to obtain the sentiments of Arabic grammar and feed a deep learning (DL) classifier. The holdout validation schemes are applied to both ML classifiers and mini-BERT-based deep neural classifiers. The results show a consistent improvement in the performance of mini-BERT-based classifiers, which outperformed ML classifiers, by increasing the training data. A comparison with previous Arabic fake news detection studies is shown where results of the current study show greater improvement.
Collapse
Affiliation(s)
- Husam M. Alawadh
- Department of English Language and Translation, College of Languages and Translation, King Saud University, Riyadh 11451, Saudi Arabia
| | - Amerah Alabrah
- Department of Information Systems, College of Computer and Information Sciences, King Saud University, Riyadh 11451, Saudi Arabia
| | - Talha Meraj
- Department of Computer Science, COMSATS University Islamabad—Wah Campus, Wah Cantt 47040, Pakistan
| | - Hafiz Tayyab Rauf
- Centre for Smart Systems, AI and Cybersecurity, Staffordshire University, Stoke-on-Trent ST4 2DE, UK
| |
Collapse
|
4
|
Bai S, Yu D, Yang M, Tang R, He H, Zhao J, Huang P. Is negative e-WOM more powerful? Multimodal data analysis on air passengers’ perception of COVID-19 safety measures. Front Psychol 2022; 13:983987. [PMID: 36329743 PMCID: PMC9624191 DOI: 10.3389/fpsyg.2022.983987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 09/12/2022] [Indexed: 11/22/2022] Open
Abstract
During the normalization stage of the COVID-19 epidemic prevention and control, the safety threats caused by improper epidemic prevention measures of airlines have become the primary concern for air passengers. Negative e-WOM related to safety perception obtained based on online multimodal reviews of travel websites has become an important decision-making basis for potential air passengers when making airline choices. This study aims to examine the relationship between potential air passengers’ negative safety perception and the usefulness of online reviews, as well as to test the moderating effect of review modality and airline type. It also further explores the effectiveness and feasibility of applying big data sentiment analysis to e-WOM management. To this end, the theoretical model of negative safety perception, review modality, and airline type affecting review usefulness was constructed. Then we select 10 low-cost airlines and 10 full-service airlines, respectively, according to the number of reviews sorted by the TripAdvisor website, and use crawling techniques to obtain 10,485 reviews related to COVID-19 safety of the above companies from December 2019 to date, and conduct safety perception sentiment analysis based on Python’s Textblob library. Finally, to avoid data overdispersion, the model is empirically analyzed by negative binomial regression using R software. The results indicate that (1) Negative safety perception significantly and negatively affects review usefulness, that is, extreme negative safety perception can provide higher review usefulness for potential air passengers. (2) Review modality and airline type have a significant moderating effect on the relationship between negative safety perception and review usefulness, in which multimodal reviews and full-service airlines both weakened the negative impact of negative safety perception on review usefulness. The theoretical model in this paper is both an extension of the application of big data sentiment analysis techniques and a beneficial supplement to current research findings of e-WOM, providing an important reference for potential air passengers to identify useful reviews accurately and thus reduce safety risks in online decision-making.
Collapse
Affiliation(s)
- Shizhen Bai
- School of Management, Harbin University of Commerce, Harbin, China
| | - Dingyao Yu
- School of Management, Harbin University of Commerce, Harbin, China
| | - Mu Yang
- Department of Management, Birkbeck, University of London, London, United Kingdom
- *Correspondence: Mu Yang,
| | - Rui Tang
- School of Economics Teaching and Research, Party School of the Central Committee of C.P.C (Chinese Academy of Governance), Beijing, China
| | - Hao He
- School of Management, Harbin University of Commerce, Harbin, China
| | - Jiayuan Zhao
- School of Computer and Information Engineering, Harbin University of Commerce, Harbin, China
| | - Peihua Huang
- School of Management, Harbin University of Commerce, Harbin, China
| |
Collapse
|
5
|
Abstract
Sentiment analysis has been one of the most active research areas in the past decade due to its vast applications. Sentiment quantification, a new research problem in this field, extends sentiment analysis from individual documents to an aggregated collection of documents. Sentiment analysis has been widely researched, but sentiment quantification has drawn less attention despite offering a greater potential to enhance current business intelligence systems. In this research, to perform sentiment quantification, a framework based on feature engineering is proposed to exploit diverse feature sets such as sentiment, content, and part of speech, as well as deep features including word2vec and GloVe. Different machine learning algorithms, including conventional, ensemble learners, and deep learning approaches, have been investigated on standard datasets of SemEval2016, SemEval2017, STS-Gold, and Sanders. The empirical-based results reveal the effectiveness of the proposed feature sets in the process of sentiment quantification when applied to machine learning algorithms. The results also reveal that the ensemble-based algorithm AdaBoost outperforms other conventional machine learning algorithms using a combination of proposed feature sets. The deep learning algorithm RNN, on the other hand, shows optimal results using word embedding-based features. This research has the potential to help diverse applications of sentiment quantification, including polling, trend analysis, automatic summarization, and rumor or fake news detection.
Collapse
|
6
|
A Novel text2IMG Mechanism of Credit Card Fraud Detection: A Deep Learning Approach. ELECTRONICS 2022. [DOI: 10.3390/electronics11050756] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Online sales and purchases are increasing daily, and they generally involve credit card transactions. This not only provides convenience to the end-user but also increases the frequency of online credit card fraud. In the recent years, in some countries, this fraud increase has led to an exponential increase in credit card fraud detection, which has become increasingly important to address this security issue. Recent studies have proposed machine learning (ML)-based solutions for detecting fraudulent credit card transactions, but their detection scores still need improvement due to the imbalance of classes in any given dataset. Few approaches have achieved exceptional results on different datasets. In this study, the Kaggle dataset was used to develop a deep learning (DL)-based approach to solve the text data problem. A novel text2IMG conversion technique is proposed that generates small images. The images are fed into a CNN architecture with class weights using the inverse frequency method to resolve the class imbalance issue. DL and ML approaches were applied to verify the robustness and validity of the proposed system. An accuracy of 99.87% was achieved by Coarse-KNN using deep features of the proposed CNN.
Collapse
|