1
|
Zhang H, Shafiq MO. Survey of transformers and towards ensemble learning using transformers for natural language processing. JOURNAL OF BIG DATA 2024; 11:25. [PMID: 38321999 PMCID: PMC10838835 DOI: 10.1186/s40537-023-00842-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 10/11/2023] [Indexed: 02/08/2024]
Abstract
The transformer model is a famous natural language processing model proposed by Google in 2017. Now, with the extensive development of deep learning, many natural language processing tasks can be solved by deep learning methods. After the BERT model was proposed, many pre-trained models such as the XLNet model, the RoBERTa model, and the ALBERT model were also proposed in the research community. These models perform very well in various natural language processing tasks. In this paper, we describe and compare these well-known models. In addition, we also apply several types of existing and well-known models which are the BERT model, the XLNet model, the RoBERTa model, the GPT2 model, and the ALBERT model to different existing and well-known natural language processing tasks, and analyze each model based on their performance. There are a few papers that comprehensively compare various transformer models. In our paper, we use six types of well-known tasks, such as sentiment analysis, question answering, text generation, text summarization, name entity recognition, and topic modeling tasks to compare the performance of various transformer models. In addition, using the existing models, we also propose ensemble learning models for the different natural language processing tasks. The results show that our ensemble learning models perform better than a single classifier on specific tasks. Graphical Abstract
Collapse
Affiliation(s)
- Hongzhi Zhang
- School of Information Technology, Carleton University, Ottawa, ON Canada
| | - M. Omair Shafiq
- School of Information Technology, Carleton University, Ottawa, ON Canada
| |
Collapse
|
2
|
Stefanis C, Giorgi E, Kalentzis K, Tselemponis A, Nena E, Tsigalou C, Kontogiorgis C, Kourkoutas Y, Chatzak E, Dokas I, Constantinidis T, Bezirtzoglou E. Sentiment analysis of epidemiological surveillance reports on COVID-19 in Greece using machine learning models. Front Public Health 2023; 11:1191730. [PMID: 37533519 PMCID: PMC10392838 DOI: 10.3389/fpubh.2023.1191730] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 06/30/2023] [Indexed: 08/04/2023] Open
Abstract
The present research deals with sentiment analysis performed with Microsoft Azure Machine Learning Studio to classify Facebook posts on the Greek National Public Health Organization (EODY) from November 2021 to January 2022 during the pandemic. Positive, negative and neutral sentiments were included after processing 300 reviews. This approach involved analyzing the words appearing in the comments and exploring the sentiments related to daily surveillance reports of COVID-19 published on the EODY Facebook page. Moreover, machine learning algorithms were implemented to predict the classification of sentiments. This research assesses the efficiency of a few popular machine learning models, which is one of the initial efforts in Greece in this domain. People have negative sentiments toward COVID surveillance reports. Words with the highest frequency of occurrence include government, vaccinated people, unvaccinated, telephone communication, health measures, virus, COVID-19 rapid/molecular tests, and of course, COVID-19. The experimental results disclose additionally that two classifiers, namely two class Neural Network and two class Bayes Point Machine, achieved high sentiment analysis accuracy and F1 score, particularly 87% and over 35%. A significant limitation of this study may be the need for more comparison with other research attempts that identified the sentiments of the EODY surveillance reports of COVID in Greece. Machine learning models can provide critical information combating public health hazards and enrich communication strategies and proactive actions in public health issues and opinion management during the COVID-19 pandemic.
Collapse
Affiliation(s)
- Christos Stefanis
- Laboratory of Hygiene and Environmental Protection, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece
| | - Elpida Giorgi
- Laboratory of Hygiene and Environmental Protection, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece
| | - Konstantinos Kalentzis
- Laboratory of Hygiene and Environmental Protection, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece
| | - Athanasios Tselemponis
- Laboratory of Hygiene and Environmental Protection, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece
| | - Evangelia Nena
- Pre-Clinical Education, Laboratory of Social Medicine, Medical School, Democritus University of Thrace, Alexandroupolis, Greece
| | - Christina Tsigalou
- Laboratory of Microbiology, Medical School, Democritus University of Thrace, Alexandroupolis, Greece
| | - Christos Kontogiorgis
- Laboratory of Hygiene and Environmental Protection, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece
| | - Yiannis Kourkoutas
- Laboratory of Applied Microbiology, Department of Molecular Biology and Genetics, Democritus University of Thrace, Alexandroupolis, Greece
| | - Ekaterini Chatzak
- Laboratory of Pharmacology, Medical School, Democritus University of Thrace, Alexandroupolis, Greece
| | - Ioannis Dokas
- Department of Civil Engineering, Democritus University of Thrace, Komotini, Greece
| | - Theodoros Constantinidis
- Laboratory of Hygiene and Environmental Protection, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece
| | - Eugenia Bezirtzoglou
- Laboratory of Hygiene and Environmental Protection, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece
| |
Collapse
|
3
|
Alshahrani SM, Khan NA. COVID-19 advising application development for Apple devices (iOS). PeerJ Comput Sci 2023; 9:e1274. [PMID: 37346730 PMCID: PMC10280587 DOI: 10.7717/peerj-cs.1274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Accepted: 02/13/2023] [Indexed: 06/23/2023]
Abstract
One of humanity's most devastating health crises was COVID-19. Billions of people suffered during this pandemic. In comparison with previous global pandemics that have been faced by the world before, societies were more accurate with the technical support system during this natural disaster. The intersection of data from healthcare units and the analysis of this data into various sophisticated systems were critical factors. Different healthcare units have taken special consideration to advance technical inputs to fight against such situations. The field of natural language processing (NLP) has dramatically supported this. Despite the primitive methods for monitoring the bio-metric factors of a person, the use of cognitive science has emerged as one of the most critical features during this pandemic era. One of the essential features is the potential to understand the data based on various texts and user inputs. The deployment of various NLP systems is one of the most challenging factors in handling the bulk amount of data flowing from multiple sources. This study focused on developing a powerful application to advise patients suffering from ailments related to COVID-19. The use of NLP refers to facilitating a user to identify the present critical situation and make necessary decisions while getting infected. This article also summarises the challenges associated with NLP and its usage for future NLP-based applications focusing on healthcare units. There are a couple of applications that reside for android-based systems as well as web-based chat-bot systems. In terms of security and safety, application development for iOS is more advanced. This study also explains the block meant of an application for advising COVID-19 infection. A natural language processing powered application for an iOS operating system is indeed one of its kind, which will help people who need to advise proper guidance. The article also portrays NLP-based application development for healthcare problems associated with personal reporting systems.
Collapse
Affiliation(s)
- Saeed M. Alshahrani
- Department of Computer Science, College of Computing and Information Technology, Shaqra University, Shaqra, Riyadh, Saudi Arabia
| | - Nayyar Ahmed Khan
- Department of Computer Science, College of Computing and Information Technology, Shaqra University, Shaqra, Riyadh, Saudi Arabia
| |
Collapse
|
4
|
Al-Garadi MA, Yang YC, Sarker A. The Role of Natural Language Processing during the COVID-19 Pandemic: Health Applications, Opportunities, and Challenges. Healthcare (Basel) 2022; 10:2270. [PMID: 36421593 PMCID: PMC9690240 DOI: 10.3390/healthcare10112270] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 11/03/2022] [Accepted: 11/06/2022] [Indexed: 07/30/2023] Open
Abstract
The COVID-19 pandemic is the most devastating public health crisis in at least a century and has affected the lives of billions of people worldwide in unprecedented ways. Compared to pandemics of this scale in the past, societies are now equipped with advanced technologies that can mitigate the impacts of pandemics if utilized appropriately. However, opportunities are currently not fully utilized, particularly at the intersection of data science and health. Health-related big data and technological advances have the potential to significantly aid the fight against such pandemics, including the current pandemic's ongoing and long-term impacts. Specifically, the field of natural language processing (NLP) has enormous potential at a time when vast amounts of text-based data are continuously generated from a multitude of sources, such as health/hospital systems, published medical literature, and social media. Effectively mitigating the impacts of the pandemic requires tackling challenges associated with the application and deployment of NLP systems. In this paper, we review the applications of NLP to address diverse aspects of the COVID-19 pandemic. We outline key NLP-related advances on a chosen set of topics reported in the literature and discuss the opportunities and challenges associated with applying NLP during the current pandemic and future ones. These opportunities and challenges can guide future research aimed at improving the current health and social response systems and pandemic preparedness.
Collapse
Affiliation(s)
- Mohammed Ali Al-Garadi
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37240, USA
| | - Yuan-Chi Yang
- Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA 30322, USA
| | - Abeed Sarker
- Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA 30322, USA
| |
Collapse
|