1
|
Ravaut M, Zhao R, Phung D, Qin VM, Milovanovic D, Pienkowska A, Bojic I, Car J, Joty S. Targeting COVID-19 and Human Resources for Health News Information Extraction: Algorithm Development and Validation. JMIR AI 2024; 3:e55059. [PMID: 39475833 PMCID: PMC11561429 DOI: 10.2196/55059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 03/17/2024] [Accepted: 07/10/2024] [Indexed: 11/17/2024]
Abstract
BACKGROUND Global pandemics like COVID-19 put a high amount of strain on health care systems and health workers worldwide. These crises generate a vast amount of news information published online across the globe. This extensive corpus of articles has the potential to provide valuable insights into the nature of ongoing events and guide interventions and policies. However, the sheer volume of information is beyond the capacity of human experts to process and analyze effectively. OBJECTIVE The aim of this study was to explore how natural language processing (NLP) can be leveraged to build a system that allows for quick analysis of a high volume of news articles. Along with this, the objective was to create a workflow comprising human-computer symbiosis to derive valuable insights to support health workforce strategic policy dialogue, advocacy, and decision-making. METHODS We conducted a review of open-source news coverage from January 2020 to June 2022 on COVID-19 and its impacts on the health workforce from the World Health Organization (WHO) Epidemic Intelligence from Open Sources (EIOS) by synergizing NLP models, including classification and extractive summarization, and human-generated analyses. Our DeepCovid system was trained on 2.8 million news articles in English from more than 3000 internet sources across hundreds of jurisdictions. RESULTS Rules-based classification with hand-designed rules narrowed the data set to 8508 articles with high relevancy confirmed in the human-led evaluation. DeepCovid's automated information targeting component reached a very strong binary classification performance of 98.98 for the area under the receiver operating characteristic curve (ROC-AUC) and 47.21 for the area under the precision recall curve (PR-AUC). Its information extraction component attained good performance in automatic extractive summarization with a mean Recall-Oriented Understudy for Gisting Evaluation (ROUGE) score of 47.76. DeepCovid's final summaries were used by human experts to write reports on the COVID-19 pandemic. CONCLUSIONS It is feasible to synergize high-performing NLP models and human-generated analyses to benefit open-source health workforce intelligence. The DeepCovid approach can contribute to an agile and timely global view, providing complementary information to scientific literature.
Collapse
Affiliation(s)
| | - Ruochen Zhao
- Nanyang Technological University, Singapore, Singapore
| | - Duy Phung
- Nanyang Technological University, Singapore, Singapore
| | | | | | | | - Iva Bojic
- Nanyang Technological University, Singapore, Singapore
| | - Josip Car
- King's College London, London, United Kingdom
| | - Shafiq Joty
- Nanyang Technological University, Singapore, Singapore
- Salesforce Research, San Francisco, CA, United States
| |
Collapse
|
2
|
Dhariwal N, Sengupta N, Madiajagan M, Patro KK, Kumari PL, Abdel Samee N, Tadeusiewicz R, Pławiak P, Prakash AJ. A pilot study on AI-driven approaches for classification of mental health disorders. Front Hum Neurosci 2024; 18:1376338. [PMID: 38660009 PMCID: PMC11039883 DOI: 10.3389/fnhum.2024.1376338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Accepted: 03/20/2024] [Indexed: 04/26/2024] Open
Abstract
The increasing prevalence of mental disorders among youth worldwide is one of society's most pressing issues. The proposed methodology introduces an artificial intelligence-based approach for comprehending and analyzing the prevalence of neurological disorders. This work draws upon the analysis of the Cities Health Initiative dataset. It employs advanced machine learning and deep learning techniques, integrated with data science, statistics, optimization, and mathematical modeling, to correlate various lifestyle and environmental factors with the incidence of these mental disorders. In this work, a variety of machine learning and deep learning models with hyper-parameter tuning are utilized to forecast trends in the occurrence of mental disorders about lifestyle choices such as smoking and alcohol consumption, as well as environmental factors like air and noise pollution. Among these models, the convolutional neural network (CNN) architecture, termed as DNN1 in this paper, accurately predicts mental health occurrences relative to the population mean with a maximum accuracy of 99.79%. Among the machine learning models, the XGBoost technique yields an accuracy of 95.30%, with an area under the ROC curve of 0.9985, indicating robust training. The research also involves extracting feature importance scores for the XGBoost classifier, with Stroop test performance results attaining the highest importance score of 0.135. Attributes related to addiction, namely smoking and alcohol consumption, hold importance scores of 0.0273 and 0.0212, respectively. Statistical tests on the training models reveal that XGBoost performs best on the mean squared error and R-squared tests, achieving scores of 0.013356 and 0.946481, respectively. These statistical evaluations bolster the models' credibility and affirm the best-fit models' accuracy. The proposed research in the domains of mental health, addiction, and pollution stands to aid healthcare professionals in diagnosing and treating neurological disorders in both youth and adults promptly through the use of predictive models. Furthermore, it aims to provide valuable insights for policymakers in formulating new regulations on pollution and addiction.
Collapse
Affiliation(s)
- Naman Dhariwal
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Nidhi Sengupta
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - M. Madiajagan
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Kiran Kumar Patro
- Department of ECE, Aditya Institute of Technology and Management (A), Tekkali, Andhra Pradesh, India
| | - P. Lalitha Kumari
- School of Computer Science and Engineering, Vellore Institute of Technology, Amaravati, Andhra Pradesh, India
| | - Nagwan Abdel Samee
- Department of Information Technology, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
| | - Ryszard Tadeusiewicz
- Department of Biocybernetics and Biomedical Engineering, AGH University of Science and Technology, Krakow, Poland
| | - Paweł Pławiak
- Department of Computer Science, Faculty of Computer Science and Telecommunications, Cracow University of Technology, Krakow, Poland
- Institute of Theoretical and Applied Informatics, Polish Academy of Sciences, Gliwice, Poland
| | - Allam Jaya Prakash
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| |
Collapse
|
3
|
Rezapour M, Elmshaeuser SK. Artificial intelligence-based analytics for impacts of COVID-19 and online learning on college students' mental health. PLoS One 2022; 17:e0276767. [PMID: 36399458 PMCID: PMC9674166 DOI: 10.1371/journal.pone.0276767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 10/13/2022] [Indexed: 11/19/2022] Open
Abstract
COVID-19, the disease caused by the novel coronavirus (SARS-CoV-2), first emerged in Wuhan, China late in December 2019. Not long after, the virus spread worldwide and was declared a pandemic by the World Health Organization in March 2020. This caused many changes around the world and in the United States, including an educational shift towards online learning. In this paper, we seek to understand how the COVID-19 pandemic and the increase in online learning impact college students' emotional wellbeing. We use several machine learning and statistical models to analyze data collected by the Faculty of Public Administration at the University of Ljubljana, Slovenia in conjunction with an international consortium of universities, other higher education institutions, and students' associations. Our results indicate that features related to students' academic life have the largest impact on their emotional wellbeing. Other important factors include students' satisfaction with their university's and government's handling of the pandemic as well as students' financial security.
Collapse
Affiliation(s)
- Mostafa Rezapour
- Department of Mathematics, Wake Forest University, Winston-Salem, NC, United States of America
| | - Scott K. Elmshaeuser
- Department of Mathematics, Wake Forest University, Winston-Salem, NC, United States of America
| |
Collapse
|
4
|
Hammoud B, Semaan A, Elhajj I, Benova L. Can machine learning models predict maternal and newborn healthcare providers' perception of safety during the COVID-19 pandemic? A cross-sectional study of a global online survey. HUMAN RESOURCES FOR HEALTH 2022; 20:63. [PMID: 35986293 PMCID: PMC9389509 DOI: 10.1186/s12960-022-00758-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 08/08/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND Maternal and newborn healthcare providers are essential professional groups vulnerable to physical and psychological risks associated with the COVID-19 pandemic. This study uses machine learning algorithms to create a predictive tool for maternal and newborn healthcare providers' perception of being safe in the workplace globally during the pandemic. METHODS We used data collected between 24 March and 5 July 2020 through a global online survey of maternal and newborn healthcare providers. The questionnaire was available in 12 languages. To predict healthcare providers' perception of safety in the workplace, we used features collected in the questionnaire, in addition to publicly available national economic and COVID-19-related factors. We built, trained and tested five machine learning models: Support Vector Machine (SVM), Random Forest (RF), XGBoost, CatBoost and Artificial Neural Network (ANN) for classification and regression. We extracted from RF models the relative contribution of features in output prediction. RESULTS Models included data from 941 maternal and newborn healthcare providers from 89 countries. ML models performed well in classification and regression tasks, whereby RF had 82% cross-validated accuracy for classification, and CatBoost with 0.46 cross-validated root mean square error for regression. In both classification and regression, the most important features contributing to output prediction were classified as three themes: (1) information accessibility, clarity and quality; (2) availability of support and means of protection; and (3) COVID-19 epidemiology. CONCLUSION This study identified salient features contributing to maternal and newborn healthcare providers perception of safety in the workplace. The developed tool can be used by health systems globally to allow real-time learning from data collected during a health system shock. By responding in real-time to the needs of healthcare providers, health systems could prevent potential negative consequences on the quality of care offered to women and newborns.
Collapse
Affiliation(s)
- Bassel Hammoud
- Biomedical Engineering Program, Faculty of Medicine-Maroun Semaan Faculty of Engineering and Architecture, American University of Beirut, Beirut, Lebanon
| | - Aline Semaan
- Department of Public Health, Institute of Tropical Medicine, Antwerp, Belgium
| | - Imad Elhajj
- Electrical and Computer Engineering Department, Maroun Semaan Faculty of Engineering and Architecture, American University of Beirut, Beirut, Lebanon
| | - Lenka Benova
- Department of Public Health, Institute of Tropical Medicine, Antwerp, Belgium
| |
Collapse
|