Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zunic A, Corcoran P, Spasic I. Sentiment Analysis in Health and Well-Being: Systematic Review. JMIR Med Inform 2020;8:e16023. [PMID: 32012057 PMCID: PMC7013658 DOI: 10.2196/16023] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Revised: 10/26/2019] [Accepted: 10/27/2019] [Indexed: 12/22/2022] Open

For:	Zunic A, Corcoran P, Spasic I. Sentiment Analysis in Health and Well-Being: Systematic Review. JMIR Med Inform 2020;8:e16023. [PMID: 32012057 PMCID: PMC7013658 DOI: 10.2196/16023] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Revised: 10/26/2019] [Accepted: 10/27/2019] [Indexed: 12/22/2022] Open

Number

Cited by Other Article(s)

Wu D, Shead H, Ren Y, Raynor P, Tao Y, Villanueva H, Hung P, Li X, Brookshire RG, Eichelberger K, Guille C, Litwin AH, Olatosi B. Uncovering the Complexity of Perinatal Polysubstance Use Disclosure Patterns on X: Mixed Methods Study. J Med Internet Res 2024;26:e53171. [PMID: 39302713 DOI: 10.2196/53171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 05/06/2024] [Accepted: 06/11/2024] [Indexed: 09/22/2024] Open

Abstract

BACKGROUND

According to the Morbidity and Mortality Weekly Report, polysubstance use among pregnant women is prevalent, with 38.2% of those who consume alcohol also engaging in the use of one or more additional substances. However, the underlying mechanisms, contexts, and experiences of polysubstance use are unclear. Organic information is abundant on social media such as X (formerly Twitter). Traditional quantitative and qualitative methods, as well as natural language processing techniques, can be jointly used to derive insights into public opinions, sentiments, and clinical and public health policy implications.

OBJECTIVE

Based on perinatal polysubstance use (PPU) data that we extracted on X from May 1, 2019, to October 31, 2021, we proposed two primary research questions: (1) What is the overall trend and sentiment of PPU discussions on X? (2) Are there any distinct patterns in the discussion trends of PPU-related tweets? If so, what are the implications for perinatal care and associated public health policies?

METHODS

We used X's application programming interface to extract >6 million raw tweets worldwide containing ≥2 prenatal health- and substance-related keywords provided by our clinical team. After removing all non-English-language tweets, non-US tweets, and US tweets without disclosed geolocations, we obtained 4848 PPU-related US tweets. We then evaluated them using a mixed methods approach. The quantitative analysis applied frequency, trend analysis, and several natural language processing techniques such as sentiment analysis to derive statistics to preview the corpus. To further understand semantics and clinical insights among these tweets, we conducted an in-depth thematic content analysis with a random sample of 500 PPU-related tweets with a satisfying κ score of 0.7748 for intercoder reliability.

RESULTS

Our quantitative analysis indicates the overall trends, bigram and trigram patterns, and negative sentiments were more dominant in PPU tweets (2490/4848, 51.36%) than in the non-PPU sample (1323/4848, 27.29%). Paired polysubstance use (4134/4848, 85.27%) was the most common, with the combination alcohol and drugs identified as the most mentioned. From the qualitative analysis, we identified 3 main themes: nonsubstance, single substance, and polysubstance, and 4 subthemes to contextualize the rationale of underlying PPU behaviors: lifestyle, perceptions of others' drug use, legal implications, and public health.

CONCLUSIONS

This study identified underexplored, emerging, and important topics related to perinatal PPU, with significant stigmas and legal ramifications discussed on X. Overall, public sentiments on PPU were mixed, encompassing negative (2490/4848, 51.36%), positive (1884/4848, 38.86%), and neutral (474/4848, 9.78%) sentiments. The leading substances in PPU were alcohol and drugs, and the normalization of PPU discussed on X is becoming more prevalent. Thus, this study provides valuable insights to further understand the complexity of PPU and its implications for public health practitioners and policy makers to provide proper access and support to individuals with PPU.

Collapse

Das S, Catterall J, Stone R, Clough AR. "The reasons you believe …": An exploratory study of text driven evidence gathering and prediction from first responder records justifying state authorised intervention for mental health episodes. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024;254:108257. [PMID: 38901271 DOI: 10.1016/j.cmpb.2024.108257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 05/13/2024] [Accepted: 05/28/2024] [Indexed: 06/22/2024]

Abstract

Objective First responders' mandatory reports of mental health episodes requiring emergency hospital care contain rich information about patients and their needs. In Queensland (Australia) much of the information contained in Emergency Examination Authorities (EEAs) remains unused. We propose and demonstrate a methodology to extract and translate vital information embedded in reports like EEAs and to use it to investigate the extreme propensity of incidence of serious mental health episodes. Methods The proposed method integrates clinical, demographic, spatial and free text information into a single data collection. The data is subjected to exploratory analysis for spatial pattern recognition leading to an observational epidemiology model for the association of maximum spatial recurrence of EEA episodes. Results Sentiment analysis revealed that among EEA presentations hospital and health service (HHS) region #4 had the lowest proportion of positive sentiments (18 %) compared to 33 % for HHS region #1 pointing to spatial differentiation of sentiments immanent in mandated free text which required more detailed analysis. At the postcode geographical level, we found that variation in maximum spatial recurrence of EEAs was significantly positively associated with spatial range of sentiments (0.29, p < 0.001) and the postcode-referenced sex ratio (0.45, p = 0.01). The volatility of sentiments significantly correlated with extremes of recurrence of EEA episodes. The predicted (probabilistic) incidence rate when mapped reflected this correlation. Conclusions The paper demonstrates the efficacy of integrating, machine extracted, human sentiments (as potential surrogates) with conventional exposure variables for evidence-based methods for mental health spatial epidemiology. Such insights from informatics-driven epidemiological observations may inform the strategic allocation of health system resources to address the highest levels of need and to improve the standard of care for mental patients while also enhancing their safe and humane treatment and management.

Collapse

Paradise Vit A, Magid A. Differences in Fear and Negativity Levels Between Formal and Informal Health-Related Websites: Analysis of Sentiments and Emotions. J Med Internet Res 2024;26:e55151. [PMID: 39120928 PMCID: PMC11344190 DOI: 10.2196/55151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 05/19/2024] [Accepted: 06/07/2024] [Indexed: 08/10/2024] Open

Abstract

BACKGROUND

Searching for web-based health-related information is frequently performed by the public and may affect public behavior regarding health decision-making. Particularly, it may result in anxiety, erroneous, and harmful self-diagnosis. Most searched health-related topics are cancer, cardiovascular diseases, and infectious diseases. A health-related web-based search may result in either formal or informal medical website, both of which may evoke feelings of fear and negativity.

OBJECTIVE

Our study aimed to assess whether there is a difference in fear and negativity levels between information appearing on formal and informal health-related websites.

METHODS

A web search was performed to retrieve the contents of websites containing symptoms of selected diseases, using selected common symptoms. Retrieved websites were classified into formal and informal websites. Fear and negativity of each content were evaluated using 3 transformer models. A fourth transformer model was fine-tuned using an existing emotion data set obtained from a web-based health community. For formal and informal websites, fear and negativity levels were aggregated. t tests were conducted to evaluate the differences in fear and negativity levels between formal and informal websites.

RESULTS

In this study, unique websites (N=1448) were collected, of which 534 were considered formal and 914 were considered informal. There were 1820 result pages from formal websites and 1494 result pages from informal websites. According to our findings, fear levels were statistically higher (t2753=3.331; P<.001) on formal websites (mean 0.388, SD 0.177) than on informal websites (mean 0.366, SD 0.168). The results also show that the level of negativity was statistically higher (t2753=2.726; P=.006) on formal websites (mean 0.657, SD 0.211) than on informal websites (mean 0.636, SD 0.201).

CONCLUSIONS

Positive texts may increase the credibility of formal health websites and increase their usage by the general public and the public's compliance to the recommendations. Increasing the usage of natural language processing tools before publishing health-related information to achieve a more positive and less stressful text to be disseminated to the public is recommended.

Collapse

Yoon S, Jang J, Son G, Park S, Hwang J, Choeh JY, Choi KH. Predicting neuroticism with open-ended response using natural language processing. Front Psychiatry 2024;15:1437569. [PMID: 39149156 PMCID: PMC11324482 DOI: 10.3389/fpsyt.2024.1437569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Accepted: 07/17/2024] [Indexed: 08/17/2024] Open

Gondode P, Duggal S, Garg N, Sethupathy S, Asai O, Lohakare P. Comparing patient education tools for chronic pain medications: Artificial intelligence chatbot versus traditional patient information leaflets. Indian J Anaesth 2024;68:631-636. [PMID: 39081915 PMCID: PMC11285886 DOI: 10.4103/ija.ija_204_24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2024] [Revised: 05/03/2024] [Accepted: 05/05/2024] [Indexed: 08/02/2024] Open

Abstract

Background and Aims

Artificial intelligence (AI) chatbots like Conversational Generative Pre-trained Transformer (ChatGPT) have recently created much buzz, especially regarding patient education. Such informed patients understand and adhere to the management and get involved in shared decision making. The accuracy and understandability of the generated educational material are prime concerns. Thus, we compared ChatGPT with traditional patient information leaflets (PILs) about chronic pain medications.

Methods

Patients' frequently asked questions were generated from PILs available on the official websites of the British Pain Society (BPS) and the Faculty of Pain Medicine. Eight blinded annexures were prepared for evaluation, consisting of traditional PILs from the BPS and AI-generated patient information materials structured similar to PILs by ChatGPT. The authors performed a comparative analysis to assess materials' readability, emotional tone, accuracy, actionability, and understandability. Readability was measured using Flesch Reading Ease (FRE), Gunning Fog Index (GFI), and Flesch-Kincaid Grade Level (FKGL). Sentiment analysis determined emotional tone. An expert panel evaluated accuracy and completeness. Actionability and understandability were assessed with the Patient Education Materials Assessment Tool.

Results

Traditional PILs generally exhibited higher readability (P values < 0.05), with [mean (standard deviation)] FRE [62.25 (1.6) versus 48 (3.7)], GFI [11.85 (0.9) versus 13.65 (0.7)], and FKGL [8.33 (0.5) versus 10.23 (0.5)] but varied emotional tones, often negative, compared to more positive sentiments in ChatGPT-generated texts. Accuracy and completeness did not significantly differ between the two. Actionability and understandability scores were comparable.

Conclusion

While AI chatbots offer efficient information delivery, ensuring accuracy and readability, patient-centeredness remains crucial. It is imperative to balance innovation with evidence-based practice.

Collapse

Molenaar A, Jenkins EL, Brennan L, Lukose D, McCaffrey TA. The use of sentiment and emotion analysis and data science to assess the language of nutrition-, food- and cooking-related content on social media: a systematic scoping review. Nutr Res Rev 2024;37:43-78. [PMID: 36991525 DOI: 10.1017/s0954422423000069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/31/2023]

Menger NS, Tognetti A, Farruggia MC, Mucignat C, Bhutani S, Cooper KW, Rohlfs Domínguez P, Heinbockel T, Shields VDC, D'Errico A, Pereda-Loth V, Pierron D, Koyama S, Croijmans I. Giving a Voice to Patients With Smell Disorders Associated With COVID-19: Cross-Sectional Longitudinal Analysis Using Natural Language Processing of Self-Reports. JMIR Public Health Surveill 2024;10:e47064. [PMID: 38728069 PMCID: PMC11127136 DOI: 10.2196/47064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 10/26/2023] [Accepted: 03/11/2024] [Indexed: 05/12/2024] Open

Abstract

BACKGROUND

Smell disorders are commonly reported with COVID-19 infection. The smell-related issues associated with COVID-19 may be prolonged, even after the respiratory symptoms are resolved. These smell dysfunctions can range from anosmia (complete loss of smell) or hyposmia (reduced sense of smell) to parosmia (smells perceived differently) or phantosmia (smells perceived without an odor source being present). Similar to the difficulty that people experience when talking about their smell experiences, patients find it difficult to express or label the symptoms they experience, thereby complicating diagnosis. The complexity of these symptoms can be an additional burden for patients and health care providers and thus needs further investigation.

OBJECTIVE

This study aims to explore the smell disorder concerns of patients and to provide an overview for each specific smell disorder by using the longitudinal survey conducted in 2020 by the Global Consortium for Chemosensory Research, an international research group that has been created ad hoc for studying chemosensory dysfunctions. We aimed to extend the existing knowledge on smell disorders related to COVID-19 by analyzing a large data set of self-reported descriptive comments by using methods from natural language processing.

METHODS

We included self-reported data on the description of changes in smell provided by 1560 participants at 2 timepoints (second survey completed between 23 and 291 days). Text data from participants who still had smell disorders at the second timepoint (long-haulers) were compared with the text data of those who did not (non-long-haulers). Specifically, 3 aims were pursued in this study. The first aim was to classify smell disorders based on the participants' self-reports. The second aim was to classify the sentiment of each self-report by using a machine learning approach, and the third aim was to find particular food and nonfood keywords that were more salient among long-haulers than those among non-long-haulers.

RESULTS

We found that parosmia (odds ratio [OR] 1.78, 95% CI 1.35-2.37; P<.001) as well as hyposmia (OR 1.74, 95% CI 1.34-2.26; P<.001) were more frequently reported in long-haulers than in non-long-haulers. Furthermore, a significant relationship was found between long-hauler status and sentiment of self-report (P<.001). Finally, we found specific keywords that were more typical for long-haulers than those for non-long-haulers, for example, fire, gas, wine, and vinegar.

CONCLUSIONS

Our work shows consistent findings with those of previous studies, which indicate that self-reports, which can easily be extracted online, may offer valuable information to health care and understanding of smell disorders. At the same time, our study on self-reports provides new insights for future studies investigating smell disorders.

Collapse

Reis FJJ, Bonfim IDS, Corrêa LA, Nogueira LC, Meziat-Filho N, Almeida RSD. Uncovering emotional and network dynamics in the speech of patients with chronic low back pain. Musculoskelet Sci Pract 2024;70:102925. [PMID: 38430821 DOI: 10.1016/j.msksp.2024.102925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 01/26/2024] [Accepted: 02/23/2024] [Indexed: 03/05/2024]

Ntiamoah M, Xavier T, Lambert J. Sentiment Analysis of Patient- and Family-Related Sepsis Events: Exploratory Study. JMIR Nurs 2024;7:e51720. [PMID: 38557694 PMCID: PMC11019419 DOI: 10.2196/51720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 01/24/2024] [Accepted: 02/07/2024] [Indexed: 04/04/2024] Open

Abstract

BACKGROUND

Despite the life-threatening nature of sepsis, little is known about the emotional experiences of patients and their families during sepsis events. We conducted a sentiment analysis pertaining to sepsis incidents involving patients and families, leveraging textual data retrieved from a publicly available blog post disseminated by the Centers for Disease Control and Prevention (CDC).

OBJECTIVE

This investigation involved a sentiment analysis of patient- and family-related sepsis events, leveraging text responses sourced from a publicly accessible blog post disseminated by the CDC. Driven by the imperative to elucidate the emotional dynamics encountered by patients and their families throughout sepsis incidents, the overarching aims centered on elucidating the emotional ramifications of sepsis on both patients and their families and discerning potential avenues for enhancing the quality of sepsis care.

METHODS

The research used a cross-sectional data mining methodology to investigate the sentiments and emotional aspects linked to sepsis, using a data set sourced from the CDC, which encompasses 170 responses from both patients and caregivers, spanning the period between September 2014 and September 2020. This investigation used the National Research Council Canada Emotion Lexicon for sentiment analysis, coupled with a combination of manual and automated techniques to extract salient features from textual responses. The study used negative binomial least absolute shrinkage and selection operator regressions to ascertain significant textual features that correlated with specific emotional states. Moreover, the visualization of Plutchik's Wheel of Emotions facilitated the discernment of prevailing emotions within the data set.

RESULTS

The results showed that patients and their families experienced a range of emotions during sepsis events, including fear, anxiety, sadness, and gratitude. Our analyses revealed an estimated incidence rate ratio (IRR) of 1.35 for fear-related words and a 1.51 IRR for sadness-related words when mentioning "hospital" in sepsis-related experiences. Similarly, mentions of "intensive care unit" were associated with an average occurrence of 12.3 fear-related words and 10.8 sadness-related words. Surviving patients' experiences had an estimated 1.15 IRR for joy-related words, contrasting with discussions around organ failure, which were associated with multiple negative emotions including disgust, anger, fear, and sadness. Furthermore, mentions of "death" were linked to more fear and anger words but fewer joy-related words. Conversely, longer timelines in sepsis events were associated with more joy-related words and fewer fear-related words, potentially indicating improved emotional adaptation over time.

CONCLUSIONS

The study's outcomes underscore the imperative for health care providers to integrate emotional support alongside medical interventions for patients and families affected by sepsis, emphasizing the emotional toll incurred and highlighting the necessity of acknowledgment and resolution, advocating for the use of sentiment analysis as a means to tailor personalized emotional aid, and thereby potentially augmenting both patient and family welfare and overall outcomes.

Collapse

Farokhnia Hamedani M, Esmaeili M, Sun Y, Sheybani E, Javidi G. Paving the way for COVID survivors' psychosocial rehabilitation: Mining topics, sentiments, and their trajectories over time from Reddit. Health Informatics J 2024;30:14604582241240680. [PMID: 38739488 DOI: 10.1177/14604582241240680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]

Abstract

Objective: This study examined major themes and sentiments and their trajectories and interactions over time using subcategories of Reddit data. The aim was to facilitate decision-making for psychosocial rehabilitation. Materials and Methods: We utilized natural language processing techniques, including topic modeling and sentiment analysis, on a dataset consisting of more than 38,000 topics, comments, and posts collected from a subreddit dedicated to the experiences of people who tested positive for COVID-19. In this longitudinal exploratory analysis, we studied the dynamics between the most dominant topics and subjects' emotional states over an 18-month period. Results: Our findings highlight the evolution of the textual and sentimental status of major topics discussed by COVID survivors over an extended period of time during the pandemic. We particularly studied pre- and post-vaccination eras as a turning point in the timeline of the pandemic. The results show that not only does the relevance of topics change over time, but the emotions attached to them also vary. Major social events, such as the administration of vaccines or enforcement of nationwide policies, are also reflected through the discussions and inquiries of social media users. In particular, the emotional state (i.e., sentiments and polarity of their feelings) of those who have experienced COVID personally. Discussion: Cumulative societal knowledge regarding the COVID-19 pandemic impacts the patterns with which people discuss their experiences, concerns, and opinions. The subjects' emotional state with respect to different topics was also impacted by extraneous factors and events, such as vaccination. Conclusion: By mining major topics, sentiments, and trajectories demonstrated in COVID-19 survivors' interactions on Reddit, this study contributes to the emerging body of scholarship on COVID-19 survivors' mental health outcomes, providing insights into the design of mental health support and rehabilitation services for COVID-19 survivors.

Collapse

Molenaar A, Lukose D, Brennan L, Jenkins EL, McCaffrey TA. Using Natural Language Processing to Explore Social Media Opinions on Food Security: Sentiment Analysis and Topic Modeling Study. J Med Internet Res 2024;26:e47826. [PMID: 38512326 PMCID: PMC10995791 DOI: 10.2196/47826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 12/05/2023] [Accepted: 12/20/2023] [Indexed: 03/22/2024] Open

Abstract

BACKGROUND

Social media has the potential to be of great value in understanding patterns in public health using large-scale analysis approaches (eg, data science and natural language processing [NLP]), 2 of which have been used in public health: sentiment analysis and topic modeling; however, their use in the area of food security and public health nutrition is limited.

OBJECTIVE

This study aims to explore the potential use of NLP tools to gather insights from real-world social media data on the public health issue of food security.

METHODS

A search strategy for obtaining tweets was developed using food security terms. Tweets were collected using the Twitter application programming interface from January 1, 2019, to December 31, 2021, filtered for Australia-based users only. Sentiment analysis of the tweets was performed using the Valence Aware Dictionary and Sentiment Reasoner. Topic modeling exploring the content of tweets was conducted using latent Dirichlet allocation with BigML (BigML, Inc). Sentiment, topic, and engagement (the sum of likes, retweets, quotations, and replies) were compared across years.

RESULTS

In total, 38,070 tweets were collected from 14,880 Twitter users. Overall, the sentiment when discussing food security was positive, although this varied across the 3 years. Positive sentiment remained higher during the COVID-19 lockdown periods in Australia. The topic model contained 10 topics (in order from highest to lowest probability in the data set): "Global production," "Food insecurity and health," "Use of food banks," "Giving to food banks," "Family poverty," "Food relief provision," "Global food insecurity," "Climate change," "Australian food insecurity," and "Human rights." The topic "Giving to food banks," which focused on support and donation, had the highest proportion of positive sentiment, and "Global food insecurity," which covered food insecurity prevalence worldwide, had the highest proportion of negative sentiment. When compared with news, there were some events, such as COVID-19 support payment introduction and bushfires across Australia, that were associated with high periods of positive or negative sentiment. Topics related to food insecurity prevalence, poverty, and food relief in Australia were not consistently more prominent during the COVID-19 pandemic than before the pandemic. Negative tweets received substantially higher engagement across 2019 and 2020. There was no clear relationship between topics that were more likely to be positive or negative and have higher or lower engagement, indicating that the identified topics are discrete issues.

CONCLUSIONS

In this study, we demonstrated the potential use of sentiment analysis and topic modeling to explore evolution in conversations on food security using social media data. Future use of NLP in food security requires the context of and interpretation by public health experts and the use of broader data sets, with the potential to track dimensions or events related to food security to inform evidence-based decision-making in this area.

Collapse

Khine AH, Wettayaprasit W, Duangsuwan J. A new word embedding model integrated with medical knowledge for deep learning-based sentiment classification. Artif Intell Med 2024;148:102758. [PMID: 38325934 DOI: 10.1016/j.artmed.2023.102758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 11/19/2023] [Accepted: 12/29/2023] [Indexed: 02/09/2024]

Abstract

The development of intelligent systems that use social media data for decision-making processes in numerous domains such as politics, business, marketing, and finance, has been made possible by the popularity of social media platforms. However, the utilization of textual data from social media in the healthcare management industry is still somewhat limited when it is compared to other industries. Investigating how current machine learning and natural language processing technologies can be used in the healthcare industry to gauge public sentiment is an important study. Earlier works on healthcare sentiment analysis have utilized traditional word embedding models trained on the general and medical corpus. However, integration of medical knowledge to pre-trained word embedding models has not been considered yet. Word embedding models trained on the general corpus led to the problem of lacking medical knowledge and the models trained on the small size of the medical corpus have limitations in capturing semantic and syntactic properties. This research proposes a new word embedding model named Word Embedding Integrated with Medical Knowledge Vector (WE-iMKVec). The proposed model integrates sentiment lexicons and medical knowledgebases into the pre-trained word embedding to enrich the properties of word embedding. A new medical-aware sentiment polarity score is proposed for the utilization in learning neural-network sentiment and these vectors incorporate with the original pre-trained word vectors. The resulting vectors are enriched with lexicon vectors and the medical knowledge vectors: Adverse Drug Reaction (ADR) vector and Unified Medical Language System (UMLS) vector are used to build the proposed WE-iMKVec model. WE-iMKVec is validated on the five different social media healthcare review datasets and the empirical results showed its superiority over traditional word embedding models in medical sentiment analysis. The highest improvement can be found in the patients.info medical condition dataset where the proposed model outperforms three conventional word2vec models (Google-News, PubMed-PMC, and Drug Reviews) by 12.7 %, 31.4 %, and 25.4 % respectively in terms of F1 score.

Collapse

Lossio-Ventura JA, Weger R, Lee AY, Guinee EP, Chung J, Atlas L, Linos E, Pereira F. A Comparison of ChatGPT and Fine-Tuned Open Pre-Trained Transformers (OPT) Against Widely Used Sentiment Analysis Tools: Sentiment Analysis of COVID-19 Survey Data. JMIR Ment Health 2024;11:e50150. [PMID: 38271138 PMCID: PMC10813836 DOI: 10.2196/50150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 11/16/2023] [Accepted: 11/17/2023] [Indexed: 01/27/2024] Open

Abstract

BACKGROUND

Health care providers and health-related researchers face significant challenges when applying sentiment analysis tools to health-related free-text survey data. Most state-of-the-art applications were developed in domains such as social media, and their performance in the health care context remains relatively unknown. Moreover, existing studies indicate that these tools often lack accuracy and produce inconsistent results.

OBJECTIVE

This study aims to address the lack of comparative analysis on sentiment analysis tools applied to health-related free-text survey data in the context of COVID-19. The objective was to automatically predict sentence sentiment for 2 independent COVID-19 survey data sets from the National Institutes of Health and Stanford University.

METHODS

Gold standard labels were created for a subset of each data set using a panel of human raters. We compared 8 state-of-the-art sentiment analysis tools on both data sets to evaluate variability and disagreement across tools. In addition, few-shot learning was explored by fine-tuning Open Pre-Trained Transformers (OPT; a large language model [LLM] with publicly available weights) using a small annotated subset and zero-shot learning using ChatGPT (an LLM without available weights).

RESULTS

The comparison of sentiment analysis tools revealed high variability and disagreement across the evaluated tools when applied to health-related survey data. OPT and ChatGPT demonstrated superior performance, outperforming all other sentiment analysis tools. Moreover, ChatGPT outperformed OPT, exhibited higher accuracy by 6% and higher F-measure by 4% to 7%.

CONCLUSIONS

This study demonstrates the effectiveness of LLMs, particularly the few-shot learning and zero-shot learning approaches, in the sentiment analysis of health-related survey data. These results have implications for saving human labor and improving efficiency in sentiment analysis tasks, contributing to advancements in the field of automated sentiment analysis.

Collapse

Galvez-Hernandez P, Gonzalez-Viana A, Gonzalez-de Paz L, Shankardass K, Muntaner C. Generating Contextual Variables From Web-Based Data for Health Research: Tutorial on Web Scraping, Text Mining, and Spatial Overlay Analysis. JMIR Public Health Surveill 2024;10:e50379. [PMID: 38190245 PMCID: PMC10804251 DOI: 10.2196/50379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 11/20/2023] [Accepted: 11/28/2023] [Indexed: 01/09/2024] Open

Abstract

BACKGROUND

Contextual variables that capture the characteristics of delimited geographic or jurisdictional areas are vital for health and social research. However, obtaining data sets with contextual-level data can be challenging in the absence of monitoring systems or public census data.

OBJECTIVE

We describe and implement an 8-step method that combines web scraping, text mining, and spatial overlay analysis (WeTMS) to transform extensive text data from government websites into analyzable data sets containing contextual data for jurisdictional areas.

METHODS

This tutorial describes the method and provides resources for its application by health and social researchers. We used this method to create data sets of health assets aimed at enhancing older adults' social connections (eg, activities and resources such as walking groups and senior clubs) across the 374 health jurisdictions in Catalonia from 2015 to 2022. These assets are registered on a web-based government platform by local stakeholders from various health and nonhealth organizations as part of a national public health program. Steps 1 to 3 involved defining the variables of interest, identifying data sources, and using Python to extract information from 50,000 websites linked to the platform. Steps 4 to 6 comprised preprocessing the scraped text, defining new variables to classify health assets based on social connection constructs, analyzing word frequencies in titles and descriptions of the assets, creating topic-specific dictionaries, implementing a rule-based classifier in R, and verifying the results. Steps 7 and 8 integrate the spatial overlay analysis to determine the geographic location of each asset. We conducted a descriptive analysis of the data sets to report the characteristics of the assets identified and the patterns of asset registrations across areas.

RESULTS

We identified and extracted data from 17,305 websites describing health assets. The titles and descriptions of the activities and resources contained 12,560 and 7301 unique words, respectively. After applying our classifier and spatial analysis algorithm, we generated 2 data sets containing 9546 health assets (5022 activities and 4524 resources) with the potential to enhance social connections among older adults. Stakeholders from 318 health jurisdictions registered identified assets on the platform between July 2015 and December 2022. The agreement rate between the classification algorithm and verified data sets ranged from 62.02% to 99.47% across variables. Leisure and skill development activities were the most prevalent (1844/5022, 36.72%). Leisure and cultural associations, such as social clubs for older adults, were the most common resources (878/4524, 19.41%). Health asset registration varied across areas, ranging between 0 and 263 activities and 0 and 265 resources.

CONCLUSIONS

The sequential use of WeTMS offers a robust method for generating data sets containing contextual-level variables from internet text data. This study can guide health and social researchers in efficiently generating ready-to-analyze data sets containing contextual variables.

Collapse

Niazi F, Elkaim LM, Zadeh Khomami NM, Levett JJ, Weil AG, Hodaie M, Alotaibi NM. Microvascular Decompression and Trigeminal Neuralgia: Patient Sentiment Analysis Using Natural Language Processing. World Neurosurg 2023;180:e528-e536. [PMID: 37778624 DOI: 10.1016/j.wneu.2023.09.107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 09/25/2023] [Indexed: 10/03/2023]

Humayun MM, Brouillette MJ, Fellows LK, Mayo NE. The Patient Generated Index (PGI) as an early-warning system for predicting brain health challenges: a prospective cohort study for people living with Human Immunodeficiency Virus (HIV). Qual Life Res 2023;32:3439-3452. [PMID: 37428407 DOI: 10.1007/s11136-023-03475-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/28/2023] [Indexed: 07/11/2023]

Abstract

PURPOSE

In research people are often asked to fill out questionnaires about their health and functioning and some of the questions refer to serious health concerns. Typically, these concerns are not identified until the statistician analyses the data. An alternative is to use an individualized measure, the Patient Generated Index (PGI) where people are asked to self-nominate areas of concern which can then be dealt with in real-time. This study estimates the extent to which self-nominated areas of concern related to mood, anxiety and cognition predict the presence or occurrence of brain health outcomes such as depression, anxiety, psychological distress, or cognitive impairment among people aging with HIV at study entry and for successive assessments over 27 months.

METHODS

The data comes from participants enrolled in the Positive Brain Health Now (+ BHN) cohort (n = 856). We analyzed the self-nominated areas that participants wrote on the PGI and classified them into seven sentiment groups according to the type of sentiment expressed: emotional, interpersonal, anxiety, depressogenic, somatic, cognitive and positive sentiments. Tokenization was used to convert qualitative data into quantifiable tokens. A longitudinal design was used to link these sentiment groups to the presence or emergence of brain health outcomes as assessed using standardized measures of these constructs: the Hospital Anxiety and Depression Scale (HADS), the Mental Health Index (MHI) of the RAND-36, the Communicating Cognitive Concerns Questionnaire (C3Q) and the Brief Cognitive Ability Measure (B-CAM). Logistic regressions were used to estimate the goodness of fit of each model using the c-statistic.

RESULTS

Emotional sentiments predicted all of the brain health outcomes at all visits with adjusted odds ratios (OR) ranging from 1.61 to 2.00 and c-statistics > 0.73 (good to excellent prediction). Nominating an anxiety sentiment was specific to predicting anxiety and psychological distress (OR 1.65 & 1.52); nominating a cognitive concern was specific to predicting self-reported cognitive ability (OR 4.78). Positive sentiments were predictive of good cognitive function (OR 0.36) and protective of depressive symptoms (OR 0.55).

CONCLUSIONS

This study indicates the value of using this semi-qualitative approach as an early-warning system in predicting brain health outcomes.

Collapse

Levett JJ, Elkaim LM, Weber MH, Yuh SJ, Lasry O, Alotaibi NM, Georgiopoulos M, Berven SH, Weil AG. A twitter analysis of patient and family experience in pediatric spine surgery. Childs Nerv Syst 2023;39:3483-3490. [PMID: 37354288 DOI: 10.1007/s00381-023-06019-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 06/04/2023] [Indexed: 06/26/2023]

Zargaran D, Zargaran A, Sousi S, Knight D, Cook H, Woollard A, Davies J, Weyrich T, Mosahebi A. Quantitative and qualitative analysis of individual experiences post botulinum toxin injection - United Kingdom Survey. SKIN HEALTH AND DISEASE 2023;3:e265. [PMID: 37799369 PMCID: PMC10549845 DOI: 10.1002/ski2.265] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 05/29/2023] [Accepted: 06/13/2023] [Indexed: 10/07/2023]

Abstract

Introduction

In the United Kingdom (UK), complications that arise following the administration of Botulinum Toxin are reported to the Medicines and Health Regulatory Agency (MHRA) via the Yellow Card Reporting Scheme. Over the past decade, there has been a significant increase in the number of non-surgical aesthetic procedures. Concerns have been raised that the MHRA is not fully capturing complications in terms of volume and impact on patients.

Aim

This novel study explores the lived experiences of individuals who have experienced an adverse event following administration of Botulinum Toxin for aesthetic purposes. Using a combination of qualitative and quantitative methodologies, this analysis evaluates data relating to long-lasting physical, psychological, emotional, and financial sequelae of complications arising from cosmetic Botulinum Toxin injections in the UK.

Methods

A mixed method, qualitative and quantitative approach was adopted to gain comprehensive insights into patients' experiences. A focus group which comprised patient representatives, psychologists, and researchers reached a consensus on a 17-question survey which was disseminated via social media channels. Deductive thematic analysis was used to analyse coded themes. Furthermore, for secondary analysis, sentiment analysis was used computationally as an innovative approach to identify and categorise free text responses associated with sentiments using natural language processing (NLP).

Results

In the study, 655 responses were received, with 287 (44%) of respondents completing all questions. The mean age of respondents was 42.6 years old. 94.1% of respondents identified as female. In the sample, 79% of respondents reported an adverse event following their procedure, with the most common event being reported as 'anxiety'. Findings revealed that 69% of respondents reported long-lasting adverse effects. From the responses, 68.4% reported not having recovered physically, 63.5% of respondents stated that they had not recovered emotionally from complications, and 61.7% said that they have not recovered psychologically. In addition, 84% of respondents stated that they do not know who regulates the aesthetics industry. Furthermore, 92% of participants reported that their clinic or practitioner did not inform them about the Yellow Card Reporting Scheme. The sentiment analysis using the AFINN Lexicon yielded adjusted scores ranging from -3 to +2, with a mean value of -1.58.

Conclusion

This is the largest survey in the UK completed by patients who experienced an adverse outcome following the aesthetic administration of Botulinum Toxin. Our study highlights the extent of the challenges faced by patients who experience an adverse event from physical, emotional, psychological, and financial perspectives. The lack of awareness of MHRA reporting structures and the lack of regulation within the UK's cosmetic injectables sector represent a significant public health challenge.

Collapse

Li C, Fu J, Lai J, Sun L, Zhou C, Li W, Jian B, Deng S, Zhang Y, Guo Z, Liu Y, Zhou Y, Xie S, Hou M, Wang R, Chen Q, Wu Y. Construction of an Emotional Lexicon of Patients With Breast Cancer: Development and Sentiment Analysis. J Med Internet Res 2023;25:e44897. [PMID: 37698914 PMCID: PMC10523220 DOI: 10.2196/44897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 08/17/2023] [Accepted: 08/18/2023] [Indexed: 09/13/2023] Open

Abstract

BACKGROUND

The innovative method of sentiment analysis based on an emotional lexicon shows prominent advantages in capturing emotional information, such as individual attitudes, experiences, and needs, which provides a new perspective and method for emotion recognition and management for patients with breast cancer (BC). However, at present, sentiment analysis in the field of BC is limited, and there is no emotional lexicon for this field. Therefore, it is necessary to construct an emotional lexicon that conforms to the characteristics of patients with BC so as to provide a new tool for accurate identification and analysis of the patients' emotions and a new method for their personalized emotion management.

OBJECTIVE

This study aimed to construct an emotional lexicon of patients with BC.

METHODS

Emotional words were obtained by merging the words in 2 general sentiment lexicons, the Chinese Linguistic Inquiry and Word Count (C-LIWC) and HowNet, and the words in text corpora acquired from patients with BC via Weibo, semistructured interviews, and expressive writing. The lexicon was constructed using manual annotation and classification under the guidance of Russell's valence-arousal space. Ekman's basic emotional categories, Lazarus' cognitive appraisal theory of emotion, and a qualitative text analysis based on the text corpora of patients with BC were combined to determine the fine-grained emotional categories of the lexicon we constructed. Precision, recall, and the F1-score were used to evaluate the lexicon's performance.

RESULTS

The text corpora collected from patients in different stages of BC included 150 written materials, 17 interviews, and 6689 original posts and comments from Weibo, with a total of 1,923,593 Chinese characters. The emotional lexicon of patients with BC contained 9357 words and covered 8 fine-grained emotional categories: joy, anger, sadness, fear, disgust, surprise, somatic symptoms, and BC terminology. Experimental results showed that precision, recall, and the F1-score of positive emotional words were 98.42%, 99.73%, and 99.07%, respectively, and those of negative emotional words were 99.73%, 98.38%, and 99.05%, respectively, which all significantly outperformed the C-LIWC and HowNet.

CONCLUSIONS

The emotional lexicon with fine-grained emotional categories conforms to the characteristics of patients with BC. Its performance related to identifying and classifying domain-specific emotional words in BC is better compared to the C-LIWC and HowNet. This lexicon not only provides a new tool for sentiment analysis in the field of BC but also provides a new perspective for recognizing the specific emotional state and needs of patients with BC and formulating tailored emotional management plans.

Collapse

Di Cara NH, Maggio V, Davis OSP, Haworth CMA. Methodologies for Monitoring Mental Health on Twitter: Systematic Review. J Med Internet Res 2023;25:e42734. [PMID: 37155236 PMCID: PMC10203928 DOI: 10.2196/42734] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 11/23/2022] [Accepted: 03/15/2023] [Indexed: 05/10/2023] Open

Abstract

BACKGROUND

The use of social media data to predict mental health outcomes has the potential to allow for the continuous monitoring of mental health and well-being and provide timely information that can supplement traditional clinical assessments. However, it is crucial that the methodologies used to create models for this purpose are of high quality from both a mental health and machine learning perspective. Twitter has been a popular choice of social media because of the accessibility of its data, but access to big data sets is not a guarantee of robust results.

OBJECTIVE

This study aims to review the current methodologies used in the literature for predicting mental health outcomes from Twitter data, with a focus on the quality of the underlying mental health data and the machine learning methods used.

METHODS

A systematic search was performed across 6 databases, using keywords related to mental health disorders, algorithms, and social media. In total, 2759 records were screened, of which 164 (5.94%) papers were analyzed. Information about methodologies for data acquisition, preprocessing, model creation, and validation was collected, as well as information about replicability and ethical considerations.

RESULTS

The 164 studies reviewed used 119 primary data sets. There were an additional 8 data sets identified that were not described in enough detail to include, and 6.1% (10/164) of the papers did not describe their data sets at all. Of these 119 data sets, only 16 (13.4%) had access to ground truth data (ie, known characteristics) about the mental health disorders of social media users. The other 86.6% (103/119) of data sets collected data by searching keywords or phrases, which may not be representative of patterns of Twitter use for those with mental health disorders. The annotation of mental health disorders for classification labels was variable, and 57.1% (68/119) of the data sets had no ground truth or clinical input on this annotation. Despite being a common mental health disorder, anxiety received little attention.

CONCLUSIONS

The sharing of high-quality ground truth data sets is crucial for the development of trustworthy algorithms that have clinical and research utility. Further collaboration across disciplines and contexts is encouraged to better understand what types of predictions will be useful in supporting the management and identification of mental health disorders. A series of recommendations for researchers in this field and for the wider research community are made, with the aim of enhancing the quality and utility of future outputs.

Collapse

Malhotra K, Dagli MM, Santangelo G, Wathen C, Ghenbot Y, Goyal K, Bawa A, Ozturk AK, Welch WC. The Digital Impact of Neurosurgery Awareness Month: Retrospective Infodemiology Study. JMIR Form Res 2023;7:e44754. [PMID: 37155226 DOI: 10.2196/44754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 02/07/2023] [Accepted: 04/17/2023] [Indexed: 05/10/2023] Open

Abstract

BACKGROUND

Neurosurgery Awareness Month (August) was initiated by the American Association of Neurological Surgeons with the aim of bringing neurological conditions to the forefront and educating the public about these conditions. Digital media is an important tool for disseminating information and connecting with influencers, general public, and other stakeholders. Hence, it is crucial to understand the impact of awareness campaigns such as Neurosurgery Awareness Month to optimize resource allocation, quantify the efficiency and reach of these initiatives, and identify areas for improvement.

OBJECTIVE

The purpose of our study was to examine the digital impact of Neurosurgery Awareness Month globally and identify areas for further improvement.

METHODS

We used 4 social media (Twitter) assessment tools (Sprout Social, SocioViz, Sentiment Viz, and Symplur) and Google Trends to extract data using various search queries. Using regression analysis, trends were studied in the total number of tweets posted in August between 2014 and 2022. Two search queries were used in this analysis: one specifically targeting tweets related to Neurosurgery Awareness Month and the other isolating all neurosurgery-related posts. Total impressions and top influencers for #neurosurgery were calculated using Symplur's machine learning algorithm. To study the context of the tweets, we used SocioViz to isolate the top 100 popular hashtags, keywords, and collaborations between influencers. Network analysis was performed to illustrate the interactions and connections within the digital media environment using ForceAtlas2 model. Sentiment analysis was done to study the underlying emotion of the tweets. Google Trends was used to study the global search interest by studying relative search volume data.

RESULTS

A total of 10,007 users were identified as tweeting about neurosurgery during Neurosurgery Awareness Month using the "#neurosurgery" hashtag. These tweets generated over 29.14 million impressions globally. Of the top 10 most influential users, 5 were faculty neurosurgeons at US university hospitals. Other influential users included notable organizations and journals in the field of neurosurgery. The network analysis of the top 100 influencers showed a collaboration rate of 81%. However, only 1.6% of the total neurosurgery tweets were advocating about neurosurgery awareness during Neurosurgery Awareness Month, and only 13 tweets were posted by verified users using the #neurosurgeryawarenessmonth hashtag. The sentiment analysis revealed that the majority of the tweets about Neurosurgery Awareness Month were pleasant with subdued emotion.

CONCLUSIONS

The global digital impact of Neurosurgery Awareness Month is nascent, and support from other international organizations and neurosurgical influencers is needed to yield a significant digital reach. Increasing collaboration and involvement from underrepresented communities may help to increase the global reach. By better understanding the digital impact of Neurosurgery Awareness Month, future health care awareness campaigns can be optimized to increase global awareness of neurosurgery and the challenges facing the field.

Collapse

Khaleghparast S, Maleki M, Hajianfar G, Soumari E, Oveisi M, Golandouz HM, Noohi F, Dehaki MG, Golpira R, Mazloomzadeh S, Arabian M, Kalayinia S. Development of a patients' satisfaction analysis system using machine learning and lexicon-based methods. BMC Health Serv Res 2023;23:280. [PMID: 36959630 PMCID: PMC10037842 DOI: 10.1186/s12913-023-09260-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Accepted: 03/07/2023] [Indexed: 03/25/2023] Open

Abstract

BACKGROUND

Patients' rights are integral to medical ethics. This study aimed to perform sentiment analysis and opinion mining on patients' messages by a combination of lexicon-based and machine learning methods to identify positive or negative comments and to determine the different ward and staff names mentioned in patients' messages.

METHODS

The level of satisfaction and observance of the rights of 250 service recipients of the hospital was evaluated through the related checklists by the evaluator. In total, 822 Persian messages, composed of 540 negative and 282 positive comments, were collected and labeled by the evaluator. Pre-processing was performed on the messages and followed by 2 feature vectors which were extracted from the messages, including the term frequency-inverse document frequency (TFIDF) vector and a combination of the multifeature (MF) (a lexicon-based method) and TFIDF (MF + TFIDF) vectors. Six feature selectors and 5 classifiers were used in this study. For the evaluations, 5-fold cross-validation with different metrics including area under the receiver operating characteristic curve (AUC), accuracy (ACC), F1 score, sensitivity (SEN), specificity (SPE) and Precision-Recall Curves (PRC) were reported. Message tag detection, which featured different hospital wards and identified staff names mentioned in the study patients' messages, was implemented by the lexicon-based method.

RESULTS

The best classifier was Multinomial Naïve Bayes in combination with MF + TFIDF feature vector and SelectFromModel (SFM) feature selection (ACC = 0.89 ± 0.03, AUC = 0.87 ± 0.03, F1 = 0.92 ± 0.03, SEN = 0.93 ± 0.04, and SPE = 0.82 ± 0.02, PRC-AUC = 0.97). Two methods of assessment by the evaluator and artificial intelligence as well as survey systems were compared.

CONCLUSION

Our results demonstrated that the lexicon-based method, in combination with machine learning classifiers, could extract sentiments in patients' comments and classify them into positive and negative categories. We also developed an online survey system to analyze patients' satisfaction in different wards and to remove conventional assessments by the evaluator.

Collapse

Denecke K, Reichenpfader D. Sentiment analysis of clinical narratives: A scoping review. J Biomed Inform 2023;140:104336. [PMID: 36958461 DOI: 10.1016/j.jbi.2023.104336] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 03/06/2023] [Accepted: 03/10/2023] [Indexed: 03/25/2023]

Abstract

A clinical sentiment is a judgment, thought or attitude promoted by an observation with respect to the health of an individual. Sentiment analysis has drawn attention in the healthcare domain for secondary use of data from clinical narratives, with a variety of applications including predicting the likelihood of emerging mental illnesses or clinical outcomes. The current state of research has not yet been summarized. This study presents results from a scoping review aiming at providing an overview of sentiment analysis of clinical narratives in order to summarize existing research and identify open research gaps. The scoping review was carried out in line with the PRISMA-ScR (Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews) guideline. Studies were identified by searching 4 electronic databases (e.g., PubMed, IEEE Xplore) in addition to conducting backward and forward reference list checking of the included studies. We extracted information on use cases, methods and tools applied, used datasets and performance of the sentiment analysis approach. Of 1,200 citations retrieved, 29 unique studies were included in the review covering a period of 8 years. Most studies apply general domain tools (e.g. TextBlob) and sentiment lexicons (e.g. SentiWordNet) for realizing use cases such as prediction of clinical outcomes; others proposed new domain-specific sentiment analysis approaches based on machine learning. Accuracy values between 71.5-88.2% are reported. Data used for evaluation and test are often retrieved from MIMIC databases or i2b2 challenges. Latest developments related to artificial neural networks are not yet fully considered in this domain. We conclude that future research should focus on developing a gold standard sentiment lexicon, adapted to the specific characteristics of clinical narratives. Efforts have to be made to either augment existing or create new high-quality labeled data sets of clinical narratives. Last, the suitability of state-of-the-art machine learning methods for natural language processing and in particular transformer-based models should be investigated for their application for sentiment analysis of clinical narratives.

Collapse

Wu J, Zhang G, Xing Y, Liu Y, Zhang Z, Dong Y, Herrera-Viedma E. A sentiment analysis driven method based on public and personal preferences with correlated attributes to select online doctors. APPL INTELL 2023;53:1-22. [PMID: 36844914 PMCID: PMC9940095 DOI: 10.1007/s10489-023-04485-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/23/2023] [Indexed: 02/25/2023]

Cui J, Wang Z, Ho SB, Cambria E. Survey on sentiment analysis: evolution of research methods and topics. Artif Intell Rev 2023;56:1-42. [PMID: 36628328 PMCID: PMC9816550 DOI: 10.1007/s10462-022-10386-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/29/2022] [Indexed: 01/09/2023]

Textual emotion detection in health: Advances and applications. J Biomed Inform 2023;137:104258. [PMID: 36528329 DOI: 10.1016/j.jbi.2022.104258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 11/24/2022] [Accepted: 11/27/2022] [Indexed: 12/23/2022]

Abstract

Textual Emotion Detection (TED) is a rapidly growing area in Natural Language Processing (NLP) that aims to detect emotions expressed through text. In this paper, we provide a review of the latest research and development in TED as applied in health and medicine. We focus on medical and non-medical data types, use cases, and methods where TED has been integral in supporting decision-making. The application of NLP technologies in health, and particularly TED, requires high confidence that these technologies and technology-aided treatment will first, do no harm. Therefore, this review also aims to assess the accuracy of TED systems and provide an update on the state of the technology. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 guidelines were used in this review. With a specific focus on the identification of different human emotions in text, the more general sentiment analysis studies that only recognize the polarity of text were excluded. A total of 66 papers met the inclusion criteria. This review found that TED in health and medicine is mainly used in the detection of depression, suicidal ideation, and the mental status of patients with asthma, Alzheimer's disease, cancer, and diabetes with major data sources of social media, healthcare services, and counseling centers. Approximately, 44% of the research in the domain is related to COVID-19, investigating the public health response to vaccinations and the emotional response of the public. In most cases, deep learning-based NLP techniques were found to be preferred over other methods due to their superior performance. Developing methods for implementing and evaluating dimensional emotional models, resolving annotation challenges by utilizing health-related lexicons, and using deep learning techniques for multi-faceted and real-time applications were found to be among the main avenues for further development of TED applications in health.

Collapse

Goldman HH, Porcino J, Divita G, Zirikly A, Desmet B, Sacco M, Marfeo E, McDonough C, Rasch E, Chan L. Informatics Research on Mental Health Functioning: Decision Support for the Social Security Administration Disability Program. Psychiatr Serv 2023;74:56-62. [PMID: 35652194 PMCID: PMC10501504 DOI: 10.1176/appi.ps.202200056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Affiliation(s)

Howard H Goldman Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
Julia Porcino Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
Guy Divita Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
Ayah Zirikly Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
Bart Desmet Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
Maryanne Sacco Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
Elizabeth Marfeo Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
Christine McDonough Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
Elizabeth Rasch Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
Leighton Chan Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)

Collapse

Canny A, Mason B, Atkins C, Patterson R, Moussa L, Boyd K. Online public information about advance care planning: An evaluation of UK and international websites. Digit Health 2023;9:20552076231180438. [PMID: 37377564 PMCID: PMC10291539 DOI: 10.1177/20552076231180438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 05/19/2023] [Indexed: 06/29/2023] Open

Abstract

Introduction

Healthcare information is increasingly internet-based. Standards require websites to be 'perceivable, operable, understandable and robust' with relevant content for citizens in appropriate language. This study examined UK and international websites offering public healthcare information on advance care planning (ACP) using current recommendations for website accessibility and content and informed by a public engagement exercise.

Methods

Google searches identified websites in English from health service providers, governmental or third sector organisations based in the UK and internationally. Target keywords that would be used by a member of the public informed the search terms. Data extraction used criterion-based assessment and web content analysis of the first two pages of each search result. Public patient representatives as key members of the multidisciplinary research team guided the development of the evaluation criteria.

Results

A total of 1158 online searches identified 89 websites, reduced to 29 by inclusion/exclusion criteria. Most sites met international recommendations for knowledge/understanding about ACP. Differences in terminology, lack of information about ACP limitations and non-adherence to recommended reading levels, accessibility standards and translation options were apparent. Sites targeting members of the public used more positive, non-technical language than those for both professional and lay users.

Conclusions

Some websites met accepted standards required to facilitate understanding and public engagement in ACP. Others could be improved significantly. Website providers have important roles and responsibilities in increasing people's understanding of their health conditions, future care options and ability to take an active role in planning for their health and care.

Collapse

Lee ITL, Juang SE, Chen ST, Ko C, Ma KSK. Sentiment analysis of tweets on alopecia areata, hidradenitis suppurativa, and psoriasis: Revealing the patient experience. Front Med (Lausanne) 2022;9:996378. [PMID: 36388938 PMCID: PMC9660311 DOI: 10.3389/fmed.2022.996378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 09/26/2022] [Indexed: 11/26/2022] Open

Abstract

Background

Chronic dermatologic disorders can cause significant emotional distress. Sentiment analysis of disease-related tweets helps identify patients' experiences of skin disease.

Objective

To analyze the expressed sentiments in tweets related to alopecia areata (AA), hidradenitis suppurativa (HS), and psoriasis (PsO) in comparison to fibromyalgia (FM).

Methods

This is a cross-sectional analysis of Twitter users' expressed sentiment on AA, HS, PsO, and FM. Tweets related to the diseases of interest were identified with keywords and hashtags for one month (April, 2022) using the Twitter standard application programming interface (API). Text, account types, and numbers of retweets and likes were collected. The sentiment analysis was performed by the R "tidytext" package using the AFINN lexicon.

Results

A total of 1,505 tweets were randomly extracted, of which 243 (16.15%) referred to AA, 186 (12.36%) to HS, 510 (33.89%) to PsO, and 566 (37.61%) to FM. The mean sentiment score was -0.239 ± 2.90. AA, HS, and PsO had similar sentiment scores (p = 0.482). Although all skin conditions were associated with a negative polarity, their average was significantly less negative than FM (p < 0.0001). Tweets from private accounts were more negative, especially for AA (p = 0.0082). Words reflecting patients' psychological states varied in different diseases. "Anxiety" was observed in posts on AA and FM but not posts on HS and PsO, while "crying" was frequently used in posts on HS. There was no definite correlation between the sentiment score and the number of retweets or likes, although negative AA tweets from public accounts received more retweets (p = 0.03511) and likes (p = 0.0228).

Conclusion

The use of Twitter sentiment analysis is a promising method to document patients' experience of skin diseases, which may improve patient care through bridging misconceptions and knowledge gaps between patients and healthcare professionals.

Collapse

Natural Language Processing Techniques for Text Classification of Biomedical Documents: A Systematic Review. INFORMATION 2022. [DOI: 10.3390/info13100499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Chen JS, Baxter SL. Applications of natural language processing in ophthalmology: present and future. Front Med (Lausanne) 2022;9:906554. [PMID: 36004369 PMCID: PMC9393550 DOI: 10.3389/fmed.2022.906554] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 05/31/2022] [Indexed: 11/13/2022] Open

Abstract

Advances in technology, including novel ophthalmic imaging devices and adoption of the electronic health record (EHR), have resulted in significantly increased data available for both clinical use and research in ophthalmology. While artificial intelligence (AI) algorithms have the potential to utilize these data to transform clinical care, current applications of AI in ophthalmology have focused mostly on image-based deep learning. Unstructured free-text in the EHR represents a tremendous amount of underutilized data in big data analyses and predictive AI. Natural language processing (NLP) is a type of AI involved in processing human language that can be used to develop automated algorithms using these vast quantities of available text data. The purpose of this review was to introduce ophthalmologists to NLP by (1) reviewing current applications of NLP in ophthalmology and (2) exploring potential applications of NLP. We reviewed current literature published in Pubmed and Google Scholar for articles related to NLP and ophthalmology, and used ancestor search to expand our references. Overall, we found 19 published studies of NLP in ophthalmology. The majority of these publications (16) focused on extracting specific text such as visual acuity from free-text notes for the purposes of quantitative analysis. Other applications included: domain embedding, predictive modeling, and topic modeling. Future ophthalmic applications of NLP may also focus on developing search engines for data within free-text notes, cleaning notes, automated question-answering, and translating ophthalmology notes for other specialties or for patients, especially with a growing interest in open notes. As medicine becomes more data-oriented, NLP offers increasing opportunities to augment our ability to harness free-text data and drive innovations in healthcare delivery and treatment of ophthalmic conditions.

Collapse

Ramjee D, Smith LH, Doanvo A, Charpignon ML, McNulty-Nebel A, Lett E, Desai AN, Majumder MS. Evaluating criminal justice reform during COVID-19: The need for a novel sentiment analysis package. PLOS DIGITAL HEALTH 2022;1:e0000063. [PMID: 36812565 PMCID: PMC9931240 DOI: 10.1371/journal.pdig.0000063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Accepted: 05/15/2022] [Indexed: 11/18/2022]

Abstract

The health and safety of incarcerated persons and correctional personnel have been prominent in the U.S. news media discourse during the COVID-19 pandemic. Examining changing attitudes toward the health of the incarcerated population is imperative to better assess the extent to which the general public favors criminal justice reform. However, existing natural language processing lexicons that underlie current sentiment analysis (SA) algorithms may not perform adequately on news articles related to criminal justice due to contextual complexities. News discourse during the pandemic has highlighted the need for a novel SA lexicon and algorithm (i.e., an SA package) tailored for examining public health policy in the context of the criminal justice system. We analyzed the performance of existing SA packages on a corpus of news articles at the intersection of COVID-19 and criminal justice collected from state-level outlets between January and May 2020. Our results demonstrated that sentence sentiment scores provided by three popular SA packages can differ considerably from manually-curated ratings. This dissimilarity was especially pronounced when the text was more polarized, whether negatively or positively. A randomly selected set of 1,000 manually scored sentences, and the corresponding binary document term matrices, were used to train two new sentiment prediction algorithms (i.e., linear regression and random forest regression) to verify the performance of the manually-curated ratings. By better accounting for the unique context in which incarceration-related terminologies are used in news media, both of our proposed models outperformed all existing SA packages considered for comparison. Our findings suggest that there is a need to develop a novel lexicon, and potentially an accompanying algorithm, for analysis of text related to public health within the criminal justice system, as well as criminal justice more broadly.

Collapse

Kingsley J, Diekmann L, Egerer MH, Lin BB, Ossola A, Marsh P. Experiences of gardening during the early stages of the COVID-19 pandemic. Health Place 2022;76:102854. [PMID: 35842955 PMCID: PMC9242931 DOI: 10.1016/j.healthplace.2022.102854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 06/22/2022] [Accepted: 06/23/2022] [Indexed: 11/24/2022]

Walsh J, Dwumfour C, Cave J, Griffiths F. Spontaneously generated online patient experience data - how and why is it being used in health research: an umbrella scoping review. BMC Med Res Methodol 2022;22:139. [PMID: 35562661 PMCID: PMC9106384 DOI: 10.1186/s12874-022-01610-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 04/13/2022] [Indexed: 11/10/2022] Open

The Case of Aspect in Sentiment Analysis: Seeking Attention or Co-Dependency? MACHINE LEARNING AND KNOWLEDGE EXTRACTION 2022. [DOI: 10.3390/make4020021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Conducting sentiment analysis: Lei L. & Liu D. Elements in Corpus Linguistics, CUP. LANG RESOUR EVAL 2022. [DOI: 10.1007/s10579-022-09593-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Jang J, Yoon S, Son G, Kang M, Choeh JY, Choi KH. Predicting Personality and Psychological Distress Using Natural Language Processing: A Study Protocol. Front Psychol 2022;13:865541. [PMID: 35465529 PMCID: PMC9022676 DOI: 10.3389/fpsyg.2022.865541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Accepted: 03/22/2022] [Indexed: 11/13/2022] Open

Abstract

Background

Self-report multiple choice questionnaires have been widely utilized to quantitatively measure one's personality and psychological constructs. Despite several strengths (e.g., brevity and utility), self-report multiple choice questionnaires have considerable limitations in nature. With the rise of machine learning (ML) and Natural language processing (NLP), researchers in the field of psychology are widely adopting NLP to assess psychological construct to predict human behaviors. However, there is a lack of connections between the work being performed in computer science and that of psychology due to small data sets and unvalidated modeling practices.

Aims

The current article introduces the study method and procedure of phase II which includes the interview questions for the five-factor model (FFM) of personality developed in phase I. This study aims to develop the interview (semi-structured) and open-ended questions for the FFM-based personality assessments, specifically designed with experts in the field of clinical and personality psychology (phase 1), and to collect the personality-related text data using the interview questions and self-report measures on personality and psychological distress (phase 2). The purpose of the study includes examining the relationship between natural language data obtained from the interview questions, measuring the FFM personality constructs, and psychological distress to demonstrate the validity of the natural language-based personality prediction.

Methods

Phase I (pilot) study was conducted to fifty-nine native Korean adults to acquire the personality-related text data from the interview (semi-structured) and open-ended questions based on the FFM of personality. The interview questions were revised and finalized with the feedback from the external expert committee, consisting of personality and clinical psychologists. Based on the established interview questions, a total of 300 Korean adults will be recruited using a convenience sampling method via online survey. The text data collected from interviews will be analyzed using the natural language processing. The results of the online survey including demographic data, depression, anxiety, and personality inventories will be analyzed together in the model to predict individuals' FFM of personality and the level of psychological distress (phase 2).

Collapse

Verma S. Sentiment analysis of public services for smart society: Literature review and future research directions. GOVERNMENT INFORMATION QUARTERLY 2022. [DOI: 10.1016/j.giq.2022.101708] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

Davis BD, McKnight DE, Teodorescu D, Quan-Haase A, Chunara R, Fyshe A, Lizotte DJ. Quantifying depression-related language on social media during the COVID-19 pandemic. Int J Popul Data Sci 2022;5:1716. [PMID: 35516163 PMCID: PMC9052361 DOI: 10.23889/ijpds.v5i4.1716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022] Open

Lu TJ, Nguyen AXL, Trinh XV, Wu AY. Sentiment Analysis Surrounding Blepharoplasty in Online Health Forums. PLASTIC AND RECONSTRUCTIVE SURGERY-GLOBAL OPEN 2022;10:e4213. [PMID: 35492229 PMCID: PMC9038503 DOI: 10.1097/gox.0000000000004213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 01/31/2022] [Indexed: 06/14/2023]

Ramadi KB, Mehta R, He D, Chao S, Chu Z, Atun R, Nguyen FT. Grass-roots entrepreneurship complements traditional top-down innovation in lung and breast cancer. NPJ Digit Med 2022;5:10. [PMID: 35064182 PMCID: PMC8782943 DOI: 10.1038/s41746-021-00545-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2021] [Accepted: 11/30/2021] [Indexed: 11/08/2022] Open

Park S, Choi SH, Song YK, Kwon JW. Comparison of Online Patient Reviews and National Pharmacovigilance Data for Tramadol-Related Adverse Events: Comparative Observational Study. JMIR Public Health Surveill 2022;8:e33311. [PMID: 34982723 PMCID: PMC8767477 DOI: 10.2196/33311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 11/08/2021] [Accepted: 11/27/2021] [Indexed: 11/13/2022] Open

Artificial Intelligence in Surgery. Artif Intell Med 2022. [DOI: 10.1007/978-3-030-64573-1_171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Rahim AIA, Ibrahim MI, Chua SL, Musa KI. Hospital Facebook Reviews Analysis Using a Machine Learning Sentiment Analyzer and Quality Classifier. Healthcare (Basel) 2021;9:1679. [PMID: 34946405 PMCID: PMC8701188 DOI: 10.3390/healthcare9121679] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 11/30/2021] [Accepted: 12/02/2021] [Indexed: 02/05/2023] Open

Kentour M, Lu J. An investigation into the deep learning approach in sentimental analysis using graph-based theories. PLoS One 2021;16:e0260761. [PMID: 34855856 PMCID: PMC8638889 DOI: 10.1371/journal.pone.0260761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 11/16/2021] [Indexed: 11/24/2022] Open

Abstract

Sentiment analysis is a branch of natural language analytics that aims to correlate what is expressed which comes normally within unstructured format with what is believed and learnt. Several attempts have tried to address this gap (i.e., Naive Bayes, RNN, LSTM, word embedding, etc.), even though the deep learning models achieved high performance, their generative process remains a "black-box" and not fully disclosed due to the high dimensional feature and the non-deterministic weights assignment. Meanwhile, graphs are becoming more popular when modeling complex systems while being traceable and understood. Here, we reveal that a good trade-off transparency and efficiency could be achieved with a Deep Neural Network by exploring the Credit Assignment Paths theory. To this end, we propose a novel algorithm which alleviates the features' extraction mechanism and attributes an importance level of selected neurons by applying a deterministic edge/node embeddings with attention scores on the input unit and backward path respectively. We experiment on the Twitter Health News dataset were the model has been extended to approach different approximations (tweet/aspect and tweets' source levels, frequency, polarity/subjectivity), it was also transparent and traceable. Moreover, results of comparing with four recent models on same data corpus for tweets analysis showed a rapid convergence with an overall accuracy of ≈83% and 94% of correctly identified true positive sentiments. Therefore, weights can be ideally assigned to specific active features by following the proposed method. As opposite to other compared works, the inferred features are conditioned through the users' preferences (i.e., frequency degree) and via the activation's derivatives (i.e., reject feature if not scored). Future direction will address the inductive aspect of graph embeddings to include dynamic graph structures and expand the model resiliency by considering other datasets like SemEval task7, covid-19 tweets, etc.

Collapse

How Does the World View China’s Carbon Policy? A Sentiment Analysis on Twitter Data. ENERGIES 2021. [DOI: 10.3390/en14227782] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Walsh J, Cave J, Griffiths F. Spontaneously Generated Online Patient Experience of Modafinil: A Qualitative and NLP Analysis. Front Digit Health 2021;3:598431. [PMID: 34713085 PMCID: PMC8521895 DOI: 10.3389/fdgth.2021.598431] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Accepted: 01/27/2021] [Indexed: 11/16/2022] Open

Abstract

Objective: To compare the findings from a qualitative and a natural language processing (NLP) based analysis of online patient experience posts on patient experience of the effectiveness and impact of the drug Modafinil.

Methods: Posts (n = 260) from 5 online social media platforms where posts were publicly available formed the dataset/corpus. Three platforms asked posters to give a numerical rating of Modafinil. Thematic analysis: data was coded and themes generated. Data were categorized into PreModafinil, Acquisition, Dosage, and PostModafinil and compared to identify each poster's own view of whether taking Modafinil was linked to an identifiable outcome. We classified this as positive, mixed, negative, or neutral and compared this with numerical ratings. NLP: Corpus text was speech tagged and keywords and key terms extracted. We identified the following entities: drug names, condition names, symptoms, actions, and side-effects. We searched for simple relationships, collocations, and co-occurrences of entities. To identify causal text, we split the corpus into PreModafinil and PostModafinil and used n-gram analysis. To evaluate sentiment, we calculated the polarity of each post between −1 (negative) and +1 (positive). NLP results were mapped to qualitative results.

Results: Posters had used Modafinil for 33 different primary conditions. Eight themes were identified: the reason for taking (condition or symptom), impact of symptoms, acquisition, dosage, side effects, other interventions tried or compared to, effectiveness of Modafinil, and quality of life outcomes. Posters reported perceived effectiveness as follows: 68% positive, 12% mixed, 18% negative. Our classification was consistent with poster ratings. Of the most frequent 100 keywords/keyterms identified by term extraction 88/100 keywords and 84/100 keyterms mapped directly to the eight themes. Seven keyterms indicated negation and temporal states. Sentiment was as follows 72% positive sentiment 4% neutral 24% negative. Matching of sentiment between the qualitative and NLP methods was accurate in 64.2% of posts. If we allow for one category difference matching was accurate in 85% of posts.

Conclusions: User generated patient experience is a rich resource for evaluating real world effectiveness, understanding patient perspectives, and identifying research gaps. Both methods successfully identified the entities and topics contained in the posts. In contrast to current evidence, posters with a wide range of other conditions found Modafinil effective. Perceived causality and effectiveness were identified by both methods demonstrating the potential to augment existing knowledge.

Collapse

Rahim AIA, Ibrahim MI, Musa KI, Chua SL, Yaacob NM. Patient Satisfaction and Hospital Quality of Care Evaluation in Malaysia Using SERVQUAL and Facebook. Healthcare (Basel) 2021;9:1369. [PMID: 34683050 PMCID: PMC8544585 DOI: 10.3390/healthcare9101369] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 09/27/2021] [Accepted: 10/12/2021] [Indexed: 02/05/2023] Open

Abstract

Social media sites, dubbed patient online reviews (POR), have been proposed as new methods for assessing patient satisfaction and monitoring quality of care. However, the unstructured nature of POR data derived from social media creates a number of challenges. The objectives of this research were to identify service quality (SERVQUAL) dimensions automatically from hospital Facebook reviews using a machine learning classifier, and to examine their associations with patient dissatisfaction. From January 2017 to December 2019, empirical research was conducted in which POR were gathered from the official Facebook page of Malaysian public hospitals. To find SERVQUAL dimensions in POR, a machine learning topic classification utilising supervised learning was developed, and this study's objective was established using logistic regression analysis. It was discovered that 73.5% of patients were satisfied with the public hospital service, whereas 26.5% were dissatisfied. SERVQUAL dimensions identified were 13.2% reviews of tangible, 68.9% of reliability, 6.8% of responsiveness, 19.5% of assurance, and 64.3% of empathy. After controlling for hospital variables, all SERVQUAL dimensions except tangible and assurance were shown to be significantly related with patient dissatisfaction (reliability, p < 0.001; responsiveness, p = 0.016; and empathy, p < 0.001). Rural hospitals had a higher probability of patient dissatisfaction (p < 0.001). Therefore, POR, assisted by machine learning technologies, provided a pragmatic and feasible way for capturing patient perceptions of care quality and supplementing conventional patient satisfaction surveys. The findings offer critical information that will assist healthcare authorities in capitalising on POR by monitoring and evaluating the quality of services in real time.

Collapse

A. Rahim AI, Ibrahim MI, Musa KI, Chua SL, Yaacob NM. Assessing Patient-Perceived Hospital Service Quality and Sentiment in Malaysian Public Hospitals Using Machine Learning and Facebook Reviews. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021;18:9912. [PMID: 34574835 PMCID: PMC8466628 DOI: 10.3390/ijerph18189912] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 09/17/2021] [Accepted: 09/18/2021] [Indexed: 02/05/2023]

Abstract

Social media is emerging as a new avenue for hospitals and patients to solicit input on the quality of care. However, social media data is unstructured and enormous in volume. Moreover, no empirical research on the use of social media data and perceived hospital quality of care based on patient online reviews has been performed in Malaysia. The purpose of this study was to investigate the determinants of positive sentiment expressed in hospital Facebook reviews in Malaysia, as well as the association between hospital accreditation and sentiments expressed in Facebook reviews. From 2017 to 2019, we retrieved comments from 48 official public hospitals' Facebook pages. We used machine learning to build a sentiment analyzer and service quality (SERVQUAL) classifier that automatically classifies the sentiment and SERVQUAL dimensions. We utilized logistic regression analysis to determine our goals. We evaluated a total of 1852 reviews and our machine learning sentiment analyzer detected 72.1% of positive reviews and 27.9% of negative reviews. We classified 240 reviews as tangible, 1257 reviews as trustworthy, 125 reviews as responsive, 356 reviews as assurance, and 1174 reviews as empathy using our machine learning SERVQUAL classifier. After adjusting for hospital characteristics, all SERVQUAL dimensions except Tangible were associated with positive sentiment. However, no significant relationship between hospital accreditation and online sentiment was discovered. Facebook reviews powered by machine learning algorithms provide valuable, real-time data that may be missed by traditional hospital quality assessments. Additionally, online patient reviews offer a hitherto untapped indication of quality that may benefit all healthcare stakeholders. Our results confirm prior studies and support the use of Facebook reviews as an adjunct method for assessing the quality of hospital services in Malaysia.

Collapse

Žunić A, Corcoran P, Spasić I. Aspect-based sentiment analysis with graph convolution over syntactic dependencies. Artif Intell Med 2021;119:102138. [PMID: 34531007 DOI: 10.1016/j.artmed.2021.102138] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 06/05/2021] [Accepted: 08/03/2021] [Indexed: 11/18/2022]