1
|
Wu D, Shead H, Ren Y, Raynor P, Tao Y, Villanueva H, Hung P, Li X, Brookshire RG, Eichelberger K, Guille C, Litwin AH, Olatosi B. Uncovering the Complexity of Perinatal Polysubstance Use Disclosure Patterns on X: Mixed Methods Study. J Med Internet Res 2024; 26:e53171. [PMID: 39302713 DOI: 10.2196/53171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 05/06/2024] [Accepted: 06/11/2024] [Indexed: 09/22/2024] Open
Abstract
BACKGROUND According to the Morbidity and Mortality Weekly Report, polysubstance use among pregnant women is prevalent, with 38.2% of those who consume alcohol also engaging in the use of one or more additional substances. However, the underlying mechanisms, contexts, and experiences of polysubstance use are unclear. Organic information is abundant on social media such as X (formerly Twitter). Traditional quantitative and qualitative methods, as well as natural language processing techniques, can be jointly used to derive insights into public opinions, sentiments, and clinical and public health policy implications. OBJECTIVE Based on perinatal polysubstance use (PPU) data that we extracted on X from May 1, 2019, to October 31, 2021, we proposed two primary research questions: (1) What is the overall trend and sentiment of PPU discussions on X? (2) Are there any distinct patterns in the discussion trends of PPU-related tweets? If so, what are the implications for perinatal care and associated public health policies? METHODS We used X's application programming interface to extract >6 million raw tweets worldwide containing ≥2 prenatal health- and substance-related keywords provided by our clinical team. After removing all non-English-language tweets, non-US tweets, and US tweets without disclosed geolocations, we obtained 4848 PPU-related US tweets. We then evaluated them using a mixed methods approach. The quantitative analysis applied frequency, trend analysis, and several natural language processing techniques such as sentiment analysis to derive statistics to preview the corpus. To further understand semantics and clinical insights among these tweets, we conducted an in-depth thematic content analysis with a random sample of 500 PPU-related tweets with a satisfying κ score of 0.7748 for intercoder reliability. RESULTS Our quantitative analysis indicates the overall trends, bigram and trigram patterns, and negative sentiments were more dominant in PPU tweets (2490/4848, 51.36%) than in the non-PPU sample (1323/4848, 27.29%). Paired polysubstance use (4134/4848, 85.27%) was the most common, with the combination alcohol and drugs identified as the most mentioned. From the qualitative analysis, we identified 3 main themes: nonsubstance, single substance, and polysubstance, and 4 subthemes to contextualize the rationale of underlying PPU behaviors: lifestyle, perceptions of others' drug use, legal implications, and public health. CONCLUSIONS This study identified underexplored, emerging, and important topics related to perinatal PPU, with significant stigmas and legal ramifications discussed on X. Overall, public sentiments on PPU were mixed, encompassing negative (2490/4848, 51.36%), positive (1884/4848, 38.86%), and neutral (474/4848, 9.78%) sentiments. The leading substances in PPU were alcohol and drugs, and the normalization of PPU discussed on X is becoming more prevalent. Thus, this study provides valuable insights to further understand the complexity of PPU and its implications for public health practitioners and policy makers to provide proper access and support to individuals with PPU.
Collapse
Affiliation(s)
- Dezhi Wu
- Department of Integrated Information Technology, University of South Carolina, Columbia, SC, United States
| | - Hannah Shead
- Department of Mathematics, Augusta University, Augusta, GA, United States
| | - Yang Ren
- Department of Computer Science and Engineering, University of South Carolina, Columbia, SC, United States
| | - Phyllis Raynor
- College of Nursing, University of South Carolina, Columbia, SC, United States
| | - Youyou Tao
- Department of Information Systems and Business Analytics, Loyola Marymount University, Los Angeles, CA, United States
| | - Harvey Villanueva
- Department of Computer Science and Engineering, University of South Carolina, Columbia, SC, United States
| | - Peiyin Hung
- Arnold School of Public Health, University of South Carolina, Columbia, SC, United States
| | - Xiaoming Li
- Arnold School of Public Health, University of South Carolina, Columbia, SC, United States
| | - Robert G Brookshire
- Department of Integrated Information Technology, University of South Carolina, Columbia, SC, United States
| | - Kacey Eichelberger
- School of Medicine Greenville, University of South Carolina, Greenville, SC, United States
- Prisma Health, Greenville, SC, United States
| | - Constance Guille
- College of Medicine, Medical University of South Carolina, Charleston, SC, United States
| | - Alain H Litwin
- School of Medicine Greenville, University of South Carolina, Greenville, SC, United States
- Prisma Health, Greenville, SC, United States
| | - Bankole Olatosi
- Arnold School of Public Health, University of South Carolina, Columbia, SC, United States
| |
Collapse
|
2
|
Das S, Catterall J, Stone R, Clough AR. "The reasons you believe …": An exploratory study of text driven evidence gathering and prediction from first responder records justifying state authorised intervention for mental health episodes. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 254:108257. [PMID: 38901271 DOI: 10.1016/j.cmpb.2024.108257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 05/13/2024] [Accepted: 05/28/2024] [Indexed: 06/22/2024]
Abstract
Objective First responders' mandatory reports of mental health episodes requiring emergency hospital care contain rich information about patients and their needs. In Queensland (Australia) much of the information contained in Emergency Examination Authorities (EEAs) remains unused. We propose and demonstrate a methodology to extract and translate vital information embedded in reports like EEAs and to use it to investigate the extreme propensity of incidence of serious mental health episodes. Methods The proposed method integrates clinical, demographic, spatial and free text information into a single data collection. The data is subjected to exploratory analysis for spatial pattern recognition leading to an observational epidemiology model for the association of maximum spatial recurrence of EEA episodes. Results Sentiment analysis revealed that among EEA presentations hospital and health service (HHS) region #4 had the lowest proportion of positive sentiments (18 %) compared to 33 % for HHS region #1 pointing to spatial differentiation of sentiments immanent in mandated free text which required more detailed analysis. At the postcode geographical level, we found that variation in maximum spatial recurrence of EEAs was significantly positively associated with spatial range of sentiments (0.29, p < 0.001) and the postcode-referenced sex ratio (0.45, p = 0.01). The volatility of sentiments significantly correlated with extremes of recurrence of EEA episodes. The predicted (probabilistic) incidence rate when mapped reflected this correlation. Conclusions The paper demonstrates the efficacy of integrating, machine extracted, human sentiments (as potential surrogates) with conventional exposure variables for evidence-based methods for mental health spatial epidemiology. Such insights from informatics-driven epidemiological observations may inform the strategic allocation of health system resources to address the highest levels of need and to improve the standard of care for mental patients while also enhancing their safe and humane treatment and management.
Collapse
Affiliation(s)
- Sourav Das
- School of Electrical Engineering, Computing, and Mathematical Sciences, Curtin University, Perth, WA, Australia.
| | - Janet Catterall
- Liaison Librarian, Library and Information Services, Division of Student Life, James Cook University, PO Box 6811. Cairns, QLD, Australia
| | - Richard Stone
- Director of Emergency Medicine, Cairns Hospital, Cairns and Hinterland Hospital and Health Service, Cairns, QLD, Australia
| | - Alan R Clough
- Professorial Research Fellow, College of Public Health, Medical and Veterinary Sciences, and Australian Institute of Tropical Health and Medicine, James Cook University, PO Box 6811. Cairns, QLD, Australia
| |
Collapse
|
3
|
Paradise Vit A, Magid A. Differences in Fear and Negativity Levels Between Formal and Informal Health-Related Websites: Analysis of Sentiments and Emotions. J Med Internet Res 2024; 26:e55151. [PMID: 39120928 PMCID: PMC11344190 DOI: 10.2196/55151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 05/19/2024] [Accepted: 06/07/2024] [Indexed: 08/10/2024] Open
Abstract
BACKGROUND Searching for web-based health-related information is frequently performed by the public and may affect public behavior regarding health decision-making. Particularly, it may result in anxiety, erroneous, and harmful self-diagnosis. Most searched health-related topics are cancer, cardiovascular diseases, and infectious diseases. A health-related web-based search may result in either formal or informal medical website, both of which may evoke feelings of fear and negativity. OBJECTIVE Our study aimed to assess whether there is a difference in fear and negativity levels between information appearing on formal and informal health-related websites. METHODS A web search was performed to retrieve the contents of websites containing symptoms of selected diseases, using selected common symptoms. Retrieved websites were classified into formal and informal websites. Fear and negativity of each content were evaluated using 3 transformer models. A fourth transformer model was fine-tuned using an existing emotion data set obtained from a web-based health community. For formal and informal websites, fear and negativity levels were aggregated. t tests were conducted to evaluate the differences in fear and negativity levels between formal and informal websites. RESULTS In this study, unique websites (N=1448) were collected, of which 534 were considered formal and 914 were considered informal. There were 1820 result pages from formal websites and 1494 result pages from informal websites. According to our findings, fear levels were statistically higher (t2753=3.331; P<.001) on formal websites (mean 0.388, SD 0.177) than on informal websites (mean 0.366, SD 0.168). The results also show that the level of negativity was statistically higher (t2753=2.726; P=.006) on formal websites (mean 0.657, SD 0.211) than on informal websites (mean 0.636, SD 0.201). CONCLUSIONS Positive texts may increase the credibility of formal health websites and increase their usage by the general public and the public's compliance to the recommendations. Increasing the usage of natural language processing tools before publishing health-related information to achieve a more positive and less stressful text to be disseminated to the public is recommended.
Collapse
Affiliation(s)
- Abigail Paradise Vit
- Department of Information Systems, The Max Stern Yezreel Valley College, Emek Yezreel, Israel
| | - Avi Magid
- Department of Information Systems, The Max Stern Yezreel Valley College, Emek Yezreel, Israel
- Management, Rambam Health Care Campus, Haifa, Israel
| |
Collapse
|
4
|
Yoon S, Jang J, Son G, Park S, Hwang J, Choeh JY, Choi KH. Predicting neuroticism with open-ended response using natural language processing. Front Psychiatry 2024; 15:1437569. [PMID: 39149156 PMCID: PMC11324482 DOI: 10.3389/fpsyt.2024.1437569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Accepted: 07/17/2024] [Indexed: 08/17/2024] Open
Abstract
Introduction With rapid advancements in natural language processing (NLP), predicting personality using this technology has become a significant research interest. In personality prediction, exploring appropriate questions that elicit natural language is particularly important because questions determine the context of responses. This study aimed to predict levels of neuroticism-a core psychological trait known to predict various psychological outcomes-using responses to a series of open-ended questions developed based on the five-factor model of personality. This study examined the model's accuracy and explored the influence of item content in predicting neuroticism. Methods A total of 425 Korean adults were recruited and responded to 18 open-ended questions about their personalities, along with the measurement of the Five-Factor Model traits. In total, 30,576 Korean sentences were collected. To develop the prediction models, the pre-trained language model KoBERT was used. Accuracy, F1 Score, Precision, and Recall were calculated as evaluation metrics. Results The results showed that items inquiring about social comparison, unintended harm, and negative feelings performed better in predicting neuroticism than other items. For predicting depressivity, items related to negative feelings, social comparison, and emotions showed superior performance. For dependency, items related to unintended harm, social dominance, and negative feelings were the most predictive. Discussion We identified items that performed better at neuroticism prediction than others. Prediction models developed based on open-ended questions that theoretically aligned with neuroticism exhibited superior predictive performance.
Collapse
Affiliation(s)
- Seowon Yoon
- School of Psychology, Korea University, Seoul, Republic of Korea
- KU Mind Health Institute, Korea University, Seoul, Republic of Korea
| | - Jihee Jang
- School of Psychology, Korea University, Seoul, Republic of Korea
| | - Gaeun Son
- School of Psychology, Korea University, Seoul, Republic of Korea
| | - Soohyun Park
- School of Psychology, Korea University, Seoul, Republic of Korea
| | - Jueun Hwang
- School of Psychology, Korea University, Seoul, Republic of Korea
| | - Joon Yeon Choeh
- Department of Software, Sejong University, Seoul, Republic of Korea
| | - Kee-Hong Choi
- School of Psychology, Korea University, Seoul, Republic of Korea
- KU Mind Health Institute, Korea University, Seoul, Republic of Korea
| |
Collapse
|
5
|
Gondode P, Duggal S, Garg N, Sethupathy S, Asai O, Lohakare P. Comparing patient education tools for chronic pain medications: Artificial intelligence chatbot versus traditional patient information leaflets. Indian J Anaesth 2024; 68:631-636. [PMID: 39081915 PMCID: PMC11285886 DOI: 10.4103/ija.ija_204_24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2024] [Revised: 05/03/2024] [Accepted: 05/05/2024] [Indexed: 08/02/2024] Open
Abstract
Background and Aims Artificial intelligence (AI) chatbots like Conversational Generative Pre-trained Transformer (ChatGPT) have recently created much buzz, especially regarding patient education. Such informed patients understand and adhere to the management and get involved in shared decision making. The accuracy and understandability of the generated educational material are prime concerns. Thus, we compared ChatGPT with traditional patient information leaflets (PILs) about chronic pain medications. Methods Patients' frequently asked questions were generated from PILs available on the official websites of the British Pain Society (BPS) and the Faculty of Pain Medicine. Eight blinded annexures were prepared for evaluation, consisting of traditional PILs from the BPS and AI-generated patient information materials structured similar to PILs by ChatGPT. The authors performed a comparative analysis to assess materials' readability, emotional tone, accuracy, actionability, and understandability. Readability was measured using Flesch Reading Ease (FRE), Gunning Fog Index (GFI), and Flesch-Kincaid Grade Level (FKGL). Sentiment analysis determined emotional tone. An expert panel evaluated accuracy and completeness. Actionability and understandability were assessed with the Patient Education Materials Assessment Tool. Results Traditional PILs generally exhibited higher readability (P values < 0.05), with [mean (standard deviation)] FRE [62.25 (1.6) versus 48 (3.7)], GFI [11.85 (0.9) versus 13.65 (0.7)], and FKGL [8.33 (0.5) versus 10.23 (0.5)] but varied emotional tones, often negative, compared to more positive sentiments in ChatGPT-generated texts. Accuracy and completeness did not significantly differ between the two. Actionability and understandability scores were comparable. Conclusion While AI chatbots offer efficient information delivery, ensuring accuracy and readability, patient-centeredness remains crucial. It is imperative to balance innovation with evidence-based practice.
Collapse
Affiliation(s)
- Prakash Gondode
- Department of Anesthesiology Pain Medicine and Critical Care, All India Institute of Medical Sciences (AIIMS) New Delhi, India
| | - Sakshi Duggal
- Department of Anesthesiology Pain Medicine and Critical Care, All India Institute of Medical Sciences (AIIMS) New Delhi, India
| | - Neha Garg
- Department of Anesthesiology Pain Medicine and Critical Care, All India Institute of Medical Sciences (AIIMS) New Delhi, India
| | - Surrender Sethupathy
- Department of Anesthesiology Pain Medicine and Critical Care, All India Institute of Medical Sciences (AIIMS) New Delhi, India
| | - Omshubham Asai
- Department of Anesthesiology, All India Institute of Medical Sciences, Nagpur, Maharashtra, India
| | - Pooja Lohakare
- Department of Microbiology, Mahatma Gandhi Institute of Medical Sciences, Wardha, Maharashtra, India
| |
Collapse
|
6
|
Molenaar A, Jenkins EL, Brennan L, Lukose D, McCaffrey TA. The use of sentiment and emotion analysis and data science to assess the language of nutrition-, food- and cooking-related content on social media: a systematic scoping review. Nutr Res Rev 2024; 37:43-78. [PMID: 36991525 DOI: 10.1017/s0954422423000069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/31/2023]
Abstract
Social media data are rapidly evolving and accessible, which presents opportunities for research. Data science techniques, such as sentiment or emotion analysis which analyse textual emotion, provide an opportunity to gather insight from social media. This paper describes a systematic scoping review of interdisciplinary evidence to explore how sentiment or emotion analysis methods alongside other data science methods have been used to examine nutrition, food and cooking social media content. A PRISMA search strategy was used to search nine electronic databases in November 2020 and January 2022. Of 7325 studies identified, thirty-six studies were selected from seventeen countries, and content was analysed thematically and summarised in an evidence table. Studies were published between 2014 and 2022 and used data from seven different social media platforms (Twitter, YouTube, Instagram, Reddit, Pinterest, Sina Weibo and mixed platforms). Five themes of research were identified: dietary patterns, cooking and recipes, diet and health, public health and nutrition and food in general. Papers developed a sentiment or emotion analysis tool or used available open-source tools. Accuracy to predict sentiment ranged from 33·33% (open-source engine) to 98·53% (engine developed for the study). The average proportion of sentiment was 38·8% positive, 46·6% neutral and 28·0% negative. Additional data science techniques used included topic modelling and network analysis. Future research requires optimising data extraction processes from social media platforms, the use of interdisciplinary teams to develop suitable and accurate methods for the subject and the use of complementary methods to gather deeper insights into these complex data.
Collapse
Affiliation(s)
- Annika Molenaar
- Department of Nutrition, Dietetics and Food, Monash University, Level 1, 264 Ferntree Gully Road, Notting Hill, VIC3168, Australia
| | - Eva L Jenkins
- Department of Nutrition, Dietetics and Food, Monash University, Level 1, 264 Ferntree Gully Road, Notting Hill, VIC3168, Australia
| | - Linda Brennan
- School of Media and Communication, RMIT University, 124 La Trobe St, MelbourneVIC3004, Australia
| | - Dickson Lukose
- Monash Data Futures Institute, Monash University, Level 2, 13 Rainforest Walk, Monash University, ClaytonVIC3800, Australia
| | - Tracy A McCaffrey
- Department of Nutrition, Dietetics and Food, Monash University, Level 1, 264 Ferntree Gully Road, Notting Hill, VIC3168, Australia
| |
Collapse
|
7
|
Menger NS, Tognetti A, Farruggia MC, Mucignat C, Bhutani S, Cooper KW, Rohlfs Domínguez P, Heinbockel T, Shields VDC, D'Errico A, Pereda-Loth V, Pierron D, Koyama S, Croijmans I. Giving a Voice to Patients With Smell Disorders Associated With COVID-19: Cross-Sectional Longitudinal Analysis Using Natural Language Processing of Self-Reports. JMIR Public Health Surveill 2024; 10:e47064. [PMID: 38728069 PMCID: PMC11127136 DOI: 10.2196/47064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 10/26/2023] [Accepted: 03/11/2024] [Indexed: 05/12/2024] Open
Abstract
BACKGROUND Smell disorders are commonly reported with COVID-19 infection. The smell-related issues associated with COVID-19 may be prolonged, even after the respiratory symptoms are resolved. These smell dysfunctions can range from anosmia (complete loss of smell) or hyposmia (reduced sense of smell) to parosmia (smells perceived differently) or phantosmia (smells perceived without an odor source being present). Similar to the difficulty that people experience when talking about their smell experiences, patients find it difficult to express or label the symptoms they experience, thereby complicating diagnosis. The complexity of these symptoms can be an additional burden for patients and health care providers and thus needs further investigation. OBJECTIVE This study aims to explore the smell disorder concerns of patients and to provide an overview for each specific smell disorder by using the longitudinal survey conducted in 2020 by the Global Consortium for Chemosensory Research, an international research group that has been created ad hoc for studying chemosensory dysfunctions. We aimed to extend the existing knowledge on smell disorders related to COVID-19 by analyzing a large data set of self-reported descriptive comments by using methods from natural language processing. METHODS We included self-reported data on the description of changes in smell provided by 1560 participants at 2 timepoints (second survey completed between 23 and 291 days). Text data from participants who still had smell disorders at the second timepoint (long-haulers) were compared with the text data of those who did not (non-long-haulers). Specifically, 3 aims were pursued in this study. The first aim was to classify smell disorders based on the participants' self-reports. The second aim was to classify the sentiment of each self-report by using a machine learning approach, and the third aim was to find particular food and nonfood keywords that were more salient among long-haulers than those among non-long-haulers. RESULTS We found that parosmia (odds ratio [OR] 1.78, 95% CI 1.35-2.37; P<.001) as well as hyposmia (OR 1.74, 95% CI 1.34-2.26; P<.001) were more frequently reported in long-haulers than in non-long-haulers. Furthermore, a significant relationship was found between long-hauler status and sentiment of self-report (P<.001). Finally, we found specific keywords that were more typical for long-haulers than those for non-long-haulers, for example, fire, gas, wine, and vinegar. CONCLUSIONS Our work shows consistent findings with those of previous studies, which indicate that self-reports, which can easily be extracted online, may offer valuable information to health care and understanding of smell disorders. At the same time, our study on self-reports provides new insights for future studies investigating smell disorders.
Collapse
Affiliation(s)
- Nick S Menger
- Institute of Medical Psychology and Behavioural Neurobiology, University of Tübingen, Tübingen, Germany
| | - Arnaud Tognetti
- Department of Clinical Neuroscience, Division of Psychology, Karolinska Institutet, Stockholm, Sweden
- Centre d'Economie de l'Environnement Montpellier, Centre National de la Recherche Scientifique, Institut National de Recherche pour l'Agriculture l'Alimentation et l'Environnement, Institut Agro, Université de Montpellier, Montpellier, France
| | - Michael C Farruggia
- Interdepartmental Neuroscience Program, Yale University, New Haven, CT, United States
| | - Carla Mucignat
- Department of Molecular Medicine, University of Padova, Padua, Italy
| | - Surabhi Bhutani
- School of Exercise and Nutritional Sciences, San Diego State University, San Diego, CA, United States
| | - Keiland W Cooper
- Department of Neurobiology and Behavior, University of California, Irvine, CA, United States
| | - Paloma Rohlfs Domínguez
- Department of Developmental and Educational Psychology, University of Basque Country, Leioa, Spain
| | - Thomas Heinbockel
- Department of Anatomy, Howard University College of Medicine, Washington, DC, United States
| | - Vonnie D C Shields
- Biological Sciences Department, Fisher College of Science and Mathematics, Towson University, Towson, MD, United States
| | - Anna D'Errico
- Goethe University of Frankfurt, Frankfurt am Main, Germany
| | | | - Denis Pierron
- Laboratoire Évolution et Santé Orale, Université Toulouse III, Toulouse, France
| | - Sachiko Koyama
- Department of Medicine, School of Medicine, Indiana University, Indianapolis, IN, United States
| | - Ilja Croijmans
- Language and Communication Department, Faculty of Arts, Radboud University, Nijmegen, Netherlands
- Centre for Language Studies, Radboud University, Nijmegen, Netherlands
| |
Collapse
|
8
|
Reis FJJ, Bonfim IDS, Corrêa LA, Nogueira LC, Meziat-Filho N, Almeida RSD. Uncovering emotional and network dynamics in the speech of patients with chronic low back pain. Musculoskelet Sci Pract 2024; 70:102925. [PMID: 38430821 DOI: 10.1016/j.msksp.2024.102925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 01/26/2024] [Accepted: 02/23/2024] [Indexed: 03/05/2024]
Abstract
BACKGROUND Computational linguistics allows an understanding of language structure and different forms of expression of patients' perceptions. AIMS The aims of this study were (i) to carry out a descriptive analysis of the discourse of people with chronic low back pain using sentiment analysis (SA) and network analysis; (ii) to verify the correlation between patients' profiles, pain intensity and disability levels with SA and network analysis; and (iii) to identify clusters in our sample according to language and SA using an unsupervised machine learning technique. METHODS We performed a secondary analysis of a qualitative study including participants with chronic non-specific low back pain. We used the data related to participants' feelings when they received the diagnosis. The SA and network analysis were performed using the Valence Aware Dictionary and sEntiment Reasoner, and the Speech Graph, respectively. Clustering was performed using the K-means algorithm. RESULTS In the SA, the mean composite score was -0.31 (Sd. = 0.58). Most participants presented a negative discourse (n = 41; 72%). Word Count (WC) and Largest Strongly connected Component (LSC) positively correlated with education. No statistically significant correlations were observed between pain intensity, disability levels, SA, and network analysis. Two clusters were identified in our sample. CONCLUSION The SA showed that participants reported their feeling when describing the moment of the diagnosis using sentences with negative discourse. We did not find a statistically significant correlation between pain intensity, disability levels, SA, and network analysis. Education level presented positive correlation with WC and LSC.
Collapse
Affiliation(s)
- Felipe J J Reis
- Physical Therapy Department, Instituto Federal do Rio de Janeiro (IFRJ), Rio de Janeiro, RJ, Brazil; Pain in Motion Research Group, Department of Physiotherapy, Human Physiology and Anatomy, Faculty of Physical Education & Physiotherapy, Vrije Universiteit Brussel, Brussels, Belgium.
| | - Igor da Silva Bonfim
- Postgraduate Program in Rehabilitation Sciences, Centro Universitário Augusto Motta (UNISUAM), Rio de Janeiro, RJ, Brazil
| | - Leticia Amaral Corrêa
- Department of Chiropractic, Faculty of Medicine, Health and Human Sciences, Macquarie University, Sydney, Australia
| | - Leandro Calazans Nogueira
- Physical Therapy Department, Instituto Federal do Rio de Janeiro (IFRJ), Rio de Janeiro, RJ, Brazil; Postgraduate Program in Rehabilitation Sciences, Centro Universitário Augusto Motta (UNISUAM), Rio de Janeiro, RJ, Brazil
| | - Ney Meziat-Filho
- Postgraduate Program in Rehabilitation Sciences, Centro Universitário Augusto Motta (UNISUAM), Rio de Janeiro, RJ, Brazil
| | - Renato Santos de Almeida
- Physical Therapy Department, Instituto Federal do Rio de Janeiro (IFRJ), Rio de Janeiro, RJ, Brazil; Postgraduate Program in Rehabilitation Sciences, Centro Universitário Augusto Motta (UNISUAM), Rio de Janeiro, RJ, Brazil
| |
Collapse
|
9
|
Ntiamoah M, Xavier T, Lambert J. Sentiment Analysis of Patient- and Family-Related Sepsis Events: Exploratory Study. JMIR Nurs 2024; 7:e51720. [PMID: 38557694 PMCID: PMC11019419 DOI: 10.2196/51720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 01/24/2024] [Accepted: 02/07/2024] [Indexed: 04/04/2024] Open
Abstract
BACKGROUND Despite the life-threatening nature of sepsis, little is known about the emotional experiences of patients and their families during sepsis events. We conducted a sentiment analysis pertaining to sepsis incidents involving patients and families, leveraging textual data retrieved from a publicly available blog post disseminated by the Centers for Disease Control and Prevention (CDC). OBJECTIVE This investigation involved a sentiment analysis of patient- and family-related sepsis events, leveraging text responses sourced from a publicly accessible blog post disseminated by the CDC. Driven by the imperative to elucidate the emotional dynamics encountered by patients and their families throughout sepsis incidents, the overarching aims centered on elucidating the emotional ramifications of sepsis on both patients and their families and discerning potential avenues for enhancing the quality of sepsis care. METHODS The research used a cross-sectional data mining methodology to investigate the sentiments and emotional aspects linked to sepsis, using a data set sourced from the CDC, which encompasses 170 responses from both patients and caregivers, spanning the period between September 2014 and September 2020. This investigation used the National Research Council Canada Emotion Lexicon for sentiment analysis, coupled with a combination of manual and automated techniques to extract salient features from textual responses. The study used negative binomial least absolute shrinkage and selection operator regressions to ascertain significant textual features that correlated with specific emotional states. Moreover, the visualization of Plutchik's Wheel of Emotions facilitated the discernment of prevailing emotions within the data set. RESULTS The results showed that patients and their families experienced a range of emotions during sepsis events, including fear, anxiety, sadness, and gratitude. Our analyses revealed an estimated incidence rate ratio (IRR) of 1.35 for fear-related words and a 1.51 IRR for sadness-related words when mentioning "hospital" in sepsis-related experiences. Similarly, mentions of "intensive care unit" were associated with an average occurrence of 12.3 fear-related words and 10.8 sadness-related words. Surviving patients' experiences had an estimated 1.15 IRR for joy-related words, contrasting with discussions around organ failure, which were associated with multiple negative emotions including disgust, anger, fear, and sadness. Furthermore, mentions of "death" were linked to more fear and anger words but fewer joy-related words. Conversely, longer timelines in sepsis events were associated with more joy-related words and fewer fear-related words, potentially indicating improved emotional adaptation over time. CONCLUSIONS The study's outcomes underscore the imperative for health care providers to integrate emotional support alongside medical interventions for patients and families affected by sepsis, emphasizing the emotional toll incurred and highlighting the necessity of acknowledgment and resolution, advocating for the use of sentiment analysis as a means to tailor personalized emotional aid, and thereby potentially augmenting both patient and family welfare and overall outcomes.
Collapse
Affiliation(s)
| | - Teenu Xavier
- University of Cincinnati, Cincinnati, OH, United States
| | | |
Collapse
|
10
|
Farokhnia Hamedani M, Esmaeili M, Sun Y, Sheybani E, Javidi G. Paving the way for COVID survivors' psychosocial rehabilitation: Mining topics, sentiments, and their trajectories over time from Reddit. Health Informatics J 2024; 30:14604582241240680. [PMID: 38739488 DOI: 10.1177/14604582241240680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Objective: This study examined major themes and sentiments and their trajectories and interactions over time using subcategories of Reddit data. The aim was to facilitate decision-making for psychosocial rehabilitation. Materials and Methods: We utilized natural language processing techniques, including topic modeling and sentiment analysis, on a dataset consisting of more than 38,000 topics, comments, and posts collected from a subreddit dedicated to the experiences of people who tested positive for COVID-19. In this longitudinal exploratory analysis, we studied the dynamics between the most dominant topics and subjects' emotional states over an 18-month period. Results: Our findings highlight the evolution of the textual and sentimental status of major topics discussed by COVID survivors over an extended period of time during the pandemic. We particularly studied pre- and post-vaccination eras as a turning point in the timeline of the pandemic. The results show that not only does the relevance of topics change over time, but the emotions attached to them also vary. Major social events, such as the administration of vaccines or enforcement of nationwide policies, are also reflected through the discussions and inquiries of social media users. In particular, the emotional state (i.e., sentiments and polarity of their feelings) of those who have experienced COVID personally. Discussion: Cumulative societal knowledge regarding the COVID-19 pandemic impacts the patterns with which people discuss their experiences, concerns, and opinions. The subjects' emotional state with respect to different topics was also impacted by extraneous factors and events, such as vaccination. Conclusion: By mining major topics, sentiments, and trajectories demonstrated in COVID-19 survivors' interactions on Reddit, this study contributes to the emerging body of scholarship on COVID-19 survivors' mental health outcomes, providing insights into the design of mental health support and rehabilitation services for COVID-19 survivors.
Collapse
Affiliation(s)
- Moez Farokhnia Hamedani
- Bryan School of Business and Economics, University of North Carolina at Greensboro, Greensboro, NC, USA
| | - Mostafa Esmaeili
- Muma College of Business, University of South Florida, Tampa, FL, USA
| | - Yao Sun
- College of Science and Liberal Arts, New Jersey Institute of Technology, Newark, NJ, USA
| | - Ehsan Sheybani
- Muma College of Business, University of South Florida, Tampa, FL, USA
| | - Giti Javidi
- Muma College of Business, University of South Florida, Tampa, FL, USA
| |
Collapse
|
11
|
Molenaar A, Lukose D, Brennan L, Jenkins EL, McCaffrey TA. Using Natural Language Processing to Explore Social Media Opinions on Food Security: Sentiment Analysis and Topic Modeling Study. J Med Internet Res 2024; 26:e47826. [PMID: 38512326 PMCID: PMC10995791 DOI: 10.2196/47826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 12/05/2023] [Accepted: 12/20/2023] [Indexed: 03/22/2024] Open
Abstract
BACKGROUND Social media has the potential to be of great value in understanding patterns in public health using large-scale analysis approaches (eg, data science and natural language processing [NLP]), 2 of which have been used in public health: sentiment analysis and topic modeling; however, their use in the area of food security and public health nutrition is limited. OBJECTIVE This study aims to explore the potential use of NLP tools to gather insights from real-world social media data on the public health issue of food security. METHODS A search strategy for obtaining tweets was developed using food security terms. Tweets were collected using the Twitter application programming interface from January 1, 2019, to December 31, 2021, filtered for Australia-based users only. Sentiment analysis of the tweets was performed using the Valence Aware Dictionary and Sentiment Reasoner. Topic modeling exploring the content of tweets was conducted using latent Dirichlet allocation with BigML (BigML, Inc). Sentiment, topic, and engagement (the sum of likes, retweets, quotations, and replies) were compared across years. RESULTS In total, 38,070 tweets were collected from 14,880 Twitter users. Overall, the sentiment when discussing food security was positive, although this varied across the 3 years. Positive sentiment remained higher during the COVID-19 lockdown periods in Australia. The topic model contained 10 topics (in order from highest to lowest probability in the data set): "Global production," "Food insecurity and health," "Use of food banks," "Giving to food banks," "Family poverty," "Food relief provision," "Global food insecurity," "Climate change," "Australian food insecurity," and "Human rights." The topic "Giving to food banks," which focused on support and donation, had the highest proportion of positive sentiment, and "Global food insecurity," which covered food insecurity prevalence worldwide, had the highest proportion of negative sentiment. When compared with news, there were some events, such as COVID-19 support payment introduction and bushfires across Australia, that were associated with high periods of positive or negative sentiment. Topics related to food insecurity prevalence, poverty, and food relief in Australia were not consistently more prominent during the COVID-19 pandemic than before the pandemic. Negative tweets received substantially higher engagement across 2019 and 2020. There was no clear relationship between topics that were more likely to be positive or negative and have higher or lower engagement, indicating that the identified topics are discrete issues. CONCLUSIONS In this study, we demonstrated the potential use of sentiment analysis and topic modeling to explore evolution in conversations on food security using social media data. Future use of NLP in food security requires the context of and interpretation by public health experts and the use of broader data sets, with the potential to track dimensions or events related to food security to inform evidence-based decision-making in this area.
Collapse
Affiliation(s)
- Annika Molenaar
- Department of Nutrition, Dietetics and Food, Monash University, Notting Hill, Australia
| | | | - Linda Brennan
- School of Media and Communication, RMIT University, Melbourne, Australia
| | - Eva L Jenkins
- Department of Nutrition, Dietetics and Food, Monash University, Notting Hill, Australia
| | - Tracy A McCaffrey
- Department of Nutrition, Dietetics and Food, Monash University, Notting Hill, Australia
| |
Collapse
|
12
|
Khine AH, Wettayaprasit W, Duangsuwan J. A new word embedding model integrated with medical knowledge for deep learning-based sentiment classification. Artif Intell Med 2024; 148:102758. [PMID: 38325934 DOI: 10.1016/j.artmed.2023.102758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 11/19/2023] [Accepted: 12/29/2023] [Indexed: 02/09/2024]
Abstract
The development of intelligent systems that use social media data for decision-making processes in numerous domains such as politics, business, marketing, and finance, has been made possible by the popularity of social media platforms. However, the utilization of textual data from social media in the healthcare management industry is still somewhat limited when it is compared to other industries. Investigating how current machine learning and natural language processing technologies can be used in the healthcare industry to gauge public sentiment is an important study. Earlier works on healthcare sentiment analysis have utilized traditional word embedding models trained on the general and medical corpus. However, integration of medical knowledge to pre-trained word embedding models has not been considered yet. Word embedding models trained on the general corpus led to the problem of lacking medical knowledge and the models trained on the small size of the medical corpus have limitations in capturing semantic and syntactic properties. This research proposes a new word embedding model named Word Embedding Integrated with Medical Knowledge Vector (WE-iMKVec). The proposed model integrates sentiment lexicons and medical knowledgebases into the pre-trained word embedding to enrich the properties of word embedding. A new medical-aware sentiment polarity score is proposed for the utilization in learning neural-network sentiment and these vectors incorporate with the original pre-trained word vectors. The resulting vectors are enriched with lexicon vectors and the medical knowledge vectors: Adverse Drug Reaction (ADR) vector and Unified Medical Language System (UMLS) vector are used to build the proposed WE-iMKVec model. WE-iMKVec is validated on the five different social media healthcare review datasets and the empirical results showed its superiority over traditional word embedding models in medical sentiment analysis. The highest improvement can be found in the patients.info medical condition dataset where the proposed model outperforms three conventional word2vec models (Google-News, PubMed-PMC, and Drug Reviews) by 12.7 %, 31.4 %, and 25.4 % respectively in terms of F1 score.
Collapse
Affiliation(s)
- Aye Hninn Khine
- Artificial Intelligence Research Lab, Division of Computational Science, Faculty of Science, Prince of Songkla University, Hat Yai, Thailand
| | - Wiphada Wettayaprasit
- Artificial Intelligence Research Lab, Division of Computational Science, Faculty of Science, Prince of Songkla University, Hat Yai, Thailand
| | - Jarunee Duangsuwan
- Artificial Intelligence Research Lab, Division of Computational Science, Faculty of Science, Prince of Songkla University, Hat Yai, Thailand.
| |
Collapse
|
13
|
Lossio-Ventura JA, Weger R, Lee AY, Guinee EP, Chung J, Atlas L, Linos E, Pereira F. A Comparison of ChatGPT and Fine-Tuned Open Pre-Trained Transformers (OPT) Against Widely Used Sentiment Analysis Tools: Sentiment Analysis of COVID-19 Survey Data. JMIR Ment Health 2024; 11:e50150. [PMID: 38271138 PMCID: PMC10813836 DOI: 10.2196/50150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 11/16/2023] [Accepted: 11/17/2023] [Indexed: 01/27/2024] Open
Abstract
BACKGROUND Health care providers and health-related researchers face significant challenges when applying sentiment analysis tools to health-related free-text survey data. Most state-of-the-art applications were developed in domains such as social media, and their performance in the health care context remains relatively unknown. Moreover, existing studies indicate that these tools often lack accuracy and produce inconsistent results. OBJECTIVE This study aims to address the lack of comparative analysis on sentiment analysis tools applied to health-related free-text survey data in the context of COVID-19. The objective was to automatically predict sentence sentiment for 2 independent COVID-19 survey data sets from the National Institutes of Health and Stanford University. METHODS Gold standard labels were created for a subset of each data set using a panel of human raters. We compared 8 state-of-the-art sentiment analysis tools on both data sets to evaluate variability and disagreement across tools. In addition, few-shot learning was explored by fine-tuning Open Pre-Trained Transformers (OPT; a large language model [LLM] with publicly available weights) using a small annotated subset and zero-shot learning using ChatGPT (an LLM without available weights). RESULTS The comparison of sentiment analysis tools revealed high variability and disagreement across the evaluated tools when applied to health-related survey data. OPT and ChatGPT demonstrated superior performance, outperforming all other sentiment analysis tools. Moreover, ChatGPT outperformed OPT, exhibited higher accuracy by 6% and higher F-measure by 4% to 7%. CONCLUSIONS This study demonstrates the effectiveness of LLMs, particularly the few-shot learning and zero-shot learning approaches, in the sentiment analysis of health-related survey data. These results have implications for saving human labor and improving efficiency in sentiment analysis tasks, contributing to advancements in the field of automated sentiment analysis.
Collapse
Affiliation(s)
| | - Rachel Weger
- School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States
| | - Angela Y Lee
- Department of Communication, Stanford University, Stanford, CA, United States
| | - Emily P Guinee
- National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States
| | - Joyce Chung
- National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States
| | - Lauren Atlas
- National Center For Complementary and Alternative Medicine, National Institutes of Health, Bethesda, MD, United States
| | - Eleni Linos
- School of Medicine, Stanford University, Stanford, CA, United States
| | - Francisco Pereira
- National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States
| |
Collapse
|
14
|
Galvez-Hernandez P, Gonzalez-Viana A, Gonzalez-de Paz L, Shankardass K, Muntaner C. Generating Contextual Variables From Web-Based Data for Health Research: Tutorial on Web Scraping, Text Mining, and Spatial Overlay Analysis. JMIR Public Health Surveill 2024; 10:e50379. [PMID: 38190245 PMCID: PMC10804251 DOI: 10.2196/50379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 11/20/2023] [Accepted: 11/28/2023] [Indexed: 01/09/2024] Open
Abstract
BACKGROUND Contextual variables that capture the characteristics of delimited geographic or jurisdictional areas are vital for health and social research. However, obtaining data sets with contextual-level data can be challenging in the absence of monitoring systems or public census data. OBJECTIVE We describe and implement an 8-step method that combines web scraping, text mining, and spatial overlay analysis (WeTMS) to transform extensive text data from government websites into analyzable data sets containing contextual data for jurisdictional areas. METHODS This tutorial describes the method and provides resources for its application by health and social researchers. We used this method to create data sets of health assets aimed at enhancing older adults' social connections (eg, activities and resources such as walking groups and senior clubs) across the 374 health jurisdictions in Catalonia from 2015 to 2022. These assets are registered on a web-based government platform by local stakeholders from various health and nonhealth organizations as part of a national public health program. Steps 1 to 3 involved defining the variables of interest, identifying data sources, and using Python to extract information from 50,000 websites linked to the platform. Steps 4 to 6 comprised preprocessing the scraped text, defining new variables to classify health assets based on social connection constructs, analyzing word frequencies in titles and descriptions of the assets, creating topic-specific dictionaries, implementing a rule-based classifier in R, and verifying the results. Steps 7 and 8 integrate the spatial overlay analysis to determine the geographic location of each asset. We conducted a descriptive analysis of the data sets to report the characteristics of the assets identified and the patterns of asset registrations across areas. RESULTS We identified and extracted data from 17,305 websites describing health assets. The titles and descriptions of the activities and resources contained 12,560 and 7301 unique words, respectively. After applying our classifier and spatial analysis algorithm, we generated 2 data sets containing 9546 health assets (5022 activities and 4524 resources) with the potential to enhance social connections among older adults. Stakeholders from 318 health jurisdictions registered identified assets on the platform between July 2015 and December 2022. The agreement rate between the classification algorithm and verified data sets ranged from 62.02% to 99.47% across variables. Leisure and skill development activities were the most prevalent (1844/5022, 36.72%). Leisure and cultural associations, such as social clubs for older adults, were the most common resources (878/4524, 19.41%). Health asset registration varied across areas, ranging between 0 and 263 activities and 0 and 265 resources. CONCLUSIONS The sequential use of WeTMS offers a robust method for generating data sets containing contextual-level variables from internet text data. This study can guide health and social researchers in efficiently generating ready-to-analyze data sets containing contextual variables.
Collapse
Affiliation(s)
- Pablo Galvez-Hernandez
- Lawrence S Bloomberg Faculty of Nursing, University of Toronto, Toronto, ON, Canada
- Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | | | - Luis Gonzalez-de Paz
- Primary Healthcare Transversal Research Group, Institut d'Investigacions Biomèdiques August Pi i Sunyer, Barcelona, Spain
- Consorci d'Atenció Primària de Salut Barcelona Esquerra, Barcelona, Spain
| | - Ketan Shankardass
- Department of Heath Sciences, Wilfrid Laurier University, Waterloo, ON, Canada
- MAP Centre for Urban Health Solutions, Li Ka Shing Knowledge Institute, St Michael's Hospital, Toronto, ON, Canada
| | - Carles Muntaner
- Lawrence S Bloomberg Faculty of Nursing, University of Toronto, Toronto, ON, Canada
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
15
|
Niazi F, Elkaim LM, Zadeh Khomami NM, Levett JJ, Weil AG, Hodaie M, Alotaibi NM. Microvascular Decompression and Trigeminal Neuralgia: Patient Sentiment Analysis Using Natural Language Processing. World Neurosurg 2023; 180:e528-e536. [PMID: 37778624 DOI: 10.1016/j.wneu.2023.09.107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 09/25/2023] [Indexed: 10/03/2023]
Abstract
OBJECTIVE Microvascular decompression (MVD) as a treatment for trigeminal neuralgia (TGN) has high success rate but is associated with risks of complication. This study analyzes Twitter to provide insights into discussions surrounding MVD for patients with TGN. METHODS A Twitter search performed in April 2022 yielded 491 tweets from 426 accounts. Tweets and accounts were classified thematically, and descriptive statistics were used for various social media metrics. Using a natural language processing machine learning algorithm, sentiment analysis (SA) was performed to evaluate patient perspectives before and after surgery, and a multivariate regression model was used to identify predictors of higher engagement metrics (likes, retweets, quote tweets, replies). RESULTS Most accounts were patients, caregivers, and other members of the public (70%). The most encountered themes were research (47%) and personal experiences (33.4%). SA of tweets about patient experiences showed that 40.2% of tweets were positive, 31.1% were neutral and 28.7% were negative. Negative tweets decreased significantly in postoperative tweets and mostly discussed complications or failure of surgery (63%). On multivariate analysis, only inclusion of media (photo or video) in a Tweet was associated with higher engagement metrics. CONCLUSIONS This study provides a comprehensive review of Twitter use discussing MVD in TGN and is the first to assess patient satisfaction after treatment using SA. The data presented on patient perspectives on social media could help physicians establish direct lines of communication with patients, fostering a more patient-focused care.
Collapse
Affiliation(s)
- Farbod Niazi
- Department of Medicine, Université de Montréal, Montreal, Quebec, Canada
| | - Lior M Elkaim
- Department of Neurology and Neurosurgery, McGill University, Montreal, Quebec, Canada.
| | | | - Jordan J Levett
- Department of Medicine, Université de Montréal, Montreal, Quebec, Canada
| | - Alexander G Weil
- Division of Neurosurgery, Sainte Justine Hospital, Montreal, Quebec, Canada
| | - Mojgan Hodaie
- Department of Surgery, Division of Neurosurgery, University of Toronto, Toronto, Ontario, Canada
| | - Naif M Alotaibi
- Department of Neurosurgery, National Neuroscience Institute, King Fahad Medical City, Riyadh, Saudi Arabia
| |
Collapse
|
16
|
Humayun MM, Brouillette MJ, Fellows LK, Mayo NE. The Patient Generated Index (PGI) as an early-warning system for predicting brain health challenges: a prospective cohort study for people living with Human Immunodeficiency Virus (HIV). Qual Life Res 2023; 32:3439-3452. [PMID: 37428407 DOI: 10.1007/s11136-023-03475-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/28/2023] [Indexed: 07/11/2023]
Abstract
PURPOSE In research people are often asked to fill out questionnaires about their health and functioning and some of the questions refer to serious health concerns. Typically, these concerns are not identified until the statistician analyses the data. An alternative is to use an individualized measure, the Patient Generated Index (PGI) where people are asked to self-nominate areas of concern which can then be dealt with in real-time. This study estimates the extent to which self-nominated areas of concern related to mood, anxiety and cognition predict the presence or occurrence of brain health outcomes such as depression, anxiety, psychological distress, or cognitive impairment among people aging with HIV at study entry and for successive assessments over 27 months. METHODS The data comes from participants enrolled in the Positive Brain Health Now (+ BHN) cohort (n = 856). We analyzed the self-nominated areas that participants wrote on the PGI and classified them into seven sentiment groups according to the type of sentiment expressed: emotional, interpersonal, anxiety, depressogenic, somatic, cognitive and positive sentiments. Tokenization was used to convert qualitative data into quantifiable tokens. A longitudinal design was used to link these sentiment groups to the presence or emergence of brain health outcomes as assessed using standardized measures of these constructs: the Hospital Anxiety and Depression Scale (HADS), the Mental Health Index (MHI) of the RAND-36, the Communicating Cognitive Concerns Questionnaire (C3Q) and the Brief Cognitive Ability Measure (B-CAM). Logistic regressions were used to estimate the goodness of fit of each model using the c-statistic. RESULTS Emotional sentiments predicted all of the brain health outcomes at all visits with adjusted odds ratios (OR) ranging from 1.61 to 2.00 and c-statistics > 0.73 (good to excellent prediction). Nominating an anxiety sentiment was specific to predicting anxiety and psychological distress (OR 1.65 & 1.52); nominating a cognitive concern was specific to predicting self-reported cognitive ability (OR 4.78). Positive sentiments were predictive of good cognitive function (OR 0.36) and protective of depressive symptoms (OR 0.55). CONCLUSIONS This study indicates the value of using this semi-qualitative approach as an early-warning system in predicting brain health outcomes.
Collapse
Affiliation(s)
- Muhammad Mustafa Humayun
- Division of Experimental Medicine, Faculty of Medicine and Health Sciences, McGill University, 5252 de Maisonneuve, Montreal, QC, H4A 3S5, Canada.
- Center for Outcome Research and Evaluation (CORE), Research Institute of the McGill University Health Center, Montreal, QC, Canada.
| | - Marie-Josée Brouillette
- Department of Psychiatry, Faculty of Medicine and Health Sciences, McGill University, Montreal, QC, Canada
| | - Lesley K Fellows
- Department of Neurology and Neurosurgery, Faculty of Medicine, McGill University, Montreal, QC, Canada
| | - Nancy E Mayo
- Center for Outcome Research and Evaluation (CORE), Research Institute of the McGill University Health Center, Montreal, QC, Canada
- School of Physical and Occupational Therapy, Faculty of Medicine, McGill University, Montreal, QC, Canada
| |
Collapse
|
17
|
Levett JJ, Elkaim LM, Weber MH, Yuh SJ, Lasry O, Alotaibi NM, Georgiopoulos M, Berven SH, Weil AG. A twitter analysis of patient and family experience in pediatric spine surgery. Childs Nerv Syst 2023; 39:3483-3490. [PMID: 37354288 DOI: 10.1007/s00381-023-06019-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 06/04/2023] [Indexed: 06/26/2023]
Abstract
BACKGROUND There is little data on patient and caregiver perceptions of spine surgery in children and youth. This study aims to characterize the personal experiences of patients, caregivers, and family members surrounding pediatric spine surgery through a qualitative and quantitative social media analysis. METHODS The Twitter application programming interface was searched for keywords related to pediatric spine surgery from inception to March 2022. Relevant tweets and accounts were extracted and subsequently classified using thematic labels. Tweet metadata was collected to measure user engagement via multivariable regression. Sentiment analysis using Natural Language Processing was performed on all tweets with a focus on tweets discussing the personal experiences of patients and caregivers. RESULTS 2424 tweets from 1847 individual accounts were retrieved for analysis. Patients and caregivers represented 1459 (79.0%) of all accounts. Posts discussed the personal experiences of patients and caregivers in 83.5% of tweets. Pediatric spine surgery research was discussed in few posts (n=90, 3.7%). Within the personal experience category, 975 (48.17%) tweets were positive, 516 (25.49%) were negative, and 533 (26.34%) were neutral. Presence of a tag (beta: -6.1, 95% CI -9.7 to -2.5) and baseline follower count (beta<0.001, 95% CI <0.001 to <0.001) significantly affected tweet engagement negatively and positively, respectively. CONCLUSIONS Patients and caregivers actively discuss topics related to pediatric spine surgery on Twitter. Posts discussing personal experience are most prevalent, while posts on research are scarce, unlike previous social media studies. Pediatric spine surgeons can leverage this dialogue to better understand the worries and needs of patients and their families.
Collapse
Affiliation(s)
- Jordan J Levett
- Faculty of Medicine, University of Montreal, Montreal, Quebec, Canada
| | - Lior M Elkaim
- Department of Neurology and Neurosurgery, McGill University, Montreal, Quebec, Canada.
| | - Michael H Weber
- Department of Orthopaedic Surgery, McGill University, Montreal, Quebec, Canada
| | - Sung-Joo Yuh
- Department of Neurosurgery, Centre Hospitalier de l'Université de Montréal, Montreal, Quebec, Canada
| | - Oliver Lasry
- Department of Neurology and Neurosurgery, McGill University, Montreal, Quebec, Canada
- Department of Epidemiology, and Occupational Health, McGill University, BiostatisticsMontreal, Quebec, Canada
| | - Naif M Alotaibi
- Department of Neurosurgery, King Fahad Medical City, National Neuroscience Institute, Riyadh, Saudi Arabia
| | | | - Sigurd H Berven
- Department of Orthopaedic Surgery, University of California San Francisco, San Francisco, California, United States
| | - Alexander G Weil
- Centre Hospitalier Universitaire Sainte-Justine, Montreal, Quebec, Canada
| |
Collapse
|
18
|
Zargaran D, Zargaran A, Sousi S, Knight D, Cook H, Woollard A, Davies J, Weyrich T, Mosahebi A. Quantitative and qualitative analysis of individual experiences post botulinum toxin injection - United Kingdom Survey. SKIN HEALTH AND DISEASE 2023; 3:e265. [PMID: 37799369 PMCID: PMC10549845 DOI: 10.1002/ski2.265] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 05/29/2023] [Accepted: 06/13/2023] [Indexed: 10/07/2023]
Abstract
Introduction In the United Kingdom (UK), complications that arise following the administration of Botulinum Toxin are reported to the Medicines and Health Regulatory Agency (MHRA) via the Yellow Card Reporting Scheme. Over the past decade, there has been a significant increase in the number of non-surgical aesthetic procedures. Concerns have been raised that the MHRA is not fully capturing complications in terms of volume and impact on patients. Aim This novel study explores the lived experiences of individuals who have experienced an adverse event following administration of Botulinum Toxin for aesthetic purposes. Using a combination of qualitative and quantitative methodologies, this analysis evaluates data relating to long-lasting physical, psychological, emotional, and financial sequelae of complications arising from cosmetic Botulinum Toxin injections in the UK. Methods A mixed method, qualitative and quantitative approach was adopted to gain comprehensive insights into patients' experiences. A focus group which comprised patient representatives, psychologists, and researchers reached a consensus on a 17-question survey which was disseminated via social media channels. Deductive thematic analysis was used to analyse coded themes. Furthermore, for secondary analysis, sentiment analysis was used computationally as an innovative approach to identify and categorise free text responses associated with sentiments using natural language processing (NLP). Results In the study, 655 responses were received, with 287 (44%) of respondents completing all questions. The mean age of respondents was 42.6 years old. 94.1% of respondents identified as female. In the sample, 79% of respondents reported an adverse event following their procedure, with the most common event being reported as 'anxiety'. Findings revealed that 69% of respondents reported long-lasting adverse effects. From the responses, 68.4% reported not having recovered physically, 63.5% of respondents stated that they had not recovered emotionally from complications, and 61.7% said that they have not recovered psychologically. In addition, 84% of respondents stated that they do not know who regulates the aesthetics industry. Furthermore, 92% of participants reported that their clinic or practitioner did not inform them about the Yellow Card Reporting Scheme. The sentiment analysis using the AFINN Lexicon yielded adjusted scores ranging from -3 to +2, with a mean value of -1.58. Conclusion This is the largest survey in the UK completed by patients who experienced an adverse outcome following the aesthetic administration of Botulinum Toxin. Our study highlights the extent of the challenges faced by patients who experience an adverse event from physical, emotional, psychological, and financial perspectives. The lack of awareness of MHRA reporting structures and the lack of regulation within the UK's cosmetic injectables sector represent a significant public health challenge.
Collapse
Affiliation(s)
- David Zargaran
- Department of Plastic SurgeryUniversity College LondonLondonUK
- British Association of Aesthetic Plastic Surgeons (BAAPS) AcademyLondonUK
| | | | - Sara Sousi
- Department of Plastic SurgeryUniversity College LondonLondonUK
| | | | - Hannah Cook
- Department of Plastic SurgeryUniversity College LondonLondonUK
| | - Alexander Woollard
- Department of Plastic SurgeryUniversity College LondonLondonUK
- Cosmetic Practice Standards Authority (CPSA)LondonUK
| | - Julie Davies
- UCL Global Business School for HealthUniversity College LondonLondonUK
| | - Tim Weyrich
- Department of Computer ScienceUniversity College LondonLondonUK
- Friedrich‐Alexander University (FAU) Erlangen‐NürnbergErlangenGermany
| | - Afshin Mosahebi
- Department of Plastic SurgeryUniversity College LondonLondonUK
- British Association of Aesthetic Plastic Surgeons (BAAPS) AcademyLondonUK
| |
Collapse
|
19
|
Li C, Fu J, Lai J, Sun L, Zhou C, Li W, Jian B, Deng S, Zhang Y, Guo Z, Liu Y, Zhou Y, Xie S, Hou M, Wang R, Chen Q, Wu Y. Construction of an Emotional Lexicon of Patients With Breast Cancer: Development and Sentiment Analysis. J Med Internet Res 2023; 25:e44897. [PMID: 37698914 PMCID: PMC10523220 DOI: 10.2196/44897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 08/17/2023] [Accepted: 08/18/2023] [Indexed: 09/13/2023] Open
Abstract
BACKGROUND The innovative method of sentiment analysis based on an emotional lexicon shows prominent advantages in capturing emotional information, such as individual attitudes, experiences, and needs, which provides a new perspective and method for emotion recognition and management for patients with breast cancer (BC). However, at present, sentiment analysis in the field of BC is limited, and there is no emotional lexicon for this field. Therefore, it is necessary to construct an emotional lexicon that conforms to the characteristics of patients with BC so as to provide a new tool for accurate identification and analysis of the patients' emotions and a new method for their personalized emotion management. OBJECTIVE This study aimed to construct an emotional lexicon of patients with BC. METHODS Emotional words were obtained by merging the words in 2 general sentiment lexicons, the Chinese Linguistic Inquiry and Word Count (C-LIWC) and HowNet, and the words in text corpora acquired from patients with BC via Weibo, semistructured interviews, and expressive writing. The lexicon was constructed using manual annotation and classification under the guidance of Russell's valence-arousal space. Ekman's basic emotional categories, Lazarus' cognitive appraisal theory of emotion, and a qualitative text analysis based on the text corpora of patients with BC were combined to determine the fine-grained emotional categories of the lexicon we constructed. Precision, recall, and the F1-score were used to evaluate the lexicon's performance. RESULTS The text corpora collected from patients in different stages of BC included 150 written materials, 17 interviews, and 6689 original posts and comments from Weibo, with a total of 1,923,593 Chinese characters. The emotional lexicon of patients with BC contained 9357 words and covered 8 fine-grained emotional categories: joy, anger, sadness, fear, disgust, surprise, somatic symptoms, and BC terminology. Experimental results showed that precision, recall, and the F1-score of positive emotional words were 98.42%, 99.73%, and 99.07%, respectively, and those of negative emotional words were 99.73%, 98.38%, and 99.05%, respectively, which all significantly outperformed the C-LIWC and HowNet. CONCLUSIONS The emotional lexicon with fine-grained emotional categories conforms to the characteristics of patients with BC. Its performance related to identifying and classifying domain-specific emotional words in BC is better compared to the C-LIWC and HowNet. This lexicon not only provides a new tool for sentiment analysis in the field of BC but also provides a new perspective for recognizing the specific emotional state and needs of patients with BC and formulating tailored emotional management plans.
Collapse
Affiliation(s)
- Chaixiu Li
- Nanfang Hospital, Southern Medical University, Guangzhou, China
- School of Nursing, Southern Medical University, Guangzhou, China
| | - Jiaqi Fu
- Nanfang Hospital, Southern Medical University, Guangzhou, China
- School of Nursing, Southern Medical University, Guangzhou, China
| | - Jie Lai
- Nanfang Hospital, Southern Medical University, Guangzhou, China
- School of Nursing, Southern Medical University, Guangzhou, China
| | - Lijun Sun
- China Electronic Product Reliability and Environmental Testing Institute, Guangzhou, China
| | - Chunlan Zhou
- Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Wenji Li
- Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Biao Jian
- China Electronic Product Reliability and Environmental Testing Institute, Guangzhou, China
| | - Shisi Deng
- Nanfang Hospital, Southern Medical University, Guangzhou, China
- School of Nursing, Southern Medical University, Guangzhou, China
| | - Yujie Zhang
- Nanfang Hospital, Southern Medical University, Guangzhou, China
- School of Nursing, Southern Medical University, Guangzhou, China
| | - Zihan Guo
- Nanfang Hospital, Southern Medical University, Guangzhou, China
- School of Nursing, Southern Medical University, Guangzhou, China
| | - Yusheng Liu
- Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Yanni Zhou
- Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Shihui Xie
- Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Mingyue Hou
- Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Ru Wang
- Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Qinjie Chen
- Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Yanni Wu
- Nanfang Hospital, Southern Medical University, Guangzhou, China
| |
Collapse
|
20
|
Di Cara NH, Maggio V, Davis OSP, Haworth CMA. Methodologies for Monitoring Mental Health on Twitter: Systematic Review. J Med Internet Res 2023; 25:e42734. [PMID: 37155236 PMCID: PMC10203928 DOI: 10.2196/42734] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 11/23/2022] [Accepted: 03/15/2023] [Indexed: 05/10/2023] Open
Abstract
BACKGROUND The use of social media data to predict mental health outcomes has the potential to allow for the continuous monitoring of mental health and well-being and provide timely information that can supplement traditional clinical assessments. However, it is crucial that the methodologies used to create models for this purpose are of high quality from both a mental health and machine learning perspective. Twitter has been a popular choice of social media because of the accessibility of its data, but access to big data sets is not a guarantee of robust results. OBJECTIVE This study aims to review the current methodologies used in the literature for predicting mental health outcomes from Twitter data, with a focus on the quality of the underlying mental health data and the machine learning methods used. METHODS A systematic search was performed across 6 databases, using keywords related to mental health disorders, algorithms, and social media. In total, 2759 records were screened, of which 164 (5.94%) papers were analyzed. Information about methodologies for data acquisition, preprocessing, model creation, and validation was collected, as well as information about replicability and ethical considerations. RESULTS The 164 studies reviewed used 119 primary data sets. There were an additional 8 data sets identified that were not described in enough detail to include, and 6.1% (10/164) of the papers did not describe their data sets at all. Of these 119 data sets, only 16 (13.4%) had access to ground truth data (ie, known characteristics) about the mental health disorders of social media users. The other 86.6% (103/119) of data sets collected data by searching keywords or phrases, which may not be representative of patterns of Twitter use for those with mental health disorders. The annotation of mental health disorders for classification labels was variable, and 57.1% (68/119) of the data sets had no ground truth or clinical input on this annotation. Despite being a common mental health disorder, anxiety received little attention. CONCLUSIONS The sharing of high-quality ground truth data sets is crucial for the development of trustworthy algorithms that have clinical and research utility. Further collaboration across disciplines and contexts is encouraged to better understand what types of predictions will be useful in supporting the management and identification of mental health disorders. A series of recommendations for researchers in this field and for the wider research community are made, with the aim of enhancing the quality and utility of future outputs.
Collapse
Affiliation(s)
- Nina H Di Cara
- School of Psychological Science, University of Bristol, Bristol, United Kingdom
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, United Kingdom
| | - Valerio Maggio
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, United Kingdom
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | - Oliver S P Davis
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, United Kingdom
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, United Kingdom
- The Alan Turing Institute, London, United Kingdom
| | - Claire M A Haworth
- School of Psychological Science, University of Bristol, Bristol, United Kingdom
- The Alan Turing Institute, London, United Kingdom
| |
Collapse
|
21
|
Malhotra K, Dagli MM, Santangelo G, Wathen C, Ghenbot Y, Goyal K, Bawa A, Ozturk AK, Welch WC. The Digital Impact of Neurosurgery Awareness Month: Retrospective Infodemiology Study. JMIR Form Res 2023; 7:e44754. [PMID: 37155226 DOI: 10.2196/44754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 02/07/2023] [Accepted: 04/17/2023] [Indexed: 05/10/2023] Open
Abstract
BACKGROUND Neurosurgery Awareness Month (August) was initiated by the American Association of Neurological Surgeons with the aim of bringing neurological conditions to the forefront and educating the public about these conditions. Digital media is an important tool for disseminating information and connecting with influencers, general public, and other stakeholders. Hence, it is crucial to understand the impact of awareness campaigns such as Neurosurgery Awareness Month to optimize resource allocation, quantify the efficiency and reach of these initiatives, and identify areas for improvement. OBJECTIVE The purpose of our study was to examine the digital impact of Neurosurgery Awareness Month globally and identify areas for further improvement. METHODS We used 4 social media (Twitter) assessment tools (Sprout Social, SocioViz, Sentiment Viz, and Symplur) and Google Trends to extract data using various search queries. Using regression analysis, trends were studied in the total number of tweets posted in August between 2014 and 2022. Two search queries were used in this analysis: one specifically targeting tweets related to Neurosurgery Awareness Month and the other isolating all neurosurgery-related posts. Total impressions and top influencers for #neurosurgery were calculated using Symplur's machine learning algorithm. To study the context of the tweets, we used SocioViz to isolate the top 100 popular hashtags, keywords, and collaborations between influencers. Network analysis was performed to illustrate the interactions and connections within the digital media environment using ForceAtlas2 model. Sentiment analysis was done to study the underlying emotion of the tweets. Google Trends was used to study the global search interest by studying relative search volume data. RESULTS A total of 10,007 users were identified as tweeting about neurosurgery during Neurosurgery Awareness Month using the "#neurosurgery" hashtag. These tweets generated over 29.14 million impressions globally. Of the top 10 most influential users, 5 were faculty neurosurgeons at US university hospitals. Other influential users included notable organizations and journals in the field of neurosurgery. The network analysis of the top 100 influencers showed a collaboration rate of 81%. However, only 1.6% of the total neurosurgery tweets were advocating about neurosurgery awareness during Neurosurgery Awareness Month, and only 13 tweets were posted by verified users using the #neurosurgeryawarenessmonth hashtag. The sentiment analysis revealed that the majority of the tweets about Neurosurgery Awareness Month were pleasant with subdued emotion. CONCLUSIONS The global digital impact of Neurosurgery Awareness Month is nascent, and support from other international organizations and neurosurgical influencers is needed to yield a significant digital reach. Increasing collaboration and involvement from underrepresented communities may help to increase the global reach. By better understanding the digital impact of Neurosurgery Awareness Month, future health care awareness campaigns can be optimized to increase global awareness of neurosurgery and the challenges facing the field.
Collapse
Affiliation(s)
- Kashish Malhotra
- Department of Surgery, Dayanand Medical College and Hospital, Ludhiana, India
| | - Mert Marcel Dagli
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Gabrielle Santangelo
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Connor Wathen
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Yohannes Ghenbot
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Kashish Goyal
- Department of Surgery, Dayanand Medical College and Hospital, Ludhiana, India
| | - Ashvind Bawa
- Department of Surgery, Dayanand Medical College and Hospital, Ludhiana, India
| | - Ali K Ozturk
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - William C Welch
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|
22
|
Khaleghparast S, Maleki M, Hajianfar G, Soumari E, Oveisi M, Golandouz HM, Noohi F, Dehaki MG, Golpira R, Mazloomzadeh S, Arabian M, Kalayinia S. Development of a patients' satisfaction analysis system using machine learning and lexicon-based methods. BMC Health Serv Res 2023; 23:280. [PMID: 36959630 PMCID: PMC10037842 DOI: 10.1186/s12913-023-09260-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Accepted: 03/07/2023] [Indexed: 03/25/2023] Open
Abstract
BACKGROUND Patients' rights are integral to medical ethics. This study aimed to perform sentiment analysis and opinion mining on patients' messages by a combination of lexicon-based and machine learning methods to identify positive or negative comments and to determine the different ward and staff names mentioned in patients' messages. METHODS The level of satisfaction and observance of the rights of 250 service recipients of the hospital was evaluated through the related checklists by the evaluator. In total, 822 Persian messages, composed of 540 negative and 282 positive comments, were collected and labeled by the evaluator. Pre-processing was performed on the messages and followed by 2 feature vectors which were extracted from the messages, including the term frequency-inverse document frequency (TFIDF) vector and a combination of the multifeature (MF) (a lexicon-based method) and TFIDF (MF + TFIDF) vectors. Six feature selectors and 5 classifiers were used in this study. For the evaluations, 5-fold cross-validation with different metrics including area under the receiver operating characteristic curve (AUC), accuracy (ACC), F1 score, sensitivity (SEN), specificity (SPE) and Precision-Recall Curves (PRC) were reported. Message tag detection, which featured different hospital wards and identified staff names mentioned in the study patients' messages, was implemented by the lexicon-based method. RESULTS The best classifier was Multinomial Naïve Bayes in combination with MF + TFIDF feature vector and SelectFromModel (SFM) feature selection (ACC = 0.89 ± 0.03, AUC = 0.87 ± 0.03, F1 = 0.92 ± 0.03, SEN = 0.93 ± 0.04, and SPE = 0.82 ± 0.02, PRC-AUC = 0.97). Two methods of assessment by the evaluator and artificial intelligence as well as survey systems were compared. CONCLUSION Our results demonstrated that the lexicon-based method, in combination with machine learning classifiers, could extract sentiments in patients' comments and classify them into positive and negative categories. We also developed an online survey system to analyze patients' satisfaction in different wards and to remove conventional assessments by the evaluator.
Collapse
Affiliation(s)
- Shiva Khaleghparast
- Cardiovascular Nursing Research Center, Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
| | - Majid Maleki
- Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
| | - Ghasem Hajianfar
- Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
| | - Esmaeil Soumari
- Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
| | - Mehrdad Oveisi
- Department of Computer Science, University of British Columbia, Vancouver, BC, Canada
| | | | - Feridoun Noohi
- Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
| | - Maziar Gholampour Dehaki
- Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
| | - Reza Golpira
- Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
| | - Saeideh Mazloomzadeh
- Cardiovascular Nursing Research Center, Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
| | - Maedeh Arabian
- Cardiovascular Nursing Research Center, Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran
| | - Samira Kalayinia
- Cardiogenetic Research Center, Medical and Research Center, Rajaie Cardiovascular, University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
23
|
Denecke K, Reichenpfader D. Sentiment analysis of clinical narratives: A scoping review. J Biomed Inform 2023; 140:104336. [PMID: 36958461 DOI: 10.1016/j.jbi.2023.104336] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 03/06/2023] [Accepted: 03/10/2023] [Indexed: 03/25/2023]
Abstract
A clinical sentiment is a judgment, thought or attitude promoted by an observation with respect to the health of an individual. Sentiment analysis has drawn attention in the healthcare domain for secondary use of data from clinical narratives, with a variety of applications including predicting the likelihood of emerging mental illnesses or clinical outcomes. The current state of research has not yet been summarized. This study presents results from a scoping review aiming at providing an overview of sentiment analysis of clinical narratives in order to summarize existing research and identify open research gaps. The scoping review was carried out in line with the PRISMA-ScR (Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews) guideline. Studies were identified by searching 4 electronic databases (e.g., PubMed, IEEE Xplore) in addition to conducting backward and forward reference list checking of the included studies. We extracted information on use cases, methods and tools applied, used datasets and performance of the sentiment analysis approach. Of 1,200 citations retrieved, 29 unique studies were included in the review covering a period of 8 years. Most studies apply general domain tools (e.g. TextBlob) and sentiment lexicons (e.g. SentiWordNet) for realizing use cases such as prediction of clinical outcomes; others proposed new domain-specific sentiment analysis approaches based on machine learning. Accuracy values between 71.5-88.2% are reported. Data used for evaluation and test are often retrieved from MIMIC databases or i2b2 challenges. Latest developments related to artificial neural networks are not yet fully considered in this domain. We conclude that future research should focus on developing a gold standard sentiment lexicon, adapted to the specific characteristics of clinical narratives. Efforts have to be made to either augment existing or create new high-quality labeled data sets of clinical narratives. Last, the suitability of state-of-the-art machine learning methods for natural language processing and in particular transformer-based models should be investigated for their application for sentiment analysis of clinical narratives.
Collapse
Affiliation(s)
- Kerstin Denecke
- Bern University of Applied Sciences, Institute for Medical Informatics, Quellgasse 21, Biel/Bienne, 2502, Bern, Switzerland.
| | - Daniel Reichenpfader
- Bern University of Applied Sciences, Institute for Medical Informatics, Quellgasse 21, Biel/Bienne, 2502, Bern, Switzerland
| |
Collapse
|
24
|
Wu J, Zhang G, Xing Y, Liu Y, Zhang Z, Dong Y, Herrera-Viedma E. A sentiment analysis driven method based on public and personal preferences with correlated attributes to select online doctors. APPL INTELL 2023; 53:1-22. [PMID: 36844914 PMCID: PMC9940095 DOI: 10.1007/s10489-023-04485-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/23/2023] [Indexed: 02/25/2023]
Abstract
This paper proposes a method to assist patients in finding the most appropriate doctor for online medical consultation. To do that, it constructs an online doctor selection decision-making method that considers the correlation attributes, in which the measure of attribute correlation is derived from the history real decision data. To combine public and personal preference with correlated attributes, it proposes a Choquet integral based comprehensive online doctor ranking method. In detail, a two stage classification model based on BERT (Bidirectional Encoder Representations from Transformers) is used to extract service features from unstructured text reviews. Then, 2-additive fuzzy measure is adopted to represent the patient public group aggregated attribute preference. Next, a novel optimization model is proposed to combine the public preference and personal preference. Finally, a case study of dxy.com is carried out to illustrate the procedure of the method. The comparison result between proposed method and other traditional MADM (multi-attribute decision-making) methods prove its rationality.
Collapse
Affiliation(s)
- Jian Wu
- School of Economics and Management, Shanghai Maritime University, Shanghai, 201306 China
- Center for Artificial Intelligence and Decision Sciences, Shanghai Maritime University, Shanghai, 201306 China
| | - Guangyin Zhang
- School of Economics and Management, Shanghai Maritime University, Shanghai, 201306 China
- Center for Artificial Intelligence and Decision Sciences, Shanghai Maritime University, Shanghai, 201306 China
| | - Yumei Xing
- School of Economics and Management, Shanghai Maritime University, Shanghai, 201306 China
- Center for Artificial Intelligence and Decision Sciences, Shanghai Maritime University, Shanghai, 201306 China
| | - Yujia Liu
- School of Economics and Management, Shanghai Maritime University, Shanghai, 201306 China
- Center for Artificial Intelligence and Decision Sciences, Shanghai Maritime University, Shanghai, 201306 China
| | - Zhen Zhang
- Institute of Systems Engineering, Dalian University of Technology, Dalian, 116024 China
| | - Yucheng Dong
- Business School, Sichuan University, Chengdu, 610065 China
| | | |
Collapse
|
25
|
Cui J, Wang Z, Ho SB, Cambria E. Survey on sentiment analysis: evolution of research methods and topics. Artif Intell Rev 2023; 56:1-42. [PMID: 36628328 PMCID: PMC9816550 DOI: 10.1007/s10462-022-10386-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/29/2022] [Indexed: 01/09/2023]
Abstract
Sentiment analysis, one of the research hotspots in the natural language processing field, has attracted the attention of researchers, and research papers on the field are increasingly published. Many literature reviews on sentiment analysis involving techniques, methods, and applications have been produced using different survey methodologies and tools, but there has not been a survey dedicated to the evolution of research methods and topics of sentiment analysis. There have also been few survey works leveraging keyword co-occurrence on sentiment analysis. Therefore, this study presents a survey of sentiment analysis focusing on the evolution of research methods and topics. It incorporates keyword co-occurrence analysis with a community detection algorithm. This survey not only compares and analyzes the connections between research methods and topics over the past two decades but also uncovers the hotspots and trends over time, thus providing guidance for researchers. Furthermore, this paper presents broad practical insights into the methods and topics of sentiment analysis, while also identifying technical directions, limitations, and future work.
Collapse
Affiliation(s)
- Jingfeng Cui
- Institute of High Performance Computing, A*STAR, 1 Fusionopolis Way, Singapore, 138632 Singapore
- School of Information Management, Nanjing Agricultural University, 1 Weigang, Nanjing, 210095 China
| | - Zhaoxia Wang
- School of Computing and Information Systems, Singapore Management University, 80 Stamford Rd, Singapore, 178902 Singapore
| | - Seng-Beng Ho
- Institute of High Performance Computing, A*STAR, 1 Fusionopolis Way, Singapore, 138632 Singapore
| | - Erik Cambria
- School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798 Singapore
| |
Collapse
|
26
|
Textual emotion detection in health: Advances and applications. J Biomed Inform 2023; 137:104258. [PMID: 36528329 DOI: 10.1016/j.jbi.2022.104258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 11/24/2022] [Accepted: 11/27/2022] [Indexed: 12/23/2022]
Abstract
Textual Emotion Detection (TED) is a rapidly growing area in Natural Language Processing (NLP) that aims to detect emotions expressed through text. In this paper, we provide a review of the latest research and development in TED as applied in health and medicine. We focus on medical and non-medical data types, use cases, and methods where TED has been integral in supporting decision-making. The application of NLP technologies in health, and particularly TED, requires high confidence that these technologies and technology-aided treatment will first, do no harm. Therefore, this review also aims to assess the accuracy of TED systems and provide an update on the state of the technology. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 guidelines were used in this review. With a specific focus on the identification of different human emotions in text, the more general sentiment analysis studies that only recognize the polarity of text were excluded. A total of 66 papers met the inclusion criteria. This review found that TED in health and medicine is mainly used in the detection of depression, suicidal ideation, and the mental status of patients with asthma, Alzheimer's disease, cancer, and diabetes with major data sources of social media, healthcare services, and counseling centers. Approximately, 44% of the research in the domain is related to COVID-19, investigating the public health response to vaccinations and the emotional response of the public. In most cases, deep learning-based NLP techniques were found to be preferred over other methods due to their superior performance. Developing methods for implementing and evaluating dimensional emotional models, resolving annotation challenges by utilizing health-related lexicons, and using deep learning techniques for multi-faceted and real-time applications were found to be among the main avenues for further development of TED applications in health.
Collapse
|
27
|
Goldman HH, Porcino J, Divita G, Zirikly A, Desmet B, Sacco M, Marfeo E, McDonough C, Rasch E, Chan L. Informatics Research on Mental Health Functioning: Decision Support for the Social Security Administration Disability Program. Psychiatr Serv 2023; 74:56-62. [PMID: 35652194 PMCID: PMC10501504 DOI: 10.1176/appi.ps.202200056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
The disability determination process of the Social Security Administration's (SSA's) disability program requires assessing work-related functioning for individual claimants alleging disability due to mental impairment. This task is particularly challenging because the determination process involves the review of a large file of information, including objective medical evidence and self-reports from claimants, families, and former employers. To improve this decision-making process, SSA entered an interagency agreement with the Rehabilitation Medicine Department, Epidemiology and Biostatistics Section, in the Clinical Center of the National Institutes of Health, intending to use data science and informatics to develop decision support tools. This collaborative effort over the past decade has led to the development of the Work Disability-Functional Assessment Battery and has initiated an approach to applying natural language processing to the review of claimants' files for information on mental health functioning. This informatics research collaboration holds promise for improving the process of disability determination for individuals with mental impairments who make claims at the SSA.
Collapse
Affiliation(s)
- Howard H Goldman
- Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
| | - Julia Porcino
- Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
| | - Guy Divita
- Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
| | - Ayah Zirikly
- Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
| | - Bart Desmet
- Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
| | - Maryanne Sacco
- Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
| | - Elizabeth Marfeo
- Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
| | - Christine McDonough
- Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
| | - Elizabeth Rasch
- Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
| | - Leighton Chan
- Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland (all authors); Department of Psychiatry, University of Maryland School of Medicine, Baltimore (Goldman); Department of Computer Science, Johns Hopkins University, Baltimore (Zirikly); Department of Occupational Therapy, Tufts University, Medford, Massachusetts (Marfeo); Department of Physical Therapy, University of Pittsburgh, Pittsburgh (McDonough)
| |
Collapse
|
28
|
Canny A, Mason B, Atkins C, Patterson R, Moussa L, Boyd K. Online public information about advance care planning: An evaluation of UK and international websites. Digit Health 2023; 9:20552076231180438. [PMID: 37377564 PMCID: PMC10291539 DOI: 10.1177/20552076231180438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 05/19/2023] [Indexed: 06/29/2023] Open
Abstract
Introduction Healthcare information is increasingly internet-based. Standards require websites to be 'perceivable, operable, understandable and robust' with relevant content for citizens in appropriate language. This study examined UK and international websites offering public healthcare information on advance care planning (ACP) using current recommendations for website accessibility and content and informed by a public engagement exercise. Methods Google searches identified websites in English from health service providers, governmental or third sector organisations based in the UK and internationally. Target keywords that would be used by a member of the public informed the search terms. Data extraction used criterion-based assessment and web content analysis of the first two pages of each search result. Public patient representatives as key members of the multidisciplinary research team guided the development of the evaluation criteria. Results A total of 1158 online searches identified 89 websites, reduced to 29 by inclusion/exclusion criteria. Most sites met international recommendations for knowledge/understanding about ACP. Differences in terminology, lack of information about ACP limitations and non-adherence to recommended reading levels, accessibility standards and translation options were apparent. Sites targeting members of the public used more positive, non-technical language than those for both professional and lay users. Conclusions Some websites met accepted standards required to facilitate understanding and public engagement in ACP. Others could be improved significantly. Website providers have important roles and responsibilities in increasing people's understanding of their health conditions, future care options and ability to take an active role in planning for their health and care.
Collapse
Affiliation(s)
- Anne Canny
- Usher Institute of Population Health Sciences and Informatics, The University of Edinburgh, Edinburgh, UK
| | - Bruce Mason
- Usher Institute of Population Health Sciences and Informatics, The University of Edinburgh, Edinburgh, UK
| | - Clare Atkins
- Usher Institute of Population Health Sciences and Informatics, The University of Edinburgh, Edinburgh, UK
| | | | - Lorna Moussa
- Usher Institute of Population Health Sciences and Informatics, The University of Edinburgh, Edinburgh, UK
| | - Kirsty Boyd
- Usher Institute of Population Health Sciences and Informatics, The University of Edinburgh, Edinburgh, UK
| |
Collapse
|
29
|
Lee ITL, Juang SE, Chen ST, Ko C, Ma KSK. Sentiment analysis of tweets on alopecia areata, hidradenitis suppurativa, and psoriasis: Revealing the patient experience. Front Med (Lausanne) 2022; 9:996378. [PMID: 36388938 PMCID: PMC9660311 DOI: 10.3389/fmed.2022.996378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 09/26/2022] [Indexed: 11/26/2022] Open
Abstract
Background Chronic dermatologic disorders can cause significant emotional distress. Sentiment analysis of disease-related tweets helps identify patients' experiences of skin disease. Objective To analyze the expressed sentiments in tweets related to alopecia areata (AA), hidradenitis suppurativa (HS), and psoriasis (PsO) in comparison to fibromyalgia (FM). Methods This is a cross-sectional analysis of Twitter users' expressed sentiment on AA, HS, PsO, and FM. Tweets related to the diseases of interest were identified with keywords and hashtags for one month (April, 2022) using the Twitter standard application programming interface (API). Text, account types, and numbers of retweets and likes were collected. The sentiment analysis was performed by the R "tidytext" package using the AFINN lexicon. Results A total of 1,505 tweets were randomly extracted, of which 243 (16.15%) referred to AA, 186 (12.36%) to HS, 510 (33.89%) to PsO, and 566 (37.61%) to FM. The mean sentiment score was -0.239 ± 2.90. AA, HS, and PsO had similar sentiment scores (p = 0.482). Although all skin conditions were associated with a negative polarity, their average was significantly less negative than FM (p < 0.0001). Tweets from private accounts were more negative, especially for AA (p = 0.0082). Words reflecting patients' psychological states varied in different diseases. "Anxiety" was observed in posts on AA and FM but not posts on HS and PsO, while "crying" was frequently used in posts on HS. There was no definite correlation between the sentiment score and the number of retweets or likes, although negative AA tweets from public accounts received more retweets (p = 0.03511) and likes (p = 0.0228). Conclusion The use of Twitter sentiment analysis is a promising method to document patients' experience of skin diseases, which may improve patient care through bridging misconceptions and knowledge gaps between patients and healthcare professionals.
Collapse
Affiliation(s)
- Irene Tai-Lin Lee
- Department of Radiology, Far Eastern Memorial Hospital, New Taipei City, Taiwan
| | - Sin-Ei Juang
- Department of Anesthesiology, College of Medicine, Kaohsiung Chang Gung Memorial Hospital, Chang Gung University, Kaohsiung, Taiwan
| | - Steven T. Chen
- Department of Dermatology, Harvard Medical School, Massachusetts General Hospital, Boston, MA, United States
| | - Christine Ko
- Department of Dermatology, Yale University, New Haven, CT, United States
- Department of Pathology, Yale University, New Haven, CT, United States
| | - Kevin Sheng-Kai Ma
- Department of Dermatology, Harvard Medical School, Massachusetts General Hospital, Boston, MA, United States
- Center for Global Health, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, United States
- College of Electrical Engineering and Computer Science, Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
30
|
Natural Language Processing Techniques for Text Classification of Biomedical Documents: A Systematic Review. INFORMATION 2022. [DOI: 10.3390/info13100499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
The classification of biomedical literature is engaged in a number of critical issues that physicians are expected to answer. In many cases, these issues are extremely difficult. This can be conducted for jobs such as diagnosis and treatment, as well as efficient representations of ideas such as medications, procedure codes, and patient visits, as well as in the quick search of a document or disease classification. Pathologies are being sought from clinical notes, among other sources. The goal of this systematic review is to analyze the literature on various problems of classification of medical texts of patients based on criteria such as: the quality of the evaluation metrics used, the different methods of machine learning applied, the different data sets, to highlight the best methods in this type of problem, and to identify the different challenges associated. The study covers the period from 1 January 2016 to 10 July 2022. We used multiple databases and archives of research articles, including Web Of Science, Scopus, MDPI, arXiv, IEEE, and ACM, to find 894 articles dealing with the subject of text classification, which we were able to filter using inclusion and exclusion criteria. Following a thorough review, we selected 33 articles dealing with biological text categorization issues. Following our investigation, we discovered two major issues linked to the methodology and data used for biomedical text classification. First, there is the data-centric challenge, followed by the data quality challenge.
Collapse
|
31
|
Chen JS, Baxter SL. Applications of natural language processing in ophthalmology: present and future. Front Med (Lausanne) 2022; 9:906554. [PMID: 36004369 PMCID: PMC9393550 DOI: 10.3389/fmed.2022.906554] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 05/31/2022] [Indexed: 11/13/2022] Open
Abstract
Advances in technology, including novel ophthalmic imaging devices and adoption of the electronic health record (EHR), have resulted in significantly increased data available for both clinical use and research in ophthalmology. While artificial intelligence (AI) algorithms have the potential to utilize these data to transform clinical care, current applications of AI in ophthalmology have focused mostly on image-based deep learning. Unstructured free-text in the EHR represents a tremendous amount of underutilized data in big data analyses and predictive AI. Natural language processing (NLP) is a type of AI involved in processing human language that can be used to develop automated algorithms using these vast quantities of available text data. The purpose of this review was to introduce ophthalmologists to NLP by (1) reviewing current applications of NLP in ophthalmology and (2) exploring potential applications of NLP. We reviewed current literature published in Pubmed and Google Scholar for articles related to NLP and ophthalmology, and used ancestor search to expand our references. Overall, we found 19 published studies of NLP in ophthalmology. The majority of these publications (16) focused on extracting specific text such as visual acuity from free-text notes for the purposes of quantitative analysis. Other applications included: domain embedding, predictive modeling, and topic modeling. Future ophthalmic applications of NLP may also focus on developing search engines for data within free-text notes, cleaning notes, automated question-answering, and translating ophthalmology notes for other specialties or for patients, especially with a growing interest in open notes. As medicine becomes more data-oriented, NLP offers increasing opportunities to augment our ability to harness free-text data and drive innovations in healthcare delivery and treatment of ophthalmic conditions.
Collapse
Affiliation(s)
- Jimmy S. Chen
- Division of Ophthalmology Informatics and Data Science, Viterbi Family Department of Ophthalmology and Shiley Eye Institute, University of California San Diego, La Jolla, CA, United States
- Health Department of Biomedical Informatics, University of California San Diego, La Jolla, CA, United States
| | - Sally L. Baxter
- Division of Ophthalmology Informatics and Data Science, Viterbi Family Department of Ophthalmology and Shiley Eye Institute, University of California San Diego, La Jolla, CA, United States
- Health Department of Biomedical Informatics, University of California San Diego, La Jolla, CA, United States
| |
Collapse
|
32
|
Ramjee D, Smith LH, Doanvo A, Charpignon ML, McNulty-Nebel A, Lett E, Desai AN, Majumder MS. Evaluating criminal justice reform during COVID-19: The need for a novel sentiment analysis package. PLOS DIGITAL HEALTH 2022; 1:e0000063. [PMID: 36812565 PMCID: PMC9931240 DOI: 10.1371/journal.pdig.0000063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Accepted: 05/15/2022] [Indexed: 11/18/2022]
Abstract
The health and safety of incarcerated persons and correctional personnel have been prominent in the U.S. news media discourse during the COVID-19 pandemic. Examining changing attitudes toward the health of the incarcerated population is imperative to better assess the extent to which the general public favors criminal justice reform. However, existing natural language processing lexicons that underlie current sentiment analysis (SA) algorithms may not perform adequately on news articles related to criminal justice due to contextual complexities. News discourse during the pandemic has highlighted the need for a novel SA lexicon and algorithm (i.e., an SA package) tailored for examining public health policy in the context of the criminal justice system. We analyzed the performance of existing SA packages on a corpus of news articles at the intersection of COVID-19 and criminal justice collected from state-level outlets between January and May 2020. Our results demonstrated that sentence sentiment scores provided by three popular SA packages can differ considerably from manually-curated ratings. This dissimilarity was especially pronounced when the text was more polarized, whether negatively or positively. A randomly selected set of 1,000 manually scored sentences, and the corresponding binary document term matrices, were used to train two new sentiment prediction algorithms (i.e., linear regression and random forest regression) to verify the performance of the manually-curated ratings. By better accounting for the unique context in which incarceration-related terminologies are used in news media, both of our proposed models outperformed all existing SA packages considered for comparison. Our findings suggest that there is a need to develop a novel lexicon, and potentially an accompanying algorithm, for analysis of text related to public health within the criminal justice system, as well as criminal justice more broadly.
Collapse
Affiliation(s)
- Divya Ramjee
- Department of Justice, Law and Criminology, School of Public Affairs, American University, Washington, District of Columbia, United States of America
- * E-mail: (DR); , (AND)
| | - Louisa H. Smith
- Roux Institute, Northeastern University, Portland, Maine, United States of America
| | - Anhvinh Doanvo
- COVID-19 Dispersed Volunteer Research Network, Boston, Massachusetts, United States of America
| | - Marie-Laure Charpignon
- Institute for Data, Systems, and Society, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Alyssa McNulty-Nebel
- Department of Epidemiology and Biostatistics, School of Public Health, Texas A&M University, College Station, Texas, United States of America
| | - Elle Lett
- Computational Health Informatics Program, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
- Perelman School of Medicine, University of Pennsylvania, 3400 Civic Center Boulevard, Philadelphia, Pennsylvania, United States of America
| | - Angel N. Desai
- Division of Infectious Disease, University of California Davis Health, Sacramento, California, United States of America
- * E-mail: (DR); , (AND)
| | - Maimuna S. Majumder
- Computational Health Informatics Program, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
- Department of Pediatrics, Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
33
|
Kingsley J, Diekmann L, Egerer MH, Lin BB, Ossola A, Marsh P. Experiences of gardening during the early stages of the COVID-19 pandemic. Health Place 2022; 76:102854. [PMID: 35842955 PMCID: PMC9242931 DOI: 10.1016/j.healthplace.2022.102854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 06/22/2022] [Accepted: 06/23/2022] [Indexed: 11/24/2022]
Abstract
Gardening has the potential to improve health and wellbeing, especially during crises. Using an international survey of gardeners (n = 3743), this study aimed to understand everyday gardening experiences, perspectives and attitudes during early stages of the COVID-19 pandemic in 2020. Our qualitative reflexive thematic and sentiment analyses show that during the first months of the COVID-19 pandemic, gardening seemed to create a safe and positive space where people could socially connect, learn and be creative. Participants had more time to garden during the pandemic, which led to enhanced connections with family members and neighbours, and the ability to spend time in a safe outdoor environment. More time gardening allowed for innovative and new gardening practices that provided enjoyment for many participants. However, our research also highlighted barriers to gardening (e.g. lack of access to garden spaces and materials). Our results illustrate the multiple benefits of gardening apparent during COVID-19 through a lens of the social-ecological model of health.
Collapse
Affiliation(s)
- Jonathan Kingsley
- School of Health Sciences, Swinburne University of Technology, 12 Wakefield Street (Swinburne Place West), Hawthorn, Victoria, 3122, Australia; Centre of Urban Transitions, Swinburne University of Technology, Level 1 EW Building, Hawthorn, Victoria, 3122, Australia.
| | - Lucy Diekmann
- University of California Cooperative Extension, 1553 Berger Dr., San Jose, CA, 95112, USA.
| | - Monika H Egerer
- Department of Life Science Systems, School of Life Sciences, Technical University of Munich, Hans Carl-von-Carlowitz-Platz 2, 85354, Freising, Germany.
| | - Brenda B Lin
- CSIRO Land & Water, GPO Box 2583, Brisbane, QLD, 4001, Australia.
| | - Alessandro Ossola
- University of California, Davis, CA, USA; Macquarie University Sydney and University of Melbourne, Australia.
| | - Pauline Marsh
- Centre for Rural Health, University of Tasmania, 1 Liverpool St, Hobart, Tasmania, 7001, Australia.
| |
Collapse
|
34
|
Walsh J, Dwumfour C, Cave J, Griffiths F. Spontaneously generated online patient experience data - how and why is it being used in health research: an umbrella scoping review. BMC Med Res Methodol 2022; 22:139. [PMID: 35562661 PMCID: PMC9106384 DOI: 10.1186/s12874-022-01610-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 04/13/2022] [Indexed: 11/10/2022] Open
Abstract
PURPOSE Social media has led to fundamental changes in the way that people look for and share health related information. There is increasing interest in using this spontaneously generated patient experience data as a data source for health research. The aim was to summarise the state of the art regarding how and why SGOPE data has been used in health research. We determined the sites and platforms used as data sources, the purposes of the studies, the tools and methods being used, and any identified research gaps. METHODS A scoping umbrella review was conducted looking at review papers from 2015 to Jan 2021 that studied the use of SGOPE data for health research. Using keyword searches we identified 1759 papers from which we included 58 relevant studies in our review. RESULTS Data was used from many individual general or health specific platforms, although Twitter was the most widely used data source. The most frequent purposes were surveillance based, tracking infectious disease, adverse event identification and mental health triaging. Despite the developments in machine learning the reviews included lots of small qualitative studies. Most NLP used supervised methods for sentiment analysis and classification. Very early days, methods need development. Methods not being explained. Disciplinary differences - accuracy tweaks vs application. There is little evidence of any work that either compares the results in both methods on the same data set or brings the ideas together. CONCLUSION Tools, methods, and techniques are still at an early stage of development, but strong consensus exists that this data source will become very important to patient centred health research.
Collapse
Affiliation(s)
- Julia Walsh
- Warwick Medical School, University of Warwick, Coventry, UK.
| | | | - Jonathan Cave
- Department of Economics, University of Warwick, Coventry, UK
| | - Frances Griffiths
- Warwick Medical School, University of Warwick, Coventry, UK.,Centre for Health Policy, University of the Witwatersrand, Johannesburg, South Africa
| |
Collapse
|
35
|
The Case of Aspect in Sentiment Analysis: Seeking Attention or Co-Dependency? MACHINE LEARNING AND KNOWLEDGE EXTRACTION 2022. [DOI: 10.3390/make4020021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
(1) Background: Aspect-based sentiment analysis (SA) is a natural language processing task, the aim of which is to classify the sentiment associated with a specific aspect of a written text. The performance of SA methods applied to texts related to health and well-being lags behind that of other domains. (2) Methods: In this study, we present an approach to aspect-based SA of drug reviews. Specifically, we analysed signs and symptoms, which were extracted automatically using the Unified Medical Language System. This information was then passed onto the BERT language model, which was extended by two layers to fine-tune the model for aspect-based SA. The interpretability of the model was analysed using an axiomatic attribution method. We performed a correlation analysis between the attribution scores and syntactic dependencies. (3) Results: Our fine-tuned model achieved accuracy of approximately 95% on a well-balanced test set. It outperformed our previous approach, which used syntactic information to guide the operation of a neural network and achieved an accuracy of approximately 82%. (4) Conclusions: We demonstrated that a BERT-based model of SA overcomes the negative bias associated with health-related aspects and closes the performance gap against the state-of-the-art in other domains.
Collapse
|
36
|
Conducting sentiment analysis: Lei L. & Liu D. Elements in Corpus Linguistics, CUP. LANG RESOUR EVAL 2022. [DOI: 10.1007/s10579-022-09593-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
37
|
Jang J, Yoon S, Son G, Kang M, Choeh JY, Choi KH. Predicting Personality and Psychological Distress Using Natural Language Processing: A Study Protocol. Front Psychol 2022; 13:865541. [PMID: 35465529 PMCID: PMC9022676 DOI: 10.3389/fpsyg.2022.865541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Accepted: 03/22/2022] [Indexed: 11/13/2022] Open
Abstract
Background Self-report multiple choice questionnaires have been widely utilized to quantitatively measure one's personality and psychological constructs. Despite several strengths (e.g., brevity and utility), self-report multiple choice questionnaires have considerable limitations in nature. With the rise of machine learning (ML) and Natural language processing (NLP), researchers in the field of psychology are widely adopting NLP to assess psychological construct to predict human behaviors. However, there is a lack of connections between the work being performed in computer science and that of psychology due to small data sets and unvalidated modeling practices. Aims The current article introduces the study method and procedure of phase II which includes the interview questions for the five-factor model (FFM) of personality developed in phase I. This study aims to develop the interview (semi-structured) and open-ended questions for the FFM-based personality assessments, specifically designed with experts in the field of clinical and personality psychology (phase 1), and to collect the personality-related text data using the interview questions and self-report measures on personality and psychological distress (phase 2). The purpose of the study includes examining the relationship between natural language data obtained from the interview questions, measuring the FFM personality constructs, and psychological distress to demonstrate the validity of the natural language-based personality prediction. Methods Phase I (pilot) study was conducted to fifty-nine native Korean adults to acquire the personality-related text data from the interview (semi-structured) and open-ended questions based on the FFM of personality. The interview questions were revised and finalized with the feedback from the external expert committee, consisting of personality and clinical psychologists. Based on the established interview questions, a total of 300 Korean adults will be recruited using a convenience sampling method via online survey. The text data collected from interviews will be analyzed using the natural language processing. The results of the online survey including demographic data, depression, anxiety, and personality inventories will be analyzed together in the model to predict individuals' FFM of personality and the level of psychological distress (phase 2).
Collapse
Affiliation(s)
- Jihee Jang
- School of Psychology, Korea University, Seoul, South Korea
| | - Seowon Yoon
- School of Psychology, Korea University, Seoul, South Korea
| | - Gaeun Son
- School of Psychology, Korea University, Seoul, South Korea
| | - Minjung Kang
- School of Psychology, Korea University, Seoul, South Korea
| | - Joon Yeon Choeh
- Department of Software, Sejong University, Seoul, South Korea
| | - Kee-Hong Choi
- School of Psychology, Korea University, Seoul, South Korea
- KU Mind Health Institute, Korea University, Seoul, South Korea
| |
Collapse
|
38
|
Verma S. Sentiment analysis of public services for smart society: Literature review and future research directions. GOVERNMENT INFORMATION QUARTERLY 2022. [DOI: 10.1016/j.giq.2022.101708] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
39
|
Davis BD, McKnight DE, Teodorescu D, Quan-Haase A, Chunara R, Fyshe A, Lizotte DJ. Quantifying depression-related language on social media during the COVID-19 pandemic. Int J Popul Data Sci 2022; 5:1716. [PMID: 35516163 PMCID: PMC9052361 DOI: 10.23889/ijpds.v5i4.1716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022] Open
Abstract
Introduction The COVID-19 pandemic had clear impacts on mental health. Social media presents an opportunity for assessing mental health at the population level. Objectives 1) Identify and describe language used on social media that is associated with discourse about depression. 2) Describe the associations between identified language and COVID-19 incidence over time across several geographies. Methods We create a word embedding based on the posts in Reddit's /r/Depression and use this word embedding to train representations of active authors. We contrast these authors against a control group and extract keywords that capture differences between the two groups. We filter these keywords for face validity and to match character limits of an information retrieval system, Elasticsearch. We retrieve all geo-tagged posts on Twitter from April 2019 to June 2021 from Seattle, Sydney, Mumbai, and Toronto. The tweets are scored with BM25 using the keywords. We call this score rDD. We compare changes in average score over time with case counts from the pandemic's beginning through June 2021. Results We observe a pattern in rDD across all cities analyzed: There is an increase in rDD near the start of the pandemic which levels off over time. However, in Mumbai we also see an increase aligned with a second wave of cases. Conclusions Our results are concordant with other studies which indicate that the impact of the pandemic on mental health was highest initially and was followed by recovery, largely unchanged by subsequent waves. However, in the Mumbai data we observed a substantial rise in rDD with a large second wave. Our results indicate possible un-captured heterogeneity across geographies, and point to a need for a better understanding of this differential impact on mental health.
Collapse
Affiliation(s)
- Brent D. Davis
- Department of Computer Science, Western University, London, ON, Canada, N6A 3K7
| | - Dawn Estes McKnight
- Department of Computing Science, University of Alberta, Edmonton, AB, T6G 2R3
| | - Daniela Teodorescu
- Department of Computing Science, University of Alberta, Edmonton, AB, T6G 2R3
| | - Anabel Quan-Haase
- Department of Sociology, Western University, London, ON, Canada, N6A 3K7
- Faculty of Information and Media Studies, Western University, London, ON, Canada, N6A 3K7
| | - Rumi Chunara
- Department of Computer Science & Engineering, New York University, New York, NY, 10003
- Department of Biostatistics, New York University, New York, NY, 10003
| | - Alona Fyshe
- Department of Computing Science, University of Alberta, Edmonton, AB, T6G 2R3
- Department of Psychology, University of Alberta, Edmonton, AB, Canada, T6G2R3
| | - Daniel J. Lizotte
- Department of Computer Science, Western University, London, ON, Canada, N6A 3K7
- Department of Epidemiology and Biostatistics, Western University, London, ON,N6A 3K7
| |
Collapse
|
40
|
Lu TJ, Nguyen AXL, Trinh XV, Wu AY. Sentiment Analysis Surrounding Blepharoplasty in Online Health Forums. PLASTIC AND RECONSTRUCTIVE SURGERY-GLOBAL OPEN 2022; 10:e4213. [PMID: 35492229 PMCID: PMC9038503 DOI: 10.1097/gox.0000000000004213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 01/31/2022] [Indexed: 06/14/2023]
Abstract
BACKGROUND Upper and lower blepharoplasty are among the most common procedures in aesthetic surgery and are often emotionally laden due to the subjective nature of outcomes and implications with beauty and self-identity. This article capitalizes on the increasing wealth of patient-provided health information online and is the first to analyze the emotions surrounding blepharoplasty discussions in an open internet health forum, MedHelp. METHODS We used Python to scrape MedHelp for threads that contained "blepharoplasty" and then used IBM Watson Natural Language Understanding to perform sentiment analyses, calculating a general sentiment score (-1 to +1) as well as emotion scores for anger, sadness, joy, fear, and disgust (0 to 1) for posts and keywords contained within the posts. Keywords were then manually grouped into five distinct clinical categories: symptoms, doctor, treatment, medication, and body. RESULTS We collected 52 threads containing "blepharoplasty," yielding 154 posts and 1365 keywords. The average sentiment score was negative among all posts (-0.15) and keywords (-0.30). Among all posts and keywords, sadness had the highest score and disgust had the lowest score. CONCLUSIONS Fear and sadness are the predominant emotions for blepharoplasty patients online, and the most negative symptoms cited are not ones that surgeons typically expect.
Collapse
Affiliation(s)
- Tracy J. Lu
- From the Department of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, Calif
| | | | - Xuan-Vi Trinh
- Department of Computer Science, McGill University, Montreal, QC, Canada
| | - Albert Y. Wu
- Department of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, Calif
| |
Collapse
|
41
|
Ramadi KB, Mehta R, He D, Chao S, Chu Z, Atun R, Nguyen FT. Grass-roots entrepreneurship complements traditional top-down innovation in lung and breast cancer. NPJ Digit Med 2022; 5:10. [PMID: 35064182 PMCID: PMC8782943 DOI: 10.1038/s41746-021-00545-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2021] [Accepted: 11/30/2021] [Indexed: 11/08/2022] Open
Abstract
The majority of biomedical research is funded by public, governmental, and philanthropic grants. These initiatives often shape the avenues and scope of research across disease areas. However, the prioritization of disease-specific funding is not always reflective of the health and social burden of each disease. We identify a prioritization disparity between lung and breast cancers, whereby lung cancer contributes to a substantially higher socioeconomic cost on society yet receives significantly less funding than breast cancer. Using search engine results and natural language processing (NLP) of Twitter tweets, we show that this disparity correlates with enhanced public awareness and positive sentiment for breast cancer. Interestingly, disease-specific venture activity does not correlate with funding or public opinion. We use outcomes from recent early-stage innovation events focused on lung cancer to highlight the complementary mechanism by which bottom-up "grass-roots" initiatives can identify and tackle under-prioritized conditions.
Collapse
Affiliation(s)
- Khalil B Ramadi
- Hacking Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA.
- School of Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA.
- Division of Engineering, New York University Abu Dhabi, Abu Dhabi, UAE.
- Tandon School of Engineering, New York University, New York, NY, USA.
| | - Rhea Mehta
- Hacking Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Economics, Wellesley College, Wellesley, MA, USA
| | - David He
- Hacking Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA
- School of Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Sichen Chao
- Hacking Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA
- School of Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Zen Chu
- Hacking Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA
- Sloan School of Management, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Rifat Atun
- Division of Engineering, New York University Abu Dhabi, Abu Dhabi, UAE
| | - Freddy T Nguyen
- Hacking Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA
- Innovation Initiative, Massachusetts Institute of Technology, Cambridge, MA, USA
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
42
|
Park S, Choi SH, Song YK, Kwon JW. Comparison of Online Patient Reviews and National Pharmacovigilance Data for Tramadol-Related Adverse Events: Comparative Observational Study. JMIR Public Health Surveill 2022; 8:e33311. [PMID: 34982723 PMCID: PMC8767477 DOI: 10.2196/33311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 11/08/2021] [Accepted: 11/27/2021] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Tramadol is known to cause fewer adverse events (AEs) than other opioids. However, recent research has raised concerns about various safety issues. OBJECTIVE We aimed to explore these new AEs related to tramadol using social media and conventional pharmacovigilance data. METHODS This study used 2 data sets, 1 from patients' drug reviews on WebMD (January 2007 to January 2021) and 1 from the US Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS; January 2016 to December 2020). We analyzed 2062 and 29,350 patient reports from WebMD and FAERS, respectively. Patient posts on WebMD were manually assigned the preferred terms of the Medical Dictionary for Regulatory Activities. To analyze AEs from FAERS, a disproportionality analysis was performed with 3 measures: proportional reporting ratio, reporting odds ratio, and information component. RESULTS From the 869 AEs reported, we identified 125 new signals related to tramadol use not listed on the drug label that satisfied all 3 signal detection criteria. In addition, 20 serious AEs were selected from new signals. Among new serious AEs, vascular disorders had the largest signal detection criteria value. Based on the disproportionality analysis and patients' symptom descriptions, tramadol-induced pain might also be an unexpected AE. CONCLUSIONS This study detected several novel signals related to tramadol use, suggesting newly identified possible AEs. Additionally, this study indicates that unexpected AEs can be detected using social media analysis alongside traditional pharmacovigilance data.
Collapse
Affiliation(s)
- Susan Park
- BK21 FOUR Community-Based Intelligent Novel Drug Discovery Education Unit, College of Pharmacy, Research Institute of Pharmaceutical Sciences, Kyungpook National University, Daegu, Republic of Korea
| | - So Hyun Choi
- Department of Statistics, Kyungpook National University, Daegu, Republic of Korea
| | - Yun-Kyoung Song
- College of Pharmacy, Daegu Catholic University, Gyeongsan-si, Gyeongbuk, Republic of Korea
| | - Jin-Won Kwon
- BK21 FOUR Community-Based Intelligent Novel Drug Discovery Education Unit, College of Pharmacy, Research Institute of Pharmaceutical Sciences, Kyungpook National University, Daegu, Republic of Korea
| |
Collapse
|
43
|
Artificial Intelligence in Surgery. Artif Intell Med 2022. [DOI: 10.1007/978-3-030-64573-1_171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
44
|
Rahim AIA, Ibrahim MI, Chua SL, Musa KI. Hospital Facebook Reviews Analysis Using a Machine Learning Sentiment Analyzer and Quality Classifier. Healthcare (Basel) 2021; 9:1679. [PMID: 34946405 PMCID: PMC8701188 DOI: 10.3390/healthcare9121679] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 11/30/2021] [Accepted: 12/02/2021] [Indexed: 02/05/2023] Open
Abstract
While experts have recognised the significance and necessity of social media integration in healthcare, no systematic method has been devised in Malaysia or Southeast Asia to include social media input into the hospital quality improvement process. The goal of this work is to explain how to develop a machine learning system for classifying Facebook reviews of public hospitals in Malaysia by using service quality (SERVQUAL) dimensions and sentiment analysis. We developed a Machine Learning Quality Classifier (MLQC) based on the SERVQUAL model and a Machine Learning Sentiment Analyzer (MLSA) by manually annotated multiple batches of randomly chosen reviews. Logistic regression (LR), naive Bayes (NB), support vector machine (SVM), and other methods were used to train the classifiers. The performance of each classifier was tested using 5-fold cross validation. For topic classification, the average F1-score was between 0.687 and 0.757 for all models. In a 5-fold cross validation of each SERVQUAL dimension and in sentiment analysis, SVM consistently outperformed other methods. The study demonstrates how to use supervised learning to automatically identify SERVQUAL domains and sentiments from patient experiences on a hospital's Facebook page. Malaysian healthcare providers can gather and assess data on patient care via the use of these content analysis technology to improve hospital quality of care.
Collapse
Affiliation(s)
- Afiq Izzudin A. Rahim
- Department of Community Medicine, School of Medical Science, Universiti Sains Malaysia, Kubang Kerian, Kota Bharu 16150, Kelantan, Malaysia; (A.I.A.R.); (K.I.M.)
| | - Mohd Ismail Ibrahim
- Department of Community Medicine, School of Medical Science, Universiti Sains Malaysia, Kubang Kerian, Kota Bharu 16150, Kelantan, Malaysia; (A.I.A.R.); (K.I.M.)
| | - Sook-Ling Chua
- Faculty of Computing and Informatics, Multimedia University, Persiaran Multimedia, Cyberjaya 63100, Selangor, Malaysia
| | - Kamarul Imran Musa
- Department of Community Medicine, School of Medical Science, Universiti Sains Malaysia, Kubang Kerian, Kota Bharu 16150, Kelantan, Malaysia; (A.I.A.R.); (K.I.M.)
| |
Collapse
|
45
|
Kentour M, Lu J. An investigation into the deep learning approach in sentimental analysis using graph-based theories. PLoS One 2021; 16:e0260761. [PMID: 34855856 PMCID: PMC8638889 DOI: 10.1371/journal.pone.0260761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 11/16/2021] [Indexed: 11/24/2022] Open
Abstract
Sentiment analysis is a branch of natural language analytics that aims to correlate what is expressed which comes normally within unstructured format with what is believed and learnt. Several attempts have tried to address this gap (i.e., Naive Bayes, RNN, LSTM, word embedding, etc.), even though the deep learning models achieved high performance, their generative process remains a "black-box" and not fully disclosed due to the high dimensional feature and the non-deterministic weights assignment. Meanwhile, graphs are becoming more popular when modeling complex systems while being traceable and understood. Here, we reveal that a good trade-off transparency and efficiency could be achieved with a Deep Neural Network by exploring the Credit Assignment Paths theory. To this end, we propose a novel algorithm which alleviates the features' extraction mechanism and attributes an importance level of selected neurons by applying a deterministic edge/node embeddings with attention scores on the input unit and backward path respectively. We experiment on the Twitter Health News dataset were the model has been extended to approach different approximations (tweet/aspect and tweets' source levels, frequency, polarity/subjectivity), it was also transparent and traceable. Moreover, results of comparing with four recent models on same data corpus for tweets analysis showed a rapid convergence with an overall accuracy of ≈83% and 94% of correctly identified true positive sentiments. Therefore, weights can be ideally assigned to specific active features by following the proposed method. As opposite to other compared works, the inferred features are conditioned through the users' preferences (i.e., frequency degree) and via the activation's derivatives (i.e., reject feature if not scored). Future direction will address the inductive aspect of graph embeddings to include dynamic graph structures and expand the model resiliency by considering other datasets like SemEval task7, covid-19 tweets, etc.
Collapse
Affiliation(s)
- Mohamed Kentour
- School of Computing and Engineering, University of Huddersfield, Huddersfield, West- Yorkshire, United Kingdom
| | - Joan Lu
- School of Computing and Engineering, University of Huddersfield, Huddersfield, West- Yorkshire, United Kingdom
| |
Collapse
|
46
|
How Does the World View China’s Carbon Policy? A Sentiment Analysis on Twitter Data. ENERGIES 2021. [DOI: 10.3390/en14227782] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
China has recently put forth an ambitious plan to achieve carbon peak around 2030 and carbon neutrality around 2060. However, there are quite a few differences regarding the public views about China’s carbon policy between the Chinese people and the people from other countries, especially concerning the doubt of foreign people about the fidelity of China’s carbon policy goals. Based on Twitter data related to China’s carbon policy topics from 2008 to 2020, this study shows the inter- and intra-annual trends in the count of tweets about China’s carbon policy, conducts sentiment analysis, extracts top frequency words from different attitudes, and analyzes the impact of China’s official Twitter accounts on the global view of China’s carbon policy. Our results show: (1) the global attention to China’s carbon policy gradually rises and occasionally rises suddenly due to important carbon events; (2) the proportion of Twitter users with negative sentiment about China’s carbon policy has increased rapidly and has exceeded the proportion of Twitter users with positive sentiment since 2019; (3) people in developing countries hold more positive or neutral attitudes towards China’s carbon policy, while developed countries hold more negative attitudes; (4) China’s official Twitter accounts serve to improve the global views on China’s carbon policy.
Collapse
|
47
|
Walsh J, Cave J, Griffiths F. Spontaneously Generated Online Patient Experience of Modafinil: A Qualitative and NLP Analysis. Front Digit Health 2021; 3:598431. [PMID: 34713085 PMCID: PMC8521895 DOI: 10.3389/fdgth.2021.598431] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Accepted: 01/27/2021] [Indexed: 11/16/2022] Open
Abstract
Objective: To compare the findings from a qualitative and a natural language processing (NLP) based analysis of online patient experience posts on patient experience of the effectiveness and impact of the drug Modafinil. Methods: Posts (n = 260) from 5 online social media platforms where posts were publicly available formed the dataset/corpus. Three platforms asked posters to give a numerical rating of Modafinil. Thematic analysis: data was coded and themes generated. Data were categorized into PreModafinil, Acquisition, Dosage, and PostModafinil and compared to identify each poster's own view of whether taking Modafinil was linked to an identifiable outcome. We classified this as positive, mixed, negative, or neutral and compared this with numerical ratings. NLP: Corpus text was speech tagged and keywords and key terms extracted. We identified the following entities: drug names, condition names, symptoms, actions, and side-effects. We searched for simple relationships, collocations, and co-occurrences of entities. To identify causal text, we split the corpus into PreModafinil and PostModafinil and used n-gram analysis. To evaluate sentiment, we calculated the polarity of each post between −1 (negative) and +1 (positive). NLP results were mapped to qualitative results. Results: Posters had used Modafinil for 33 different primary conditions. Eight themes were identified: the reason for taking (condition or symptom), impact of symptoms, acquisition, dosage, side effects, other interventions tried or compared to, effectiveness of Modafinil, and quality of life outcomes. Posters reported perceived effectiveness as follows: 68% positive, 12% mixed, 18% negative. Our classification was consistent with poster ratings. Of the most frequent 100 keywords/keyterms identified by term extraction 88/100 keywords and 84/100 keyterms mapped directly to the eight themes. Seven keyterms indicated negation and temporal states. Sentiment was as follows 72% positive sentiment 4% neutral 24% negative. Matching of sentiment between the qualitative and NLP methods was accurate in 64.2% of posts. If we allow for one category difference matching was accurate in 85% of posts. Conclusions: User generated patient experience is a rich resource for evaluating real world effectiveness, understanding patient perspectives, and identifying research gaps. Both methods successfully identified the entities and topics contained in the posts. In contrast to current evidence, posters with a wide range of other conditions found Modafinil effective. Perceived causality and effectiveness were identified by both methods demonstrating the potential to augment existing knowledge.
Collapse
Affiliation(s)
- Julia Walsh
- Warwick Medical School, University of Warwick, Coventry, United Kingdom
| | - Jonathan Cave
- Department of Economics, University of Warwick, Coventry, United Kingdom
| | - Frances Griffiths
- Warwick Medical School, University of Warwick, Coventry, United Kingdom
| |
Collapse
|
48
|
Rahim AIA, Ibrahim MI, Musa KI, Chua SL, Yaacob NM. Patient Satisfaction and Hospital Quality of Care Evaluation in Malaysia Using SERVQUAL and Facebook. Healthcare (Basel) 2021; 9:1369. [PMID: 34683050 PMCID: PMC8544585 DOI: 10.3390/healthcare9101369] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 09/27/2021] [Accepted: 10/12/2021] [Indexed: 02/05/2023] Open
Abstract
Social media sites, dubbed patient online reviews (POR), have been proposed as new methods for assessing patient satisfaction and monitoring quality of care. However, the unstructured nature of POR data derived from social media creates a number of challenges. The objectives of this research were to identify service quality (SERVQUAL) dimensions automatically from hospital Facebook reviews using a machine learning classifier, and to examine their associations with patient dissatisfaction. From January 2017 to December 2019, empirical research was conducted in which POR were gathered from the official Facebook page of Malaysian public hospitals. To find SERVQUAL dimensions in POR, a machine learning topic classification utilising supervised learning was developed, and this study's objective was established using logistic regression analysis. It was discovered that 73.5% of patients were satisfied with the public hospital service, whereas 26.5% were dissatisfied. SERVQUAL dimensions identified were 13.2% reviews of tangible, 68.9% of reliability, 6.8% of responsiveness, 19.5% of assurance, and 64.3% of empathy. After controlling for hospital variables, all SERVQUAL dimensions except tangible and assurance were shown to be significantly related with patient dissatisfaction (reliability, p < 0.001; responsiveness, p = 0.016; and empathy, p < 0.001). Rural hospitals had a higher probability of patient dissatisfaction (p < 0.001). Therefore, POR, assisted by machine learning technologies, provided a pragmatic and feasible way for capturing patient perceptions of care quality and supplementing conventional patient satisfaction surveys. The findings offer critical information that will assist healthcare authorities in capitalising on POR by monitoring and evaluating the quality of services in real time.
Collapse
Affiliation(s)
- Afiq Izzudin A. Rahim
- Department of Community Medicine, School of Medical Science, Universiti Sains Malaysia, Kubang Kerian, Kota Bharu 16150, Kelantan, Malaysia; (A.I.A.R.); (K.I.M.)
| | - Mohd Ismail Ibrahim
- Department of Community Medicine, School of Medical Science, Universiti Sains Malaysia, Kubang Kerian, Kota Bharu 16150, Kelantan, Malaysia; (A.I.A.R.); (K.I.M.)
| | - Kamarul Imran Musa
- Department of Community Medicine, School of Medical Science, Universiti Sains Malaysia, Kubang Kerian, Kota Bharu 16150, Kelantan, Malaysia; (A.I.A.R.); (K.I.M.)
| | - Sook-Ling Chua
- Faculty of Computing and Informatics, Multimedia University, Persiaran Multimedia, Cyberjaya 63100, Selangor, Malaysia;
| | - Najib Majdi Yaacob
- Unit of Biostatistics and Research Methodology, Health Campus, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, Kota Bharu 16150, Kelantan, Malaysia;
| |
Collapse
|
49
|
A. Rahim AI, Ibrahim MI, Musa KI, Chua SL, Yaacob NM. Assessing Patient-Perceived Hospital Service Quality and Sentiment in Malaysian Public Hospitals Using Machine Learning and Facebook Reviews. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:9912. [PMID: 34574835 PMCID: PMC8466628 DOI: 10.3390/ijerph18189912] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 09/17/2021] [Accepted: 09/18/2021] [Indexed: 02/05/2023]
Abstract
Social media is emerging as a new avenue for hospitals and patients to solicit input on the quality of care. However, social media data is unstructured and enormous in volume. Moreover, no empirical research on the use of social media data and perceived hospital quality of care based on patient online reviews has been performed in Malaysia. The purpose of this study was to investigate the determinants of positive sentiment expressed in hospital Facebook reviews in Malaysia, as well as the association between hospital accreditation and sentiments expressed in Facebook reviews. From 2017 to 2019, we retrieved comments from 48 official public hospitals' Facebook pages. We used machine learning to build a sentiment analyzer and service quality (SERVQUAL) classifier that automatically classifies the sentiment and SERVQUAL dimensions. We utilized logistic regression analysis to determine our goals. We evaluated a total of 1852 reviews and our machine learning sentiment analyzer detected 72.1% of positive reviews and 27.9% of negative reviews. We classified 240 reviews as tangible, 1257 reviews as trustworthy, 125 reviews as responsive, 356 reviews as assurance, and 1174 reviews as empathy using our machine learning SERVQUAL classifier. After adjusting for hospital characteristics, all SERVQUAL dimensions except Tangible were associated with positive sentiment. However, no significant relationship between hospital accreditation and online sentiment was discovered. Facebook reviews powered by machine learning algorithms provide valuable, real-time data that may be missed by traditional hospital quality assessments. Additionally, online patient reviews offer a hitherto untapped indication of quality that may benefit all healthcare stakeholders. Our results confirm prior studies and support the use of Facebook reviews as an adjunct method for assessing the quality of hospital services in Malaysia.
Collapse
Affiliation(s)
- Afiq Izzudin A. Rahim
- Department of Community Medicine, School of Medical Science, Universiti Sains Malaysia, Kubang Kerian, Kota Bharu 16150, Kelantan, Malaysia; (A.I.A.R.); (K.I.M.)
| | - Mohd Ismail Ibrahim
- Department of Community Medicine, School of Medical Science, Universiti Sains Malaysia, Kubang Kerian, Kota Bharu 16150, Kelantan, Malaysia; (A.I.A.R.); (K.I.M.)
| | - Kamarul Imran Musa
- Department of Community Medicine, School of Medical Science, Universiti Sains Malaysia, Kubang Kerian, Kota Bharu 16150, Kelantan, Malaysia; (A.I.A.R.); (K.I.M.)
| | - Sook-Ling Chua
- Faculty of Computing and Informatics, Multimedia University, Persiaran Multimedia, Cyberjaya 63100, Selangor, Malaysia;
| | - Najib Majdi Yaacob
- Units of Biostatistics and Research Methodology, School of Medical Sciences, Health Campus, Universiti Sains Malaysia, Kubang Kerian, Kota Bharu 16150, Kelantan, Malaysia;
| |
Collapse
|
50
|
Žunić A, Corcoran P, Spasić I. Aspect-based sentiment analysis with graph convolution over syntactic dependencies. Artif Intell Med 2021; 119:102138. [PMID: 34531007 DOI: 10.1016/j.artmed.2021.102138] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 06/05/2021] [Accepted: 08/03/2021] [Indexed: 11/18/2022]
Abstract
Aspect-based sentiment analysis is a natural language processing task whose aim is to automatically classify the sentiment associated with a specific aspect of a written text. In this study, we propose a novel model for aspect-based sentiment analysis, which exploits the dependency parse tree of a sentence using graph convolution to classify the sentiment of a given aspect. To evaluate this model in the domain of health and well-being, where this task is biased toward negative sentiment, we used a corpus of drug reviews. Specific aspects were grounded in the Unified Medical Language System, a large repository of inter-related biomedical concepts and the corresponding terminology. Our experiments demonstrated that graph convolution approach outperforms standard deep learning architectures on the task of aspect-based sentiment analysis. Moreover, graph convolution over dependency parse trees (F-score of 0.8179) outperforms the same approach over a flat sequence representation of sentences (F-score of 0.7332). These results bring the performance of sentiment analysis in health and well-being in line with the state of the art in other domains.
Collapse
Affiliation(s)
- Anastazia Žunić
- School of Computer Science & Informatics, Cardiff University, The Parade, Cardiff CF24 3AA, United Kingdom
| | - Padraig Corcoran
- School of Computer Science & Informatics, Cardiff University, The Parade, Cardiff CF24 3AA, United Kingdom
| | - Irena Spasić
- School of Computer Science & Informatics, Cardiff University, The Parade, Cardiff CF24 3AA, United Kingdom.
| |
Collapse
|