Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Okon E, Rachakonda V, Hong HJ, Callison-Burch C, Lipoff JB. Natural language processing of Reddit data to evaluate dermatology patient experiences and therapeutics. J Am Acad Dermatol 2020;83:803-8. [PMID: 31306722 DOI: 10.1016/j.jaad.2019.07.014] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 06/30/2019] [Accepted: 07/03/2019] [Indexed: 11/24/2022]

For:	Okon E, Rachakonda V, Hong HJ, Callison-Burch C, Lipoff JB. Natural language processing of Reddit data to evaluate dermatology patient experiences and therapeutics. J Am Acad Dermatol 2020;83:803-8. [PMID: 31306722 DOI: 10.1016/j.jaad.2019.07.014] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 06/30/2019] [Accepted: 07/03/2019] [Indexed: 11/24/2022]

Number

Cited by Other Article(s)

Madrid-García A, Merino-Barbancho B, Freites-Núñez D, Rodríguez-Rodríguez L, Menasalvas-Ruíz E, Rodríguez-González A, Peñas A. From Web to RheumaLpack: Creating a Linguistic Corpus for Exploitation and Knowledge Discovery in Rheumatology. Comput Biol Med 2024;179:108920. [PMID: 39047506 DOI: 10.1016/j.compbiomed.2024.108920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 06/30/2024] [Accepted: 07/16/2024] [Indexed: 07/27/2024]

Abstract

This study introduces RheumaLinguisticpack (RheumaLpack), the first specialised linguistic web corpus designed for the field of musculoskeletal disorders. By combining web mining (i.e., web scraping) and natural language processing (NLP) techniques, as well as clinical expertise, RheumaLpack systematically captures and curates structured and unstructured data across a spectrum of web sources including clinical trials registers (i.e., ClinicalTrials.gov), bibliographic databases (i.e., PubMed), medical agencies (i.e. European Medicines Agency), social media (i.e., Reddit), and accredited health websites (i.e., MedlinePlus, Harvard Health Publishing, and Cleveland Clinic). Given the complexity of rheumatic and musculoskeletal diseases (RMDs) and their significant impact on quality of life, this resource can be proposed as a useful tool to train algorithms that could mitigate the diseases' effects. Therefore, the corpus aims to improve the training of artificial intelligence (AI) algorithms and facilitate knowledge discovery in RMDs. The development of RheumaLpack involved a systematic six-step methodology covering data identification, characterisation, selection, collection, processing, and corpus description. The result is a non-annotated, monolingual, and dynamic corpus, featuring almost 3 million records spanning from 2000 to 2023. RheumaLpack represents a pioneering contribution to rheumatology research, providing a useful resource for the development of advanced AI and NLP applications. This corpus highlights the value of web data to address the challenges posed by musculoskeletal diseases, illustrating the corpus's potential to improve research and treatment paradigms in rheumatology. Finally, the methodology shown can be replicated to obtain data from other medical specialities. The code and details on how to build RheumaLpack are also provided to facilitate the dissemination of such resource.

Collapse

Paganelli A, Spadafora M, Navarrete-Dechent C, Guida S, Pellacani G, Longo C. Natural language processing in dermatology: A systematic literature review and state of the art. J Eur Acad Dermatol Venereol 2024. [PMID: 39150311 DOI: 10.1111/jdv.20286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2024] [Accepted: 07/19/2024] [Indexed: 08/17/2024]

Wessel D, Pogrebnyakov N. Using Social Media as a Source of Real-World Data for Pharmaceutical Drug Development and Regulatory Decision Making. Drug Saf 2024;47:495-511. [PMID: 38446405 PMCID: PMC11018692 DOI: 10.1007/s40264-024-01409-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/07/2024] [Indexed: 03/07/2024]

Abstract

INTRODUCTION

While pharmaceutical companies aim to leverage real-world data (RWD) to bridge the gap between clinical drug development and real-world patient outcomes, extant research has mainly focused on the use of social media in a post-approval safety-surveillance setting. Recent regulatory and technological developments indicate that social media may serve as a rich source to expand the evidence base to pre-approval and drug development activities. However, use cases related to drug development have been largely omitted, thereby missing some of the benefits of RWD. In addition, an applied end-to-end understanding of RWD rooted in both industry and regulations is lacking.

OBJECTIVE

We aimed to investigate how social media can be used as a source of RWD to support regulatory decision making and drug development in the pharmaceutical industry. We aimed to specifically explore the data pipeline and examine how social-media derived RWD can align with regulatory guidance from the US Food and Drug Administration and industry needs.

METHODS

A machine learning pipeline was developed to extract patient insights related to anticoagulants from X (Twitter) data. These findings were then analysed from an industry perspective, and complemented by interviews with professionals from a pharmaceutical company.

RESULTS

The analysis reveals several use cases where RWD derived from social media can be beneficial, particularly in generating hypotheses around patient and therapeutic area needs. We also note certain limitations of social media data, particularly around inferring causality.

CONCLUSIONS

Social media display considerable potential as a source of RWD for guiding efforts in pharmaceutical drug development and pre-approval settings. Although further regulatory guidance on the use of social media for RWD is needed to encourage its use, regulatory and technological developments are suggested to warrant at least exploratory uses for drug development.

Collapse

Lindquist EG, West AE. What Do My (Online) Friends Think? A Topic Modeling Approach to Identifying Patterns of Response to Self-Injurious Behaviors on Reddit. Arch Suicide Res 2024;28:537-553. [PMID: 36998237 DOI: 10.1080/13811118.2023.2193594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 04/01/2023]

Murray C, Mitchell L, Tuke J, Mackay M. Revealing patient-reported experiences in healthcare from social media using thedesign-acquire-process-model-analyse-visualise framework. Digit Health 2024;10:20552076241251715. [PMID: 38757085 PMCID: PMC11097732 DOI: 10.1177/20552076241251715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 04/10/2024] [Indexed: 05/18/2024] Open

Joly-Chevrier M, Aly S, Bahous K, Lefrançois P. Atopic Dermatitis Patient Needs Assessed through the Largest Online Patient Community: A Cross-Sectional Reddit Analysis. J Cutan Med Surg 2024;28:75-77. [PMID: 38229271 PMCID: PMC10908197 DOI: 10.1177/12034754231221988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2024]

Rajalingam K, Levin N, Marques O, Grichnik J, Lin A, Chen WS. Treatment options and emotional well-being in patients with rosacea: An unsupervised machine learning analysis of over 200,000 posts. JAAD Int 2023;13:172-178. [PMID: 37823041 PMCID: PMC10562143 DOI: 10.1016/j.jdin.2023.07.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/06/2023] [Indexed: 10/13/2023] Open

Thornton C, Lanyi K, Wilkins G, Potter R, Hunter E, Kolehmainen N, Pearson F. Scoping the Priorities and Concerns of Parents: Infodemiology Study of Posts on Mumsnet and Reddit. J Med Internet Res 2023;25:e47849. [PMID: 38015600 PMCID: PMC10716753 DOI: 10.2196/47849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 09/18/2023] [Accepted: 09/28/2023] [Indexed: 11/29/2023] Open

Abstract

BACKGROUND

Health technology innovation is increasingly supported by a bottom-up approach to priority setting, aiming to better reflect the concerns of its intended beneficiaries. Web-based forums provide parents with an outlet to share concerns, advice, and information related to parenting and the health and well-being of their children. They provide a rich source of data on parenting concerns and priorities that could inform future child health research and innovation.

OBJECTIVE

The aim of the study is to identify common concerns expressed on 2 major web-based forums and cluster these to identify potential family health concern topics as indicative priority areas for future research and innovation.

METHODS

We text-mined the r/Parenting subreddit (69,846 posts) and the parenting section of Mumsnet (99,848 posts) to create a large corpus of posts. A generative statistical model (latent Dirichlet allocation) was used to identify the most discussed topics in the corpus, and content analysis was applied to identify the parenting concerns found in a subset of posts.

RESULTS

A model with 25 topics produced the highest coherence and a wide range of meaningful parenting concern topics. The most frequently expressed parenting concerns are related to their child's sleep, self-care, eating (and food), behavior, childcare context, and the parental context including parental conflict. Topics directly associated with infants, such as potty training and bottle feeding, were more common on Mumsnet, while parental context and screen time were more common on r/Parenting.

CONCLUSIONS

Latent Dirichlet allocation topic modeling can be applied to gain a rapid, yet meaningful overview of parent concerns expressed on a large and diverse set of social media posts and used to complement traditional insight gathering methods. Parents framed their concerns in terms of children's everyday health concerns, generating topics that overlap significantly with established family health concern topics. We provide evidence of the range of family health concerns found at these sources and hope this can be used to generate material for use alongside traditional insight gathering methods.

Collapse

Fu J, Yang J, Li Q, Huang D, Yang H, Xie X, Xu H, Zhang M, Zheng C. What can we learn from a Chinese social media used by glaucoma patients? BMC Ophthalmol 2023;23:470. [PMID: 37986061 PMCID: PMC10661764 DOI: 10.1186/s12886-023-03208-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 11/07/2023] [Indexed: 11/22/2023] Open

Abstract

PURPOSE

Our study aims to discuss glaucoma patients' needs and Internet habits using big data analysis and Natural Language Processing (NLP) based on deep learning (DL).

METHODS

In this retrospective study, we used web crawler technology to crawl glaucoma-related topic posts from the glaucoma bar of Baidu Tieba, China. According to the contents of topic posts, we classified them into posts with seeking medical advice and without seeking medical advice (social support, expressing emotions, sharing knowledge, and others). Word Cloud and frequency statistics were used to analyze the contents and visualize the keywords of topic posts. Two DL models, Bidirectional Long Short-Term Memory (Bi-LSTM) and Bidirectional Encoder Representations from Transformers (BERT), were trained to identify the posts seeking medical advice. The evaluation matrices included: accuracy, F1 value, and the area under the ROC curve (AUC).

RESULTS

A total of 10,892 topic posts were included, among them, most were seeking medical advice (N = 7071, 64.91%), and seeking advice regarding symptoms or examination (N = 4913, 45.11%) dominated the majority. The following were searching for social support (N = 2362, 21.69%), expressing emotions (N = 497, 4.56%), and sharing knowledge (N = 527, 4.84%) in sequence. The word cloud analysis results showed that ocular pressure, visual field, examination, and operation were the most frequent words. The accuracy, F1 score, and AUC were 0.891, 0.891, and 0.931 for the BERT model, 0.82, 0.821, and 0.890 for the Bi-LSTM model.

CONCLUSION

Social media can help enhance the patient-doctor relationship by providing patients' concerns and cognition about glaucoma in China. NLP can be a powerful tool to reflect patients' focus on diseases. DL models performed well in classifying Chinese medical-related texts, which could play an important role in public health monitoring.

Collapse

Zulbayar S, Mollayeva T, Colantonio A, Chan V, Escobar M. Integrating unsupervised and supervised learning techniques to predict traumatic brain injury: A population-based study. INTELLIGENCE-BASED MEDICINE 2023;8:100118. [PMID: 38222038 PMCID: PMC10785655 DOI: 10.1016/j.ibmed.2023.100118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]

Abstract

This work aimed to identify pre-existing health conditions of patients with traumatic brain injury (TBI) and develop predictive models for the first TBI event and its external causes by employing a combination of unsupervised and supervised learning algorithms. We acquired up to five years of pre-injury diagnoses for 488,107 patients with TBI and 488,107 matched control patients who entered the emergency department or acute care hospitals between April 1st, 2002, and March 31st, 2020. Diagnoses were obtained from the Ontario Health Insurance Plan (OHIP) database which contains province-wide claims data by physicians in Ontario, Canada for inpatient and outpatient services. A screening process was conducted on the OHIP diagnostic codes to limit the subsequent analysis to codes that were predictive of TBI, which concluded that 314 codes were significantly associated with TBI. The Latent Dirichlet Allocation (LDA) model was applied to the diagnostic codes and generated an optimal number of 19 topics that concur with published literature but also suggest other unexplored areas. Estimated word-topic probabilities from the LDA model helped us detect pre-morbid conditions among patients with TBI by uncovering the underlying patterns of diagnoses, meanwhile estimated document-topic probabilities were utilized in variable creation as form of a dimension reduction. We created 19 topic scores for each patient in the cohort which were utilized along with socio-demographic factors for Random Forest binary classifier models. Test set performances evaluated using area under the receiver operating characteristic curve (AUC) were: TBI event (AUC = 0.85), external cause of injury: falls (AUC = 0.85), struck by/against (AUC = 0.83), cyclist collision (AUC = 0.76), motor vehicle collision (AUC = 0.83). Our analysis successfully demonstrated the feasibility of using machine learning to predict TBI due to various external causes and identified the most important factors that contribute to this prediction.

Collapse

Affiliation(s)

Suvd Zulbayar Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5T 3M7, Canada Institute of Health and Policy, Management and Evaluation, University of Toronto, M5T 3M6, Canada
Tatyana Mollayeva Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5T 3M7, Canada Rehabilitation Sciences Institute, Temerty Faculty of Medicine, University of Toronto, Toronto, ON M5G 1V7, Canada Acquired Brain Injury Research Lab, Department of Occupational Science and Occupational Therapy, University of Toronto, Toronto, ON M5G 1V7, Canada KITE Research Institute, Toronto Rehabilitation Institute-University Health Network, Toronto, ON M5G 2A2, Canada
Angela Colantonio Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5T 3M7, Canada Rehabilitation Sciences Institute, Temerty Faculty of Medicine, University of Toronto, Toronto, ON M5G 1V7, Canada Acquired Brain Injury Research Lab, Department of Occupational Science and Occupational Therapy, University of Toronto, Toronto, ON M5G 1V7, Canada KITE Research Institute, Toronto Rehabilitation Institute-University Health Network, Toronto, ON M5G 2A2, Canada Institute of Health and Policy, Management and Evaluation, University of Toronto, M5T 3M6, Canada ICES, Toronto, ON, M4N 3M5, Canada
Vincy Chan Rehabilitation Sciences Institute, Temerty Faculty of Medicine, University of Toronto, Toronto, ON M5G 1V7, Canada Acquired Brain Injury Research Lab, Department of Occupational Science and Occupational Therapy, University of Toronto, Toronto, ON M5G 1V7, Canada KITE Research Institute, Toronto Rehabilitation Institute-University Health Network, Toronto, ON M5G 2A2, Canada Institute of Health and Policy, Management and Evaluation, University of Toronto, M5T 3M6, Canada
Michael Escobar Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5T 3M7, Canada

Collapse

Omiye JA, Gui H, Daneshjou R, Cai ZR, Muralidharan V. Principles, applications, and future of artificial intelligence in dermatology. Front Med (Lausanne) 2023;10:1278232. [PMID: 37901399 PMCID: PMC10602645 DOI: 10.3389/fmed.2023.1278232] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 09/27/2023] [Indexed: 10/31/2023] Open

Cummins JA, Zhou G, Nambudiri VE. Natural Language Processing for Large-Scale Analysis of Eczema and Psoriasis Social Media Comments. JID INNOVATIONS 2023;3:100210. [PMID: 37564106 PMCID: PMC10410170 DOI: 10.1016/j.xjidi.2023.100210] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 02/19/2023] [Accepted: 02/27/2023] [Indexed: 08/12/2023] Open

Emanuel RHK, Docherty PD, Lunt H, Campbell RE. Comparing Literature- and Subreddit-Derived Laboratory Values in Polycystic Ovary Syndrome (PCOS): Validation of Clinical Data Posted on PCOS Reddit Forums. JMIR Form Res 2023;7:e44810. [PMID: 37624626 PMCID: PMC10492173 DOI: 10.2196/44810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Revised: 05/16/2023] [Accepted: 05/24/2023] [Indexed: 08/26/2023] Open

Abstract

BACKGROUND

Polycystic ovary syndrome (PCOS) is a heterogeneous condition that affects 4% to 21% of people with ovaries. Inaccessibility or dissatisfaction with clinical treatment for PCOS has led to some individuals with the condition discussing their experiences in specialized web-based forums.

OBJECTIVE

This study explores the feasibility of using such web-based forums for clinical research purposes by gathering and analyzing laboratory test results posted in an active PCOS forum, specifically the PCOS subreddit hosted on Reddit.

METHODS

We gathered around 45,000 posts from the PCOS subreddit. A random subset of 5000 posts was manually read, and the presence of laboratory test results was labeled. These labeled posts were used to train a machine learning model to identify which of the remaining posts contained laboratory results. The laboratory results were extracted manually from the identified posts. These self-reported laboratory test results were compared with values in the published literature to assess whether the results were concordant with researcher-published values for PCOS cohorts. A total of 10 papers were chosen to represent published PCOS literature, with selection criteria including the Rotterdam diagnostic criteria for PCOS, a publication date within the last 20 years, and at least 50 participants with PCOS.

RESULTS

Overall, the general trends observed in the laboratory test results from the PCOS web-based forum were consistent with clinically reported PCOS. A number of results, such as follicle stimulating hormone, fasting insulin, and anti-Mullerian hormone, were concordant with published values for patients with PCOS. The high consistency of these results among the literature and when compared to the subreddit suggests that follicle stimulating hormone, fasting insulin, and anti-Mullerian hormone are more consistent across PCOS phenotypes than other test results. Some results, such as testosterone, sex hormone-binding globulin, and homeostasis model assessment-estimated insulin resistance index, were between those of PCOS literature values and normal values, as defined by clinical testing limits. Interestingly, other results, including dehydroepiandrosterone sulfate, luteinizing hormone, and fasting glucose, appeared to be slightly more dysregulated than those reported in the literature.

CONCLUSIONS

The differences between the forum-posted results and those published in the literature may be due to the selection process in clinical studies and the possibility that the forum disproportionally describes PCOS phenotypes that are less likely to be alleviated with medical intervention. However, the degree of concordance in most laboratory test values implied that the PCOS web-based forum participants were representative of research-identified PCOS cohorts. This validation of the PCOS subreddit grants the possibility for more research into the contents of the subreddit and the idea of undertaking similar research using the contents of other medical internet forums.

Collapse

Laureate CDP, Buntine W, Linger H. A systematic review of the use of topic models for short text social media analysis. Artif Intell Rev 2023:1-33. [PMID: 37362887 PMCID: PMC10150353 DOI: 10.1007/s10462-023-10471-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/14/2023] [Indexed: 06/28/2023]

El-Jack K, Henderson K, Andy AU, Southwick L. Reddit Users' Questions and Concerns about Anesthesia. INTERNATIONAL JOURNAL OF MEDICAL STUDENTS 2023. [DOI: 10.5195/ijms.2022.1687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023] Open

Chen Z, Zeng G, Zhong S, Wang L. From the exotic to the everyday: The Avocado crossing borders via cyberspace. Appetite 2023;180:106362. [PMID: 36368563 DOI: 10.1016/j.appet.2022.106362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Revised: 10/28/2022] [Accepted: 10/29/2022] [Indexed: 11/09/2022]

Fu J, Yang J, Li Q, Huang D, Yang H, Xie X, Xu H, Zhang M, Zheng C. Glaucoma-related posts from a Chinese social media: An exploratory study.. [DOI: 10.21203/rs.3.rs-2312218/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]

Joly-Chevrier M, Aly S, Lefrançois P. Comparison of Basal Cell Carcinoma Posts, Comments and Authors Between Reddit and Quora Forums. J Cutan Med Surg 2022;26:634-635. [PMID: 36200886 DOI: 10.1177/12034754221129872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Rajagopalan D, Thomas J, Ring D, Fatehi A. Quantitative Patient-Reported Experience Measures Derived From Natural Language Processing Have a Normal Distribution and No Ceiling Effect. Qual Manag Health Care 2022;31:210-218. [PMID: 35383720 DOI: 10.1097/qmh.0000000000000355] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Das N, Sadhukhan B, Chatterjee T, Chakrabarti S. Effect of public sentiment on stock market movement prediction during the COVID-19 outbreak. SOCIAL NETWORK ANALYSIS AND MINING 2022;12:92. [PMID: 35911484 PMCID: PMC9325657 DOI: 10.1007/s13278-022-00919-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 06/27/2022] [Accepted: 07/04/2022] [Indexed: 11/28/2022]

Dave AD, Zhu D. Ophthalmology Inquiries on Reddit: What Should Physicians Know? Clin Ophthalmol 2022;16:2923-2931. [PMID: 36071726 PMCID: PMC9441589 DOI: 10.2147/opth.s375822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 08/24/2022] [Indexed: 11/23/2022] Open

Abstract

Purpose

Patients are seeking answers to personal medical questions on social media. Reddit, a popular social media site, has been overlooked as a source of data in the field of ophthalmology. We analyzed posts in the subreddit, r/Ophthalmology, to better understand the most common ophthalmic conditions patients are discussing online and how often those making posts are being advised to seek professional medical care.

Patients and Methods

This cross-sectional study analyzed posts and comments from March 18, 2018 to November 9, 2020. All posts and comments on r/Ophthalmology are public and were accessed using the Python Reddit API Wrapper. This text was analyzed for unique references to common ophthalmic conditions and for mentions and recommendations to different types of medical care.

Results

Nine hundred and nineteen posts were collected from the above timeframe. An auto-moderator makes a comment on every post to discourage patients from asking personal medical questions. Despite this, over two-thirds of posts discussed medical treatment for eye conditions in either a post, comment, or both. Almost half of all posts mention “ophthalmology”, but only 9% go as far as to recommend ophthalmic care. One-third of posts have no reference to medical care. Within posts, flashes and floaters were the most common condition mentioned, making up 15% of unique mentions, followed by glaucoma (7.4%) and retinal detachments (6.7%). Within comments, cataracts were most commonly discussed, making up 12% of unique mentions, followed by glaucoma (8.9%).

Conclusion

These findings show that patients are seeking information about their eye health on the r/Ophthalmology subreddit and that Reddit users are engaging with these types of posts, instead of recommending professional care in most cases. It is important for ophthalmologists to recognize the most common conditions patients are asking about online and learn how they can do a better job of educating their patients in the office.

Collapse

Barrutia L, Vega-Gutiérrez J, Santamarina-Albertos A. Benefits, drawbacks, and challenges of social media use in dermatology: A systematic review. J DERMATOL TREAT 2022;33:2738-2757. [PMID: 35506617 DOI: 10.1080/09546634.2022.2069661] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Ahmed F, Lipoff JB. The role of dermatologists in social media: exploring the benefits and risks. DER HAUTARZT 2022;73:401-404. [PMID: 35133443 PMCID: PMC8830428 DOI: 10.1007/s00105-022-04946-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Accepted: 01/07/2022] [Indexed: 11/28/2022]

Markides BR, Laws R, Hesketh K, Maddison R, Denney‐Wilson E, Campbell KJ. A thematic cluster analysis of parents' online discussions about fussy eating. MATERNAL & CHILD NUTRITION 2022;18:e13316. [PMID: 35132813 PMCID: PMC8932712 DOI: 10.1111/mcn.13316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Revised: 11/21/2021] [Accepted: 12/14/2021] [Indexed: 11/30/2022]

Abstract

Food fussiness is associated with non‐responsive parent feeding practices, such as persuasive and instrumental feeding. Although most children described as ‘fussy eaters’ are likely exhibiting developmentally typical behaviours, up to half of the parents of children 2–5 years old express concerns. Concern for fussy eating may mediate the use of non‐responsive feeding practices and so must be addressed in parent feeding interventions. Therefore, it is critical to better understand parents' concerns and how they may relate to feeding practices. This study aimed to explore how parents' feeding practices and the social cognitive factors that may drive them clustered based on parents' concern for fussy eating. Data were collected from parent discussions of fussy eating on a Reddit forum (80,366 posts). Latent Dirichlet allocation was used to identify discussions of fussy eating. Relevant posts (1542) made by users who identified as a parent of a fussy eater (n = 630) underwent qualitative coding and thematic analysis. Five clusters of parents were identified, ranging in size from 53 to 189 users. These were primarily characterised by parents' degree of concern and feeding practices: (1) High concern, nonresponsive; (2) Concerned, nonresponsive; (3) Low concern, responsive; (4) Low concern, mixed strategies; (5) Low concern, indulgent. Parents who used responsive practices tended to be less concerned for fussy eating, have greater trust in their child's ability to self‐regulate hunger, have longer‐term feeding goals, and exhibit greater ability for personal self‐regulation. Future research should further examine how these constructs may be leveraged in parent feeding interventions.

Parents using nonresponsive practices to manage fussy eating may have more concerns about fussy eating, less trust in their child's ability to self‐regulate hunger, shorter‐term feeding goals, and less developed self‐regulation skills.

As most children described as “fussy eaters” are likely exhibiting developmentally typical behaviours, interventions addressing nutrition and feeding practices in early childhood should aim to alleviate parental concern for fussy eating, for example, by helping them recognise developmentally normal eating behaviours.

Early childhood feeding interventions may also be enhanced by helping parents to develop long‐term feeding goals and improved self‐regulation skills in the context of their feeding practices.

Collapse

Persistence and Attrition among Participants in a Multi-Page Online Survey Recruited via Reddit’s Social Media Network. SOCIAL SCIENCES 2022. [DOI: 10.3390/socsci11020031] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Natural Language Processing: A Window to Understanding Skincare Trends. Int J Med Inform 2022;160:104705. [DOI: 10.1016/j.ijmedinf.2022.104705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 01/09/2022] [Accepted: 01/20/2022] [Indexed: 11/21/2022]

Sager MA, Kashyap AM, Tamminga M, Ravoori S, Callison-Burch C, Lipoff JB. Identifying and Responding to Health Misinformation on Reddit Dermatology Forums With Artificially Intelligent Bots Using Natural Language Processing: Design and Evaluation Study. JMIR DERMATOLOGY 2021;4:e20975. [PMID: 37632809 PMCID: PMC10334965 DOI: 10.2196/20975] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 07/08/2021] [Accepted: 08/05/2021] [Indexed: 11/13/2022] Open

Wu W, Lyu H, Luo J. Characterizing Discourse about COVID-19 Vaccines: A Reddit Version of the Pandemic Story. HEALTH DATA SCIENCE 2021;2021:9837856. [PMID: 36405359 PMCID: PMC9629685 DOI: 10.34133/2021/9837856] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 06/28/2021] [Indexed: 12/18/2022]

Classifying patient and professional voice in social media health posts. BMC Med Inform Decis Mak 2021;21:244. [PMID: 34407807 PMCID: PMC8371035 DOI: 10.1186/s12911-021-01577-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 07/06/2021] [Indexed: 11/10/2022] Open

Abstract

Background

Patient-based analysis of social media is a growing research field with the aim of delivering precision medicine but it requires accurate classification of posts relating to patients’ experiences. We motivate the need for this type of classification as a pre-processing step for further analysis of social media data in the context of related work in this area. In this paper we present experiments for a three-way document classification by patient voice, professional voice or other. We present results for a convolutional neural network classifier trained on English data from two different data sources (Reddit and Twitter) and two domains (cardiovascular and skin diseases).

Results

We found that document classification by patient voice, professional voice or other can be done consistently manually (0.92 accuracy). Annotators agreed roughly equally for each domain (cardiovascular and skin) but they agreed more when annotating Reddit posts compared to Twitter posts. Best classification performance was obtained when training two separate classifiers for each data source, one for Reddit and one for Twitter posts, when evaluating on in-source test data for both test sets combined with an overall accuracy of 0.95 (and macro-average F1 of 0.92) and an F1-score of 0.95 for patient voice only.

Conclusion

The main conclusion resulting from this work is that combining social media data from platforms with different characteristics for training a patient and professional voice classifier does not result in best possible performance. We showed that it is best to train separate models per data source (Reddit and Twitter) instead of a model using the combined training data from both sources. We also found that it is preferable to train separate models per domain (cardiovascular and skin) while showing that the difference to the combined model is only minor (0.01 accuracy). Our highest overall F1-score (0.95) obtained for classifying posts as patient voice is a very good starting point for further analysis of social media data reflecting the experience of patients.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12911-021-01577-9.

Collapse

Fairie P, Zhang Z, D'Souza AG, Walsh T, Quan H, Santana MJ. Categorising patient concerns using natural language processing techniques. BMJ Health Care Inform 2021;28:e100274. [PMID: 34193519 PMCID: PMC8246286 DOI: 10.1136/bmjhci-2020-100274] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 05/20/2021] [Indexed: 11/23/2022] Open

Abstract

OBJECTIVES

Patient feedback is critical to identify and resolve patient safety and experience issues in healthcare systems. However, large volumes of unstructured text data can pose problems for manual (human) analysis. This study reports the results of using a semiautomated, computational topic-modelling approach to analyse a corpus of patient feedback.

METHODS

Patient concerns were received by Alberta Health Services between 2011 and 2018 (n=76 163), regarding 806 care facilities in 163 municipalities, including hospitals, clinics, community care centres and retirement homes, in a province of 4.4 million. Their existing framework requires manual labelling of pre-defined categories. We applied an automated latent Dirichlet allocation (LDA)-based topic modelling algorithm to identify the topics present in these concerns, and thereby produce a framework-free categorisation.

RESULTS

The LDA model produced 40 topics which, following manual interpretation by researchers, were reduced to 28 coherent topics. The most frequent topics identified were communication issues causing delays (frequency: 10.58%), community care for elderly patients (8.82%), interactions with nurses (8.80%) and emergency department care (7.52%). Many patient concerns were categorised into multiple topics. Some were more specific versions of categories from the existing framework (eg, communication issues causing delays), while others were novel (eg, smoking in inappropriate settings).

DISCUSSION

LDA-generated topics were more nuanced than the manually labelled categories. For example, LDA found that concerns with community care were related to concerns about nursing for seniors, providing opportunities for insight and action.

CONCLUSION

Our findings outline the range of concerns patients share in a large health system and demonstrate the usefulness of using LDA to identify categories of patient concerns.

Collapse

Yu K, Syed MN, Bernardis E, Gelfand JM. Machine Learning Applications in the Evaluation and Management of Psoriasis: A Systematic Review. ACTA ACUST UNITED AC 2021;5:147-159. [PMID: 33733038 PMCID: PMC7963214 DOI: 10.1177/2475530320950267] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Eapen BR. Artificial Intelligence in Dermatology: A Practical Introduction to a Paradigm Shift. Indian Dermatol Online J 2020;11:881-889. [PMID: 33344334 PMCID: PMC7735013 DOI: 10.4103/idoj.idoj_388_20] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2020] [Revised: 07/27/2020] [Accepted: 09/13/2020] [Indexed: 12/13/2022] Open

Jelodar H, Wang Y, Orji R, Huang S. Deep Sentiment Classification and Topic Discovery on Novel Coronavirus or COVID-19 Online Discussions: NLP Using LSTM Recurrent Neural Network Approach. IEEE J Biomed Health Inform 2020;24:2733-2742. [PMID: 32750931 DOI: 10.1101/2020.04.22.054973] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Chan S, Reddy V, Myers B, Thibodeaux Q, Brownstone N, Liao W. Machine Learning in Dermatology: Current Applications, Opportunities, and Limitations. Dermatol Ther (Heidelb) 2020;10:365-386. [PMID: 32253623 PMCID: PMC7211783 DOI: 10.1007/s13555-020-00372-0] [Citation(s) in RCA: 81] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Indexed: 12/14/2022] Open

Xu C, Yang H, Sun L, Cao X, Hou Y, Cai Q, Jia P, Wang Y. Detecting Lung Cancer Trends by Leveraging Real-World and Internet-Based Data: Infodemiology Study. J Med Internet Res 2020;22:e16184. [PMID: 32163035 PMCID: PMC7099398 DOI: 10.2196/16184] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2019] [Revised: 10/28/2019] [Accepted: 02/22/2020] [Indexed: 01/13/2023] Open

Abstract

Background

Internet search data on health-related terms can reflect people’s concerns about their health status in near real time, and hence serve as a supplementary metric of disease characteristics. However, studies using internet search data to monitor and predict chronic diseases at a geographically finer state-level scale are sparse.

Objective

The aim of this study was to explore the associations of internet search volumes for lung cancer with published cancer incidence and mortality data in the United States.

Methods

We used Google relative search volumes, which represent the search frequency of specific search terms in Google. We performed cross-sectional analyses of the original and disease metrics at both national and state levels. A smoothed time series of relative search volumes was created to eliminate the effects of irregular changes on the search frequencies and obtain the long-term trends of search volumes for lung cancer at both the national and state levels. We also performed analyses of decomposed Google relative search volume data and disease metrics at the national and state levels.

Results

The monthly trends of lung cancer-related internet hits were consistent with the trends of reported lung cancer rates at the national level. Ohio had the highest frequency for lung cancer-related search terms. At the state level, the relative search volume was significantly correlated with lung cancer incidence rates in 42 states, with correlation coefficients ranging from 0.58 in Virginia to 0.94 in Oregon. Relative search volume was also significantly correlated with mortality in 47 states, with correlation coefficients ranging from 0.58 in Oklahoma to 0.94 in North Carolina. Both the incidence and mortality rates of lung cancer were correlated with decomposed relative search volumes in all states excluding Vermont.

Conclusions

Internet search behaviors could reflect public awareness of lung cancer. Research on internet search behaviors could be a novel and timely approach to monitor and estimate the prevalence, incidence, and mortality rates of a broader range of cancers and even more health issues.

Collapse

Yousaf A, Hagen R, Delaney E, Davis S, Zinn Z. The influence of social media on acne treatment: A cross-sectional survey. Pediatr Dermatol 2020;37:301-304. [PMID: 31944359 PMCID: PMC7453954 DOI: 10.1111/pde.14091] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Du C, Lee W, Moskowitz D, Lucioni A, Kobashi KC, Lee UJ. I leaked, then I Reddit: experiences and insight shared on urinary incontinence by Reddit users. Int Urogynecol J 2019;31:243-248. [DOI: 10.1007/s00192-019-04165-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Accepted: 10/21/2019] [Indexed: 10/25/2022]