1
|
Madrid-García A, Merino-Barbancho B, Freites-Núñez D, Rodríguez-Rodríguez L, Menasalvas-Ruíz E, Rodríguez-González A, Peñas A. From Web to RheumaLpack: Creating a Linguistic Corpus for Exploitation and Knowledge Discovery in Rheumatology. Comput Biol Med 2024; 179:108920. [PMID: 39047506 DOI: 10.1016/j.compbiomed.2024.108920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 06/30/2024] [Accepted: 07/16/2024] [Indexed: 07/27/2024]
Abstract
This study introduces RheumaLinguisticpack (RheumaLpack), the first specialised linguistic web corpus designed for the field of musculoskeletal disorders. By combining web mining (i.e., web scraping) and natural language processing (NLP) techniques, as well as clinical expertise, RheumaLpack systematically captures and curates structured and unstructured data across a spectrum of web sources including clinical trials registers (i.e., ClinicalTrials.gov), bibliographic databases (i.e., PubMed), medical agencies (i.e. European Medicines Agency), social media (i.e., Reddit), and accredited health websites (i.e., MedlinePlus, Harvard Health Publishing, and Cleveland Clinic). Given the complexity of rheumatic and musculoskeletal diseases (RMDs) and their significant impact on quality of life, this resource can be proposed as a useful tool to train algorithms that could mitigate the diseases' effects. Therefore, the corpus aims to improve the training of artificial intelligence (AI) algorithms and facilitate knowledge discovery in RMDs. The development of RheumaLpack involved a systematic six-step methodology covering data identification, characterisation, selection, collection, processing, and corpus description. The result is a non-annotated, monolingual, and dynamic corpus, featuring almost 3 million records spanning from 2000 to 2023. RheumaLpack represents a pioneering contribution to rheumatology research, providing a useful resource for the development of advanced AI and NLP applications. This corpus highlights the value of web data to address the challenges posed by musculoskeletal diseases, illustrating the corpus's potential to improve research and treatment paradigms in rheumatology. Finally, the methodology shown can be replicated to obtain data from other medical specialities. The code and details on how to build RheumaLpack are also provided to facilitate the dissemination of such resource.
Collapse
Affiliation(s)
- Alfredo Madrid-García
- Grupo de Patología Musculoesquelética, Hospital Clínico San Carlos, Instituto de Investigación Sanitaria San Carlos (IdISSC), Prof. Martin Lagos s/n, Madrid, 28040, Spain.
| | - Beatriz Merino-Barbancho
- Escuela Técnica Superior de Ingenieros de Telecomunicación Universidad Politécnica de Madrid, Avenida Complutense, 30, Madrid, 28040, Spain
| | - Dalifer Freites-Núñez
- Grupo de Patología Musculoesquelética, Hospital Clínico San Carlos, Instituto de Investigación Sanitaria San Carlos (IdISSC), Prof. Martin Lagos s/n, Madrid, 28040, Spain
| | - Luis Rodríguez-Rodríguez
- Grupo de Patología Musculoesquelética, Hospital Clínico San Carlos, Instituto de Investigación Sanitaria San Carlos (IdISSC), Prof. Martin Lagos s/n, Madrid, 28040, Spain
| | - Ernestina Menasalvas-Ruíz
- Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Pozuelo de Alarcón, Madrid, 28223, Spain; Escuela Técnica Superior de Ingenieros Informáticos, Universidad Politécnica de Madrid, Boadilla del Monte, Madrid, 28660, Spain
| | - Alejandro Rodríguez-González
- Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Pozuelo de Alarcón, Madrid, 28223, Spain; Escuela Técnica Superior de Ingenieros Informáticos, Universidad Politécnica de Madrid, Boadilla del Monte, Madrid, 28660, Spain
| | - Anselmo Peñas
- UNED NLP & IR Group Universidad Nacional de Educación a Distancia, Juan del Rosal 16, 28040, Madrid, Spain
| |
Collapse
|
2
|
Paganelli A, Spadafora M, Navarrete-Dechent C, Guida S, Pellacani G, Longo C. Natural language processing in dermatology: A systematic literature review and state of the art. J Eur Acad Dermatol Venereol 2024. [PMID: 39150311 DOI: 10.1111/jdv.20286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2024] [Accepted: 07/19/2024] [Indexed: 08/17/2024]
Abstract
BACKGROUND Natural Language Processing (NLP) is a field of both computational linguistics and artificial intelligence (AI) dedicated to analysis and interpretation of human language. OBJECTIVES This systematic review aims at exploring all the possible applications of NLP techniques in the dermatological setting. METHODS Extensive search on 'natural language processing' and 'dermatology' was performed on MEDLINE and Scopus electronic databases. Only journal articles with full text electronically available and English translation were considered. The PICO (Population, Intervention or exposure, Comparison, Outcome) algorithm was applied to our study protocol. RESULTS Natural Language Processing (NLP) techniques have been utilized across various dermatological domains, including atopic dermatitis, acne/rosacea, skin infections, non-melanoma skin cancers (NMSCs), melanoma and skincare. There is versatility of NLP in data extraction from diverse sources such as electronic health records (EHRs), social media platforms and online forums. We found extensive utilization of NLP techniques across diverse dermatological domains, showcasing its potential in extracting valuable insights from various sources and informing diagnosis, treatment optimization, patient preferences and unmet needs in dermatological research and clinical practice. CONCLUSIONS While NLP shows promise in enhancing dermatological research and clinical practice, challenges such as data quality, ambiguity, lack of standardization and privacy concerns necessitate careful consideration. Collaborative efforts between dermatologists, data scientists and ethicists are essential for addressing these challenges and maximizing the potential of NLP in dermatology.
Collapse
Affiliation(s)
- Alessia Paganelli
- Dermatology Unit, Azienda Unità Sanitaria Locale-IRCCS di Reggio Emilia, Reggio Emilia, Italy
| | - Marco Spadafora
- University of Modena and Reggio Emilia, Modena, Italy
- Azienda Unità Sanitaria Locale-IRCCS di Reggio Emilia, Skin Cancer Center, Reggio Emilia, Italy
| | - Cristian Navarrete-Dechent
- Melanoma and Skin Cancer Unit, Department of Dermatology, Escuela de Medicina, Pontificia Universidad Católica de Chile, Santiago, Chile
| | - Stefania Guida
- Dermatology Clinic, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Giovanni Pellacani
- Department of Clinical Internal, Anesthesiological and Cardiovascular Sciences, Dermatology Clinic, Sapienza University of Rome, Rome, Italy
| | - Caterina Longo
- University of Modena and Reggio Emilia, Modena, Italy
- Azienda Unità Sanitaria Locale-IRCCS di Reggio Emilia, Skin Cancer Center, Reggio Emilia, Italy
| |
Collapse
|
3
|
Wessel D, Pogrebnyakov N. Using Social Media as a Source of Real-World Data for Pharmaceutical Drug Development and Regulatory Decision Making. Drug Saf 2024; 47:495-511. [PMID: 38446405 PMCID: PMC11018692 DOI: 10.1007/s40264-024-01409-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/07/2024] [Indexed: 03/07/2024]
Abstract
INTRODUCTION While pharmaceutical companies aim to leverage real-world data (RWD) to bridge the gap between clinical drug development and real-world patient outcomes, extant research has mainly focused on the use of social media in a post-approval safety-surveillance setting. Recent regulatory and technological developments indicate that social media may serve as a rich source to expand the evidence base to pre-approval and drug development activities. However, use cases related to drug development have been largely omitted, thereby missing some of the benefits of RWD. In addition, an applied end-to-end understanding of RWD rooted in both industry and regulations is lacking. OBJECTIVE We aimed to investigate how social media can be used as a source of RWD to support regulatory decision making and drug development in the pharmaceutical industry. We aimed to specifically explore the data pipeline and examine how social-media derived RWD can align with regulatory guidance from the US Food and Drug Administration and industry needs. METHODS A machine learning pipeline was developed to extract patient insights related to anticoagulants from X (Twitter) data. These findings were then analysed from an industry perspective, and complemented by interviews with professionals from a pharmaceutical company. RESULTS The analysis reveals several use cases where RWD derived from social media can be beneficial, particularly in generating hypotheses around patient and therapeutic area needs. We also note certain limitations of social media data, particularly around inferring causality. CONCLUSIONS Social media display considerable potential as a source of RWD for guiding efforts in pharmaceutical drug development and pre-approval settings. Although further regulatory guidance on the use of social media for RWD is needed to encourage its use, regulatory and technological developments are suggested to warrant at least exploratory uses for drug development.
Collapse
Affiliation(s)
- Didrik Wessel
- Copenhagen Business School, Frederiksberg, Denmark.
- , Nørrebrogade 18A 3TH, 2200, Copenhagen N, Denmark.
| | | |
Collapse
|
4
|
Lindquist EG, West AE. What Do My (Online) Friends Think? A Topic Modeling Approach to Identifying Patterns of Response to Self-Injurious Behaviors on Reddit. Arch Suicide Res 2024; 28:537-553. [PMID: 36998237 DOI: 10.1080/13811118.2023.2193594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 04/01/2023]
Abstract
OBJECTIVE Approximately 17% of adolescents and young adults will engage in non-suicidal self-injury (NSSI) at least once in their lifetime, leading the World Health Organization to identify self-injury as one of the top five public health concerns for adolescents. Despite the widespread prevalence of this behavior, NSSI continues to be heavily stigmatized in both medical and community settings, deterring many engaged in NSSI from seeking informal support from friends and family as well as formal psychological or psychiatric treatment. In contrast to the low rates of in-person help-seeking for NSSI, online support groups are highly utilized by those engaged in NSSI. Thus, an empirical study of responses to frequent, voluntary disclosure of NSSI on social media is needed to better understand how these communities meet the needs of those who self-injure. METHOD The current project used latent Dirichlet allocation to identify frequent and favored themes in response to self-injury content in the largest self-injury group on Reddit (over 100,000 members). Reddit, the 9th most visited website in the world, is a chat-based social media platform that has 430+ million active users and billions of site visits, with current estimates suggesting that ∼63% of the U.S. population are Reddit users. RESULTS Identified themes included: (1) recovery encouragement; (2) provision of social and instrumental support; and (3) daily realities of living with NSSI. Responses that encouraged recovery received more upvotes on Reddit than any other type of comment. CONCLUSION These results can inform evidence-based, person-centered, dimensional treatments for NSSI.
Collapse
|
5
|
Murray C, Mitchell L, Tuke J, Mackay M. Revealing patient-reported experiences in healthcare from social media using thedesign-acquire-process-model-analyse-visualise framework. Digit Health 2024; 10:20552076241251715. [PMID: 38757085 PMCID: PMC11097732 DOI: 10.1177/20552076241251715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 04/10/2024] [Indexed: 05/18/2024] Open
Abstract
Understanding patient experience in healthcare is increasingly important and desired by medical professionals in a patient-centred care approach. Healthcare discourse on social media presents an opportunity to gain a unique perspective on patient-reported experiences, complementing traditional survey data. These social media reports often appear as first-hand accounts of patients' journeys through the healthcare system, whose details extend beyond the confines of structured surveys and at a far larger scale than focus groups. However, in contrast with the vast presence of patient-experience data on social media and the potential benefits the data offers, it attracts comparatively little research attention due to the technical proficiency required for text analysis. In this article, we introduce the design-acquire-process-model-analyse-visualise framework to provide an overview of techniques and an approach to capture patient-reported experiences from social media data. We apply this framework in a case study on prostate cancer data from /r/ProstateCancer, demonstrate the framework's value in capturing specific aspects of patient concern (such as sexual dysfunction), provide an overview of the discourse, and show narrative and emotional progression through these stories. We anticipate this framework to apply to a wide variety of areas in healthcare, including capturing and differentiating experiences across minority groups, geographic boundaries, and types of illnesses.
Collapse
Affiliation(s)
- Curtis Murray
- School of Mathematical Sciences, The University of Adelaide, Adelaide, SA, Australia
| | - Lewis Mitchell
- School of Mathematical Sciences, The University of Adelaide, Adelaide, SA, Australia
| | - Jonathan Tuke
- School of Mathematical Sciences, The University of Adelaide, Adelaide, SA, Australia
| | - Mark Mackay
- College of Public Heath, Medical and Veterinary Scienc, James Cook University, Townsville, QLD, Australia
| |
Collapse
|
6
|
Joly-Chevrier M, Aly S, Bahous K, Lefrançois P. Atopic Dermatitis Patient Needs Assessed through the Largest Online Patient Community: A Cross-Sectional Reddit Analysis. J Cutan Med Surg 2024; 28:75-77. [PMID: 38229271 PMCID: PMC10908197 DOI: 10.1177/12034754231221988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2024]
Affiliation(s)
| | - Safin Aly
- Faculty of Medicine, Université de Montréal, Montréal, QC, Canada
| | - Kaoutar Bahous
- Department of Computer and Software Engineering, Polytechnique Montréal, Montréal, QC, Canada
| | - Philippe Lefrançois
- Division of Dermatology, Department of Medicine, McGill University, Montréal, QC, Canada
- Division of Dermatology, Department of Medicine, Jewish General Hospital, Montréal, QC, Canada
- Lady Davis Institute for Medical Research, Montréal, QC, Canada
| |
Collapse
|
7
|
Rajalingam K, Levin N, Marques O, Grichnik J, Lin A, Chen WS. Treatment options and emotional well-being in patients with rosacea: An unsupervised machine learning analysis of over 200,000 posts. JAAD Int 2023; 13:172-178. [PMID: 37823041 PMCID: PMC10562143 DOI: 10.1016/j.jdin.2023.07.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/06/2023] [Indexed: 10/13/2023] Open
Abstract
Background Many patients with rosacea join online support groups to gather and disseminate information about disease management and provide emotional support for others. Objective To better understand rosacea patient's primary concerns for the disease as well as their disease search patterns online. Methods Overall, 207,038 posts by 41,400 users were collected from June 1, 2017, to June 1, 2022, in a popular online forum. We applied Latent Dirichlet Allocation (LDA), an unsupervised machine learning model, to organize the posts into topics. Keywords for each topic supplied by LDA were used to manually assign topic and category labels. Results Twenty-three significant topics of conversation were identified and organized into 4 major categories, including Management (50.33%), Clinical Presentation (24.14%), Emotion (21.97%), and Information Appraisal (3.57%). Limitations Although we analyzed the largest forum on the internet for rosacea, generalizability is limited given the presence of other smaller forums and the skewed demographics of forum users. Conclusion Social media forums play an important role for disease discussion and emotional venting. Although rosacea management was the most frequently discussed topic, emotional posting was a significantly prevalent occurrence.
Collapse
Affiliation(s)
- Karan Rajalingam
- Charles E. Schmidt College of Medicine, Florida Atlantic University, Boca Raton, Florida
| | - Nicole Levin
- Charles E. Schmidt College of Medicine, Florida Atlantic University, Boca Raton, Florida
| | - Oge Marques
- Department of Computer & Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, Florida
| | - James Grichnik
- Department of Dermatology and Cutaneous Surgery, University of South Florida Morsani College of Medicine, Tampa, Florida
| | - Ann Lin
- Department of Dermatology and Cutaneous Surgery, University of South Florida Morsani College of Medicine, Tampa, Florida
| | - Wei-Shen Chen
- Department of Dermatology and Cutaneous Surgery, University of South Florida Morsani College of Medicine, Tampa, Florida
| |
Collapse
|
8
|
Thornton C, Lanyi K, Wilkins G, Potter R, Hunter E, Kolehmainen N, Pearson F. Scoping the Priorities and Concerns of Parents: Infodemiology Study of Posts on Mumsnet and Reddit. J Med Internet Res 2023; 25:e47849. [PMID: 38015600 PMCID: PMC10716753 DOI: 10.2196/47849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 09/18/2023] [Accepted: 09/28/2023] [Indexed: 11/29/2023] Open
Abstract
BACKGROUND Health technology innovation is increasingly supported by a bottom-up approach to priority setting, aiming to better reflect the concerns of its intended beneficiaries. Web-based forums provide parents with an outlet to share concerns, advice, and information related to parenting and the health and well-being of their children. They provide a rich source of data on parenting concerns and priorities that could inform future child health research and innovation. OBJECTIVE The aim of the study is to identify common concerns expressed on 2 major web-based forums and cluster these to identify potential family health concern topics as indicative priority areas for future research and innovation. METHODS We text-mined the r/Parenting subreddit (69,846 posts) and the parenting section of Mumsnet (99,848 posts) to create a large corpus of posts. A generative statistical model (latent Dirichlet allocation) was used to identify the most discussed topics in the corpus, and content analysis was applied to identify the parenting concerns found in a subset of posts. RESULTS A model with 25 topics produced the highest coherence and a wide range of meaningful parenting concern topics. The most frequently expressed parenting concerns are related to their child's sleep, self-care, eating (and food), behavior, childcare context, and the parental context including parental conflict. Topics directly associated with infants, such as potty training and bottle feeding, were more common on Mumsnet, while parental context and screen time were more common on r/Parenting. CONCLUSIONS Latent Dirichlet allocation topic modeling can be applied to gain a rapid, yet meaningful overview of parent concerns expressed on a large and diverse set of social media posts and used to complement traditional insight gathering methods. Parents framed their concerns in terms of children's everyday health concerns, generating topics that overlap significantly with established family health concern topics. We provide evidence of the range of family health concerns found at these sources and hope this can be used to generate material for use alongside traditional insight gathering methods.
Collapse
Affiliation(s)
- Christopher Thornton
- National Institute for Health and Care Research Innovation Observatory, Population Health Sciences Institute, Newcastle University, Newcastle Upon Tyne, United Kingdom
| | - Kate Lanyi
- National Institute for Health and Care Research Innovation Observatory, Population Health Sciences Institute, Newcastle University, Newcastle Upon Tyne, United Kingdom
| | - Georgina Wilkins
- National Institute for Health and Care Research Innovation Observatory, Population Health Sciences Institute, Newcastle University, Newcastle Upon Tyne, United Kingdom
| | - Rhiannon Potter
- National Institute for Health and Care Research Innovation Observatory, Population Health Sciences Institute, Newcastle University, Newcastle Upon Tyne, United Kingdom
| | - Emily Hunter
- National Institute for Health and Care Research Innovation Observatory, Population Health Sciences Institute, Newcastle University, Newcastle Upon Tyne, United Kingdom
| | - Niina Kolehmainen
- National Institute for Health and Care Research Innovation Observatory, Population Health Sciences Institute, Newcastle University, Newcastle Upon Tyne, United Kingdom
| | - Fiona Pearson
- National Institute for Health and Care Research Innovation Observatory, Population Health Sciences Institute, Newcastle University, Newcastle Upon Tyne, United Kingdom
| |
Collapse
|
9
|
Fu J, Yang J, Li Q, Huang D, Yang H, Xie X, Xu H, Zhang M, Zheng C. What can we learn from a Chinese social media used by glaucoma patients? BMC Ophthalmol 2023; 23:470. [PMID: 37986061 PMCID: PMC10661764 DOI: 10.1186/s12886-023-03208-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 11/07/2023] [Indexed: 11/22/2023] Open
Abstract
PURPOSE Our study aims to discuss glaucoma patients' needs and Internet habits using big data analysis and Natural Language Processing (NLP) based on deep learning (DL). METHODS In this retrospective study, we used web crawler technology to crawl glaucoma-related topic posts from the glaucoma bar of Baidu Tieba, China. According to the contents of topic posts, we classified them into posts with seeking medical advice and without seeking medical advice (social support, expressing emotions, sharing knowledge, and others). Word Cloud and frequency statistics were used to analyze the contents and visualize the keywords of topic posts. Two DL models, Bidirectional Long Short-Term Memory (Bi-LSTM) and Bidirectional Encoder Representations from Transformers (BERT), were trained to identify the posts seeking medical advice. The evaluation matrices included: accuracy, F1 value, and the area under the ROC curve (AUC). RESULTS A total of 10,892 topic posts were included, among them, most were seeking medical advice (N = 7071, 64.91%), and seeking advice regarding symptoms or examination (N = 4913, 45.11%) dominated the majority. The following were searching for social support (N = 2362, 21.69%), expressing emotions (N = 497, 4.56%), and sharing knowledge (N = 527, 4.84%) in sequence. The word cloud analysis results showed that ocular pressure, visual field, examination, and operation were the most frequent words. The accuracy, F1 score, and AUC were 0.891, 0.891, and 0.931 for the BERT model, 0.82, 0.821, and 0.890 for the Bi-LSTM model. CONCLUSION Social media can help enhance the patient-doctor relationship by providing patients' concerns and cognition about glaucoma in China. NLP can be a powerful tool to reflect patients' focus on diseases. DL models performed well in classifying Chinese medical-related texts, which could play an important role in public health monitoring.
Collapse
Affiliation(s)
- Junxia Fu
- Department of Ophthalmology, School of Medicine, Xinhua Hospital Affiliated to Shanghai Jiao Tong University, 200092, Shanghai, China
- Institute of Hospital Development Strategy, China Hospital Development Institute, Shanghai Jiao Tong University, 200092, Shanghai, China
| | - Junrui Yang
- Joint Shantou International Eye Center of Shantou University and the Chinese University of Hong Kong, Shantou University Medical College, Shantou, Guangdong, China
- Department of Ophthalmology, The 74th Army Group Hospital, Guangzhou, Guangdong, China
| | - Qiuman Li
- Department of Pediatric Cardiology, Guangzhou Women and Children's Medical Center, Guangzhou, Guangdong, China
| | - Danqing Huang
- Institute of Hospital Development Strategy, China Hospital Development Institute, Shanghai Jiao Tong University, 200092, Shanghai, China
| | - Hongyang Yang
- Institute of Hospital Development Strategy, China Hospital Development Institute, Shanghai Jiao Tong University, 200092, Shanghai, China
| | - Xiaoling Xie
- Joint Shantou International Eye Center of Shantou University and the Chinese University of Hong Kong, Shantou University Medical College, Shantou, Guangdong, China
| | - Huaxin Xu
- The Faculty of Science, University of Technology Sydney, Sydney, Australia
| | - Mingzhi Zhang
- Joint Shantou International Eye Center of Shantou University and the Chinese University of Hong Kong, Shantou University Medical College, Shantou, Guangdong, China.
| | - Ce Zheng
- Department of Ophthalmology, School of Medicine, Xinhua Hospital Affiliated to Shanghai Jiao Tong University, 200092, Shanghai, China.
- Institute of Hospital Development Strategy, China Hospital Development Institute, Shanghai Jiao Tong University, 200092, Shanghai, China.
| |
Collapse
|
10
|
Zulbayar S, Mollayeva T, Colantonio A, Chan V, Escobar M. Integrating unsupervised and supervised learning techniques to predict traumatic brain injury: A population-based study. INTELLIGENCE-BASED MEDICINE 2023; 8:100118. [PMID: 38222038 PMCID: PMC10785655 DOI: 10.1016/j.ibmed.2023.100118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]
Abstract
This work aimed to identify pre-existing health conditions of patients with traumatic brain injury (TBI) and develop predictive models for the first TBI event and its external causes by employing a combination of unsupervised and supervised learning algorithms. We acquired up to five years of pre-injury diagnoses for 488,107 patients with TBI and 488,107 matched control patients who entered the emergency department or acute care hospitals between April 1st, 2002, and March 31st, 2020. Diagnoses were obtained from the Ontario Health Insurance Plan (OHIP) database which contains province-wide claims data by physicians in Ontario, Canada for inpatient and outpatient services. A screening process was conducted on the OHIP diagnostic codes to limit the subsequent analysis to codes that were predictive of TBI, which concluded that 314 codes were significantly associated with TBI. The Latent Dirichlet Allocation (LDA) model was applied to the diagnostic codes and generated an optimal number of 19 topics that concur with published literature but also suggest other unexplored areas. Estimated word-topic probabilities from the LDA model helped us detect pre-morbid conditions among patients with TBI by uncovering the underlying patterns of diagnoses, meanwhile estimated document-topic probabilities were utilized in variable creation as form of a dimension reduction. We created 19 topic scores for each patient in the cohort which were utilized along with socio-demographic factors for Random Forest binary classifier models. Test set performances evaluated using area under the receiver operating characteristic curve (AUC) were: TBI event (AUC = 0.85), external cause of injury: falls (AUC = 0.85), struck by/against (AUC = 0.83), cyclist collision (AUC = 0.76), motor vehicle collision (AUC = 0.83). Our analysis successfully demonstrated the feasibility of using machine learning to predict TBI due to various external causes and identified the most important factors that contribute to this prediction.
Collapse
Affiliation(s)
- Suvd Zulbayar
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5T 3M7, Canada
- Institute of Health and Policy, Management and Evaluation, University of Toronto, M5T 3M6, Canada
| | - Tatyana Mollayeva
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5T 3M7, Canada
- Rehabilitation Sciences Institute, Temerty Faculty of Medicine, University of Toronto, Toronto, ON M5G 1V7, Canada
- Acquired Brain Injury Research Lab, Department of Occupational Science and Occupational Therapy, University of Toronto, Toronto, ON M5G 1V7, Canada
- KITE Research Institute, Toronto Rehabilitation Institute-University Health Network, Toronto, ON M5G 2A2, Canada
| | - Angela Colantonio
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5T 3M7, Canada
- Rehabilitation Sciences Institute, Temerty Faculty of Medicine, University of Toronto, Toronto, ON M5G 1V7, Canada
- Acquired Brain Injury Research Lab, Department of Occupational Science and Occupational Therapy, University of Toronto, Toronto, ON M5G 1V7, Canada
- KITE Research Institute, Toronto Rehabilitation Institute-University Health Network, Toronto, ON M5G 2A2, Canada
- Institute of Health and Policy, Management and Evaluation, University of Toronto, M5T 3M6, Canada
- ICES, Toronto, ON, M4N 3M5, Canada
| | - Vincy Chan
- Rehabilitation Sciences Institute, Temerty Faculty of Medicine, University of Toronto, Toronto, ON M5G 1V7, Canada
- Acquired Brain Injury Research Lab, Department of Occupational Science and Occupational Therapy, University of Toronto, Toronto, ON M5G 1V7, Canada
- KITE Research Institute, Toronto Rehabilitation Institute-University Health Network, Toronto, ON M5G 2A2, Canada
- Institute of Health and Policy, Management and Evaluation, University of Toronto, M5T 3M6, Canada
| | - Michael Escobar
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5T 3M7, Canada
| |
Collapse
|
11
|
Omiye JA, Gui H, Daneshjou R, Cai ZR, Muralidharan V. Principles, applications, and future of artificial intelligence in dermatology. Front Med (Lausanne) 2023; 10:1278232. [PMID: 37901399 PMCID: PMC10602645 DOI: 10.3389/fmed.2023.1278232] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 09/27/2023] [Indexed: 10/31/2023] Open
Abstract
This paper provides an overview of artificial-intelligence (AI), as applied to dermatology. We focus our discussion on methodology, AI applications for various skin diseases, limitations, and future opportunities. We review how the current image-based models are being implemented in dermatology across disease subsets, and highlight the challenges facing widespread adoption. Additionally, we discuss how the future of AI in dermatology might evolve and the emerging paradigm of large language, and multi-modal models to emphasize the importance of developing responsible, fair, and equitable models in dermatology.
Collapse
Affiliation(s)
| | - Haiwen Gui
- Department of Dermatology, Stanford University, Stanford, CA, United States
| | - Roxana Daneshjou
- Department of Dermatology, Stanford University, Stanford, CA, United States
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States
| | - Zhuo Ran Cai
- Department of Dermatology, Stanford University, Stanford, CA, United States
| | | |
Collapse
|
12
|
Cummins JA, Zhou G, Nambudiri VE. Natural Language Processing for Large-Scale Analysis of Eczema and Psoriasis Social Media Comments. JID INNOVATIONS 2023; 3:100210. [PMID: 37564106 PMCID: PMC10410170 DOI: 10.1016/j.xjidi.2023.100210] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 02/19/2023] [Accepted: 02/27/2023] [Indexed: 08/12/2023] Open
Abstract
Social media tools are widely used by dermatologic patients. Eczema and psoriasis, two of the most common inflammatory skin diseases, are well-represented on the social media site Reddit. We used natural language processing tools to examine comments in subreddits r/psoriasis and r/eczema (combined user base >187,000), tracking commenters' interest levels and sentiments related to common treatments for psoriasis and eczema as well as discussions of adverse drug reactions. All comments from 2014-2020 from the subreddits r/eczema (n = 196,571) and r/psoriasis (n = 123,144) were retrieved and processed using natural language processing tools. Comment volume in r/eczema related to antibacterial therapies, lifestyle changes, and prednisone decreased from 2014-2020, whereas phototherapy comments remained stable, and dupilumab comment volume increased. Comment volume in r/psoriasis for newer therapeutics (including biologics and apremilast) increased after Food and Drug Administration approval, whereas older therapies such as etanercept, adalimumab, and methotrexate decreased over time. Sentiment scores tended to decrease in the years after Food and Drug Administration approval. Among psoriasis treatments, calcipotriene and branded calcipotriene/betamethasone foam had the highest sentiment, whereas apremilast had the lowest overall sentiment score. These analyses also identified changes in patient interest levels and sentiment related to eczema and psoriasis treatments, suggesting an area for additional research.
Collapse
Affiliation(s)
| | - Guohai Zhou
- Department of Dermatology, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA
| | - Vinod E. Nambudiri
- Department of Dermatology, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA
| |
Collapse
|
13
|
Emanuel RHK, Docherty PD, Lunt H, Campbell RE. Comparing Literature- and Subreddit-Derived Laboratory Values in Polycystic Ovary Syndrome (PCOS): Validation of Clinical Data Posted on PCOS Reddit Forums. JMIR Form Res 2023; 7:e44810. [PMID: 37624626 PMCID: PMC10492173 DOI: 10.2196/44810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Revised: 05/16/2023] [Accepted: 05/24/2023] [Indexed: 08/26/2023] Open
Abstract
BACKGROUND Polycystic ovary syndrome (PCOS) is a heterogeneous condition that affects 4% to 21% of people with ovaries. Inaccessibility or dissatisfaction with clinical treatment for PCOS has led to some individuals with the condition discussing their experiences in specialized web-based forums. OBJECTIVE This study explores the feasibility of using such web-based forums for clinical research purposes by gathering and analyzing laboratory test results posted in an active PCOS forum, specifically the PCOS subreddit hosted on Reddit. METHODS We gathered around 45,000 posts from the PCOS subreddit. A random subset of 5000 posts was manually read, and the presence of laboratory test results was labeled. These labeled posts were used to train a machine learning model to identify which of the remaining posts contained laboratory results. The laboratory results were extracted manually from the identified posts. These self-reported laboratory test results were compared with values in the published literature to assess whether the results were concordant with researcher-published values for PCOS cohorts. A total of 10 papers were chosen to represent published PCOS literature, with selection criteria including the Rotterdam diagnostic criteria for PCOS, a publication date within the last 20 years, and at least 50 participants with PCOS. RESULTS Overall, the general trends observed in the laboratory test results from the PCOS web-based forum were consistent with clinically reported PCOS. A number of results, such as follicle stimulating hormone, fasting insulin, and anti-Mullerian hormone, were concordant with published values for patients with PCOS. The high consistency of these results among the literature and when compared to the subreddit suggests that follicle stimulating hormone, fasting insulin, and anti-Mullerian hormone are more consistent across PCOS phenotypes than other test results. Some results, such as testosterone, sex hormone-binding globulin, and homeostasis model assessment-estimated insulin resistance index, were between those of PCOS literature values and normal values, as defined by clinical testing limits. Interestingly, other results, including dehydroepiandrosterone sulfate, luteinizing hormone, and fasting glucose, appeared to be slightly more dysregulated than those reported in the literature. CONCLUSIONS The differences between the forum-posted results and those published in the literature may be due to the selection process in clinical studies and the possibility that the forum disproportionally describes PCOS phenotypes that are less likely to be alleviated with medical intervention. However, the degree of concordance in most laboratory test values implied that the PCOS web-based forum participants were representative of research-identified PCOS cohorts. This validation of the PCOS subreddit grants the possibility for more research into the contents of the subreddit and the idea of undertaking similar research using the contents of other medical internet forums.
Collapse
Affiliation(s)
- Rebecca H K Emanuel
- Department of Mechanical Engineering, University of Canterbury, Christchurch, New Zealand
| | - Paul D Docherty
- Department of Mechanical Engineering, University of Canterbury, Christchurch, New Zealand
| | - Helen Lunt
- Diabetes Services, Te Whatu Ora Waitaha Canterbury, Canterbury, New Zealand
- Department of Medicine, University of Otago, Canterbury, New Zealand
| | - Rebecca E Campbell
- Department of Physiology, School of Biomedical Sciences, Centre for Neuroendocrinology, University of Otago, Dunedin, New Zealand
| |
Collapse
|
14
|
Laureate CDP, Buntine W, Linger H. A systematic review of the use of topic models for short text social media analysis. Artif Intell Rev 2023:1-33. [PMID: 37362887 PMCID: PMC10150353 DOI: 10.1007/s10462-023-10471-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/14/2023] [Indexed: 06/28/2023]
Abstract
Recently, research on short text topic models has addressed the challenges of social media datasets. These models are typically evaluated using automated measures. However, recent work suggests that these evaluation measures do not inform whether the topics produced can yield meaningful insights for those examining social media data. Efforts to address this issue, including gauging the alignment between automated and human evaluation tasks, are hampered by a lack of knowledge about how researchers use topic models. Further problems could arise if researchers do not construct topic models optimally or use them in a way that exceeds the models' limitations. These scenarios threaten the validity of topic model development and the insights produced by researchers employing topic modelling as a methodology. However, there is currently a lack of information about how and why topic models are used in applied research. As such, we performed a systematic literature review of 189 articles where topic modelling was used for social media analysis to understand how and why topic models are used for social media analysis. Our results suggest that the development of topic models is not aligned with the needs of those who use them for social media analysis. We have found that researchers use topic models sub-optimally. There is a lack of methodological support for researchers to build and interpret topics. We offer a set of recommendations for topic model researchers to address these problems and bridge the gap between development and applied research on short text topic models. Supplementary Information The online version contains supplementary material available at 10.1007/s10462-023-10471-x.
Collapse
Affiliation(s)
| | - Wray Buntine
- College of Engineering and Computer Science, VinUniversity, Vinhomes Ocean Park, Gia Lam District, Hanoi 10000 Vietnam
| | - Henry Linger
- Faculty of IT, Monash University, Wellington Rd, Clayton, VIC 3800 Australia
| |
Collapse
|
15
|
El-Jack K, Henderson K, Andy AU, Southwick L. Reddit Users' Questions and Concerns about Anesthesia. INTERNATIONAL JOURNAL OF MEDICAL STUDENTS 2023. [DOI: 10.5195/ijms.2022.1687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023] Open
Abstract
Background: Patients utilize social media in search of support networks. Reddit is one of the most popular social media sites and allows users to anonymously connect. Anesthesia patients are actively using Reddit to discuss their treatment options and experiences within the medical system.
Methods: Posts published on an active Reddit forum on Anesthesia (i.e., /r/Anesthesia) were used. Big Query was used to collect posts from /r/Anesthesia. We collected 3,288 posts published between December 2015 and August 2019. We collected a control group of 3,288 posts from a Reddit forum not related to Anesthesia. Using latent Dirichlet allocation (LDA) we extracted 20 topics from our data set. The LDA topic themes most associated with posts in /r/Anesthesia compared to the control group were determined.
Results: LDA analysis of posts in /r/Anesthesia relative to a control group produced 6 distinct categories of posts (Table 1). The posts most associated with /r/Anesthesia when compared to a control group were posts belonging to the “Physician-Patient Experience” category (Cohen’s d= 0.389) while the posts least associated with /r/Anesthesia were from the “Uncertainties” category of posts (Cohen’s d= 0.147). Example experiences from members of the /r/Anesthesia forum highlight subjective experiences of patients undergoing anesthesia.
Conclusions: The language used on social media can provide insights into an individual's experience with anesthesia and inform physicians about patient concerns. Anesthesiologists are poised to address these concerns and prevent anonymous misinformation by providing verified physician insights on the forum /r/Anesthesia.
Collapse
|
16
|
Chen Z, Zeng G, Zhong S, Wang L. From the exotic to the everyday: The Avocado crossing borders via cyberspace. Appetite 2023; 180:106362. [PMID: 36368563 DOI: 10.1016/j.appet.2022.106362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Revised: 10/28/2022] [Accepted: 10/29/2022] [Indexed: 11/09/2022]
Abstract
With the globalization of food sales and consumption, exotic foods are now regularly crossing geographical and cultural borders and moving into local areas. This process is attracting ever-increasing attention from academics. Taking avocado consumption presented on Sina Weibo as an example, this research analyzes avocado related user-generated content on Sina Weibo over three years- 2013, 2015, and 2017- and employs topic modeling and semantic network methods to obtain the mechanism by which exotic food cross borders to appear in local consumers' daily food choices. Two specific links are explored: online information dissemination and offline daily consumption. The result indicates that a selective geographical narrative and framework for avocado information influence local consumers' choice of exotic foods according to three aspects: edibility, accessibility, and acceptability. For local consumers, the avocado is now connected with local objects and spaces, gradually transforming from a novelty to functional daily food and from low to high-frequency consumption to high-frequency consumption, escaping the marginal and penetrating into the core cultural context and completing the process of embedment into the everyday. This study refutes the assertion that "globalized diets bring about homogenized diets," explores the mechanism of influence by which information dissemination in cyberspace affects cultural borders, complements the study of food consumption in Southern countries, and provides new thoughts on the theoretical and practical exploration of food globalization from the perspective of food geography.
Collapse
Affiliation(s)
- Zheng Chen
- School of Tourism Management, Sun Yat-Sen University, China.
| | - Guojun Zeng
- School of Tourism Management, Sun Yat-Sen University, China.
| | - Shuru Zhong
- School of Tourism Management, Sun Yat-Sen University, China.
| | - Longjie Wang
- School of Management, Zhejiang University, China.
| |
Collapse
|
17
|
Fu J, Yang J, Li Q, Huang D, Yang H, Xie X, Xu H, Zhang M, Zheng C. Glaucoma-related posts from a Chinese social media: An exploratory study.. [DOI: 10.21203/rs.3.rs-2312218/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Abstract
Purpose: Our study aims to discuss glaucoma patients' needs and Internet habits using big data analysis and Natural Language Processing (NLP) based on deep learning (DL). We also developed and validated DL models to recognize social media data.
Methods: In this retrospective study, we used web crawler technology to crawl glaucoma-related topic posts from the glaucoma bar of Baidu Tieba. According to the contents of topic posts, we classified them into posts with or without seeking medical advice. Word Cloud and frequency statistics were used to analyze the contents and visualize the keywords. Two DL models, Bidirectional Long Short-Term Memory (Bi-LSTM) and Bidirectional Encoder Representations from Transformers (BERT), were trained to identify the posts seeking medical advice. The evaluation matrices included: accuracy, F1 value, and the area under the ROC curve (AUC).
Results: A total of 10,892 topic posts were included, among them, most were seeking medical advice (N=7071, 64.91%), and seeking advice regarding symptoms or examination (N=4913, 45.11%) dominated the majority, followed by searching for social support , expressing emotions, and sharing knowledge. The word cloud analysis showed that ocular pressure, visual field, examination, and operation were the most frequent words. The accuracy, F1 score, and AUC were 0.891, 0.891, and 0.931 for BERT model, 0.82, 0.821, and 0.890 for Bi-LSTM model.
Conclusion: Social media can help enhance the patient-doctor relationship by providing patients’ concerns and cognition about glaucoma. DL models performed well in classifying Chinese medical-related texts, which could play an important role in public health monitoring.
Collapse
Affiliation(s)
| | - Junrui Yang
- Joint Shantou International Eye Center of Shantou University, Chinese University of Hong Kong, Shantou University Medical College
| | - Qiuman Li
- Guangzhou Women and Children’s Medical Center
| | | | | | - Xiaoling Xie
- Joint Shantou International Eye Center of Shantou University, Chinese University of Hong Kong, Shantou University Medical College
| | | | - Mingzhi Zhang
- Joint Shantou International Eye Center of Shantou University, Chinese University of Hong Kong, Shantou University Medical College
| | | |
Collapse
|
18
|
Joly-Chevrier M, Aly S, Lefrançois P. Comparison of Basal Cell Carcinoma Posts, Comments and Authors Between Reddit and Quora Forums. J Cutan Med Surg 2022; 26:634-635. [PMID: 36200886 DOI: 10.1177/12034754221129872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Affiliation(s)
| | - Safin Aly
- 5622 Faculty of Medicine, Université de Montréal, Montreal, QC, Canada
| | - Philippe Lefrançois
- 12367 Division of Dermatology, Department of Medicine, McGill University, Montreal, QC, Canada.,Division of Dermatology, Department of Medicine, Jewish General Hospital, Montreal, QC, Canada.,Lady Davis Institute for Medical Research, Montreal, QC, Canada
| |
Collapse
|
19
|
Rajagopalan D, Thomas J, Ring D, Fatehi A. Quantitative Patient-Reported Experience Measures Derived From Natural Language Processing Have a Normal Distribution and No Ceiling Effect. Qual Manag Health Care 2022; 31:210-218. [PMID: 35383720 DOI: 10.1097/qmh.0000000000000355] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
BACKGROUND AND OBJECTIVES Patient-reported experience measures have the potential to guide improvement in health care delivery. Many patient-reported experience measures are limited by the presence of strong ceiling effects that limit their analytical utility. METHODS We used natural language processing to develop 2 new methods of evaluating patient experience using text comments and associated ordinal and categorical ratings of willingness to recommend from 1390 patients receiving specialty or nonspecialty care at our offices. One method used multivariable analysis based on linguistic factors to derive a formula to estimate the ordinal likelihood to recommend. The other method used the meaning extraction method of thematic analysis to identify words associated with categorical ratings of likelihood to recommend with which we created an equation to compute an experience score. We measured normality of the 2 score distributions and ceiling effects. RESULTS Spearman rank-order correlation analysis identified 36 emotional and linguistic constructs associated with ordinal rating of likelihood to recommend, 9 of which were independently associated in multivariable analysis. The calculation derived from this model corresponded with the original ordinal rating with an accuracy within 0.06 units on a 0 to 10 scale. This score and the score developed from thematic analysis both had a relatively normal distribution and limited or no ceiling effect. CONCLUSIONS Quantitative ratings of patient experience developed using natural language processing of text comments can have relatively normal distributions and no ceiling effect.
Collapse
Affiliation(s)
- Dayal Rajagopalan
- Department of Surgery and Perioperative Care, Dell Medical School, The University of Texas at Austin
| | | | | | | |
Collapse
|
20
|
Das N, Sadhukhan B, Chatterjee T, Chakrabarti S. Effect of public sentiment on stock market movement prediction during the COVID-19 outbreak. SOCIAL NETWORK ANALYSIS AND MINING 2022; 12:92. [PMID: 35911484 PMCID: PMC9325657 DOI: 10.1007/s13278-022-00919-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 06/27/2022] [Accepted: 07/04/2022] [Indexed: 11/28/2022]
Abstract
Forecasting the stock market is one of the most difficult undertakings in the financial industry due to its complex, volatile, noisy, and nonparametric character. However, as computer science advances, an intelligent model can help investors and analysts minimize investment risk. Public opinion on social media and other online portals is an important factor in stock market predictions. The COVID-19 pandemic stimulates online activities since individuals are compelled to remain at home, bringing about a massive quantity of public opinion and emotion. This research focuses on stock market movement prediction with public sentiments using the long short-term memory network (LSTM) during the COVID-19 flare-up. Here, seven different sentiment analysis tools, VADER, logistic regression, Loughran–McDonald, Henry, TextBlob, Linear SVC, and Stanford, are used for sentiment analysis on web scraped data from four online sources: stock-related articles headlines, tweets, financial news from "Economic Times" and Facebook comments. Predictions are made utilizing both feeling scores and authentic stock information for every one of the 28 opinion measures processed. An accuracy of 98.11% is achieved by using linear SVC to calculate sentiment ratings from Facebook comments. Thereafter, the four estimated sentiment scores from each of the seven instruments are integrated with stock data in a step-by-step fashion to determine the overall influence on the stock market. When all four sentiment scores are paired with stock data, the forecast accuracy for five out of seven tools is at its most noteworthy, with linear SVC computed scores assisting stock data to arrive at its most elevated accuracy of 98.32%.
Collapse
Affiliation(s)
- Nabanita Das
- Department of Computer Science & Engineering, Techno International New Town, Kolkata, West Bengal India
| | - Bikash Sadhukhan
- Department of Computer Science & Engineering, Techno International New Town, Kolkata, West Bengal India
| | - Tanusree Chatterjee
- Department of Computer Science & Engineering, Techno International New Town, Kolkata, West Bengal India
| | | |
Collapse
|
21
|
Dave AD, Zhu D. Ophthalmology Inquiries on Reddit: What Should Physicians Know? Clin Ophthalmol 2022; 16:2923-2931. [PMID: 36071726 PMCID: PMC9441589 DOI: 10.2147/opth.s375822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 08/24/2022] [Indexed: 11/23/2022] Open
Abstract
Purpose Patients are seeking answers to personal medical questions on social media. Reddit, a popular social media site, has been overlooked as a source of data in the field of ophthalmology. We analyzed posts in the subreddit, r/Ophthalmology, to better understand the most common ophthalmic conditions patients are discussing online and how often those making posts are being advised to seek professional medical care. Patients and Methods This cross-sectional study analyzed posts and comments from March 18, 2018 to November 9, 2020. All posts and comments on r/Ophthalmology are public and were accessed using the Python Reddit API Wrapper. This text was analyzed for unique references to common ophthalmic conditions and for mentions and recommendations to different types of medical care. Results Nine hundred and nineteen posts were collected from the above timeframe. An auto-moderator makes a comment on every post to discourage patients from asking personal medical questions. Despite this, over two-thirds of posts discussed medical treatment for eye conditions in either a post, comment, or both. Almost half of all posts mention “ophthalmology”, but only 9% go as far as to recommend ophthalmic care. One-third of posts have no reference to medical care. Within posts, flashes and floaters were the most common condition mentioned, making up 15% of unique mentions, followed by glaucoma (7.4%) and retinal detachments (6.7%). Within comments, cataracts were most commonly discussed, making up 12% of unique mentions, followed by glaucoma (8.9%). Conclusion These findings show that patients are seeking information about their eye health on the r/Ophthalmology subreddit and that Reddit users are engaging with these types of posts, instead of recommending professional care in most cases. It is important for ophthalmologists to recognize the most common conditions patients are asking about online and learn how they can do a better job of educating their patients in the office.
Collapse
Affiliation(s)
- Amisha D Dave
- University of Connecticut School of Medicine, Farmington, CT, USA
- Correspondence: Amisha D Dave, University of Connecticut School of Medicine, 263 Farmington Ave, Farmington, CT, 06030, USA, Tel +1 203-903-2625, Email
| | - Dagny Zhu
- NVISION Eye Centers, Rowland Heights, CA, USA
| |
Collapse
|
22
|
Barrutia L, Vega-Gutiérrez J, Santamarina-Albertos A. Benefits, drawbacks, and challenges of social media use in dermatology: A systematic review. J DERMATOL TREAT 2022; 33:2738-2757. [PMID: 35506617 DOI: 10.1080/09546634.2022.2069661] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
The presence of dermatological information on social media has grown exponentially over the last two decades. Consequently, the recent literature on this topic is abundant. Many authors have highlighted that social media constitutes a unique opportunity for patient education. Additionally, numerous other benefits of these platforms have been reported. However, other authors have focused on the potential risks that these networks involve. The main concerns are patient confidentiality, legal considerations and ethical issues. Therefore, we stand at a crossroads where the many advantages of social media use in dermatology seem to be underestimated due to the presence of potential drawbacks. At this point, we propose that a systematic review of the positive and negative aspects of using social media in dermatology is necessary. We carried out a comprehensive systematic review dating from inception to July 2021. Finally, 161 articles were included. Fifteen benefits, 11 drawbacks and 10 challenges of social media use in dermatology were identified and discussed. Suggested strategies to address the identified drawbacks were provided. Overall, while there are risks to using social media, they are outnumbered by their benefits. Therefore, dermatologists should embrace this opportunity to educate patients and aim to create rigorous and engaging content.
Collapse
Affiliation(s)
- Leire Barrutia
- Dermatology, Medicine and Toxicology Department, University of Valladolid, Valladolid, Spain.,Dermatology Department, Clinical University Hospital of Valladolid, Valladolid, Spain
| | - Jesús Vega-Gutiérrez
- Dermatology, Medicine and Toxicology Department, University of Valladolid, Valladolid, Spain.,Dermatology Department, Río Hortega University Hospital, Valladolid, Spain
| | - Alba Santamarina-Albertos
- Dermatology, Medicine and Toxicology Department, University of Valladolid, Valladolid, Spain.,Dermatology Department, Clinical University Hospital of Valladolid, Valladolid, Spain
| |
Collapse
|
23
|
Ahmed F, Lipoff JB. The role of dermatologists in social media: exploring the benefits and risks. DER HAUTARZT 2022; 73:401-404. [PMID: 35133443 PMCID: PMC8830428 DOI: 10.1007/s00105-022-04946-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Accepted: 01/07/2022] [Indexed: 11/28/2022]
Affiliation(s)
- Fahad Ahmed
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA USA
| | - Jules B. Lipoff
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA USA
- Penn Medicine University City, 3737 Market Street, Suite 1100, 19104 Philadelphia, PA USA
- Leonard Davis Institute of Health Economics, University of Pennsylvania, Philadelphia, PA USA
| |
Collapse
|
24
|
Markides BR, Laws R, Hesketh K, Maddison R, Denney‐Wilson E, Campbell KJ. A thematic cluster analysis of parents' online discussions about fussy eating. MATERNAL & CHILD NUTRITION 2022; 18:e13316. [PMID: 35132813 PMCID: PMC8932712 DOI: 10.1111/mcn.13316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Revised: 11/21/2021] [Accepted: 12/14/2021] [Indexed: 11/30/2022]
Abstract
Food fussiness is associated with non‐responsive parent feeding practices, such as persuasive and instrumental feeding. Although most children described as ‘fussy eaters’ are likely exhibiting developmentally typical behaviours, up to half of the parents of children 2–5 years old express concerns. Concern for fussy eating may mediate the use of non‐responsive feeding practices and so must be addressed in parent feeding interventions. Therefore, it is critical to better understand parents' concerns and how they may relate to feeding practices. This study aimed to explore how parents' feeding practices and the social cognitive factors that may drive them clustered based on parents' concern for fussy eating. Data were collected from parent discussions of fussy eating on a Reddit forum (80,366 posts). Latent Dirichlet allocation was used to identify discussions of fussy eating. Relevant posts (1542) made by users who identified as a parent of a fussy eater (n = 630) underwent qualitative coding and thematic analysis. Five clusters of parents were identified, ranging in size from 53 to 189 users. These were primarily characterised by parents' degree of concern and feeding practices: (1) High concern, nonresponsive; (2) Concerned, nonresponsive; (3) Low concern, responsive; (4) Low concern, mixed strategies; (5) Low concern, indulgent. Parents who used responsive practices tended to be less concerned for fussy eating, have greater trust in their child's ability to self‐regulate hunger, have longer‐term feeding goals, and exhibit greater ability for personal self‐regulation. Future research should further examine how these constructs may be leveraged in parent feeding interventions. Parents using nonresponsive practices to manage fussy eating may have more concerns about fussy eating, less trust in their child's ability to self‐regulate hunger, shorter‐term feeding goals, and less developed self‐regulation skills. As most children described as “fussy eaters” are likely exhibiting developmentally typical behaviours, interventions addressing nutrition and feeding practices in early childhood should aim to alleviate parental concern for fussy eating, for example, by helping them recognise developmentally normal eating behaviours. Early childhood feeding interventions may also be enhanced by helping parents to develop long‐term feeding goals and improved self‐regulation skills in the context of their feeding practices.
Collapse
Affiliation(s)
- Brittany R. Markides
- Institute for Physical Activity and Nutrition (IPAN), School of Exercise and Nutrition Sciences Deakin University Burwood Victoria Australia
| | - Rachel Laws
- Institute for Physical Activity and Nutrition (IPAN), School of Exercise and Nutrition Sciences Deakin University Burwood Victoria Australia
| | - Kylie Hesketh
- Institute for Physical Activity and Nutrition (IPAN), School of Exercise and Nutrition Sciences Deakin University Burwood Victoria Australia
| | - Ralph Maddison
- Institute for Physical Activity and Nutrition (IPAN), School of Exercise and Nutrition Sciences Deakin University Burwood Victoria Australia
| | - Elizabeth Denney‐Wilson
- Susan Wakil School of Nursing and Midwifery, Faculty of Medicine and Health The University of Sydney Camperdown New South Wales Australia
| | - Karen J. Campbell
- Institute for Physical Activity and Nutrition (IPAN), School of Exercise and Nutrition Sciences Deakin University Burwood Victoria Australia
| |
Collapse
|
25
|
Persistence and Attrition among Participants in a Multi-Page Online Survey Recruited via Reddit’s Social Media Network. SOCIAL SCIENCES 2022. [DOI: 10.3390/socsci11020031] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Participant attrition is a major concern for the validity of longer or complex surveys. Unlike paper-based surveys, which may be discarded even if partially completed, multi-page online surveys capture responses from all completed pages until the time of abandonment. This can result in different item response rates, with pages earlier in the sequence showing more completions than later pages. Using data from a multi-page online survey administered to cohorts recruited on Reddit, this paper analyses the pattern of attrition at various stages of the survey instrument and examines the effects of survey length, time investment, survey format and complexity, and survey delivery on participant attrition. The participant attrition rate (PAR) differed between cohorts, with cohorts drawn from Reddit showing a higher PAR than cohorts targeted by other means. Common to all was that the PAR was higher among younger respondents and among men. Changes in survey question design resulted in the greatest rise in PAR irrespective of age, gender or cohort.
Collapse
|
26
|
Natural Language Processing: A Window to Understanding Skincare Trends. Int J Med Inform 2022; 160:104705. [DOI: 10.1016/j.ijmedinf.2022.104705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 01/09/2022] [Accepted: 01/20/2022] [Indexed: 11/21/2022]
|
27
|
Sager MA, Kashyap AM, Tamminga M, Ravoori S, Callison-Burch C, Lipoff JB. Identifying and Responding to Health Misinformation on Reddit Dermatology Forums With Artificially Intelligent Bots Using Natural Language Processing: Design and Evaluation Study. JMIR DERMATOLOGY 2021; 4:e20975. [PMID: 37632809 PMCID: PMC10334965 DOI: 10.2196/20975] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 07/08/2021] [Accepted: 08/05/2021] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Reddit, the fifth most popular website in the United States, boasts a large and engaged user base on its dermatology forums where users crowdsource free medical opinions. Unfortunately, much of the advice provided is unvalidated and could lead to the provision of inappropriate care. Initial testing has revealed that artificially intelligent bots can detect misinformation regarding tanning and essential oils on Reddit dermatology forums and may be able to produce responses to posts containing misinformation. OBJECTIVE To analyze the ability of bots to find and respond to tanning and essential oil-related health misinformation on Reddit's dermatology forums in a controlled test environment. METHODS Using natural language processing techniques, we trained bots to target misinformation, using relevant keywords and to post prefabricated responses. By evaluating different model architectures across a held-out test set, we compared performances. RESULTS Our models yielded data test accuracies ranging 95%-100%, with a Bidirectional Encoder Representations from Transformers (BERT) fine-tuned model resulting in the highest level of test accuracy. Bots were then able to post corrective prefabricated responses to misinformation in a test environment. CONCLUSIONS Using a limited data set, bots accurately detected examples of health misinformation within Reddit dermatology forums. Given that these bots can then post prefabricated responses, this technique may allow for interception of misinformation. Providing correct information does not mean that users will be receptive or find such interventions persuasive. Further studies should investigate this strategy's effectiveness to inform future deployment of bots as a technique in combating health misinformation.
Collapse
Affiliation(s)
- Monique A Sager
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Aditya M Kashyap
- Department of Computer Science, University of Pennsylvania, Philadelphia, PA, United States
| | - Mila Tamminga
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Sadhana Ravoori
- Department of Computer Science, University of Pennsylvania, Philadelphia, PA, United States
| | | | - Jules B Lipoff
- Department of Dermatology, University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|
28
|
Wu W, Lyu H, Luo J. Characterizing Discourse about COVID-19 Vaccines: A Reddit Version of the Pandemic Story. HEALTH DATA SCIENCE 2021; 2021:9837856. [PMID: 36405359 PMCID: PMC9629685 DOI: 10.34133/2021/9837856] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 06/28/2021] [Indexed: 12/18/2022]
Abstract
It has been one year since the outbreak of the COVID-19 pandemic. The good news is that vaccines developed by several manufacturers are being actively distributed worldwide. However, as more and more vaccines become available to the public, various concerns related to vaccines become the primary barriers that may hinder the public from getting vaccinated. Considering the complexities of these concerns and their potential hazards, this study is aimed at offering a clear understanding about different population groups' underlying concerns when they talk about COVID-19 vaccines-particularly those active on Reddit. The goal is achieved by applying LDA and LIWC to characterize the pertaining discourse with insights generated through a combination of quantitative and qualitative comparisons. Findings include the following: (1) during the pandemic, the proportion of Reddit comments predominated by conspiracy theories outweighed that of any other topics; (2) each subreddit has its own user bases, so information posted in one subreddit may not reach that from other subreddits; and (3) since users' concerns vary across time and subreddits, communication strategies must be adjusted according to specific needs. The results of this study manifest challenges as well as opportunities in the process of designing effective communication and immunization programs.
Collapse
Affiliation(s)
- Wei Wu
- Goergen Institute for Data Science, University of Rochester, Rochester, USA
| | - Hanjia Lyu
- Goergen Institute for Data Science, University of Rochester, Rochester, USA
| | - Jiebo Luo
- Department of Computer Science, University of Rochester, Rochester, USA
| |
Collapse
|
29
|
Classifying patient and professional voice in social media health posts. BMC Med Inform Decis Mak 2021; 21:244. [PMID: 34407807 PMCID: PMC8371035 DOI: 10.1186/s12911-021-01577-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 07/06/2021] [Indexed: 11/10/2022] Open
Abstract
Background Patient-based analysis of social media is a growing research field with the aim of delivering precision medicine but it requires accurate classification of posts relating to patients’ experiences. We motivate the need for this type of classification as a pre-processing step for further analysis of social media data in the context of related work in this area. In this paper we present experiments for a three-way document classification by patient voice, professional voice or other. We present results for a convolutional neural network classifier trained on English data from two different data sources (Reddit and Twitter) and two domains (cardiovascular and skin diseases). Results We found that document classification by patient voice, professional voice or other can be done consistently manually (0.92 accuracy). Annotators agreed roughly equally for each domain (cardiovascular and skin) but they agreed more when annotating Reddit posts compared to Twitter posts. Best classification performance was obtained when training two separate classifiers for each data source, one for Reddit and one for Twitter posts, when evaluating on in-source test data for both test sets combined with an overall accuracy of 0.95 (and macro-average F1 of 0.92) and an F1-score of 0.95 for patient voice only. Conclusion The main conclusion resulting from this work is that combining social media data from platforms with different characteristics for training a patient and professional voice classifier does not result in best possible performance. We showed that it is best to train separate models per data source (Reddit and Twitter) instead of a model using the combined training data from both sources. We also found that it is preferable to train separate models per domain (cardiovascular and skin) while showing that the difference to the combined model is only minor (0.01 accuracy). Our highest overall F1-score (0.95) obtained for classifying posts as patient voice is a very good starting point for further analysis of social media data reflecting the experience of patients. Supplementary Information The online version contains supplementary material available at 10.1186/s12911-021-01577-9.
Collapse
|
30
|
Fairie P, Zhang Z, D'Souza AG, Walsh T, Quan H, Santana MJ. Categorising patient concerns using natural language processing techniques. BMJ Health Care Inform 2021; 28:e100274. [PMID: 34193519 PMCID: PMC8246286 DOI: 10.1136/bmjhci-2020-100274] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 05/20/2021] [Indexed: 11/23/2022] Open
Abstract
OBJECTIVES Patient feedback is critical to identify and resolve patient safety and experience issues in healthcare systems. However, large volumes of unstructured text data can pose problems for manual (human) analysis. This study reports the results of using a semiautomated, computational topic-modelling approach to analyse a corpus of patient feedback. METHODS Patient concerns were received by Alberta Health Services between 2011 and 2018 (n=76 163), regarding 806 care facilities in 163 municipalities, including hospitals, clinics, community care centres and retirement homes, in a province of 4.4 million. Their existing framework requires manual labelling of pre-defined categories. We applied an automated latent Dirichlet allocation (LDA)-based topic modelling algorithm to identify the topics present in these concerns, and thereby produce a framework-free categorisation. RESULTS The LDA model produced 40 topics which, following manual interpretation by researchers, were reduced to 28 coherent topics. The most frequent topics identified were communication issues causing delays (frequency: 10.58%), community care for elderly patients (8.82%), interactions with nurses (8.80%) and emergency department care (7.52%). Many patient concerns were categorised into multiple topics. Some were more specific versions of categories from the existing framework (eg, communication issues causing delays), while others were novel (eg, smoking in inappropriate settings). DISCUSSION LDA-generated topics were more nuanced than the manually labelled categories. For example, LDA found that concerns with community care were related to concerns about nursing for seniors, providing opportunities for insight and action. CONCLUSION Our findings outline the range of concerns patients share in a large health system and demonstrate the usefulness of using LDA to identify categories of patient concerns.
Collapse
Affiliation(s)
- Paul Fairie
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Alberta Strategy for Patient-Oriented Research Patient Engagement Platform, Calgary, Alberta, Canada
| | - Zilong Zhang
- Centre for Health Informatics, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Adam G D'Souza
- Centre for Health Informatics, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Alberta Health Services, Calgary, Alberta, Canada
| | - Tara Walsh
- Alberta Health Services, Calgary, Alberta, Canada
| | - Hude Quan
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Centre for Health Informatics, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Maria J Santana
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Alberta Strategy for Patient-Oriented Research Patient Engagement Platform, Calgary, Alberta, Canada
- Department of Pediatrics, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| |
Collapse
|
31
|
Yu K, Syed MN, Bernardis E, Gelfand JM. Machine Learning Applications in the Evaluation and Management of Psoriasis: A Systematic Review. ACTA ACUST UNITED AC 2021; 5:147-159. [PMID: 33733038 PMCID: PMC7963214 DOI: 10.1177/2475530320950267] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Background Machine learning (ML), a subset of artificial intelligence (AI) that aims to teach machines to automatically learn tasks by inferring patterns from data, holds significant promise to aid psoriasis care. Applications include evaluation of skin images for screening and diagnosis as well as clinical management including treatment and complication prediction. Objective To summarize literature on ML applications to psoriasis evaluation and management and to discuss challenges and opportunities for future advances. Methods We searched MEDLINE, Google Scholar, ACM Digital Library, and IEEE Xplore for peer-reviewed publications published in English through December 1, 2019. Our search queries identified publications with any of the 10 computing-related keywords and "psoriasis" in the title and/or abstract. Results Thirty-three studies were identified. Articles were organized by topic and synthesized as evaluation- or management-focused articles covering 5 content categories: (A) Evaluation using skin images: (1) identification and differential diagnosis of psoriasis lesions, (2) lesion segmentation, and (3) lesion severity and area scoring; (B) clinical management: (1) prediction of complications and (2) treatment. Conclusion Machine learning has significant potential to aid psoriasis evaluation and management. Current topics popular in ML research on psoriasis are the evaluation of medical images, prediction of complications, and treatment discovery. For patients to derive the greatest benefit from ML advancements, it is helpful for dermatologists to have an understanding of ML and how it can effectively aid their assessments and decision-making.
Collapse
Affiliation(s)
- Kimberley Yu
- Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA
| | - Maha N Syed
- Department of Dermatology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA
| | - Elena Bernardis
- Department of Dermatology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA
| | - Joel M Gelfand
- Department of Dermatology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
32
|
Eapen BR. Artificial Intelligence in Dermatology: A Practical Introduction to a Paradigm Shift. Indian Dermatol Online J 2020; 11:881-889. [PMID: 33344334 PMCID: PMC7735013 DOI: 10.4103/idoj.idoj_388_20] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2020] [Revised: 07/27/2020] [Accepted: 09/13/2020] [Indexed: 12/13/2022] Open
Abstract
Artificial Intelligence (AI) has surpassed dermatologists in skin cancer detection, but dermatology still lags behind radiology in its broader adoption. Building and using AI applications are becoming increasingly accessible. However, complex use cases may still require specialized expertise for design and deployment. AI has many applications in dermatology ranging from fundamental research, diagnostics, therapeutics, and cosmetic dermatology. The lack of standardization of images and privacy concerns are the foremost challenges stifling AI adoption. Dermatologists have a significant role to play in standardized data collection, curating data for machine learning, clinically validating AI solutions, and ultimately adopting this paradigm shift that is changing the way we practice.
Collapse
Affiliation(s)
- Bell R. Eapen
- Information Systems, McMaster University, Hamilton, ON, Canada
| |
Collapse
|
33
|
Jelodar H, Wang Y, Orji R, Huang S. Deep Sentiment Classification and Topic Discovery on Novel Coronavirus or COVID-19 Online Discussions: NLP Using LSTM Recurrent Neural Network Approach. IEEE J Biomed Health Inform 2020; 24:2733-2742. [PMID: 32750931 DOI: 10.1101/2020.04.22.054973] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Internet forums and public social media, such as online healthcare forums, provide a convenient channel for users (people/patients) concerned about health issues to discuss and share information with each other. In late December 2019, an outbreak of a novel coronavirus (infection from which results in the disease named COVID-19) was reported, and, due to the rapid spread of the virus in other parts of the world, the World Health Organization declared a state of emergency. In this paper, we used automated extraction of COVID-19-related discussions from social media and a natural language process (NLP) method based on topic modeling to uncover various issues related to COVID-19 from public opinions. Moreover, we also investigate how to use LSTM recurrent neural network for sentiment classification of COVID-19 comments. Our findings shed light on the importance of using public opinions and suitable computational techniques to understand issues surrounding COVID-19 and to guide related decision-making. In addition, experiments demonstrated that the research model achieved an accuracy of 81.15% - a higher accuracy than that of several other well-known machine-learning algorithms for COVID-19-Sentiment Classification.
Collapse
|
34
|
Chan S, Reddy V, Myers B, Thibodeaux Q, Brownstone N, Liao W. Machine Learning in Dermatology: Current Applications, Opportunities, and Limitations. Dermatol Ther (Heidelb) 2020; 10:365-386. [PMID: 32253623 PMCID: PMC7211783 DOI: 10.1007/s13555-020-00372-0] [Citation(s) in RCA: 81] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Indexed: 12/14/2022] Open
Abstract
Machine learning (ML) has the potential to improve the dermatologist's practice from diagnosis to personalized treatment. Recent advancements in access to large datasets (e.g., electronic medical records, image databases, omics), faster computing, and cheaper data storage have encouraged the development of ML algorithms with human-like intelligence in dermatology. This article is an overview of the basics of ML, current applications of ML, and potential limitations and considerations for further development of ML. We have identified five current areas of applications for ML in dermatology: (1) disease classification using clinical images; (2) disease classification using dermatopathology images; (3) assessment of skin diseases using mobile applications and personal monitoring devices; (4) facilitating large-scale epidemiology research; and (5) precision medicine. The purpose of this review is to provide a guide for dermatologists to help demystify the fundamentals of ML and its wide range of applications in order to better evaluate its potential opportunities and challenges.
Collapse
Affiliation(s)
- Stephanie Chan
- Department of Dermatology, University of California San Francisco, San Francisco, CA, USA
| | - Vidhatha Reddy
- Department of Dermatology, University of California San Francisco, San Francisco, CA, USA
| | - Bridget Myers
- Department of Dermatology, University of California San Francisco, San Francisco, CA, USA
| | - Quinn Thibodeaux
- Department of Dermatology, University of California San Francisco, San Francisco, CA, USA
| | - Nicholas Brownstone
- Department of Dermatology, University of California San Francisco, San Francisco, CA, USA
| | - Wilson Liao
- Department of Dermatology, University of California San Francisco, San Francisco, CA, USA.
| |
Collapse
|
35
|
Xu C, Yang H, Sun L, Cao X, Hou Y, Cai Q, Jia P, Wang Y. Detecting Lung Cancer Trends by Leveraging Real-World and Internet-Based Data: Infodemiology Study. J Med Internet Res 2020; 22:e16184. [PMID: 32163035 PMCID: PMC7099398 DOI: 10.2196/16184] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2019] [Revised: 10/28/2019] [Accepted: 02/22/2020] [Indexed: 01/13/2023] Open
Abstract
Background Internet search data on health-related terms can reflect people’s concerns about their health status in near real time, and hence serve as a supplementary metric of disease characteristics. However, studies using internet search data to monitor and predict chronic diseases at a geographically finer state-level scale are sparse. Objective The aim of this study was to explore the associations of internet search volumes for lung cancer with published cancer incidence and mortality data in the United States. Methods We used Google relative search volumes, which represent the search frequency of specific search terms in Google. We performed cross-sectional analyses of the original and disease metrics at both national and state levels. A smoothed time series of relative search volumes was created to eliminate the effects of irregular changes on the search frequencies and obtain the long-term trends of search volumes for lung cancer at both the national and state levels. We also performed analyses of decomposed Google relative search volume data and disease metrics at the national and state levels. Results The monthly trends of lung cancer-related internet hits were consistent with the trends of reported lung cancer rates at the national level. Ohio had the highest frequency for lung cancer-related search terms. At the state level, the relative search volume was significantly correlated with lung cancer incidence rates in 42 states, with correlation coefficients ranging from 0.58 in Virginia to 0.94 in Oregon. Relative search volume was also significantly correlated with mortality in 47 states, with correlation coefficients ranging from 0.58 in Oklahoma to 0.94 in North Carolina. Both the incidence and mortality rates of lung cancer were correlated with decomposed relative search volumes in all states excluding Vermont. Conclusions Internet search behaviors could reflect public awareness of lung cancer. Research on internet search behaviors could be a novel and timely approach to monitor and estimate the prevalence, incidence, and mortality rates of a broader range of cancers and even more health issues.
Collapse
Affiliation(s)
- Chenjie Xu
- School of Public Health, Tianjin Medical University, Tianjin, China
| | - Hongxi Yang
- School of Public Health, Tianjin Medical University, Tianjin, China.,School of Public Health, Yale University, New Haven, CT, United States
| | - Li Sun
- School of Nursing, Tianjin Medical University, Tianjin, China
| | - Xinxi Cao
- School of Public Health, Tianjin Medical University, Tianjin, China
| | - Yabing Hou
- School of Public Health, Tianjin Medical University, Tianjin, China
| | - Qiliang Cai
- School of Public Health, Tianjin Medical University, Tianjin, China.,The Second Hospital of Tianjin Medical University, Tianjin, China
| | - Peng Jia
- Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Hong Kong, China.,International Initiative on Spatial Lifecourse Epidemiology, Hong Kong, China.,Faculty of Geo-information Science and Earth Observation, University of Twente, Enschede, Netherlands
| | - Yaogang Wang
- School of Public Health, Tianjin Medical University, Tianjin, China
| |
Collapse
|
36
|
Yousaf A, Hagen R, Delaney E, Davis S, Zinn Z. The influence of social media on acne treatment: A cross-sectional survey. Pediatr Dermatol 2020; 37:301-304. [PMID: 31944359 PMCID: PMC7453954 DOI: 10.1111/pde.14091] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
BACKGROUND/OBJECTIVES Social media use has been suggested to worsen psychiatric health among adolescents, especially those with visible skin lesions including acne. However, little is known about social media's impact on acne treatment. The purpose of the study sought to characterize the influence of social media use on acne treatment. METHODS We conducted a cross-sectional survey of West Virginia University ambulatory patients whose chief complaint was acne was conducted. The survey collected sociodemographics and queried whether individuals accessed social media for acne treatment advice or not, whether changes to acne care were made based on social media, and whether these changes aligned with the American Academy of Dermatology (AAD) clinical guidelines for acne management. RESULTS Of 130 respondents, 45% consulted social media for acne treatment advice (54% of women vs 31% of men). 41% of adolescents and 51% of adults consulted social media. The most used platforms were YouTube and Instagram (58% each). Social media users often tried an OTC treatment (81%) or dietary modification (40%). However, only 31% of participants consulting social media made changes fully aligned with AAD clinical guidelines. CONCLUSIONS Social media-influenced acne treatment advice is prevalent, especially among women, adolescents, and young adults. This treatment advice frequently does not align with AAD guidelines, with notably 40% of respondents choosing dietary modification for acne management. These results suggest that dermatologists should inquire about social media acne treatment advice and directly address misinformation.
Collapse
Affiliation(s)
- Ahmed Yousaf
- Department of Dermatology, West Virginia University, Morgantown, WV, USA
| | - Rachael Hagen
- Department of Dermatology, West Virginia University, Morgantown, WV, USA.,West Virginia School of Osteopathic Medicine, Lewisburg, WV, USA
| | - Emily Delaney
- Department of Dermatology, West Virginia University, Morgantown, WV, USA
| | - Stephen Davis
- Department of Health Policy, Management & Leadership, West Virginia University, Morgantown, WV, USA.,Department of Emergency Medicine, West Virginia University, Morgantown, WV, USA
| | - Zachary Zinn
- Department of Dermatology, West Virginia University, Morgantown, WV, USA
| |
Collapse
|
37
|
Du C, Lee W, Moskowitz D, Lucioni A, Kobashi KC, Lee UJ. I leaked, then I Reddit: experiences and insight shared on urinary incontinence by Reddit users. Int Urogynecol J 2019; 31:243-248. [DOI: 10.1007/s00192-019-04165-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Accepted: 10/21/2019] [Indexed: 10/25/2022]
|