Shah AM, Lee KY, Hidayat A, Falchook A, Muhammad W. A text analytics approach for mining public discussions in online cancer forum: Analysis of multi-intent lung cancer treatment dataset.
Int J Med Inform 2024;
184:105375. [PMID:
38367390 DOI:
10.1016/j.ijmedinf.2024.105375]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 01/25/2024] [Accepted: 02/07/2024] [Indexed: 02/19/2024]
Abstract
BACKGROUND
Online cancer forums (OCF) are increasingly popular platforms for patients and caregivers to discuss, seek information on, and share opinions about diseases and treatments. This interaction generates a substantial amount of unstructured text data, necessitating deeper exploration. Using time series data, our study exploits topic modeling in the novel domain of online cancer forums (OCFs) to identify meaningful topics and changing dynamics of online discussion across different lung cancer treatment intent groups.
METHODS
For this purpose, a dataset comprising 27,998 forum posts about lung cancer was collected from three OCFs: lungcancer.net, lungevity.org, and reddit.com, spanning the years 2016 to 2018.
RESULTS
The analysis reflects the public discussion on multi-intent lung cancer treatment over time, taking into account seasonal variations. Discussions on cancer symptoms and prevention garnered the most attention, dominating both curative and palliative care discussions. There were distinct seasonal peaks: curative care topics surged from winter to late spring, while palliative care topics peaked from late summer to mid-autumn. Keyword analysis highlighted that lung cancer diagnosis and treatment were primary topics, whereas cancer prevention and treatment outcomes were predominant across multi-care contexts. For the study period, curative care discussions predominantly revolved around informational support and disease syndromes. In contrast, social support and cancer prevention prevailed in the palliative care context. Notably, topics such as cancer screening and cancer treatment exhibit pronounced seasonal variations in curative care, peaking in frequency during the summers (May to August) of the study period. Meanwhile, the topic of tumor control within palliative care showed significant seasonal influence during the winters and summers of 2017 and 2018.
CONCLUSION
Our text analysis approach using OCF data shows potential for computational methods in this novel domain to gain insights into trends in public cancer communication and seasonal variations for a better understanding of improving personalized care, decision support, treatment outcomes, and quality of life.
Collapse