Chanudom I, Tharavichitkul E, Laosiritaworn W. Prediction of Cervical Cancer Patients' Survival Period with Machine Learning Techniques.
Healthc Inform Res 2024;
30:60-72. [PMID:
38359850 PMCID:
PMC10879821 DOI:
10.4258/hir.2024.30.1.60]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 01/02/2024] [Accepted: 01/13/2024] [Indexed: 02/17/2024] Open
Abstract
OBJECTIVES
The objective of this research is to apply machine learning (ML) algorithms to predict the survival of cervical cancer patients. The aim was to address the limitations of traditional statistical methods, which often fail to provide accurate answers due to the complexity of the problem.
METHODS
This research employed visualization techniques for initial data understanding. Subsequently, ML algorithms were used to develop both classification and regression models for survival prediction. In the classification models, we trained the algorithms to predict the time interval between the initial diagnosis and the patient's death. The intervals were categorized as "<6 months," "6 months to 3 years," "3 years to 5 years," and ">5 years." The regression model aimed to predict survival time (in months). We used attribute weights to gain insights into the model, highlighting features with a significant impact on predictions and offering valuable insights into the model's behavior and decision-making process.
RESULTS
The gradient boosting trees algorithm achieved an 81.55% accuracy in the classification model, while the random forest algorithm excelled in the regression model, with a root mean square error of 22.432. Notably, radiation doses around the affected areas significantly influenced survival duration.
CONCLUSIONS
Machine learning demonstrated the ability to provide high-accuracy predictions of survival periods in both classification and regression problems. This suggests its potential use as a decision-support tool in the process of treatment planning and resource allocation for each patient.
Collapse