1
|
Zeinali N, Youn N, Albashayreh A, Fan W, Gilbertson White S. Machine Learning Approaches to Predict Symptoms in People With Cancer: Systematic Review. JMIR Cancer 2024; 10:e52322. [PMID: 38502171 PMCID: PMC10988375 DOI: 10.2196/52322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 01/18/2024] [Accepted: 01/19/2024] [Indexed: 03/20/2024] Open
Abstract
BACKGROUND People with cancer frequently experience severe and distressing symptoms associated with cancer and its treatments. Predicting symptoms in patients with cancer continues to be a significant challenge for both clinicians and researchers. The rapid evolution of machine learning (ML) highlights the need for a current systematic review to improve cancer symptom prediction. OBJECTIVE This systematic review aims to synthesize the literature that has used ML algorithms to predict the development of cancer symptoms and to identify the predictors of these symptoms. This is essential for integrating new developments and identifying gaps in existing literature. METHODS We conducted this systematic review in accordance with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) checklist. We conducted a systematic search of CINAHL, Embase, and PubMed for English records published from 1984 to August 11, 2023, using the following search terms: cancer, neoplasm, specific symptoms, neural networks, machine learning, specific algorithm names, and deep learning. All records that met the eligibility criteria were individually reviewed by 2 coauthors, and key findings were extracted and synthesized. We focused on studies using ML algorithms to predict cancer symptoms, excluding nonhuman research, technical reports, reviews, book chapters, conference proceedings, and inaccessible full texts. RESULTS A total of 42 studies were included, the majority of which were published after 2017. Most studies were conducted in North America (18/42, 43%) and Asia (16/42, 38%). The sample sizes in most studies (27/42, 64%) typically ranged from 100 to 1000 participants. The most prevalent category of algorithms was supervised ML, accounting for 39 (93%) of the 42 studies. Each of the methods-deep learning, ensemble classifiers, and unsupervised ML-constituted 3 (3%) of the 42 studies. The ML algorithms with the best performance were logistic regression (9/42, 17%), random forest (7/42, 13%), artificial neural networks (5/42, 9%), and decision trees (5/42, 9%). The most commonly included primary cancer sites were the head and neck (9/42, 22%) and breast (8/42, 19%), with 17 (41%) of the 42 studies not specifying the site. The most frequently studied symptoms were xerostomia (9/42, 14%), depression (8/42, 13%), pain (8/42, 13%), and fatigue (6/42, 10%). The significant predictors were age, gender, treatment type, treatment number, cancer site, cancer stage, chemotherapy, radiotherapy, chronic diseases, comorbidities, physical factors, and psychological factors. CONCLUSIONS This review outlines the algorithms used for predicting symptoms in individuals with cancer. Given the diversity of symptoms people with cancer experience, analytic approaches that can handle complex and nonlinear relationships are critical. This knowledge can pave the way for crafting algorithms tailored to a specific symptom. In addition, to improve prediction precision, future research should compare cutting-edge ML strategies such as deep learning and ensemble methods with traditional statistical models.
Collapse
Affiliation(s)
- Nahid Zeinali
- Department of Computer Science and Informatics, University of Iowa, Iowa City, IA, United States
| | - Nayung Youn
- College of Nursing, University of Iowa, Iowa City, IA, United States
| | - Alaa Albashayreh
- College of Nursing, University of Iowa, Iowa City, IA, United States
| | - Weiguo Fan
- Department of Business Analytics, University of Iowa, Iowa City, IA, United States
| | | |
Collapse
|
2
|
Choe YH, Lee S, Lim Y, Kim SH. Machine learning-derived model for predicting poor post-treatment quality of life in Korean cancer survivors. Support Care Cancer 2024; 32:143. [PMID: 38315224 DOI: 10.1007/s00520-024-08347-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Accepted: 01/25/2024] [Indexed: 02/07/2024]
Abstract
PURPOSE A substantial number of cancer survivors have poor quality of life (QOL) even after completing cancer treatment. Thus, in this study, we used machine learning (ML) to develop predictive models for poor QOL in post-treatment cancer survivors in South Korea. METHODS This cross-sectional study used online survey data from 1,005 post-treatment cancer survivors in South Korea. The outcome variable was QOL, which was measured using the global QOL subscale of the European Organization of Cancer and Treatment for Cancer Quality of Life Questionnaire, where a global QOL score < 60.4 was defined as poor QOL. Three ML models (random forest (RF), support vector machine, and extreme gradient boosting) and three deep learning models were used to develop predictive models for poor QOL. Model performance regarding accuracy, area under the receiver operating characteristic curve, F1 score, precision, and recall was evaluated. The SHapely Additive exPlanation (SHAP) method was used to identify important features. RESULTS Of the 1,005 participants, 65.1% had poor QOL. Among the six models, the RF model had the best performance (accuracy = 0.85, F1 = 0.90). The SHAP method revealed that survivorship concerns (e.g., distress, pain, and fatigue) were the most important factors that affected poor QOL. CONCLUSIONS The ML-based prediction model developed to predict poor QOL in Korean post-treatment cancer survivors showed good accuracy. The ML model proposed in this study can be used to support clinical decision-making in identifying survivors at risk of poor QOL.
Collapse
Affiliation(s)
- Yu Hyeon Choe
- Department of Nursing, Inha University, Incheon, Republic of Korea
| | - Sujee Lee
- Department of Industrial and Information Systems Engineering, Soongsil University, Seoul, Republic of Korea
| | - Yooseok Lim
- Department of Industrial and Information Systems Engineering, Soongsil University, Seoul, Republic of Korea
| | - Soo Hyun Kim
- Department of Nursing, Inha University, Incheon, Republic of Korea.
| |
Collapse
|
3
|
Alghafees M, Seyam RM, Al-Hussain T, Amin TM, Altaweel W, Sabbah BN, Sabbah AN, Almesned R, Alessa L. Using machine learning models to predict synchronous genitourinary cancers among gastrointestinal stromal tumor patients. Urol Ann 2024; 16:94-97. [PMID: 38415235 PMCID: PMC10896329 DOI: 10.4103/ua.ua_32_23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 08/24/2023] [Accepted: 10/13/2023] [Indexed: 02/29/2024] Open
Abstract
Objectives Gastrointestinal stromal tumors (GISTs) can occur synchronously with other neoplasms, including the genitourinary (GU) system. Machine learning (ML) may be a valuable tool in predicting synchronous GU tumors in GIST patients, and thus improving prognosis. This study aims to evaluate the use of ML algorithms to predict synchronous GU tumors among GIST patients in a specialist research center in Saudi Arabia. Materials and Methods We analyzed data from all patients with histopathologically confirmed GIST at our facility from 2003 to 2020. Patient files were reviewed for the presence of renal cell carcinoma, adrenal tumors, or other GU cancers. Three supervised ML algorithms were used: logistic regression, XGBoost Regressor, and random forests (RFs). A set of variables, including independent attributes, was entered into the models. Results A total of 170 patients were included in the study, with 58.8% (n = 100) being male. The median age was 57 (range: 9-91) years. The majority of GISTs were gastric (60%, n = 102) with a spindle cell histology. The most common stage at diagnosis was T2 (27.6%, n = 47) and N0 (20%, n = 34). Six patients (3.5%) had synchronous GU tumors. The RF model achieved the highest accuracy with 97.1%. Conclusion Our study suggests that the RF model is an effective tool for predicting synchronous GU tumors in GIST patients. Larger multicenter studies, utilizing more powerful algorithms such as deep learning and other artificial intelligence subsets, are necessary to further refine and improve these predictions.
Collapse
Affiliation(s)
- Mohammad Alghafees
- Department of Urology, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia
| | - Raouf M Seyam
- Department of Urology, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia
| | - Turki Al-Hussain
- Department of Pathology and Laboratory Medicine, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia
| | - Tarek Mahmoud Amin
- Department of Surgical Oncology, Oncology Center, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia
| | - Waleed Altaweel
- Department of Urology, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia
| | | | | | - Razan Almesned
- Department of Urology, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia
| | - Laila Alessa
- Department of Urology, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia
| |
Collapse
|
4
|
Song W, Wu F, Yan Y, Li Y, Wang Q, Hu X, Li Y. Gut microbiota landscape and potential biomarker identification in female patients with systemic lupus erythematosus using machine learning. Front Cell Infect Microbiol 2023; 13:1289124. [PMID: 38169617 PMCID: PMC10758415 DOI: 10.3389/fcimb.2023.1289124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 11/28/2023] [Indexed: 01/05/2024] Open
Abstract
Objectives Systemic Lupus Erythematosus (SLE) is a complex autoimmune disease that disproportionately affects women. Early diagnosis and prevention are crucial for women's health, and the gut microbiota has been found to be strongly associated with SLE. This study aimed to identify potential biomarkers for SLE by characterizing the gut microbiota landscape using feature selection and exploring the use of machine learning (ML) algorithms with significantly dysregulated microbiotas (SDMs) for early identification of SLE patients. Additionally, we used the SHapley Additive exPlanations (SHAP) interpretability framework to visualize the impact of SDMs on the risk of developing SLE in females. Methods Stool samples were collected from 54 SLE patients and 55 Negative Controls (NC) for microbiota analysis using 16S rRNA sequencing. Feature selection was performed using Elastic Net and Boruta on species-level taxonomy. Subsequently, four ML algorithms, namely logistic regression (LR), Adaptive Boosting (AdaBoost), Random Forest (RF), and eXtreme gradient boosting (XGBoost), were used to achieve early identification of SLE with SDMs. Finally, the best-performing algorithm was combined with SHAP to explore how SDMs affect the risk of developing SLE in females. Results Both alpha and beta diversity were found to be different in SLE group. Following feature selection, 68 and 21 microbiota were retained in Elastic Net and Boruta, respectively, with 16 microbiota overlapping between the two, i.e., SDMs for SLE. The four ML algorithms with SDMs could effectively identify SLE patients, with XGBoost performing the best, achieving Accuracy, Sensitivity, Specificity, Positive Predictive Value, Negative Predictive Value, and AUC values of 0.844, 0.750, 0.938, 0.923, 0.790, and 0.930, respectively. The SHAP interpretability framework showed a complex non-linear relationship between the relative abundance of SDMs and the risk of SLE, with Escherichia_fergusonii having the largest SHAP value. Conclusions This study revealed dysbiosis in the gut microbiota of female SLE patients. ML classifiers combined with SDMs can facilitate early identification of female patients with SLE, particularly XGBoost. The SHAP interpretability framework provides insight into the impact of SDMs on the risk of SLE and may inform future scientific treatment for SLE.
Collapse
Affiliation(s)
- Wenzhu Song
- School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China
| | - Feng Wu
- Department of Nephrology, Shanxi Provincial People's Hospital (Fifth Hospital) of Shanxi Medical University, Taiyuan, China
| | - Yan Yan
- Department of Nephrology, Shanxi Provincial People's Hospital (Fifth Hospital) of Shanxi Medical University, Taiyuan, China
| | - Yaheng Li
- Shanxi Provincial Key Laboratory of Kidney Disease, Taiyuan, Shanxiuan, China
| | - Qian Wang
- Shanxi Provincial Key Laboratory of Kidney Disease, Taiyuan, Shanxiuan, China
| | - Xueli Hu
- Department of Nephrology, Hejin People’s Hospital, Yuncheng, Shanxi, China
| | - Yafeng Li
- Department of Nephrology, Shanxi Provincial People's Hospital (Fifth Hospital) of Shanxi Medical University, Taiyuan, China
- Shanxi Provincial Key Laboratory of Kidney Disease, Taiyuan, Shanxiuan, China
- Core Laboratory, Shanxi Provincial People's Hospital (Fifth Hospital) of Shanxi Medical University, Taiyuan, China
- Academy of Microbial Ecology, Shanxi Medical University, Taiyuan, China
| |
Collapse
|
5
|
Jin Y, Lan A, Dai Y, Jiang L, Liu S. Development and testing of a random forest-based machine learning model for predicting events among breast cancer patients with a poor response to neoadjuvant chemotherapy. Eur J Med Res 2023; 28:394. [PMID: 37777809 PMCID: PMC10543332 DOI: 10.1186/s40001-023-01361-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 09/11/2023] [Indexed: 10/02/2023] Open
Abstract
BACKGROUND Breast cancer (BC) is the most common malignant tumor around the world. Timely detection of the tumor progression after treatment could improve the survival outcome of patients. This study aimed to develop machine learning models to predict events (defined as either (1) the first tumor relapse locally, regionally, or distantly; (2) a diagnosis of secondary malignant tumor; or (3) death because of any reason.) in BC patients post-treatment. METHODS The patients with the response of stable disease (SD) and progressive disease (PD) after neoadjuvant chemotherapy (NAC) were selected. The clinicopathological features and the survival data were recorded in 1 year and 5 years, respectively. Patients were randomly divided into the training set and test set in the ratio of 8:2. A random forest (RF) and a logistic regression were established in both of 1-year cohort and the 5-year cohort. The performance was compared between the two models. The models were validated using data from the Surveillance, Epidemiology, and End Results (SEER) database. RESULTS A total of 315 patients were included. In the 1-year cohort, 197 patients were divided into a training set while 87 were into a test set. The specificity, sensitivity, and AUC were 0.800, 0.833, and 0.810 in the RF model. And 0.520, 0.833, and 0.653 of the logistic regression. In the 5-year cohort, 132 patients were divided into the training set while 33 were into the test set. The specificity, sensitivity, and AUC were 0.882, 0.750, and 0.829 in the RF model. And 0.882, 0.688, and 0.752 of the logistic regression. In the external validation set, of the RF model, the specificity, sensitivity, and AUC were 0.765, 0.812, and 0.779. Of the logistics regression model, the specificity, sensitivity, and AUC were 0.833, 0.376, and 0.619. CONCLUSION The RF model has a good performance in predicting events among BC patients with SD and PD post-NAC. It may be beneficial to BC patients, assisting in detecting tumor recurrence.
Collapse
Affiliation(s)
- Yudi Jin
- Department of Breast and Thyroid Surgery, The First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, China
- Department of Pathology, Chongqing Key Laboratory for Intelligent Oncology in Breast Cancer (iCQBC), Chongqing University Cancer Hospital, Chongqing, 400030, China
| | - Ailin Lan
- Department of Breast and Thyroid Surgery, The First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, China
| | - Yuran Dai
- Department of Breast and Thyroid Surgery, The First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, China
| | - Linshan Jiang
- Department of Breast and Thyroid Surgery, The First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, China
| | - Shengchun Liu
- Department of Breast and Thyroid Surgery, The First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, China.
| |
Collapse
|
6
|
Toro-Tobon D, Loor-Torres R, Duran M, Fan JW, Singh Ospina N, Wu Y, Brito JP. Artificial Intelligence in Thyroidology: A Narrative Review of the Current Applications, Associated Challenges, and Future Directions. Thyroid 2023; 33:903-917. [PMID: 37279303 PMCID: PMC10440669 DOI: 10.1089/thy.2023.0132] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Background: The use of artificial intelligence (AI) in health care has grown exponentially with the promise of facilitating biomedical research and enhancing diagnosis, treatment, monitoring, disease prevention, and health care delivery. We aim to examine the current state, limitations, and future directions of AI in thyroidology. Summary: AI has been explored in thyroidology since the 1990s, and currently, there is an increasing interest in applying AI to improve the care of patients with thyroid nodules (TNODs), thyroid cancer, and functional or autoimmune thyroid disease. These applications aim to automate processes, improve the accuracy and consistency of diagnosis, personalize treatment, decrease the burden for health care professionals, improve access to specialized care in areas lacking expertise, deepen the understanding of subtle pathophysiologic patterns, and accelerate the learning curve of less experienced clinicians. There are promising results for many of these applications. Yet, most are in the validation or early clinical evaluation stages. Only a few are currently adopted for risk stratification of TNODs by ultrasound and determination of the malignant nature of indeterminate TNODs by molecular testing. Challenges of the currently available AI applications include the lack of prospective and multicenter validations and utility studies, small and low diversity of training data sets, differences in data sources, lack of explainability, unclear clinical impact, inadequate stakeholder engagement, and inability to use outside of the research setting, which might limit the value of their future adoption. Conclusions: AI has the potential to improve many aspects of thyroidology; however, addressing the limitations affecting the suitability of AI interventions in thyroidology is a prerequisite to ensure that AI provides added value for patients with thyroid disease.
Collapse
Affiliation(s)
- David Toro-Tobon
- Division of Endocrinology, Diabetes, Metabolism and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Ricardo Loor-Torres
- Division of Endocrinology, Diabetes, Metabolism and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Mayra Duran
- Division of Endocrinology, Diabetes, Metabolism and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Jungwei W. Fan
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, Minnesota, USA
| | - Naykky Singh Ospina
- Division of Endocrinology, Department of Medicine, University of Florida, Gainesville, Florida, USA
| | - Yonghui Wu
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Juan P. Brito
- Division of Endocrinology, Diabetes, Metabolism and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota, USA
| |
Collapse
|
7
|
Nuutinen M, Hiltunen AM, Korhonen S, Haavisto I, Poikonen-Saksela P, Mattson J, Manikis G, Kondylakis H, Simos P, Mazzocco K, Pat-Horenczyk R, Sousa B, Cardoso F, Manica I, Kudel I, Leskelä RL. Aid of a machine learning algorithm can improve clinician predictions of patient quality of life during breast cancer treatments. HEALTH AND TECHNOLOGY 2023. [DOI: 10.1007/s12553-023-00733-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
|
8
|
Pan LC, Wu XR, Lu Y, Zhang HQ, Zhou YL, Liu X, Liu SL, Yan QY. Artificial intelligence empowered Digital Health Technologies in Cancer Survivorship Care: a scoping review. Asia Pac J Oncol Nurs 2022; 9:100127. [PMID: 36176267 PMCID: PMC9513729 DOI: 10.1016/j.apjon.2022.100127] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Accepted: 07/29/2022] [Indexed: 12/03/2022] Open
Abstract
Objective The objectives of this systematic review are to describe features and specific application scenarios for current cancer survivorship care services of Artificial intelligence (AI)-driven digital health technologies (DHTs) and to explore the acceptance and briefly evaluate its feasibility in the application process. Methods Search for literatures published from 2010 to 2022 on sites MEDLINE, IEEE-Xplor, PubMed, Embase, Cochrane Central Register of Controlled Trials and Scopus systematically. The types of literatures include original research, descriptive study, randomized controlled trial, pilot study, and feasible or acceptable study. The literatures above described current status and effectiveness of digital medical technologies based on AI and used in cancer survivorship care services. Additionally, we use QuADS quality assessment tool to evaluate the quality of literatures included in this review. Results 43 studies that met the inclusion criteria were analyzed and qualitatively synthesized. The current status and results related to the application of AI-driven DHTs in cancer survivorship care were reviewed. Most of these studies were designed specifically for breast cancer survivors’ care and focused on the areas of recurrence or secondary cancer prediction, clinical decision support, cancer survivability prediction, population or treatment stratified, anti-cancer treatment-induced adverse reaction prediction, and so on. Applying AI-based DHTs to cancer survivors actually has shown some positive outcomes, including increased motivation of patient-reported outcomes (PROs), reduce fatigue and pain levels, improved quality of life, and physical function. However, current research mostly explored the technology development and formation (testing) phases, with limited-scale population, and single-center trial. Therefore, it is not suitable to draw conclusions that the effectiveness of AI-based DHTs in supportive cancer care, as most of applications are still in the early stage of development and feasibility testing. Conclusions While digital therapies are promising in the care of cancer patients, more high-quality studies are still needed in the future to demonstrate the effectiveness of digital therapies in cancer care. Studies should explore how to develop uniform standards for measuring patient-related outcomes, ensure the scientific validity of research methods, and emphasize patient and health practitioner involvement in the development and use of technology.
Collapse
Affiliation(s)
- Lu-Chen Pan
- Department of Nursing, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| | - Xiao-Ru Wu
- School of Nursing, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Ying Lu
- Department of Nursing, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
- School of Nursing, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Han-Qing Zhang
- Health Science Center, Yangtze University, Jinzhou 434023, China
| | - Yao-Ling Zhou
- Department of Nursing, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
- School of Nursing, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Xue Liu
- School of Nursing, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Sheng-Lin Liu
- Department of Medical Engineering, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
- Corresponding authors.
| | - Qiao-Yuan Yan
- Department of Nursing, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
- Corresponding authors.
| |
Collapse
|