1
|
Sadeghi H, Seif F, Farahani EH, Khanmohammadi S, Nahidinezhad S. Utilizing patient data: A tutorial on predicting second cancer with machine learning models. Cancer Med 2024; 13:e70231. [PMID: 39300964 PMCID: PMC11413496 DOI: 10.1002/cam4.70231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 08/28/2024] [Accepted: 09/03/2024] [Indexed: 09/22/2024] Open
Abstract
BACKGROUND The article explores the potential risk of secondary cancer (SC) due to radiation therapy (RT) and highlights the necessity for new modeling techniques to mitigate this risk. METHODS By employing machine learning (ML) models, specifically decision trees, in the research process, a practical framework is established for forecasting the occurrence of SC using patient data. RESULTS & DISCUSSION This framework aids in categorizing patients into high-risk or low-risk groups, thereby enabling personalized treatment plans and interventions. The paper also underscores the many factors that contribute to the likelihood of SC, such as radiation dosage, patient age, and genetic predisposition, while emphasizing the limitations of current models in encompassing all relevant parameters. These limitations arise from the non-linear dependencies between variables and the failure to consider factors such as genetics, hormones, lifestyle, radiation from secondary particles, and imaging dosage. To instruct and assess ML models for predicting the occurrence of SC based on patient data, the paper utilizes a dataset consisting of instances and attributes. CONCLUSION The practical implications of this research lie in enhancing our understanding and prediction of SC following RT, facilitating personalized treatment approaches, and establishing a framework for leveraging patient data within the realm of ML models.
Collapse
Affiliation(s)
- Hossein Sadeghi
- Department of Physics, Faculty of SciencesArak UniversityArakIran
| | - Fatemeh Seif
- Department of Radiotherapy and Medical PhysicsArak University of Medical Sciences & Khansari HospitalArakIran
| | | | | | | |
Collapse
|
2
|
Zabihiyeganeh M, Mirzaei A, Tabrizian P, Rezaee A, Sheikhtaheri A, Kadijani AA, Kadijani BA, Sharifi Kia A. Prediction of subsequent fragility fractures: application of machine learning. BMC Musculoskelet Disord 2024; 25:438. [PMID: 38834975 DOI: 10.1186/s12891-024-07559-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 05/29/2024] [Indexed: 06/06/2024] Open
Abstract
BACKGROUND Machine learning (ML) has shown exceptional promise in various domains of medical research. However, its application in predicting subsequent fragility fractures is still largely unknown. In this study, we aim to evaluate the predictive power of different ML algorithms in this area and identify key features associated with the risk of subsequent fragility fractures in osteoporotic patients. METHODS We retrospectively analyzed data from patients presented with fragility fractures at our Fracture Liaison Service, categorizing them into index fragility fracture (n = 905) and subsequent fragility fracture groups (n = 195). We independently trained ML models using 27 features for both male and female cohorts. The algorithms tested include Random Forest, XGBoost, CatBoost, Logistic Regression, LightGBM, AdaBoost, Multi-Layer Perceptron, and Support Vector Machine. Model performance was evaluated through 10-fold cross-validation. RESULTS The CatBoost model outperformed other models, achieving 87% accuracy and an AUC of 0.951 for females, and 93.4% accuracy with an AUC of 0.990 for males. The most significant predictors for females included age, serum C-reactive protein (CRP), 25(OH)D, creatinine, blood urea nitrogen (BUN), parathyroid hormone (PTH), femoral neck Z-score, menopause age, number of pregnancies, phosphorus, calcium, and body mass index (BMI); for males, the predictors were serum CRP, femoral neck T-score, PTH, hip T-score, BMI, BUN, creatinine, alkaline phosphatase, and spinal Z-score. CONCLUSION ML models, especially CatBoost, offer a valuable approach for predicting subsequent fragility fractures in osteoporotic patients. These models hold the potential to enhance clinical decision-making by supporting the development of personalized preventative strategies.
Collapse
Affiliation(s)
- Mozhdeh Zabihiyeganeh
- Bone and Joint Reconstruction Research Center, Department of Orthopedics, School of Medicine, University of Medical Sciences, Baharestan Sq, Tehran, Iran
| | - Alireza Mirzaei
- Bone and Joint Reconstruction Research Center, Department of Orthopedics, School of Medicine, University of Medical Sciences, Baharestan Sq, Tehran, Iran
- Department of Orthopaedic Surgery, University of Minnesota, Minneapolis, MN, USA
| | - Pouria Tabrizian
- Bone and Joint Reconstruction Research Center, Department of Orthopedics, School of Medicine, University of Medical Sciences, Baharestan Sq, Tehran, Iran
| | - Aryan Rezaee
- Bone and Joint Reconstruction Research Center, Department of Orthopedics, School of Medicine, University of Medical Sciences, Baharestan Sq, Tehran, Iran
- Student Research Committee, School of Medicine, Iran University of Medical Sciences, Tehran, Iran
| | - Abbas Sheikhtaheri
- Department of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran
| | - Azade Amini Kadijani
- Bone and Joint Reconstruction Research Center, Department of Orthopedics, School of Medicine, University of Medical Sciences, Baharestan Sq, Tehran, Iran
| | - Bahare Amini Kadijani
- Department of Medical Physics, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Ali Sharifi Kia
- Bone and Joint Reconstruction Research Center, Department of Orthopedics, School of Medicine, University of Medical Sciences, Baharestan Sq, Tehran, Iran.
- Department of Computer Science, Faculty of Science, Western University, London, ON, Canada.
| |
Collapse
|
3
|
Seth I, Lim B, Joseph K, Gracias D, Xie Y, Ross RJ, Rozen WM. Use of artificial intelligence in breast surgery: a narrative review. Gland Surg 2024; 13:395-411. [PMID: 38601286 PMCID: PMC11002485 DOI: 10.21037/gs-23-414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 02/21/2024] [Indexed: 04/12/2024]
Abstract
Background and Objective We have witnessed tremendous advances in artificial intelligence (AI) technologies. Breast surgery, a subspecialty of general surgery, has notably benefited from AI technologies. This review aims to evaluate how AI has been integrated into breast surgery practices, to assess its effectiveness in improving surgical outcomes and operational efficiency, and to identify potential areas for future research and application. Methods Two authors independently conducted a comprehensive search of PubMed, Google Scholar, EMBASE, and Cochrane CENTRAL databases from January 1, 1950, to September 4, 2023, employing keywords pertinent to AI in conjunction with breast surgery or cancer. The search focused on English language publications, where relevance was determined through meticulous screening of titles, abstracts, and full-texts, followed by an additional review of references within these articles. The review covered a range of studies illustrating the applications of AI in breast surgery encompassing lesion diagnosis to postoperative follow-up. Publications focusing specifically on breast reconstruction were excluded. Key Content and Findings AI models have preoperative, intraoperative, and postoperative applications in the field of breast surgery. Using breast imaging scans and patient data, AI models have been designed to predict the risk of breast cancer and determine the need for breast cancer surgery. In addition, using breast imaging scans and histopathological slides, models were used for detecting, classifying, segmenting, grading, and staging breast tumors. Preoperative applications included patient education and the display of expected aesthetic outcomes. Models were also designed to provide intraoperative assistance for precise tumor resection and margin status assessment. As well, AI was used to predict postoperative complications, survival, and cancer recurrence. Conclusions Extra research is required to move AI models from the experimental stage to actual implementation in healthcare. With the rapid evolution of AI, further applications are expected in the coming years including direct performance of breast surgery. Breast surgeons should be updated with the advances in AI applications in breast surgery to provide the best care for their patients.
Collapse
Affiliation(s)
- Ishith Seth
- Department of Plastic Surgery, Peninsula Health, Melbourne, Victoria, Australia
- Central Clinical School at Monash University, The Alfred Centre, Melbourne, Victoria, Australia
| | - Bryan Lim
- Department of Plastic Surgery, Peninsula Health, Melbourne, Victoria, Australia
- Central Clinical School at Monash University, The Alfred Centre, Melbourne, Victoria, Australia
| | - Konrad Joseph
- Department of Surgery, Port Macquarie Base Hospital, New South Wales, Australia
| | - Dylan Gracias
- Department of Surgery, Townsville Hospital, Queensland, Australia
| | - Yi Xie
- Department of Plastic Surgery, Peninsula Health, Melbourne, Victoria, Australia
| | - Richard J. Ross
- Department of Plastic Surgery, Peninsula Health, Melbourne, Victoria, Australia
- Central Clinical School at Monash University, The Alfred Centre, Melbourne, Victoria, Australia
| | - Warren M. Rozen
- Department of Plastic Surgery, Peninsula Health, Melbourne, Victoria, Australia
- Central Clinical School at Monash University, The Alfred Centre, Melbourne, Victoria, Australia
| |
Collapse
|
4
|
Cheng K, Wang J, Liu J, Zhang X, Shen Y, Su H. Public health implications of computer-aided diagnosis and treatment technologies in breast cancer care. AIMS Public Health 2023; 10:867-895. [PMID: 38187901 PMCID: PMC10764974 DOI: 10.3934/publichealth.2023057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Accepted: 10/10/2023] [Indexed: 01/09/2024] Open
Abstract
Breast cancer remains a significant public health issue, being a leading cause of cancer-related mortality among women globally. Timely diagnosis and efficient treatment are crucial for enhancing patient outcomes, reducing healthcare burdens and advancing community health. This systematic review, following the PRISMA guidelines, aims to comprehensively synthesize the recent advancements in computer-aided diagnosis and treatment for breast cancer. The study covers the latest developments in image analysis and processing, machine learning and deep learning algorithms, multimodal fusion techniques and radiation therapy planning and simulation. The results of the review suggest that machine learning, augmented and virtual reality and data mining are the three major research hotspots in breast cancer management. Moreover, this paper discusses the challenges and opportunities for future research in this field. The conclusion highlights the importance of computer-aided techniques in the management of breast cancer and summarizes the key findings of the review.
Collapse
Affiliation(s)
- Kai Cheng
- Yantai Affiliated Hospital of Binzhou Medical University, Yantai, 264100, China
| | - Jiangtao Wang
- Yantai Affiliated Hospital of Binzhou Medical University, Yantai, 264100, China
| | - Jian Liu
- Yantai Affiliated Hospital of Binzhou Medical University, Yantai, 264100, China
| | - Xiangsheng Zhang
- Yantai Affiliated Hospital of Binzhou Medical University, Yantai, 264100, China
| | - Yuanyuan Shen
- Yantai Affiliated Hospital of Binzhou Medical University, Yantai, 264100, China
| | - Hang Su
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy
| |
Collapse
|
5
|
Syleouni ME, Karavasiloglou N, Manduchi L, Wanner M, Korol D, Ortelli L, Bordoni A, Rohrmann S. Predicting second breast cancer among women with primary breast cancer using machine learning algorithms, a population-based observational study. Int J Cancer 2023. [PMID: 37243372 DOI: 10.1002/ijc.34568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 04/26/2023] [Accepted: 05/05/2023] [Indexed: 05/28/2023]
Abstract
Breast cancer survivors often experience recurrence or a second primary cancer. We developed an automated approach to predict the occurrence of any second breast cancer (SBC) using patient-level data and explored the generalizability of the models with an external validation data source. Breast cancer patients from the cancer registry of Zurich, Zug, Schaffhausen, Schwyz (N = 3213; training dataset) and the cancer registry of Ticino (N = 1073; external validation dataset), diagnosed between 2010 and 2018, were used for model training and validation, respectively. Machine learning (ML) methods, namely a feed-forward neural network (ANN), logistic regression, and extreme gradient boosting (XGB) were employed for classification. The best-performing model was selected based on the receiver operating characteristic (ROC) curve. Key characteristics contributing to a high SBC risk were identified. SBC was diagnosed in 6% of all cases. The most important features for SBC prediction were age at incidence, year of birth, stage, and extent of the pathological primary tumor. The ANN model had the highest area under the ROC curve with 0.78 (95% confidence interval [CI] 0.750.82) in the training data and 0.70 (95% CI 0.61-0.79) in the external validation data. Investigating the generalizability of different ML algorithms, we found that the ANN generalized better than the other models on the external validation data. This research is a first step towards the development of an automated tool that could assist clinicians in the identification of women at high risk of developing an SBC and potentially preventing it.
Collapse
Affiliation(s)
- Maria-Eleni Syleouni
- Division of Chronic Disease Epidemiology, Epidemiology Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland
- Cancer Registry Zurich, Zug, Schaffhausen and Schwyz, University Hospital Zurich, Zurich, Switzerland
| | - Nena Karavasiloglou
- Division of Chronic Disease Epidemiology, Epidemiology Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland
- European Food Safety Authority, Parma, Italy
| | | | - Miriam Wanner
- Cancer Registry Zurich, Zug, Schaffhausen and Schwyz, University Hospital Zurich, Zurich, Switzerland
| | - Dimitri Korol
- Cancer Registry Zurich, Zug, Schaffhausen and Schwyz, University Hospital Zurich, Zurich, Switzerland
| | - Laura Ortelli
- Ticino Cancer Registry, Public Health Division of Canton Ticino, Locarno, Switzerland
| | - Andrea Bordoni
- Ticino Cancer Registry, Public Health Division of Canton Ticino, Locarno, Switzerland
| | - Sabine Rohrmann
- Division of Chronic Disease Epidemiology, Epidemiology Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland
- Cancer Registry Zurich, Zug, Schaffhausen and Schwyz, University Hospital Zurich, Zurich, Switzerland
| |
Collapse
|
6
|
Xie F, Wang ZH, Wu SS, Gang TR, Gao GX, Qu X, Zhang ZT. Comparing outcomes of single-port insufflation endoscopic breast-conserving surgery and conventional open approach for breast cancer. World J Surg Oncol 2022; 20:335. [PMID: 36203177 PMCID: PMC9535932 DOI: 10.1186/s12957-022-02798-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 09/24/2022] [Indexed: 12/09/2022] Open
Abstract
Background In the surgical treatment of breast cancer, the goal of surgeons is to continually create and improve minimally invasive surgical techniques to increase patients’ quality of life. Currently, routine breast-conserving surgery is often performed using two obvious incisions. Here, we compare the clinical efficacy and aesthetic outcomes of a novel technique using one incision, called ‘single-port insufflation endoscopic breast-conserving surgery’ (SIE-BCS), vs. conventional breast-conserving surgery (C-BCS) in patients with early-stage breast cancer. Methods A total of 180 patients with stage I or stage II breast cancer participated in this study, of whom 63 underwent SIE-BCS and 117 underwent C-BCS. Logistic regression analysis was conducted to assess the risk of local recurrence and metastasis. Aesthetic outcomes were evaluated using the BREAST-Q scale. Results The mean operation time was significantly longer for SIE-BCS (194.9 ± 71.5 min) than for C-BCS (140.3 ± 56.9 min), but the mean incision length was significantly shorter for SIE-BCS than for C-BCS (3.4 ± 1.2 cm vs. 8.6 ± 2.3 cm). While both surgeries yielded similar BREAST-Q ratings for satisfaction with breasts and sexual well-being, SIE-BCS was associated with significantly better ratings for physical well-being (chest area) and psychological well-being. Additionally, SIE-BCS was associated with decreased rates of adverse effects of radiation. The preliminary analysis showed that SIE-BCS did not increase the risk of local recurrence or metastasis. Conclusion The novel single-port insufflation endoscopic assisted BCS technique is feasible, safe, and improves patients’ postoperative comfort and psychological well-being, as compared to the conventional technique.
Collapse
Affiliation(s)
- Fang Xie
- General Surgery Department, Beijing Friendship Hospital, Capital Medical University, 95 Yong-an Road, Xi-Cheng District, Beijing, 100050, China.,Beijing Hospital of Traditional Chinese Medicine, Capital Medical University, 23 Mei shu guan Back Street, Dong-Cheng District, Beijing, 100010, China
| | - Zi-Han Wang
- Department of Breast Disease, Peking University People's Hospital, 11 Xi zhi men South Street, Xi-Cheng District, Beijing, 100044, China
| | - Shan-Shan Wu
- Department of Clinical Epidemiology and Evidence-Based Medicine, Beijing Friendship Hospital, 95 Yong-an Road, Xi-Cheng District, Beijing, 100050, China
| | - Tian-Ran Gang
- General Surgery Department, Beijing Friendship Hospital, Capital Medical University, 95 Yong-an Road, Xi-Cheng District, Beijing, 100050, China
| | - Guo-Xuan Gao
- General Surgery Department, Beijing Friendship Hospital, Capital Medical University, 95 Yong-an Road, Xi-Cheng District, Beijing, 100050, China
| | - Xiang Qu
- General Surgery Department, Beijing Friendship Hospital, Capital Medical University, 95 Yong-an Road, Xi-Cheng District, Beijing, 100050, China.
| | - Zhong-Tao Zhang
- General Surgery Department, Beijing Friendship Hospital, Capital Medical University, 95 Yong-an Road, Xi-Cheng District, Beijing, 100050, China.
| |
Collapse
|
7
|
Mazo C, Aura C, Rahman A, Gallagher WM, Mooney C. Application of Artificial Intelligence Techniques to Predict Risk of Recurrence of Breast Cancer: A Systematic Review. J Pers Med 2022; 12:1496. [PMID: 36143281 PMCID: PMC9500690 DOI: 10.3390/jpm12091496] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 09/05/2022] [Accepted: 09/09/2022] [Indexed: 12/31/2022] Open
Abstract
Breast cancer is the most common disease among women, with over 2.1 million new diagnoses each year worldwide. About 30% of patients initially presenting with early stage disease have a recurrence of cancer within 10 years. Predicting who will have a recurrence and who will not remains challenging, with consequent implications for associated treatment. Artificial intelligence strategies that can predict the risk of recurrence of breast cancer could help breast cancer clinicians avoid ineffective overtreatment. Despite its significance, most breast cancer recurrence datasets are insufficiently large, not publicly available, or imbalanced, making these studies more difficult. This systematic review investigates the role of artificial intelligence in the prediction of breast cancer recurrence. We summarise common techniques, features, training and testing methodologies, metrics, and discuss current challenges relating to implementation in clinical practice. We systematically reviewed works published between 1 January 2011 and 1 November 2021 using the methodology of Kitchenham and Charter. We leveraged Springer, Google Scholar, PubMed, and IEEE search engines. This review found three areas that require further work. First, there is no agreement on artificial intelligence methodologies, feature predictors, or assessment metrics. Second, issues such as sampling strategies, missing data, and class imbalance problems are rarely addressed or discussed. Third, representative datasets for breast cancer recurrence are scarce, which hinders model validation and deployment. We conclude that predicting breast cancer recurrence remains an open problem despite the use of artificial intelligence.
Collapse
Affiliation(s)
- Claudia Mazo
- UCD School of Computer Science, University College Dublin, D04 V1W8 Dublin, Ireland
| | - Claudia Aura
- UCD School of Biomolecular and Biomedical Science, UCD Conway Institute, University College Dublin, D04 V1W8 Dublin, Ireland
| | - Arman Rahman
- UCD School of Biomolecular and Biomedical Science, UCD Conway Institute, University College Dublin, D04 V1W8 Dublin, Ireland
| | - William M. Gallagher
- UCD School of Biomolecular and Biomedical Science, UCD Conway Institute, University College Dublin, D04 V1W8 Dublin, Ireland
| | - Catherine Mooney
- UCD School of Computer Science, University College Dublin, D04 V1W8 Dublin, Ireland
| |
Collapse
|
8
|
Yaghoobi Notash A, Yaghoobi Notash A, Omidi Z, Haghighat S. Prediction of lymphedema occurrence in patients with breast cancer using the optimized combination of ensemble learning algorithm and feature selection. BMC Med Inform Decis Mak 2022; 22:195. [PMID: 35879760 PMCID: PMC9310496 DOI: 10.1186/s12911-022-01937-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Accepted: 07/15/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Breast cancer-related lymphedema is one of the most important complications that adversely affect patients' quality of life. Lymphedema can be managed if its risk factors are known and can be modified. This study aimed to select an appropriate model to predict the risk of lymphedema and determine the factors affecting lymphedema. METHOD This study was conducted on data of 970 breast cancer patients with lymphedema referred to a lymphedema clinic. This study was designed in two phases: developing an appropriate model to predict the risk of lymphedema and identifying the risk factors. The first phase included data preprocessing, optimizing feature selection for each base learner by the Genetic algorithm, optimizing the combined ensemble learning method, and estimating fitness function for evaluating an appropriate model. In the second phase, the influential variables were assessed and introduced based on the average number of variables in the output of the proposed algorithm. RESULT Once the sensitivity and accuracy of the algorithms were evaluated and compared, the Support Vector Machine algorithm showed the highest sensitivity and was found to be the superior model for predicting lymphedema. Meanwhile, the combined method had an accuracy coefficient of 91%. The extracted significant features in the proposed model were the number of lymph nodes to the number of removed lymph nodes ratio (68%), feeling of heaviness (67%), limited range of motion in the affected limb (65%), the number of the removed lymph nodes ( 64%), receiving radiotherapy (63%), misalignment of the dominant and the involved limb (62%), presence of fibrotic tissue (62%), type of surgery (62%), tingling sensation (62%), the number of the involved lymph nodes (61%), body mass index (61%), the number of chemotherapy sessions (60%), age (58%), limb injury (53%), chemotherapy regimen (53%), and occupation (50%). CONCLUSION Applying a combination of ensemble learning approach with the selected classification algorithms, feature selection, and optimization by Genetic algorithm, Lymphedema can be predicted with appropriate accuracy. Developing applications by effective variables to determine the risk of lymphedema can help lymphedema clinics choose the proper preventive and therapeutic method.
Collapse
Affiliation(s)
- Anaram Yaghoobi Notash
- The Computer Engineering Department, Rasht Branch, Islamic Azad University, Rasht, Iran
- Shariati Hospital, Tehran University of Medical Science (TUMS), Tehran, Iran
| | | | - Zahra Omidi
- Breast Cancer Research Center, Motamed Cancer Institute, ACECR, Tehran, Iran
| | - Shahpar Haghighat
- Breast Cancer Research Center, Motamed Cancer Institute, ACECR, Tehran, Iran.
| |
Collapse
|
9
|
Khair S, Dort JC, Quan ML, Cheung WY, Sauro KM, Nakoneshny SC, Popowich BL, Liu P, Wu G, Xu Y. Validated algorithms for identifying timing of second event of oropharyngeal squamous cell carcinoma using real-world data. Head Neck 2022; 44:1909-1917. [PMID: 35653151 DOI: 10.1002/hed.27109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 04/29/2022] [Accepted: 05/18/2022] [Indexed: 11/07/2022] Open
Abstract
BACKGROUND Understanding occurrence and timing of second events (recurrence and second primary cancer) is essential for cancer specific survival analysis. However, this information is not readily available in administrative data. METHODS Alberta Cancer Registry, physician claims, and other administrative data were used. Timing of second event was estimated based on our developed algorithm. For validation, the difference, in days between the algorithm estimated and the chart-reviewed timing of second event. Further, the result of Cox-regression modeling cancer-free survival was compared to chart review data. RESULTS Majority (74.3%) of the patients had a difference between the chart-reviewed and algorithm-estimated timing of second event falling within the 0-60 days window. Kaplan-Meier curves generated from the estimated data and chart review data were comparable with a 5-year second-event-free survival rate of 75.4% versus 72.5%. CONCLUSION The algorithm provided an estimated timing of second event similar to that of the chart review.
Collapse
Affiliation(s)
- Shahreen Khair
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Joseph C Dort
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Department of Surgery, Cumming School of Medicine, University of Calgary, North Tower, Foothills Medical Centre, Calgary, Alberta, Canada
| | - May Lynn Quan
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Department of Surgery, Cumming School of Medicine, University of Calgary, North Tower, Foothills Medical Centre, Calgary, Alberta, Canada.,Department of Oncology, Cumming School of Medicine, University of Calgary, Tom Baker, Cancer Centre, Calgary, Alberta, Canada
| | - Winson Y Cheung
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Department of Surgery, Cumming School of Medicine, University of Calgary, North Tower, Foothills Medical Centre, Calgary, Alberta, Canada
| | - Khara M Sauro
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Department of Surgery, Cumming School of Medicine, University of Calgary, North Tower, Foothills Medical Centre, Calgary, Alberta, Canada.,Department of Oncology, Cumming School of Medicine, University of Calgary, Tom Baker, Cancer Centre, Calgary, Alberta, Canada
| | - Steven C Nakoneshny
- The Ohlson Research Initiative, Arnie Charbonneau Cancer Institute, University of Calgary, Calgary, Alberta, Canada
| | - Brittany Lynn Popowich
- Centre for Health Informatics, Cumming School of Medicine, University of Calgary, Teaching Research and Wellness (TRW), Calgary, Alberta, Canada
| | - Ping Liu
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Guosong Wu
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Centre for Health Informatics, Cumming School of Medicine, University of Calgary, Teaching Research and Wellness (TRW), Calgary, Alberta, Canada
| | - Yuan Xu
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Department of Surgery, Cumming School of Medicine, University of Calgary, North Tower, Foothills Medical Centre, Calgary, Alberta, Canada.,Department of Oncology, Cumming School of Medicine, University of Calgary, Tom Baker, Cancer Centre, Calgary, Alberta, Canada.,Centre for Health Informatics, Cumming School of Medicine, University of Calgary, Teaching Research and Wellness (TRW), Calgary, Alberta, Canada
| |
Collapse
|
10
|
Rabiei R, Ayyoubzadeh SM, Sohrabei S, Esmaeili M, Atashi A. Prediction of Breast Cancer using Machine Learning Approaches. J Biomed Phys Eng 2022; 12:297-308. [PMID: 35698545 PMCID: PMC9175124 DOI: 10.31661/jbpe.v0i0.2109-1403] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2021] [Accepted: 03/05/2022] [Indexed: 05/27/2023]
Abstract
BACKGROUND Breast cancer is considered one of the most common cancers in women caused by various clinical, lifestyle, social, and economic factors. Machine learning has the potential to predict breast cancer based on features hidden in data. OBJECTIVE This study aimed to predict breast cancer using different machine-learning approaches applying demographic, laboratory, and mammographic data. MATERIAL AND METHODS In this analytical study, the database, including 5,178 independent records, 25% of which belonged to breast cancer patients with 24 attributes in each record was obtained from Motamed cancer institute (ACECR), Tehran, Iran. The database contained 5,178 independent records, 25% of which belonged to breast cancer patients containing 24 attributes in each record. The random forest (RF), neural network (MLP), gradient boosting trees (GBT), and genetic algorithms (GA) were used in this study. Models were initially trained with demographic and laboratory features (20 features). The models were then trained with all demographic, laboratory, and mammographic features (24 features) to measure the effectiveness of mammography features in predicting breast cancer. RESULTS RF presented higher performance compared to other techniques (accuracy 80%, sensitivity 95%, specificity 80%, and the area under the curve (AUC) 0.56). Gradient boosting (AUC=0.59) showed a stronger performance compared to the neural network. CONCLUSION Combining multiple risk factors in modeling for breast cancer prediction could help the early diagnosis of the disease with necessary care plans. Collection, storage, and management of different data and intelligent systems based on multiple factors for predicting breast cancer are effective in disease management.
Collapse
Affiliation(s)
- Reza Rabiei
- PhD, Department of Health Information Technology and Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Seyed Mohammad Ayyoubzadeh
- PhD, Department of Health Information Technology and Management, School of Allied Medical Sciences, Tehran University of Medical Science, Tehran, Iran
| | - Solmaz Sohrabei
- MSc, Department Deputy of Development, Management and Resources, Office of Statistic and Information Technology Management, Zanjan University of Medical Sciences, Zanjan, Iran
| | - Marzieh Esmaeili
- PhD, Department of Health Information Technology and Management, School of Allied Medical Sciences, Tehran University of Medical Science, Tehran, Iran
| | - Alireza Atashi
- PhD, Department of E-Health, Virtual School, Tehran University of Medical Sciences, Medical Informatics Research Group, Clinical Research Department, Breast Cancer Research Center, Motamed Cancer Institute, ACECR, Tehran, Iran
| |
Collapse
|
11
|
Machine Learning Algorithms: Prediction and Feature Selection for Clinical Refracture after Surgically Treated Fragility Fracture. J Clin Med 2022; 11:jcm11072021. [PMID: 35407629 PMCID: PMC8999234 DOI: 10.3390/jcm11072021] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 03/08/2022] [Accepted: 04/02/2022] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND The number of patients with fragility fracture has been increasing. Although the increasing number of patients with fragility fracture increased the rate of fracture (refracture), the causes of refracture are multifactorial, and its predictors are still not clarified. In this issue, we collected a registry-based longitudinal dataset that contained more than 7000 patients with fragility fractures treated surgically to detect potential predictors for clinical refracture. METHODS Based on the fact that machine learning algorithms are often used for the analysis of a large-scale dataset, we developed automatic prediction models and clarified the relevant features for patients with clinical refracture. Formats of input data containing perioperative clinical information were table data. Clinical refracture was documented as the primary outcome if the diagnosis of fracture was made at postoperative outpatient care. A decision-tree-based model, LightGBM, had moderate accuracy for the prediction in the test and the independent dataset, whereas the other models had poor accuracy or worse. RESULTS From a clinical perspective, rheumatoid arthritis (RA) and chronic kidney disease (CKD) were noted as the relevant features for patients with clinical refracture, both of which were associated with secondary osteoporosis. CONCLUSION The decision-tree-based algorithm showed the precise prediction of clinical refracture, in which RA and CKD were detected as the potential predictors. Understanding these predictors may improve the management of patients with fragility fractures.
Collapse
|
12
|
|
13
|
Okagbue HI, Oguntunde PE, Adamu PI, Adejumo AO. Unique clusters of patterns of breast cancer survivorship. HEALTH AND TECHNOLOGY 2022. [DOI: 10.1007/s12553-021-00637-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
14
|
Sundus KI, Hammo BH, Al-Zoubi MB, Al-Omari A. Solving the multicollinearity problem to improve the stability of machine learning algorithms applied to a fully annotated breast cancer dataset. INFORMATICS IN MEDICINE UNLOCKED 2022. [DOI: 10.1016/j.imu.2022.101088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022] Open
|
15
|
Yang PT, Wu WS, Wu CC, Shih YN, Hsieh CH, Hsu JL. Breast cancer recurrence prediction with ensemble methods and cost-sensitive learning. Open Med (Wars) 2021; 16:754-768. [PMID: 34027105 PMCID: PMC8122465 DOI: 10.1515/med-2021-0282] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 03/11/2021] [Accepted: 04/03/2021] [Indexed: 11/15/2022] Open
Abstract
Breast cancer is one of the most common cancers in women all over the world. Due to the improvement of medical treatments, most of the breast cancer patients would be in remission. However, the patients have to face the next challenge, the recurrence of breast cancer which may cause more severe effects, and even death. The prediction of breast cancer recurrence is crucial for reducing mortality. This paper proposes a prediction model for the recurrence of breast cancer based on clinical nominal and numeric features. In this study, our data consist of 1,061 patients from Breast Cancer Registry from Shin Kong Wu Ho-Su Memorial Hospital between 2011 and 2016, in which 37 records are denoted as breast cancer recurrence. Each record has 85 features. Our approach consists of three stages. First, we perform data preprocessing and feature selection techniques to consolidate the dataset. Among all features, six features are identified for further processing in the following stages. Next, we apply resampling techniques to resolve the issue of class imbalance. Finally, we construct two classifiers, AdaBoost and cost-sensitive learning, to predict the risk of recurrence and carry out the performance evaluation in three-fold cross-validation. By applying the AdaBoost method, we achieve accuracy of 0.973 and sensitivity of 0.675. By combining the AdaBoost and cost-sensitive method of our model, we achieve a reasonable accuracy of 0.468 and substantially high sensitivity of 0.947 which guarantee almost no false dismissal. Our model can be used as a supporting tool in the setting and evaluation of the follow-up visit for early intervention and more advanced treatments to lower cancer mortality.
Collapse
Affiliation(s)
- Pei-Tse Yang
- Department of Computer Science and Information Engineering, Fu Jen Catholic University, New Taipei City, Taiwan, Republic of China
| | - Wen-Shuo Wu
- Department of Computer Science and Information Engineering, Fu Jen Catholic University, New Taipei City, Taiwan, Republic of China
| | - Chia-Chun Wu
- Department of Computer Science and Information Engineering, Fu Jen Catholic University, New Taipei City, Taiwan, Republic of China
| | - Yi-Nuo Shih
- Department of Occupational Therapy, Fu Jen Catholic University, New Taipei City, Taiwan, Republic of China
| | - Chung-Ho Hsieh
- Department of General Surgery, Shin Kong Wu Ho-Su Memorial Hospital, Taipei, Taiwan, Republic of China
| | - Jia-Lien Hsu
- Department of Computer Science and Information Engineering, Fu Jen Catholic University, New Taipei City, Taiwan, Republic of China
| |
Collapse
|
16
|
Lou SJ, Hou MF, Chang HT, Chiu CC, Lee HH, Yeh SCJ, Shi HY. Machine Learning Algorithms to Predict Recurrence within 10 Years after Breast Cancer Surgery: A Prospective Cohort Study. Cancers (Basel) 2020; 12:cancers12123817. [PMID: 33348826 PMCID: PMC7765963 DOI: 10.3390/cancers12123817] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 12/11/2020] [Accepted: 12/14/2020] [Indexed: 02/07/2023] Open
Abstract
No studies have discussed machine learning algorithms to predict recurrence within 10 years after breast cancer surgery. This study purposed to compare the accuracy of forecasting models to predict recurrence within 10 years after breast cancer surgery and to identify significant predictors of recurrence. Registry data for breast cancer surgery patients were allocated to a training dataset (n = 798) for model development, a testing dataset (n = 171) for internal validation, and a validating dataset (n = 171) for external validation. Global sensitivity analysis was then performed to evaluate the significance of the selected predictors. Demographic characteristics, clinical characteristics, quality of care, and preoperative quality of life were significantly associated with recurrence within 10 years after breast cancer surgery (p < 0.05). Artificial neural networks had the highest prediction performance indices. Additionally, the surgeon volume was the best predictor of recurrence within 10 years after breast cancer surgery, followed by hospital volume and tumor stage. Accurate recurrence within 10 years prediction by machine learning algorithms may improve precision in managing patients after breast cancer surgery and improve understanding of risk factors for recurrence within 10 years after breast cancer surgery.
Collapse
Affiliation(s)
- Shi-Jer Lou
- Graduate Institute of Technological and Vocational Education, National Pingtung University of Science and Technology, Pingtung 91201, Taiwan;
| | - Ming-Feng Hou
- College of Medicine, Kaohsiung Medical University, Kaohsiung 80708, Taiwan;
- Department of Surgery, Kaohsiung Medical University Hospital, Kaohsiung 80756, Taiwan
| | - Hong-Tai Chang
- Department of Surgery, Kaohsiung Municipal United Hospital, Kaohsiung 80457, Taiwan;
| | - Chong-Chi Chiu
- Department of General Surgery, E-Da Cancer Hospital, I-Shou University, Kaohsiung 82445, Taiwan;
- School of Medicine, College of Medicine, I-Shou University, Kaohsiung 82445, Taiwan
| | - Hao-Hsien Lee
- Department of General Surgery, Chi Mei Medical Center, Liouying, Tainan 73657, Taiwan;
| | - Shu-Chuan Jennifer Yeh
- Department of Healthcare Administration and Medical Informatics, Kaohsiung Medical University, Kaohsiung 80708, Taiwan;
- Department of Business Management, National Sun Yat-sen University, Kaohsiung 80424, Taiwan
| | - Hon-Yi Shi
- Department of Healthcare Administration and Medical Informatics, Kaohsiung Medical University, Kaohsiung 80708, Taiwan;
- Department of Business Management, National Sun Yat-sen University, Kaohsiung 80424, Taiwan
- Department of Medical Research, Kaohsiung Medical University Hospital, Kaohsiung 80708, Taiwan
- Department of Medical Research, China Medical University Hospital, China Medical University, Taichung 40402, Taiwan
- Correspondence: ; Tel.: +886-7-321-1101 (ext. 2648); Fax: +886-7-313-7487
| |
Collapse
|