1
|
Azeroual S, Ben-Bouazza FE, Naqi A, Sebihi R. Predicting disease recurrence in breast cancer patients using machine learning models with clinical and radiomic characteristics: a retrospective study. J Egypt Natl Canc Inst 2024; 36:20. [PMID: 38853190 DOI: 10.1186/s43046-024-00222-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Accepted: 04/06/2024] [Indexed: 06/11/2024] Open
Abstract
BACKGROUND The goal is to use three different machine learning models to predict the recurrence of breast cancer across a very heterogeneous sample of patients with varying disease kinds and stages. METHODS A heterogeneous group of patients with varying cancer kinds and stages, including both triple-negative breast cancer (TNBC) and non-triple-negative breast cancer (non-TNBC), was examined. Three distinct models were created using the following five machine learning techniques: Adaptive Boosting (AdaBoost), Random Under-sampling Boosting (RUSBoost), Extreme Gradient Boosting (XGBoost), support vector machines (SVM), and Logistic Regression. The clinical model used both clinical and pathology data in conjunction with the machine learning algorithms. The machine learning algorithms were combined with dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) imaging characteristics in the radiomic model, and the merged model combined the two types of data. Each technique was evaluated using several criteria, including the receiver operating characteristic (ROC) curve, precision, recall, and F1 score. RESULTS The results suggest that the integration of clinical and radiomic data improves the predictive accuracy in identifying instances of breast cancer recurrence. The XGBoost algorithm is widely recognized as the most effective algorithm in terms of performance. CONCLUSION The findings presented in this study offer significant contributions to the field of breast cancer research, particularly in relation to the prediction of cancer recurrence. These insights hold great potential for informing future investigations and clinical interventions that seek to enhance the accuracy and effectiveness of recurrence prediction in breast cancer patients.
Collapse
Affiliation(s)
- Saadia Azeroual
- LPHE-Modeling and Simulations, Faculty of Science, Mohammed V University in Rabat, Rabat, Morocco.
| | - Fatima-Ezzahraa Ben-Bouazza
- Faculty of Sciences and Technology, Hassan First University, Settat, Morocco
- LaMSN (La Maison Des Sciences Num´Eriques), Saint-Denis, France
| | - Amine Naqi
- Mohammed VI University of Sciences and Health, Casablanca, Morocco
| | - Rajaa Sebihi
- LPHE-Modeling and Simulations, Faculty of Science, Mohammed V University in Rabat, Rabat, Morocco
| |
Collapse
|
2
|
Ma W, Li M, Chu Z, Chen H. Smart Biosensor for Breast Cancer Survival Prediction Based on Multi-View Multi-Way Graph Learning. SENSORS (BASEL, SWITZERLAND) 2024; 24:3289. [PMID: 38894082 PMCID: PMC11174864 DOI: 10.3390/s24113289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Revised: 05/17/2024] [Accepted: 05/19/2024] [Indexed: 06/21/2024]
Abstract
Biosensors play a crucial role in detecting cancer signals by orchestrating a series of intricate biological and physical transduction processes. Among various cancers, breast cancer stands out due to its genetic underpinnings, which trigger uncontrolled cell proliferation, predominantly impacting women, and resulting in significant mortality rates. The utilization of biosensors in predicting survival time becomes paramount in formulating an optimal treatment strategy. However, conventional biosensors employing traditional machine learning methods encounter challenges in preprocessing features for the learning task. Despite the potential of deep learning techniques to automatically extract useful features, they often struggle to effectively leverage the intricate relationships between features and instances. To address this challenge, our study proposes a novel smart biosensor architecture that integrates a multi-view multi-way graph learning (MVMWGL) approach for predicting breast cancer survival time. This innovative approach enables the assimilation of insights from gene interactions and biosensor similarities. By leveraging real-world data, we conducted comprehensive evaluations, and our experimental results unequivocally demonstrate the superiority of the MVMWGL approach over existing methods.
Collapse
Affiliation(s)
- Wenming Ma
- School of Computer and Control Engineering, Yantai University, Yantai 264005, China; (M.L.); (Z.C.); (H.C.)
| | | | | | | |
Collapse
|
3
|
Nazari E, Naderi H, Tabadkani M, ArefNezhad R, Farzin AH, Dashtiahangar M, Khazaei M, Ferns GA, Mehrabian A, Tabesh H, Avan A. Breast cancer prediction using different machine learning methods applying multi factors. J Cancer Res Clin Oncol 2023; 149:17133-17146. [PMID: 37773467 DOI: 10.1007/s00432-023-05388-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2023] [Accepted: 09/01/2023] [Indexed: 10/01/2023]
Abstract
OBJECTIVE Breast cancer (BC) is a multifactorial disease and is one of the most common cancers globally. This study aimed to compare different machine learning (ML) techniques to develop a comprehensive breast cancer risk prediction model based on features of various factors. METHODS The population sample contained 810 records (115 cancer patients and 695 healthy individuals). 45 attributes out of 85 were selected based on the opinion of experts. These selected attributes are in genetic, biochemical, biomarker, gender, demographic and pathological factors. 13 Machine learning models were trained with proposed attributes and coefficient of attributes and internal relationships were calculated. RESULT Compared to other methods random forest (RF) has higher performance (accuracy 99.26%, precision 99%, and area under the curve (AUC) 99%). The results of assessing the impact and correlation of variables using the RF method based on PCA indicated that pathology, biomarker, biochemistry, gene, and demographic factors with a coefficient of 0.35, 0.23, 0.15, 0.14, and 0.13 respectively, affected the risk of BC (r2 = 0.54). CONCLUSION Breast cancer has several risk factors. Medical experts use these risk factors for early diagnosis. Therefore, identifying related risk factors and their effect can increase the accuracy of diagnosis. Considering the broad features for predicting breast cancer leads to the development of a comprehensive prediction model. In this study, using RF technique a breast cancer prediction model with 99.3% accuracy was developed based on multifactorial features.
Collapse
Affiliation(s)
- Elham Nazari
- Faculty of Medicine, Department of Medical Informatics, Mashhad University of Medical Sciences, Mashhad, Iran
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
- Department of Health Information Technology and Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Hamid Naderi
- Faculty of Medicine, Department of Medical Informatics, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Mahla Tabadkani
- Student Research Committee, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Reza ArefNezhad
- Halal Research Center of IRI, FDA, Tehran, Iran
- Department of Anatomy, School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran
| | | | | | - Majid Khazaei
- Student Research Committee, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Gordon A Ferns
- Division of Medical Education, Brighton & Sussex Medical School, Falmer, Brighton, BN1 9PH, Sussex, UK
| | - Amin Mehrabian
- Warwick Medical School, University of Warwick, Coventry, UK
| | - Hamed Tabesh
- Faculty of Medicine, Department of Medical Informatics, Mashhad University of Medical Sciences, Mashhad, Iran.
| | - Amir Avan
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran.
- Faculty of Health, School of Biomedical Sciences, Queensland University of Technology, Brisbane, QLD, Australia.
- College of Medicine, University of Warith Al-Anbiyaa, Karbala, Iraq.
| |
Collapse
|
4
|
Yan S, Li J, Wu W. Artificial intelligence in breast cancer: application and future perspectives. J Cancer Res Clin Oncol 2023; 149:16179-16190. [PMID: 37656245 DOI: 10.1007/s00432-023-05337-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 08/24/2023] [Indexed: 09/02/2023]
Abstract
Breast cancer is one of the most common cancers and is one of the leading causes of cancer-related deaths in women worldwide. Early diagnosis and treatment are the key for a favorable prognosis. The application of artificial intelligence technology in the medical field is increasingly extensive, including image analysis, automated diagnosis, intelligent pharmaceutical system, personalized treatment and so on. AI-based breast cancer imaging, pathology and adjuvant therapy technology cannot only reduce the workload of clinicians, but also continuously improve the accuracy and sensitivity of breast cancer diagnosis and treatment. This paper reviews the application of AI in breast cancer, as well as looks ahead and poses challenges to the future development of AI for breast cancer detection and therapeutic, so as to provide ideas for future research.
Collapse
Affiliation(s)
- Shuixin Yan
- The Affiliated Lihuili Hospital of Ningbo University, Ningbo, 315000, Zhejiang, China
| | - Jiadi Li
- The Affiliated Lihuili Hospital of Ningbo University, Ningbo, 315000, Zhejiang, China
| | - Weizhu Wu
- The Affiliated Lihuili Hospital of Ningbo University, Ningbo, 315000, Zhejiang, China.
| |
Collapse
|
5
|
El Haji H, Souadka A, Patel BN, Sbihi N, Ramasamy G, Patel BK, Ghogho M, Banerjee I. Evolution of Breast Cancer Recurrence Risk Prediction: A Systematic Review of Statistical and Machine Learning-Based Models. JCO Clin Cancer Inform 2023; 7:e2300049. [PMID: 37566789 DOI: 10.1200/cci.23.00049] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 05/11/2023] [Accepted: 06/14/2023] [Indexed: 08/13/2023] Open
Abstract
PURPOSE Selection of appropriate adjuvant therapy to ultimately reduce the risk of breast cancer (BC) recurrence is a challenge for medical oncologists. Several automated risk prediction models have been developed using retrospective clinical data and have evolved significantly over the years in terms of predictors of recurrence, data usage, and predictive techniques (statistical/machine learning [ML]). METHODS Following PRISMA guidelines, we performed a systematic literature review of the aforementioned statistical and ML models published between January 2008 and December 2022 through searching five digital databases-PubMed, ScienceDirect, Scopus, Cochrane, and Web of Science. The comprehensive search yielded a total of 163 papers and after a screening process focusing on papers that dealt exclusively with statistical/ML methods, only 23 papers were deemed appropriate for further analysis. We benchmarked the studies on the basis of development, evaluation metrics, and validation strategy with an added emphasis on racial diversity of patients included in the studies. RESULTS In total, 30.4% of the included studies use statistical techniques, while 69.6% are ML-based. Among these, traditional ML models (support vector machines, decision tree, logistic regression, and naïve Bayes) are the most frequently used (26.1%) along with deep learning (26.1%). Deep learning and ensemble learning provide the most accurate predictions (AUC = 0.94 each). CONCLUSION ML-based prediction models exhibit outstanding performance, yet their practical applicability might be hindered by limited interpretability and reduced generalization. Moreover, predictive models for BC recurrence often focus on limited variables related to tumor, treatment, molecular, and clinical features. Imbalanced classes and the lack of open-source data sets impede model development and validation. Furthermore, existing models predominantly overlook African and Middle Eastern populations, as they are trained and validated mainly on Caucasian and Asian patients.
Collapse
Affiliation(s)
- Hasna El Haji
- Department of Radiology, Mayo Clinic, Phoenix, AZ
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ
- International University of Rabat, TICLab, Rabat, Morocco
| | - Amine Souadka
- Surgical Oncology Department, National Institute of Oncology, Mohammed V University in Rabat, Rabat, Morocco
| | - Bhavik N Patel
- Department of Radiology, Mayo Clinic, Phoenix, AZ
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ
| | - Nada Sbihi
- International University of Rabat, TICLab, Rabat, Morocco
| | - Gokul Ramasamy
- Department of Radiology, Mayo Clinic, Phoenix, AZ
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ
| | | | - Mounir Ghogho
- International University of Rabat, TICLab, Rabat, Morocco
- University of Leeds, Faculty of Engineering, Leeds, United Kingdom
| | - Imon Banerjee
- Department of Radiology, Mayo Clinic, Phoenix, AZ
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ
| |
Collapse
|
6
|
Su F, Chao J, Liu P, Zhang B, Zhang N, Luo Z, Han J. Prognostic models for breast cancer: based on logistics regression and Hybrid Bayesian Network. BMC Med Inform Decis Mak 2023; 23:120. [PMID: 37443001 PMCID: PMC10347801 DOI: 10.1186/s12911-023-02224-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 07/03/2023] [Indexed: 07/15/2023] Open
Abstract
BACKGROUND To construct two prognostic models to predict survival in breast cancer patients; to compare the efficacy of the two models in the whole group and the advanced human epidermal growth factor receptor-2-positive (HER2+) subgroup of patients; to conclude whether the Hybrid Bayesian Network (HBN) model outperformed the logistics regression (LR) model. METHODS In this paper, breast cancer patient data were collected from the SEER database. Data processing and analysis were performed using Rstudio 4.2.0, including data preprocessing, model construction and validation. The L_DVBN algorithm in Julia0.4.7 and bnlearn package in R was used to build and evaluate the HBN model. Data with a diagnosis time of 2018(n = 23,384) were distributed randomly as training and testing sets in the ratio of 7:3 using the leave-out method for model construction and internal validation. External validation of the model was done using the dataset of 2019(n = 8128). Finally, the late HER2 + patients(n = 395) was selected for subgroup analysis. Accuracy, calibration and net benefit of clinical decision making were evaluated for both models. RESULTS The HBN model showed that seventeen variables were associated with survival outcome, including age, tumor size, site, histologic type, radiotherapy, surgery, chemotherapy, distant metastasis, subtype, clinical stage, ER receptor, PR receptor, clinical grade, race, marital status, tumor laterality, and lymph node. The AUCs for the internal validation of the LR and HBN models were 0.831 and 0.900; The AUCs for the external validation of the LR and HBN models on the whole population were 0.786 and 0.871; the AUCs for the external validation of the two models on the subgroup population were 0.601 and 0.813. CONCLUSION The accuracy, net clinical benefit, and calibration of the HBN model were better than LR model. The predictive efficacy of both models decreased and the difference was greater in advanced HER2 + patients, which means the HBN model had higher robustness and more stable predictive performance in the subgroup.
Collapse
Affiliation(s)
- Fan Su
- Department of Epidemiology and Health Statistics, School of Public Health, Southeast University, No. 87 Ding Jia Qiao, Central Gate Street, Gulou District, Nanjing, Jiangsu China
| | - Jianqian Chao
- Department of Epidemiology and Health Statistics, School of Public Health, Southeast University, No. 87 Ding Jia Qiao, Central Gate Street, Gulou District, Nanjing, Jiangsu China
- Department of Medical Insurance, School of Public Health, Southeast University, No. 87 Ding Jia Qiao, Central Gate Street, Gulou District, Nanjing, Jiangsu China
| | - Pei Liu
- Department of Epidemiology and Health Statistics, School of Public Health, Southeast University, No. 87 Ding Jia Qiao, Central Gate Street, Gulou District, Nanjing, Jiangsu China
| | - Bowen Zhang
- Department of Epidemiology and Health Statistics, School of Public Health, Southeast University, No. 87 Ding Jia Qiao, Central Gate Street, Gulou District, Nanjing, Jiangsu China
| | - Na Zhang
- Department of Epidemiology and Health Statistics, School of Public Health, Southeast University, No. 87 Ding Jia Qiao, Central Gate Street, Gulou District, Nanjing, Jiangsu China
| | - Zongyu Luo
- Department of Medical Insurance, School of Public Health, Southeast University, No. 87 Ding Jia Qiao, Central Gate Street, Gulou District, Nanjing, Jiangsu China
| | - Jiaying Han
- Department of Epidemiology and Health Statistics, School of Public Health, Southeast University, No. 87 Ding Jia Qiao, Central Gate Street, Gulou District, Nanjing, Jiangsu China
| |
Collapse
|
7
|
Zou S, Lin Y, Yu X, Eriksson M, Lin M, Fu F, Yang H. Genetic and lifestyle factors for breast cancer risk assessment in Southeast China. Cancer Med 2023; 12:15504-15514. [PMID: 37264741 PMCID: PMC10417168 DOI: 10.1002/cam4.6198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2022] [Revised: 04/01/2023] [Accepted: 05/23/2023] [Indexed: 06/03/2023] Open
Abstract
BACKGROUND Despite the rising incidence and mortality of breast cancer among women in China, there are currently few predictive models for breast cancer in the Chinese population and with low accuracy. This study aimed to identify major genetic and life-style risk factors in a Chinese population for potential application in risk assessment models. METHODS A case-control study in southeast China was conducted including 1321 breast cancer patients and 2045 controls during 2013-2016, in which the data were randomly divided into a training set and a test set on a 7:3 scale. The association between genetic and life-style factors and breast cancer was examined using logistic regression models. Using AUC curves, we also compared the performance of the logistic model to machine learning models, namely LASSO regression model and support vector machine (SVM), and the scores calculated from CKB, Gail and Tyrer-Cuzick models in the test set. RESULTS Among all factors considered, the best model was achieved when polygenetic risk score, lifestyle, and reproductive factors were considered jointly in the logistic regression model (AUC = 0.73; 95% CI: 0.70-0.77). The models created in this study performed better than those using scores calculated from the CKB, Gail, and Tyrer-Cuzick models. However, the logistic model and machine learning models did not significantly differ from one another. CONCLUSION In summary, we have found genetic and lifestyle risk predictors for breast cancer with moderate discrimination, which might provide reference for breast cancer screening in southeast China. Further population-based studies are needed to validate the model for future applications in personalized breast cancer screening programs.
Collapse
Affiliation(s)
- Shuqing Zou
- Department of Epidemiology and Health Statistics, School of Public HealthFujian Medical UniversityFuzhouChina
| | - Yuxiang Lin
- Department of Breast SurgeryFujian Medical University Union HospitalFuzhouChina
- Department of General SurgeryFujian Medical University Union HospitalFuzhouChina
- Breast Cancer Institute, Fujian Medical UniversityFuzhouChina
| | - Xingxing Yu
- Department of Epidemiology and Health Statistics, School of Public HealthFujian Medical UniversityFuzhouChina
| | - Mikael Eriksson
- Department of Medical Epidemiology and BiostatisticsKarolinska InstitutetStockholmSweden
| | | | - Fangmeng Fu
- Department of Breast SurgeryFujian Medical University Union HospitalFuzhouChina
- Department of General SurgeryFujian Medical University Union HospitalFuzhouChina
- Breast Cancer Institute, Fujian Medical UniversityFuzhouChina
| | - Haomin Yang
- Department of Epidemiology and Health Statistics, School of Public HealthFujian Medical UniversityFuzhouChina
- Department of Medical Epidemiology and BiostatisticsKarolinska InstitutetStockholmSweden
| |
Collapse
|
8
|
C Manikis G, Simos NJ, Kourou K, Kondylakis H, Poikonen-Saksela P, Mazzocco K, Pat-Horenczyk R, Sousa B, Oliveira-Maia AJ, Mattson J, Roziner I, Marzorati C, Marias K, Nuutinen M, Karademas E, Fotiadis D. Personalized Risk Analysis to Improve the Psychological Resilience of Women Undergoing Treatment for Breast Cancer: Development of a Machine Learning-Driven Clinical Decision Support Tool. J Med Internet Res 2023; 25:e43838. [PMID: 37307043 PMCID: PMC10337304 DOI: 10.2196/43838] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 04/04/2023] [Accepted: 04/19/2023] [Indexed: 06/13/2023] Open
Abstract
BACKGROUND Health professionals are often faced with the need to identify women at risk of manifesting poor psychological resilience following the diagnosis and treatment of breast cancer. Machine learning algorithms are increasingly used to support clinical decision support (CDS) tools in helping health professionals identify women who are at risk of adverse well-being outcomes and plan customized psychological interventions for women at risk. Clinical flexibility, cross-validated performance accuracy, and model explainability permitting person-specific identification of risk factors are highly desirable features of such tools. OBJECTIVE This study aimed to develop and cross-validate machine learning models designed to identify breast cancer survivors at risk of poor overall mental health and global quality of life and identify potential targets of personalized psychological interventions according to an extensive set of clinical recommendations. METHODS A set of 12 alternative models was developed to improve the clinical flexibility of the CDS tool. All models were validated using longitudinal data from a prospective, multicenter clinical pilot at 5 major oncology centers in 4 countries (Italy, Finland, Israel, and Portugal; the Predicting Effective Adaptation to Breast Cancer to Help Women to BOUNCE Back [BOUNCE] project). A total of 706 patients with highly treatable breast cancer were enrolled shortly after diagnosis and before the onset of oncological treatments and were followed up for 18 months. An extensive set of demographic, lifestyle, clinical, psychological, and biological variables measured within 3 months after enrollment served as predictors. Rigorous feature selection isolated key psychological resilience outcomes that could be incorporated into future clinical practice. RESULTS Balanced random forest classifiers were successful at predicting well-being outcomes, with accuracies ranging between 78% and 82% (for 12-month end points after diagnosis) and between 74% and 83% (for 18-month end points after diagnosis). Explainability and interpretability analyses built on the best-performing models were used to identify potentially modifiable psychological and lifestyle characteristics that, if addressed systematically in the context of personalized psychological interventions, would be most likely to promote resilience for a given patient. CONCLUSIONS Our results highlight the clinical utility of the BOUNCE modeling approach by focusing on resilience predictors that can be readily available to practicing clinicians at major oncology centers. The BOUNCE CDS tool paves the way for personalized risk assessment methods to identify patients at high risk of adverse well-being outcomes and direct valuable resources toward those most in need of specialized psychological interventions.
Collapse
Affiliation(s)
- Georgios C Manikis
- Foundation for Research and Technology-Hellas, Institute of Computer Science, Heraklion, Greece
| | - Nicholas J Simos
- Foundation for Research and Technology-Hellas, Institute of Computer Science, Heraklion, Greece
| | - Konstantina Kourou
- Department of Materials Science and Engineering, University of Ioannina, Ioannina, Greece
- Foundation for Research and Technology-Hellas, Biomedical Research Institute, Ioannina, Greece
| | - Haridimos Kondylakis
- Foundation for Research and Technology-Hellas, Institute of Computer Science, Heraklion, Greece
| | - Paula Poikonen-Saksela
- Comprehensive Cancer Center, Helsinki University Hospital, University of Helsinki, Helsinki, Finland
| | - Ketti Mazzocco
- Applied Research Division for Cognitive and Psychological Science, European Institute of Oncology Istituto di Ricovero e Cura a Carattere Scientifico, Milan, Italy
- Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
| | - Ruth Pat-Horenczyk
- School of Social Work and Social Welfare,The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Berta Sousa
- Breast Unit, Champalimaud Research and Clinical Centre, Champalimaud Foundation, Lisbon, Portugal
| | | | - Johanna Mattson
- Comprehensive Cancer Center, Helsinki University Hospital, University of Helsinki, Helsinki, Finland
| | - Ilan Roziner
- Department of Communication Disorders, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Chiara Marzorati
- Applied Research Division for Cognitive and Psychological Science, European Institute of Oncology Istituto di Ricovero e Cura a Carattere Scientifico, Milan, Italy
| | - Kostas Marias
- Foundation for Research and Technology-Hellas, Institute of Computer Science, Heraklion, Greece
| | | | - Evangelos Karademas
- Foundation for Research and Technology-Hellas, Institute of Computer Science, Heraklion, Greece
| | - Dimitrios Fotiadis
- Department of Materials Science and Engineering, University of Ioannina, Ioannina, Greece
- Foundation for Research and Technology-Hellas, Biomedical Research Institute, Ioannina, Greece
| |
Collapse
|
9
|
Silva-Aravena F, Núñez Delafuente H, Gutiérrez-Bahamondes JH, Morales J. A Hybrid Algorithm of ML and XAI to Prevent Breast Cancer: A Strategy to Support Decision Making. Cancers (Basel) 2023; 15:cancers15092443. [PMID: 37173910 PMCID: PMC10177162 DOI: 10.3390/cancers15092443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 04/15/2023] [Accepted: 04/18/2023] [Indexed: 05/15/2023] Open
Abstract
Worldwide, the coronavirus has intensified the management problems of health services, significantly harming patients. Some of the most affected processes have been cancer patients' prevention, diagnosis, and treatment. Breast cancer is the most affected, with more than 20 million cases and at least 10 million deaths by 2020. Various studies have been carried out to support the management of this disease globally. This paper presents a decision support strategy for health teams based on machine learning (ML) tools and explainability algorithms (XAI). The main methodological contributions are: first, the evaluation of different ML algorithms that allow classifying patients with and without cancer from the available dataset; and second, an ML methodology mixed with an XAI algorithm, which makes it possible to predict the disease and interpret the variables and how they affect the health of patients. The results show that first, the XGBoost Algorithm has a better predictive capacity, with an accuracy of 0.813 for the train data and 0.81 for the test data; and second, with the SHAP algorithm, it is possible to know the relevant variables and their level of significance in the prediction, and to quantify the impact on the clinical condition of the patients, which will allow health teams to offer early and personalized alerts for each patient.
Collapse
Affiliation(s)
- Fabián Silva-Aravena
- Facultad de Ciencias Sociales y Económicas, Universidad Católica del Maule, Avenida San Miguel 3605, Talca 3460000, Chile
| | - Hugo Núñez Delafuente
- Doctorado en Sistemas de Ingeniería, Facultad de Ingeniería, Universidad de Talca, Camino Los Niches Km 1, Curicó 3340000, Chile
| | - Jimmy H Gutiérrez-Bahamondes
- Doctorado en Sistemas de Ingeniería, Facultad de Ingeniería, Universidad de Talca, Camino Los Niches Km 1, Curicó 3340000, Chile
| | - Jenny Morales
- Facultad de Ciencias Sociales y Económicas, Universidad Católica del Maule, Avenida San Miguel 3605, Talca 3460000, Chile
| |
Collapse
|
10
|
Bigarré C, Bertucci F, Finetti P, Macgrogan G, Muracciole X, Benzekry S. Mechanistic modeling of metastatic relapse in early breast cancer to investigate the biological impact of prognostic biomarkers. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 231:107401. [PMID: 36804267 DOI: 10.1016/j.cmpb.2023.107401] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 01/12/2023] [Accepted: 02/01/2023] [Indexed: 06/18/2023]
Abstract
BACKGROUND AND OBJECTIVE Estimating the risk of metastatic relapse is a major challenge to decide adjuvant treatment options in early-stage breast cancer (eBC). To date, distant metastasis-free survival (DMFS) analysis mainly relies on classical, agnostic, statistical models (e.g., Cox regression). Instead, we propose here to derive mechanistic models of DMFS. METHODS The present series consisted of eBC patients who did not receive adjuvant systemic therapy from three datasets, composed respectively of 692 (Bergonié Institute), 591 (Paoli-Calmettes Institute, IPC), and 163 (Public Hospital Marseille, AP-HM) patients with routine clinical annotations. The last dataset also contained expression of three non-routine biomarkers. Our mechanistic model of DMFS relies on two mathematical parameters that represent growth (α) and dissemination (μ). We identified their population distributions using mixed-effects modeling. Critically, we propose a novel variable selection procedure allowing to: (i) identify the association of biological parameters with either α, μ or both, and (ii) generate an optimal candidate model for DMFS prediction. RESULTS We found that Ki67 and Thymidine Kinase-1 were associated with α, and nodal status and Plasminogen Activator Inhibitor-1 with μ. The predictive performances of the model were excellent in calibration but moderate in discrimination, with c-indices of 0.72 (95% CI [0.48, 0.95], AP-HM), 0.63 ([0.44, 0.83], Bergonié) and 0.60 (95% CI [0.54, 0.80], IPC). CONCLUSIONS Overall, we demonstrate that our novel method combining mechanistic and advanced statistical modeling is able to unravel the biological roles of clinicopathological parameters from DMFS data.
Collapse
Affiliation(s)
- Célestin Bigarré
- COMPO, Inria Méditerranée, Cancer Research Center of Marseille, Inserm UMR1068, CNRS UMR7258, Aix Marseille University UM105, 13385 Marseille, France.
| | - François Bertucci
- Predictive Oncology Laboratory, Marseille Cancer Research Centre (CRCM), Inserm U1068, CNRS UMR7258, Institut Paoli-Calmettes, Equipe labellisée Ligue Nationale Contre Le Cancer, Aix-Marseille University, Marseille, France; Department of Medical Oncology, CRCM, Institut Paoli-Calmettes, CNRS, Inserm, Aix-Marseille University, Marseille, France
| | - Pascal Finetti
- Predictive Oncology Laboratory, Marseille Cancer Research Centre (CRCM), Inserm U1068, CNRS UMR7258, Institut Paoli-Calmettes, Equipe labellisée Ligue Nationale Contre Le Cancer, Aix-Marseille University, Marseille, France
| | - Gaëtan Macgrogan
- Department of Biopathology, Institut Bergonié, Regional Comprehensive Cancer Centre, Bordeaux, France; Inserm U1218, Bordeaux Public Health, University of Bordeaux, Bordeaux, France
| | - Xavier Muracciole
- COMPO, Inria Méditerranée, Cancer Research Center of Marseille, Inserm UMR1068, CNRS UMR7258, Aix Marseille University UM105, 13385 Marseille, France; Radiotherapy Department, Assistance Publique - Hôpitaux de Marseille, Aix Marseille University, Marseille, France
| | - Sébastien Benzekry
- COMPO, Inria Méditerranée, Cancer Research Center of Marseille, Inserm UMR1068, CNRS UMR7258, Aix Marseille University UM105, 13385 Marseille, France
| |
Collapse
|
11
|
Sarkar S, Mali K. Firefly-SVM predictive model for breast cancer subgroup classification with clinicopathological parameters. Digit Health 2023; 9:20552076231207203. [PMID: 37860702 PMCID: PMC10583530 DOI: 10.1177/20552076231207203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 09/26/2023] [Indexed: 10/21/2023] Open
Abstract
Background Breast cancer is a highly predominant destructive disease among women characterised with varied tumour biology, molecular subgroups and diverse clinicopathological specifications. The potentiality of machine learning to transform complex medical data into meaningful knowledge has led to its application in breast cancer detection and prognostic evaluation. Objective The emergence of data-driven diagnostic model for assisting clinicians in diagnostic decision making has gained an increasing curiosity in breast cancer identification and analysis. This motivated us to develop a breast cancer data-driven model for subtype classification more accurately. Method In this article, we proposed a firefly-support vector machine (SVM) breast cancer predictive model that uses clinicopathological and demographic data gathered from various tertiary care cancer hospitals or oncological centres to distinguish between patients with triple-negative breast cancer (TNBC) and non-triple-negative breast cancer (non-TNBC). Results The results of the firefly-support vector machine (firefly-SVM) predictive model were distinguished from the traditional grid search-support vector machine (Grid-SVM) model, particle swarm optimisation-support vector machine (PSO-SVM) and genetic algorithm-support vector machine (GA-SVM) hybrid models through hyperparameter tuning. The findings show that the recommended firefly-SVM classification model outperformed other existing models in terms of prediction accuracy (93.4%, 86.6%, 69.6%) for automated SVM parameter selection. The effectiveness of the prediction model was also evaluated using well-known metrics, such as the F1-score, mean square error, area under the ROC curve, logarithmic loss and precision-recall curve. Conclusion Firefly-SVM predictive model may be treated as an alternate tool for breast cancer subgroup classification that would benefit the clinicians for managing the patient with proper treatment and diagnostic outcome.
Collapse
Affiliation(s)
- Suvobrata Sarkar
- Department of Computer Science and Engineering, Dr. B.C. Roy Engineering College, Durgapur, West Bengal, India
| | - Kalyani Mali
- Department of Computer Science and Engineering, University of Kalyani, Kalyani, West Bengal, India
| |
Collapse
|
12
|
Huang MJ, Ye L, Yu KX, Liu J, Li K, Wang XD, Li JP. Development of prediction model of low anterior resection syndrome for colorectal cancer patients after surgery based on machine-learning technique. Cancer Med 2023; 12:1501-1519. [PMID: 35899858 PMCID: PMC9883536 DOI: 10.1002/cam4.5041] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 06/18/2022] [Accepted: 07/07/2022] [Indexed: 02/05/2023] Open
Abstract
BACKGROUND Low anterior resection syndrome (LARS) is a common postoperative complication in patients with colorectal cancer, which seriously affects their postoperative quality of life. At present, the aetiology of LARS is still unclear, but some risk factors have been studied. Accurate prediction and early management of medical intervention are keys to improving the quality of life of such high-risk patients. OBJECTIVES Based on machine-learning methods, this study used the follow-up results of postoperative patients with colorectal cancer to develop prediction models for LARS and conducted a comparative analysis between the different models. METHODS A total of 382 patients diagnosed with colorectal cancer and undergoing surgery at West China Hospital from April 2017 to December 2020 were retrospectively selected as the development cohort. Logistic regression, support vector machine, decision tree, random forest and artificial neural network algorithms were used to construct the prediction models of the obtained dataset. The models were internally validated using cross-validation. The area under the curve and Brier score measures were used to evaluate and compare the differentiation and calibration degrees of the models. The sensitivity, specificity, positive predictive value and negative predictive value of the different models were described for clinical use. RESULTS A total of 342 patients were included, the incidence of LARS being 47.4% (162/342) during the six-month follow-up. After feature selection, the factors influencing the occurrence of LARS were found to be location, distance, diverting stoma, exsufflation and surgical type. The prediction models based on five machine-learning methods all showed acceptable performance. CONCLUSIONS The five models developed based on the machine-learning methods showed good prediction performance. However, considering the simplicity of clinical use of the model results, the logistic regression model is most recommended. The clinical applicability of these models will also need to be evaluated with external cohort data.
Collapse
Affiliation(s)
- Ming Jun Huang
- West China School of Nursing/Day Surgery Center, West China Hospital, Sichuan University, Chengdu, China
| | - Lin Ye
- West China School of Stomatology, Sichuan University, Chengdu, China.,Division of Gastrointestinal Surgery, Department of General Surgery, West China Hospital, Sichuan University, Chengdu, China
| | - Ke Xin Yu
- Division of Gastrointestinal Surgery, Department of General Surgery, West China Hospital, Sichuan University, Chengdu, China.,West China Medical School, Sichuan University, Chengdu, China
| | - Jing Liu
- Division of Gastrointestinal Surgery, Department of General Surgery, West China Hospital, Sichuan University, Chengdu, China.,West China School of Nursing/West China Hospital, Sichuan University, Chengdu, China
| | - Ka Li
- West China School of Nursing/West China Hospital, Sichuan University, Chengdu, China
| | - Xiao Dong Wang
- Division of Gastrointestinal Surgery, Department of General Surgery, West China Hospital, Sichuan University, Chengdu, China
| | - Ji Ping Li
- Nursing Department, West China Hospital, Sichuan University, Chengdu, China
| |
Collapse
|
13
|
Cè M, Caloro E, Pellegrino ME, Basile M, Sorce A, Fazzini D, Oliva G, Cellina M. Artificial intelligence in breast cancer imaging: risk stratification, lesion detection and classification, treatment planning and prognosis-a narrative review. EXPLORATION OF TARGETED ANTI-TUMOR THERAPY 2022; 3:795-816. [PMID: 36654817 PMCID: PMC9834285 DOI: 10.37349/etat.2022.00113] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 09/28/2022] [Indexed: 12/28/2022] Open
Abstract
The advent of artificial intelligence (AI) represents a real game changer in today's landscape of breast cancer imaging. Several innovative AI-based tools have been developed and validated in recent years that promise to accelerate the goal of real patient-tailored management. Numerous studies confirm that proper integration of AI into existing clinical workflows could bring significant benefits to women, radiologists, and healthcare systems. The AI-based approach has proved particularly useful for developing new risk prediction models that integrate multi-data streams for planning individualized screening protocols. Furthermore, AI models could help radiologists in the pre-screening and lesion detection phase, increasing diagnostic accuracy, while reducing workload and complications related to overdiagnosis. Radiomics and radiogenomics approaches could extrapolate the so-called imaging signature of the tumor to plan a targeted treatment. The main challenges to the development of AI tools are the huge amounts of high-quality data required to train and validate these models and the need for a multidisciplinary team with solid machine-learning skills. The purpose of this article is to present a summary of the most important AI applications in breast cancer imaging, analyzing possible challenges and new perspectives related to the widespread adoption of these new tools.
Collapse
Affiliation(s)
- Maurizio Cè
- Postgraduate School in Diagnostic and Interventional Radiology, University of Milan, 20122 Milan, Italy
| | - Elena Caloro
- Postgraduate School in Diagnostic and Interventional Radiology, University of Milan, 20122 Milan, Italy
| | - Maria E. Pellegrino
- Postgraduate School in Diagnostic and Interventional Radiology, University of Milan, 20122 Milan, Italy
| | - Mariachiara Basile
- Postgraduate School in Diagnostic and Interventional Radiology, University of Milan, 20122 Milan, Italy
| | - Adriana Sorce
- Postgraduate School in Diagnostic and Interventional Radiology, University of Milan, 20122 Milan, Italy
| | | | - Giancarlo Oliva
- Department of Radiology, ASST Fatebenefratelli Sacco, 20121 Milan, Italy
| | - Michaela Cellina
- Department of Radiology, ASST Fatebenefratelli Sacco, 20121 Milan, Italy
| |
Collapse
|
14
|
Yu W, Lu Y, Shou H, Xu H, Shi L, Geng X, Song T. A 5‐year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms. Cancer Med 2022; 12:6867-6876. [PMID: 36479910 PMCID: PMC10067071 DOI: 10.1002/cam4.5477] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2022] [Revised: 10/31/2022] [Accepted: 11/11/2022] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Prediction models with high accuracy rates for nonmetastatic cervical cancer (CC) patients are limited. This study aimed to construct and compare predictive models on the basis of machine learning (ML) algorithms for predicting the 5-year survival status of CC patients through using the Surveillance, Epidemiology, and End Results public database of the National Cancer Institute. METHODS The data registered from 2004 to 2016 were extracted and randomly divided into training and validation cohorts (8:2). The least absolute shrinkage and selection operator (LASSO) regression was employed to identify significant factors. Then, four predictive models were constructed, including logistic regression (LR), random forest (RF), support vector machine (SVM), and extreme gradient boosting (XGBoost). The predictive models were evaluated and compared using Receiver-operating characteristics with areas under the curves (AUCs) and decision curve analysis (DCA), respectively. RESULTS A total of 13,802 patients were involved and classified into training (N = 11,041) and validation (N = 2761) cohorts. By using the LASSO regression method, seven factors were identified. In the training cohort, the XGBoost model showed the best performance (AUC = 0.8400) compared to the other three models (all p < 0.05 by Delong's test). In the validation cohort, the XGBoost model also demonstrated a superior prediction ability (AUC = 0.8365) than LR and SVM models (both p < 0.05 by Delong's test), although the difference was not statistically significant between the XGBoost and the RF models (p = 0.4251 by Delong's test). Based on the DCA results, the XGBoost model was also superior, and feature importance analysis indicated that the tumor stage was the most important variable among the seven factors. CONCLUSIONS The XGBoost model proved to be an effective algorithm with better prediction abilities. This model is proposed to support better decision-making for nonmetastatic CC patients in the future.
Collapse
Affiliation(s)
- Wenke Yu
- Department of Radiology Qingchun Hospital of Zhejiang Province Hangzhou Zhejiang China
| | - Yanwei Lu
- Cancer Center, Department of Radiation Oncology Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College Hangzhou Zhejiang China
| | - Huafeng Shou
- Department of Gynecology Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College Hangzhou Zhejiang China
| | - Hong’en Xu
- Cancer Center, Department of Radiation Oncology Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College Hangzhou Zhejiang China
| | - Lei Shi
- Cancer Center, Department of Radiation Oncology Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College Hangzhou Zhejiang China
| | - Xiaolu Geng
- Department of Radiology Qingchun Hospital of Zhejiang Province Hangzhou Zhejiang China
| | - Tao Song
- Cancer Center, Department of Radiation Oncology Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College Hangzhou Zhejiang China
| |
Collapse
|
15
|
Hanis TM, Ruhaiyem NIR, Arifin WN, Haron J, Wan Abdul Rahman WF, Abdullah R, Musa KI. Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records. Diagnostics (Basel) 2022; 12:diagnostics12112826. [PMID: 36428886 PMCID: PMC9689364 DOI: 10.3390/diagnostics12112826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2022] [Revised: 10/13/2022] [Accepted: 10/15/2022] [Indexed: 11/18/2022] Open
Abstract
This study aims to determine the feasibility of machine learning (ML) and patient registration record to be utilised to develop an over-the-counter (OTC) screening model for breast cancer risk estimation. Data were retrospectively collected from women who came to the Hospital Universiti Sains Malaysia, Malaysia for breast-related problems. Eight ML models were used: k-nearest neighbour (kNN), elastic-net logistic regression, multivariate adaptive regression splines, artificial neural network, partial least square, random forest, support vector machine (SVM), and extreme gradient boosting. Features utilised for the development of the screening models were limited to information in the patient registration form. The final model was evaluated in terms of performance across a mammographic density. Additionally, the feature importance of the final model was assessed using the model agnostic approach. kNN had the highest Youden J index, precision, and PR-AUC, while SVM had the highest F2 score. The kNN model was selected as the final model. The model had a balanced performance in terms of sensitivity, specificity, and PR-AUC across the mammographic density groups. The most important feature was the age at examination. In conclusion, this study showed that ML and patient registration information are feasible to be used as the OTC screening model for breast cancer.
Collapse
Affiliation(s)
- Tengku Muhammad Hanis
- Department of Community Medicine, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Correspondence: (T.M.H.); (K.I.M.)
| | | | - Wan Nor Arifin
- Biostatistics and Research Methodology Unit, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Juhara Haron
- Department of Radiology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Breast Cancer Awareness and Research Unit, Hospital Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Wan Faiziah Wan Abdul Rahman
- Breast Cancer Awareness and Research Unit, Hospital Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Department of Pathology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Rosni Abdullah
- School of Computer Sciences, Universiti Sains Malaysia, Gelugor 11800, Penang, Malaysia
| | - Kamarul Imran Musa
- Department of Community Medicine, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Correspondence: (T.M.H.); (K.I.M.)
| |
Collapse
|
16
|
Mazo C, Aura C, Rahman A, Gallagher WM, Mooney C. Application of Artificial Intelligence Techniques to Predict Risk of Recurrence of Breast Cancer: A Systematic Review. J Pers Med 2022; 12:1496. [PMID: 36143281 PMCID: PMC9500690 DOI: 10.3390/jpm12091496] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 09/05/2022] [Accepted: 09/09/2022] [Indexed: 12/31/2022] Open
Abstract
Breast cancer is the most common disease among women, with over 2.1 million new diagnoses each year worldwide. About 30% of patients initially presenting with early stage disease have a recurrence of cancer within 10 years. Predicting who will have a recurrence and who will not remains challenging, with consequent implications for associated treatment. Artificial intelligence strategies that can predict the risk of recurrence of breast cancer could help breast cancer clinicians avoid ineffective overtreatment. Despite its significance, most breast cancer recurrence datasets are insufficiently large, not publicly available, or imbalanced, making these studies more difficult. This systematic review investigates the role of artificial intelligence in the prediction of breast cancer recurrence. We summarise common techniques, features, training and testing methodologies, metrics, and discuss current challenges relating to implementation in clinical practice. We systematically reviewed works published between 1 January 2011 and 1 November 2021 using the methodology of Kitchenham and Charter. We leveraged Springer, Google Scholar, PubMed, and IEEE search engines. This review found three areas that require further work. First, there is no agreement on artificial intelligence methodologies, feature predictors, or assessment metrics. Second, issues such as sampling strategies, missing data, and class imbalance problems are rarely addressed or discussed. Third, representative datasets for breast cancer recurrence are scarce, which hinders model validation and deployment. We conclude that predicting breast cancer recurrence remains an open problem despite the use of artificial intelligence.
Collapse
Affiliation(s)
- Claudia Mazo
- UCD School of Computer Science, University College Dublin, D04 V1W8 Dublin, Ireland
| | - Claudia Aura
- UCD School of Biomolecular and Biomedical Science, UCD Conway Institute, University College Dublin, D04 V1W8 Dublin, Ireland
| | - Arman Rahman
- UCD School of Biomolecular and Biomedical Science, UCD Conway Institute, University College Dublin, D04 V1W8 Dublin, Ireland
| | - William M. Gallagher
- UCD School of Biomolecular and Biomedical Science, UCD Conway Institute, University College Dublin, D04 V1W8 Dublin, Ireland
| | - Catherine Mooney
- UCD School of Computer Science, University College Dublin, D04 V1W8 Dublin, Ireland
| |
Collapse
|
17
|
Sarkar S, Mali K. Breast Cancer Subtypes Classification with Hybrid Machine Learning Model. Methods Inf Med 2022; 61:68-83. [PMID: 36096144 DOI: 10.1055/s-0042-1751043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
Abstract
BACKGROUND Breast cancer is the most prevailing heterogeneous disease among females characterized with distinct molecular subtypes and varied clinicopathological features. With the emergence of various artificial intelligence techniques especially machine learning, the breast cancer research has attained new heights in cancer detection and prognosis. OBJECTIVE Recent development in computer driven diagnostic system has enabled the clinicians to improve the accuracy in detecting various types of breast tumors. Our study is to develop a computer driven diagnostic system which will enable the clinicians to improve the accuracy in detecting various types of breast tumors. METHODS In this article, we proposed a breast cancer classification model based on the hybridization of machine learning approaches for classifying triple-negative breast cancer and non-triple negative breast cancer patients with clinicopathological features collected from multiple tertiary care hospitals/centers. RESULTS The results of genetic algorithm and support vector machine (GA-SVM) hybrid model was compared with classics feature selection SVM hybrid models like support vector machine-recursive feature elimination (SVM-RFE), LASSO-SVM, Grid-SVM, and linear SVM. The classification results obtained from GA-SVM hybrid model outperformed the other compared models when applied on two distinct hospital-based datasets of patients investigated with breast cancer in North West of African subcontinent. To validate the predictive model accuracy, 10-fold cross-validation method was applied on all models with the same multicentered datasets. The model performance was evaluated with well-known metrics like mean squared error, logarithmic loss, F1-score, area under the ROC curve, and the precision-recall curve. CONCLUSION The hybrid machine learning model can be employed for breast cancer subtypes classification that could help the medical practitioners in better treatment planning and disease outcome.
Collapse
Affiliation(s)
- Suvobrata Sarkar
- Department of Computer Science and Engineering, Dr. B.C. Roy Engineering College, Durgapur, West Bengal, India
| | - Kalyani Mali
- Department of Computer Science and Engineering, University of Kalyani, Kalyani, West Bengal, India
| |
Collapse
|
18
|
Theophanous S, Lønne PI, Choudhury A, Berbee M, Dekker A, Dennis K, Dewdney A, Gambacorta MA, Gilbert A, Guren MG, Holloway L, Jadon R, Kochhar R, Mohamed AA, Muirhead R, Parés O, Raszewski L, Roy R, Scarsbrook A, Sebag-Montefiore D, Spezi E, Spindler KLG, van Triest B, Vassiliou V, Malinen E, Wee L, Appelt AL. Development and validation of prognostic models for anal cancer outcomes using distributed learning: protocol for the international multi-centre atomCAT2 study. Diagn Progn Res 2022; 6:14. [PMID: 35922837 PMCID: PMC9351222 DOI: 10.1186/s41512-022-00128-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 06/09/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Anal cancer is a rare cancer with rising incidence. Despite the relatively good outcomes conferred by state-of-the-art chemoradiotherapy, further improving disease control and reducing toxicity has proven challenging. Developing and validating prognostic models using routinely collected data may provide new insights for treatment development and selection. However, due to the rarity of the cancer, it can be difficult to obtain sufficient data, especially from single centres, to develop and validate robust models. Moreover, multi-centre model development is hampered by ethical barriers and data protection regulations that often limit accessibility to patient data. Distributed (or federated) learning allows models to be developed using data from multiple centres without any individual-level patient data leaving the originating centre, therefore preserving patient data privacy. This work builds on the proof-of-concept three-centre atomCAT1 study and describes the protocol for the multi-centre atomCAT2 study, which aims to develop and validate robust prognostic models for three clinically important outcomes in anal cancer following chemoradiotherapy. METHODS This is a retrospective multi-centre cohort study, investigating overall survival, locoregional control and freedom from distant metastasis after primary chemoradiotherapy for anal squamous cell carcinoma. Patient data will be extracted and organised at each participating radiotherapy centre (n = 18). Candidate prognostic factors have been identified through literature review and expert opinion. Summary statistics will be calculated and exchanged between centres prior to modelling. The primary analysis will involve developing and validating Cox proportional hazards models across centres for each outcome through distributed learning. Outcomes at specific timepoints of interest and factor effect estimates will be reported, allowing for outcome prediction for future patients. DISCUSSION The atomCAT2 study will analyse one of the largest available cross-institutional cohorts of patients with anal cancer treated with chemoradiotherapy. The analysis aims to provide information on current international clinical practice outcomes and may aid the personalisation and design of future anal cancer clinical trials through contributing to a better understanding of patient risk stratification.
Collapse
Affiliation(s)
- Stelios Theophanous
- Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK.
| | - Per-Ivar Lønne
- Department of Medical Physics, Oslo University Hospital, Oslo, Norway
| | - Ananya Choudhury
- MAASTRO (Dept of Radiotherapy), GROW School of Oncology and Developmental Biology, Maastricht University and Maastricht University Medical Centre+, P. Debyelaan 25, 6229, Maastricht, Netherlands
| | - Maaike Berbee
- MAASTRO (Dept of Radiotherapy), GROW School of Oncology and Developmental Biology, Maastricht University and Maastricht University Medical Centre+, P. Debyelaan 25, 6229, Maastricht, Netherlands
| | - Andre Dekker
- MAASTRO (Dept of Radiotherapy), GROW School of Oncology and Developmental Biology, Maastricht University and Maastricht University Medical Centre+, P. Debyelaan 25, 6229, Maastricht, Netherlands
| | | | | | | | - Alexandra Gilbert
- Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK
| | - Marianne Grønlie Guren
- Department of Oncology, Oslo University Hospital, and Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Lois Holloway
- Ingham Research Institute and Liverpool Hospital, Liverpool, New South Wales, Australia
| | | | | | | | | | | | | | - Rajarshi Roy
- Hull University Teaching Hospitals NHS Trust, Hull, UK
| | - Andrew Scarsbrook
- Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK
- Leeds Teaching Hospitals NHS Trust, Leeds, UK
| | | | | | | | - Baukelien van Triest
- The Netherlands Cancer Institute-Antoni van Leeuwenhoek (NKI-AVL), Amsterdam, The Netherlands
| | | | - Eirik Malinen
- Department of Medical Physics, Oslo University Hospital, Oslo, Norway
| | - Leonard Wee
- MAASTRO (Dept of Radiotherapy), GROW School of Oncology and Developmental Biology, Maastricht University and Maastricht University Medical Centre+, P. Debyelaan 25, 6229, Maastricht, Netherlands
| | - Ane L Appelt
- Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK
- Leeds Teaching Hospitals NHS Trust, Leeds, UK
| |
Collapse
|
19
|
Salman G, Aldujaily E, Jabardi M, Qassid OL. Investigating the clinical significance of EGFR expression using machine learning in a series of Iraqi patients with triple-negative breast cancer. J Med Life 2022; 15:967-978. [PMID: 36188649 PMCID: PMC9514808 DOI: 10.25122/jml-2021-0401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 03/14/2022] [Indexed: 11/05/2022] Open
Abstract
Breast cancer is a heterogeneous disease with a distinct profile of the expression of each tumor. Triple-negative breast cancer (TNBC) is a molecular subtype of breast cancer characterized by an aggressive clinical behavior linked to loss or reduced expression of estrogen, progesterone, and Her2/neu receptors. The study's main objective was to investigate the clinical significance of epidermal growth factor receptor (EGFR) overexpression in a series of Iraqi patients with TNBC. The sectional analytic study involved immunohistochemical analysis of EGFR expression in randomly selected 53 formalin fixed paraffin embedded tissue blocks of TNBC cases out of 127 Iraqi patients with TNBC and correlated expression data with clinicopathological parameters including survival time. Machine learning (statistical tests and principal component analysis (PCA)) was used to predict the outcome of the patients using EGFR expression data together with clinicopathological parameters. EGFR was expressed in approximately 28% of TNBC cases. We estimated the risk of mortality and distant metastasis based on EGFR expression and clinicopathologic factors using the principal component analysis (PCA) model. We found a substantial positive correlation between clinical stage and distant metastasis, clinical stage and death, death and distant metastasis, and death and positive EGFR expression. Overall, EGFR expression was linked to a poor prognosis and increased mortality. A higher risk of distant metastasis and death was associated with an advanced clinical stage of the tumor. Furthermore, the existence of distant metastases increased the risk of death. These findings raise the possibility of using EGFR expression data with other clinicopathological parameters to predict the outcome of patients with TNBC.
Collapse
Affiliation(s)
- Gufran Salman
- Department of Basic Science, Faculty of Dentistry, University of Kufa, Kufa, Iraq
| | - Esraa Aldujaily
- Department of Pathology and Forensic Medicine, Faculty of Medicine, University of Kufa, Kufa, Iraq,Corresponding Author: Esraa Aldujaily, Department of Pathology and Forensic Medicine, Faculty of Medicine, University of Kufa, Kufa, Iraq. E-mail:
| | - Mohammed Jabardi
- Department of Computer Science, College of Education, University of Kufa, Kufa, Iraq
| | - Omar Layth Qassid
- Cancer Research Center, University of Leicester, Leicester City, United Kingdom
| |
Collapse
|
20
|
Yu C, Wang J. Data mining and mathematical models in cancer prognosis and prediction. MEDICAL REVIEW (BERLIN, GERMANY) 2022; 2:285-307. [PMID: 37724193 PMCID: PMC10388766 DOI: 10.1515/mr-2021-0026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 12/29/2021] [Indexed: 09/20/2023]
Abstract
Cancer is a fetal and complex disease. Individual differences of the same cancer type or the same patient at different stages of cancer development may require distinct treatments. Pathological differences are reflected in tissues, cells and gene levels etc. The interactions between the cancer cells and nearby microenvironments can also influence the cancer progression and metastasis. It is a huge challenge to understand all of these mechanistically and quantitatively. Researchers applied pattern recognition algorithms such as machine learning or data mining to predict cancer types or classifications. With the rapidly growing and available computing powers, researchers begin to integrate huge data sets, multi-dimensional data types and information. The cells are controlled by the gene expressions determined by the promoter sequences and transcription regulators. For example, the changes in the gene expression through these underlying mechanisms can modify cell progressing in the cell-cycle. Such molecular activities can be governed by the gene regulations through the underlying gene regulatory networks, which are essential for cancer study when the information and gene regulations are clear and available. In this review, we briefly introduce several machine learning methods of cancer prediction and classification which include Artificial Neural Networks (ANNs), Decision Trees (DTs), Support Vector Machine (SVM) and naive Bayes. Then we describe a few typical models for building up gene regulatory networks such as Correlation, Regression and Bayes methods based on available data. These methods can help on cancer diagnosis such as susceptibility, recurrence, survival etc. At last, we summarize and compare the modeling methods to analyze the development and progression of cancer through gene regulatory networks. These models can provide possible physical strategies to analyze cancer progression in a systematic and quantitative way.
Collapse
Affiliation(s)
- Chong Yu
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, China
- Department of Statistics, JiLin University of Finance and Economics, Changchun, Jilin Province, China
| | - Jin Wang
- Department of Chemistry and of Physics and Astronomy, State University of New York, Stony Brook, NY, USA
| |
Collapse
|
21
|
Medical Internet-of-Things Based Breast Cancer Diagnosis Using Hyperparameter-Optimized Neural Networks. FUTURE INTERNET 2022. [DOI: 10.3390/fi14050153] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
In today’s healthcare setting, the accurate and timely diagnosis of breast cancer is critical for recovery and treatment in the early stages. In recent years, the Internet of Things (IoT) has experienced a transformation that allows the analysis of real-time and historical data using artificial intelligence (AI) and machine learning (ML) approaches. Medical IoT combines medical devices and AI applications with healthcare infrastructure to support medical diagnostics. The current state-of-the-art approach fails to diagnose breast cancer in its initial period, resulting in the death of most women. As a result, medical professionals and researchers are faced with a tremendous problem in early breast cancer detection. We propose a medical IoT-based diagnostic system that competently identifies malignant and benign people in an IoT environment to resolve the difficulty of identifying early-stage breast cancer. The artificial neural network (ANN) and convolutional neural network (CNN) with hyperparameter optimization are used for malignant vs. benign classification, while the Support Vector Machine (SVM) and Multilayer Perceptron (MLP) were utilized as baseline classifiers for comparison. Hyperparameters are important for machine learning algorithms since they directly control the behaviors of training algorithms and have a significant effect on the performance of machine learning models. We employ a particle swarm optimization (PSO) feature selection approach to select more satisfactory features from the breast cancer dataset to enhance the classification performance using MLP and SVM, while grid-based search was used to find the best combination of the hyperparameters of the CNN and ANN models. The Wisconsin Diagnostic Breast Cancer (WDBC) dataset was used to test the proposed approach. The proposed model got a classification accuracy of 98.5% using CNN, and 99.2% using ANN.
Collapse
|
22
|
Prognosis Model of Advanced Non-Small-Cell Lung Cancer Based on Max-Min Hill-Climbing Algorithm. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:9173913. [PMID: 35371284 PMCID: PMC8975666 DOI: 10.1155/2022/9173913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Revised: 12/12/2021] [Accepted: 03/07/2022] [Indexed: 11/17/2022]
Abstract
A safer and more effective treatment is need for the comprehensive treatment based on chemotherapy in patients with advanced non-small-cell lung cancer (NSCLC). The max-min hill-climbing (MMHC) is a common algorithm for disease prediction. This study is aimed at analyzing the efficacy of the MMHC algorithm in prognosis evaluation of advanced NSCLC. In this study, the prognosis model of lung cancer was first established by the MMHC algorithm. Then, according to the MMHC algorithm results, 40 patients with advanced NSCLC were divided into the research group and control group before anlotinib hydrochloride capsule combined with pemetrexed disodium chemotherapy. The diameter of solid tumor lesions, objective response rate (ORR), disease control rate (DCR), and progression-free survival (PFS) was compared between the two groups. The results showed that the MMHC model has a higher prediction accuracy of survival status of lung cancer patients. Under the guidance of the model, the research group has a smaller diameter of primary foci and metastatic foci, a higher ORR, DCR, and a longer PFS than the control group (P < 0.05). We can conclude that the MMHC algorithm can guide the maintenance treatment of advanced NSCLC, which is conducive to the prognosis judgment and treatment cost control.
Collapse
|
23
|
Li H, Liu RB, Long CM, Teng Y, Cheng L, Liu Y. Development and Validation of a New Multiparametric Random Survival Forest Predictive Model for Breast Cancer Recurrence with a Potential Benefit to Individual Outcomes. Cancer Manag Res 2022; 14:909-923. [PMID: 35256862 PMCID: PMC8898179 DOI: 10.2147/cmar.s346871] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 01/27/2022] [Indexed: 12/24/2022] Open
Abstract
Purpose Breast cancer (BC) is a multi-factorial disease. Its individual prognosis varies; thus, individualized patient profiling is instrumental to improving BC management and individual outcomes. An economical, multiparametric, and practical model to predict BC recurrence is needed. Patients and Methods We retrospectively investigated the clinical data of BC patients treated at the Third Affiliated Hospital of Sun Yat-sen University and Liuzhou Women and Children’s Medical Center from January 2013 to December 2020. Random forest-recursive feature elimination (run by R caret package) was used to determine the best variable set, and the random survival forest method was used to develop a predictive model for BC recurrence. Results The training and validations sets included 623 and 151 patients, respectively. We selected 14 variables, the pathological (TNM) stage, gamma-glutamyl transpeptidase, total cholesterol, Ki-67, lymphocyte count, low-density lipoprotein, age, apolipoprotein B, high-density lipoprotein, globulin, neutrophil count to lymphocyte count ratio, alanine aminotransferase, triglyceride, and albumin to globulin ratio, using random survival forest (RSF)-recursive feature elimination. We developed a recurrence prediction model using RSF. Using area under the receiver operating characteristic curve and Kaplan–Meier survival analyses, the model performance was determined to be accurate. C-indexes were 0.997 and 0.936 for the training and validation sets, respectively. Conclusion The model could accurately predict BC recurrence. It aids clinicians in identifying high-risk patients and making treatment decisions for Breast cancer patients in China. This new multiparametric RSF model is instrumental for breast cancer recurrence prediction and potentially improves individual outcomes.
Collapse
Affiliation(s)
- Huan Li
- Department of Thyroid and Breast Surgery, Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong, People’s Republic of China
| | - Ren-Bin Liu
- Department of Thyroid and Breast Surgery, Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong, People’s Republic of China
| | - Chen-Meng Long
- Department of Breast Surgery, Liuzhou Women and Children’s Medical Center, Liuzhou, Guangxi, People’s Republic Of China
| | - Yuan Teng
- Department of Breast Surgery, Guangzhou Women and Children’s Medical Center, Guangzhou, Guangdong, People’s Republic of China
| | - Lin Cheng
- Department of Thyroid and Breast Surgery, Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong, People’s Republic of China
| | - Yu Liu
- Department of Thyroid and Breast Surgery, Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong, People’s Republic of China
- Correspondence: Yu Liu, Tel +8613560170809, Fax +86 20 85252154, Email
| |
Collapse
|
24
|
Thomas LB, Mastorides SM, Viswanadhan NA, Jakey CE, Borkowski AA. Artificial Intelligence: Review of Current and Future Applications in Medicine. Fed Pract 2022; 38:527-538. [PMID: 35136337 DOI: 10.12788/fp.0174] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Background The role of artificial intelligence (AI) in health care is expanding rapidly. Currently, there are at least 29 US Food and Drug Administration-approved AI health care devices that apply to numerous medical specialties and many more are in development. Observations With increasing expectations for all health care sectors to deliver timely, fiscally-responsible, high-quality health care, AI has potential utility in numerous areas, such as image analysis, improved workflow and efficiency, public health, and epidemiology, to aid in processing large volumes of patient and medical data. In this review, we describe basic terminology, principles, and general AI applications relating to health care. We then discuss current and future applications for a variety of medical specialties. Finally, we discuss the future potential of AI along with the potential risks and limitations of current AI technology. Conclusions AI can improve diagnostic accuracy, increase patient safety, assist with patient triage, monitor disease progression, and assist with treatment decisions.
Collapse
Affiliation(s)
- L Brannon Thomas
- James A. Haley Veterans' Hospital, Tampa, Florida.,University of South Florida, Morsani College of Medicine, Tampa
| | - Stephen M Mastorides
- James A. Haley Veterans' Hospital, Tampa, Florida.,University of South Florida, Morsani College of Medicine, Tampa
| | | | - Colleen E Jakey
- James A. Haley Veterans' Hospital, Tampa, Florida.,University of South Florida, Morsani College of Medicine, Tampa
| | - Andrew A Borkowski
- James A. Haley Veterans' Hospital, Tampa, Florida.,University of South Florida, Morsani College of Medicine, Tampa
| |
Collapse
|
25
|
Rasool R, Ullah I, Mubeen B, Alshehri S, Imam SS, Ghoneim MM, Alzarea SI, Al-Abbasi FA, Murtaza BN, Kazmi I, Nadeem MS. Theranostic Interpolation of Genomic Instability in Breast Cancer. Int J Mol Sci 2022; 23:ijms23031861. [PMID: 35163783 PMCID: PMC8836911 DOI: 10.3390/ijms23031861] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Revised: 01/26/2022] [Accepted: 01/27/2022] [Indexed: 12/14/2022] Open
Abstract
Breast cancer is a diverse disease caused by mutations in multiple genes accompanying epigenetic aberrations of hazardous genes and protein pathways, which distress tumor-suppressor genes and the expression of oncogenes. Alteration in any of the several physiological mechanisms such as cell cycle checkpoints, DNA repair machinery, mitotic checkpoints, and telomere maintenance results in genomic instability. Theranostic has the potential to foretell and estimate therapy response, contributing a valuable opportunity to modify the ongoing treatments and has developed new treatment strategies in a personalized manner. “Omics” technologies play a key role while studying genomic instability in breast cancer, and broadly include various aspects of proteomics, genomics, metabolomics, and tumor grading. Certain computational techniques have been designed to facilitate the early diagnosis of cancer and predict disease-specific therapies, which can produce many effective results. Several diverse tools are used to investigate genomic instability and underlying mechanisms. The current review aimed to explore the genomic landscape, tumor heterogeneity, and possible mechanisms of genomic instability involved in initiating breast cancer. We also discuss the implications of computational biology regarding mutational and pathway analyses, identification of prognostic markers, and the development of strategies for precision medicine. We also review different technologies required for the investigation of genomic instability in breast cancer cells, including recent therapeutic and preventive advances in breast cancer.
Collapse
Affiliation(s)
- Rabia Rasool
- Institute of Molecular Biology and Biotechnology, The University of Lahore, Lahore 54000, Pakistan; (R.R.); (I.U.); (B.M.)
| | - Inam Ullah
- Institute of Molecular Biology and Biotechnology, The University of Lahore, Lahore 54000, Pakistan; (R.R.); (I.U.); (B.M.)
| | - Bismillah Mubeen
- Institute of Molecular Biology and Biotechnology, The University of Lahore, Lahore 54000, Pakistan; (R.R.); (I.U.); (B.M.)
| | - Sultan Alshehri
- Department of Pharmaceutics, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia; (S.A.); (S.S.I.)
| | - Syed Sarim Imam
- Department of Pharmaceutics, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia; (S.A.); (S.S.I.)
| | - Mohammed M. Ghoneim
- Department of Pharmacy Practice, College of Pharmacy, AlMaarefa University, Ad Diriyah 13713, Saudi Arabia;
| | - Sami I. Alzarea
- Department of Pharmacology, College of Pharmacy, Jouf University, Sakaka 72341, Saudi Arabia;
| | - Fahad A. Al-Abbasi
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia;
| | - Bibi Nazia Murtaza
- Department of Zoology, Abbottabad University of Science and Technology (AUST), Abbottabad 22310, Pakistan;
| | - Imran Kazmi
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia;
- Correspondence: (I.K.); (M.S.N.)
| | - Muhammad Shahid Nadeem
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia;
- Correspondence: (I.K.); (M.S.N.)
| |
Collapse
|
26
|
Kaur I, Doja M, Ahmad T. Data Mining and Machine Learning in Cancer Survival Research: An Overview and Future Recommendations. J Biomed Inform 2022; 128:104026. [DOI: 10.1016/j.jbi.2022.104026] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 02/07/2022] [Accepted: 02/09/2022] [Indexed: 12/29/2022]
|
27
|
Gasparini A, Humphreys K. Estimating latent, dynamic processes of breast cancer tumour growth and distant metastatic spread from mammography screening data. Stat Methods Med Res 2022; 31:862-881. [PMID: 35103530 PMCID: PMC9099158 DOI: 10.1177/09622802211072496] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
We propose a framework for jointly modelling tumour size at diagnosis and time to
distant metastatic spread, from diagnosis, based on latent dynamic sub-models of
growth of the primary tumour and of distant metastatic detection. The framework
also includes a sub-model for screening sensitivity as a function of latent
tumour size. Our approach connects post-diagnosis events to the natural history
of cancer and, once refined, may prove useful for evaluating new interventions,
such as personalised screening regimes. We evaluate our model-fitting procedure
using Monte Carlo simulation, showing that the estimation algorithm can retrieve
the correct model parameters, that key patterns in the data can be captured by
the model even with misspecification of some structural assumptions, and that,
still, with enough data it should be possible to detect strong
misspecifications. Furthermore, we fit our model to observational data from an
extension of a case-control study of post-menopausal breast cancer in Sweden,
providing model-based estimates of the probability of being free from detected
distant metastasis as a function of tumour size, mode of detection (of the
primary tumour), and screening history. For women with screen-detected cancer
and two previous negative screens, the probabilities of being free from detected
distant metastases 5 years after detection and removal of the primary tumour are
0.97, 0.89 and 0.59 for tumours of diameter 5, 15 and 35 mm, respectively. We
also study the probability of having latent/dormant metastases at detection of
the primary tumour, estimating that 33% of patients in our study had such
metastases.
Collapse
Affiliation(s)
- Alessandro Gasparini
- Alessandro Gasparini, Department of Medical
Epidemiology and Biostatistics, Karolinska Institutet, PO Box 281, SE-17177,
Stockholm, Sweden.
| | | |
Collapse
|
28
|
Park JS, Kim DW, Lee D, Lee T, Koo KC, Han WK, Chung BH, Lee KS. Development of prediction models of spontaneous ureteral stone passage through machine learning: Comparison with conventional statistical analysis. PLoS One 2021; 16:e0260517. [PMID: 34851999 PMCID: PMC8635399 DOI: 10.1371/journal.pone.0260517] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Accepted: 11/11/2021] [Indexed: 12/22/2022] Open
Abstract
Objectives To develop a prediction model of spontaneous ureteral stone passage (SSP) using machine learning and logistic regression and compare the performance of the two models. Indications for management of ureteral stones are unclear, and the clinician determines whether to wait for SSP or perform active treatment, especially in well-controlled patients, to avoid unwanted complications. Therefore, suggesting the possibility of SSP would help make a clinical decision regarding ureteral stones. Methods Patients diagnosed with unilateral ureteral stones at our emergency department between August 2014 and September 2018 were included and underwent non-contrast-enhanced computed tomography 4 weeks from the first stone episode. Predictors of SSP were applied to build and validate the prediction model using multilayer perceptron (MLP) with the Keras framework. Results Of 833 patients, SSP was observed in 606 (72.7%). SSP rates were 68.2% and 75.6% for stone sizes 5–10 mm and <5 mm, respectively. Stone opacity, location, and whether it was the first ureteral stone episode were significant predictors of SSP. Areas under the curve (AUCs) for receiver operating characteristic (ROC) curves for MLP, and logistic regression were 0.859 and 0.847, respectively, for stones <5 mm, and 0.881 and 0.817, respectively, for 5–10 mm stones. Conclusion SSP prediction models were developed in patients with well-controlled unilateral ureteral stones; the performance of the models was good, especially in identifying SSP for 5–10-mm ureteral stones without definite treatment guidelines. To further improve the performance of these models, future studies should focus on using machine learning techniques in image analysis.
Collapse
Affiliation(s)
- Jee Soo Park
- Department of Urology, Yonsei University College of Medicine, Seoul, Korea
- Department of Urology, Sorokdo National Hospital, Goheung, Korea
| | - Dong Wook Kim
- Department of Oral and Maxillofacial Surgery, Yonsei University College of Dentistry, Seoul, Korea
| | - Dongu Lee
- Department of Urology, Yonsei University College of Medicine, Seoul, Korea
| | - Taeju Lee
- Department of Urology, Yonsei University College of Medicine, Seoul, Korea
| | - Kyo Chul Koo
- Department of Urology, Yonsei University College of Medicine, Seoul, Korea
| | - Woong Kyu Han
- Department of Urology, Yonsei University College of Medicine, Seoul, Korea
| | - Byung Ha Chung
- Department of Urology, Yonsei University College of Medicine, Seoul, Korea
| | - Kwang Suk Lee
- Department of Urology, Yonsei University College of Medicine, Seoul, Korea
- Department of Mechanical Engineering, Yonsei University College of Engineering, Seoul, Korea
- * E-mail:
| |
Collapse
|
29
|
Emmert-Streib F, Manjang K, Dehmer M, Yli-Harja O, Auvinen A. Are There Limits in Explainability of Prognostic Biomarkers? Scrutinizing Biological Utility of Established Signatures. Cancers (Basel) 2021; 13:cancers13205087. [PMID: 34680236 PMCID: PMC8533990 DOI: 10.3390/cancers13205087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Revised: 10/01/2021] [Accepted: 10/05/2021] [Indexed: 11/30/2022] Open
Abstract
Prognostic biomarkers can have an important role in the clinical practice because they allow stratification of patients in terms of predicting the outcome of a disorder. Obstacles for developing such markers include lack of robustness when using different data sets and limited concordance among similar signatures. In this paper, we highlight a new problem that relates to the biological meaning of already established prognostic gene expression signatures. Specifically, it is commonly assumed that prognostic markers provide sensible biological information and molecular explanations about the underlying disorder. However, recent studies on prognostic biomarkers investigating 80 established signatures of breast and prostate cancer demonstrated that this is not the case. We will show that this surprising result is related to the distinction between causal models and predictive models and the obfuscating usage of these models in the biomedical literature. Furthermore, we suggest a falsification procedure for studies aiming to establish a prognostic signature to safeguard against false expectations with respect to biological utility.
Collapse
Affiliation(s)
- Frank Emmert-Streib
- Predictive Society and Data Analytics Lab, Faculty of Information Technology and Communication Sciences, Tampere University, 33720 Tampere, Finland;
- Correspondence:
| | - Kalifa Manjang
- Predictive Society and Data Analytics Lab, Faculty of Information Technology and Communication Sciences, Tampere University, 33720 Tampere, Finland;
| | - Matthias Dehmer
- Department of Computer Science, Swiss Distance University of Applied Sciences, 3900 Brig, Switzerland;
- Department of Mechatronics and Biomedical Computer Science, UMIT, 6060 Hall in Tyrol, Austria
- College of Artificial Intelligence, Nankai University, Tianjin 300350, China
| | - Olli Yli-Harja
- Computational Systems Biology, Faculty of Medicine and Health Technology, Tampere University, 33720 Tampere, Finland;
- Institute for Systems Biology, Seattle, WA 98195, USA
- Institute of Biosciences and Medical Technology, 33720 Tampere, Finland
| | - Anssi Auvinen
- Unit of Health Sciences, Faculty of Social Sciences, Tampere University, 33720 Tampere, Finland;
| |
Collapse
|
30
|
Hassanzadeh HR, Wang MD. An Integrated Deep Network for Cancer Survival Prediction Using Omics Data. Front Big Data 2021; 4:568352. [PMID: 34337396 PMCID: PMC8322661 DOI: 10.3389/fdata.2021.568352] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Accepted: 06/01/2021] [Indexed: 12/22/2022] Open
Abstract
As a highly sophisticated disease that humanity faces, cancer is known to be associated with dysregulation of cellular mechanisms in different levels, which demands novel paradigms to capture informative features from different omics modalities in an integrated way. Successful stratification of patients with respect to their molecular profiles is a key step in precision medicine and in tailoring personalized treatment for critically ill patients. In this article, we use an integrated deep belief network to differentiate high-risk cancer patients from the low-risk ones in terms of the overall survival. Our study analyzes RNA, miRNA, and methylation molecular data modalities from both labeled and unlabeled samples to predict cancer survival and subsequently to provide risk stratification. To assess the robustness of our novel integrative analytics, we utilize datasets of three cancer types with 836 patients and show that our approach outperforms the most successful supervised and semi-supervised classification techniques applied to the same cancer prediction problems. In addition, despite the preconception that deep learning techniques require large size datasets for proper training, we have illustrated that our model can achieve better results for moderately sized cancer datasets.
Collapse
Affiliation(s)
- Hamid Reza Hassanzadeh
- School of Interactive Computing, Georgia Institute of Technology, Atlanta, GA, United States
| | - May D. Wang
- Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, United States
| |
Collapse
|
31
|
Li X, Yang L, Yuan Z, Lou J, Fan Y, Shi A, Huang J, Zhao M, Wu Y. Multi-institutional development and external validation of machine learning-based models to predict relapse risk of pancreatic ductal adenocarcinoma after radical resection. J Transl Med 2021; 19:281. [PMID: 34193166 PMCID: PMC8243478 DOI: 10.1186/s12967-021-02955-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 06/19/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Surgical resection is the only potentially curative treatment for pancreatic ductal adenocarcinoma (PDAC) and the survival of patients after radical resection is closely related to relapse. We aimed to develop models to predict the risk of relapse using machine learning methods based on multiple clinical parameters. METHODS Data were collected and analysed of 262 PDAC patients who underwent radical resection at 3 institutions between 2013 and 2017, with 183 from one institution as a training set, 79 from the other 2 institution as a validation set. We developed and compared several predictive models to predict 1- and 2-year relapse risk using machine learning approaches. RESULTS Machine learning techniques were superior to conventional regression-based analyses in predicting risk of relapse of PDAC after radical resection. Among them, the random forest (RF) outperformed other methods in the training set. The highest accuracy and area under the receiver operating characteristic curve (AUROC) for predicting 1-year relapse risk with RF were 78.4% and 0.834, respectively, and for 2-year relapse risk were 95.1% and 0.998. However, the support vector machine (SVM) model showed better performance than the others for predicting 1-year relapse risk in the validation set. And the k neighbor algorithm (KNN) model achieved the highest accuracy and AUROC for predicting 2-year relapse risk. CONCLUSIONS By machine learning, this study has developed and validated comprehensive models integrating clinicopathological characteristics to predict the relapse risk of PDAC after radical resection which will guide the development of personalized surveillance programs after surgery.
Collapse
Affiliation(s)
- Xiawei Li
- Department of Surgery, Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310000, Zhejiang, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Cancer Institute, Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310000, Zhejiang, China
- Cancer Center, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Litao Yang
- Department of Surgery, Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Hangzhou, 310000, Zhejiang, China
| | - Zheping Yuan
- Hessian Health Technology Co., Ltd, Beijing, 100007, China
| | - Jianyao Lou
- Department of Surgery, Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310000, Zhejiang, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Cancer Institute, Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310000, Zhejiang, China
- Cancer Center, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Yiqun Fan
- Department of Surgery, Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, 322000, Zhejiang, China
| | - Aiguang Shi
- Department of Surgery, Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310000, Zhejiang, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Cancer Institute, Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310000, Zhejiang, China
- Cancer Center, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Junjie Huang
- Department of Surgery, Changxing People's Hospital, Huzhou, 313100, Zhejiang, China
| | - Mingchen Zhao
- Hessian Health Technology Co., Ltd, Beijing, 100007, China
| | - Yulian Wu
- Department of Surgery, Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310000, Zhejiang, China.
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Cancer Institute, Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310000, Zhejiang, China.
- Cancer Center, Zhejiang University, Hangzhou, 310058, Zhejiang, China.
| |
Collapse
|
32
|
Yoo YC, Park S, Kim HJ, Jung HE, Kim JY, Kim MH. Preoperative Routine Laboratory Markers for Predicting Postoperative Recurrence and Death in Patients with Breast Cancer. J Clin Med 2021; 10:2610. [PMID: 34199276 PMCID: PMC8231951 DOI: 10.3390/jcm10122610] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Revised: 04/26/2021] [Accepted: 06/11/2021] [Indexed: 12/14/2022] Open
Abstract
Simple, convenient, and reliable preoperative prognostic indicators are needed to estimate the future risk of recurrences and guide the treatment decisions associated with breast cancer. We evaluated preoperative hematological markers related to recurrence and mortality and investigated independent risk factors for recurrence and mortality in patients after breast cancer surgery. We reviewed electronic medical records of patients with invasive breast cancer diagnosed at our tertiary institution between November 2005 and December 2010 and followed them until 2015. We compared two groups of patients classified according to recurrence or death and identified risk factors for postoperative outcomes. Data from 1783 patients were analyzed ultimately. Cancer antigen (CA) 15-3 and red cell distribution width (RDW) had the highest area under the curve values among several preoperative hematological markers for disease-free survival and overall survival (0.590 and 0.637, respectively). Patients with both preoperative CA 15-3 levels over 11.4 and RDW over 13.5 had a 1.7-fold higher risk of recurrence (hazard ratio (HR): 1.655; 95% confidence interval (CI): 1.154-2.374; p = 0.007) and mortality (HR: 1.723; 95% CI: 1.098-2.704; p = 0.019). In conclusion, relatively high preoperative RDW (>13.5) and CA 15-3 levels (>11.4) had the highest predictive power for mortality and recurrence, respectively. When RDW and CA 15-3 exceeded the cut-off value, the risk of recurrence and death also increased approximately 1.7 times.
Collapse
Affiliation(s)
- Young-Chul Yoo
- Department of Anesthesiology and Pain Medicine, Anesthesia and Pain Research Institute, Yonsei University College of Medicine, 50-1 Yonsei-ro, Seodaemun-gu, Seoul 03722, Korea; (Y.-C.Y.); (H.-J.K.)
| | - Seho Park
- Devision of Breast Cancer, Department of General Surgery, Anesthesia and Pain Research Institute, Yonsei University College of Medicine, 50-1 Yonsei-ro, Seodaemun-gu, Seoul 03722, Korea;
| | - Hyun-Joo Kim
- Department of Anesthesiology and Pain Medicine, Anesthesia and Pain Research Institute, Yonsei University College of Medicine, 50-1 Yonsei-ro, Seodaemun-gu, Seoul 03722, Korea; (Y.-C.Y.); (H.-J.K.)
| | - Hyun-Eom Jung
- Department of Anesthesiology and Pain Medicine, Yonsei University College of Medicine, Gangnam Severance Hospital, 211 Eonju-ro, Gangnam-gu, Seoul 06273, Korea; (H.-E.J.); (J.-Y.K.)
| | - Ji-Young Kim
- Department of Anesthesiology and Pain Medicine, Yonsei University College of Medicine, Gangnam Severance Hospital, 211 Eonju-ro, Gangnam-gu, Seoul 06273, Korea; (H.-E.J.); (J.-Y.K.)
| | - Myoung-Hwa Kim
- Department of Anesthesiology and Pain Medicine, Anesthesia and Pain Research Institute, Yonsei University College of Medicine, Gangnam Severance Hospital, 211 Eonju-ro, Gangnam-gu, Seoul 06273, Korea
| |
Collapse
|
33
|
Yang PT, Wu WS, Wu CC, Shih YN, Hsieh CH, Hsu JL. Breast cancer recurrence prediction with ensemble methods and cost-sensitive learning. Open Med (Wars) 2021; 16:754-768. [PMID: 34027105 PMCID: PMC8122465 DOI: 10.1515/med-2021-0282] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 03/11/2021] [Accepted: 04/03/2021] [Indexed: 11/15/2022] Open
Abstract
Breast cancer is one of the most common cancers in women all over the world. Due to the improvement of medical treatments, most of the breast cancer patients would be in remission. However, the patients have to face the next challenge, the recurrence of breast cancer which may cause more severe effects, and even death. The prediction of breast cancer recurrence is crucial for reducing mortality. This paper proposes a prediction model for the recurrence of breast cancer based on clinical nominal and numeric features. In this study, our data consist of 1,061 patients from Breast Cancer Registry from Shin Kong Wu Ho-Su Memorial Hospital between 2011 and 2016, in which 37 records are denoted as breast cancer recurrence. Each record has 85 features. Our approach consists of three stages. First, we perform data preprocessing and feature selection techniques to consolidate the dataset. Among all features, six features are identified for further processing in the following stages. Next, we apply resampling techniques to resolve the issue of class imbalance. Finally, we construct two classifiers, AdaBoost and cost-sensitive learning, to predict the risk of recurrence and carry out the performance evaluation in three-fold cross-validation. By applying the AdaBoost method, we achieve accuracy of 0.973 and sensitivity of 0.675. By combining the AdaBoost and cost-sensitive method of our model, we achieve a reasonable accuracy of 0.468 and substantially high sensitivity of 0.947 which guarantee almost no false dismissal. Our model can be used as a supporting tool in the setting and evaluation of the follow-up visit for early intervention and more advanced treatments to lower cancer mortality.
Collapse
Affiliation(s)
- Pei-Tse Yang
- Department of Computer Science and Information Engineering, Fu Jen Catholic University, New Taipei City, Taiwan, Republic of China
| | - Wen-Shuo Wu
- Department of Computer Science and Information Engineering, Fu Jen Catholic University, New Taipei City, Taiwan, Republic of China
| | - Chia-Chun Wu
- Department of Computer Science and Information Engineering, Fu Jen Catholic University, New Taipei City, Taiwan, Republic of China
| | - Yi-Nuo Shih
- Department of Occupational Therapy, Fu Jen Catholic University, New Taipei City, Taiwan, Republic of China
| | - Chung-Ho Hsieh
- Department of General Surgery, Shin Kong Wu Ho-Su Memorial Hospital, Taipei, Taiwan, Republic of China
| | - Jia-Lien Hsu
- Department of Computer Science and Information Engineering, Fu Jen Catholic University, New Taipei City, Taiwan, Republic of China
| |
Collapse
|
34
|
Rahman MM, Paul KC, Hossain MA, Ali GGMN, Rahman MS, Thill JC. Machine Learning on the COVID-19 Pandemic, Human Mobility and Air Quality: A Review. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2021; 9:72420-72450. [PMID: 34786314 PMCID: PMC8545207 DOI: 10.1109/access.2021.3079121] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2021] [Accepted: 05/07/2021] [Indexed: 05/19/2023]
Abstract
The ongoing COVID-19 global pandemic is touching every facet of human lives (e.g., public health, education, economy, transportation, and the environment). This novel pandemic and non-pharmaceutical interventions of lockdown and confinement implemented citywide, regionally or nationally are affecting virus transmission, people's travel patterns, and air quality. Many studies have been conducted to predict the diffusion of the COVID-19 disease, assess the impacts of the pandemic on human mobility and on air quality, and assess the impacts of lockdown measures on viral spread with a range of Machine Learning (ML) techniques. This literature review aims to analyze the results from past research to understand the interactions among the COVID-19 pandemic, lockdown measures, human mobility, and air quality. The critical review of prior studies indicates that urban form, people's socioeconomic and physical conditions, social cohesion, and social distancing measures significantly affect human mobility and COVID-19 viral transmission. During the COVID-19 pandemic, many people are inclined to use private transportation for necessary travel to mitigate coronavirus-related health problems. This review study also noticed that COVID-19 related lockdown measures significantly improve air quality by reducing the concentration of air pollutants, which in turn improves the COVID-19 situation by reducing respiratory-related sickness and deaths. It is argued that ML is a powerful, effective, and robust analytic paradigm to handle complex and wicked problems such as a global pandemic. This study also explores the spatio-temporal aspects of lockdown and confinement measures on coronavirus diffusion, human mobility, and air quality. Additionally, we discuss policy implications, which will be helpful for policy makers to take prompt actions to moderate the severity of the pandemic and improve urban environments by adopting data-driven analytic methods.
Collapse
Affiliation(s)
- Md. Mokhlesur Rahman
- The William States Lee College of EngineeringUniversity of North Carolina at CharlotteCharlotteNC28223USA
- Department of Urban and Regional PlanningKhulna University of Engineering and Technology (KUET)Khulna9203Bangladesh
| | - Kamal Chandra Paul
- Department of Electrical and Computer EngineeringThe William States Lee College of EngineeringUniversity of North Carolina at CharlotteCharlotteNC28223USA
| | - Md. Amjad Hossain
- Department of Computer Science, Mathematics and EngineeringShepherd UniversityShepherdstownWV25443USA
| | - G. G. Md. Nawaz Ali
- Department of Applied Computer ScienceUniversity of CharlestonCharlestonWV25304USA
| | - Md. Shahinoor Rahman
- Department of Earth and Environmental SciencesNew Jersey City UniversityJersey CityNJ07305USA
| | - Jean-Claude Thill
- Department of Geography and Earth SciencesSchool of Data ScienceUniversity of North Carolina at CharlotteCharlotteNC28223USA
| |
Collapse
|
35
|
Kim JY, Lee YS, Yu J, Park Y, Lee SK, Lee M, Lee JE, Kim SW, Nam SJ, Park YH, Ahn JS, Kang M, Im YH. Deep Learning-Based Prediction Model for Breast Cancer Recurrence Using Adjuvant Breast Cancer Cohort in Tertiary Cancer Center Registry. Front Oncol 2021; 11:596364. [PMID: 34017679 PMCID: PMC8129587 DOI: 10.3389/fonc.2021.596364] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Accepted: 02/17/2021] [Indexed: 01/06/2023] Open
Abstract
Several prognosis prediction models have been developed for breast cancer (BC) patients with curative surgery, but there is still an unmet need to precisely determine BC prognosis for individual BC patients in real time. This is a retrospectively collected data analysis from adjuvant BC registry at Samsung Medical Center between January 2000 and December 2016. The initial data set contained 325 clinical data elements: baseline characteristics with demographics, clinical and pathologic information, and follow-up clinical information including laboratory and imaging data during surveillance. Weibull Time To Event Recurrent Neural Network (WTTE-RNN) by Martinsson was implemented for machine learning. We searched for the optimal window size as time-stamped inputs. To develop the prediction model, data from 13,117 patients were split into training (60%), validation (20%), and test (20%) sets. The median follow-up duration was 4.7 years and the median number of visits was 8.4. We identified 32 features related to BC recurrence and considered them in further analyses. Performance at a point of statistics was calculated using Harrell's C-index and area under the curve (AUC) at each 2-, 5-, and 7-year points. After 200 training epochs with a batch size of 100, the C-index reached 0.92 for the training data set and 0.89 for the validation and test data sets. The AUC values were 0.90 at 2-year point, 0.91 at 5-year point, and 0.91 at 7-year point. The deep learning-based final model outperformed three other machine learning-based models. In terms of pathologic characteristics, the median absolute error (MAE) and weighted mean absolute error (wMAE) showed great results of as little as 3.5%. This BC prognosis model to determine the probability of BC recurrence in real time was developed using information from the time of BC diagnosis and the follow-up period in RNN machine learning model.
Collapse
Affiliation(s)
- Ji-Yeon Kim
- Division of Hematology-Oncology, Department of Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Yong Seok Lee
- Digital Health Business Team, Samsung SDS, Seoul, South Korea
| | - Jonghan Yu
- Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Youngmin Park
- Digital Health Business Team, Samsung SDS, Seoul, South Korea
| | - Se Kyung Lee
- Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Minyoung Lee
- Digital Health Business Team, Samsung SDS, Seoul, South Korea
| | - Jeong Eon Lee
- Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Seok Won Kim
- Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Seok Jin Nam
- Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Yeon Hee Park
- Division of Hematology-Oncology, Department of Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Jin Seok Ahn
- Division of Hematology-Oncology, Department of Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Mira Kang
- Center for Health Promotion, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Young-Hyuck Im
- Division of Hematology-Oncology, Department of Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| |
Collapse
|
36
|
Elkhader J, Elemento O. Artificial intelligence in oncology: From bench to clinic. Semin Cancer Biol 2021; 84:113-128. [PMID: 33915289 DOI: 10.1016/j.semcancer.2021.04.013] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 03/22/2021] [Accepted: 04/15/2021] [Indexed: 02/07/2023]
Abstract
In the past few years, Artificial Intelligence (AI) techniques have been applied to almost every facet of oncology, from basic research to drug development and clinical care. In the clinical arena where AI has perhaps received the most attention, AI is showing promise in enhancing and automating image-based diagnostic approaches in fields such as radiology and pathology. Robust AI applications, which retain high performance and reproducibility over multiple datasets, extend from predicting indications for drug development to improving clinical decision support using electronic health record data. In this article, we review some of these advances. We also introduce common concepts and fundamentals of AI and its various uses, along with its caveats, to provide an overview of the opportunities and challenges in the field of oncology. Leveraging AI techniques productively to provide better care throughout a patient's medical journey can fuel the predictive promise of precision medicine.
Collapse
Affiliation(s)
- Jamal Elkhader
- HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Dept. of Physiology and Biophysics, Weill Cornell Medicine, New York, NY 10021, USA; Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, 10021, USA; Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, 10065, USA; Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY, 10065, USA
| | - Olivier Elemento
- HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Dept. of Physiology and Biophysics, Weill Cornell Medicine, New York, NY 10021, USA; Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, 10021, USA; Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, 10065, USA; Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY, 10065, USA.
| |
Collapse
|
37
|
Banegas-Luna AJ, Peña-García J, Iftene A, Guadagni F, Ferroni P, Scarpato N, Zanzotto FM, Bueno-Crespo A, Pérez-Sánchez H. Towards the Interpretability of Machine Learning Predictions for Medical Applications Targeting Personalised Therapies: A Cancer Case Survey. Int J Mol Sci 2021; 22:4394. [PMID: 33922356 PMCID: PMC8122817 DOI: 10.3390/ijms22094394] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 04/16/2021] [Accepted: 04/20/2021] [Indexed: 12/18/2022] Open
Abstract
Artificial Intelligence is providing astonishing results, with medicine being one of its favourite playgrounds. Machine Learning and, in particular, Deep Neural Networks are behind this revolution. Among the most challenging targets of interest in medicine are cancer diagnosis and therapies but, to start this revolution, software tools need to be adapted to cover the new requirements. In this sense, learning tools are becoming a commodity but, to be able to assist doctors on a daily basis, it is essential to fully understand how models can be interpreted. In this survey, we analyse current machine learning models and other in-silico tools as applied to medicine-specifically, to cancer research-and we discuss their interpretability, performance and the input data they are fed with. Artificial neural networks (ANN), logistic regression (LR) and support vector machines (SVM) have been observed to be the preferred models. In addition, convolutional neural networks (CNNs), supported by the rapid development of graphic processing units (GPUs) and high-performance computing (HPC) infrastructures, are gaining importance when image processing is feasible. However, the interpretability of machine learning predictions so that doctors can understand them, trust them and gain useful insights for the clinical practice is still rarely considered, which is a factor that needs to be improved to enhance doctors' predictive capacity and achieve individualised therapies in the near future.
Collapse
Affiliation(s)
- Antonio Jesús Banegas-Luna
- Structural Bioinformatics and High-Performance Computing Research Group (BIO-HPC), Universidad Católica de Murcia (UCAM), 30107 Murcia, Spain; (J.P.-G.); (A.B.-C.)
| | - Jorge Peña-García
- Structural Bioinformatics and High-Performance Computing Research Group (BIO-HPC), Universidad Católica de Murcia (UCAM), 30107 Murcia, Spain; (J.P.-G.); (A.B.-C.)
| | - Adrian Iftene
- Faculty of Computer Science, Universitatea Alexandru Ioan Cuza (UAIC), 700505 Jashi, Romania;
| | - Fiorella Guadagni
- Interinstitutional Multidisciplinary Biobank (BioBIM), IRCCS San Raffaele Roma, 00166 Rome, Italy; (F.G.); (P.F.)
- Department of Human Sciences and Promotion of the Quality of Life, San Raffaele Roma Open University, 00166 Rome, Italy;
| | - Patrizia Ferroni
- Interinstitutional Multidisciplinary Biobank (BioBIM), IRCCS San Raffaele Roma, 00166 Rome, Italy; (F.G.); (P.F.)
- Department of Human Sciences and Promotion of the Quality of Life, San Raffaele Roma Open University, 00166 Rome, Italy;
| | - Noemi Scarpato
- Department of Human Sciences and Promotion of the Quality of Life, San Raffaele Roma Open University, 00166 Rome, Italy;
| | - Fabio Massimo Zanzotto
- Dipartimento di Ingegneria dell’Impresa “Mario Lucertini”, University of Rome Tor Vergata, 00133 Rome, Italy;
| | - Andrés Bueno-Crespo
- Structural Bioinformatics and High-Performance Computing Research Group (BIO-HPC), Universidad Católica de Murcia (UCAM), 30107 Murcia, Spain; (J.P.-G.); (A.B.-C.)
| | - Horacio Pérez-Sánchez
- Structural Bioinformatics and High-Performance Computing Research Group (BIO-HPC), Universidad Católica de Murcia (UCAM), 30107 Murcia, Spain; (J.P.-G.); (A.B.-C.)
| |
Collapse
|
38
|
|
39
|
Kim JY, Kim D, Jeon KJ, Kim H, Huh JK. Using deep learning to predict temporomandibular joint disc perforation based on magnetic resonance imaging. Sci Rep 2021; 11:6680. [PMID: 33758266 PMCID: PMC7988137 DOI: 10.1038/s41598-021-86115-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Accepted: 03/08/2021] [Indexed: 12/22/2022] Open
Abstract
The goal of this study was to develop a deep learning-based algorithm to predict temporomandibular joint (TMJ) disc perforation based on the findings of magnetic resonance imaging (MRI) and to validate its performance through comparison with previously reported results. The study objects were obtained by reviewing medical records from January 2005 to June 2018. 299 joints from 289 patients were divided into perforated and non-perforated groups based on the existence of disc perforation confirmed during surgery. Experienced observers interpreted the TMJ MRI images to extract features. Data containing those features were applied to build and validate prediction models using random forest and multilayer perceptron (MLP) techniques, the latter using the Keras framework, a recent deep learning architecture. The area under the receiver operating characteristic (ROC) curve (AUC) was used to compare the performances of the models. MLP produced the best performance (AUC 0.940), followed by random forest (AUC 0.918) and disc shape alone (AUC 0.791). The MLP and random forest were also superior to previously reported results using MRI (AUC 0.808) and MRI-based nomogram (AUC 0.889). Implementing deep learning showed superior performance in predicting disc perforation in TMJ compared to conventional methods and previous reports.
Collapse
Affiliation(s)
- Jae-Young Kim
- Department of Oral and Maxillofacial Surgery, Gangnam Severance Hospital, Yonsei University College of Dentistry, Seoul, Republic of Korea
| | - Dongwook Kim
- Department of Oral and Maxillofacial Surgery, Yonsei University College of Dentistry, Seoul, Republic of Korea
| | - Kug Jin Jeon
- Department of Oral and Maxillofacial Radiology, Yonsei University College of Dentistry, Seoul, Republic of Korea
| | - Hwiyoung Kim
- Department of Radiological Science, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Jong-Ki Huh
- Department of Oral and Maxillofacial Surgery, Gangnam Severance Hospital, Yonsei University College of Dentistry, 211 Eonju-ro, Gangnam-gu, Seoul, 06273, Republic of Korea.
| |
Collapse
|
40
|
Massafra R, Latorre A, Fanizzi A, Bellotti R, Didonna V, Giotta F, La Forgia D, Nardone A, Pastena M, Ressa CM, Rinaldi L, Russo AOM, Tamborra P, Tangaro S, Zito A, Lorusso V. A Clinical Decision Support System for Predicting Invasive Breast Cancer Recurrence: Preliminary Results. Front Oncol 2021; 11:576007. [PMID: 33777733 PMCID: PMC7991309 DOI: 10.3389/fonc.2021.576007] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 01/22/2021] [Indexed: 12/20/2022] Open
Abstract
The mortality associated to breast cancer is in many cases related to metastasization and recurrence. Personalized treatment strategies are critical for the outcomes improvement of BC patients and the Clinical Decision Support Systems can have an important role in medical practice. In this paper, we present the preliminary results of a prediction model of the Breast Cancer Recurrence (BCR) within five and ten years after diagnosis. The main breast cancer-related and treatment-related features of 256 patients referred to Istituto Tumori “Giovanni Paolo II” of Bari (Italy) were used to train machine learning algorithms at the-state-of-the-art. Firstly, we implemented several feature importance techniques and then we evaluated the prediction performances of BCR within 5 and 10 years after the first diagnosis by means different classifiers. By using a small number of features, the models reached highly performing results both with reference to the BCR within 5 years and within 10 years with an accuracy of 77.50% and 80.39% and a sensitivity of 92.31% and 95.83% respectively, in the hold-out sample test. Despite validation studies are needed on larger samples, our results are promising for the development of a reliable prognostic supporting tool for clinicians in the definition of personalized treatment plans.
Collapse
Affiliation(s)
- Raffaella Massafra
- Struttura Semplice Dipartimentale di Fisica Sanitaria, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Agnese Latorre
- Unitá Opertiva Complessa di Oncologia Medica, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Annarita Fanizzi
- Struttura Semplice Dipartimentale di Fisica Sanitaria, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Roberto Bellotti
- Dipartimento di Fisica, Universitá degli Studi "Aldo Moro" e Istituto Nazionale di Fisica Nucleare - Sezione di Bari, Bari, Italy
| | - Vittorio Didonna
- Struttura Semplice Dipartimentale di Fisica Sanitaria, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Francesco Giotta
- Unitá Opertiva Complessa di Oncologia Medica, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Daniele La Forgia
- Struttura Semplice Dipartimentale di Radiologia Senologica, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Annalisa Nardone
- Unitá Opertiva Complessa di Radioterapia, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Maria Pastena
- Unitá Opertiva Complessa di Anatomia Patologica, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Cosmo Maurizio Ressa
- Unitá Opertiva Complessa di Chirurgia Plastica e Ricostruttiva, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Lucia Rinaldi
- Struttura Semplice Dipartimentale di Oncologia Per la Presa in Carico Globale del Paziente, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | | | - Pasquale Tamborra
- Struttura Semplice Dipartimentale di Fisica Sanitaria, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Sabina Tangaro
- Dipartimento di Scienze del Suolo, della Pianta e degli Alimenti, Universitá degli Studi "Aldo Moro" e Istituto Nazionale di Fisica Nucleare - Sezione di Bari, Bari, Italy
| | - Alfredo Zito
- Unitá Opertiva Complessa di Anatomia Patologica, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Vito Lorusso
- Unitá Opertiva Complessa di Oncologia Medica, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| |
Collapse
|
41
|
Kourou K, Manikis G, Poikonen-Saksela P, Mazzocco K, Pat-Horenczyk R, Sousa B, Oliveira-Maia AJ, Mattson J, Roziner I, Pettini G, Kondylakis H, Marias K, Karademas E, Simos P, Fotiadis DI. A machine learning-based pipeline for modeling medical, socio-demographic, lifestyle and self-reported psychological traits as predictors of mental health outcomes after breast cancer diagnosis: An initial effort to define resilience effects. Comput Biol Med 2021; 131:104266. [PMID: 33607379 DOI: 10.1016/j.compbiomed.2021.104266] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 02/01/2021] [Accepted: 02/09/2021] [Indexed: 12/19/2022]
Abstract
Displaying resilience following a diagnosis of breast cancer is crucial for successful adaptation to illness, well-being, and health outcomes. Several theoretical and computational models have been proposed toward understanding the complex process of illness adaptation, involving a large variety of patient sociodemographic, lifestyle, medical, and psychological characteristics. To date, conventional multivariate statistical methods have been used extensively to model resilience. In the present work we describe a computational pipeline designed to identify the most prominent predictors of mental health outcomes following breast cancer diagnosis. A machine learning framework was developed and tested on the baseline data (recorded immediately post diagnosis) from an ongoing prospective, multinational study. This fully annotated dataset includes socio-demographic, lifestyle, medical and self-reported psychological characteristics of women recently diagnosed with breast cancer (N = 609). Nine different feature selection and cross-validated classification schemes were compared on their performance in classifying patients into low vs high depression symptom severity. Best-performing approaches involved a meta-estimator combined with a Support Vector Machines (SVMs) classification algorithm, exhibiting balanced accuracy of 0.825, and a fair balance between sensitivity (90%) and specificity (74%). These models consistently identified a set of psychological traits (optimism, perceived ability to cope with trauma, resilience as trait, ability to comprehend the illness), and subjective perceptions of personal functionality (physical, social, cognitive) as key factors accounting for concurrent depression symptoms. A comprehensive supervised learning pipeline is proposed for the identification of predictors of depression symptoms which could severely impede adaptation to illness.
Collapse
Affiliation(s)
- Konstantina Kourou
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and Engineering, University of Ioannina, Ioannina, Greece; Foundation for Research and Technology-Hellas, Institute of Molecular Biology and Biotechnology, Department of Biomedical Research, Ioannina, Greece
| | - Georgios Manikis
- Computational Biomedicine Laboratory, FORTH-ICS, Heraklion, Greece
| | - Paula Poikonen-Saksela
- Helsinki University Hospital Comprehensive Cancer Center and Helsinki University, Finland
| | - Ketti Mazzocco
- Applied Research Division for Cognitive and Psychological Science, European Institute of Oncology, Milan, Italy; Department of Oncology and Hemato-oncology, University of Milan, Italy
| | - Ruth Pat-Horenczyk
- School of Social Work and Social Welfare,The Hebrew University of Jerusalem, Israel
| | - Berta Sousa
- Breast Unit, Champalimaud Clinical Centre/ Champalimaud Foundation, Champalimaud Research, Lisboa, Portugal
| | - Albino J Oliveira-Maia
- Champalimaud Research and Clinical Centre, Champalimaud Centre for the Unknown, Lisboa, Portugal; NOVA Medical School, NMS, Universidade Nova de Lisboa, Lisboa, Portugal
| | - Johanna Mattson
- Helsinki University Hospital Comprehensive Cancer Center and Helsinki University, Finland
| | - Ilan Roziner
- Department of Communication Disorders, Sackler Faculty of Medicine, Tel Aviv University, Israel
| | - Greta Pettini
- Applied Research Division for Cognitive and Psychological Science, European Institute of Oncology, Milan, Italy
| | | | - Kostas Marias
- Computational Biomedicine Laboratory, FORTH-ICS, Heraklion, Greece
| | - Evangelos Karademas
- Computational Biomedicine Laboratory, FORTH-ICS, Heraklion, Greece; Department of Psychology, University of Crete, Rethymno, Greece
| | - Panagiotis Simos
- Computational Biomedicine Laboratory, FORTH-ICS, Heraklion, Greece; School of Medicine, University of Crete, Heraklion, Greece
| | - Dimitrios I Fotiadis
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and Engineering, University of Ioannina, Ioannina, Greece; Foundation for Research and Technology-Hellas, Institute of Molecular Biology and Biotechnology, Department of Biomedical Research, Ioannina, Greece.
| |
Collapse
|
42
|
Rao A, Dhahri AA, Razzaq H, Mokhtari E, Majeed A, Patel A. Algorithm-Based Online Software for Patients' Self-Referral to Breast Clinic as an Alternative to General Practitioner Referral Pathway. Cureus 2020; 12:e11740. [PMID: 33274168 PMCID: PMC7707138 DOI: 10.7759/cureus.11740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
Introduction The study aimed to assess the accuracy of online software in the use of self-referral to breast surgery clinics for patients with new signs and symptoms. The study also evaluated the appropriateness of GP referrals to breast clinics and evaluated patients' perceptions of an online self-referral portal to the breast clinic for the assessment of breast signs and symptoms. Design and methods The pilot study was divided into two phases. In the first phase, prospective questionnaire-based data was collected from patients who were referred by a GP and presented to the regional breast unit with new signs and symptoms for breast conditions, Princess Alexandra Hospital NHS Trust (May - October 2018). The questionnaire assessed the time at each stage required by the patient to have a visit at the breast unit. It also asked the patient's opinion about an online self-referral portal to the surgical clinic. They were given hypothetical scenarios to evaluate their understanding of breast conditions. In the second phase, the patients presenting to symptomatic breast clinics were provided with the iPad to fill in their medical information in the online software. The data was collected between July and October 2019. The software algorithm was based on the National Institute of Clinical Health and Excellence (NICE) guidelines for breast conditions (2015). Breast surgeons’ recommendations acted as a standard to evaluate the accuracy of GPs' referrals and software outcome for each patient. Results There were 80 patients (mean age 49.1 [SD: 17.7], all females) included in the first phase of the study. The most common clinical presentation was a breast lump (47.6%), followed by breast pain (26.9%) and nipple changes (7.9%). Breast surgeons considered appropriate 75.6% of the referrals made by the GP. Seventy-two percent of the patients got an urgent appointment to see their GP, and 94.8% of the patients were urgently referred by their GP to see the breast surgeon. Only 37.8% of the urgent referrals were correctly referred as urgent. Having a direct online referral system for breast conditions will be beneficial for patients was agreed by 78.4%. The majority (98.1%) of the participants answered correctly for the hypothetical questions requiring breast surgeon review. In the second phase, there were a total of 86 patients with a mean age of 43.9 (SD: 13.3). The most common presentation was breast lump (n=68, 79.1%) and other presentations included breast pain, nipple changes, and discharge. The GPs' accuracy of correct referral was 69.1%. One third (30.9%) of the referrals could have been managed in the community or as a routine review by the breast surgeon. In comparison, the online software's accuracy was 85.1% accurate (p=0.001). The accuracy for detecting patients who needed urgent breast clinic review was 100% for online software. Conclusion A large proportion of referrals could have been dealt with in the community or referred routinely. Patients would prefer a direct online referral system to the breast clinic. They understand red flag signs and symptoms. Online software has the potential to streamline patients for symptomatic breast clinics. It can reduce the burden on the GPs who are constantly under pressure to diagnose patients accurately and refer to the correct specialty appropriately within a short time.
Collapse
Affiliation(s)
- Ahsan Rao
- Breast Surgery, Addenbrooke's Hospital, Cambridge University Hospitals NHS Foundation Trust, London, GBR
| | | | - Humayun Razzaq
- General Surgery, Southend University Hospital, Southend-on-Sea, GBR
| | - Eshagh Mokhtari
- Breast Surgery, Princess Alexandra Hospital NHS Trust, Harlow, GBR
| | - Azeem Majeed
- Primary Care and Public Health, Imperial College London School of Public Health, London, GBR
| | - Ashraf Patel
- Breast Surgery, Princess Alexandra Hospital NHS Trust, Harlow, GBR
| |
Collapse
|
43
|
Zhong X, Luo T, Deng L, Liu P, Hu K, Lu D, Zheng D, Luo C, Xie Y, Li J, He P, Pu T, Ye F, Bu H, Fu B, Zheng H. Multidimensional Machine Learning Personalized Prognostic Model in an Early Invasive Breast Cancer Population-Based Cohort in China: Algorithm Validation Study. JMIR Med Inform 2020; 8:e19069. [PMID: 33164899 PMCID: PMC7683252 DOI: 10.2196/19069] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2020] [Revised: 08/07/2020] [Accepted: 09/16/2020] [Indexed: 02/05/2023] Open
Abstract
Background Current online prognostic prediction models for breast cancer, such as Adjuvant! Online and PREDICT, are based on specific populations. They have been well validated and widely used in the United States and Western Europe; however, several validation attempts in non-European countries have revealed suboptimal predictions. Objective We aimed to develop an advanced breast cancer prognosis model for disease progression, cancer-specific mortality, and all-cause mortality by integrating tumor, demographic, and treatment characteristics from a large breast cancer cohort in China. Methods This study was approved by the Clinical Test and Biomedical Ethics Committee of West China Hospital, Sichuan University on May 17, 2012. Data collection for this project was started in May 2017 and ended in March 2019. Data on 5293 women diagnosed with stage I to III invasive breast cancer between 2000 and 2013 were collected. Disease progression, cancer-specific mortality, all-cause mortality, and the likelihood of disease progression or death within a 5-year period were predicted. Extreme gradient boosting was used to develop the prediction model. Model performance was assessed by calculating the area under the receiver operating characteristic curve (AUROC), and the model was calibrated and compared with PREDICT. Results The training, test, and validation sets comprised 3276 (499 progressions, 202 breast cancer-specific deaths, and 261 all-cause deaths within 5-year follow-up), 1405 (211 progressions, 94 breast cancer-specific deaths, and 129 all-cause deaths), and 612 (109 progressions, 33 breast cancer-specific deaths, and 37 all-cause deaths) women, respectively. The AUROC values for disease progression, cancer-specific mortality, and all-cause mortality were 0.76, 0.88, and 0.82 for training set; 0.79, 0.80, and 0.83 for the test set; and 0.79, 0.84, and 0.88 for the validation set, respectively. Calibration analysis demonstrated good agreement between predicted and observed events within 5 years. Comparable AUROC and calibration results were confirmed in different age, residence status, and receptor status subgroups. Compared with PREDICT, our model showed similar AUROC and improved calibration values. Conclusions Our prognostic model exhibits high discrimination and good calibration. It may facilitate prognosis prediction and clinical decision making for patients with breast cancer in China.
Collapse
Affiliation(s)
- Xiaorong Zhong
- Department of Head, Neck and Mammary Gland Oncology, Cancer Center, West China Hospital, Sichuan University, Chengdu, China
| | - Ting Luo
- Department of Head, Neck and Mammary Gland Oncology, Cancer Center, West China Hospital, Sichuan University, Chengdu, China
| | - Ling Deng
- Laboratory of Molecular Diagnosis of Cancer, Clinical Research Center for Breast, West China Hospital, Sichuan University, Chengdu, China
| | - Pei Liu
- Big Data Research Center, School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China
| | - Kejia Hu
- Department of Medical Epidemiology & Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Donghao Lu
- Department of Medical Epidemiology & Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Dan Zheng
- Laboratory of Molecular Diagnosis of Cancer, Clinical Research Center for Breast, West China Hospital, Sichuan University, Chengdu, China
| | - Chuanxu Luo
- Laboratory of Molecular Diagnosis of Cancer, Clinical Research Center for Breast, West China Hospital, Sichuan University, Chengdu, China
| | - Yuxin Xie
- Department of Head, Neck and Mammary Gland Oncology, Cancer Center, West China Hospital, Sichuan University, Chengdu, China
| | - Jiayuan Li
- Department of Epidemiology and Biostatistics, West China School of Public Health, Sichuan University, Chengdu, China
| | - Ping He
- Department of Head, Neck and Mammary Gland Oncology, Cancer Center, West China Hospital, Sichuan University, Chengdu, China
| | - Tianjie Pu
- Laboratory of Pathology, West China Hospital, Sichuan University, Chengdu, China
| | - Feng Ye
- Laboratory of Pathology, West China Hospital, Sichuan University, Chengdu, China
| | - Hong Bu
- Laboratory of Pathology, West China Hospital, Sichuan University, Chengdu, China
| | - Bo Fu
- Big Data Research Center, School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China
| | - Hong Zheng
- Laboratory of Molecular Diagnosis of Cancer, Clinical Research Center for Breast, West China Hospital, Sichuan University, Chengdu, China
| |
Collapse
|
44
|
Akcay M, Etiz D, Celik O. Prediction of Survival and Recurrence Patterns by Machine Learning in Gastric Cancer Cases Undergoing Radiation Therapy and Chemotherapy. Adv Radiat Oncol 2020. [DOI: 10.1016/j.adro.2020.07.007 2452-1094/ 2020 the author(s).published b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/01/2022] Open
|
45
|
Prediction of Survival and Recurrence Patterns by Machine Learning in Gastric Cancer Cases Undergoing Radiation Therapy and Chemotherapy. Adv Radiat Oncol 2020. [PMID: 33305079 DOI: 10.1016/j.adro.2020.07.007 2452-1094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Purpose Radical surgery is the most important treatment modality in gastric cancer. Preoperative or postoperative radiation therapy (RT) and perioperative chemotherapy are the treatment options that should be added to surgery. This study aimed to evaluate the overall survival (OS) and recurrence patterns by machine learning in gastric cancer cases undergoing RT. Methods and Materials Between 2012 and 2019, the OS and recurrence patterns of 75 gastric cancer cases receiving RT ± chemotherapy at the Department of Radiation Oncology were evaluated by machine learning. Logistic regression, multilayer perceptron, XGBoost, support vector classification, random forest, and Gaussian Naive Bayes (GNB) algorithms were used to predict OS, hematogenous distant metastases, and peritoneal metastases. After the correlation analysis, the backward feature selection was performed as the variable selection method, and the variables with P values less than .005 were selected. Results Over the median 23-month follow-up, recurrence was seen in 33 cases, and 36 patients died. The median OS was 23 (min: 7; max: 82) months, and the disease-free survival was 18 (min: 5, max: 80) months. The most common recurrence pattern was hematogenous distant metastasis, followed by peritoneal metastasis. In this study, the most successful algorithms in the prediction of OS, distant metastases, and peritoneal metastases were found to be GNB with an accuracy of 81% (95% confidence interval [CI], 0.65-0.97, area under the curve [AUC]: 0.89), XGBoost with 86% accuracy (95% CI, 0.74-0.97, AUC: 0.86), and random forest with 97% accuracy (95% CI, 0.92-1.00, AUC: 0.97), respectively. Conclusions In gastric cancer, GNB, XGBoost, and random forest algorithms were determined to be the most successful algorithms for predicting OS, distant metastases, and peritoneal metastases, respectively. To determine the most accurate algorithm and perhaps make personalized treatments applicable, more precise machine learning studies are needed with an increased number of cases in the coming years.
Collapse
|
46
|
Nicolò C, Périer C, Prague M, Bellera C, MacGrogan G, Saut O, Benzekry S. Machine Learning and Mechanistic Modeling for Prediction of Metastatic Relapse in Early-Stage Breast Cancer. JCO Clin Cancer Inform 2020; 4:259-274. [PMID: 32213092 DOI: 10.1200/cci.19.00133] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
PURPOSE For patients with early-stage breast cancer, predicting the risk of metastatic relapse is of crucial importance. Existing predictive models rely on agnostic survival analysis statistical tools (eg, Cox regression). Here we define and evaluate the predictive ability of a mechanistic model for time to distant metastatic relapse. METHODS The data we used for our model consisted of 642 patients with 21 clinicopathologic variables. A mechanistic model was developed on the basis of two intrinsic mechanisms of metastatic progression: growth (parameter α) and dissemination (parameter μ). Population statistical distributions of the parameters were inferred using mixed-effects modeling. A random survival forest analysis was used to select a minimal set of five covariates with the best predictive power. These were further considered to individually predict the model parameters by using a backward selection approach. Predictive performances were compared with classic Cox regression and machine learning algorithms. RESULTS The mechanistic model was able to accurately fit the data. Covariate analysis revealed statistically significant association of Ki67 expression with α (P = .001) and EGFR expression with μ (P = .009). The model achieved a c-index of 0.65 (95% CI, 0.60 to 0.71) in cross-validation and had predictive performance similar to that of random survival forest (95% CI, 0.66 to 0.69) and Cox regression (95% CI, 0.62 to 0.67) as well as machine learning classification algorithms. CONCLUSION By providing informative estimates of the invisible metastatic burden at the time of diagnosis and forward simulations of metastatic growth, the proposed model could be used as a personalized prediction tool for routine management of patients with breast cancer.
Collapse
Affiliation(s)
- Chiara Nicolò
- Mathematical Modeling for Oncology Team, Inria Bordeaux Sud-Ouest, Talence, France.,Institut de Mathématiques de Bordeaux, UMR 5251, CNRS, Bordeaux, France
| | - Cynthia Périer
- Mathematical Modeling for Oncology Team, Inria Bordeaux Sud-Ouest, Talence, France.,Institut de Mathématiques de Bordeaux, UMR 5251, CNRS, Bordeaux, France
| | - Melanie Prague
- Statistics in Systems Biology and Translational Medicine Team, Inria Bordeaux Sud-Ouest, University of Bordeaux, Bordeaux, France.,INSERM U1219, Bordeaux Public Health, University of Bordeaux, Bordeaux, France
| | - Carine Bellera
- INSERM U1219, Bordeaux Public Health, University of Bordeaux, Bordeaux, France.,Department of Clinical Epidemiology and Clinical Research, Institut Bergonié, Regional Comprehensive Cancer Centre, Bordeaux, France
| | - Gaëtan MacGrogan
- Department of Biopathology, Institut Bergonié, Regional Comprehensive Cancer Centre, Bordeaux, France.,INSERM U1218, Bordeaux Public Health, University of Bordeaux, Bordeaux, France
| | - Olivier Saut
- Mathematical Modeling for Oncology Team, Inria Bordeaux Sud-Ouest, Talence, France.,Institut de Mathématiques de Bordeaux, UMR 5251, CNRS, Bordeaux, France
| | - Sébastien Benzekry
- Mathematical Modeling for Oncology Team, Inria Bordeaux Sud-Ouest, Talence, France.,Institut de Mathématiques de Bordeaux, UMR 5251, CNRS, Bordeaux, France
| |
Collapse
|
47
|
Trabeculae microstructure parameters serve as effective predictors for marginal bone loss of dental implant in the mandible. Sci Rep 2020; 10:18437. [PMID: 33116221 PMCID: PMC7595041 DOI: 10.1038/s41598-020-75563-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 10/15/2020] [Indexed: 12/27/2022] Open
Abstract
Marginal bone loss (MBL) is one of the leading causes of dental implant failure. This study aimed to investigate the feasibility of machine learning (ML) algorithms based on trabeculae microstructure parameters to predict the occurrence of severe MBL. Eighty-one patients (41 severe MBL cases and 40 normal controls) were involved in the current study. Four ML models, including support vector machine (SVM), artificial neural network (ANN), logistic regression (LR), and random forest (RF), were employed to predict severe MBL. The area under the receiver operating characteristic (ROC) curve (AUC), sensitivity, and specificity were used to evaluate the performance of these models. At the early stage of functional loading, severe MBL cases showed a significant increase of structure model index and trabecular pattern factor in peri-implant alveolar bone. The SVM model exhibited the best outcome in predicting MBL (AUC = 0.967, sensitivity = 91.67%, specificity = 100.00%), followed by ANN (AUC = 0.928, sensitivity = 91.67%, specificity = 93.33%), LR (AUC = 0.906, sensitivity = 91.67%, specificity = 93.33%), RF (AUC = 0.842, sensitivity = 75.00%, specificity = 86.67%). Together, ML algorithms based on the morphological variation of trabecular bone can be used to predict severe MBL.
Collapse
|
48
|
Xie L, Chu R, Wang K, Zhang X, Li J, Zhao Z, Yao S, Wang Z, Dong T, Yang X, Su X, Qiao X, Song K, Kong B. Prognostic Assessment of Cervical Cancer Patients by Clinical Staging and Surgical-Pathological Factor: A Support Vector Machine-Based Approach. Front Oncol 2020; 10:1353. [PMID: 32850433 PMCID: PMC7419674 DOI: 10.3389/fonc.2020.01353] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Accepted: 06/29/2020] [Indexed: 11/23/2022] Open
Abstract
Introduction: The International Federation of Gynecology and Obstetrics (FIGO) staging system is considered the most powerful prognostic factor in patients with cervical cancer. In addition, other surgical-pathological risk factors have been demonstrated to have significance in predicting the prognosis of patients. Therefore, the purpose of this study was to investigate the effects of the FIGO staging system and surgical-pathological risk factors on the prognosis of cervical cancer patients. Methods: A retrospective study was performed on patients diagnosed with cervical cancer at FIGO stage IB1–IIA2. Kaplan–Meier, Cox proportional hazards regression analysis and the support vector machine (SVM) algorithm were used to assess and validate the high-risk factors related to recurrence and death. Results: A total of 647 patients were included. Kaplan-Meier analysis showed that five high-risk factors, including FIGO stage, status of pelvic lymph node, parametrial involvement, tumor size, and depth of cervical cancer, had a significant effect on the prognosis of patients. In multivariate analysis, pelvic lymph node metastasis (hazard ratio [HR] 2.415, 95% confidence interval [CI] 1.471–3.965), parametrial involvement (HR 2.740, 95% CI 1.092–6.872) and >2/3 depth of cervical invasion (HR 2.263, 95% CI 1.045–4.902) were three independent risk factors of disease-free survival. Pelvic lymph node metastasis (HR 3.855, 95% CI 2.125–6.991) and parametrial involvement (HR 3.871, 95% CI 1.375–10.900) were two independent risk factors for overall survival. When all five high-risk factors were assembled and used for classification prediction through SVM, it achieved the highest prediction accuracy of recurrence (accuracy = 69.1%). The highest prediction accuracy for survival was 94.3% when only using the two independent predictors (the pathological status of lymph nodes and parametrium involvement) by SVM classifiers. Among the 13 groups of intermediate-risk factor, the combination of tumor size, histology and grade of differentiation was more accurate in predicting prognosis than the intermediate-risk factors in the Sedlis criteria (recurrence: 86.8% vs. 60.0%; death: 92.0% vs. 71.6%). Conclusions: The combination of FIGO stage and surgical-pathological risk factors can further enhance the prediction accuracy of the prognosis in patients with early-stage cervical cancer. Histology and grade of differentiation can further improve the prediction accuracy of intermediate-risk factors in the Sedlis criteria.
Collapse
Affiliation(s)
- Lin Xie
- Department of Obstetrics and Gynecology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China.,Department of Obstetrics and Gynecology, Jining No.1 People's Hospital, Jining, China
| | - Ran Chu
- Department of Obstetrics and Gynecology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Kai Wang
- Department of Obstetrics and Gynecology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Xi Zhang
- Department of Obstetrics and Gynecology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Jie Li
- Department of Obstetrics and Gynecology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Zhe Zhao
- Department of Obstetrics and Gynecology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Shu Yao
- Department of Obstetrics and Gynecology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Zhiwen Wang
- School of Control Science and Engineering, Shandong University, Jinan, China
| | - Taotao Dong
- Department of Obstetrics and Gynecology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Xingsheng Yang
- Department of Obstetrics and Gynecology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Xuantao Su
- School of Control Science and Engineering, Shandong University, Jinan, China
| | - Xu Qiao
- School of Control Science and Engineering, Shandong University, Jinan, China
| | - Kun Song
- Department of Obstetrics and Gynecology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China.,Gynecology Oncology Key Laboratory, Qilu Hospital, Shandong University, Jinan, China
| | - Beihua Kong
- Department of Obstetrics and Gynecology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, China.,Gynecology Oncology Key Laboratory, Qilu Hospital, Shandong University, Jinan, China
| |
Collapse
|
49
|
Akcay M, Etiz D, Celik O. Prediction of Survival and Recurrence Patterns by Machine Learning in Gastric Cancer Cases Undergoing Radiation Therapy and Chemotherapy. Adv Radiat Oncol 2020; 5:1179-1187. [PMID: 33305079 PMCID: PMC7718548 DOI: 10.1016/j.adro.2020.07.007] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Revised: 06/05/2020] [Accepted: 07/15/2020] [Indexed: 12/12/2022] Open
Abstract
Purpose Radical surgery is the most important treatment modality in gastric cancer. Preoperative or postoperative radiation therapy (RT) and perioperative chemotherapy are the treatment options that should be added to surgery. This study aimed to evaluate the overall survival (OS) and recurrence patterns by machine learning in gastric cancer cases undergoing RT. Methods and Materials Between 2012 and 2019, the OS and recurrence patterns of 75 gastric cancer cases receiving RT ± chemotherapy at the Department of Radiation Oncology were evaluated by machine learning. Logistic regression, multilayer perceptron, XGBoost, support vector classification, random forest, and Gaussian Naive Bayes (GNB) algorithms were used to predict OS, hematogenous distant metastases, and peritoneal metastases. After the correlation analysis, the backward feature selection was performed as the variable selection method, and the variables with P values less than .005 were selected. Results Over the median 23-month follow-up, recurrence was seen in 33 cases, and 36 patients died. The median OS was 23 (min: 7; max: 82) months, and the disease-free survival was 18 (min: 5, max: 80) months. The most common recurrence pattern was hematogenous distant metastasis, followed by peritoneal metastasis. In this study, the most successful algorithms in the prediction of OS, distant metastases, and peritoneal metastases were found to be GNB with an accuracy of 81% (95% confidence interval [CI], 0.65-0.97, area under the curve [AUC]: 0.89), XGBoost with 86% accuracy (95% CI, 0.74-0.97, AUC: 0.86), and random forest with 97% accuracy (95% CI, 0.92-1.00, AUC: 0.97), respectively. Conclusions In gastric cancer, GNB, XGBoost, and random forest algorithms were determined to be the most successful algorithms for predicting OS, distant metastases, and peritoneal metastases, respectively. To determine the most accurate algorithm and perhaps make personalized treatments applicable, more precise machine learning studies are needed with an increased number of cases in the coming years.
Collapse
Affiliation(s)
- Melek Akcay
- Department of Radiation Oncology, Medical Faculty of Osmangazi University, Eskişehir, Turkey
| | - Durmus Etiz
- Department of Radiation Oncology, Medical Faculty of Osmangazi University, Eskişehir, Turkey
| | - Ozer Celik
- Department of Mathematics-Computer, Eskisehir Osmangazi University, Eskişehir, Turkey
| |
Collapse
|
50
|
Sanchez-Ibarra HE, Jiang X, Gallegos-Gonzalez EY, Cavazos-González AC, Chen Y, Morcos F, Barrera-Saldaña HA. KRAS, NRAS, and BRAF mutation prevalence, clinicopathological association, and their application in a predictive model in Mexican patients with metastatic colorectal cancer: A retrospective cohort study. PLoS One 2020; 15:e0235490. [PMID: 32628708 PMCID: PMC7337295 DOI: 10.1371/journal.pone.0235490] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Accepted: 06/16/2020] [Indexed: 01/10/2023] Open
Abstract
Mutations in KRAS, NRAS, and BRAF (RAS/BRAF) genes are the main predictive biomarkers for the response to anti-EGFR monoclonal antibodies (MAbs) targeted therapy in metastatic colorectal cancer (mCRC). This retrospective study aimed to report the mutational status prevalence of these genes, explore their possible associations with clinicopathological features, and build and validate a predictive model. To achieve these objectives, 500 mCRC Mexican patients were screened for clinically relevant mutations in RAS/BRAF genes. Fifty-two percent of these specimens harbored clinically relevant mutations in at least one screened gene. Among these, 86% had a mutation in KRAS, 7% in NRAS, 6% in BRAF, and 2% in both NRAS and BRAF. Only tumor location in the proximal colon exhibited a significant correlation with KRAS and BRAF mutational status (p-value = 0.0414 and 0.0065, respectively). Further t-SNE analyses were made to 191 specimens to reveal patterns among patients with clinical parameters and KRAS mutational status. Then, directed by the results from classical statistical tests and t-SNE analysis, neural network models utilized entity embeddings to learn patterns and build predictive models using a minimal number of trainable parameters. This study could be the first step in the prediction for RAS/BRAF mutational status from tumoral features and could lead the way to a more detailed and more diverse dataset that could benefit from machine learning methods.
Collapse
Affiliation(s)
| | - Xianli Jiang
- Evolutionary Information Laboratory, Department of Biological Sciences, the University of Texas at Dallas, Richardson, Texas, United States of America
| | | | | | - Yenho Chen
- Evolutionary Information Laboratory, Department of Biological Sciences, the University of Texas at Dallas, Richardson, Texas, United States of America
| | - Faruck Morcos
- Evolutionary Information Laboratory, Department of Biological Sciences, the University of Texas at Dallas, Richardson, Texas, United States of America
| | | |
Collapse
|