101
|
Lombardi A, Arezzo F, Di Sciascio E, Ardito C, Mongelli M, Di Lillo N, Fascilla FD, Silvestris E, Kardhashi A, Putino C, Cazzolla A, Loizzi V, Cazzato G, Cormio G, Di Noia T. A human-interpretable machine learning pipeline based on ultrasound to support leiomyosarcoma diagnosis. Artif Intell Med 2023; 146:102697. [PMID: 38042596 DOI: 10.1016/j.artmed.2023.102697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 10/08/2023] [Accepted: 10/29/2023] [Indexed: 12/04/2023]
Abstract
The preoperative evaluation of myometrial tumors is essential to avoid delayed treatment and to establish the appropriate surgical approach. Specifically, the differential diagnosis of leiomyosarcoma (LMS) is particularly challenging due to the overlapping of clinical, laboratory and ultrasound features between fibroids and LMS. In this work, we present a human-interpretable machine learning (ML) pipeline to support the preoperative differential diagnosis of LMS from leiomyomas, based on both clinical data and gynecological ultrasound assessment of 68 patients (8 with LMS diagnosis). The pipeline provides the following novel contributions: (i) end-users have been involved both in the definition of the ML tasks and in the evaluation of the overall approach; (ii) clinical specialists get a full understanding of both the decision-making mechanisms of the ML algorithms and the impact of the features on each automatic decision. Moreover, the proposed pipeline addresses some of the problems concerning both the imbalance of the two classes by analyzing and selecting the best combination of the synthetic oversampling strategy of the minority class and the classification algorithm among different choices, and the explainability of the features at global and local levels. The results show very high performance of the best strategy (AUC = 0.99, F1 = 0.87) and the strong and stable impact of two ultrasound-based features (i.e., tumor borders and consistency of the lesions). Furthermore, the SHAP algorithm was exploited to quantify the impact of the features at the local level and a specific module was developed to provide a template-based natural language (NL) translation of the explanations for enhancing their interpretability and fostering the use of ML in the clinical setting.
Collapse
Affiliation(s)
- Angela Lombardi
- Department of Electrical and Information Engineering (DEI), Politecnico di Bari, Bari, Italy.
| | - Francesca Arezzo
- Gynecologic Oncology Unit, Interdisciplinar Department of Medicine, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Eugenio Di Sciascio
- Department of Electrical and Information Engineering (DEI), Politecnico di Bari, Bari, Italy
| | - Carmelo Ardito
- Department of Engineering, LUM "Giuseppe Degennaro" University, Casamassima, Bari, Italy
| | - Michele Mongelli
- Obstetrics and Gynecology Unit, Department of Biomedical Sciences and Human Oncology, University of Bari "Aldo Moro", Bari, Italy
| | - Nicola Di Lillo
- Obstetrics and Gynecology Unit, Department of Biomedical Sciences and Human Oncology, University of Bari "Aldo Moro", Bari, Italy
| | | | - Erica Silvestris
- Gynecologic Oncology Unit, Interdisciplinar Department of Medicine, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Anila Kardhashi
- Gynecologic Oncology Unit, Interdisciplinar Department of Medicine, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Carmela Putino
- Obstetrics and Gynecology Unit, Department of Biomedical Sciences and Human Oncology, University of Bari "Aldo Moro", Bari, Italy
| | - Ambrogio Cazzolla
- Gynecologic Oncology Unit, Interdisciplinar Department of Medicine, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | - Vera Loizzi
- Gynecologic Oncology Unit, Interdisciplinar Department of Medicine, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy; Interdisciplinar Department of Medicine, University of Bari "Aldo Moro", Bari, Italy
| | - Gerardo Cazzato
- Section of Pathology, Department of Emergency and Organ Transplantation (DETO), University of Bari "Aldo Moro", Bari, Italy
| | - Gennaro Cormio
- Gynecologic Oncology Unit, Interdisciplinar Department of Medicine, IRCCS Istituto Tumori "Giovanni Paolo II", Bari, Italy; Interdisciplinar Department of Medicine, University of Bari "Aldo Moro", Bari, Italy
| | - Tommaso Di Noia
- Department of Electrical and Information Engineering (DEI), Politecnico di Bari, Bari, Italy
| |
Collapse
|
102
|
Zhang W, Guan X, Jiao S, Wang G, Wang X. Development and validation of an artificial intelligence prediction model and a survival risk stratification for lung metastasis in colorectal cancer from highly imbalanced data: A multicenter retrospective study. EUROPEAN JOURNAL OF SURGICAL ONCOLOGY 2023; 49:107107. [PMID: 37883884 DOI: 10.1016/j.ejso.2023.107107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 09/08/2023] [Accepted: 10/02/2023] [Indexed: 10/28/2023]
Abstract
BACKGROUND To assist clinicians with diagnosis and optimal treatment decision-making, we attempted to develop and validate an artificial intelligence prediction model for lung metastasis (LM) in colorectal cancer (CRC) patients. METHODS The clinicopathological characteristics of 46037 CRC patients from the Surveillance, Epidemiology, and End Results (SEER) database and 2779 CRC patients from a multi-center external validation set were collected retrospectively. After feature selection by univariate and multivariate analyses, six machine learning (ML) models, including logistic regression, K-nearest neighbor, support vector machine, decision tree, random forest, and balanced random forest (BRF), were developed and validated for the LM prediction. In addition, stratified LM patients by risk score were utilized for survival analysis. RESULTS Extremely low rates of LM with 2.59% and 4.50% were present in the development and validation set. As the imbalanced learning strategy, the BRF model with an Area under the receiver operating characteristic curve (AUC) of 0.874 and an average precision (AP) of 0.184 performed best compares with other models and clinical predictor. Patients with LM in the high-risk group had significantly poorer survival (P<0.001) and failed to benefit from resection (P = 0.125). CONCLUSIONS In summary, we have utilized the BRF algorithm to develop an effective, non-invasive, and practical model for predicting LM in CRC patients based on highly imbalanced datasets. In addition, we have implemented a novel approach to stratify the survival risk of CRC patients with LM based the output of the model.
Collapse
Affiliation(s)
- Weiyuan Zhang
- Department of Colorectal Cancer Surgery, the Second Affiliated Hospital of Harbin Medical University, Harbin, 150000, China
| | - Xu Guan
- Department of Colorectal Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100000, China; Department of Colorectal Surgery, Shanxi Province Cancer Hospital/Hospital Affiliated to Cancer Hospital, Chinese Academy of Medical Sciences/Cancer Hospital Affiliated to Shanxi Medical University, Taiyuan, 030000, China.
| | - Shuai Jiao
- Department of Colorectal Surgery, Shanxi Province Cancer Hospital/Hospital Affiliated to Cancer Hospital, Chinese Academy of Medical Sciences/Cancer Hospital Affiliated to Shanxi Medical University, Taiyuan, 030000, China
| | - Guiyu Wang
- Department of Colorectal Cancer Surgery, the Second Affiliated Hospital of Harbin Medical University, Harbin, 150000, China.
| | - Xishan Wang
- Department of Colorectal Cancer Surgery, the Second Affiliated Hospital of Harbin Medical University, Harbin, 150000, China; Department of Colorectal Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100000, China; Department of Colorectal Surgery, Shanxi Province Cancer Hospital/Hospital Affiliated to Cancer Hospital, Chinese Academy of Medical Sciences/Cancer Hospital Affiliated to Shanxi Medical University, Taiyuan, 030000, China.
| |
Collapse
|
103
|
Ushida T, Kotani T, Baba J, Imai K, Moriyama Y, Nakano-Kobayashi T, Iitani Y, Nakamura N, Hayakawa M, Kajiyama H. Antenatal prediction models for outcomes of extremely and very preterm infants based on machine learning. Arch Gynecol Obstet 2023; 308:1755-1763. [PMID: 36502513 DOI: 10.1007/s00404-022-06865-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Accepted: 11/17/2022] [Indexed: 12/14/2022]
Abstract
PURPOSE Predicting individual risks for adverse outcomes in preterm infants is necessary for perinatal management and antenatal counseling for their parents. To evaluate whether a machine learning approach can improve the prediction of severe infant outcomes beyond the performance of conventional logistic models, and to identify maternal and fetal factors that largely contribute to these outcomes. METHODS A population-based retrospective study was performed using clinical data of 31,157 infants born at < 32 weeks of gestation and weighing ≤ 1500 g, registered in the Neonatal Research Network of Japan between 2006 and 2015. We developed a conventional logistic model and 6 types of machine learning models based on 12 maternal and fetal factors. Discriminative ability was evaluated using the area under the receiver operating characteristic curves (AUROCs), and the importance of each factor in terms of its contribution to outcomes was evaluated using the SHAP (SHapley Additive exPlanations) value. RESULTS The AUROCs of the most discriminative machine learning models were better than those of the conventional models for all outcomes. The AUROCs for in-hospital death and short-term adverse outcomes in the gradient boosting decision tree were significantly higher than those in the conventional model (p = 0.015 and p = 0.002, respectively). The SHAP value analyses showed that gestational age, birth weight, and antenatal corticosteroid treatment were the three most important factors associated with severe infant outcomes. CONCLUSION Machine learning models improve the prediction of severe infant outcomes. Moreover, the machine learning approach provides insight into the potential risk factors for severe infant outcomes.
Collapse
Affiliation(s)
- Takafumi Ushida
- Department of Obstetrics and Gynecology, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan.
- Division of Perinatology, Center for Maternal-Neonatal Care, Nagoya University Hospital, Nagoya, Japan.
| | - Tomomi Kotani
- Department of Obstetrics and Gynecology, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan
- Division of Perinatology, Center for Maternal-Neonatal Care, Nagoya University Hospital, Nagoya, Japan
| | - Joji Baba
- Education Software Co., Ltd, Tokyo, Japan
| | - Kenji Imai
- Department of Obstetrics and Gynecology, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan
| | - Yoshinori Moriyama
- Department of Obstetrics and Gynecology, Fujita Health University School of Medicine, Toyoake, Japan
| | | | - Yukako Iitani
- Department of Obstetrics and Gynecology, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan
| | - Noriyuki Nakamura
- Department of Obstetrics and Gynecology, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan
| | - Masahiro Hayakawa
- Division of Neonatology, Center for Maternal-Neonatal Care, Nagoya University Hospital, Nagoya, Japan
| | - Hiroaki Kajiyama
- Department of Obstetrics and Gynecology, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan
| |
Collapse
|
104
|
Tian S, Yu R, Zhou F, Zhan N, Li J, Wang X, Peng X. Prediction of HER2 status via random forest in 3257 Chinese patients with gastric cancer. Clin Exp Med 2023; 23:5015-5024. [PMID: 37318648 DOI: 10.1007/s10238-023-01111-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 05/29/2023] [Indexed: 06/16/2023]
Abstract
The accurate evaluation of human epidermal growth factor receptor 2 (HER2) is crucial for successful trastuzumab-based therapy in individuals with gastric cancer (GC). The present study, involving a retrospective cohort (N = 2865) from Wuhan Union Hospital and a prospective cohort (N = 392) from Renmin Hospital of Wuhan University, evaluated the benefits of clinical features using random forest and logistic regression models for the detection of HER2 status in patients with GC. Patients from the Union cohort were randomly assigned to either a training (N = 2005) or an internal validation (N = 860) group. Data processing and feature selection were done in Python, which was also used to build random forest and logistic regression models for the prediction of HER2 overexpression. The Renmin cohort (N = 392) was used as the external validation group. Ten features were closely correlated with HER2 overexpression, including age, albumin/globulin ratio, globulin, activated partial thromboplastin time, tumor stage, node stage, tumor node metastasis stage, tumor size, tumor differentiation, and neuron-specific enolase (NSE). Random forest and logistic regression had areas under the curve (AUC) of 0.9995 and 0.6653 in the training group and 0.923 and 0.667 in the internal validation group, respectively. When the two predictive models were validated using data from the Renmin cohort, random forest and logistic regression had AUCs of 0.9994 and 0.627, respectively. This is the first multicenter study to predict HER2 overexpression in individuals with GC, based on clinical variables. The random forest model significantly outperformed the logistic regression model.
Collapse
Affiliation(s)
- Shan Tian
- Department of Infectious Disease, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
| | - Rong Yu
- Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, 430060, Hubei Province, China
| | - Fangfang Zhou
- School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, Hubei, China
| | - Na Zhan
- Department of Pathology, Renmin Hospital of Wuhan University, Wuhan, 430060, Hubei Province, China
| | - Jiao Li
- Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, 430060, Hubei Province, China
| | - Xia Wang
- Department of Pharmacy, The Second Affiliated Hospital of Jianghan University, No.122, Xianzheng Road, Hanyang District, Wuhan, 430050, Hubei, China.
| | - Xiulan Peng
- Department of Oncology, The Second Affiliated Hospital of Jianghan University, No.122, Xianzheng Road, Hanyang District, Wuhan, 430050, Hubei Province, China.
| |
Collapse
|
105
|
Yang S, Jang H, Park IK, Lee HS, Lee KY, Oh GE, Park C, Kang J. Machine-Learning Algorithms Using Systemic Inflammatory Markers to Predict the Oncologic Outcomes of Colorectal Cancer After Surgery. Ann Surg Oncol 2023; 30:8717-8726. [PMID: 37605080 DOI: 10.1245/s10434-023-14136-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 07/24/2023] [Indexed: 08/23/2023]
Abstract
BACKGROUND This study aimed to investigate the clinical significance of machine-learning (ML) algorithms based on serum inflammatory markers to predict survival outcomes for patients with colorectal cancer (CRC). METHODS The study included 941 patients with stages I to III CRC. Based on random forest algorithms using 15 compositions of inflammatory markers, four different prediction scores (DFS score-1, DFS score-2, DFS score-3, and DFS score-4) were developed for the Yonsei cohort (training set, n = 803) and tested in the Ulsan cohort (test set, n = 138). The Cox proportional hazards model was used to determine correlation between prediction scores and disease-free survival (DFS). Harrell's concordance index (C-index) was used to compare the predictive ability of prediction scores for each composition. RESULTS The multivariable analysis showed the DFS score-4 to be an independent prognostic factor after adjustment for clinicopathologic factors in both the training and test sets (hazard ratio [HR], 8.98; 95% confidence interval [CI] 6.7-12.04; P < 0.001 for the training set and HR, 2.55; 95% CI 1.1-5.89; P = 0.028 for the test set]. With regard to DFS, the highest C-index among single compositions was observed in the lymphocyte-to-C-reactive protein ratio (LCR) (0.659; 95% CI 0.656-0.662), and the C-index of DFS score-4 (0.727; 95% CI 0.724-0.729) was significantly higher than that of LCR in the test set. The C-index of DFS score-3 (0.725; 95% CI 0.723-0.728) was similar to that of DFS score-4, but higher than that of DFS score-2 (0.680; 95% CI 0.676-0.683). CONCLUSIONS The ML-based approaches showed prognostic utility in predicting DFS. They could enhance clinical use of inflammatory markers in patients with CRC.
Collapse
Affiliation(s)
- Songsoo Yang
- Department of Surgery, Ulsan University Hospital, University of Ulsan College of Medicine, Ulsan, Republic of Korea
| | - Hyosoon Jang
- Graduate School of Artificial Intelligence, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
| | - In Kyu Park
- Department of Surgery, Ulsan University Hospital, University of Ulsan College of Medicine, Ulsan, Republic of Korea
| | - Hye Sun Lee
- Biostatistics Collaboration Unit, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Kang Young Lee
- Department of Surgery, Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Ga Eul Oh
- Department of Artificial Intelligence, Yonsei University, Seoul, Republic of Korea
| | - Chihyun Park
- Department of Computer Science and Engineering, Kangwon National University, Chuncheon-si, Gangwon-do, Republic of Korea.
| | - Jeonghyun Kang
- Department of Surgery, Gangnam Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea.
| |
Collapse
|
106
|
Winchester LM, Harshfield EL, Shi L, Badhwar A, Khleifat AA, Clarke N, Dehsarvi A, Lengyel I, Lourida I, Madan CR, Marzi SJ, Proitsi P, Rajkumar AP, Rittman T, Silajdžić E, Tamburin S, Ranson JM, Llewellyn DJ. Artificial intelligence for biomarker discovery in Alzheimer's disease and dementia. Alzheimers Dement 2023; 19:5860-5871. [PMID: 37654029 PMCID: PMC10840606 DOI: 10.1002/alz.13390] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 06/13/2023] [Accepted: 06/19/2023] [Indexed: 09/02/2023]
Abstract
With the increase in large multimodal cohorts and high-throughput technologies, the potential for discovering novel biomarkers is no longer limited by data set size. Artificial intelligence (AI) and machine learning approaches have been developed to detect novel biomarkers and interactions in complex data sets. We discuss exemplar uses and evaluate current applications and limitations of AI to discover novel biomarkers. Remaining challenges include a lack of diversity in the data sets available, the sheer complexity of investigating interactions, the invasiveness and cost of some biomarkers, and poor reporting in some studies. Overcoming these challenges will involve collecting data from underrepresented populations, developing more powerful AI approaches, validating the use of noninvasive biomarkers, and adhering to reporting guidelines. By harnessing rich multimodal data through AI approaches and international collaborative innovation, we are well positioned to identify clinically useful biomarkers that are accurate, generalizable, unbiased, and acceptable in clinical practice. HIGHLIGHTS: Artificial intelligence and machine learning approaches may accelerate dementia biomarker discovery. Remaining challenges include data set suitability due to size and bias in cohort selection. Multimodal data, diverse data sets, improved machine learning approaches, real-world validation, and interdisciplinary collaboration are required.
Collapse
Affiliation(s)
| | - Eric L Harshfield
- Department of Clinical Neurosciences, Stroke Research Group, University of Cambridge, Cambridge, UK
| | - Liu Shi
- Novo Nordisk Research Centre Oxford (NNRCO), Headington, UK
| | - AmanPreet Badhwar
- Département de Pharmacologie et Physiologie, Institut de Génie Biomédical, Faculté de Médecine, Université de Montréal, Montreal, Canada
- Centre de recherche de l'Institut Universitaire de Gériatrie (CRIUGM), Montreal, Canada
| | - Ahmad Al Khleifat
- Department of Basic and Clinical Neuroscience, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Natasha Clarke
- Centre de recherche de l'Institut Universitaire de Gériatrie (CRIUGM), Montreal, Canada
| | - Amir Dehsarvi
- School of Medicine, Medical Sciences, and Nutrition, University of Aberdeen, Aberdeen, UK
| | - Imre Lengyel
- Wellcome-Wolfson Institute of Experimental Medicine, Queen's University, Belfast, UK
| | - Ilianna Lourida
- Health and Community Sciences, University of Exeter Medical School, Exeter, UK
| | | | - Sarah J Marzi
- UK Dementia Research Institute at Imperial College London, London, UK
- Department of Brain Sciences, Imperial College London, London, UK
| | - Petroula Proitsi
- Department of Basic and Clinical Neuroscience, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Anto P Rajkumar
- Institute of Mental Health, Mental Health and Clinical Neurosciences academic unit, University of Nottingham, Nottingham, UK, Mental health services of older people, Nottinghamshire healthcare NHS foundation trust, Nottingham, UK
| | - Timothy Rittman
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, UK
| | - Edina Silajdžić
- Department of Basic and Clinical Neuroscience, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Stefano Tamburin
- Department of Neurosciences, Biomedicine and Movement Sciences, University of Verona, Verona, Italy
| | - Janice M Ranson
- Health and Community Sciences, University of Exeter Medical School, Exeter, UK
| | - David J Llewellyn
- Health and Community Sciences, University of Exeter Medical School, Exeter, UK
- The Alan Turing Institute, London, UK
| |
Collapse
|
107
|
Zhou SN, Jv DW, Meng XF, Zhang JJ, Liu C, Wu ZY, Hong N, Lu YY, Zhang N. Feasibility of machine learning-based modeling and prediction using multiple centers data to assess intrahepatic cholangiocarcinoma outcomes. Ann Med 2023; 55:215-223. [PMID: 36576390 PMCID: PMC9809369 DOI: 10.1080/07853890.2022.2160008] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND AND AIMS Currently, there are still no definitive consensus in the treatment of intrahepatic cholangiocarcinoma (iCCA). This study aimed to build a clinical decision support tool based on machine learning using the Surveillance, Epidemiology, and End Results (SEER) database and the data from the Fifth Medical Center of the PLA General Hospital in China. METHODS 4,398 eligible patients from the SEER database and 504 eligible patients from the hospital data, who presented with histologically proven iCCA, were enrolled for modeling by cross-validation based on machine learning. All the models were trained using the open-source Python library scikit-survival version 0.16.0. Shapley additive explanations method was used to help clinicians better understand the obtained results. Permutation importance was calculated using library ELI5. RESULTS All involved treatment modalities could contribute to a better prognosis. Three models were derived and tested using different data sources, with concordance indices of 0.67, 0.69, and 0.73, respectively. The prediction results were consistent with those under actual situations involving randomly selected patients. Model 2, trained using the hospital data, was selected to develop an online tool, due to its advantage in predicting short-term prognosis. CONCLUSION The prediction model and tool established in this study can be applied to predict the prognosis of iCCA after treatment by inputting the patient's clinical parameters or TNM stages and treatment options, thus contributing to optimal clinical decisions.KEY MESSAGESA prognostic model related to disease staging and treatment mode was conducted using the method of machine learning, based on the big data of multi centers.The online calculator can predict the short-term survival prognosis of intrahepatic cholangiocarcinoma, thus, help to make the best clinical decision.The online calculator built to calculate the mortality risk and overall survival can be easily obtained and applied.
Collapse
Affiliation(s)
- Shuang-Nan Zhou
- Senior Department of Infectious Disease, The Fifth Medical Center of Chinese PLA General Hospital, Beijing, China
| | - Da-Wei Jv
- The Fifth Out-patient Department, Central Theater Command General Hospital of Chinese PLA, Wuhan, Hubei, China
| | - Xiang-Fei Meng
- Faculty of Hepato-Pancrato-Biliary Surgery, The First Medical Center of Chinese PLA General Hospital, Beijing, China
| | - Jing-Jing Zhang
- Senior Department of Liver Disease, The Fifth Medical Center of Chinese PLA General Hospital, Beijing, China
| | - Chun Liu
- Digital Health China Technologies Co., Ltd, Beijing, China
| | - Ze-Yi Wu
- Digital Health China Technologies Co., Ltd, Beijing, China
| | - Na Hong
- Digital Health China Technologies Co., Ltd, Beijing, China
| | - Yin-Ying Lu
- Senior Department of Liver Disease, The Fifth Medical Center of Chinese PLA General Hospital, Beijing, China
| | - Ning Zhang
- Senior Department of Liver Disease, The Fifth Medical Center of Chinese PLA General Hospital, Beijing, China
| |
Collapse
|
108
|
Wu Y, Seufert I, Al-Shaheri FN, Kurilov R, Bauer AS, Manoochehri M, Moskalev EA, Brors B, Tjaden C, Giese NA, Hackert T, Büchler MW, Hoheisel JD. DNA-methylation signature accurately differentiates pancreatic cancer from chronic pancreatitis in tissue and plasma. Gut 2023; 72:2344-2353. [PMID: 37709492 DOI: 10.1136/gutjnl-2023-330155] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 08/31/2023] [Indexed: 09/16/2023]
Abstract
OBJECTIVE Pancreatic ductal adenocarcinoma (PDAC) is a lethal malignancy. Differentiation from chronic pancreatitis (CP) is currently inaccurate in about one-third of cases. Misdiagnoses in both directions, however, have severe consequences for patients. We set out to identify molecular markers for a clear distinction between PDAC and CP. DESIGN Genome-wide variations of DNA-methylation, messenger RNA and microRNA level as well as combinations thereof were analysed in 345 tissue samples for marker identification. To improve diagnostic performance, we established a random-forest machine-learning approach. Results were validated on another 48 samples and further corroborated in 16 liquid biopsy samples. RESULTS Machine-learning succeeded in defining markers to differentiate between patients with PDAC and CP, while low-dimensional embedding and cluster analysis failed to do so. DNA-methylation yielded the best diagnostic accuracy by far, dwarfing the importance of transcript levels. Identified changes were confirmed with data taken from public repositories and validated in independent sample sets. A signature of six DNA-methylation sites in a CpG-island of the protein kinase C beta type gene achieved a validated diagnostic accuracy of 100% in tissue and in circulating free DNA isolated from patient plasma. CONCLUSION The success of machine-learning to identify an effective marker signature documents the power of this approach. The high diagnostic accuracy of discriminating PDAC from CP could have tremendous consequences for treatment success, once the result from still a limited number of liquid biopsy samples would be confirmed in a larger cohort of patients with suspected pancreatic cancer.
Collapse
Affiliation(s)
- Yenan Wu
- Division of Functional Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Faculty of Biosciences, Heidelberg University, Heidelberg, Germany
| | - Isabelle Seufert
- Division of Functional Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Faculty of Biosciences, Heidelberg University, Heidelberg, Germany
| | - Fawaz N Al-Shaheri
- Division of Functional Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Medical Faculty Heidelberg, University of Heidelberg, Heidelberg, Germany
| | - Roman Kurilov
- Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Andrea S Bauer
- Division of Functional Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Mehdi Manoochehri
- Division of Functional Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Evgeny A Moskalev
- Institute of Pathology, Universitätsklinikum Erlangen, Friedrich Alexander Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Benedikt Brors
- Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Christin Tjaden
- Department of Surgery, Heidelberg University Hospital, Heidelberg, Germany
| | - Nathalia A Giese
- Department of Surgery, Heidelberg University Hospital, Heidelberg, Germany
| | - Thilo Hackert
- Department of Surgery, Heidelberg University Hospital, Heidelberg, Germany
| | - Markus W Büchler
- Department of Surgery, Heidelberg University Hospital, Heidelberg, Germany
| | - Jörg D Hoheisel
- Division of Functional Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
| |
Collapse
|
109
|
Michelutti L, Tel A, Zeppieri M, Ius T, Sembronio S, Robiony M. The Use of Artificial Intelligence Algorithms in the Prognosis and Detection of Lymph Node Involvement in Head and Neck Cancer and Possible Impact in the Development of Personalized Therapeutic Strategy: A Systematic Review. J Pers Med 2023; 13:1626. [PMID: 38138853 PMCID: PMC10745006 DOI: 10.3390/jpm13121626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 11/17/2023] [Accepted: 11/18/2023] [Indexed: 12/24/2023] Open
Abstract
Given the increasingly important role that the use of artificial intelligence algorithms is taking on in the medical field today (especially in oncology), the purpose of this systematic review is to analyze the main reports on such algorithms applied for the prognostic evaluation of patients with head and neck malignancies. The objective of this paper is to examine the currently available literature in the field of artificial intelligence applied to head and neck oncology, particularly in the prognostic evaluation of the patient with this kind of tumor, by means of a systematic review. The paper exposes an overview of the applications of artificial intelligence in deriving prognostic information related to the prediction of survival and recurrence and how these data may have a potential impact on the choice of therapeutic strategy, making it increasingly personalized. This systematic review was written following the PRISMA 2020 guidelines.
Collapse
Affiliation(s)
- Luca Michelutti
- Clinic of Maxillofacial Surgery, Head-Neck and NeuroScience Department, University Hospital of Udine, p.le S. Maria della Misericordia 15, 33100 Udine, Italy (A.T.)
| | - Alessandro Tel
- Clinic of Maxillofacial Surgery, Head-Neck and NeuroScience Department, University Hospital of Udine, p.le S. Maria della Misericordia 15, 33100 Udine, Italy (A.T.)
| | - Marco Zeppieri
- Department of Ophthalmology, University Hospital of Udine, Piazzale S. Maria della Misericordia 15, 33100 Udine, Italy
| | - Tamara Ius
- Neurosurgery Unit, Head-Neck and NeuroScience Department, University Hospital of Udine, p.le S. Maria della Misericordia 15, 33100 Udine, Italy
| | - Salvatore Sembronio
- Clinic of Maxillofacial Surgery, Head-Neck and NeuroScience Department, University Hospital of Udine, p.le S. Maria della Misericordia 15, 33100 Udine, Italy (A.T.)
| | - Massimo Robiony
- Clinic of Maxillofacial Surgery, Head-Neck and NeuroScience Department, University Hospital of Udine, p.le S. Maria della Misericordia 15, 33100 Udine, Italy (A.T.)
| |
Collapse
|
110
|
Wang S, Han J, Huang J, Islam K, Shi Y, Zhou Y, Kim D, Zhou J, Lian Z, Liu Y, Huang J. Deep learning-based predictive classification of functional subpopulations of hematopoietic stem cells and multipotent progenitors. RESEARCH SQUARE 2023:rs.3.rs-3332530. [PMID: 38014055 PMCID: PMC10680918 DOI: 10.21203/rs.3.rs-3332530/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Background Hematopoietic stem cells (HSCs) and multipotent progenitors (MPPs) play a pivotal role in maintaining lifelong hematopoiesis. The distinction between stem cells and other progenitors, as well as the assessment of their functions, has long been a central focus in stem cell research. In recent years, deep learning has emerged as a powerful tool for cell image analysis and classification/prediction. Methods In this study, we explored the feasibility of employing deep learning techniques to differentiate murine HSCs and MPPs based solely on their morphology, as observed through light microscopy (DIC) images. Results After rigorous training and validation using extensive image datasets, we successfully developed a three-class classifier, referred to as the LSM model, capable of reliably distinguishing long-term HSCs (LT-HSCs), short-term HSCs (ST-HSCs), and MPPs. The LSM model extracts intrinsic morphological features unique to different cell types, irrespective of the methods used for cell identification and isolation, such as surface markers or intracellular GFP markers. Furthermore, employing the same deep learning framework, we created a two-class classifier that effectively discriminates between aged HSCs and young HSCs. This discovery is particularly significant as both cell types share identical surface markers yet serve distinct functions. This classifier holds the potential to offer a novel, rapid, and efficient means of assessing the functional states of HSCs, thus obviating the need for time-consuming transplantation experiments. Conclusion Our study represents the pioneering use of deep learning to differentiate HSCs and MPPs under steady-state conditions. With ongoing advancements in model algorithms and their integration into various imaging systems, deep learning stands poised to become an invaluable tool, significantly impacting stem cell research.
Collapse
Affiliation(s)
- Shen Wang
- Lehigh University Department of Mechanical Engineering and Mechanics
| | | | | | - Khayrul Islam
- Lehigh University Department of Mechanical Engineering and Mechanics
| | - Yuheng Shi
- Shanghai Medical College of Fudan University: Fudan University School of Basic Medical Sciences
| | | | | | | | | | | | | |
Collapse
|
111
|
Huang Y, Li J, Li M, Aparasu RR. Application of machine learning in predicting survival outcomes involving real-world data: a scoping review. BMC Med Res Methodol 2023; 23:268. [PMID: 37957593 PMCID: PMC10641971 DOI: 10.1186/s12874-023-02078-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 10/20/2023] [Indexed: 11/15/2023] Open
Abstract
BACKGROUND Despite the interest in machine learning (ML) algorithms for analyzing real-world data (RWD) in healthcare, the use of ML in predicting time-to-event data, a common scenario in clinical practice, is less explored. ML models are capable of algorithmically learning from large, complex datasets and can offer advantages in predicting time-to-event data. We reviewed the recent applications of ML for survival analysis using RWD in healthcare. METHODS PUBMED and EMBASE were searched from database inception through March 2023 to identify peer-reviewed English-language studies of ML models for predicting time-to-event outcomes using the RWD. Two reviewers extracted information on the data source, patient population, survival outcome, ML algorithms, and the Area Under the Curve (AUC). RESULTS Of 257 citations, 28 publications were included. Random survival forests (N = 16, 57%) and neural networks (N = 11, 39%) were the most popular ML algorithms. There was variability across AUC for these ML models (median 0.789, range 0.6-0.950). ML algorithms were predominately considered for predicting overall survival in oncology (N = 12, 43%). ML survival models were often used to predict disease prognosis or clinical events (N = 27, 96%) in the oncology, while less were used for treatment outcomes (N = 1, 4%). CONCLUSIONS The ML algorithms, random survival forests and neural networks, are mainly used for RWD to predict survival outcomes such as disease prognosis or clinical events in the oncology. This review shows that more opportunities remain to apply these ML algorithms to inform treatment decision-making in clinical practice. More methodological work is also needed to ensure the utility and applicability of ML models in survival outcomes.
Collapse
Affiliation(s)
- Yinan Huang
- Department of Pharmacy Administration, School of Pharmacy, University of Mississippi, University, MS, 38677, USA
| | - Jieni Li
- Department of Pharmaceutical Health Outcomes and Policy, College of Pharmacy, University of Houston, Houston, TX, 77204, USA
| | - Mai Li
- Department of Industrial Engineering, Cullen College of Engineering, University of Houston, Houston, TX, USA
| | - Rajender R Aparasu
- Department of Pharmaceutical Health Outcomes and Policy, College of Pharmacy, University of Houston, Houston, TX, 77204, USA.
| |
Collapse
|
112
|
Shi X, Sun Y, Tian H, Abhilash PM, Luo X, Liu H. Material Extrusion Filament Width and Height Prediction via Design of Experiment and Machine Learning. MICROMACHINES 2023; 14:2091. [PMID: 38004948 PMCID: PMC10673448 DOI: 10.3390/mi14112091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 11/01/2023] [Accepted: 11/09/2023] [Indexed: 11/26/2023]
Abstract
The dimensions of material extrusion 3D printing filaments play a pivotal role in determining processing resolution and efficiency and are influenced by processing parameters. This study focuses on four key process parameters, namely, nozzle diameter, nondimensional nozzle height, extrusion pressure, and printing speed. The design of experiment was carried out to determine the impact of various factors and interaction effects on filament width and height through variance analysis. Five machine learning models (support vector regression, backpropagation neural network, decision tree, random forest, and K-nearest neighbor) were built to predict the geometric dimension of filaments. The models exhibited good predictive performance. The coefficients of determination of the backpropagation neural network model for predicting line width and line height were 0.9025 and 0.9604, respectively. The effect of various process parameters on the geometric morphology based on the established prediction model was also studied. The order of influence on line width and height, ranked from highest to lowest, was as follows: nozzle diameter, printing speed, extrusion pressure, and nondimensional nozzle height. Different nondimensional nozzle height settings may cause the extruded material to be stretched or squeezed. The material being in a stretched state leads to a thin filament, and the regularity of processing parameters on the geometric size is not strong. Meanwhile, the nozzle diameter exhibits a significant impact on dimensions when the material is in a squeezing state. Thus, this study can be used to predict the size of printing filament structures, guide the selection of printing parameters, and determine the size of 3D printing layers.
Collapse
Affiliation(s)
- Xiaoquan Shi
- Department of Mechanical Engineering and Automation, Harbin Institute of Technology, Harbin 150001, China; (X.S.); (H.T.)
| | - Yazhou Sun
- Department of Mechanical Engineering and Automation, Harbin Institute of Technology, Harbin 150001, China; (X.S.); (H.T.)
| | - Haiying Tian
- Department of Mechanical Engineering and Automation, Harbin Institute of Technology, Harbin 150001, China; (X.S.); (H.T.)
| | - Puthanveettil Madathil Abhilash
- Centre for Precision Manufacturing, Department of Design, Manufacturing & Engineering Management, University of Strathclyde, Glasgow G1 1XJ, UK; (P.M.A.); (X.L.)
| | - Xichun Luo
- Centre for Precision Manufacturing, Department of Design, Manufacturing & Engineering Management, University of Strathclyde, Glasgow G1 1XJ, UK; (P.M.A.); (X.L.)
| | - Haitao Liu
- Department of Mechanical Engineering and Automation, Harbin Institute of Technology, Harbin 150001, China; (X.S.); (H.T.)
| |
Collapse
|
113
|
Wang Y, Liu L, Wang C. Trends in using deep learning algorithms in biomedical prediction systems. Front Neurosci 2023; 17:1256351. [PMID: 38027475 PMCID: PMC10665494 DOI: 10.3389/fnins.2023.1256351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 09/25/2023] [Indexed: 12/01/2023] Open
Abstract
In the domain of using DL-based methods in medical and healthcare prediction systems, the utilization of state-of-the-art deep learning (DL) methodologies assumes paramount significance. DL has attained remarkable achievements across diverse domains, rendering its efficacy particularly noteworthy in this context. The integration of DL with health and medical prediction systems enables real-time analysis of vast and intricate datasets, yielding insights that significantly enhance healthcare outcomes and operational efficiency in the industry. This comprehensive literature review systematically investigates the latest DL solutions for the challenges encountered in medical healthcare, with a specific emphasis on DL applications in the medical domain. By categorizing cutting-edge DL approaches into distinct categories, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), generative adversarial networks (GANs), long short-term memory (LSTM) models, support vector machine (SVM), and hybrid models, this study delves into their underlying principles, merits, limitations, methodologies, simulation environments, and datasets. Notably, the majority of the scrutinized articles were published in 2022, underscoring the contemporaneous nature of the research. Moreover, this review accentuates the forefront advancements in DL techniques and their practical applications within the realm of medical prediction systems, while simultaneously addressing the challenges that hinder the widespread implementation of DL in image segmentation within the medical healthcare domains. These discerned insights serve as compelling impetuses for future studies aimed at the progressive advancement of using DL-based methods in medical and health prediction systems. The evaluation metrics employed across the reviewed articles encompass a broad spectrum of features, encompassing accuracy, precision, specificity, F-score, adoptability, adaptability, and scalability.
Collapse
Affiliation(s)
- Yanbu Wang
- School of Strength and Conditioning, Beijing Sport University, Beijing, China
| | - Linqing Liu
- Department of Physical Education, Peking University, Beijing, China
| | - Chao Wang
- Institute of Competitive Sports, Beijing Sport University, Beijing, China
| |
Collapse
|
114
|
Li M, Jiang Y, Zhang Y, Zhu H. Medical image analysis using deep learning algorithms. Front Public Health 2023; 11:1273253. [PMID: 38026291 PMCID: PMC10662291 DOI: 10.3389/fpubh.2023.1273253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 10/05/2023] [Indexed: 12/01/2023] Open
Abstract
In the field of medical image analysis within deep learning (DL), the importance of employing advanced DL techniques cannot be overstated. DL has achieved impressive results in various areas, making it particularly noteworthy for medical image analysis in healthcare. The integration of DL with medical image analysis enables real-time analysis of vast and intricate datasets, yielding insights that significantly enhance healthcare outcomes and operational efficiency in the industry. This extensive review of existing literature conducts a thorough examination of the most recent deep learning (DL) approaches designed to address the difficulties faced in medical healthcare, particularly focusing on the use of deep learning algorithms in medical image analysis. Falling all the investigated papers into five different categories in terms of their techniques, we have assessed them according to some critical parameters. Through a systematic categorization of state-of-the-art DL techniques, such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Generative Adversarial Networks (GANs), Long Short-term Memory (LSTM) models, and hybrid models, this study explores their underlying principles, advantages, limitations, methodologies, simulation environments, and datasets. Based on our results, Python was the most frequent programming language used for implementing the proposed methods in the investigated papers. Notably, the majority of the scrutinized papers were published in 2021, underscoring the contemporaneous nature of the research. Moreover, this review accentuates the forefront advancements in DL techniques and their practical applications within the realm of medical image analysis, while simultaneously addressing the challenges that hinder the widespread implementation of DL in image analysis within the medical healthcare domains. These discerned insights serve as compelling impetuses for future studies aimed at the progressive advancement of image analysis in medical healthcare research. The evaluation metrics employed across the reviewed articles encompass a broad spectrum of features, encompassing accuracy, sensitivity, specificity, F-score, robustness, computational complexity, and generalizability.
Collapse
Affiliation(s)
- Mengfang Li
- The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Yuanyuan Jiang
- Department of Cardiovascular Medicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Yanzhou Zhang
- Department of Cardiovascular Medicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Haisheng Zhu
- Department of Cardiovascular Medicine, Wencheng People’s Hospital, Wencheng, China
| |
Collapse
|
115
|
Al-Rajab M, Lu J, Xu Q, Kentour M, Sawsa A, Shuweikeh E, Joy M, Arasaradnam R. A hybrid machine learning feature selection model-HMLFSM to enhance gene classification applied to multiple colon cancers dataset. PLoS One 2023; 18:e0286791. [PMID: 37917732 PMCID: PMC10621932 DOI: 10.1371/journal.pone.0286791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 05/20/2023] [Indexed: 11/04/2023] Open
Abstract
Colon cancer is a significant global health problem, and early detection is critical for improving survival rates. Traditional detection methods, such as colonoscopies, can be invasive and uncomfortable for patients. Machine Learning (ML) algorithms have emerged as a promising approach for non-invasive colon cancer classification using genetic data or patient demographics and medical history. One approach is to use ML to analyse genetic data, or patient demographics and medical history, to predict the likelihood of colon cancer. However, due to the challenges imposed by variable gene expression and the high dimensionality of cancer-related datasets, traditional transductive ML applications have limited accuracy and risk overfitting. In this paper, we propose a new hybrid feature selection model called HMLFSM-Hybrid Machine Learning Feature Selection Model to improve colon cancer gene classification. We developed a multifilter hybrid model including a two-phase feature selection approach, combining Information Gain (IG) and Genetic Algorithms (GA), and minimum Redundancy Maximum Relevance (mRMR) coupling with Particle Swarm Optimization (PSO). We critically tested our model on three colon cancer genetic datasets and found that the new framework outperformed other models with significant accuracy improvements (95%, ~97%, and ~94% accuracies for datasets 1, 2, and 3 respectively). The results show that our approach improves the classification accuracy of colon cancer detection by highlighting important and relevant genes, eliminating irrelevant ones, and revealing the genes that have a direct influence on the classification process. For colon cancer gene analysis, and along with our experiments and literature review, we found that selective input feature extraction prior to feature selection is essential for improving predictive performance.
Collapse
Affiliation(s)
- Murad Al-Rajab
- College of Engineering, Abu Dhabi University, Abu Dhabi, United Arab Emirates
- School of Computing and Engineering, University of Huddersfield, Huddersfield, United Kingdom
| | - Joan Lu
- School of Computing and Engineering, University of Huddersfield, Huddersfield, United Kingdom
| | - Qiang Xu
- School of Computing and Engineering, University of Huddersfield, Huddersfield, United Kingdom
| | - Mohamed Kentour
- School of Computing and Engineering, University of Huddersfield, Huddersfield, United Kingdom
| | - Ahlam Sawsa
- School of Computing and Engineering, University of Huddersfield, Huddersfield, United Kingdom
- Bradford Teaching Hospitals NHS Foundation Trust, Bradford, United Kingdom
| | - Emad Shuweikeh
- School of Computing and Engineering, University of Huddersfield, Huddersfield, United Kingdom
| | - Mike Joy
- University of Warwick, Coventry, United Kingdom
| | | |
Collapse
|
116
|
Amintas S, Giraud N, Fernandez B, Dupin C, Denost Q, Garant A, Frulio N, Smith D, Rullier A, Rullier E, Vuong T, Dabernat S, Vendrely V. The Crying Need for a Better Response Assessment in Rectal Cancer. Curr Treat Options Oncol 2023; 24:1507-1523. [PMID: 37702885 PMCID: PMC10643426 DOI: 10.1007/s11864-023-01125-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/09/2023] [Indexed: 09/14/2023]
Abstract
OPINION STATEMENT Since total neoadjuvant treatment achieves almost 30% pathologic complete response, organ preservation has been increasingly debated for good responders after neoadjuvant treatment for patients diagnosed with rectal cancer. Two organ preservation strategies are available: a watch and wait strategy and a local excision strategy including patients with a near clinical complete response. A major issue is the selection of patients according to the initial tumor staging or the response assessment. Despite modern imaging improvement, identifying complete response remains challenging. A better selection could be possible by radiomics analyses, exploiting numerous image features to feed data characterization algorithms. The subsequent step is to include baseline and/or pre-therapeutic MRI, PET-CT, and CT radiomics added to the patients' clinicopathological data, inside machine learning (ML) prediction models, with predictive or prognostic purposes. These models could be further improved by the addition of new biomarkers such as circulating tumor biomarkers, molecular profiling, or pathological immune biomarkers.
Collapse
Affiliation(s)
- Samuel Amintas
- Tumor Biology and Tumor Bank Laboratory, CHU Bordeaux, F-33600, Pessac, France.
- BRIC (BoRdeaux Institute of onCology), UMR1312, INSERM, University of Bordeaux, F-33000, Bordeaux, France.
| | - Nicolas Giraud
- Department of Radiation Oncology, CHU Bordeaux, F-33000, Bordeaux, France
| | | | - Charles Dupin
- BRIC (BoRdeaux Institute of onCology), UMR1312, INSERM, University of Bordeaux, F-33000, Bordeaux, France
- Department of Radiation Oncology, CHU Bordeaux, F-33000, Bordeaux, France
| | - Quentin Denost
- Bordeaux Colorectal Institute, F-33000, Bordeaux, France
| | - Aurelie Garant
- UT Southwestern Department of Radiation Oncology, Dallas, USA
| | - Nora Frulio
- Radiology Department, CHU Bordeaux, F-33600, Pessac, France
| | - Denis Smith
- Department of Digestive Oncology, CHU Bordeaux, F-33600, Pessac, France
| | - Anne Rullier
- Histology Department, CHU Bordeaux, F-33000, Bordeaux, France
| | - Eric Rullier
- BRIC (BoRdeaux Institute of onCology), UMR1312, INSERM, University of Bordeaux, F-33000, Bordeaux, France
- Surgery Department, CHU Bordeaux, F-33600, Pessac, France
| | - Te Vuong
- Department of Radiation Oncology, McGill University, Jewish General Hospital, Montreal, Canada
| | - Sandrine Dabernat
- BRIC (BoRdeaux Institute of onCology), UMR1312, INSERM, University of Bordeaux, F-33000, Bordeaux, France
- Biochemistry Department, CHU Bordeaux, F-33000, Bordeaux, France
| | - Véronique Vendrely
- BRIC (BoRdeaux Institute of onCology), UMR1312, INSERM, University of Bordeaux, F-33000, Bordeaux, France
- Department of Radiation Oncology, CHU Bordeaux, F-33000, Bordeaux, France
| |
Collapse
|
117
|
Liu R, Wu S, Yu HY, Zeng K, Liang Z, Li S, Hu Y, Yang Y, Ye L. Prediction model for hepatocellular carcinoma recurrence after hepatectomy: Machine learning-based development and interpretation study. Heliyon 2023; 9:e22458. [PMID: 38034691 PMCID: PMC10687050 DOI: 10.1016/j.heliyon.2023.e22458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 09/10/2023] [Accepted: 11/13/2023] [Indexed: 12/02/2023] Open
Abstract
Background Identifying patients with hepatocellular carcinoma (HCC) at high risk of recurrence after hepatectomy can help to implement timely interventional treatment. This study aimed to develop a machine learning (ML) model to predict the recurrence risk of HCC patients after hepatectomy. Methods We retrospectively collected 315 HCC patients who underwent radical hepatectomy at the Third Affiliated Hospital of Sun Yat-sen University from April 2013 to October 2017, and randomly divided them into the training and validation sets at a ratio of 7:3. According to the postoperative recurrence of HCC patients, the patients were divided into recurrence group and non-recurrence group, and univariate and multivariate logistic regression were performed for the two groups. We applied six machine learning algorithms to construct the prediction models and performed internal validation by 10-fold cross-validation. Shapley additive explanations (SHAP) method was applied to interpret the machine learning model. We also built a web calculator based on the best machine learning model to personalize the assessment of the recurrence risk of HCC patients after hepatectomy. Results A total of 13 variables were included in the machine learning models. The multilayer perceptron (MLP) machine learning model was proved to achieve optimal predictive value in test set (AUC = 0.680). The SHAP method displayed that γ-glutamyl transpeptidase (GGT), fibrinogen, neutrophil, aspartate aminotransferase (AST) and total bilirubin (TB) were the top 5 important factors for recurrence risk of HCC patients after hepatectomy. In addition, we further demonstrated the reliability of the model by analyzing two patients. Finally, we successfully constructed an online web prediction calculator based on the MLP machine learning model. Conclusion MLP was an optimal machine learning model for predicting the recurrence risk of HCC patients after hepatectomy. This predictive model can help identify HCC patients at high recurrence risk after hepatectomy to provide early and personalized treatment.
Collapse
Affiliation(s)
- Rongqiang Liu
- Department of Hepatic Surgery and Liver Transplantation Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
- Department of Hepatobiliary Surgery, Renmin Hospital of Wuhan University, Wuhan, Hubei Province, China
| | - Shinan Wu
- Department of Hepatic Surgery and Liver Transplantation Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
- Institute of Xiamen University, School of Medicine, Xiamen University, Xiamen, Fujian, China
| | - Hao yuan Yu
- Department of Hepatic Surgery and Liver Transplantation Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Kaining Zeng
- Department of Hepatic Surgery and Liver Transplantation Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Zhixing Liang
- Department of Hepatic Surgery and Liver Transplantation Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Siqi Li
- Department of Hepatic Surgery and Liver Transplantation Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Yongwei Hu
- Department of Hepatic Surgery and Liver Transplantation Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Yang Yang
- Department of Hepatic Surgery and Liver Transplantation Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Linsen Ye
- Department of Hepatic Surgery and Liver Transplantation Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
118
|
Takahashi S, Terai H, Hoshino M, Tsujio T, Kato M, Toyoda H, Suzuki A, Tamai K, Yabu A, Nakamura H. Machine-learning-based approach for nonunion prediction following osteoporotic vertebral fractures. EUROPEAN SPINE JOURNAL : OFFICIAL PUBLICATION OF THE EUROPEAN SPINE SOCIETY, THE EUROPEAN SPINAL DEFORMITY SOCIETY, AND THE EUROPEAN SECTION OF THE CERVICAL SPINE RESEARCH SOCIETY 2023; 32:3788-3796. [PMID: 36269421 DOI: 10.1007/s00586-022-07431-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Revised: 10/02/2022] [Accepted: 10/17/2022] [Indexed: 11/06/2022]
Abstract
PURPOSE An osteoporotic vertebral fracture (OVF) is a common disease that causes disabilities in elderly patients. In particular, patients with nonunion following an OVF often experience severe back pain and require surgical intervention. However, nonunion diagnosis generally takes more than six months. Although several studies have advocated the use of magnetic resonance imaging (MRI) observations as predictive factors, they exhibit insufficient accuracy. The purpose of this study was to create a predictive model for OVF nonunion using machine learning (ML). METHODS We used datasets from two prospective cohort studies for OVF nonunion prediction based on conservative treatment. Among 573 patients with acute OVFs exceeding 65 years in age enrolled in this study, 505 were analyzed. The demographic data, fracture type, and MRI observations of both studies were analyzed using ML. The ML architecture utilized in this study included a logistic regression model, decision tree, extreme gradient boosting (XGBoost), and random forest (RF). The datasets were processed using Python. RESULTS The two ML algorithms, XGBoost and RF, exhibited higher area under the receiver operating characteristic curves (AUCs) than the logistic regression and decision tree models (AUC = 0.860 and 0.845 for RF and XGBoost, respectively). The present study found that MRI findings, anterior height ratio, kyphotic angle, BMI, VAS, age, posterior wall injury, fracture level, and smoking habit ranked as important features in the ML algorithms. CONCLUSION ML-based algorithms might be more effective than conventional methods for nonunion prediction following OVFs.
Collapse
Affiliation(s)
- Shinji Takahashi
- Department of Orthopaedic Surgery, Osaka Metropolitan University Graduate School of Medicine, 1-4-3 Asahi-Machi, Abeno-Ku, Osaka, 545-8585, Japan.
| | - Hidetomi Terai
- Department of Orthopaedic Surgery, Osaka City General Hospital, 2-15-16, Miyakojima Hon-Dori, Miyakojima-Ku, Osaka, Japan
| | - Masatoshi Hoshino
- Department of Orthopaedic Surgery, Osaka City General Hospital, 2-15-16, Miyakojima Hon-Dori, Miyakojima-Ku, Osaka, Japan
| | - Tadao Tsujio
- Department of Orthopaedic Surgery, Shiraniwa Hospital, 6-10-1. Shiraniwadai, Ikoma City, Nara, Japan
| | - Minori Kato
- Department of Orthopaedic Surgery, Osaka Metropolitan University Graduate School of Medicine, 1-4-3 Asahi-Machi, Abeno-Ku, Osaka, 545-8585, Japan
| | - Hiromitsu Toyoda
- Department of Orthopaedic Surgery, Osaka Metropolitan University Graduate School of Medicine, 1-4-3 Asahi-Machi, Abeno-Ku, Osaka, 545-8585, Japan
| | - Akinobu Suzuki
- Department of Orthopaedic Surgery, Osaka Metropolitan University Graduate School of Medicine, 1-4-3 Asahi-Machi, Abeno-Ku, Osaka, 545-8585, Japan
| | - Koji Tamai
- Department of Orthopaedic Surgery, Osaka Metropolitan University Graduate School of Medicine, 1-4-3 Asahi-Machi, Abeno-Ku, Osaka, 545-8585, Japan
| | - Akito Yabu
- Department of Orthopaedic Surgery, Osaka Metropolitan University Graduate School of Medicine, 1-4-3 Asahi-Machi, Abeno-Ku, Osaka, 545-8585, Japan
| | - Hiroaki Nakamura
- Department of Orthopaedic Surgery, Osaka Metropolitan University Graduate School of Medicine, 1-4-3 Asahi-Machi, Abeno-Ku, Osaka, 545-8585, Japan
| |
Collapse
|
119
|
Garg P, Mohanty A, Ramisetty S, Kulkarni P, Horne D, Pisick E, Salgia R, Singhal SS. Artificial intelligence and allied subsets in early detection and preclusion of gynecological cancers. Biochim Biophys Acta Rev Cancer 2023; 1878:189026. [PMID: 37980945 DOI: 10.1016/j.bbcan.2023.189026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 11/09/2023] [Accepted: 11/14/2023] [Indexed: 11/21/2023]
Abstract
Gynecological cancers including breast, cervical, ovarian, uterine, and vaginal, pose the greatest threat to world health, with early identification being crucial to patient outcomes and survival rates. The application of machine learning (ML) and artificial intelligence (AI) approaches to the study of gynecological cancer has shown potential to revolutionize cancer detection and diagnosis. The current review outlines the significant advancements, obstacles, and prospects brought about by AI and ML technologies in the timely identification and accurate diagnosis of different types of gynecological cancers. The AI-powered technologies can use genomic data to discover genetic alterations and biomarkers linked to a particular form of gynecologic cancer, assisting in the creation of targeted treatments. Furthermore, it has been shown that the potential benefits of AI and ML technologies in gynecologic tumors can greatly increase the accuracy and efficacy of cancer diagnosis, reduce diagnostic delays, and possibly eliminate the need for needless invasive operations. In conclusion, the review focused on the integrative part of AI and ML based tools and techniques in the early detection and exclusion of various cancer types; together with a collaborative coordination between research clinicians, data scientists, and regulatory authorities, which is suggested to realize the full potential of AI and ML in gynecologic cancer care.
Collapse
Affiliation(s)
- Pankaj Garg
- Department of Chemistry, GLA University, Mathura, Uttar Pradesh 281406, India
| | - Atish Mohanty
- Departments of Medical Oncology & Therapeutics Research, Molecular Medicine, Beckman Research Institute of City of Hope, Comprehensive Cancer Center and National Medical Center, Duarte, CA 91010, USA
| | - Sravani Ramisetty
- Departments of Medical Oncology & Therapeutics Research, Molecular Medicine, Beckman Research Institute of City of Hope, Comprehensive Cancer Center and National Medical Center, Duarte, CA 91010, USA
| | - Prakash Kulkarni
- Departments of Medical Oncology & Therapeutics Research, Molecular Medicine, Beckman Research Institute of City of Hope, Comprehensive Cancer Center and National Medical Center, Duarte, CA 91010, USA
| | - David Horne
- Molecular Medicine, Beckman Research Institute of City of Hope, Comprehensive Cancer Center and National Medical Center, Duarte, CA 91010, USA
| | - Evan Pisick
- Department of Medical Oncology, City of Hope, Chicago, IL 60099, USA
| | - Ravi Salgia
- Departments of Medical Oncology & Therapeutics Research, Molecular Medicine, Beckman Research Institute of City of Hope, Comprehensive Cancer Center and National Medical Center, Duarte, CA 91010, USA
| | - Sharad S Singhal
- Departments of Medical Oncology & Therapeutics Research, Molecular Medicine, Beckman Research Institute of City of Hope, Comprehensive Cancer Center and National Medical Center, Duarte, CA 91010, USA.
| |
Collapse
|
120
|
Shen Y, Goparaju C, Yang Y, Babu BA, Gai W, Pass H, Jiang G. Recurrence prediction of lung adenocarcinoma using an immune gene expression and clinical data trained and validated support vector machine classifier. Transl Lung Cancer Res 2023; 12:2055-2067. [PMID: 38025809 PMCID: PMC10654435 DOI: 10.21037/tlcr-23-473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Accepted: 10/20/2023] [Indexed: 12/01/2023]
Abstract
Background Immune microenvironment plays a critical role in cancer from onset to relapse. Machine learning (ML) algorithm can facilitate the analysis of lab and clinical data to predict lung cancer recurrence. Prompt detection and intervention are crucial for long-term survival in lung cancer relapse. Our study aimed to evaluate the clinical and genomic prognosticators for lung cancer recurrence by comparing the predictive accuracy of four ML models. Methods A total of 41 early-stage lung cancer patients who underwent surgery between June 2007 and October 2014 at New York University Langone Medical Center were included (with recurrence, n=16; without recurrence, n=25). All patients had tumor tissue and buffy coat collected at the time of resection. The CIBERSORT algorithm quantified tumor-infiltrating immune cells (TIICs). Protein-protein interaction (PPI) network and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis were conducted to unearth potential molecular drivers of tumor progression. The data was split into training (75%) and validation sets (25%). Ensemble linear kernel support vector machine (SVM) ML models were developed using optimized clinical and genomic features to predict tumor recurrence. Results Activated natural killer (NK) cells, M0 macrophages, and M1 macrophages showed a positive correlation with progression. Conversely, T CD4+ memory resting cells were negatively correlated. In the PPI network, TNF and IL6 emerged as prominent hub genes. Prediction models integrating clinicopathological prognostic factors, tumor gene expression (45 genes), and buffy coat gene expression (47 genes) yielded varying receiver operating characteristic (ROC)-area under the curves (AUCs): 62.7%, 65.4%, and 59.7% in the training set, 58.3%, 83.3%, and 75.0% in the validation set, respectively. Notably, merging gene expression with clinical data in a linear SVM model led to a significant accuracy boost, with an AUC of 92.0% in training and 91.7% in validation. Conclusions Using ML algorithm, immune gene expression data from tumor tissue and buffy coat may enhance the precision of lung cancer recurrence prediction.
Collapse
Affiliation(s)
- Yingran Shen
- Department of Thoracic Surgery, Tongji University Affiliated Shanghai Pulmonary Hospital, Shanghai, China
| | - Chandra Goparaju
- Division of Cardiothoracic Surgery, New York University Langone Medical Center, New York, NY, USA
| | - Yang Yang
- Department of Thoracic Surgery, Tongji University Affiliated Shanghai Pulmonary Hospital, Shanghai, China
| | - Benson A. Babu
- Good Samaritan Hospital, Westchester Medical Center Network, Valhalla, NY, USA
| | - Weiming Gai
- Division of Cardiothoracic Surgery, New York University Langone Medical Center, New York, NY, USA
| | - Harvey Pass
- Division of Cardiothoracic Surgery, New York University Langone Medical Center, New York, NY, USA
| | - Gening Jiang
- Department of Thoracic Surgery, Tongji University Affiliated Shanghai Pulmonary Hospital, Shanghai, China
| |
Collapse
|
121
|
Gholipour M, Khajouei R, Amiri P, Hajesmaeel Gohari S, Ahmadian L. Extracting cancer concepts from clinical notes using natural language processing: a systematic review. BMC Bioinformatics 2023; 24:405. [PMID: 37898795 PMCID: PMC10613366 DOI: 10.1186/s12859-023-05480-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 09/13/2023] [Indexed: 10/30/2023] Open
Abstract
BACKGROUND Extracting information from free texts using natural language processing (NLP) can save time and reduce the hassle of manually extracting large quantities of data from incredibly complex clinical notes of cancer patients. This study aimed to systematically review studies that used NLP methods to identify cancer concepts from clinical notes automatically. METHODS PubMed, Scopus, Web of Science, and Embase were searched for English language papers using a combination of the terms concerning "Cancer", "NLP", "Coding", and "Registries" until June 29, 2021. Two reviewers independently assessed the eligibility of papers for inclusion in the review. RESULTS Most of the software programs used for concept extraction reported were developed by the researchers (n = 7). Rule-based algorithms were the most frequently used algorithms for developing these programs. In most articles, the criteria of accuracy (n = 14) and sensitivity (n = 12) were used to evaluate the algorithms. In addition, Systematized Nomenclature of Medicine-Clinical Terms (SNOMED-CT) and Unified Medical Language System (UMLS) were the most commonly used terminologies to identify concepts. Most studies focused on breast cancer (n = 4, 19%) and lung cancer (n = 4, 19%). CONCLUSION The use of NLP for extracting the concepts and symptoms of cancer has increased in recent years. The rule-based algorithms are well-liked algorithms by developers. Due to these algorithms' high accuracy and sensitivity in identifying and extracting cancer concepts, we suggested that future studies use these algorithms to extract the concepts of other diseases as well.
Collapse
Affiliation(s)
- Maryam Gholipour
- Student Research Committee, Kerman University of Medical Sciences, Kerman, Iran
| | - Reza Khajouei
- Department of Health Information Sciences, Faculty of Management and Medical Information Sciences, Kerman University of Medical Sciences, Kerman, Iran
| | - Parastoo Amiri
- Student Research Committee, Kerman University of Medical Sciences, Kerman, Iran
| | - Sadrieh Hajesmaeel Gohari
- Medical Informatics Research Center, Institute for Futures Studies in Health, Kerman University of Medical Sciences, Kerman, Iran
| | - Leila Ahmadian
- Department of Health Information Sciences, Faculty of Management and Medical Information Sciences, Kerman University of Medical Sciences, Kerman, Iran.
| |
Collapse
|
122
|
Lei YP, Song QZ, Liu S, Xie JY, Lv GQ. Predicting lymph node metastasis in colorectal cancer: An analysis of influencing factors to develop a risk model. World J Gastrointest Surg 2023; 15:2234-2246. [PMID: 37969707 PMCID: PMC10642478 DOI: 10.4240/wjgs.v15.i10.2234] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 09/07/2023] [Accepted: 09/14/2023] [Indexed: 10/27/2023] Open
Abstract
BACKGROUND Colorectal cancer (CRC) is a significant global health issue, and lymph node metastasis (LNM) is a crucial prognostic factor. Accurate prediction of LNM is essential for developing individualized treatment strategies for patients with CRC. However, the prediction of LNM is challenging and depends on various factors such as tumor histology, clinicopathological features, and molecular characteristics. The most reliable method to detect LNM is the histopathological examination of surgically resected specimens; however, this method is invasive, time-consuming, and subject to sampling errors and interobserver variability. AIM To analyze influencing factors and develop and validate a risk prediction model for LNM in CRC based on a large patient queue. METHODS This study retrospectively analyzed 300 patients who underwent CRC surgery at two Peking University Shenzhen hospitals between January and December 2021. A deep learning approach was used to extract features potentially associated with LNM from primary tumor histological images while a logistic regression model was employed to predict LNM in CRC using machine-learning-derived features and clinicopathological variables as predictors. RESULTS The prediction model constructed for LNM in CRC was based on a logistic regression framework that incorporated machine learning-extracted features and clinicopathological variables. The model achieved high accuracy (0.86), sensitivity (0.81), specificity (0.87), positive predictive value (0.66), negative predictive value (0.94), area under the curve for the receiver operating characteristic (0.91), and a low Brier score (0.10). The model showed good agreement between the observed and predicted probabilities of LNM across a range of risk thresholds, indicating good calibration and clinical utility. CONCLUSION The present study successfully developed and validated a potent and effective risk-prediction model for LNM in patients with CRC. This model utilizes machine-learning-derived features extracted from primary tumor histology and clinicopathological variables, demonstrating superior performance and clinical applicability compared to existing models. The study provides new insights into the potential of deep learning to extract valuable information from tumor histology, in turn, improving the prediction of LNM in CRC and facilitate risk stratification and decision-making in clinical practice.
Collapse
Affiliation(s)
- Yun-Peng Lei
- Department of Gastrointestinal Surgery, Peking University Shenzhen Hospital, Shenzhen 518036, Guangdong Province, China
| | - Qing-Zhi Song
- Department of Gastrointestinal Surgery, Peking University Shenzhen Hospital, Shenzhen 518036, Guangdong Province, China
| | - Shuang Liu
- Department of Gastrointestinal Surgery, Peking University Shenzhen Hospital, Shenzhen 518036, Guangdong Province, China
| | - Ji-Yan Xie
- Department of Gastrointestinal Surgery, Peking University Shenzhen Hospital, Shenzhen 518036, Guangdong Province, China
| | - Guo-Qing Lv
- Department of Gastrointestinal Surgery, Shenzhen Peking University-The Hong Kong University of Science and Technology Medical Center, Peking University Shenzhen Hospital, Shenzhen 518036, Guangdong Province, China
| |
Collapse
|
123
|
Karabacak M, Jagtiani P, Carrasquilla A, Germano IM, Margetis K. Prognosis Individualized: Survival predictions for WHO grade II and III gliomas with a machine learning-based web application. NPJ Digit Med 2023; 6:200. [PMID: 37884599 PMCID: PMC10603035 DOI: 10.1038/s41746-023-00948-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Accepted: 10/13/2023] [Indexed: 10/28/2023] Open
Abstract
WHO grade II and III gliomas demonstrate diverse biological behaviors resulting in variable survival outcomes. In the context of glioma prognosis, machine learning (ML) approaches could facilitate the navigation through the maze of factors influencing survival, aiding clinicians in generating more precise and personalized survival predictions. Here we report the utilization of ML models in predicting survival at 12, 24, 36, and 60 months following grade II and III glioma diagnosis. From the National Cancer Database, we analyze 10,001 WHO grade II and 11,456 grade III cranial gliomas. Using the area under the receiver operating characteristic (AUROC) values, we deploy the top-performing models in a web application for individualized predictions. SHapley Additive exPlanations (SHAP) enhance the interpretability of the models. Top-performing predictive models are the ones built with LightGBM and Random Forest algorithms. For grade II gliomas, the models yield AUROC values ranging from 0.813 to 0.896 for predicting mortality across different timeframes, and for grade III gliomas, the models yield AUROCs ranging from 0.855 to 0.878. ML models provide individualized survival forecasts for grade II and III glioma patients across multiple clinically relevant time points. The user-friendly web application represents a pioneering digital tool to potentially integrate predictive analytics into neuro-oncology clinical practice, to empower prognostication and personalize clinical decision-making.
Collapse
Affiliation(s)
- Mert Karabacak
- Department of Neurosurgery, Mount Sinai Health System, New York, 10029, NY, USA
| | - Pemla Jagtiani
- School of Medicine, SUNY Downstate Health Sciences University, New York, 11203, NY, USA
| | | | - Isabelle M Germano
- Department of Neurosurgery, Mount Sinai Health System, New York, 10029, NY, USA
| | | |
Collapse
|
124
|
Loperfido A, Celebrini A, Marzetti A, Bellocchi G. Current role of artificial intelligence in head and neck cancer surgery: a systematic review of literature. EXPLORATION OF TARGETED ANTI-TUMOR THERAPY 2023; 4:933-940. [PMID: 37970203 PMCID: PMC10645467 DOI: 10.37349/etat.2023.00174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2023] [Accepted: 07/19/2023] [Indexed: 11/17/2023] Open
Abstract
Aim Artificial intelligence (AI) is a new field of science in which computers will provide decisions-supporting tools to help doctors make difficult clinical choices. Recent AI applications in otolaryngology include head and neck oncology, rhinology, neurotology, and laryngology. The aim of this systematic review is to describe the potential uses of AI in head and neck oncology with a special focus on the surgical field. Methods The authors performed a systematic review, in accordance with the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines, in the main medical databases, including PubMed, Scopus, and Cochrane Library, considering all original studies published until February 2023 about the role of AI in head and neck cancer surgery. The search strategy included a combination of the following terms: "artificial intelligence" or "machine learning" and "head and neck cancer". Results Overall, 303 papers were identified and after duplicate removal (12 papers) and excluding papers not written in English (1 paper) and off-topic (4 papers), papers were assessed for eligibility; finally, only 12 papers were included. Three main fields of clinical interest were identified: the most widely investigated included the role of AI in surgical margins assessment (7 papers); the second most frequently evaluated topic was complications assessment (4 papers); finally, only one paper dealt with the indication of salvage laryngectomy after primary radiotherapy. Conclusions The authors report the first systematic review in the literature concerning the role of AI in head and neck cancer surgery. An increasing influx of AI applications to clinical problems in otolaryngology is expected, so specialists should be increasingly prepared to manage the constant changes. It will always remain critical for clinicians to use their skills and knowledge to critically evaluate the additional information provided by AI and make the final decisions on each patient.
Collapse
Affiliation(s)
| | | | - Andrea Marzetti
- Department of Otolaryngology Head and Neck Surgery, Fabrizio Spaziani Hospital, 03100 Frosinone, Italy
| | | |
Collapse
|
125
|
Fei H, Han X, Wang Y, Li S. Novel immune-related gene signature for risk stratification and prognosis prediction in ovarian cancer. J Ovarian Res 2023; 16:205. [PMID: 37858138 PMCID: PMC10585734 DOI: 10.1186/s13048-023-01289-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 09/28/2023] [Indexed: 10/21/2023] Open
Abstract
BACKGROUND The immune system played a multifaceted role in ovarian cancer (OC) and was a significant mediator of ovarian carcinogenesis. Various immune cells and immune gene products played an integrated role in ovarian cancer (OC) progression, proved the significance of the immune microenvironment in prognosis. Therefore, we aimed to establish and validate an immune gene prognostic signature for OC patients' prognosis prediction. METHODS Differently expressed Immune-related genes (DEIRGs) were identified in 428 OC and 77 normal ovary tissue specimens from 9 independent GEO datasets. The Cancer Genome Atlas (TCGA) cohort was used as a training cohort, Univariate Cox analysis was used to identify prognostic DEIRGs in TCGA cohort. Then, an immune gene-based risk model for prognosis prediction was constructed using the LASSO regression analysis, and validated the accuracy and stability of the model in 374 and 93 OC patients in TCGA training cohort and International Cancer Genome Consortium (ICGC) validation cohort respectively. Finally, the correlation among risk score model, clinicopathological parameters, and immune cell infiltration were analyzed. RESULTS Five DEIRGs were identified to establish the immune gene signature and divided OC patients into the low- and high-risk groups. In TCGA and ICGC datasets, patients in the low-risk group showed a substantially higher survival rate than high-risk group. Receiver operating characteristic (ROC) curves, t-distributed stochastic neighbor embedding (t-SNE) analysis and principal component analysis (PCA) showed the good performance of the risk model. Clinicopathological correlation analysis proved the risk score model could serve as an independent prognostic factor in 2 independent datasets. CONCLUSIONS The prognostic model based on immune-related genes can function as a superior prognostic indicator for OC patients, which could provide evidence for individualized treatment and clinical decision making.
Collapse
Affiliation(s)
- Hongjun Fei
- Department of Reproductive Genetics, International Peace Maternity and Child Health Hospital, Shanghai Key Laboratory of Embryo Original Diseases, Shanghai Municipal Key Clinical Specialty, Shanghai Jiao Tong University School of Medicine, No.910, Hengshan Road, Shanghai, 200030, China.
| | - Xu Han
- Department of Reproductive Genetics, International Peace Maternity and Child Health Hospital, Shanghai Key Laboratory of Embryo Original Diseases, Shanghai Municipal Key Clinical Specialty, Shanghai Jiao Tong University School of Medicine, No.910, Hengshan Road, Shanghai, 200030, China
| | - Yanlin Wang
- Department of Reproductive Genetics, International Peace Maternity and Child Health Hospital, Shanghai Key Laboratory of Embryo Original Diseases, Shanghai Municipal Key Clinical Specialty, Shanghai Jiao Tong University School of Medicine, No.910, Hengshan Road, Shanghai, 200030, China
| | - Shuyuan Li
- Department of Reproductive Genetics, International Peace Maternity and Child Health Hospital, Shanghai Key Laboratory of Embryo Original Diseases, Shanghai Municipal Key Clinical Specialty, Shanghai Jiao Tong University School of Medicine, No.910, Hengshan Road, Shanghai, 200030, China.
| |
Collapse
|
126
|
Padwal MK, Basu S, Basu B. Application of Machine Learning in Predicting Hepatic Metastasis or Primary Site in Gastroenteropancreatic Neuroendocrine Tumors. Curr Oncol 2023; 30:9244-9261. [PMID: 37887568 PMCID: PMC10605255 DOI: 10.3390/curroncol30100668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 10/16/2023] [Accepted: 10/16/2023] [Indexed: 10/28/2023] Open
Abstract
Gastroenteropancreatic neuroendocrine tumors (GEP-NETs) account for 80% of gastroenteropancreatic neuroendocrine neoplasms (GEP-NENs). GEP-NETs are well-differentiated tumors, highly heterogeneous in biology and origin, and are often diagnosed at the metastatic stage. Diagnosis is commonly through clinical symptoms, histopathology, and PET-CT imaging, while molecular markers for metastasis and the primary site are unknown. Here, we report the identification of multi-gene signatures for hepatic metastasis and primary sites through analyses on RNA-SEQ datasets of pancreatic and small intestinal NETs tissue samples. Relevant gene features, identified from the normalized RNA-SEQ data using the mRMRe algorithm, were used to develop seven Machine Learning models (LDA, RF, CART, k-NN, SVM, XGBOOST, GBM). Two multi-gene random forest (RF) models classified primary and metastatic samples with 100% accuracy in training and test cohorts and >90% accuracy in an independent validation cohort. Similarly, three multi-gene RF models identified the pancreas or small intestine as the primary site with 100% accuracy in training and test cohorts, and >95% accuracy in an independent cohort. Multi-label models for concurrent prediction of hepatic metastasis and primary site returned >98.42% and >87.42% accuracies on training and test cohorts, respectively. A robust molecular signature to predict liver metastasis or the primary site for GEP-NETs is reported for the first time and could complement the clinical management of GEP-NETs.
Collapse
Affiliation(s)
- Mahesh Kumar Padwal
- Molecular Biology Division, Bhabha Atomic Research Centre, Mumbai 400085, India;
- Homi Bhabha National Institute, Mumbai 400094, India;
| | - Sandip Basu
- Homi Bhabha National Institute, Mumbai 400094, India;
- Radiation Medicine Centre, Bhabha Atomic Research Centre, Tata Memorial Hospital Annexe, Mumbai 400012, India
| | - Bhakti Basu
- Molecular Biology Division, Bhabha Atomic Research Centre, Mumbai 400085, India;
- Homi Bhabha National Institute, Mumbai 400094, India;
| |
Collapse
|
127
|
Peisen F, Gerken A, Hering A, Dahm I, Nikolaou K, Gatidis S, Eigentler TK, Amaral T, Moltz JH, Othman AE. Can Whole-Body Baseline CT Radiomics Add Information to the Prediction of Best Response, Progression-Free Survival, and Overall Survival of Stage IV Melanoma Patients Receiving First-Line Targeted Therapy: A Retrospective Register Study. Diagnostics (Basel) 2023; 13:3210. [PMID: 37892030 PMCID: PMC10605712 DOI: 10.3390/diagnostics13203210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2023] [Revised: 10/06/2023] [Accepted: 10/12/2023] [Indexed: 10/29/2023] Open
Abstract
BACKGROUND The aim of this study was to investigate whether the combination of radiomics and clinical parameters in a machine-learning model offers additive information compared with the use of only clinical parameters in predicting the best response, progression-free survival after six months, as well as overall survival after six and twelve months in patients with stage IV malignant melanoma undergoing first-line targeted therapy. METHODS A baseline machine-learning model using clinical variables (demographic parameters and tumor markers) was compared with an extended model using clinical variables and radiomic features of the whole tumor burden, utilizing repeated five-fold cross-validation. Baseline CTs of 91 stage IV malignant melanoma patients, all treated in the same university hospital, were identified in the Central Malignant Melanoma Registry and all metastases were volumetrically segmented (n = 4727). RESULTS Compared with the baseline model, the extended radiomics model did not add significantly more information to the best-response prediction (AUC [95% CI] 0.548 (0.188, 0.808) vs. 0.487 (0.139, 0.743)), the prediction of PFS after six months (AUC [95% CI] 0.699 (0.436, 0.958) vs. 0.604 (0.373, 0.867)), or the overall survival prediction after six and twelve months (AUC [95% CI] 0.685 (0.188, 0.967) vs. 0.766 (0.433, 1.000) and AUC [95% CI] 0.554 (0.163, 0.781) vs. 0.616 (0.271, 1.000), respectively). CONCLUSIONS The results showed no additional value of baseline whole-body CT radiomics for best-response prediction, progression-free survival prediction for six months, or six-month and twelve-month overall survival prediction for stage IV melanoma patients receiving first-line targeted therapy. These results need to be validated in a larger cohort.
Collapse
Affiliation(s)
- Felix Peisen
- Department of Diagnostic and Interventional Radiology, Tuebingen University Hospital, Eberhard Karls University, Hoppe-Seyler-Straße 3, 72076 Tuebingen, Germany; (I.D.); (K.N.); (S.G.); (A.E.O.)
| | - Annika Gerken
- Fraunhofer MEVIS, Max-von-Laue-Straße 2, 28359 Bremen, Germany; (A.G.); (A.H.); (J.H.M.)
| | - Alessa Hering
- Fraunhofer MEVIS, Max-von-Laue-Straße 2, 28359 Bremen, Germany; (A.G.); (A.H.); (J.H.M.)
- Diagnostic Image Analysis Group, Radboud University Medical Center (Radboudumc), Geert Grooteplein Zuid 10, 6525 GA Nijmegen, The Netherlands
| | - Isabel Dahm
- Department of Diagnostic and Interventional Radiology, Tuebingen University Hospital, Eberhard Karls University, Hoppe-Seyler-Straße 3, 72076 Tuebingen, Germany; (I.D.); (K.N.); (S.G.); (A.E.O.)
| | - Konstantin Nikolaou
- Department of Diagnostic and Interventional Radiology, Tuebingen University Hospital, Eberhard Karls University, Hoppe-Seyler-Straße 3, 72076 Tuebingen, Germany; (I.D.); (K.N.); (S.G.); (A.E.O.)
- Image-Guided and Functionally Instructed Tumor Therapies (iFIT), The Cluster of Excellence (EXC 2180), 72076 Tuebingen, Germany
| | - Sergios Gatidis
- Department of Diagnostic and Interventional Radiology, Tuebingen University Hospital, Eberhard Karls University, Hoppe-Seyler-Straße 3, 72076 Tuebingen, Germany; (I.D.); (K.N.); (S.G.); (A.E.O.)
- Max Planck Institute for Intelligent Systems, Max-Planck-Ring 4, 72076 Tuebingen, Germany
| | - Thomas K. Eigentler
- Center of Dermato-Oncology, Department of Dermatology, Tuebingen University Hospital, Eberhard Karls University, Liebermeisterstraße 25, 72076 Tuebingen, Germany; (T.K.E.); (T.A.)
- Department of Dermatology, Venereology and Allergology, Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humbolt-Universität zu Berlin, Luisenstraße 2, 10117 Berlin, Germany
| | - Teresa Amaral
- Center of Dermato-Oncology, Department of Dermatology, Tuebingen University Hospital, Eberhard Karls University, Liebermeisterstraße 25, 72076 Tuebingen, Germany; (T.K.E.); (T.A.)
| | - Jan H. Moltz
- Fraunhofer MEVIS, Max-von-Laue-Straße 2, 28359 Bremen, Germany; (A.G.); (A.H.); (J.H.M.)
| | - Ahmed E. Othman
- Department of Diagnostic and Interventional Radiology, Tuebingen University Hospital, Eberhard Karls University, Hoppe-Seyler-Straße 3, 72076 Tuebingen, Germany; (I.D.); (K.N.); (S.G.); (A.E.O.)
- Institute of Neuroradiology, Johannes Gutenberg University Hospital Mainz, Langenbeckstraße 1, 55131 Mainz, Germany
| |
Collapse
|
128
|
Harmantepe AT, Dikicier E, Gönüllü E, Ozdemir K, Kamburoğlu MB, Yigit M. A different way to diagnosis acute appendicitis: machine learning. POLISH JOURNAL OF SURGERY 2023; 96:38-43. [PMID: 38629278 DOI: 10.5604/01.3001.0053.5994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024]
Abstract
<b><br>Indroduction:</b> Machine learning is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns, and make decisions with minimal human intervention.</br> <b><br>Aim:</b> Our aim is to predict acute appendicitis, which is the most common indication for emergency surgery, using machine learning algorithms with an easy and inexpensive method.</br> <b><br>Materials and methods:</b> Patients who were treated surgically with a prediagnosis of acute appendicitis in a single center between 2011 and 2021 were analyzed. Patients with right lower quadrant pain were selected. A total of 189 positive and 156 negative appendectomies were found. Gender and hemogram were used as features. Machine learning algorithms and data analysis were made in Python (3.7) programming language.</br> <b><br>Results:</b> Negative appendectomies were found in 62% (n = 97) of the women and in 38% (n = 59) of the men. Positive appendectomies were present in 38% (n = 72) of the women and 62% (n = 117) of the men. The accuracy in the test data was 82.7% in logistic regression, 68.9% in support vector machines, 78.1% in k-nearest neighbors, and 83.9% in neural networks. The accuracy in the voting classifier created with logistic regression, k-nearest neighbor, support vector machines, and artificial neural networks was 86.2%. In the voting classifier, the sensitivity was 83.7% and the specificity was 88.6%.</br> <b><br>Conclusions:</b> The results of our study show that machine learning is an effective method for diagnosing acute appendicitis. This study presents a practical, easy, fast, and inexpensive method to predict the diagnosis of acute appendicitis.</br>.
Collapse
Affiliation(s)
| | - Enis Dikicier
- Sakarya University Faculty of Medicine, Department of General Surgery
| | - Emre Gönüllü
- Sakarya University Education and Research Hospital, Department of General Surgery
| | | | | | - Merve Yigit
- Sakarya University Education and Research Hospital, Department of General Surgery
| |
Collapse
|
129
|
Chen H, Cao B, Yang J, Ren H, Xia X, Zhang X, Yan W, Liang X, Li C. Construction and effect evaluation of prediction model for red blood cell transfusion requirement in cesarean section based on artificial intelligence. BMC Med Inform Decis Mak 2023; 23:213. [PMID: 37828543 PMCID: PMC10568840 DOI: 10.1186/s12911-023-02286-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 09/04/2023] [Indexed: 10/14/2023] Open
Abstract
OBJECTIVES This study intends to build an artificial intelligence model for obstetric cesarean section surgery to evaluate the intraoperative blood transfusion volume before operation, and compare the model prediction results with the actual results to evaluate the accuracy of the artificial intelligence prediction model for intraoperative red blood cell transfusion in obstetrics. The advantages and disadvantages of intraoperative blood demand and identification of high-risk groups for blood transfusion provide data support and improvement suggestions for the realization of accurate blood management of obstetric cesarean section patients during the perioperative period. METHODS Using a machine learning algorithm, an intraoperative blood transfusion prediction model was trained. The differences between the predicted results and the actual results were compared by means of blood transfusion or not, blood transfusion volume, and blood transfusion volume targeting postoperative hemoglobin (Hb). RESULTS Area under curve of the model is 0.89. The accuracy of the model for blood transfusion was 96.85%. The statistical standard for the accuracy of the model blood transfusion volume is the calculation of 1U absolute error, the accuracy rate is 86.56%, and the accuracy rate of the blood transfusion population is 45.00%. In the simulation prediction results, 93.67% of the predicted and actual cases in no blood transfusion surgery; 63.45% of the same predicted blood transfusion in blood transfusion surgery, and only 20.00% of the blood transfusion volume is the same. CONCLUSIONS In conclusion, this study used machine learning algorithm to process, analyze and predict the results of a large sample of cesarean section clinical data, and found that the important predictors of blood transfusion during cesarean section included preoperative RBC, surgical method, the site of surgery, coagulation-related indicators, and other factors. At the same time, it was found that the overall accuracy of the AI model was higher than actual blood using. Although the prediction of blood transfusion volume was not well matched with the actual blood using, the model provided a perspective of preoperative identification of high blood transfusion risks. The results can provide good auxiliary decision support for preoperative evaluation of obstetric cesarean section, and then promote the realization of accurate perioperative blood management for obstetric cesarean section patients.
Collapse
Affiliation(s)
- Hang Chen
- School of Computer Science and Technology, Xi'an Jiaotong University, Xi'an, 710049, Shaanxi, China
- Department of Information Service, Shaanxi Provincial People's Hospital, Xi'an, 710068, Shaanxi, China
| | - Bowei Cao
- Department of Information Service, Shaanxi Provincial People's Hospital, Xi'an, 710068, Shaanxi, China
| | - Jiangcun Yang
- Department of Transfusion Medicine, Shaanxi Provincial People's Hospital, Xi'an, 710068, Shaanxi, China
| | - He Ren
- Beijing HealSci Technology Co., Ltd, Beijing, 100022, China
| | - Xingqiu Xia
- Beijing HealSci Technology Co., Ltd, Beijing, 100022, China
| | - Xiaowen Zhang
- Department of Information Service, Shaanxi Provincial People's Hospital, Xi'an, 710068, Shaanxi, China
| | - Wei Yan
- Department of Information Service, Shaanxi Provincial People's Hospital, Xi'an, 710068, Shaanxi, China
| | - Xiaodan Liang
- Department of Information Service, Shaanxi Provincial People's Hospital, Xi'an, 710068, Shaanxi, China
| | - Chen Li
- School of Computer Science and Technology, Xi'an Jiaotong University, Xi'an, 710049, Shaanxi, China.
| |
Collapse
|
130
|
Chen H, Wang S, Zhang Y, Gao X, Guan Y, Wu N, Wang X, Zhou T, Zhang Y, Cui D, Wang M, Zhang D, Wang J. A prognostic mathematical model based on tumor microenvironment-related genes expression for breast cancer patients. Front Oncol 2023; 13:1209707. [PMID: 37860187 PMCID: PMC10583559 DOI: 10.3389/fonc.2023.1209707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 09/18/2023] [Indexed: 10/21/2023] Open
Abstract
Background Tumor microenvironment (TME) status is closely related to breast cancer (BC) prognosis and systemic therapeutic effects. However, to date studies have not considered the interactions of immune and stromal cells at the gene expression level in BC as a whole. Herein, we constructed a predictive model, for adjuvant decision-making, by mining TME molecular expression information related to BC patient prognosis and drug treatment sensitivity. Methods Clinical information and gene expression profiles were extracted from The Cancer Genome Atlas (TCGA), with patients divided into high- and low-score groups according to immune/stromal scores. TME-related prognostic genes were identified using Kaplan-Meier analysis, functional enrichment analysis, and protein-protein interaction (PPI) networks, and validated in the Gene Expression Omnibus (GEO) database. Least absolute shrinkage and selection operator (LASSO) Cox regression analysis was used to construct and verify a prognostic model based on TME-related genes. In addition, the patients' response to chemotherapy and immunotherapy was assessed by survival outcome and immunohistochemistry (IPS). Immunohistochemistry (IHC) staining laid a solid foundation for exploring the value of novel therapeutic target genes. Results By dividing patients into low- and high-risk groups, a significant distinction in overall survival was found (p < 0.05). The risk model was independent of multiple clinicopathological parameters and accurately predicted prognosis in BC patients (p < 0.05). The nomogram-integrated risk score had high prediction accuracy and applicability, when compared with simple clinicopathological features. As predicted by the risk model, regardless of the chemotherapy regimen, the survival advantage of the low-risk group was evident in those patients receiving chemotherapy (p < 0.05). However, in patients receiving anthracycline (A) therapy, outcomes were not significantly different when compared with those receiving no-A therapy (p = 0.24), suggesting these patients may omit from A-containing adjuvant chemotherapy. Our risk model also effectively predicted tumor mutation burden (TMB) and immunotherapy efficacy in BC patients (p < 0.05). Conclusion The prognostic score model based on TME-related genes effectively predicted prognosis and chemotherapy effects in BC patients. The model provides a theoretical basis for novel driver-gene discover in BC and guides the decision-making for the adjuvant treatment of early breast cancer (eBC).
Collapse
Affiliation(s)
- Hong Chen
- Department of Breast Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Shan Wang
- Department of Breast Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Yuting Zhang
- Department of Breast Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Xue Gao
- Department of Pathology, The First Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Yufu Guan
- Department of Breast and Thyroid Surgery, Affiliated Zhongshan Hospital of Dalian University, Dalian, China
| | - Nan Wu
- Department of Breast Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Xinyi Wang
- Department of Breast Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Tianyang Zhou
- Department of Breast Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Ying Zhang
- Department of Breast Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Di Cui
- Information Center, Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Mijia Wang
- Department of Breast Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Dianlong Zhang
- Department of Breast and Thyroid Surgery, Affiliated Zhongshan Hospital of Dalian University, Dalian, China
| | - Jia Wang
- Department of Breast Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
| |
Collapse
|
131
|
Tang Q, Su Q, Wei L, Wang K, Jiang T. Identifying potential biomarkers for non-obstructive azoospermia using WGCNA and machine learning algorithms. Front Endocrinol (Lausanne) 2023; 14:1108616. [PMID: 37854191 PMCID: PMC10579891 DOI: 10.3389/fendo.2023.1108616] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 09/08/2023] [Indexed: 10/20/2023] Open
Abstract
Objective The cause and mechanism of non-obstructive azoospermia (NOA) is complicated; therefore, an effective therapy strategy is yet to be developed. This study aimed to analyse the pathogenesis of NOA at the molecular biological level and to identify the core regulatory genes, which could be utilised as potential biomarkers. Methods Three NOA microarray datasets (GSE45885, GSE108886, and GSE145467) were collected from the GEO database and merged into training sets; a further dataset (GSE45887) was then defined as the validation set. Differential gene analysis, consensus cluster analysis, and WGCNA were used to identify preliminary signature genes; then, enrichment analysis was applied to these previously screened signature genes. Next, 4 machine learning algorithms (RF, SVM, GLM, and XGB) were used to detect potential biomarkers that are most closely associated with NOA. Finally, a diagnostic model was constructed from these potential biomarkers and visualised as a nomogram. The differential expression and predictive reliability of the biomarkers were confirmed using the validation set. Furthermore, the competing endogenous RNA network was constructed to identify the regulatory mechanisms of potential biomarkers; further, the CIBERSORT algorithm was used to calculate immune infiltration status among the samples. Results A total of 215 differentially expressed genes (DEGs) were identified between NOA and control groups (27 upregulated and 188 downregulated genes). The WGCNA results identified 1123 genes in the MEblue module as target genes that are highly correlated with NOA positivity. The NOA samples were divided into 2 clusters using consensus clustering; further, 1027 genes in the MEblue module, which were screened by WGCNA, were considered to be target genes that are highly correlated with NOA classification. The 129 overlapping genes were then established as signature genes. The XGB algorithm that had the maximum AUC value (AUC=0.946) and the minimum residual value was used to further screen the signature genes. IL20RB, C9orf117, HILS1, PAOX, and DZIP1 were identified as potential NOA biomarkers. This 5 biomarker model had the highest AUC value, of up to 0.982, compared to other single biomarker models; additionally, the results of this biomarker model were verified in the validation set. Conclusions As IL20RB, C9orf117, HILS1, PAOX, and DZIP1 have been determined to possess the strongest association with NOA, these five genes could be used as potential therapeutic targets for NOA patients. Furthermore, the model constructed using these five genes, which possessed the highest diagnostic accuracy, may be an effective biomarker model that warrants further experimental validation.
Collapse
Affiliation(s)
- Qizhen Tang
- Department of Urology, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
| | - Quanxin Su
- Department of Urology, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
| | - Letian Wei
- Department of Urology, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
| | - Kenan Wang
- Department of Urology, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
| | - Tao Jiang
- Department of Andrology, The Second Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
| |
Collapse
|
132
|
Sahu A, Das PK, Meher S. Recent advancements in machine learning and deep learning-based breast cancer detection using mammograms. Phys Med 2023; 114:103138. [PMID: 37914431 DOI: 10.1016/j.ejmp.2023.103138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Revised: 07/22/2023] [Accepted: 09/14/2023] [Indexed: 11/03/2023] Open
Abstract
OBJECTIVE Mammogram-based automatic breast cancer detection has a primary role in accurate cancer diagnosis and treatment planning to save valuable lives. Mammography is one basic yet efficient test for screening breast cancer. Very few comprehensive surveys have been presented to briefly analyze methods for detecting breast cancer with mammograms. In this article, our objective is to give an overview of recent advancements in machine learning (ML) and deep learning (DL)-based breast cancer detection systems. METHODS We give a structured framework to categorize mammogram-based breast cancer detection techniques. Several publicly available mammogram databases and different performance measures are also mentioned. RESULTS After deliberate investigation, we find most of the works classify breast tumors either as normal-abnormal or malignant-benign rather than classifying them into three classes. Furthermore, DL-based features are more significant than hand-crafted features. However, transfer learning is preferred over others as it yields better performance in small datasets, unlike classical DL techniques. SIGNIFICANCE AND CONCLUSION In this article, we have made an attempt to give recent advancements in artificial intelligence (AI)-based breast cancer detection systems. Furthermore, a number of challenging issues and possible research directions are mentioned, which will help researchers in further scopes of research in this field.
Collapse
Affiliation(s)
- Adyasha Sahu
- Department of Electronics and Communication Engineering, National Institute of Technology, Rourkela, Odisha, 769008, India.
| | - Pradeep Kumar Das
- School of Electronics Engineering (SENSE), VIT Vellore, Tamil Nadu, 632014, India.
| | - Sukadev Meher
- Department of Electronics and Communication Engineering, National Institute of Technology, Rourkela, Odisha, 769008, India.
| |
Collapse
|
133
|
Sufyan M, Shokat Z, Ashfaq UA. Artificial intelligence in cancer diagnosis and therapy: Current status and future perspective. Comput Biol Med 2023; 165:107356. [PMID: 37688994 DOI: 10.1016/j.compbiomed.2023.107356] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 07/21/2023] [Accepted: 08/12/2023] [Indexed: 09/11/2023]
Abstract
Artificial intelligence (AI) in healthcare plays a pivotal role in combating many fatal diseases, such as skin, breast, and lung cancer. AI is an advanced form of technology that uses mathematical-based algorithmic principles similar to those of the human mind for cognizing complex challenges of the healthcare unit. Cancer is a lethal disease with many etiologies, including numerous genetic and epigenetic mutations. Cancer being a multifactorial disease is difficult to be diagnosed at an early stage. Therefore, genetic variations and other leading factors could be identified in due time through AI and machine learning (ML). AI is the synergetic approach for mining the drug targets, their mechanism of action, and drug-organism interaction from massive raw data. This synergetic approach is also facing several challenges in data mining but computational algorithms from different scientific communities for multi-target drug discovery are highly helpful to overcome the bottlenecks in AI for drug-target discovery. AI and ML could be the epicenter in the medical world for the diagnosis, treatment, and evaluation of almost any disease in the near future. In this comprehensive review, we explore the immense potential of AI and ML when integrated with the biological sciences, specifically in the context of cancer research. Our goal is to illuminate the many ways in which AI and ML are being applied to the study of cancer, from diagnosis to individualized treatment. We highlight the prospective role of AI in supporting oncologists and other medical professionals in making informed decisions and improving patient outcomes by examining the intersection of AI and cancer control. Although AI-based medical therapies show great potential, many challenges must be overcome before they can be implemented in clinical practice. We critically assess the current hurdles and provide insights into the future directions of AI-driven approaches, aiming to pave the way for enhanced cancer interventions and improved patient care.
Collapse
Affiliation(s)
- Muhammad Sufyan
- Department of Bioinformatics and Biotechnology, Government College University Faisalabad, Pakistan.
| | - Zeeshan Shokat
- Department of Bioinformatics and Biotechnology, Government College University Faisalabad, Pakistan.
| | - Usman Ali Ashfaq
- Department of Bioinformatics and Biotechnology, Government College University Faisalabad, Pakistan.
| |
Collapse
|
134
|
Hui X, Wang Y, Li W, Yuan Y, Tao X, Lv R. Nd-Mn Molecular Cluster with Searched Targets for Oral Cancer Imaging. Mol Imaging Biol 2023; 25:875-886. [PMID: 37256508 DOI: 10.1007/s11307-023-01828-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 04/10/2023] [Accepted: 05/11/2023] [Indexed: 06/01/2023]
Abstract
In this research, we designed a novel NIR II luminescence imaging probe with targeting effect to accurately track oral squamous cell carcinoma (OSCC) cells. Massive gene expression data were processed by weighted gene co-expression network analysis to establish a network of relationships between genes. After clustering, correlation of clinical information, and gene functional enrichment analysis, MMP1 was predicted to be a biomarker/therapeutic target for OSCC cells. To obtain rare-earth probes with better luminescence in the NIR II region, we adjusted the doping ratio of the rare-earth element (Nd, Gd, Er, and Yb) fraction of the Nd-Mn molecular cluster to optimize its luminescence properties. The results of in vitro targeting experiments showed that Nd-Mn-MMP1Ab can target Cal-27 cells, demonstrating at the cellular level that the MMP1 gene is a biomarker for oral cancer, which also proves that the cancer targets predicted by the bioinformatics approach are correct.
Collapse
Affiliation(s)
- Xin Hui
- Engineering Research Center of Molecular and Neuro Imaging, Ministry of Education, School of Life Science and Technology, Xidian University, Xi'an, 710071, Shanxi, China
| | - Yanxing Wang
- Engineering Research Center of Molecular and Neuro Imaging, Ministry of Education, School of Life Science and Technology, Xidian University, Xi'an, 710071, Shanxi, China
| | - Wenjing Li
- Engineering Research Center of Molecular and Neuro Imaging, Ministry of Education, School of Life Science and Technology, Xidian University, Xi'an, 710071, Shanxi, China
| | - Ying Yuan
- Department of Medical Interdisciplinary Research, Xi'an Ninth Hospital Affiliated to Medical College of Xi'an Jiaotong University, Xi'an, 710054, Shaanxi, China
| | - Xiaofeng Tao
- Department of Medical Interdisciplinary Research, Xi'an Ninth Hospital Affiliated to Medical College of Xi'an Jiaotong University, Xi'an, 710054, Shaanxi, China.
| | - Ruichan Lv
- Engineering Research Center of Molecular and Neuro Imaging, Ministry of Education, School of Life Science and Technology, Xidian University, Xi'an, 710071, Shanxi, China.
| |
Collapse
|
135
|
El Naqa I, Karolak A, Luo Y, Folio L, Tarhini AA, Rollison D, Parodi K. Translation of AI into oncology clinical practice. Oncogene 2023; 42:3089-3097. [PMID: 37684407 DOI: 10.1038/s41388-023-02826-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 08/23/2023] [Accepted: 08/25/2023] [Indexed: 09/10/2023]
Abstract
Artificial intelligence (AI) is a transformative technology that is capturing popular imagination and can revolutionize biomedicine. AI and machine learning (ML) algorithms have the potential to break through existing barriers in oncology research and practice such as automating workflow processes, personalizing care, and reducing healthcare disparities. Emerging applications of AI/ML in the literature include screening and early detection of cancer, disease diagnosis, response prediction, prognosis, and accelerated drug discovery. Despite this excitement, only few AI/ML models have been properly validated and fewer have become regulated products for routine clinical use. In this review, we highlight the main challenges impeding AI/ML clinical translation. We present different clinical use cases from the domains of radiology, radiation oncology, immunotherapy, and drug discovery in oncology. We dissect the unique challenges and opportunities associated with each of these cases. Finally, we summarize the general requirements for successful AI/ML implementation in the clinic, highlighting specific examples and points of emphasis including the importance of multidisciplinary collaboration of stakeholders, role of domain experts in AI augmentation, transparency of AI/ML models, and the establishment of a comprehensive quality assurance program to mitigate risks of training bias and data drifts, all culminating toward safer and more beneficial AI/ML applications in oncology labs and clinics.
Collapse
Affiliation(s)
- Issam El Naqa
- Department of Machine Learning, Moffitt Cancer Center, Tampa, FL, 33612, USA.
| | - Aleksandra Karolak
- Department of Machine Learning, Moffitt Cancer Center, Tampa, FL, 33612, USA
| | - Yi Luo
- Department of Machine Learning, Moffitt Cancer Center, Tampa, FL, 33612, USA
| | - Les Folio
- Diagnostic Imaging & Interventional Radiology, Moffitt Cancer Center, Tampa, FL, 33612, USA
| | - Ahmad A Tarhini
- Cutaneous Oncology and Immunology, Moffitt Cancer Center, Tampa, FL, 33612, USA
| | - Dana Rollison
- Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, 33612, USA
| | - Katia Parodi
- Department of Medical Physics, Ludwig-Maximilians-Universität München, Munich, Germany
| |
Collapse
|
136
|
Liang X, Guan R, Zhu J, Meng Y, Zhu J, Yang Y, Cui Y, Dai J, Mao W, Lv L, Shen D, Guo R. A clinical decision support system to predict the efficacy for EGFR-TKIs based on artificial neural network. J Cancer Res Clin Oncol 2023; 149:12265-12274. [PMID: 37434091 DOI: 10.1007/s00432-023-05104-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 06/30/2023] [Indexed: 07/13/2023]
Abstract
BACKGROUND The efficacy of epidermal growth factor receptor (EGFR)-tyrosine kinase inhibitor (TKI) was affected by numerous factors. In the study, we developed and validated an artificial neural network (ANN) system based on clinical characteristics and next-generation sequencing (NGS) to support clinical decisions. METHODS A multicenter retrospective non-interventional study was conducted. 240 patients from three hospitals with advanced non-small cell lung cancer (NSCLC) and EGFR mutation were tested by NGS before the first treatment. All patients received formal EGFR-TKIs treatment. Five different models were individually trained to predict the efficacy of EGFR-TKIs based on one medical center with 188 patients. Two independent cohorts from other medical centers were collected for external validation. RESULTS Compared with logistic regression, four machine learning methods showed better predicting abilities for EGFR-TKIs. The inclusion of NGS tests improved the predictive power of models. ANN performed best on the dataset with mutations TP53, RB1, PIK3CA, EGFR mutation sites, and tumor mutation burden (TMB). The prediction accuracy, recall and AUC were 0.82, 0.82, and 0.82, respectively in our final model. In the external validation set, ANN still showed good performance and differentiated patients with poor outcomes. Finally, a clinical decision support software based on ANN was developed and provided a visualization interface for clinicians. CONCLUSION This study provides an approach to assess the efficacy of NSCLC patients with first-line EGFR-TKI treatment. Software is developed to support clinical decisions.
Collapse
Affiliation(s)
- Xiao Liang
- Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
- Department of Oncology, Jiangyin People's Hospital, Jiangyin, China
| | - Runwei Guan
- School of Electronics and Computer Science, University of Southampton, Southampton, UK
| | - Jiamin Zhu
- Department of Oncology, Jiangyin People's Hospital, Jiangyin, China
| | - Yue Meng
- Department of Oncology, Affiliated Hospital of Nantong University, Nantong, China
| | - Jing Zhu
- Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
- Department of Oncology, The Affiliated Jiangning Hospital With Nanjing Medical University, Nanjing, China
| | - Yuxiang Yang
- School of Computer Science and Engineering, Northeastern University, Shenyang, China
| | - Yanan Cui
- Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Jiali Dai
- Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Weidong Mao
- Department of Oncology, Jiangyin People's Hospital, Jiangyin, China
| | - Liting Lv
- Department of Oncology, Affiliated Hospital of Nantong University, Nantong, China.
| | - Dong Shen
- Department of Oncology, Jiangyin People's Hospital, Jiangyin, China.
| | - Renhua Guo
- Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China.
| |
Collapse
|
137
|
Ma X, Pierce E, Anand H, Aviles N, Kunk P, Alemazkoor N. Early prediction of response to palliative chemotherapy in patients with stage-IV gastric and esophageal cancer. BMC Cancer 2023; 23:910. [PMID: 37759332 PMCID: PMC10536729 DOI: 10.1186/s12885-023-11422-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 09/20/2023] [Indexed: 09/29/2023] Open
Abstract
BACKGROUND The goal of therapy for many patients with advanced stage malignancies, including those with metastatic gastric and esophageal cancers, is to extend overall survival while also maintaining quality of life. After weighing the risks and benefits of treatment with palliative chemotherapy (PC) with non-curative intent, many patients decide to pursue treatment. It is known that a subset of patients who are treated with PC experience significant side effects without clinically significant survival benefits from PC. METHODS We use data from 150 patients with stage-IV gastric and esophageal cancers to train machine learning models that predict whether a patient with stage-IV gastric or esophageal cancers would benefit from PC, in terms of increased survival duration, at very early stages of the treatment. RESULTS Our findings show that machine learning can predict with high accuracy whether a patient will benefit from PC at the time of diagnosis. More accurate predictions can be obtained after only two cycles of PC (i.e., about 4 weeks after diagnosis). The results from this study are promising with regard to potential improvements in quality of life for patients near the end of life and a potential overall survival benefit by optimizing systemic therapy earlier in the treatment course of patients.
Collapse
Affiliation(s)
- Xiaoyuan Ma
- Department of Statistics, University of Virginia, Charlottesville, USA
| | - Eric Pierce
- School of Medicine, University of Virginia, Charlottesville, USA
| | - Harsh Anand
- System and Information Engineering, University of Virginia, Charlottesville, USA
| | - Natalie Aviles
- Department of Sociology, University of Virginia, Charlottesville, USA
| | - Paul Kunk
- School of Medicine, University of Virginia, Charlottesville, USA
| | - Negin Alemazkoor
- System and Information Engineering, University of Virginia, Charlottesville, USA.
| |
Collapse
|
138
|
Nazir A, Memon Z, Sadiq T, Rahman H, Khan IU. A Novel Feature-Selection Algorithm in IoT Networks for Intrusion Detection. SENSORS (BASEL, SWITZERLAND) 2023; 23:8153. [PMID: 37836983 PMCID: PMC10575335 DOI: 10.3390/s23198153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 09/19/2023] [Accepted: 09/25/2023] [Indexed: 10/15/2023]
Abstract
The Internet of Things (IoT) and network-enabled smart devices are crucial to the digitally interconnected society of the present day. However, the increased reliance on IoT devices increases their susceptibility to malicious activities within network traffic, posing significant challenges to cybersecurity. As a result, both system administrators and end users are negatively affected by these malevolent behaviours. Intrusion-detection systems (IDSs) are commonly deployed as a cyber attack defence mechanism to mitigate such risks. IDS plays a crucial role in identifying and preventing cyber hazards within IoT networks. However, the development of an efficient and rapid IDS system for the detection of cyber attacks remains a challenging area of research. Moreover, IDS datasets contain multiple features, so the implementation of feature selection (FS) is required to design an effective and timely IDS. The FS procedure seeks to eliminate irrelevant and redundant features from large IDS datasets, thereby improving the intrusion-detection system's overall performance. In this paper, we propose a hybrid wrapper-based feature-selection algorithm that is based on the concepts of the Cellular Automata (CA) engine and Tabu Search (TS)-based aspiration criteria. We used a Random Forest (RF) ensemble learning classifier to evaluate the fitness of the selected features. The proposed algorithm, CAT-S, was tested on the TON_IoT dataset. The simulation results demonstrate that the proposed algorithm, CAT-S, enhances classification accuracy while simultaneously reducing the number of features and the false positive rate.
Collapse
Affiliation(s)
- Anjum Nazir
- Department of Computer Science, National University of Computer and Emerging Sciences (NUCES—FAST), Karachi 75123, Pakistan; (A.N.); (Z.M.)
| | - Zulfiqar Memon
- Department of Computer Science, National University of Computer and Emerging Sciences (NUCES—FAST), Karachi 75123, Pakistan; (A.N.); (Z.M.)
| | - Touseef Sadiq
- Centre for Artificial Intelligence Research, Department of Information and Communication Technology, University of Agder, Jon Lilletuns vei 9, 4879 Grimstad, Norway
| | - Hameedur Rahman
- Department of Computer Games Development, Faculty of Computing & AI, Air University, E9, Islamabad 44400, Pakistan;
| | - Inam Ullah Khan
- Department of Electronic Engineering, School of Engineering & Applied Sciences (SEAS), Isra University, Islamabad Campus, Islamabad 44400, Pakistan;
| |
Collapse
|
139
|
Meng Y, Duan Q, Jiao K, Xue J. A screened predictive model for esophageal squamous cell carcinoma based on salivary flora data. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:18368-18385. [PMID: 38052562 DOI: 10.3934/mbe.2023816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/07/2023]
Abstract
Esophageal squamous cell carcinoma (ESCC) is a malignant tumor of the digestive system in the esophageal squamous epithelium. Many studies have linked esophageal cancer (EC) to the imbalance of oral microecology. In this work, different machine learning (ML) models including Random Forest (RF), Gaussian mixture model (GMM), K-nearest neighbor (KNN), logistic regression (LR), support vector machine (SVM) and extreme gradient boosting (XGBoost) based on Genetic Algorithm (GA) optimization was developed to predict the relationship between salivary flora and ESCC by combining the relative abundance data of Bacteroides, Firmicutes, Proteobacteria, Fusobacteria and Actinobacteria in the saliva of patients with ESCC and healthy control. The results showed that the XGBoost model without parameter optimization performed best on the entire dataset for ESCC diagnosis by cross-validation (Accuracy = 73.50%). Accuracy and the other evaluation indicators, including Precision, Recall, F1-score and the area under curve (AUC) of the receiver operating characteristic (ROC), revealed XGBoost optimized by the GA (GA-XGBoost) achieved the best outcome on the testing set (Accuracy = 89.88%, Precision = 89.43%, Recall = 90.75%, F1-score = 90.09%, AUC = 0.97). The predictive ability of GA-XGBoost was validated in phylum-level salivary microbiota data from ESCC patients and controls in an external cohort. The results obtained in this validation (Accuracy = 70.60%, Precision = 46.00%, Recall = 90.55%, F1-score = 61.01%) illustrate the reliability of the predictive performance of the model. The feature importance rankings obtained by XGBoost indicate that Bacteroides and Actinobacteria are the two most important factors in predicting ESCC. Based on these results, GA-XGBoost can predict and diagnose ESCC according to the relative abundance of salivary flora, providing an effective tool for the non-invasive prediction of esophageal malignancies.
Collapse
Affiliation(s)
- Yunxiang Meng
- School of Mathematics and Statistics, Xi'an JiaoTong University, Xi'an, China
| | - Qihong Duan
- School of Mathematics and Statistics, Xi'an JiaoTong University, Xi'an, China
| | - Kai Jiao
- Department of Oral Mucosal Diseases, State Key Laboratory of Military Stomatology & National Clinical Research Center for Oral Diseases & Shaanxi Key Laboratory of Stomatology, School of Stomatology, The Fourth Military Medical University, Xi'an, China
| | - Jiang Xue
- School of Mathematics and Statistics, Xi'an JiaoTong University, Xi'an, China
| |
Collapse
|
140
|
Ospina R, Ferreira AGO, de Oliveira HM, Leiva V, Castro C. On the Use of Machine Learning Techniques and Non-Invasive Indicators for Classifying and Predicting Cardiac Disorders. Biomedicines 2023; 11:2604. [PMID: 37892978 PMCID: PMC10604302 DOI: 10.3390/biomedicines11102604] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 09/14/2023] [Accepted: 09/19/2023] [Indexed: 10/29/2023] Open
Abstract
This research aims to enhance the classification and prediction of ischemic heart diseases using machine learning techniques, with a focus on resource efficiency and clinical applicability. Specifically, we introduce novel non-invasive indicators known as Campello de Souza features, which require only a tensiometer and a clock for data collection. These features were evaluated using a comprehensive dataset of heart disease cases from a machine learning data repository. Our findings highlight the ability of machine learning algorithms to not only streamline diagnostic procedures but also reduce diagnostic errors and the dependency on extensive clinical testing. Three key features-mean arterial pressure, pulsatile blood pressure index, and resistance-compliance indicator-were found to significantly improve the accuracy of machine learning algorithms in binary heart disease classification. Logistic regression achieved the highest average accuracy among the examined classifiers when utilizing these features. While such novel indicators contribute substantially to the classification process, they should be integrated into a broader diagnostic framework that includes comprehensive patient evaluations and medical expertise. Therefore, the present study offers valuable insights for leveraging data science techniques in the diagnosis and management of cardiovascular diseases.
Collapse
Affiliation(s)
- Raydonal Ospina
- Department of Statistics, Universidade Federal da Bahia, Salvador 40110-909, Brazil
- Department of Statistics, CASTLab, Universidade Federal de Pernambuco, Recife 50670-901, Brazil
| | - Adenice G. O. Ferreira
- Department of Statistics, CASTLab, Universidade Federal de Pernambuco, Recife 50670-901, Brazil
| | - Hélio M. de Oliveira
- Department of Statistics, CASTLab, Universidade Federal de Pernambuco, Recife 50670-901, Brazil
| | - Víctor Leiva
- School of Industrial Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso 2362807, Chile
| | - Cecilia Castro
- Centre of Mathematics, Universidade do Minho, 4710-057 Braga, Portugal
| |
Collapse
|
141
|
Karim MR, Islam T, Shajalal M, Beyan O, Lange C, Cochez M, Rebholz-Schuhmann D, Decker S. Explainable AI for Bioinformatics: Methods, Tools and Applications. Brief Bioinform 2023; 24:bbad236. [PMID: 37478371 DOI: 10.1093/bib/bbad236] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 05/10/2023] [Accepted: 05/26/2023] [Indexed: 07/23/2023] Open
Abstract
Artificial intelligence (AI) systems utilizing deep neural networks and machine learning (ML) algorithms are widely used for solving critical problems in bioinformatics, biomedical informatics and precision medicine. However, complex ML models that are often perceived as opaque and black-box methods make it difficult to understand the reasoning behind their decisions. This lack of transparency can be a challenge for both end-users and decision-makers, as well as AI developers. In sensitive areas such as healthcare, explainability and accountability are not only desirable properties but also legally required for AI systems that can have a significant impact on human lives. Fairness is another growing concern, as algorithmic decisions should not show bias or discrimination towards certain groups or individuals based on sensitive attributes. Explainable AI (XAI) aims to overcome the opaqueness of black-box models and to provide transparency in how AI systems make decisions. Interpretable ML models can explain how they make predictions and identify factors that influence their outcomes. However, the majority of the state-of-the-art interpretable ML methods are domain-agnostic and have evolved from fields such as computer vision, automated reasoning or statistics, making direct application to bioinformatics problems challenging without customization and domain adaptation. In this paper, we discuss the importance of explainability and algorithmic transparency in the context of bioinformatics. We provide an overview of model-specific and model-agnostic interpretable ML methods and tools and outline their potential limitations. We discuss how existing interpretable ML methods can be customized and fit to bioinformatics research problems. Further, through case studies in bioimaging, cancer genomics and text mining, we demonstrate how XAI methods can improve transparency and decision fairness. Our review aims at providing valuable insights and serving as a starting point for researchers wanting to enhance explainability and decision transparency while solving bioinformatics problems. GitHub: https://github.com/rezacsedu/XAI-for-bioinformatics.
Collapse
Affiliation(s)
- Md Rezaul Karim
- Computer Science 5 - Information Systems and Databases, RWTH Aachen University, Germany
- Department of Data Science and Artificial Intelligence, Fraunhofer FIT, Germany
| | - Tanhim Islam
- Computer Science 9 - Process and Data Science, RWTH Aachen University, Germany
| | | | - Oya Beyan
- Computer Science 5 - Information Systems and Databases, RWTH Aachen University, Germany
- University of Cologne, Faculty of Medicine and University Hospital Cologne, Institute for Medical Informatics, Germany
| | - Christoph Lange
- Computer Science 5 - Information Systems and Databases, RWTH Aachen University, Germany
- Department of Data Science and Artificial Intelligence, Fraunhofer FIT, Germany
| | - Michael Cochez
- Department of Computer Science, Vrije Universiteit Amsterdam, the Netherlands
- Elsevier Discovery Lab, Amsterdam, the Netherlands
| | - Dietrich Rebholz-Schuhmann
- ZBMED - Information Center for Life Sciences, Cologne, Germany
- Faculty of Medicine, University of Cologne, Germany
| | - Stefan Decker
- Computer Science 5 - Information Systems and Databases, RWTH Aachen University, Germany
- Department of Data Science and Artificial Intelligence, Fraunhofer FIT, Germany
| |
Collapse
|
142
|
Nakamura W, Sumitomo M, Zennami K, Takenaka M, Ichino M, Takahara K, Teramoto A, Shiroki R. Combination of deep learning and ensemble machine learning using intraoperative video images strongly predicts recovery of urinary continence after robot-assisted radical prostatectomy. Cancer Rep (Hoboken) 2023; 6:e1861. [PMID: 37449339 PMCID: PMC10480482 DOI: 10.1002/cnr2.1861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 06/20/2023] [Accepted: 06/25/2023] [Indexed: 07/18/2023] Open
Abstract
BACKGROUND We recently reported the importance of deep learning (DL) of pelvic magnetic resonance imaging in predicting the degree of urinary incontinence (UI) following robot-assisted radical prostatectomy (RARP). However, our results were limited because the prediction accuracy was approximately 70%. AIM To develop a more precise prediction model that can inform patients about UI recovery post-RARP surgery using a DL model based on intraoperative video images. METHODS AND RESULTS The study cohort comprised of 101 patients with localized prostate cancer undergoing RARP. Three snapshots from intraoperative video recordings showing the pelvic cavity (prior to bladder neck incision, immediately following prostate removal, and after vesicourethral anastomosis) were evaluated, including pre- and intraoperative parameters. We evaluated the DL model plus simple or ensemble machine learning (ML), and the area under the receiver operating characteristic curve (AUC) was analyzed through sensitivity and specificity. Of 101, 64 and 37 patients demonstrated "early continence (using 0 or 1 safety pad at 3 months post-RARP)" and "late continence (others)," respectively, at 3 months postoperatively. The combination of DL and simple ML using intraoperative video snapshots with clinicopathological parameters had a notably high performance (AUC, 0.683-0.749) to predict early recovery from UI after surgery. Furthermore, combining DL with ensemble artificial neural network using intraoperative video snapshots had the highest performance (AUC, 0.882; sensitivity, 92.2%; specificity, 78.4%; overall accuracy, 85.3%) to predict early recovery from post-RARP incontinence, with similar results by internal validation. The addition of clinicopathological parameters showed no additive effects for each analysis using DL, EL and simple ML. CONCLUSION Our findings suggest that the DL algorithm with intraoperative video imaging is a reliable method for informing patients about the severity of their recovery from UI after RARP, although it is not clear if our methods are reproducible for predicting long-term UI and pad-free continence.
Collapse
Affiliation(s)
- Wataru Nakamura
- Department of Urology, School of MedicineFujita Health UniversityToyoakeJapan
| | - Makoto Sumitomo
- Department of Urology, School of MedicineFujita Health UniversityToyoakeJapan
- Fujita Cancer CenterFujita Health UniversityToyoakeJapan
| | - Kenji Zennami
- Department of Urology, School of MedicineFujita Health UniversityToyoakeJapan
| | - Masashi Takenaka
- Department of Urology, School of MedicineFujita Health UniversityToyoakeJapan
| | - Manabu Ichino
- Department of Urology, School of MedicineFujita Health UniversityToyoakeJapan
| | - Kiyoshi Takahara
- Department of Urology, School of MedicineFujita Health UniversityToyoakeJapan
| | - Atsushi Teramoto
- Faculty of Radiological Technology, School of Medical SciencesFujita Health UniversityToyoakeJapan
- Faculty of Information EngineeringMeijo UniversityNagoyaJapan
| | - Ryoichi Shiroki
- Department of Urology, School of MedicineFujita Health UniversityToyoakeJapan
| |
Collapse
|
143
|
Karabacak M, Jagtiani P, Carrasquilla A, Shrivastava RK, Margetis K. Advancing personalized prognosis in atypical and anaplastic meningiomas through interpretable machine learning models. J Neurooncol 2023; 164:671-681. [PMID: 37768472 DOI: 10.1007/s11060-023-04463-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 09/23/2023] [Indexed: 09/29/2023]
Abstract
PURPOSE The primary purpose of this study was to utilize machine learning (ML) models to create a web application that can predict survival outcomes for patients diagnosed with atypical and anaplastic meningiomas. METHODS In this retrospective cohort study, patients diagnosed with WHO grade II and III meningiomas were selected from the National Cancer Database (NCDB) to analyze survival outcomes at 12, 36, and 60 months. Five machine learning algorithms - TabPFN, TabNet, XGBoost, LightGBM, and Random Forest were employed and optimized using the Optuna library for hyperparameter tuning. The top-performing models were then deployed into our web-based application. RESULTS From the NCDB, 12,197 adult patients diagnosed with histologically confirmed WHO grade II and III meningiomas were retrieved. The mean age was 61 (± 20), and 6,847 (56.1%) of these were females. Performance evaluation indicated that the top-performing models for each outcome were the models built with the TabPFN algorithm. The TabPFN models yielded area under the receiver operating characteristic (AUROC) values of 0.805, 0.781, and 0.815 in predicting 12-, 36-, and 60-month mortality, respectively. CONCLUSION With the continuous growth of neuro-oncology data, ML algorithms act as key tools in predicting survival outcomes for WHO grade II and III meningioma patients. By incorporating these interpretable models into a web application, we can practically utilize them to improve risk evaluation and prognosis for meningioma patients.
Collapse
Affiliation(s)
- Mert Karabacak
- Department of Neurosurgery, Mount Sinai Health System, New York, NY, USA
| | - Pemla Jagtiani
- School of Medicine, SUNY Downstate Health Sciences University, New York, NY, USA
| | | | - Raj K Shrivastava
- Department of Neurosurgery, Mount Sinai Health System, New York, NY, USA
| | | |
Collapse
|
144
|
Huang G, Liu H, Gong S, Ge Y. Survival Prediction After Transarterial Chemoembolization for Hepatocellular Carcinoma: a Deep Multitask Survival Analysis Approach. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2023; 7:332-358. [PMID: 37637721 PMCID: PMC10449707 DOI: 10.1007/s41666-023-00139-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 02/20/2023] [Accepted: 07/16/2023] [Indexed: 08/29/2023]
Abstract
The accurate prediction of postoperative survival time of patients with Barcelona Clinic Liver Cancer (BCLC) stage B hepatocellular carcinoma (HCC) is important for postoperative health care. Survival analysis is a common method used to predict the occurrence time of events of interest in the medical field. At present, the mainstream survival analysis models, such as the Cox proportional risk model, should make strict assumptions about the potential random process to solve the censored data, thus potentially limiting their application in clinical practice. In this paper, we propose a novel deep multitask survival model (DMSM) to analyze HCC survival data. Specifically, DMSM transforms the traditional survival time prediction problem of patients with HCC into a survival probability prediction problem at multiple time points and applies entropy regularization and ranking loss to optimize a multitask neural network. Compared with the traditional methods of deleting censored data and strong hypothesis, DMSM makes full use of all the information in the censored data but does not need to make any assumption. In addition, we identify the risk factors affecting the prognosis of patients with HCC and visualize the importance of ranking these factors. On the basis of the analysis of a real dataset of patients with BCLC stage B HCC, experimental results on three different validation datasets show that the DMSM achieves competitive performance with concordance index of 0.779, 0.727, and 0.780 and integrated Brier score (IBS) of 0.172, 0.138, and 0.135, respectively. Our DMSM has a comparatively small standard deviation (0.002, 0.002, and 0.003) for IBS of bootstrapping 100 times. The DMSM we proposed can be utilized as an effective survival analysis model and provide an important means for the accurate prediction of postoperative survival time of patients with BCLC stage B HCC.
Collapse
Affiliation(s)
- Guo Huang
- College of Computer Science, Chongqing University, Chongqing, 400044 China
| | - Huijun Liu
- College of Computer Science, Chongqing University, Chongqing, 400044 China
| | - Shu Gong
- Department of Gastroenterology, Children’s Hospital of Chongqing Medical University, Chongqing, 400044 China
- Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing, 400044 China
- Chongqing Key Laboratory of Pediatrics, Chongqing, 400044 China
| | - Yongxin Ge
- School of Big Data & Software Engineering, Chongqing University, Chongqing, 401331 China
| |
Collapse
|
145
|
Pham TD. Prediction of Five-Year Survival Rate for Rectal Cancer Using Markov Models of Convolutional Features of RhoB Expression on Tissue Microarray. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:3195-3204. [PMID: 37155403 DOI: 10.1109/tcbb.2023.3274211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
The ability to predict survival in cancer is clinically important because the finding can help patients and physicians make optimal treatment decisions. Artificial intelligence in the context of deep learning has been increasingly realized by the informatics-oriented medical community as a powerful machine-learning technology for cancer research, diagnosis, prediction, and treatment. This paper presents the combination of deep learning, data coding, and probabilistic modeling for predicting five-year survival in a cohort of patients with rectal cancer using images of RhoB expression on biopsies. Using about one-third of the patients' data for testing, the proposed approach achieved 90% prediction accuracy, which is much higher than the direct use of the best pretrained convolutional neural network (70%) and the best coupling of a pretrained model and support vector machines (70%).
Collapse
|
146
|
Ogundokun RO, Li A, Babatunde RS, Umezuruike C, Sadiku PO, Abdulahi AT, Babatunde AN. Enhancing Skin Cancer Detection and Classification in Dermoscopic Images through Concatenated MobileNetV2 and Xception Models. Bioengineering (Basel) 2023; 10:979. [PMID: 37627864 PMCID: PMC10451641 DOI: 10.3390/bioengineering10080979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 08/04/2023] [Accepted: 08/17/2023] [Indexed: 08/27/2023] Open
Abstract
One of the most promising research initiatives in the healthcare field is focused on the rising incidence of skin cancer worldwide and improving early discovery methods for the disease. The most significant factor in the fatalities caused by skin cancer is the late identification of the disease. The likelihood of human survival may be significantly improved by performing an early diagnosis followed by appropriate therapy. It is not a simple process to extract the elements from the photographs of the tumors that may be used for the prospective identification of skin cancer. Several deep learning models are widely used to extract efficient features for a skin cancer diagnosis; nevertheless, the literature demonstrates that there is still room for additional improvements in various performance metrics. This study proposes a hybrid deep convolutional neural network architecture for identifying skin cancer by adding two main heuristics. These include Xception and MobileNetV2 models. Data augmentation was introduced to balance the dataset, and the transfer learning technique was utilized to resolve the challenges of the absence of labeled datasets. It has been detected that the suggested method of employing Xception in conjunction with MobileNetV2 attains the most excellent performance, particularly concerning the dataset that was evaluated: specifically, it produced 97.56% accuracy, 97.00% area under the curve, 100% sensitivity, 93.33% precision, 96.55% F1 score, and 0.0370 false favorable rates. This research has implications for clinical practice and public health, offering a valuable tool for dermatologists and healthcare professionals in their fight against skin cancer.
Collapse
Affiliation(s)
- Roseline Oluwaseun Ogundokun
- Department of Computer Science, Landmark University, Omu Aran 251103, Nigeria
- Department of Multimedia Engineering, Kaunas University of Technology, 44249 Kaunas, Lithuania
| | - Aiman Li
- School of Marxism, Guangzhou University of Chinese Medicine, Guangzhou 510006, China
| | | | | | - Peter O. Sadiku
- Department of Computer Science, University of Ilorin, Ilorin 240003, Nigeria
| | | | | |
Collapse
|
147
|
Han S, Sohn TJ, Ng BP, Park C. Predicting unplanned readmission due to cardiovascular disease in hospitalized patients with cancer: a machine learning approach. Sci Rep 2023; 13:13491. [PMID: 37596346 PMCID: PMC10439193 DOI: 10.1038/s41598-023-40552-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Accepted: 08/12/2023] [Indexed: 08/20/2023] Open
Abstract
Cardiovascular disease (CVD) in cancer patients can affect the risk of unplanned readmissions, which have been reported to be costly and associated with worse mortality and prognosis. We aimed to demonstrate the feasibility of using machine learning techniques in predicting the risk of unplanned 180-day readmission attributable to CVD among hospitalized cancer patients using the 2017-2018 Nationwide Readmissions Database. We included hospitalized cancer patients, and the outcome was unplanned hospital readmission due to any CVD within 180 days after discharge. CVD included atrial fibrillation, coronary artery disease, heart failure, stroke, peripheral artery disease, cardiomegaly, and cardiomyopathy. Decision tree (DT), random forest, extreme gradient boost (XGBoost), and AdaBoost were implemented. Accuracy, precision, recall, F2 score, and receiver operating characteristic curve (AUC) were used to assess the model's performance. Among 358,629 hospitalized patients with cancer, 5.86% (n = 21,021) experienced unplanned readmission due to any CVD. The three ensemble algorithms outperformed the DT, with the XGBoost displaying the best performance. We found length of stay, age, and cancer surgery were important predictors of CVD-related unplanned hospitalization in cancer patients. Machine learning models can predict the risk of unplanned readmission due to CVD among hospitalized cancer patients.
Collapse
Affiliation(s)
- Sola Han
- Health Outcomes Division, College of Pharmacy, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Ted J Sohn
- Health Outcomes Division, College of Pharmacy, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Boon Peng Ng
- College of Nursing, University of Central Florida, Orlando, FL, USA
- Disability, Aging, and Technology Cluster, University of Central Florida, Orlando, FL, USA
| | - Chanhyun Park
- Health Outcomes Division, College of Pharmacy, The University of Texas at Austin, Austin, TX, 78712, USA.
| |
Collapse
|
148
|
Prabhakaran S, Choong KWK, Prabhakaran S, Choy KT, Kong JC. Accuracy of deep neural learning models in the imaging prediction of pathological complete response after neoadjuvant chemoradiotherapy for locally advanced rectal cancer: a systematic review. Langenbecks Arch Surg 2023; 408:321. [PMID: 37594552 DOI: 10.1007/s00423-023-03039-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2023] [Accepted: 08/01/2023] [Indexed: 08/19/2023]
Abstract
PURPOSE Up to 15-27% of patients achieve pathologic complete response (pCR) following neoadjuvant chemoradiotherapy (CRT) for locally advanced rectal cancer (LARC). Deep neural learning (DL) algorithms have been suggested to be a useful adjunct to allow accurate prediction of pCR and to identify patients who could potentially avoid surgery. This systematic review aims to interrogate the accuracy of DL algorithms at predicting pCR. METHODS Embase (PubMed, MEDLINE) databases and Google Scholar were searched to identify eligible English-language studies, with the search concluding in July 2022. Studies reporting on the accuracy of DL models in predicting pCR were selected for review and information pertaining to study characteristics and diagnostic measures was extracted from relevant studies. Risk of bias was evaluated using the Newcastle-Ottawa scale (NOS). RESULTS Our search yielded 85 potential publications. Nineteen full texts were reviewed, and a total of 12 articles were included in this systematic review. There were six retrospective and six prospective cohort studies. The most common DL algorithm used was the Convolutional Neural Network (CNN). Performance comparison was carried out via single modality comparison. The median performance for each best-performing algorithm was an AUC of 0.845 (range 0.71-0.99) and Accuracy of 0.85 (0.83-0.98). CONCLUSIONS There is a promising role for DL models in the prediction of pCR following neoadjuvant-CRT for LARC. Further studies are needed to provide a standardised comparison in order to allow for large-scale clinical application. PROPERO REGISTRATION PROSPERO 2021 CRD42021269904 Available from: https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42021269904 .
Collapse
Affiliation(s)
- Sowmya Prabhakaran
- Department of General Surgical Specialties, The Royal Melbourne Hospital, Melbourne, Victoria, Australia.
| | | | - Swetha Prabhakaran
- Department of Colorectal Surgery, Alfred Hospital, Melbourne, Victoria, Australia
| | - Kay Tai Choy
- Department of Surgery, Austin Health, Melbourne, Victoria, Australia
| | - Joseph Ch Kong
- Department of Colorectal Surgery, Alfred Hospital, Melbourne, Victoria, Australia
- Central Clinical School, Monash University, Melbourne, Victoria, Australia
- Department of Surgical Oncology, Peter MacCallum Cancer Centre, Melbourne, Victoria, Australia
| |
Collapse
|
149
|
Shahab M, Zheng G, Khan A, Wei D, Novikov AS. Machine Learning-Based Virtual Screening and Molecular Simulation Approaches Identified Novel Potential Inhibitors for Cancer Therapy. Biomedicines 2023; 11:2251. [PMID: 37626747 PMCID: PMC10452548 DOI: 10.3390/biomedicines11082251] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 08/07/2023] [Accepted: 08/08/2023] [Indexed: 08/27/2023] Open
Abstract
Cyclin-dependent kinase 2 (CDK2) is a promising target for cancer treatment, developing new effective CDK2 inhibitors is of great significance in anticancer therapy. The involvement of CDK2 in tumorigenesis has been debated, but recent evidence suggests that specifically inhibiting CDK2 could be beneficial in treating certain tumors. This approach remains attractive in the development of anticancer drugs. Several small-molecule inhibitors targeting CDK2 have reached clinical trials, but a selective inhibitor for CDK2 is yet to be discovered. In this study, we conducted machine learning-based drug designing to search for a drug candidate for CDK2. Machine learning models, including k-NN, SVM, RF, and GNB, were created to detect active and inactive inhibitors for a CDK2 drug target. The models were assessed using 10-fold cross-validation to ensure their accuracy and reliability. These methods are highly suitable for classifying compounds as either active or inactive through the virtual screening of extensive compound libraries. Subsequently, machine learning techniques were employed to analyze the test dataset obtained from the zinc database. A total of 25 compounds with 98% accuracy were predicted as active against CDK2. These compounds were docked into CDK2's active site. Finally, three compounds were selected based on good docking score, and, along with a reference compound, underwent MD simulation. The Gaussian naïve Bayes model yielded superior results compared to other models. The top three hits exhibited enhanced stability and compactness compared to the reference compound. In conclusion, our study provides valuable insights for identifying and refining lead compounds as CDK2 inhibitors.
Collapse
Affiliation(s)
- Muhammad Shahab
- State Key Laboratories of Chemical Resources Engineering, Beijing University of Chemical Technology, Beijing 100029, China;
| | - Guojun Zheng
- State Key Laboratories of Chemical Resources Engineering, Beijing University of Chemical Technology, Beijing 100029, China;
| | - Abbas Khan
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China; (A.K.); (D.W.)
| | - Dongqing Wei
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China; (A.K.); (D.W.)
| | - Alexander S. Novikov
- Institute of Chemistry, Saint Petersburg State University, Saint Petersburg 199034, Russia
- Research Institute of Chemistry, Peoples’ Friendship University of Russia (RUDN University), Moscow 117198, Russia
| |
Collapse
|
150
|
Reddaway J, Richardson PE, Bevan RJ, Stoneman J, Palombo M. Microglial morphometric analysis: so many options, so little consistency. Front Neuroinform 2023; 17:1211188. [PMID: 37637472 PMCID: PMC10448193 DOI: 10.3389/fninf.2023.1211188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 07/05/2023] [Indexed: 08/29/2023] Open
Abstract
Quantification of microglial activation through morphometric analysis has long been a staple of the neuroimmunologist's toolkit. Microglial morphological phenomics can be conducted through either manual classification or constructing a digital skeleton and extracting morphometric data from it. Multiple open-access and paid software packages are available to generate these skeletons via semi-automated and/or fully automated methods with varying degrees of accuracy. Despite advancements in methods to generate morphometrics (quantitative measures of cellular morphology), there has been limited development of tools to analyze the datasets they generate, in particular those containing parameters from tens of thousands of cells analyzed by fully automated pipelines. In this review, we compare and critique the approaches using cluster analysis and machine learning driven predictive algorithms that have been developed to tackle these large datasets, and propose improvements for these methods. In particular, we highlight the need for a commitment to open science from groups developing these classifiers. Furthermore, we call attention to a need for communication between those with a strong software engineering/computer science background and neuroimmunologists to produce effective analytical tools with simplified operability if we are to see their wide-spread adoption by the glia biology community.
Collapse
Affiliation(s)
- Jack Reddaway
- Division of Neuroscience, School of Biosciences, Cardiff University, Cardiff, United Kingdom
- Hodge Centre for Neuropsychiatric Immunology, Neuroscience and Mental Health Innovation Institute (NMHII), Cardiff University, Cardiff, United Kingdom
| | | | - Ryan J. Bevan
- UK Dementia Research Institute, Cardiff University, Cardiff, United Kingdom
| | - Jessica Stoneman
- Division of Neuroscience, School of Biosciences, Cardiff University, Cardiff, United Kingdom
| | - Marco Palombo
- Cardiff University Brain Research Imaging Centre (CUBRIC), School of Psychology, Cardiff University, Cardiff, United Kingdom
- School of Computer Science and Informatics, Cardiff University, Cardiff, United Kingdom
| |
Collapse
|