1
|
Norman PA, Li W, Jiang W, Chen BE. deepAFT: A nonlinear accelerated failure time model with artificial neural network. Stat Med 2024; 43:3689-3701. [PMID: 38894557 DOI: 10.1002/sim.10152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 03/14/2024] [Accepted: 06/07/2024] [Indexed: 06/21/2024]
Abstract
The Cox regression model or accelerated failure time regression models are often used for describing the relationship between survival outcomes and potential explanatory variables. These models assume the studied covariates are connected to the survival time or its distribution or their transformations through a function of a linear regression form. In this article, we propose nonparametric, nonlinear algorithms (deepAFT methods) based on deep artificial neural networks to model survival outcome data in the broad distribution family of accelerated failure time models. The proposed methods predict survival outcomes directly and tackle the problem of censoring via an imputation algorithm as well as re-weighting and transformation techniques based on the inverse probabilities of censoring. Through extensive simulation studies, we confirm that the proposed deepAFT methods achieve accurate predictions. They outperform the existing regression models in prediction accuracy, while being flexible and robust in modeling covariate effects of various nonlinear forms. Their prediction performance is comparable to other established deep learning methods such as deepSurv and random survival forest methods. Even though the direct output is the expected survival time, the proposed AFT methods also provide predictions for distributional functions such as the cumulative hazard and survival functions without additional learning efforts. For situations where the popular Cox regression model may not be appropriate, the deepAFT methods provide useful and effective alternatives, as shown in simulations, and demonstrated in applications to a lymphoma clinical trial study.
Collapse
Affiliation(s)
- Patrick A Norman
- Kingston General Health Research Institute, Queen's University, Kingston, Ontario, Canada
| | - Wanlu Li
- Department of Mathematics and Statistics, Queen's University, Kingston, Ontario, Canada
| | - Wenyu Jiang
- Department of Mathematics and Statistics, Queen's University, Kingston, Ontario, Canada
| | - Bingshu E Chen
- Department of Public Health Sciences and Canadian Cancer Trials Group, Queen's University, Kingston, Ontario, Canada
| |
Collapse
|
2
|
Verma S, Magazzù G, Eftekhari N, Lou T, Gilhespy A, Occhipinti A, Angione C. Cross-attention enables deep learning on limited omics-imaging-clinical data of 130 lung cancer patients. CELL REPORTS METHODS 2024; 4:100817. [PMID: 38981473 PMCID: PMC11294841 DOI: 10.1016/j.crmeth.2024.100817] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 04/18/2024] [Accepted: 06/17/2024] [Indexed: 07/11/2024]
Abstract
Deep-learning tools that extract prognostic factors derived from multi-omics data have recently contributed to individualized predictions of survival outcomes. However, the limited size of integrated omics-imaging-clinical datasets poses challenges. Here, we propose two biologically interpretable and robust deep-learning architectures for survival prediction of non-small cell lung cancer (NSCLC) patients, learning simultaneously from computed tomography (CT) scan images, gene expression data, and clinical information. The proposed models integrate patient-specific clinical, transcriptomic, and imaging data and incorporate Kyoto Encyclopedia of Genes and Genomes (KEGG) and Reactome pathway information, adding biological knowledge within the learning process to extract prognostic gene biomarkers and molecular pathways. While both models accurately stratify patients in high- and low-risk groups when trained on a dataset of only 130 patients, introducing a cross-attention mechanism in a sparse autoencoder significantly improves the performance, highlighting tumor regions and NSCLC-related genes as potential biomarkers and thus offering a significant methodological advancement when learning from small imaging-omics-clinical samples.
Collapse
Affiliation(s)
- Suraj Verma
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK
| | | | | | - Thai Lou
- Gateshead Health NHS Foundation Trust, Gateshead, UK
| | - Alex Gilhespy
- South Tyneside and Sunderland NHS Foundation Trust, Sunderland, UK
| | - Annalisa Occhipinti
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK; Centre for Digital Innovation, Teesside University, Middlesbrough, UK; National Horizons Centre, Teesside University, Darlington, UK
| | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK; Centre for Digital Innovation, Teesside University, Middlesbrough, UK; National Horizons Centre, Teesside University, Darlington, UK.
| |
Collapse
|
3
|
Kim I, Seok J, Kim Y. CTIVA: Censored time interval variable analysis. PLoS One 2023; 18:e0294513. [PMID: 37972018 PMCID: PMC10653491 DOI: 10.1371/journal.pone.0294513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 11/02/2023] [Indexed: 11/19/2023] Open
Abstract
Traditionally, datasets with multiple censored time-to-events have not been utilized in multivariate analysis because of their high level of complexity. In this paper, we propose the Censored Time Interval Analysis (CTIVA) method to address this issue. It estimates the joint probability distribution of actual event times in the censored dataset by implementing a statistical probability density estimation technique on the dataset. Based on the acquired event time, CTIVA investigates variables correlated with the interval time of events via statistical tests. The proposed method handles both categorical and continuous variables simultaneously-thus, it is suitable for application on real-world censored time-to-event datasets, which include both categorical and continuous variables. CTIVA outperforms traditional censored time-to-event data handling methods by 5% on simulation data. The average area under the curve (AUC) of the proposed method on the simulation dataset exceeds 0.9 under various conditions. Further, CTIVA yields novel results on National Sample Cohort Demo (NSCD) and proteasome inhibitor bortezomib dataset, a real-world censored time-to-event dataset of medical history of beneficiaries provided by the National Health Insurance Sharing Service (NHISS) and National Center for Biotechnology Information (NCBI). We believe that the development of CTIVA is a milestone in the investigation of variables correlated with interval time of events in presence of censoring.
Collapse
Affiliation(s)
- Insoo Kim
- School of Electrical Engineering, Korea University, Seoul, Republic of Korea
| | - Junhee Seok
- School of Electrical Engineering, Korea University, Seoul, Republic of Korea
| | - Yoojoong Kim
- School of Computer Science and Information Engineering, The Catholic University of Korea, Bucheon, Republic of Korea
| |
Collapse
|
4
|
Sun B, Chen L. Interpretable deep learning for improving cancer patient survival based on personal transcriptomes. Sci Rep 2023; 13:11344. [PMID: 37443344 PMCID: PMC10344908 DOI: 10.1038/s41598-023-38429-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 07/07/2023] [Indexed: 07/15/2023] Open
Abstract
Precision medicine chooses the optimal drug for a patient by considering individual differences. With the tremendous amount of data accumulated for cancers, we develop an interpretable neural network to predict cancer patient survival based on drug prescriptions and personal transcriptomes (CancerIDP). The deep learning model achieves 96% classification accuracy in distinguishing short-lived from long-lived patients. The Pearson correlation between predicted and actual months-to-death values is as high as 0.937. About 27.4% of patients may survive longer with an alternative medicine chosen by our deep learning model. The median survival time of all patients can increase by 3.9 months. Our interpretable neural network model reveals the most discriminating pathways in the decision-making process, which will further facilitate mechanistic studies of drug development for cancers.
Collapse
Affiliation(s)
- Bo Sun
- Department of Quantitative and Computational Biology, University of Southern California, 1050 Childs Way, Los Angeles, CA, 90089, USA
| | - Liang Chen
- Department of Quantitative and Computational Biology, University of Southern California, 1050 Childs Way, Los Angeles, CA, 90089, USA.
| |
Collapse
|
5
|
Zhai YJ, Zhang Y, Liu HZ, Zhang ZR. Multi-angle Support Vector Survival Analysis with Neural Tangent Kernel Study. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2023. [DOI: 10.1007/s13369-022-07540-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
6
|
Neural Networks for Survival Prediction in Medicine Using Prognostic Factors: A Review and Critical Appraisal. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:1176060. [PMID: 36238497 PMCID: PMC9553343 DOI: 10.1155/2022/1176060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Revised: 08/26/2022] [Accepted: 09/13/2022] [Indexed: 11/17/2022]
Abstract
Survival analysis deals with the expected duration of time until one or more events of interest occur. Time to the event of interest may be unobserved, a phenomenon commonly known as right censoring, which renders the analysis of these data challenging. Over the years, machine learning algorithms have been developed and adapted to right-censored data. Neural networks have been repeatedly employed to build clinical prediction models in healthcare with a focus on cancer and cardiology. We present the first ever attempt at a large-scale review of survival neural networks (SNNs) with prognostic factors for clinical prediction in medicine. This work provides a comprehensive understanding of the literature (24 studies from 1990 to August 2021, global search in PubMed). Relevant manuscripts are classified as methodological/technical (novel methodology or new theoretical model; 13 studies) or applications (11 studies). We investigate how researchers have used neural networks to fit survival data for prediction. There are two methodological trends: either time is added as part of the input features and a single output node is specified, or multiple output nodes are defined for each time interval. A critical appraisal of model aspects that should be designed and reported more carefully is performed. We identify key characteristics of prediction models (i.e., number of patients/predictors, evaluation measures, calibration), and compare ANN's predictive performance to the Cox proportional hazards model. The median sample size is 920 patients, and the median number of predictors is 7. Major findings include poor reporting (e.g., regarding missing data, hyperparameters) as well as inaccurate model development/validation. Calibration is neglected in more than half of the studies. Cox models are not developed to their full potential and claims for the performance of SNNs are exaggerated. Light is shed on the current state of art of SNNs in medicine with prognostic factors. Recommendations are made for the reporting of clinical prediction models. Limitations are discussed, and future directions are proposed for researchers who seek to develop existing methodology.
Collapse
|
7
|
Lin C, Qiao N, Zhang W, Li Y, Ma S. Default risk prediction and feature extraction using a penalized deep neural network. STATISTICS AND COMPUTING 2022; 32:76. [PMID: 36124203 PMCID: PMC9476445 DOI: 10.1007/s11222-022-10140-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 08/26/2022] [Indexed: 06/15/2023]
Abstract
Online peer-to-peer lending platforms provide loans directly from lenders to borrowers without passing through traditional financial institutions. For lenders on these platforms to avoid loss, it is crucial that they accurately assess default risk so that they can make appropriate decisions. In this study, we develop a penalized deep learning model to predict default risk based on survival data. As opposed to simply predicting whether default will occur, we focus on predicting the probability of default over time. Moreover, by adding an additional one-to-one layer in the neural network, we achieve feature selection and estimation simultaneously by incorporating an L 1 -penalty into the objective function. The minibatch gradient descent algorithm makes it possible to handle massive data. An analysis of a real-world loan data and simulations demonstrate the model's competitive practical performance, which suggests favorable potential applications in peer-to-peer lending platforms.
Collapse
Affiliation(s)
- Cunjie Lin
- Center for Applied Statistics, Renmin University of China, Beijing, 100872 China
- School of Statistics, Renmin University of China, Beijing, 100872 China
| | - Nan Qiao
- School of Statistics, Renmin University of China, Beijing, 100872 China
| | - Wenli Zhang
- School of Statistics, Renmin University of China, Beijing, 100872 China
| | - Yang Li
- Center for Applied Statistics, Renmin University of China, Beijing, 100872 China
- School of Statistics, Renmin University of China, Beijing, 100872 China
- Statistical Consulting Center, Renmin University of China, Beijing, 100872 China
| | - Shuangge Ma
- School of Statistics, Renmin University of China, Beijing, 100872 China
- Department of Biostatistics, Yale University, New Haven, CT 06511 USA
| |
Collapse
|
8
|
Kim SI, Kang JW, Eun YG, Lee YC. Prediction of survival in oropharyngeal squamous cell carcinoma using machine learning algorithms: A study based on the surveillance, epidemiology, and end results database. Front Oncol 2022; 12:974678. [PMID: 36072804 PMCID: PMC9441569 DOI: 10.3389/fonc.2022.974678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Accepted: 08/08/2022] [Indexed: 11/28/2022] Open
Abstract
Background We determined appropriate survival prediction machine learning models for patients with oropharyngeal squamous cell carcinoma (OPSCC) using the “Surveillance, Epidemiology, and End Results” (SEER) database. Methods In total, 4039 patients diagnosed with OPSCC between 2004 and 2016 were enrolled in this study. In particular, 13 variables were selected and analyzed: age, sex, tumor grade, tumor size, neck dissection, radiation therapy, cancer directed surgery, chemotherapy, T stage, N stage, M stage, clinical stage, and human papillomavirus (HPV) status. The T-, N-, and clinical staging were reconstructed based on the American Joint Committee on Cancer (AJCC) Staging Manual, 8th Edition. The patients were randomly assigned to a development or test dataset at a 7:3 ratio. The extremely randomized survival tree (EST), conditional survival forest (CSF), and DeepSurv models were used to predict the overall and disease-specific survival in patients with OPSCC. A 10-fold cross-validation on a development dataset was used to build the training and internal validation data for all models. We evaluated the predictive performance of each model using test datasets. Results A higher c-index value and lower integrated Brier score (IBS), root mean square error (RMSE), and mean absolute error (MAE) indicate a better performance from a machine learning model. The C-index was the highest for the DeepSurv model (0.77). The IBS was also the lowest in the DeepSurv model (0.08). However, the RMSE and RAE were the lowest for the CSF model. Conclusions We demonstrated various machine-learning-based survival prediction models. The CSF model showed a better performance in predicting the survival of patients with OPSCC in terms of the RMSE and RAE. In this context, machine learning models based on personalized survival predictions can be used to stratify various complex risk factors. This could help in designing personalized treatments and predicting prognoses for patients.
Collapse
|
9
|
MSGWO-MKL-SVM: A Missing Link Prediction Method for UAV Swarm Network Based on Time Series. MATHEMATICS 2022. [DOI: 10.3390/math10142535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Missing link prediction technology (MLP) is always a hot research area in the field of complex networks, and it has been extensively utilized in UAV swarm network reconstruction recently. UAV swarm is an artificial network with strong randomness, in the face of which prediction methods based on network similarity often perform poorly. To solve those problems, this paper proposes a Multi Kernel Learning algorithm with a multi-strategy grey wolf optimizer based on time series (MSGWO-MKL-SVM). The Multiple Kernel Learning (MKL) method is adopted in this algorithm to extract the advanced features of time series, and the Support Vector Machine (SVM) algorithm is used to determine the hyperplane of threshold value in nonlinear high dimensional space. Besides that, we propose a new measurable indicator of Multiple Kernel Learning based on cluster, transforming a Multiple Kernel Learning problem into a multi-objective optimization problem. Some adaptive neighborhood strategies are used to enhance the global searching ability of grey wolf optimizer algorithm (GWO). Comparison experiments were conducted on the standard UCI datasets and the professional UAV swarm datasets. The classification accuracy of MSGWO-MKL-SVM on UCI datasets is improved by 6.2% on average, and the link prediction accuracy of MSGWO-MKL-SVM on professional UAV swarm datasets is improved by 25.9% on average.
Collapse
|
10
|
Hao R, Zheng H, Yang X. Data augmentation based estimation for the censored composite quantile regression neural network model. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.109381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
11
|
Eminaga O, Shkolyar E, Breil B, Semjonow A, Boegemann M, Xing L, Tinay I, Liao JC. Artificial Intelligence-Based Prognostic Model for Urologic Cancers: A SEER-Based Study. Cancers (Basel) 2022; 14:cancers14133135. [PMID: 35804904 PMCID: PMC9264864 DOI: 10.3390/cancers14133135] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2022] [Revised: 06/22/2022] [Accepted: 06/23/2022] [Indexed: 12/11/2022] Open
Abstract
Simple Summary We describe a risk profile reconstruction model for cancer-specific survival estimation for continuous time points after urologic cancer diagnosis. We used artificial intelligence (AI)-based algorithms, a national cancer registry data, and accessible clinical parameters for the risk-profile reconstruction. We derived a risk stratification model and estimated the minimum follow-up duration and the likelihood for risk stability in prostate, kidney, and testicular cancers. The estimated follow-up duration was in alignment with recognized clinical guidelines for these cancers. Moreover, the estimated follow-up duration was differed by the cancer origin and the disease dissemination status. Overall, the reconstruction of the population’s risk profile for the cancer-specific prognostic score estimation is feasible using AI and has potential application in clinical settings to improve risk stratification and surveillance management. Abstract Background: Prognostication is essential to determine the risk profile of patients with urologic cancers. Methods: We utilized the SEER national cancer registry database with approximately 2 million patients diagnosed with urologic cancers (penile, testicular, prostate, bladder, ureter, and kidney). The cohort was randomly divided into the development set (90%) and the out-held test set (10%). Modeling algorithms and clinically relevant parameters were utilized for cancer-specific mortality prognosis. The model fitness for the survival estimation was assessed using the differences between the predicted and observed Kaplan–Meier estimates on the out-held test set. The overall concordance index (c-index) score estimated the discriminative accuracy of the survival model on the test set. A simulation study assessed the estimated minimum follow-up duration and time points with the risk stability. Results: We achieved a well-calibrated prognostic model with an overall c-index score of 0.800 (95% CI: 0.795–0.805) on the representative out-held test set. The simulation study revealed that the suggestions for the follow-up duration covered the minimum duration and differed by the tumor dissemination stages and affected organs. Time points with a high likelihood for risk stability were identifiable. Conclusions: A personalized temporal survival estimation is feasible using artificial intelligence and has potential application in clinical settings, including surveillance management.
Collapse
Affiliation(s)
- Okyaz Eminaga
- Department of Urology, Stanford University School of Medicine, Stanford, CA 94305, USA; (E.S.); (J.C.L.)
- Correspondence:
| | - Eugene Shkolyar
- Department of Urology, Stanford University School of Medicine, Stanford, CA 94305, USA; (E.S.); (J.C.L.)
| | - Bernhard Breil
- Faculty of Health Care, Hochschule Niederrhein, University of Applied Sciences, 47805 Krefeld, Germany;
| | - Axel Semjonow
- Prostate Center, Department of Urology, University Hospital Muenster, 48149 Muenster, Germany; (A.S.); (M.B.)
| | - Martin Boegemann
- Prostate Center, Department of Urology, University Hospital Muenster, 48149 Muenster, Germany; (A.S.); (M.B.)
| | - Lei Xing
- Department of Radiation Oncology, Stanford University School of Medicine, Stanford, CA 94305, USA;
| | - Ilker Tinay
- Department of Urology, Marmara University School of Medicine, Istanbul 34854, Turkey;
| | - Joseph C. Liao
- Department of Urology, Stanford University School of Medicine, Stanford, CA 94305, USA; (E.S.); (J.C.L.)
| |
Collapse
|
12
|
Appelt AL, Elhaminia B, Gooya A, Gilbert A, Nix M. Deep Learning for Radiotherapy Outcome Prediction Using Dose Data - A Review. Clin Oncol (R Coll Radiol) 2022; 34:e87-e96. [PMID: 34924256 DOI: 10.1016/j.clon.2021.12.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2021] [Revised: 11/26/2021] [Accepted: 12/01/2021] [Indexed: 12/23/2022]
Abstract
Artificial intelligence, and in particular deep learning using convolutional neural networks, has been used extensively for image classification and segmentation, including on medical images for diagnosis and prognosis prediction. Use in radiotherapy prognostic modelling is still limited, however, especially as applied to toxicity and tumour response prediction from radiation dose distributions. We review and summarise studies that applied deep learning to radiotherapy dose data, in particular studies that utilised full three-dimensional dose distributions. Ten papers have reported on deep learning models for outcome prediction utilising spatial dose information, whereas four studies used reduced dimensionality (dose volume histogram) information for prediction. Many of these studies suffer from the same issues that plagued early normal tissue complication probability modelling, including small, single-institutional patient cohorts, lack of external validation, poor data and model reporting, use of late toxicity data without taking time-to-event into account, and nearly exclusive focus on clinician-reported complications. They demonstrate, however, how radiation dose, imaging and clinical data may be technically integrated in convolutional neural networks-based models; and some studies explore how deep learning may help better understand spatial variation in radiosensitivity. In general, there are a number of issues specific to the intersection of radiotherapy outcome modelling and deep learning, for example translation of model developments into treatment plan optimisation, which will require further combined effort from the radiation oncology and artificial intelligence communities.
Collapse
Affiliation(s)
- A L Appelt
- Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK.
| | - B Elhaminia
- Centre for Computational Imaging and Simulation Technologies in Biomedicine (CISTIB), Department of Electronic and Electrical Engineering, University of Leeds, Leeds, UK
| | - A Gooya
- Centre for Computational Imaging and Simulation Technologies in Biomedicine (CISTIB), Department of Electronic and Electrical Engineering, University of Leeds, Leeds, UK
| | - A Gilbert
- Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK
| | - M Nix
- Department of Medical Physics and Engineering, Leeds Cancer Centre, St James's University Hospital, Leeds, UK
| |
Collapse
|
13
|
Andersen PK. Fifty Years with the Cox Proportional Hazards Regression Model. J Indian Inst Sci 2022. [DOI: 10.1007/s41745-021-00283-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|