Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: El Naqa I, Bradley JD, Lindsay PE, Hope AJ, Deasy JO. Predicting radiotherapy outcomes using statistical learning techniques. Phys Med Biol 2009;54:S9-S30. [PMID: 19687564 DOI: 10.1088/0031-9155/54/18/s02] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

For:	El Naqa I, Bradley JD, Lindsay PE, Hope AJ, Deasy JO. Predicting radiotherapy outcomes using statistical learning techniques. Phys Med Biol 2009;54:S9-S30. [PMID: 19687564 DOI: 10.1088/0031-9155/54/18/s02] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Number

Cited by Other Article(s)

Ozkan EE, Serel TA, Soyupek AS, Kaymak ZA. Utilization of machine learning methods for prediction of acute and late rectal toxicity due to curative prostate radiotherapy. RADIATION PROTECTION DOSIMETRY 2024;200:1244-1250. [PMID: 38932433 DOI: 10.1093/rpd/ncae154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 04/17/2024] [Accepted: 06/13/2024] [Indexed: 06/28/2024]

Maragno D, Buti G, Birbil Şİ, Liao Z, Bortfeld T, den Hertog D, Ajdari A. Embedding machine learning based toxicity models within radiotherapy treatment plan optimization. Phys Med Biol 2024;69:075003. [PMID: 38412530 DOI: 10.1088/1361-6560/ad2d7e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 02/27/2024] [Indexed: 02/29/2024]

Abstract

Objective.This study addresses radiation-induced toxicity (RIT) challenges in radiotherapy (RT) by developing a personalized treatment planning framework. It leverages patient-specific data and dosimetric information to create an optimization model that limits adverse side effects using constraints learned from historical data.Approach.The study uses the optimization with constraint learning (OCL) framework, incorporating patient-specific factors into the optimization process. It consists of three steps: optimizing the baseline treatment plan using population-wide dosimetric constraints; training a machine learning (ML) model to estimate the patient's RIT for the baseline plan; and adapting the treatment plan to minimize RIT using ML-learned patient-specific constraints. Various predictive models, including classification trees, ensembles of trees, and neural networks, are applied to predict the probability of grade 2+ radiation pneumonitis (RP2+) for non-small cell lung (NSCLC) cancer patients three months post-RT. The methodology is assessed with four high RP2+ risk NSCLC patients, with the goal of optimizing the dose distribution to constrain the RP2+ outcome below a pre-specified threshold. Conventional and OCL-enhanced plans are compared based on dosimetric parameters and predicted RP2+ risk. Sensitivity analysis on risk thresholds and data uncertainty is performed using a toy NSCLC case.Main results.Experiments show the methodology's capacity to directly incorporate all predictive models into RT treatment planning. In the four patients studied, mean lung dose and V20 were reduced by an average of 1.78 Gy and 3.66%, resulting in an average RP2+ risk reduction from 95% to 42%. Notably, this reduction maintains tumor coverage, although in two cases, sparing the lung slightly increased spinal cord max-dose (0.23 and 0.79 Gy).Significance.By integrating patient-specific information into learned constraints, the study significantly reduces adverse side effects like RP2+ without compromising target coverage. This unified framework bridges the gap between predicting toxicities and optimizing treatment plans in personalized RT decision-making.

Collapse

Hatt M. "Fuzzy" radiomics: the way forward for nuclear medicine imaging applications? Eur J Nucl Med Mol Imaging 2023;50:1558-1559. [PMID: 36951992 DOI: 10.1007/s00259-023-06201-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/24/2023]

Bang C, Bernard G, Le WT, Lalonde A, Kadoury S, Bahig H. Artificial intelligence to predict outcomes of head and neck radiotherapy. Clin Transl Radiat Oncol 2023;39:100590. [PMID: 36935854 PMCID: PMC10014342 DOI: 10.1016/j.ctro.2023.100590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 01/28/2023] [Accepted: 01/28/2023] [Indexed: 02/01/2023] Open

Quintero P, Benoit D, Cheng Y, Moore C, Beavis A. Machine learning-based predictions of gamma passing rates for virtual specific-plan verification based on modulation maps, monitor unit profiles, and composite dose images. Phys Med Biol 2022;67. [PMID: 36384046 DOI: 10.1088/1361-6560/aca38a] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Accepted: 11/16/2022] [Indexed: 11/18/2022]

Chen M, Wang Z, Jiang S, Sun J, Wang L, Sahoo N, Brandon Gunn G, Frank SJ, Xu C, Chen J, Nguyen QN, Chang JY, Liao Z, Ronald Zhu X, Zhang X. Predictive performance of different NTCP techniques for radiation-induced esophagitis in NSCLC patients receiving proton radiotherapy. Sci Rep 2022;12:9178. [PMID: 35655073 PMCID: PMC9163134 DOI: 10.1038/s41598-022-12898-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 05/18/2022] [Indexed: 11/24/2022] Open

Abstract

This study aimed to compare the predictive performance of different modeling methods in developing normal tissue complication probability (NTCP) models for predicting radiation-induced esophagitis (RE) in non–small cell lung cancer (NSCLC) patients receiving proton radiotherapy. The dataset was composed of 328 NSCLC patients receiving passive-scattering proton therapy and 41.6% of the patients experienced ≥ grade 2 RE. Five modeling methods were used to build NTCP models: standard Lyman–Kutcher–Burman (sLKB), generalized LKB (gLKB), multivariable logistic regression using two variable selection procedures-stepwise forward selection (Stepwise-MLR), and least absolute shrinkage and selection operator (LASSO-MLR), and support vector machines (SVM). Predictive performance was internally validated by a bootstrap approach for each modeling method. The overall performance, discriminative ability, and calibration were assessed using the Negelkerke R², area under the receiver operator curve (AUC), and Hosmer–Lemeshow test, respectively. The LASSO-MLR model showed the best discriminative ability with an AUC value of 0.799 (95% confidence interval (CI): 0.763–0.854), and the best overall performance with a Negelkerke R² value of 0.332 (95% CI: 0.266–0.486). Both of the optimism-corrected Negelkerke R² values of the SVM and sLKB models were 0.301. The optimism-corrected AUC of the gLKB model (0.796) was higher than that of the SVM model (0.784). The sLKB model had the smallest optimism in the model variation and discriminative ability. In the context of classification and probability estimation for predicting the NTCP for radiation-induced esophagitis, the MLR model developed with LASSO provided the best predictive results. The simplest LKB modeling had similar or even better predictive performance than the most complex SVM modeling, and it was least likely to overfit the training data. The advanced machine learning approach might have limited applicability in clinical settings with a relatively small amount of data.

Collapse

Affiliation(s)

Mei Chen Department of Radiation Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.,Department of Radiation Physics, Unit 1150, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, TX, 77030, USA
Zeming Wang Department of Radiation Physics, Unit 1150, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, TX, 77030, USA
Shengpeng Jiang Department of Radiation Physics, Unit 1150, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, TX, 77030, USA.,Department of Radiation Oncology, Tianjin Medical University Cancer Institute and Hospital, Tianjin, 30060, China
Jian Sun Department of Radiation Oncology, Tianjin Medical University Cancer Institute and Hospital, Tianjin, 30060, China.,Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
Li Wang Department of Experimental Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
Narayan Sahoo Department of Radiation Physics, Unit 1150, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, TX, 77030, USA
G Brandon Gunn Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
Steven J Frank Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
Cheng Xu Department of Radiation Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
Jiayi Chen Department of Radiation Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
Quynh-Nhu Nguyen Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
Joe Y Chang Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
Zhongxing Liao Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
X Ronald Zhu Department of Radiation Physics, Unit 1150, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, TX, 77030, USA
Xiaodong Zhang Department of Radiation Physics, Unit 1150, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, TX, 77030, USA.

Collapse

Cui S, Ten Haken RK, El Naqa I. Integrating Multiomics Information in Deep Learning Architectures for Joint Actuarial Outcome Prediction in Non-Small Cell Lung Cancer Patients After Radiation Therapy. Int J Radiat Oncol Biol Phys 2021;110:893-904. [PMID: 33539966 PMCID: PMC8180510 DOI: 10.1016/j.ijrobp.2021.01.042] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 11/10/2020] [Accepted: 01/23/2021] [Indexed: 12/14/2022]

Abstract

PURPOSE

Novel actuarial deep learning neural network (ADNN) architectures are proposed for joint prediction of radiation therapy outcomes-radiation pneumonitis (RP) and local control (LC)-in stage III non-small cell lung cancer (NSCLC) patients. Unlike normal tissue complication probability/tumor control probability models that use dosimetric information solely, our proposed models consider complex interactions among multiomics information including positron emission tomography (PET) radiomics, cytokines, and miRNAs. Additional time-to-event information is also used in the actuarial prediction.

METHODS AND MATERIALS

Three architectures were investigated: ADNN-DVH considered dosimetric information only; ADNN-com integrated multiomics information; and ADNN-com-joint combined RP2 (RP grade ≥2) and LC prediction. In these architectures, differential dose-volume histograms (DVHs) were fed into 1D convolutional neural networks (CNN) for extracting reduced representations. Variational encoders were used to learn representations of imaging and biological data. Reduced representations were fed into Surv-Nets to predict time-to-event probabilities for RP2 and LC independently and jointly by incorporating time information into designated loss functions.

RESULTS

Models were evaluated on 117 retrospective patients and were independently tested on 25 newly accrued patients prospectively. A multi-institutional RTOG0617 data set of 327 patients was used for external validation. ADNN-DVH yielded cross-validated c-indexes (95% confidence intervals) of 0.660 (0.630-0.690) for RP2 prediction and 0.727 (0.700-0.753) for LC prediction, outperforming a generalized Lyman model for RP2 (0.613 [0.583-0.643]) and a generalized log-logistic model for LC (0.569 [0.545-0.594]). The independent internal test and external validation yielded similar results. ADNN-com achieved an even better performance than ADNN-DVH on both cross-validation and independent internal test. Furthermore, ADNN-com-joint, which yielded performance similar to ADNN-com, realized joint prediction with c-indexes of 0.705 (0.676-0.734) for RP2 and 0.740 (0.714-0.765) for LC and achieved an area under a free-response receiving operator characteristic curve (AU-FROC) of 0.729 (0.697-0.773) for the joint prediction of RP2 and LC.

CONCLUSION

Novel deep learning architectures that integrate multiomics information outperformed traditional normal tissue complication probability/tumor control probability models in actuarial prediction of RP2 and LC.

Collapse

Field M, Hardcastle N, Jameson M, Aherne N, Holloway L. Machine learning applications in radiation oncology. PHYSICS & IMAGING IN RADIATION ONCOLOGY 2021;19:13-24. [PMID: 34307915 PMCID: PMC8295850 DOI: 10.1016/j.phro.2021.05.007] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 05/19/2021] [Accepted: 05/22/2021] [Indexed: 12/23/2022]

Ebert MA, Gulliford S, Acosta O, de Crevoisier R, McNutt T, Heemsbergen WD, Witte M, Palma G, Rancati T, Fiorino C. Spatial descriptions of radiotherapy dose: normal tissue complication models and statistical associations. Phys Med Biol 2021;66:12TR01. [PMID: 34049304 DOI: 10.1088/1361-6560/ac0681] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Accepted: 05/28/2021] [Indexed: 12/20/2022]

Karadaghy OA, Shew M, New J, Bur AM. Machine Learning to Predict Treatment in Oropharyngeal Squamous Cell Carcinoma. ORL J Otorhinolaryngol Relat Spec 2021;84:39-46. [PMID: 33730728 DOI: 10.1159/000515334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Accepted: 02/06/2021] [Indexed: 11/19/2022]

Grand challenges for medical physics in radiation oncology. Radiother Oncol 2020;153:7-14. [DOI: 10.1016/j.radonc.2020.10.001] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Revised: 10/02/2020] [Accepted: 10/02/2020] [Indexed: 12/12/2022]

Romero M, Interian Y, Solberg T, Valdes G. Targeted transfer learning to improve performance in small medical physics datasets. Med Phys 2020;47:6246-6256. [PMID: 33007112 DOI: 10.1002/mp.14507] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 07/13/2020] [Accepted: 08/04/2020] [Indexed: 11/05/2022] Open

Abstract

PURPOSE

To perform an in-depth evaluation of current state of the art techniques in training neural networks to identify appropriate approaches in small datasets.

METHOD

In total, 112,120 frontal-view X-ray images from the NIH ChestXray14 dataset were used in our analysis. Two tasks were studied: unbalanced multi-label classification of 14 diseases, and binary classification of pneumonia vs non-pneumonia. All datasets were randomly split into training, validation, and testing (70%, 10%, and 20%). Two popular convolution neural networks (CNNs), DensNet121 and ResNet50, were trained using PyTorch. We performed several experiments to test: (a) whether transfer learning using pretrained networks on ImageNet are of value to medical imaging/physics tasks (e.g., predicting toxicity from radiographic images after training on images from the internet), (b) whether using pretrained networks trained on problems that are similar to the target task helps transfer learning (e.g., using X-ray pretrained networks for X-ray target tasks), (c) whether freeze deep layers or change all weights provides an optimal transfer learning strategy, (d) the best strategy for the learning rate policy, and (e) what quantity of data is needed in order to appropriately deploy these various strategies (N = 50 to N = 77 880).

RESULTS

In the multi-label problem, DensNet121 needed at least 1600 patients to be comparable to, and 10 000 to outperform, radiomics-based logistic regression. In classifying pneumonia vs non-pneumonia, both CNN and radiomics-based methods performed poorly when N < 2000. For small datasets ( < 2000), however, a significant boost in performance (>15% increase on AUC) comes from a good selection of the transfer learning dataset, dropout, cycling learning rate, and freezing and unfreezing of deep layers as training progresses. In contrast, if sufficient data are available (>35 000), little or no tweaking is needed to obtain impressive performance. While transfer learning using X-ray images from other anatomical sites improves performance, we also observed a similar boost by using pretrained networks from ImageNet. Having source images from the same anatomical site, however, outperforms every other methodology, by up to 15%. In this case, DL models can be trained with as little as N = 50.

CONCLUSIONS

While training DL models in small datasets (N < 2000) is challenging, no tweaking is necessary for bigger datasets (N > 35 000). Using transfer learning with images from the same anatomical site can yield remarkable performance in new tasks with as few as N = 50. Surprisingly, we did not find any advantage for using images from other anatomical sites over networks that have been trained using ImageNet. This indicates that features learned may not be as general as currently believed, and performance decays rapidly even by just changing the anatomical site of the images.

Collapse

Isaksson LJ, Pepa M, Zaffaroni M, Marvaso G, Alterio D, Volpe S, Corrao G, Augugliaro M, Starzyńska A, Leonardi MC, Orecchia R, Jereczek-Fossa BA. Machine Learning-Based Models for Prediction of Toxicity Outcomes in Radiotherapy. Front Oncol 2020;10:790. [PMID: 32582539 PMCID: PMC7289968 DOI: 10.3389/fonc.2020.00790] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 04/22/2020] [Indexed: 12/20/2022] Open

Mund K, Wu J, Liu C, Yan G. Evaluation of a neural network‐based photon beam profile deconvolution method. J Appl Clin Med Phys 2020;21:53-62. [PMID: 32227629 PMCID: PMC7324697 DOI: 10.1002/acm2.12865] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Revised: 01/17/2020] [Accepted: 02/26/2020] [Indexed: 01/14/2023] Open

Abstract

Purpose

The authors have previously shown the feasibility of using an artificial neural network (ANN) to eliminate the volume average effect (VAE) of scanning ionization chambers (ICs). The purpose of this work was to evaluate the method when applied to beams of different energies (6 and 10 MV) and modalities [flattened (FF) vs unflattened (FFF)], measured with ICs of various sizes.

Methods

The three‐layer ANN extracted data from transverse photon beam profiles using a sliding window, and output deconvolved value corresponding to the location at the center of the window. Beam profiles of seven fields ranging from 2 × 2 to 10 × 10 cm² at four depths (1.5, 5, 10 and 20 cm) were measured with three ICs (CC04, CC13, and FC65‐P) and an EDGE diode detector for 6 MV FF and FFF. Similar data for the 10 MV FF beam was also collected with CC13 and EDGE. The EDGE‐measured profiles were used as reference data to train and test the ANNs. Separate ANNs were trained by using the data of each beam energy and modality. Combined ANNs were also trained by combining data of different beam energies and/or modalities. The ANN's performance was quantified and compared by evaluating the penumbra width difference (PWD) between the deconvolved and reference profiles.

Results

Excellent agreement between the deconvolved and reference profiles was achieved with both separate and combined ANNs for all studied ICs, beam energies, beam modalities, and geometries. After deconvolution, the average PWD decreased from 1–3 mm to under 0.15 mm with separate ANNs and to under 0.20 mm with combined ANN.

Conclusions

The ANN‐based deconvolution method can be effectively applied to beams of different energies and modalities measured with ICs of various sizes. Separate ANNs yielded marginally better results than combined ANNs. An IC‐specific, combined ANN can provide clinically acceptable results as long as the training data includes data of each beam energy and modality.

Collapse

El Naqa I, Haider MA, Giger ML, Ten Haken RK. Artificial Intelligence: reshaping the practice of radiological sciences in the 21st century. Br J Radiol 2020;93:20190855. [PMID: 31965813 PMCID: PMC7055429 DOI: 10.1259/bjr.20190855] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Revised: 01/12/2020] [Accepted: 01/13/2020] [Indexed: 12/15/2022] Open

Mizutani T, Magome T, Igaki H, Haga A, Nawa K, Sekiya N, Nakagawa K. Optimization of treatment strategy by using a machine learning model to predict survival time of patients with malignant glioma after radiotherapy. JOURNAL OF RADIATION RESEARCH 2019;60:818-824. [PMID: 31665445 PMCID: PMC7357235 DOI: 10.1093/jrr/rrz066] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Revised: 07/25/2019] [Indexed: 05/05/2023]

Cui S, Luo Y, Tseng HH, Ten Haken RK, El Naqa I. Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage. Med Phys 2019;46:2497-2511. [PMID: 30891794 DOI: 10.1002/mp.13497] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Revised: 02/18/2019] [Accepted: 03/08/2019] [Indexed: 12/23/2022] Open

Abstract

PURPOSE

There has been burgeoning interest in applying machine learning methods for predicting radiotherapy outcomes. However, the imbalanced ratio of a large number of variables to a limited sample size in radiation oncology constitutes a major challenge. Therefore, dimensionality reduction methods can be a key to success. The study investigates and contrasts the application of traditional machine learning methods and deep learning approaches for outcome modeling in radiotherapy. In particular, new joint architectures based on variational autoencoder (VAE) for dimensionality reduction are presented and their application is demonstrated for the prediction of lung radiation pneumonitis (RP) from a large-scale heterogeneous dataset.

METHODS

A large-scale heterogeneous dataset containing a pool of 230 variables including clinical factors (e.g., dose, KPS, stage) and biomarkers (e.g., single nucleotide polymorphisms (SNPs), cytokines, and micro-RNAs) in a population of 106 nonsmall cell lung cancer (NSCLC) patients who received radiotherapy was used for modeling RP. Twenty-two patients had grade 2 or higher RP. Four methods were investigated, including feature selection (case A) and feature extraction (case B) with traditional machine learning methods, a VAE-MLP joint architecture (case C) with deep learning and lastly, the combination of feature selection and joint architecture (case D). For feature selection, Random forest (RF), Support Vector Machine (SVM), and multilayer perceptron (MLP) were implemented to select relevant features. Specifically, each method was run for multiple times to rank features within several cross-validated (CV) resampled sets. A collection of ranking lists were then aggregated by top 5% and Kemeny graph methods to identify the final ranking for prediction. A synthetic minority oversampling technique was applied to correct for class imbalance during this process. For deep learning, a VAE-MLP joint architecture where a VAE aimed for dimensionality reduction and an MLP aimed for classification was developed. In this architecture, reconstruction loss and prediction loss were combined into a single loss function to realize simultaneous training and weights were assigned to different classes to mitigate class imbalance. To evaluate the prediction performance and conduct comparisons, the area under receiver operating characteristic curves (AUCs) were performed for nested CVs for both handcrafted feature selections and the deep learning approach. The significance of differences in AUCs was assessed using the DeLong test of U-statistics.

RESULTS

An MLP-based method using weight pruning (WP) feature selection yielded the best performance among the different hand-crafted feature selection methods (case A), reaching an AUC of 0.804 (95% CI: 0.761-0.823) with 29 top features. A VAE-MLP joint architecture (case C) achieved a comparable but slightly lower AUC of 0.781 (95% CI: 0.737-0.808) with the size of latent dimension being 2. The combination of handcrafted features (case A) and latent representation (case D) achieved a significant AUC improvement of 0.831 (95% CI: 0.805-0.863) with 22 features (P-value = 0.000642 compared with handcrafted features only (Case A) and P-value = 0.000453 compared to VAE alone (Case C)) with an MLP classifier.

CONCLUSION

The potential for combination of traditional machine learning methods and deep learning VAE techniques has been demonstrated for dealing with limited datasets in modeling radiotherapy toxicities. Specifically, latent variables from a VAE-MLP joint architecture are able to complement handcrafted features for the prediction of RP and improve prediction over either method alone.

Collapse

Luna JM, Chao HH, Diffenderfer ES, Valdes G, Chinniah C, Ma G, Cengel KA, Solberg TD, Berman AT, Simone CB. Predicting radiation pneumonitis in locally advanced stage II-III non-small cell lung cancer using machine learning. Radiother Oncol 2019;133:106-112. [PMID: 30935565 DOI: 10.1016/j.radonc.2019.01.003] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 01/04/2019] [Accepted: 01/07/2019] [Indexed: 12/23/2022]

El Naqa I, Kosorok MR, Jin J, Mierzwa M, Ten Haken RK. Prospects and challenges for clinical decision support in the era of big data. JCO Clin Cancer Inform 2018;2:CCI.18.00002. [PMID: 30613823 PMCID: PMC6317743 DOI: 10.1200/cci.18.00002] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open

Heethuis SE, Goense L, van Rossum PSN, Borggreve AS, Mook S, Voncken FEM, Bartels-Rutten A, Aleman BMP, van Hillegersberg R, Ruurda JP, Meijer GJ, Lagendijk JJW, van Lier ALHMW. DW-MRI and DCE-MRI are of complementary value in predicting pathologic response to neoadjuvant chemoradiotherapy for esophageal cancer. Acta Oncol 2018;57:1201-1208. [PMID: 29781342 DOI: 10.1080/0284186x.2018.1473637] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

El Naqa I, Ruan D, Valdes G, Dekker A, McNutt T, Ge Y, Wu QJ, Oh JH, Thor M, Smith W, Rao A, Fuller C, Xiao Y, Manion F, Schipper M, Mayo C, Moran JM, Ten Haken R. Machine learning and modeling: Data, validation, communication challenges. Med Phys 2018;45:e834-e840. [PMID: 30144098 DOI: 10.1002/mp.12811] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2017] [Revised: 12/28/2017] [Accepted: 01/22/2018] [Indexed: 11/06/2022] Open

Affiliation(s)

Issam El Naqa Department of Radiation Oncology, University of Michigan, Ann Arbor, MI, USA
Dan Ruan Department of Radiation Oncology, University of California Los Angeles, Los Angeles, CA, USA
Gilmer Valdes Department of Radiation Oncology, University of California Los San Francisco, San Francisco, CA, USA
Andre Dekker GROW-School for Oncology and Developmental Biology, Department of Radiation Oncology (MAASTRO), Maastricht University Medical Center, Maastricht, The Netherlands
Todd McNutt Department of Radiation Oncology, John Hopkins University, Baltimore, MD, USA
Yaorong Ge Department of Software and Information Systems, University of North Carolina, Charlotte, NC, USA
Q Jackie Wu Department of Radiation Oncology, Duke University Medical Center, Durham, NC, USA
Jung Hun Oh Department of Medical Physics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Maria Thor Department of Medical Physics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Wade Smith Department of Radiation Oncology, University of Washington, Seattle, WA, USA
Arvind Rao Department of Radiation Oncology, MD Anderson, Houston, TX, USA.,Department of Bioinformatics and Computational Biology, MD Anderson, Houston, TX, USA
Clifton Fuller Department of Radiation Oncology, MD Anderson, Houston, TX, USA
Ying Xiao Department of Radiation Oncology, University of Pennsylvania, Philadelphia, PA, USA
Frank Manion Department of Radiation Oncology, University of Michigan, Ann Arbor, MI, USA
Matthew Schipper Department of Radiation Oncology, University of Michigan, Ann Arbor, MI, USA
Charles Mayo Department of Radiation Oncology, University of Michigan, Ann Arbor, MI, USA
Jean M Moran Department of Radiation Oncology, University of Michigan, Ann Arbor, MI, USA
Randall Ten Haken Department of Radiation Oncology, University of Michigan, Ann Arbor, MI, USA

Collapse

Carrara M, Massari E, Cicchetti A, Giandini T, Avuzzi B, Palorini F, Stucchi C, Fellin G, Gabriele P, Vavassori V, Degli Esposti C, Cozzarini C, Pignoli E, Fiorino C, Rancati T, Valdagni R. Development of a Ready-to-Use Graphical Tool Based on Artificial Neural Network Classification: Application for the Prediction of Late Fecal Incontinence After Prostate Cancer Radiation Therapy. Int J Radiat Oncol Biol Phys 2018;102:1533-1542. [PMID: 30092335 DOI: 10.1016/j.ijrobp.2018.07.2014] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2017] [Revised: 06/19/2018] [Accepted: 07/26/2018] [Indexed: 12/13/2022]

Abstract

PURPOSE

This study was designed to apply artificial neural network (ANN) classification methods for the prediction of late fecal incontinence (LFI) after high-dose prostate cancer radiation therapy and to develop a ready-to-use graphical tool.

MATERIALS AND METHODS

In this study, 598 men recruited in 2 national multicenter trials were analyzed. Information was recorded on comorbidity, previous abdominal surgery, use of drugs, and dose distribution. Fecal incontinence was prospectively evaluated through self-reported questionnaires. To develop the ANN, the study population was randomly split into training (n = 300), validation (n = 149), and test (n = 149) sets. Mean grade of longitudinal LFI (ie, expressed as the average incontinence grade over the first 3 years after radiation therapy) ≥1 was considered the endpoint. A suitable subset of variables able to better predict LFI was selected by simulating 100,000 ANN configurations. The search for the definitive ANN was then performed by varying the number of inputs and hidden neurons from 4 to 5 and from 1 to 9, respectively. A final classification model was established as the average of the best 5 among 500 ANNs with the same architecture. An ANN-based graphical method to compute LFI prediction was developed to include one continuous and n dichotomous variables.

RESULTS

An ANN architecture was selected, with 5 input variables (mean dose, previous abdominal surgery, use of anticoagulants, use of antihypertensive drugs, and use of neoadjuvant and adjuvant hormone therapy) and 4 hidden neurons. The developed classification model correctly identified patients with LFI with 80.8% sensitivity and 63.7% ± 1.0% specificity and an area under the curve of 0.78. The developed graphical tool may efficiently classify patients in low, intermediate, and high LFI risk classes.

CONCLUSIONS

An ANN-based model was developed to predict LFI. The model was translated in a ready-to-use graphical tool for LFI risk classification, with direct interpretation of the role of the predictors.

Collapse

Affiliation(s)

Mauro Carrara Department of Medical Physics, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy.
Eleonora Massari Department of Medical Physics, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
Alessandro Cicchetti Prostate Cancer Program, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
Tommaso Giandini Department of Medical Physics, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
Barbara Avuzzi Department of Radiation Oncology 1, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
Federica Palorini Prostate Cancer Program, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
Claudio Stucchi Department of Medical Physics, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
Giovanni Fellin Department of Radiation Oncology, Ospedale Santa Chiara, Trento, Italy
Pietro Gabriele Department of Radiation Oncology, Istituto di Candiolo-Fondazione del Piemonte per l'Oncologia IRCCS, Candiolo, Italy
Vittorio Vavassori Department of Radiation Oncology, Cliniche Gavazzeni-Humanitas, Bergamo, Italy
Claudio Degli Esposti Department of Radiation Oncology, Ospedale Bellaria, Bologna, Italy
Cesare Cozzarini Department of Radiation Oncology, San Raffaele Scientific Institute, Milano, Italy
Emanuele Pignoli Department of Medical Physics, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
Claudio Fiorino Department of Medical Physics, San Raffaele Scientific Institute, Milano, Italy
Tiziana Rancati Prostate Cancer Program, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
Riccardo Valdagni Prostate Cancer Program, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy; Department of Radiation Oncology 1, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy; Department of Oncology and Hemato-oncology, Università degli Studi di Milano, Milan, Italy

Collapse

Anacleto A, Dias J. Data Analysis in Radiotherapy Treatments. INTERNATIONAL JOURNAL OF E-HEALTH AND MEDICAL COMMUNICATIONS 2018. [DOI: 10.4018/ijehmc.2018070103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Kang J, Rancati T, Lee S, Oh JH, Kerns SL, Scott JG, Schwartz R, Kim S, Rosenstein BS. Machine Learning and Radiogenomics: Lessons Learned and Future Directions. Front Oncol 2018;8:228. [PMID: 29977864 PMCID: PMC6021505 DOI: 10.3389/fonc.2018.00228] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Accepted: 06/04/2018] [Indexed: 12/25/2022] Open

Abstract

Due to the rapid increase in the availability of patient data, there is significant interest in precision medicine that could facilitate the development of a personalized treatment plan for each patient on an individual basis. Radiation oncology is particularly suited for predictive machine learning (ML) models due to the enormous amount of diagnostic data used as input and therapeutic data generated as output. An emerging field in precision radiation oncology that can take advantage of ML approaches is radiogenomics, which is the study of the impact of genomic variations on the sensitivity of normal and tumor tissue to radiation. Currently, patients undergoing radiotherapy are treated using uniform dose constraints specific to the tumor and surrounding normal tissues. This is suboptimal in many ways. First, the dose that can be delivered to the target volume may be insufficient for control but is constrained by the surrounding normal tissue, as dose escalation can lead to significant morbidity and rare. Second, two patients with nearly identical dose distributions can have substantially different acute and late toxicities, resulting in lengthy treatment breaks and suboptimal control, or chronic morbidities leading to poor quality of life. Despite significant advances in radiogenomics, the magnitude of the genetic contribution to radiation response far exceeds our current understanding of individual risk variants. In the field of genomics, ML methods are being used to extract harder-to-detect knowledge, but these methods have yet to fully penetrate radiogenomics. Hence, the goal of this publication is to provide an overview of ML as it applies to radiogenomics. We begin with a brief history of radiogenomics and its relationship to precision medicine. We then introduce ML and compare it to statistical hypothesis testing to reflect on shared lessons and to avoid common pitfalls. Current ML approaches to genome-wide association studies are examined. The application of ML specifically to radiogenomics is next presented. We end with important lessons for the proper integration of ML into radiogenomics.

Collapse

Gabryś HS, Buettner F, Sterzing F, Hauswald H, Bangert M. Design and Selection of Machine Learning Methods Using Radiomics and Dosiomics for Normal Tissue Complication Probability Modeling of Xerostomia. Front Oncol 2018;8:35. [PMID: 29556480 PMCID: PMC5844945 DOI: 10.3389/fonc.2018.00035] [Citation(s) in RCA: 94] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 02/01/2018] [Indexed: 01/13/2023] Open

Abstract

Purpose

The purpose of this study is to investigate whether machine learning with dosiomic, radiomic, and demographic features allows for xerostomia risk assessment more precise than normal tissue complication probability (NTCP) models based on the mean radiation dose to parotid glands.

Material and methods

A cohort of 153 head-and-neck cancer patients was used to model xerostomia at 0–6 months (early), 6–15 months (late), 15–24 months (long-term), and at any time (a longitudinal model) after radiotherapy. Predictive power of the features was evaluated by the area under the receiver operating characteristic curve (AUC) of univariate logistic regression models. The multivariate NTCP models were tuned and tested with single and nested cross-validation, respectively. We compared predictive performance of seven classification algorithms, six feature selection methods, and ten data cleaning/class balancing techniques using the Friedman test and the Nemenyi post hoc analysis.

Results

NTCP models based on the parotid mean dose failed to predict xerostomia (AUCs < 0.60). The most informative predictors were found for late and long-term xerostomia. Late xerostomia correlated with the contralateral dose gradient in the anterior–posterior (AUC = 0.72) and the right–left (AUC = 0.68) direction, whereas long-term xerostomia was associated with parotid volumes (AUCs > 0.85), dose gradients in the right–left (AUCs > 0.78), and the anterior–posterior (AUCs > 0.72) direction. Multivariate models of long-term xerostomia were typically based on the parotid volume, the parotid eccentricity, and the dose–volume histogram (DVH) spread with the generalization AUCs ranging from 0.74 to 0.88. On average, support vector machines and extra-trees were the top performing classifiers, whereas the algorithms based on logistic regression were the best choice for feature selection. We found no advantage in using data cleaning or class balancing methods.

Conclusion

We demonstrated that incorporation of organ- and dose-shape descriptors is beneficial for xerostomia prediction in highly conformal radiotherapy treatments. Due to strong reliance on patient-specific, dose-independent factors, our results underscore the need for development of personalized data-driven risk profiles for NTCP models of xerostomia. The facilitated machine learning pipeline is described in detail and can serve as a valuable reference for future work in radiomic and dosiomic NTCP modeling.

Collapse

Toesca DAS, Ibragimov B, Koong AJ, Xing L, Koong AC, Chang DT. Strategies for prediction and mitigation of radiation-induced liver toxicity. JOURNAL OF RADIATION RESEARCH 2018;59:i40-i49. [PMID: 29432550 PMCID: PMC5868188 DOI: 10.1093/jrr/rrx104] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2017] [Revised: 12/12/2017] [Indexed: 05/07/2023]

Valdes G, Simone CB, Chen J, Lin A, Yom SS, Pattison AJ, Carpenter CM, Solberg TD. Clinical decision support of radiotherapy treatment planning: A data-driven machine learning strategy for patient-specific dosimetric decision making. Radiother Oncol 2017;125:392-397. [PMID: 29162279 DOI: 10.1016/j.radonc.2017.10.014] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2017] [Revised: 10/10/2017] [Accepted: 10/10/2017] [Indexed: 02/06/2023]

Abstract

BACKGROUND AND PURPOSE

Clinical decision support systems are a growing class of tools with the potential to impact healthcare. This study investigates the construction of a decision support system through which clinicians can efficiently identify which previously approved historical treatment plans are achievable for a new patient to aid in selection of therapy.

MATERIAL AND METHODS

Treatment data were collected for early-stage lung and postoperative oropharyngeal cancers treated using photon (lung and head and neck) and proton (head and neck) radiotherapy. Machine-learning classifiers were constructed using patient-specific feature-sets and a library of historical plans. Model accuracy was analyzed using learning curves, and historical treatment plan matching was investigated.

RESULTS

Learning curves demonstrate that for these datasets, approximately 45, 60, and 30 patients are needed for a sufficiently accurate classification model for radiotherapy for early-stage lung, postoperative oropharyngeal photon, and postoperative oropharyngeal proton, respectively. The resulting classification model provides a database of previously approved treatment plans that are achievable for a new patient. An exemplary case, highlighting tradeoffs between the heart and chest wall dose while holding target dose constant in two historical plans is provided.

CONCLUSIONS

We report on the first artificial-intelligence based clinical decision support system that connects patients to past discrete treatment plans in radiation oncology and demonstrate for the first time how this tool can enable clinicians to use past decisions to help inform current assessments. Clinicians can be informed of dose tradeoffs between critical structures early in the treatment process, enabling more time spent on finding the optimal course of treatment for individual patients.

Collapse

El Naqa I, Kerns SL, Coates J, Luo Y, Speers C, West CML, Rosenstein BS, Ten Haken RK. Radiogenomics and radiotherapy response modeling. Phys Med Biol 2017;62:R179-R206. [PMID: 28657906 PMCID: PMC5557376 DOI: 10.1088/1361-6560/aa7c55] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Predicting xerostomia after IMRT treatments: a data mining approach. HEALTH AND TECHNOLOGY 2017. [DOI: 10.1007/s12553-017-0204-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Wang G, Kalra M, Orton CG. Machine learning will transform radiology significantly within the next 5 years. Med Phys 2017;44:2041-2044. [PMID: 28295412 DOI: 10.1002/mp.12204] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Accepted: 03/02/2017] [Indexed: 01/14/2023] Open

Yahya N, Ebert MA, Bulsara M, House MJ, Kennedy A, Joseph DJ, Denham JW. Statistical-learning strategies generate only modestly performing predictive models for urinary symptoms following external beam radiotherapy of the prostate: A comparison of conventional and machine-learning methods. Med Phys 2017;43:2040. [PMID: 27147316 DOI: 10.1118/1.4944738] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open

Abstract

PURPOSE

Given the paucity of available data concerning radiotherapy-induced urinary toxicity, it is important to ensure derivation of the most robust models with superior predictive performance. This work explores multiple statistical-learning strategies for prediction of urinary symptoms following external beam radiotherapy of the prostate.

METHODS

The performance of logistic regression, elastic-net, support-vector machine, random forest, neural network, and multivariate adaptive regression splines (MARS) to predict urinary symptoms was analyzed using data from 754 participants accrued by TROG03.04-RADAR. Predictive features included dose-surface data, comorbidities, and medication-intake. Four symptoms were analyzed: dysuria, haematuria, incontinence, and frequency, each with three definitions (grade ≥ 1, grade ≥ 2 and longitudinal) with event rate between 2.3% and 76.1%. Repeated cross-validations producing matched models were implemented. A synthetic minority oversampling technique was utilized in endpoints with rare events. Parameter optimization was performed on the training data. Area under the receiver operating characteristic curve (AUROC) was used to compare performance using sample size to detect differences of ≥0.05 at the 95% confidence level.

RESULTS

Logistic regression, elastic-net, random forest, MARS, and support-vector machine were the highest-performing statistical-learning strategies in 3, 3, 3, 2, and 1 endpoints, respectively. Logistic regression, MARS, elastic-net, random forest, neural network, and support-vector machine were the best, or were not significantly worse than the best, in 7, 7, 5, 5, 3, and 1 endpoints. The best-performing statistical model was for dysuria grade ≥ 1 with AUROC ± standard deviation of 0.649 ± 0.074 using MARS. For longitudinal frequency and dysuria grade ≥ 1, all strategies produced AUROC>0.6 while all haematuria endpoints and longitudinal incontinence models produced AUROC<0.6.

CONCLUSIONS

Logistic regression and MARS were most likely to be the best-performing strategy for the prediction of urinary symptoms with elastic-net and random forest producing competitive results. The predictive power of the models was modest and endpoint-dependent. New features, including spatial dose maps, may be necessary to achieve better models.

Collapse

Predictive modelling analysis for development of a radiotherapy decision support system in prostate cancer: a preliminary study. JOURNAL OF RADIOTHERAPY IN PRACTICE 2017. [DOI: 10.1017/s1460396916000583] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Perspectives on making big data analytics work for oncology. Methods 2016;111:32-44. [PMID: 27586524 DOI: 10.1016/j.ymeth.2016.08.010] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2016] [Revised: 08/19/2016] [Accepted: 08/25/2016] [Indexed: 12/31/2022] Open

Abstract

Oncology, with its unique combination of clinical, physical, technological, and biological data provides an ideal case study for applying big data analytics to improve cancer treatment safety and outcomes. An oncology treatment course such as chemoradiotherapy can generate a large pool of information carrying the 5Vs hallmarks of big data. This data is comprised of a heterogeneous mixture of patient demographics, radiation/chemo dosimetry, multimodality imaging features, and biological markers generated over a treatment period that can span few days to several weeks. Efforts using commercial and in-house tools are underway to facilitate data aggregation, ontology creation, sharing, visualization and varying analytics in a secure environment. However, open questions related to proper data structure representation and effective analytics tools to support oncology decision-making need to be addressed. It is recognized that oncology data constitutes a mix of structured (tabulated) and unstructured (electronic documents) that need to be processed to facilitate searching and subsequent knowledge discovery from relational or NoSQL databases. In this context, methods based on advanced analytics and image feature extraction for oncology applications will be discussed. On the other hand, the classical p (variables)≫n (samples) inference problem of statistical learning is challenged in the Big data realm and this is particularly true for oncology applications where p-omics is witnessing exponential growth while the number of cancer incidences has generally plateaued over the past 5-years leading to a quasi-linear growth in samples per patient. Within the Big data paradigm, this kind of phenomenon may yield undesirable effects such as echo chamber anomalies, Yule-Simpson reversal paradox, or misleading ghost analytics. In this work, we will present these effects as they pertain to oncology and engage small thinking methodologies to counter these effects ranging from incorporating prior knowledge, using information-theoretic techniques to modern ensemble machine learning approaches or combination of these. We will particularly discuss the pros and cons of different approaches to improve mining of big data in oncology.

Collapse

Valdes G, Solberg TD, Heskel M, Ungar L, Simone CB. Using machine learning to predict radiation pneumonitis in patients with stage I non-small cell lung cancer treated with stereotactic body radiation therapy. Phys Med Biol 2016;61:6105-20. [PMID: 27461154 DOI: 10.1088/0031-9155/61/16/6105] [Citation(s) in RCA: 68] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Abstract

To develop a patient-specific 'big data' clinical decision tool to predict pneumonitis in stage I non-small cell lung cancer (NSCLC) patients after stereotactic body radiation therapy (SBRT). 61 features were recorded for 201 consecutive patients with stage I NSCLC treated with SBRT, in whom 8 (4.0%) developed radiation pneumonitis. Pneumonitis thresholds were found for each feature individually using decision stumps. The performance of three different algorithms (Decision Trees, Random Forests, RUSBoost) was evaluated. Learning curves were developed and the training error analyzed and compared to the testing error in order to evaluate the factors needed to obtain a cross-validated error smaller than 0.1. These included the addition of new features, increasing the complexity of the algorithm and enlarging the sample size and number of events. In the univariate analysis, the most important feature selected was the diffusion capacity of the lung for carbon monoxide (DLCO adj%). On multivariate analysis, the three most important features selected were the dose to 15 cc of the heart, dose to 4 cc of the trachea or bronchus, and race. Higher accuracy could be achieved if the RUSBoost algorithm was used with regularization. To predict radiation pneumonitis within an error smaller than 10%, we estimate that a sample size of 800 patients is required. Clinically relevant thresholds that put patients at risk of developing radiation pneumonitis were determined in a cohort of 201 stage I NSCLC patients treated with SBRT. The consistency of these thresholds can provide radiation oncologists with an estimate of their reliability and may inform treatment planning and patient counseling. The accuracy of the classification is limited by the number of patients in the study and not by the features gathered or the complexity of the algorithm.

Collapse

Li H, Becker N, Raman S, Chan TCY, Bissonnette JP. The value of nodal information in predicting lung cancer relapse using 4DPET/4DCT. Med Phys 2016;42:4727-33. [PMID: 26233200 DOI: 10.1118/1.4926755] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

Abstract

PURPOSE

There is evidence that computed tomography (CT) and positron emission tomography (PET) imaging metrics are prognostic and predictive in nonsmall cell lung cancer (NSCLC) treatment outcomes. However, few studies have explored the use of standardized uptake value (SUV)-based image features of nodal regions as predictive features. The authors investigated and compared the use of tumor and node image features extracted from the radiotherapy target volumes to predict relapse in a cohort of NSCLC patients undergoing chemoradiation treatment.

METHODS

A prospective cohort of 25 patients with locally advanced NSCLC underwent 4DPET/4DCT imaging for radiation planning. Thirty-seven image features were derived from the CT-defined volumes and SUVs of the PET image from both the tumor and nodal target regions. The machine learning methods of logistic regression and repeated stratified five-fold cross-validation (CV) were used to predict local and overall relapses in 2 yr. The authors used well-known feature selection methods (Spearman's rank correlation, recursive feature elimination) within each fold of CV. Classifiers were ranked on their Matthew's correlation coefficient (MCC) after CV. Area under the curve, sensitivity, and specificity values are also presented.

RESULTS

For predicting local relapse, the best classifier found had a mean MCC of 0.07 and was composed of eight tumor features. For predicting overall relapse, the best classifier found had a mean MCC of 0.29 and was composed of a single feature: the volume greater than 0.5 times the maximum SUV (N).

CONCLUSIONS

The best classifier for predicting local relapse had only tumor features. In contrast, the best classifier for predicting overall relapse included a node feature. Overall, the methods showed that nodes add value in predicting overall relapse but not local relapse.

Collapse

Li G, Wei J, Huang H, Gaebler CP, Yuan A, Deasy JO. Automatic assessment of average diaphragm motion trajectory from 4DCT images through machine learning. Biomed Phys Eng Express 2015;1. [PMID: 27110388 DOI: 10.1088/2057-1976/1/4/045015] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Abstract

To automatically estimate average diaphragm motion trajectory (ADMT) based on four-dimensional computed tomography (4DCT), facilitating clinical assessment of respiratory motion and motion variation and retrospective motion study. We have developed an effective motion extraction approach and a machine-learning-based algorithm to estimate the ADMT. Eleven patients with 22 sets of 4DCT images (4DCT1 at simulation and 4DCT2 at treatment) were studied. After automatically segmenting the lungs, the differential volume-per-slice (dVPS) curves of the left and right lungs were calculated as a function of slice number for each phase with respective to the full-exhalation. After 5-slice moving average was performed, the discrete cosine transform (DCT) was applied to analyze the dVPS curves in frequency domain. The dimensionality of the spectrum data was reduced by using several lowest frequency coefficients (f_v) to account for most of the spectrum energy (Σf_v²). Multiple linear regression (MLR) method was then applied to determine the weights of these frequencies by fitting the ground truth-the measured ADMT, which are represented by three pivot points of the diaphragm on each side. The 'leave-one-out' cross validation method was employed to analyze the statistical performance of the prediction results in three image sets: 4DCT1, 4DCT2, and 4DCT1 + 4DCT2. Seven lowest frequencies in DCT domain were found to be sufficient to approximate the patient dVPS curves (R = 91%-96% in MLR fitting). The mean error in the predicted ADMT using leave-one-out method was 0.3 ± 1.9 mm for the left-side diaphragm and 0.0 ± 1.4 mm for the right-side diaphragm. The prediction error is lower in 4DCT2 than 4DCT1, and is the lowest in 4DCT1 and 4DCT2 combined. This frequency-analysis-based machine learning technique was employed to predict the ADMT automatically with an acceptable error (0.2 ± 1.6 mm). This volumetric approach is not affected by the presence of the lung tumors, providing an automatic robust tool to evaluate diaphragm motion.

Collapse

Kang J, Schwartz R, Flickinger J, Beriwal S. Machine Learning Approaches for Predicting Radiation Therapy Outcomes: A Clinician's Perspective. Int J Radiat Oncol Biol Phys 2015;93:1127-35. [PMID: 26581149 DOI: 10.1016/j.ijrobp.2015.07.2286] [Citation(s) in RCA: 115] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2015] [Revised: 07/21/2015] [Accepted: 07/27/2015] [Indexed: 02/06/2023]

Zhao X, Kong D, Jozsef G, Chang J, Wong EK, Formenti SC, Wang Y. Automated beam placement for breast radiotherapy using a support vector machine based algorithm. Med Phys 2012;39:2536-43. [PMID: 22559624 DOI: 10.1118/1.3700736] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open

Abstract

PURPOSE

To develop an automated beam placement technique for whole breast radiotherapy using tangential beams. We seek to find optimal parameters for tangential beams to cover the whole ipsilateral breast (WB) and minimize the dose to the organs at risk (OARs).

METHODS

A support vector machine (SVM) based method is proposed to determine the optimal posterior plane of the tangential beams. Relative significances of including/avoiding the volumes of interests are incorporated into the cost function of the SVM. After finding the optimal 3-D plane that separates the whole breast (WB) and the included clinical target volumes (CTVs) from the OARs, the gantry angle, collimator angle, and posterior jaw size of the tangential beams are derived from the separating plane equation. Dosimetric measures of the treatment plans determined by the automated method are compared with those obtained by applying manual beam placement by the physicians. The method can be further extended to use multileaf collimator (MLC) blocking by optimizing posterior MLC positions.

RESULTS

The plans for 36 patients (23 prone- and 13 supine-treated) with left breast cancer were analyzed. Our algorithm reduced the volume of the heart that receives >500 cGy dose (V5) from 2.7 to 1.7 cm(3) (p = 0.058) on average and the volume of the ipsilateral lung that receives >1000 cGy dose (V10) from 55.2 to 40.7 cm(3) (p = 0.0013). The dose coverage as measured by volume receiving >95% of the prescription dose (V95%) of the WB without a 5 mm superficial layer decreases by only 0.74% (p = 0.0002) and the V95% for the tumor bed with 1.5 cm margin remains unchanged.

CONCLUSIONS

This study has demonstrated the feasibility of using a SVM-based algorithm to determine optimal beam placement without a physician's intervention. The proposed method reduced the dose to OARs, especially for supine treated patients, without any relevant degradation of dose homogeneity and coverage in general.

Collapse

El Naqa I, Pater P, Seuntjens J. Monte Carlo role in radiobiological modelling of radiotherapy outcomes. Phys Med Biol 2012;57:R75-97. [PMID: 22571871 DOI: 10.1088/0031-9155/57/11/r75] [Citation(s) in RCA: 79] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]

van der Schaaf A, Xu CJ, van Luijk P, Van't Veld AA, Langendijk JA, Schilstra C. Multivariate modeling of complications with data driven variable selection: guarding against overfitting and effects of data set size. Radiother Oncol 2012;105:115-21. [PMID: 22264894 DOI: 10.1016/j.radonc.2011.12.006] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2011] [Revised: 11/03/2011] [Accepted: 12/12/2011] [Indexed: 11/27/2022]

Abstract

PURPOSE

Multivariate modeling of complications after radiotherapy is frequently used in conjunction with data driven variable selection. This study quantifies the risk of overfitting in a data driven modeling method using bootstrapping for data with typical clinical characteristics, and estimates the minimum amount of data needed to obtain models with relatively high predictive power.

MATERIALS AND METHODS

To facilitate repeated modeling and cross-validation with independent datasets for the assessment of true predictive power, a method was developed to generate simulated data with statistical properties similar to real clinical data sets. Characteristics of three clinical data sets from radiotherapy treatment of head and neck cancer patients were used to simulate data with set sizes between 50 and 1000 patients. A logistic regression method using bootstrapping and forward variable selection was used for complication modeling, resulting for each simulated data set in a selected number of variables and an estimated predictive power. The true optimal number of variables and true predictive power were calculated using cross-validation with very large independent data sets.

RESULTS

For all simulated data set sizes the number of variables selected by the bootstrapping method was on average close to the true optimal number of variables, but showed considerable spread. Bootstrapping is more accurate in selecting the optimal number of variables than the AIC and BIC alternatives, but this did not translate into a significant difference of the true predictive power. The true predictive power asymptotically converged toward a maximum predictive power for large data sets, and the estimated predictive power converged toward the true predictive power. More than half of the potential predictive power is gained after approximately 200 samples. Our simulations demonstrated severe overfitting (a predicative power lower than that of predicting 50% probability) in a number of small data sets, in particular in data sets with a low number of events (median: 7, 95th percentile: 32). Recognizing overfitting from an inverted sign of the estimated model coefficients has a limited discriminative value.

CONCLUSIONS

Despite considerable spread around the optimal number of selected variables, the bootstrapping method is efficient and accurate for sufficiently large data sets, and guards against overfitting for all simulated cases with the exception of some data sets with a particularly low number of events. An appropriate minimum data set size to obtain a model with high predictive power is approximately 200 patients and more than 32 events. With fewer data samples the true predictive power decreases rapidly, and for larger data set sizes the benefit levels off toward an asymptotic maximum predictive power.

Collapse

Pella A, Cambria R, Riboldi M, Jereczek-Fossa BA, Fodor C, Zerini D, Torshabi AE, Cattani F, Garibaldi C, Pedroli G, Baroni G, Orecchia R. Use of machine learning methods for prediction of acute toxicity in organs at risk following prostate radiotherapy. Med Phys 2011;38:2859-67. [PMID: 21815361 DOI: 10.1118/1.3582947] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open

Buettner F, Gulliford SL, Webb S, Partridge M. Modeling late rectal toxicities based on a parameterized representation of the 3D dose distribution. Phys Med Biol 2011;56:2103-18. [PMID: 21386140 DOI: 10.1088/0031-9155/56/7/013] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Abstract

Many models exist for predicting toxicities based on dose-volume histograms (DVHs) or dose-surface histograms (DSHs). This approach has several drawbacks as firstly the reduction of the dose distribution to a histogram results in the loss of spatial information and secondly the bins of the histograms are highly correlated with each other. Furthermore, some of the complex nonlinear models proposed in the past lack a direct physical interpretation and the ability to predict probabilities rather than binary outcomes. We propose a parameterized representation of the 3D distribution of the dose to the rectal wall which explicitly includes geometrical information in the form of the eccentricity of the dose distribution as well as its lateral and longitudinal extent. We use a nonlinear kernel-based probabilistic model to predict late rectal toxicity based on the parameterized dose distribution and assessed its predictive power using data from the MRC RT01 trial (ISCTRN 47772397). The endpoints under consideration were rectal bleeding, loose stools, and a global toxicity score. We extract simple rules identifying 3D dose patterns related to a specifically low risk of complication. Normal tissue complication probability (NTCP) models based on parameterized representations of geometrical and volumetric measures resulted in areas under the curve (AUCs) of 0.66, 0.63 and 0.67 for predicting rectal bleeding, loose stools and global toxicity, respectively. In comparison, NTCP models based on standard DVHs performed worse and resulted in AUCs of 0.59 for all three endpoints. In conclusion, we have presented low-dimensional, interpretable and nonlinear NTCP models based on the parameterized representation of the dose to the rectal wall. These models had a higher predictive power than models based on standard DVHs and their low dimensionality allowed for the identification of 3D dose patterns related to a low risk of complication.

Collapse

Arimura H, Magome T, Anai S, Shioyama Y, Nakamura K. [Medical imaging processing and evaluation in radiation therapy]. Nihon Hoshasen Gijutsu Gakkai Zasshi 2011;67:76-83. [PMID: 21301175 DOI: 10.6009/jjrt.67.76] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]

Huang EX, Hope AJ, Lindsay PE, Trovo M, El Naqa I, Deasy JO, Bradley JD. Heart irradiation as a risk factor for radiation pneumonitis. Acta Oncol 2011;50:51-60. [PMID: 20874426 DOI: 10.3109/0284186x.2010.521192] [Citation(s) in RCA: 107] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Roelofs E, Persoon L, Qamhiyeh S, Verhaegen F, De Ruysscher D, Scholz M, Iancu G, Engelsman M, Rasch C, Zijp L, Meerleer GD, Coghe M, Langendijk J, Schilstra C, Pijls-Johannesma M, Lambin P. Design of and technical challenges involved in a framework for multicentric radiotherapy treatment planning studies. Radiother Oncol 2010;97:567-71. [PMID: 20864198 DOI: 10.1016/j.radonc.2010.08.009] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2009] [Revised: 04/06/2010] [Accepted: 08/12/2010] [Indexed: 12/25/2022]

Jayasurya K, Fung G, Yu S, Dehing-Oberije C, De Ruysscher D, Hope A, De Neve W, Lievens Y, Lambin P, Dekker ALAJ. Comparison of Bayesian network and support vector machine models for two-year survival prediction in lung cancer patients treated with radiotherapy. Med Phys 2010;37:1401-7. [PMID: 20443461 DOI: 10.1118/1.3352709] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open

Abstract

PURPOSE

Classic statistical and machine learning models such as support vector machines (SVMs) can be used to predict cancer outcome, but often only perform well if all the input variables are known, which is unlikely in the medical domain. Bayesian network (BN) models have a natural ability to reason under uncertainty and might handle missing data better. In this study, the authors hypothesize that a BN model can predict two-year survival in non-small cell lung cancer (NSCLC) patients as accurately as SVM, but will predict survival more accurately when data are missing.

METHODS

A BN and SVM model were trained on 322 inoperable NSCLC patients treated with radiotherapy from Maastricht and validated in three independent data sets of 35, 47, and 33 patients from Ghent, Leuven, and Toronto. Missing variables occurred in the data set with only 37, 28, and 24 patients having a complete data set.

RESULTS

The BN model structure and parameter learning identified gross tumor volume size, performance status, and number of positive lymph nodes on a PET as prognostic factors for two-year survival. When validated in the full validation set of Ghent, Leuven, and Toronto, the BN model had an AUC of 0.77, 0.72, and 0.70, respectively. A SVM model based on the same variables had an overall worse performance (AUC 0.71, 0.68, and 0.69) especially in the Ghent set, which had the highest percentage of missing the important GTV size data. When only patients with complete data sets were considered, the BN and SVM model performed more alike.

CONCLUSIONS

Within the limitations of this study, the hypothesis is supported that BN models are better at handling missing data than SVM models and are therefore more suitable for the medical domain. Future works have to focus on improving the BN performance by including more patients, more variables, and more diversity.

Collapse