Yoganathan SA, Ahmed S, Paloor S, Torfeh T, Aouadi S, Al-Hammadi N, Hammoud R. Virtual pretreatment patient-specific quality assurance of volumetric modulated arc therapy using deep learning.
Med Phys 2023;
50:7891-7903. [PMID:
37379068 DOI:
10.1002/mp.16567]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 05/16/2023] [Accepted: 06/02/2023] [Indexed: 06/29/2023] Open
Abstract
BACKGROUND
Automatic patient-specific quality assurance (PSQA) is recently explored using artificial intelligence approaches, and several studies reported the development of machine learning models for predicting the gamma pass rate (GPR) index only.
PURPOSE
To develop a novel deep learning approach using a generative adversarial network (GAN) to predict the synthetic measured fluence.
METHODS AND MATERIALS
A novel training method called "dual training," which involves the training of the encoder and decoder separately, was proposed and evaluated for cycle GAN (cycle-GAN) and conditional GAN (c-GAN). A total of 164 VMAT treatment plans, including 344 arcs (training data: 262, validation data: 30, and testing data: 52) from various treatment sites, were selected for prediction model development. For each patient, portal-dose-image-prediction fluence from TPS was used as input, and measured fluence from EPID was used as output/response for model training. Predicted GPR was derived by comparing the TPS fluence with the synthetic measured fluence generated by the DL models using gamma evaluation of criteria 2%/2 mm. The performance of dual training was compared against the traditional single-training approach. In addition, we also developed a separate classification model specifically designed to detect automatically three types of errors (rotational, translational, and MU-scale) in the synthetic EPID-measured fluence.
RESULTS
Overall, the dual training improved the prediction accuracy of both cycle-GAN and c-GAN. Predicted GPR results of single training were within 3% for 71.2% and 78.8% of test cases for cycle-GAN and c-GAN, respectively. Moreover, similar results for dual training were 82.7% and 88.5% for cycle-GAN and c-GAN, respectively. The error detection model showed high classification accuracy (>98%) for detecting errors related to rotational and translational errors. However, it struggled to differentiate the fluences with "MU scale error" from "error-free" fluences.
CONCLUSION
We developed a method to automatically generate the synthetic measured fluence and identify errors within them. The proposed dual training improved the PSQA prediction accuracy of both the GAN models, with c-GAN demonstrating superior performance over the cycle-GAN. Our results indicate that the c-GAN with dual training approach combined with error detection model, can accurately generate the synthetic measured fluence for VMAT PSQA and identify the errors. This approach has the potential to pave the way for virtual patient-specific QA of VMAT treatments.
Collapse