1
|
Khelil MM. [Six Sigma driven QC management in hepatitis C serology]. Ann Biol Clin (Paris) 2024; 82:215-224. [PMID: 38702909 DOI: 10.1684/abc.2024.1885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/06/2024]
Abstract
The Westgard quality control (QC) rules are often applied in infectious diseases serology to validate the quality of results, but this requires a reasonable tradeoff between maximum sensitivity to errors and minimum false rejections. This article, in addition to illustrate the six sigma methodology in the QC management of the (anti-HCV Architect®) test, it discusses the main influencing factors on sigma value. Data from low positive and in-kit control materials spreading over 6 months and using four reagent kits, were used to calculate the precision of the test. The difference between the control material reactivity and the cut-off defined the error budget. Sigma values were > 6, which indicates that the method produces four erroneous results per million tests. The application of the six sigma concept made it possible to argue the choice of the new QC strategy (use of 13S rule with one positive control) and to relax the existing QC rules. This work provides a framework for infectious diseases serology laboratories to evaluate tests performances against a quality requirement and design an optimal QC strategy.
Collapse
|
2
|
Müller S. Objektivierung der Versorgungsqualität in der Viszeralchirurgie. Zentralbl Chir 2024; 149:213-215. [PMID: 38838695 DOI: 10.1055/a-2280-5655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2024]
|
3
|
Basaula D, Hay B, Wright M, Hall L, Easdon A, McWiggan P, Yeo A, Ungureanu E, Kron T. Additive manufacturing of patient specific bolus for radiotherapy: large scale production and quality assurance. Phys Eng Sci Med 2024; 47:551-561. [PMID: 38285272 PMCID: PMC11166743 DOI: 10.1007/s13246-024-01385-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Accepted: 01/07/2024] [Indexed: 01/30/2024]
Abstract
Bolus is commonly used to improve dose distributions in radiotherapy in particular if dose to skin must be optimised such as in breast or head and neck cancer. We are documenting four years of experience with 3D printed bolus at a large cancer centre. In addition to this we review the quality assurance (QA) program developed to support it. More than 2000 boluses were produced between Nov 2018 and Feb 2023 using fused deposition modelling (FDM) printing with polylactic acid (PLA) on up to five Raise 3D printers. Bolus is designed in the radiotherapy treatment planning system (Varian Eclipse), exported to an STL file followed by pre-processing. After checking each bolus with CT scanning initially we now produce standard quality control (QC) wedges every month and whenever a major change in printing processes occurs. A database records every bolus printed and manufacturing details. It takes about 3 days from designing the bolus in the planning system to delivering it to treatment. A 'premium' PLA material (Spidermaker) was found to be best in terms of homogeneity and CT number consistency (80 HU +/- 8HU). Most boluses were produced for photon beams (93.6%) with the rest used for electrons. We process about 120 kg of PLA per year with a typical bolus weighing less than 500 g and the majority of boluses 5 mm thick. Print times are proportional to bolus weight with about 24 h required for 500 g material deposited. 3D printing using FDM produces smooth and reproducible boluses. Quality control is essential but can be streamlined.
Collapse
|
4
|
Yan B, Shi J, Xue X, Peng H, Wu A, Wang X, Ma C. Error detection using a multi-channel hybrid network with a low-resolution detector in patient-specific quality assurance. J Appl Clin Med Phys 2024; 25:e14327. [PMID: 38488663 PMCID: PMC11163496 DOI: 10.1002/acm2.14327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 02/15/2024] [Accepted: 02/20/2024] [Indexed: 06/11/2024] Open
Abstract
PURPOSE This study aimed to develop a hybrid multi-channel network to detect multileaf collimator (MLC) positional errors using dose difference (DD) maps and gamma maps generated from low-resolution detectors in patient-specific quality assurance (QA) for Intensity Modulated Radiation Therapy (IMRT). METHODS A total of 68 plans with 358 beams of IMRT were included in this study. The MLC leaf positions of all control points in the original IMRT plans were modified to simulate four types of errors: shift error, opening error, closing error, and random error. These modified plans were imported into the treatment planning system (TPS) to calculate the predicted dose, while the PTW seven29 phantom was utilized to obtain the measured dose distributions. Based on the measured and predicted dose, DD maps and gamma maps, both with and without errors, were generated, resulting in a dataset with 3222 samples. The network's performance was evaluated using various metrics, including accuracy, sensitivity, specificity, precision, F1-score, ROC curves, and normalized confusion matrix. Besides, other baseline methods, such as single-channel hybrid network, ResNet-18, and Swin-Transformer, were also evaluated as a comparison. RESULTS The experimental results showed that the multi-channel hybrid network outperformed other methods, demonstrating higher average precision, accuracy, sensitivity, specificity, and F1-scores, with values of 0.87, 0.89, 0.85, 0.97, and 0.85, respectively. The multi-channel hybrid network also achieved higher AUC values in the random errors (0.964) and the error-free (0.946) categories. Although the average accuracy of the multi-channel hybrid network was only marginally better than that of ResNet-18 and Swin Transformer, it significantly outperformed them regarding precision in the error-free category. CONCLUSION The proposed multi-channel hybrid network exhibits a high level of accuracy in identifying MLC errors using low-resolution detectors. The method offers an effective and reliable solution for promoting quality and safety of IMRT QA.
Collapse
|
5
|
Berk K, Kron T, Hardcastle N, Yeo AU. Efficacious patient-specific QA for Vertebra SBRT using a high-resolution detector array SRS MapCHECK: AAPM TG-218 analysis. J Appl Clin Med Phys 2024; 25:e14276. [PMID: 38414322 PMCID: PMC11163485 DOI: 10.1002/acm2.14276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 12/01/2023] [Accepted: 12/22/2023] [Indexed: 02/29/2024] Open
Abstract
PURPOSE Patient-specific quality assurance (PSQA) for vertebra stereotactic body radiation therapy (SBRT) presents challenges due to highly modulated small fields with high-dose gradients between the target and spinal cord. This study aims to explore the use of the SRS MapCHECK® (SRSMC) for vertebra SBRT PSQA. METHODS Twenty vertebra SBRT treatment plans including prescriptions 20 Gy/1 fraction and 24 Gy/2 fractions were selected for each of Millennium (M)-Multileaf Collimator (MLC), and high-definition (HD)-MLC. All 40 plans were measured using Gafchromic EBT3 film (film) and SRSMC, using the StereoPHAN phantom. Plan complexity was assessed using modulation complexity score (MCS), edge metric (EM) (mm-1), modulation factor (MU/cGy), and average leaf pair opening (ALPO) (mm) and its correlation with gamma-pass rate was investigated. The high dose gradient between the target and the spinal cord was analyzed for film and SRSMC and compared against the treatment planning system (TPS). Applying the methodology proposed by AAPM TG-218, action and tolerance values specific to the SRSMC for vertebra SBRT were determined for β values ranging from 5 to 8. RESULTS Film and SRSMC gamma-pass rates showed no correlation (p > 0.05). A moderate negative correlation (R = -0.57, p = 0.01) is present between EM and SRSMC 3%/1 mm gamma-pass rate for HD-MLC plans. Both film and SRSMC accurately measured high dose gradients between the target and the spinal cord (R2 > 0.86, p ≤ 0.05). Notably, dose-gradient of HD-MLC plans is 22% steeper and has a smaller standard deviation to M-MLC plans (p ≤ 0.05). Applying TG-218, the film tolerance limit was 96% with action limit 95% for 5%/1 mm (β = 6) and for the SRSMC tolerance limit was 97% with an action limit of 96% for 4%/1 mm (β = 6). CONCLUSION Our findings suggest that universal TG-218 limits may not be suitable for vertebra SBRT PSQA. This study demonstrates that SRSMC is a viable tool for vertebra SBRT PSQA, supported by TG-218 implementation of process-based tolerance and action limits.
Collapse
|
6
|
Erickson B, Cui Y, Alber M, Wang C, Fang Yin F, Kirkpatrick J, Adamson J. Independent Monte Carlo dose calculation identifies single isocenter multi-target radiosurgery targets most likely to fail pre-treatment measurement. J Appl Clin Med Phys 2024; 25:e14290. [PMID: 38289874 PMCID: PMC11163499 DOI: 10.1002/acm2.14290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 01/04/2024] [Accepted: 01/08/2024] [Indexed: 02/01/2024] Open
Abstract
PURPOSE For individual targets of single isocenter multi-target (SIMT) Stereotactic radiosurgery (SRS), we assess dose difference between the treatment planning system (TPS) and independent Monte Carlo (MC), and demonstrate persistence into the pre-treatment Quality Assurance (QA) measurement. METHODS Treatment plans from 31 SIMT SRS patients were recalculated in a series of scenarios designed to investigate sources of discrepancy between TPS and independent MC. Targets with > 5% discrepancy in DMean[Gy] after progressing through all scenarios were measured with SRS MapCHECK. A matched pair analysis was performed comparing SRS MapCHECK results for these targets with matched targets having similar characteristics (volume & distance from isocenter) but no such MC dose discrepancy. RESULTS Of 217 targets analyzed, individual target mean dose (DMean[Gy]) fell outside a 5% threshold for 28 and 24 targets before and after removing tissue heterogeneity effects, respectively, while only 5 exceeded the threshold after removing effect of patient geometry (via calculation on StereoPHAN geometry). Significant factors affecting agreement between the TPS and MC included target distance from isocenter (0.83% decrease in DMean[Gy] per 2 cm), volume (0.15% increase per cc), and degree of plan modulation (0.37% increase per 0.01 increase in modulation complexity score). SRS MapCHECK measurement had better agreement with MC than with TPS (2%/1 mm / 10% threshold gamma pass rate (GPR) = 99.4 ± 1.9% vs. 93.1 ± 13.9%, respectively). In the matched pair analysis, targets exceeding 5% for MC versus TPS also had larger discrepancies between TPS and measurement with no GPR (2%/1 mm / 10% threshold) exceeding 90% (71.5% ± 16.1%); whereas GPR was high for matched targets with no such MC versus TPS difference (96.5% ± 3.3%, p = 0.01). CONCLUSIONS Independent MC complements pre-treatment QA measurement for SIMT SRS by identifying problematic individual targets prior to pre-treatment measurement, thus enabling plan modifications earlier in the planning process and guiding selection of targets for pre-treatment QA measurement.
Collapse
|
7
|
Misa J, McCarthy S, Clair WS, Pokhrel D. Stereotactic radiotherapy of intracranial tumor beds on a ring-mounted Halcyon LINAC. J Appl Clin Med Phys 2024; 25:e14281. [PMID: 38277473 PMCID: PMC11163492 DOI: 10.1002/acm2.14281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 12/21/2023] [Accepted: 01/03/2024] [Indexed: 01/28/2024] Open
Abstract
PURPOSE This study sought to evaluate the feasibility and efficacy of the Halcyon Ring Delivery System (RDS) for delivering stereotactic radiotherapy (SRT) treatments for intracranial tumors beds. METHODS Ten previously treated brain SRT patients for 30 Gy in five fractions with non-coplanar HyperArc plans on TrueBeam (6MV-FFF) were replanned on Halcyon (6MV-FFF) using the same number of arcs and Eclipse's AcurosXB dose engine. Plan quality evaluation metrics per SRT protocol included: PTV coverage, GTV dose (minimum and mean), target conformity indices (CI), heterogeneity index (HI), gradient index (GI), maximum dose 2 cm away from the PTV (D2cm), and doses to organs-at-risk (OAR). Additionally, patient-specific quality assurance (QA) results and beam-on-time (BOT) were analyzed. RESULTS The Halcyon RDS provided highly conformal SRT plans for intracranial tumor beds with similar dose to target. When benchmarked against clinically delivered HyperArc plans, target coverage, CI(s) and HI were statistically similar. The Halcyon plans saw no statistical difference in maximum OAR doses to the brainstem, spinal cord, and cochlea. Due to the machine's coplanar geometry, the Halcyon plans showed a decrease in optic pathway dose (0.75 Gy vs. 2.08 Gy, p = 0.029). Overall, Halcyon's coplanar geometry resulted in a larger GI (3.33 vs. 2.72, p = 0.008) and a larger D2cm (39.59% vs. 29.07%, p < 0.001). In this cohort, multiple cases had the PTV and the optic pathway in the same axial plane. In one such instance, the PTV was <2 cm away from the optic pathway but even at this close proximity OAR, Halcyon still adequately spared the optic pathway. Additionally, the Halcyon's geometry provided slightly larger amount of normal brain dose receiving 24.4 Gy (8.99 cc vs. 7.36 cc) and 28.8 Gy (2.9 cc vs. 2.5 cc), although statistically insignificant. The Halcyon plans achieved similar delivery accuracy, quantified by patient-specific QA results evaluated with a 2%/2 mm gamma criteria (99.42% vs. 99.70%). For both plans, independent Monte Carlo second checks calculation agreed within 1%. Average Halcyon BOT was slightly higher by 0.35 min (p = 0.045), however, due to the one-step patient set-up and verification overall estimated treatment times on Halcyon were lower compared to HyperArc treatments (7.61 min vs. 10.26 min, p < 0.001). CONCLUSIONS When benchmarked against clinically delivered HyperArc treatments, the Halcyon brain SRT plans provided similar plan quality and delivery accuracy but achieved faster overall treatment times. We have started treating select brain SRT patients on the Halcyon RDS for patients having tumor beds greater than 1 cm in diameter with the closest OAR distance of greater than 2 cm away from the target. We recommend other clinics to consider commissioning SRT treatments on their Halcyon systems-allowing including remote Halcyon-only clinics to provide exceptionally high-quality therapeutic brain SRT treatments to an otherwise underserved patient cohort.
Collapse
|
8
|
Kaidar-Person O, Meattini I, Boersma LJ, Becherini C, Cortes J, Curigliano G, de Azambuja E, Harbeck N, Rugo HS, Del Mastro L, Gennari A, Isacke CM, Vestmø Maraldo M, Marangoni E, Nader Marta G, Mjaaland I, Salvestrini V, Spanic T, Visani L, Morandi A, Lambertini M, Livi L, Coles CE, Poortmans P, Offersen BV. Essential requirements for reporting radiation therapy in breast cancer clinical trials: An international multi-disciplinary consensus endorsed by the European Society for Radiotherapy and Oncology (ESTRO). Radiother Oncol 2024; 195:110060. [PMID: 38122852 DOI: 10.1016/j.radonc.2023.110060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 12/12/2023] [Accepted: 12/14/2023] [Indexed: 12/23/2023]
Abstract
The European Society for Radiotherapy and Oncology (ESTRO) has advocated the establishment of guidelines to optimise precision radiotherapy (RT) in conjunction with contemporary therapeutics for cancer care. Quality assurance in RT (QART) plays a pivotal role in influencing treatment outcomes. Clinical trials incorporating QART protocols have demonstrated improved survival rates with minimal associated toxicity. Nonetheless, in routine clinical practice, there can be variability in the indications for RT, dosage, fractionation, and treatment planning, leading to uncertainty. In pivotal trials reporting outcomes of systemic therapy for breast cancer, there is limited information available regarding RT, and the potential interaction between modern systemic therapy and RT remains largely uncharted. This article is grounded in a consensus recommendation endorsed by ESTRO, formulated by international breast cancer experts. The consensus was reached through a modified Delphi process and was presented at an international meeting convened in Florence, Italy, in June 2023. These recommendations are regarded as both optimal and essential standards, with the latter aiming to define the minimum requirements. A template for a case report form (CRF) has been devised, which can be utilised by all clinical breast cancer trials involving RT. Optimal requirements include adherence to predefined RT planning protocols and centralised QART. Essential requirements aim to reduce variations and deviations from the guidelines in RT, even when RT is not the primary focus of the trial. These recommendations underscore the significance of implementing these practices in both clinical trials and daily clinical routines to generate high-quality data.
Collapse
|
9
|
Lutaka M, Speelman A, Naidoo S, Hamunyela R. Quality audits of nuclear medicine practices in a middle-income African setting. J Med Radiat Sci 2024; 71:186-193. [PMID: 38009579 DOI: 10.1002/jmrs.743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 11/10/2023] [Indexed: 11/29/2023] Open
Abstract
INTRODUCTION The International Atomic Energy Agency (IAEA) introduced a Quality Management Audits in Nuclear Medicine (QUANUM) programme, to improve nuclear medicine practice standards aligned with international standards through self-assessments. The absence of quality management audits in nuclear medicine departments could potentially result in a compromise in the safety and quality of patient care. To date, there is no evidence that quality audits have been conducted in nuclear medicine departments of this middle-income country. This quality audit, therefore, assessed conformance to the IAEA QUANUM programme in four nuclear medicine departments. METHODS The study adopted a quantitative methodological exploratory approach. The IAEA QUANUM programme was used to audit nuclear medicine services' overall activity such as clinical practice, management, radiopharmacy, general and radiation safety, quality assurance, operations and services. The data was collected via document analysis in four nuclear medicine department identified as Sites A-D. RESULTS Overall results showed that Site A conformed with 247 out of 370 (67%) counts and non-conformed with 123 out of 370 (33%) counts whilst Site B conformed with 205 out of 342 (60%) counts and non-conformed with 137 out of 342 counts (40%). Site C conformed with 259 out of 345 (75%) counts and non-conformed with 86 out of 345 (25%) counts. Site D conformed with 166 out of 349 (48%) counts and non-conformed with 183 out of 349 (52%) counts. The study yielded 125 overall recommendations. CONCLUSIONS All the sites demonstrated good compliance to international standards in radionuclide therapy. Site A complied poorly in strategies and policies, whilst Site B complied poorly in quality control of equipment. Site C showed poor compliance to human resource development and Site D showed aspects pertaining to administration and management as well as evaluation of quality systems.
Collapse
|
10
|
Mukwada G, Hirst A, Rowshanfarzad P, Ebert MA. Development of a 3D printed phantom for commissioning and quality assurance of multiple brain targets stereotactic radiosurgery. Phys Eng Sci Med 2024; 47:455-463. [PMID: 38285271 PMCID: PMC11166808 DOI: 10.1007/s13246-023-01374-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 12/18/2023] [Indexed: 01/30/2024]
Abstract
Single plan techniques for multiple brain targets (MBT) stereotactic radiosurgery (SRS) are now routine. Patient specific quality assurance (QA) for MBT poses challenges due to the limited capabilities of existing QA tools which necessitates several plan redeliveries. This study sought to develop an SRS QA phantom that enables flexible MBT patient specific QA in a single delivery, along with complex SRS commissioning. PLA marble and PLA StoneFil materials were selected based on the literature and previous research conducted in our department. The HU numbers were investigated to determine the appropriate percentage infill for skull and soft-tissue equivalence. A Prusa MK3S printer in conjunction with the above-mentioned filaments were used to print the SRS QA phantom. Quality control (QC) was performed on the printed skull, film inserts and plugs for point dose measurements. EBT3 film and point dose measurements were performed using a CC04 ionisation chamber. QC demonstrated that the SRS QA phantom transverse, coronal and sagittal film planes were orthogonal within 0.5°. HU numbers for the skull, film inserts and plugs were 858 ± 20 and 35 ± 12 respectively. Point and EBT3 film dose measurements were within 2.5% and 3%/2 mm 95% gamma pass rate, respectively except one Gross Tumour Volume (GTV) that had a slightly lower gamma pass rate. Dose distributions to five GTVs were measured with EBT3 film in a single plan delivery on CyberKnife. In conclusion, an SRS QA phantom was designed, and 3D printed and its use for performing complex MBT patient specific QA in a single delivery was demonstrated.
Collapse
|
11
|
Deng J, Liu S, Huang Y, Li X, Wu X. Evaluating AAPM-TG-218 recommendations: Gamma index tolerance and action limits in IMRT and VMAT quality assurance using SunCHECK. J Appl Clin Med Phys 2024; 25:e14277. [PMID: 38243604 PMCID: PMC11163510 DOI: 10.1002/acm2.14277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 10/23/2023] [Accepted: 12/17/2023] [Indexed: 01/21/2024] Open
Abstract
PURPOSE This study aimed to improve the safety and accuracy of radiotherapy by establishing tolerance (TL) and action (AL) limits for the gamma index in patient-specific quality assurance (PSQA) for intensity-modulated radiation therapy (IMRT) and volumetric-modulated arc therapy (VMAT) using SunCHECK software, as per AAPM TG-218 report recommendations. METHODS The study included 125 patients divided into six groups by treatment regions (H&N, thoracic and pelvic) and techniques (VMAT, IMRT). SunCHECK was used to calculate the gamma passing rate (%GP) and dose error (%DE) for each patient, for the planning target volume and organs at risk (OARs). The TL and AL were then determined for each group according to TG-218 recommendations. We conducted a comprehensive analysis to compare %DE among different groups and examined the relationship between %GP and %DE. RESULTS The TL and AL of all groups were more stringent than the common standard as defined by the TG218 report. The TL and AL values of the groups differed significantly, and the values for the thoracic groups were lower for both VMAT and IMRT. The %DE of the parameters D95%, D90%, and Dmean in the planning target volume, and Dmean and Dmax in OARs were significantly different. The dose deviation of VMAT was larger than IMRT, especially in the thoracic group. A %GP and %DE correlation analysis showed a strong correlation for the planning target volume, but a weak correlation for the OARs. Additionally, a significant correlation existed between %GP of SunCHECK and Delta4. CONCLUSION The study established TL and AL values tailored to various anatomical regions and treatment techniques at our institution. Establishing PSQA workflows for VMAT and IMRT offers valuable clinical insights and guidance. We also suggest developing a standard combining clinically relevant metrics with %GP to evaluate PSQA results comprehensively.
Collapse
|
12
|
Pathmaraj K. The importance of quality management systems in nuclear medicine departments. J Med Radiat Sci 2024; 71:167-169. [PMID: 38641981 DOI: 10.1002/jmrs.793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 04/07/2024] [Indexed: 04/21/2024] Open
Abstract
Quality management systems (QMS) in nuclear medicine is an essential component of the Quality program and is instrumental in the safe delivery of a high standard clinical service. The IAEA QUANUM program is a nuclear medicine specific audit program that can be used to assess the standards of a nuclear medicine department and its service delivery. Regular internal and external audits are encouraged as part of the QMS.
Collapse
|
13
|
Dreindl R, Bolsa‐Ferruz M, Fayos‐Sola R, Padilla Cabal F, Scheuchenpflug L, Elia A, Amico A, Carlino A, Stock M, Grevillot L. Commissioning and clinical implementation of an independent dose calculation system for scanned proton beams. J Appl Clin Med Phys 2024; 25:e14328. [PMID: 38553788 PMCID: PMC11087175 DOI: 10.1002/acm2.14328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 01/31/2024] [Accepted: 02/07/2024] [Indexed: 05/12/2024] Open
Abstract
PURPOSE Experimental patient-specific QA (PSQA) is a time and resource-intensive process, with a poor sensitivity in detecting errors. Radiation therapy facilities aim to substitute it by means of independent dose calculation (IDC) in combination with a comprehensive beam delivery QA program. This paper reports on the commissioning of the IDC software tool myQA iON (IBA Dosimetry) for proton therapy and its clinical implementation at the MedAustron Ion Therapy Center. METHODS The IDC commissioning work included the validation of the beam model, the implementation and validation of clinical CT protocols, and the evaluation of patient treatment data. Dose difference maps, gamma index distributions, and pass rates (GPR) have been reviewed. The performance of the IDC tool has been assessed and clinical workflows, simulation settings, and GPR tolerances have been defined. RESULTS Beam model validation showed agreement of ranges within ± 0.2 mm, Bragg-Peak widths within ± 0.1 mm, and spot sizes at various air gaps within ± 5% compared to physical measurements. Simulated dose in 2D reference fields deviated by -0.3% ± 0.5%, while 3D dose distributions differed by 1.8% on average to measurements. Validation of the CT calibration resulted in systematic differences of 2.0% between IDC and experimental data for tissue like samples. GPRs of 99.4 ± 0.6% were found for head, head and neck, and pediatric CT protocols on a 2%/2 mm gamma criterion. GPRs for the adult abdomen protocol were at 98.9% on average with 3%/3 mm. Root causes of GPR outliers, for example, implants were identified and evaluated. CONCLUSION IDC has been successfully commissioned and integrated into the MedAustron clinical workflow for protons in 2021. IDC has been stepwise and safely substituting experimental PSQA since February 2021. The initial reduction of proton experimental PSQA was about 25% and reached up to 90% after 1 year.
Collapse
|
14
|
Ludmir EB, Hoffman KE, Jhingran A, Kouzy R, Ip MCP, Sturdevant L, Ning MS, Minsky BD, McAleer MF, Chronowski GM, Arzu IY, Reed VK, Garg AK, Roberts T, Eastwick GA, Olson MR, Selek U, Gabel M, Koong AC, Kupferman ME, Kuban DA. Implementation and Efficacy of a Large-Scale Radiation Oncology Case-Based Peer-Review Quality Program across a Multinational Cancer Network. Pract Radiat Oncol 2024; 14:e173-e179. [PMID: 38176466 DOI: 10.1016/j.prro.2023.12.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 12/05/2023] [Accepted: 12/14/2023] [Indexed: 01/06/2024]
Abstract
PURPOSE With expansion of academic cancer center networks across geographically-dispersed sites, ensuring high-quality delivery of care across all network affiliates is essential. We report on the characteristics and efficacy of a radiation oncology peer-review quality assurance (QA) system implemented across a large-scale multinational cancer network. METHODS AND MATERIALS Since 2014, weekly case-based peer-review QA meetings have been standard for network radiation oncologists with radiation oncology faculty at a major academic center. This radiotherapy (RT) QA program involves pre-treatment peer-review of cases by disease site, with disease-site subspecialized main campus faculty members. This virtual QA platform involves direct review of the proposed RT plan as well as supporting data, including relevant pathology and imaging studies for each patient. Network RT plans were scored as being concordant or nonconcordant based on national guidelines, institutional recommendations, and/or expert judgment when considering individual patient-specific factors for a given case. Data from January 1, 2014, through December 31, 2019, were aggregated for analysis. RESULTS Between 2014 and 2019, across 8 network centers, a total of 16,601 RT plans underwent peer-review. The network-based peer-review case volume increased over the study period, from 958 cases in 2014 to 4,487 in 2019. A combined global nonconcordance rate of 4.5% was noted, with the highest nonconcordance rates among head-and-neck cases (11.0%). For centers that joined the network during the study period, we observed a significant decrease in the nonconcordance rate over time (3.1% average annual decrease in nonconcordance, P = 0.01); among centers that joined the network prior to the study period, nonconcordance rates remained stable over time. CONCLUSIONS Through a standardized QA platform, network-based multinational peer-review of RT plans can be achieved. Improved concordance rates among newly added network affiliates over time are noted, suggesting a positive impact of network membership on the quality of delivered cancer care.
Collapse
|
15
|
Petragallo R, Luximon DC, Neylon J, Bardach NS, Ritter T, Lamb JM. Clinical physicists' perceptions of weekly chart checks and the potential role for automated image review assessed by structured interviews. J Appl Clin Med Phys 2024; 25:e14313. [PMID: 38650177 PMCID: PMC11087166 DOI: 10.1002/acm2.14313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 11/21/2023] [Accepted: 01/23/2024] [Indexed: 04/25/2024] Open
Abstract
BACKGROUND This study utilizes interviews of clinical medical physicists to investigate self-reported shortcomings of the current weekly chart check workflow and opportunities for improvement. METHODS Nineteen medical physicists were recruited for a 30-minute semi-structured interview, with a particular focus placed on image review and the use of automated tools for image review in weekly checks. Survey-type questions were used to gather quantitative information about chart check practices and importance placed on reducing chart check workloads versus increasing chart check effectiveness. Open-ended questions were used to probe respondents about their current weekly chart check workflow, opinions of the value of weekly chart checks and perceived shortcomings, and barriers and facilitators to the implementation of automated chart check tools. Thematic analysis was used to develop common themes across the interviews. RESULTS Physicists ranked highly the value of reducing the time spent on weekly chart checks (average 6.3 on a scale from 1 to 10), but placed more value on increasing the effectiveness of checks with an average of 9.2 on a 1-10 scale. Four major themes were identified: (1) weekly chart checks need to adapt to an electronic record-and-verify chart environment, (2) physicists could add value to patient care by analyzing images without duplicating the work done by physicians, (3) greater support for trending analysis is needed in weekly checks, and (4) automation has the potential to increase the value of physics checks. CONCLUSION This study identified several key shortcomings of the current weekly chart check process from the perspective of the clinical medical physicist. Our results show strong support for automating components of the weekly check workflow in order to allow for more effective checks that emphasize follow-up, trending, failure modes and effects analysis, and allow time to be spent on other higher value tasks that improve patient safety.
Collapse
|
16
|
Gogineni E, Schaefer D, Ewing A, Andraos T, DiCostanzo D, Weldon M, Christ D, Baliga S, Jhawar S, Mitchell D, Grecula J, Konieczkowski DJ, Palmer J, Jahraus T, Dibs K, Chakravarti A, Martin D, Gamez ME, Blakaj D. Systematic Implementation of Effective Quality Assurance Processes for the Assessment of Radiation Target Volumes in Head and Neck Cancer. Pract Radiat Oncol 2024; 14:e205-e213. [PMID: 38237893 DOI: 10.1016/j.prro.2023.12.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 10/17/2023] [Accepted: 12/01/2023] [Indexed: 02/26/2024]
Abstract
PURPOSE Significant heterogeneity exists in clinical quality assurance (QA) practices within radiation oncology departments, with most chart rounds lacking prospective peer-reviewed contour evaluation. This has the potential to significantly affect patient outcomes, particularly for head and neck cancers (HNC) given the large variance in target volume delineation. With this understanding, we incorporated a prospective systematic peer contour-review process into our workflow for all patients with HNC. This study aims to assess the effectiveness of implementing prospective peer review into practice for our National Cancer Institute Designated Cancer Center and to report factors associated with contour modifications. METHODS AND MATERIALS Starting in November 2020, our department adopted a systematic QA process with real-time metrics, in which contours for all patients with HNC treated with radiation therapy were prospectively peer reviewed and graded. Contours were graded with green (unnecessary), yellow (minor), or red (major) colors based on the degree of peer-recommended modifications. Contours from November 2020 through September 2021 were included for analysis. RESULTS Three hundred sixty contours were included. Contour grades were made up of 89.7% green, 8.9% yellow, and 1.4% red grades. Physicians with >12 months of clinical experience were less likely to have contour changes requested than those with <12 months (8.3% vs 40.9%; P < .001). Contour grades were significantly associated with physician case load, with physicians presenting more than the median number of 50 cases having significantly less modifications requested than those presenting <50 (6.7% vs 13.3%; P = .013). Physicians working with a resident or fellow were less likely to have contour changes requested than those without a trainee (5.2% vs 12.6%; P = .039). Frequency of major modification requests significantly decreased over time after adoption of prospective peer contour review, with no red grades occurring >6 months after adoption. CONCLUSIONS This study highlights the importance of prospective peer contour-review implementation into systematic clinical QA processes for HNC. Physician experience proved to be the highest predictor of approved contours. A growth curve was demonstrated, with major modifications declining after prospective contour review implementation. Even within a high-volume academic practice with subspecialist attendings, >10% of patients had contour changes made as a direct result of prospective peer review.
Collapse
|
17
|
Ji M, Li Z, Tian Y, Zhang K, Li M, Chen Y. A virtual phantom for patient-specific QA On A 1.5T MR-linac. J Appl Clin Med Phys 2024; 25:e14264. [PMID: 38252813 PMCID: PMC11087164 DOI: 10.1002/acm2.14264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 12/19/2023] [Accepted: 12/22/2023] [Indexed: 01/24/2024] Open
Abstract
Create a virtual ArcCHECK-MR phantom, customized for a 1.5T MR-linac, with consideration of the different density regions within the quality assurance (QA) phantom, aiming to streamline the utilization of this specialized QA device. A virtual phantom was constructed in the treatment planning system (TPS) to replicate the ArcCHECK-MR's composition, consisting of five distinct layers: "Outer" (representing the outer PMMA ring), "Complex" (simulating the printed circuit boards), "Detectors" (encompassing the detector area), "Inner" (signifying the inner PMMA ring) and "Insert" (representing the PMMA insert). These layers were defined based on geometric data and represented as contour points on a set of dummy CT images. Additionally, a setup platform was integrated as contoured structures. To determine the relative electron density (RED) values of the external and internal PMMA components, measurements were taken at 25 points in the insert using an ion chamber. A novel method for establishing the exit/entrance dose ratio (EEDR) for ArcCHECK-MR was introduced. The RED of higher density region was derived by evaluating the local gamma index passing rate results with criteria of 2% dose difference and 2 mm distance-to-agreement. The performance of the virtual phantom was assessed for Unity 7 FFF beams with a 1.5T magnetic field. The radii of the five ring structures within the virtual phantom measured 133.0 mm, 110.0 mm, 103.4 mm, 100.0 mm, and 75.0 mm for the "Outer," "Complex," "Detectors," "Inner" and "Insert" regions, respectively. The RED values were as follows: ArcCHECK-MR PMMA had a RED of 1.130, "Detectors" were assumed to have a RED of 1.000, "Complex" had a RED of 1.200, and the setup QA phantom justified a RED of 1.350. Early validation results demonstrate that the 5-layer virtual phantom, when compared to the commonly used bulk overridden phantom, offers improved capability in MR-linac environments. This enhancement led to an increase in passing rates for the local gamma index by approximately 5 ∼ 6%, when applying the criteria of 2%, 2 mm. We have successfully generated a virtual representation of the distinct regions within the ArcCHECK-MR using a TPS, addressing the challenges associated with its use in conjunction with a 1.5T MR-linac. We consistently observed favorable local gamma index passing rates across two 1.5T MR-linac and ArcCHECK-MR unit combinations. This approach has the potential to minimize uncertainties in the creation of the QA phantom for ArcCHECK-MR across various institutions.
Collapse
|
18
|
Boywitt D, Kähler M, Bungard S, Höhle M, Rauh J. [Reliability of peer review-like dialogue in the German statutory quality assurance program]. ZEITSCHRIFT FUR EVIDENZ, FORTBILDUNG UND QUALITAT IM GESUNDHEITSWESEN 2024; 186:18-26. [PMID: 38580502 DOI: 10.1016/j.zefq.2024.02.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 01/17/2024] [Accepted: 02/19/2024] [Indexed: 04/07/2024]
Abstract
BACKGROUND Quality measurement in the German statutory program for quality in health care follows a two-step process. For selected areas of health care, quality is measured via performance indicators (first step). Providers failing to achieve benchmarks in these indicators subsequently enter into a peer review process (second step) and are asked by the respective regional authority to provide a written statement regarding their indicator results. The statements are then evaluated by peers, with the goal to assess the provider's quality of care. In the past, similar peer review-based approaches to the measurement of health care quality in other countries have shown a tendency to lack reliability. So far, the reliability of this component of the German statutory program for quality in health care has not been investigated. METHOD Using logistic regression models, the influence of the respective regional authority on the peer review component of health care quality measurement in Germany was investigated using three exemplary indicators and data from 2016. RESULTS Both the probability that providers are asked to provide a statement as well as the results produced by the peer review process significantly depend on the regional authority in charge. This dependence cannot be fully explained by differences in the indicator results or by differences in case volume. CONCLUSIONS The present results are in accordance with earlier findings, which show low reliability for peer review-based approaches to quality measurement. Thus, different results produced by the peer review component of the quality measurement process may in part be due to differences in the way the review process is conducted. This heterogeneity among the regional authorities limits the reliability of this process. In order to increase reliability, the peer review process should be standardized to a higher degree, with clear review criteria, and the peers should undergo comprehensive training for the review process. Alternatively, the future peer review component could be adapted to focus rather on identification of improvement strategies than on reliable provider comparisons.
Collapse
|
19
|
Mehl C, Müller T, Nau T, Bachmann C, Geraedts M. [Development of an indicator set for the evaluation of the quality of routine ambulatory health care for common disorders in children and adolescents]. ZEITSCHRIFT FUR EVIDENZ, FORTBILDUNG UND QUALITAT IM GESUNDHEITSWESEN 2024; 186:69-76. [PMID: 38631959 DOI: 10.1016/j.zefq.2024.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 02/28/2024] [Accepted: 03/13/2024] [Indexed: 04/19/2024]
Abstract
BACKGROUND In Germany, no consented quality indicator set (QI set) exists to date that can be used to assess the quality of pediatric care. Therefore, the aim of the project "Assessment of the quality of routine ambulatory health care for common disorders in children and adolescents" (QualiPäd) funded by the Innovation Committee of the Federal Joint Committee (grant no.: 01VSF19035) was to develop a QI set for the diseases asthma, atopic eczema, otitis media, tonsillitis, attention-deficit hyperactivity disorder (ADHD), depression and conduct disorder. METHODS For the observation period 2018/2019, quality indicators (QIs) were searched in indicator databases, guidelines and literature databases and complemented in part by newly formulated QIs (e.g., derived from guideline recommendations). The QIs were then assigned to content categories and dimensions according to Donabedian and OECD and reduced by removing duplicates. Finally, a panel of experts consulted the QIs using the modified RAND-UCLA Appropriateness Method (RAM). RESULTS The search resulted in a preliminary QI set of 2324 QIs. After the reduction steps and the evaluation of the experts, 282 QIs were included in the QI set (asthma: 72 QIs, atopic eczema: 25 QIs, otitis media: 31 QIs, tonsillitis: 12 QIs, ADHD: 53 QIs, depression: 43 QIs, conduct disorder: 46 QIs). The QIs are distributed among the following different categories: Therapy (138 QIs), Diagnostics (95 QIs), Patient-reported outcome measures/Patient-reported experience measures (PROM/PREM) (45 QIs), Practice management (31 QIs), and Health reporting (4 QIs). In the Donabedian model, 89% of the QIs capture process quality, 9% outcome quality, and 2% structural quality; according to the OECD classification, 61% measure effectiveness, 23% patient-centeredness, and 16% safety of care. CONCLUSION The consented QI set is currently being tested and can subsequently be used (possibly modified) to measure the quality of routine outpatient care for children and adolescents in Germany, in order to indicate the status quo and potential areas for improvement in outpatient care.
Collapse
|
20
|
Kim E, Park YK, Zhao T, Laugeman E, Zhao XN, Hao Y, Chung Y, Lee H. Image quality characterization of an ultra-high-speed kilovoltage cone-beam computed tomography imaging system on an O-ring linear accelerator. J Appl Clin Med Phys 2024; 25:e14337. [PMID: 38576183 PMCID: PMC11087174 DOI: 10.1002/acm2.14337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/23/2024] [Accepted: 03/06/2024] [Indexed: 04/06/2024] Open
Abstract
PURPOSE The quality of on-board imaging systems, including cone-beam computed tomography (CBCT), plays a vital role in image-guided radiation therapy (IGRT) and adaptive radiotherapy. Recently, there has been an upgrade of the CBCT systems fused in the O-ring linear accelerators called HyperSight, featuring a high imaging performance. As the characterization of a new imaging system is essential, we evaluated the image quality of the HyperSight system by comparing it with Halcyon 3.0 CBCT and providing benchmark data for routine imaging quality assurance. METHODS The HyperSight features ultra-fast scan time, a larger kilovoltage (kV) detector, a more substantial kV tube, and an advanced reconstruction algorithm. Imaging protocols in the two modes of operation, treatment mode with IGRT and the CBCT for planning (CBCTp) mode were evaluated and compared with Halcyon 3.0 CBCT. Image quality metrics, including spatial resolution, contrast resolution, uniformity, noise, computed tomography (CT) number linearity, and calibration error, were assessed using a Catphan and an electron density phantom and analyzed with TotalQA software. RESULTS HyperSight demonstrated substantial improvements in contrast-to-noise ratio and noise in both IGRT and CBCTp modes compared to Halcyon 3.0 CBCT. CT number calibration error of HyperSight CBCTp mode (1.06%) closely matches that of a full CT scanner (0.72%), making it suitable for adaptive planning. In addition, the advanced hardware of HyperSight, such as ultra-fast scan time (5.9 s) or 2.5 times larger heat unit capacity, enhanced the clinical efficiency in our experience. CONCLUSIONS HyperSight represented a significant advancement in CBCT imaging. With its image quality, CT number accuracy, and ultra-fast scans, HyperSight has a potential to transform patient care and treatment outcomes. The enhanced scan speed and image quality of HyperSight are expected to significantly improve the quality and efficiency of treatment, particularly benefiting patients.
Collapse
|
21
|
Dunn L, Tamborriello A, Subramanian B, Xu X, Ruruku TT. Assessing the sensitivity and suitability of a range of detectors for SIMT PSQA. J Appl Clin Med Phys 2024; 25:e14343. [PMID: 38569013 PMCID: PMC11087180 DOI: 10.1002/acm2.14343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 03/07/2024] [Accepted: 03/14/2024] [Indexed: 04/05/2024] Open
Abstract
PURPOSE Single-isocenter multi-target intracranial stereotactic radiotherapy (SIMT) is an effective treatment for brain metastases with complex treatment plans and delivery optimization necessitating rigorous quality assurance. This work aims to assess five methods for quality assurance of SIMT treatment plans in terms of their suitability and sensitivity to delivery errors. METHODS Sun Nuclear ArcCHECK and SRS MapCHECK, GafChromic EBT Radiochromic Film, machine log files, and Varian Portal Dosimetry were all used to measure 15 variations of a single SIMT plan. Variations of the original plan were created with Python. They comprised various degrees of systematic MLC offsets per leaf up to 2 mm, random per-leaf variations with differing minimum and maximum magnitudes, simulated collimator, and dose miscalibrations (MU scaling). The erroneous plans were re-imported into Eclipse and plan-quality degradation was assessed by comparing each plan variation to the original clinical plan in terms of the percentage of clinical goals passing relative to the original plan. Each erroneous plan could be then ranked by the plan-quality degradation percentage following recalculation in the TPS so that the effects of each variation could be correlated with γ pass rates and detector suitability. RESULTS & CONCLUSIONS It was found that 2%/1 mm is a good starting point for the ArcCHECK, Portal Dosimetry, and the SRS MapCHECK methods, respectively, and provides clinically relevant error detection sensitivity. Looser dose criteria of 5%/1 mm or 5%/1.5 mm are suitable for film dosimetry and log-file-based methods. The statistical methods explored can be expanded to other areas of patient-specific QA and detector assessment.
Collapse
|
22
|
Muthu S, Mudhana G. Dosimetric Systems in Pre-Treatment QA for Stereotactic Treatments: Correlation Agreements and Target Volume Dependency. Asian Pac J Cancer Prev 2024; 25:1425-1432. [PMID: 38680004 PMCID: PMC11162722 DOI: 10.31557/apjcp.2024.25.4.1425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Accepted: 04/11/2024] [Indexed: 05/01/2024] Open
Abstract
AIM This study comprehensively investigated pre-treatment quality assurance (QA) for 100 cancer patients undergoing stereotactic treatments (SRS/SRT) using various detectors. METHODS The study conducted QA for SRS/SRT treatments planned with a 6MV SRS beam at a dose rate of 1,000 MU/min, utilizing Eclipse v13.6 Treatment Planning System (TPS). Point dose measurements employed 0.01cm3 and 0.13cm3 cylindrical ionization chambers, while planar dose verification utilized Gafchromic EBT-XD Film and Portal Imager (aS1000). Plans were categorized by target volume, and a thorough analysis compared point dose agreements, planar dose gamma pass rates, and their correlations with chamber volume mean dose, detector type, and point dose agreement. Additionally, the consistency between different ionization chambers was assessed. RESULTS Point dose agreement generally improved with increasing target volume, except for volumes over 10cm3 with 0.01cm3 chambers, showing a contrary trend. Significant differences (p<0.05) were observed between TPS and measured doses for both chambers. Gamma pass rate improved with increasing target volume in EBT XD and aS1000 analyses, except for the >10cm3 group in EBT XD. EBT XD demonstrated better agreement with TPS for target volumes up to 10cm3 compared to aS1000, with a statistically significant difference (p<0.05) between the detectors. Strong correlations were found between chamber point dose and chamber volume mean dose agreement, as well as between the two gamma criteria analyses of the same detector type in the planar dose correlation analysis. However, weak correlations were discovered for other analyses. CONCLUSION This study found weak correlation between different detector types in pre-treatment QA for point dose and planar dose evaluation. However, within a specific detector type, strong correlation was observed for different point dose evaluation methods and gamma criteria. This highlights the importance of cautious interpretation of QA results, particularly for SRS QA, due to the lack of correlation between detector types.
Collapse
|
23
|
Zaman S, Vimalesvaran K, Chappell D, Varela M, Peters NS, Shiwani H, Knott KD, Davies RH, Moon JC, Bharath AA, Linton NW, Francis DP, Cole GD, Howard JP. Quality assurance of late gadolinium enhancement cardiac magnetic resonance images: a deep learning classifier for confidence in the presence or absence of abnormality with potential to prompt real-time image optimization. J Cardiovasc Magn Reson 2024; 26:101040. [PMID: 38522522 PMCID: PMC11129090 DOI: 10.1016/j.jocmr.2024.101040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 03/10/2024] [Accepted: 03/19/2024] [Indexed: 03/26/2024] Open
Abstract
BACKGROUND Late gadolinium enhancement (LGE) of the myocardium has significant diagnostic and prognostic implications, with even small areas of enhancement being important. Distinguishing between definitely normal and definitely abnormal LGE images is usually straightforward, but diagnostic uncertainty arises when reporters are not sure whether the observed LGE is genuine or not. This uncertainty might be resolved by repetition (to remove artifact) or further acquisition of intersecting images, but this must take place before the scan finishes. Real-time quality assurance by humans is a complex task requiring training and experience, so being able to identify which images have an intermediate likelihood of LGE while the scan is ongoing, without the presence of an expert is of high value. This decision-support could prompt immediate image optimization or acquisition of supplementary images to confirm or refute the presence of genuine LGE. This could reduce ambiguity in reports. METHODS Short-axis, phase-sensitive inversion recovery late gadolinium images were extracted from our clinical cardiac magnetic resonance (CMR) database and shuffled. Two, independent, blinded experts scored each individual slice for "LGE likelihood" on a visual analog scale, from 0 (absolute certainty of no LGE) to 100 (absolute certainty of LGE), with 50 representing clinical equipoise. The scored images were split into two classes-either "high certainty" of whether LGE was present or not, or "low certainty." The dataset was split into training, validation, and test sets (70:15:15). A deep learning binary classifier based on the EfficientNetV2 convolutional neural network architecture was trained to distinguish between these categories. Classifier performance on the test set was evaluated by calculating the accuracy, precision, recall, F1-score, and area under the receiver operating characteristics curve (ROC AUC). Performance was also evaluated on an external test set of images from a different center. RESULTS One thousand six hundred and forty-five images (from 272 patients) were labeled and split at the patient level into training (1151 images), validation (247 images), and test (247 images) sets for the deep learning binary classifier. Of these, 1208 images were "high certainty" (255 for LGE, 953 for no LGE), and 437 were "low certainty". An external test comprising 247 images from 41 patients from another center was also employed. After 100 epochs, the performance on the internal test set was accuracy = 0.94, recall = 0.80, precision = 0.97, F1-score = 0.87, and ROC AUC = 0.94. The classifier also performed robustly on the external test set (accuracy = 0.91, recall = 0.73, precision = 0.93, F1-score = 0.82, and ROC AUC = 0.91). These results were benchmarked against a reference inter-expert accuracy of 0.86. CONCLUSION Deep learning shows potential to automate quality control of late gadolinium imaging in CMR. The ability to identify short-axis images with intermediate LGE likelihood in real-time may serve as a useful decision-support tool. This approach has the potential to guide immediate further imaging while the patient is still in the scanner, thereby reducing the frequency of recalls and inconclusive reports due to diagnostic indecision.
Collapse
|
24
|
Shah NH, Halamka JD, Saria S, Pencina M, Tazbaz T, Tripathi M, Callahan A, Hildahl H, Anderson B. A Nationwide Network of Health AI Assurance Laboratories. JAMA 2024; 331:245-249. [PMID: 38117493 DOI: 10.1001/jama.2023.26930] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
Importance Given the importance of rigorous development and evaluation standards needed of artificial intelligence (AI) models used in health care, nationwide accepted procedures to provide assurance that the use of AI is fair, appropriate, valid, effective, and safe are urgently needed. Observations While there are several efforts to develop standards and best practices to evaluate AI, there is a gap between having such guidance and the application of such guidance to both existing and new AI models being developed. As of now, there is no publicly available, nationwide mechanism that enables objective evaluation and ongoing assessment of the consequences of using health AI models in clinical care settings. Conclusion and Relevance The need to create a public-private partnership to support a nationwide health AI assurance labs network is outlined here. In this network, community best practices could be applied for testing health AI models to produce reports on their performance that can be widely shared for managing the lifecycle of AI models over time and across populations and sites where these models are deployed.
Collapse
|
25
|
Huang Y, Pi Y, Ma K, Miao X, Fu S, Feng A, Duan Y, Kong Q, Zhuo W, Xu Z. Predicting the error magnitude in patient-specific QA during radiotherapy based on ResNet. JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY 2024; 32:797-807. [PMID: 38457139 DOI: 10.3233/xst-230251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]
Abstract
BACKGROUND The error magnitude is closely related to patient-specific dosimetry and plays an important role in evaluating the delivery of the radiotherapy plan in QA. No previous study has investigated the feasibility of deep learning to predict error magnitude. OBJECTIVE The purpose of this study was to predict the error magnitude of different delivery error types in radiotherapy based on ResNet. METHODS A total of 34 chest cancer plans (172 fields) of intensity-modulated radiation therapy (IMRT) from Eclipse were selected, of which 30 plans (151 fields) were used for model training and validation, and 4 plans including 21 fields were used for external testing. The collimator misalignment (COLL), monitor unit variation (MU), random multi-leaf collimator shift (MLCR), and systematic MLC shift (MLCS) were introduced. These dose distributions of portal dose predictions for the original plans were defined as the reference dose distribution (RDD), while those for the error-introduced plans were defined as the error-introduced dose distribution (EDD). Different inputs were used in the ResNet for predicting the error magnitude. RESULTS In the test set, the accuracy of error type prediction based on the dose difference, gamma distribution, and RDD + EDD was 98.36%, 98.91%, and 100%, respectively; the root mean squared error (RMSE) was 1.45-1.54, 0.58-0.90, 0.32-0.36, and 0.15-0.24; the mean absolute error (MAE) was 1.06-1.18, 0.32-0.78, 0.25-0.27, and 0.11-0.18, respectively, for COLL, MU, MLCR and MLCS. CONCLUSIONS In this study, error magnitude prediction models with dose difference, gamma distribution, and RDD + EDD are established based on ResNet. The accurate prediction of the error magnitude under different error types can provide reference for error analysis in patient-specific QA.
Collapse
|