1
|
Hein D, Holmin S, Szczykutowicz T, Maltz JS, Danielsson M, Wang G, Persson M. Noise suppression in photon-counting computed tomography using unsupervised Poisson flow generative models. Vis Comput Ind Biomed Art 2024; 7:24. [PMID: 39311990 PMCID: PMC11420411 DOI: 10.1186/s42492-024-00175-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 09/01/2024] [Indexed: 09/26/2024] Open
Abstract
Deep learning (DL) has proven to be important for computed tomography (CT) image denoising. However, such models are usually trained under supervision, requiring paired data that may be difficult to obtain in practice. Diffusion models offer unsupervised means of solving a wide range of inverse problems via posterior sampling. In particular, using the estimated unconditional score function of the prior distribution, obtained via unsupervised learning, one can sample from the desired posterior via hijacking and regularization. However, due to the iterative solvers used, the number of function evaluations (NFE) required may be orders of magnitudes larger than for single-step samplers. In this paper, we present a novel image denoising technique for photon-counting CT by extending the unsupervised approach to inverse problem solving to the case of Poisson flow generative models (PFGM)++. By hijacking and regularizing the sampling process we obtain a single-step sampler, that is NFE = 1. Our proposed method incorporates posterior sampling using diffusion models as a special case. We demonstrate that the added robustness afforded by the PFGM++ framework yields significant performance gains. Our results indicate competitive performance compared to popular supervised, including state-of-the-art diffusion-style models with NFE = 1 (consistency models), unsupervised, and non-DL-based image denoising techniques, on clinical low-dose CT data and clinical images from a prototype photon-counting CT system developed by GE HealthCare.
Collapse
Affiliation(s)
- Dennis Hein
- Department of Physics, KTH Royal Institute of Technology, Stockholm, 1142, Sweden.
- MedTechLabs, Karolinska University Hospital, Stockholm, 17164, Sweden.
| | - Staffan Holmin
- Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, 17164, Sweden
- Department of Neuroradiology, Karolinska University Hospital, Stockholm, 17164, Sweden
| | - Timothy Szczykutowicz
- Department of Radiology, School of Medicine and Public Health, University of Wisconsin, Madison, WI, 53705, United States
| | | | - Mats Danielsson
- Department of Physics, KTH Royal Institute of Technology, Stockholm, 1142, Sweden
- MedTechLabs, Karolinska University Hospital, Stockholm, 17164, Sweden
| | - Ge Wang
- Department of Biomedical Engineering, School of Engineering, Biomedical Imaging Center, Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY, 12180, United States
| | - Mats Persson
- Department of Physics, KTH Royal Institute of Technology, Stockholm, 1142, Sweden
- MedTechLabs, Karolinska University Hospital, Stockholm, 17164, Sweden
| |
Collapse
|
2
|
Hsieh SS, Rajbhandary PL. Existence, uniqueness, and efficiency of numerically unbiased attenuation pathlength estimators for photon counting detectors at low count rates. Med Phys 2024. [PMID: 39287477 DOI: 10.1002/mp.17406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 08/21/2024] [Accepted: 08/26/2024] [Indexed: 09/19/2024] Open
Abstract
BACKGROUND The first step in computed tomography (CT) reconstruction is to estimate attenuation pathlength. Usually, this is done with a logarithm transformation, which is the direct solution to the Beer-Lambert Law. At low signals, however, the logarithm estimator is biased. Bias arises both from the curvature of the logarithm and from the possibility of detecting zero counts, so a data substitution strategy may be employed to avoid the singularity of the logarithm. Recent progress has been made by Li et al. [IEEE Trans Med Img 42:6, 2023] to modify the logarithm estimator to eliminate curvature bias, but the optimal strategy for mitigating bias from the singularity remains unknown. PURPOSE The purpose of this study was to use numerical techniques to construct unbiased attenuation pathlength estimators that are alternatives to the logarithm estimator, and to study the uniqueness and optimality of possible solutions, assuming a photon counting detector. METHODS Formally, an attenuation pathlength estimator is a mapping from integer detector counts to real pathlength values. We constrain our focus to only the small signal inputs that are problematic for the logarithm estimator, which we define as inputs of <100 counts, and we consider estimators that use only a single input and that are not informed by adjacent measurements (e.g., adaptive smoothing). The set of all possible pathlength estimators can then be represented as points in a 100-dimensional vector space. Within this vector space, we use optimization to select the estimator that (1) minimizes mean squared error and (2) is unbiased. We define "unbiased" as satisfying the numerical condition that the maximum bias be less than 0.001 across a continuum of 1000 object thicknesses that span the desired operating range. Because the objective function is convex and the constraints are affine, optimization is tractable and guaranteed to converge to the global minimum. We further examine the nullspace of the constraint matrix to understand the uniqueness of possible solutions, and we compare the results to the Cramér-Rao bound of the variance. RESULTS We first show that an unbiased attenuation pathlength estimator does not exist if very low mean detector signals (equivalently, very thick objects) are permitted. It is necessary to select a minimum mean detector signal for which unbiased behavior is desired. If we select two counts, the optimal estimator is similar to Li's estimator. If we select one count, the optimal estimator becomes non-monotonic. The oscillations cause the unbiased estimator to be noise amplifying. The nullspace of the constraint matrix is high-dimensional, so that unbiased solutions are not unique. The Cramér-Rao bound of the variance matches well with the expectedI - 0.5 ${{I}^{ - 0.5}}$ scaling law and cannot be attained. CONCLUSION If arbitrarily thick objects are permitted, an unbiased attenuation pathlength estimator does not exist. If the maximum thickness is restricted, an unbiased estimator exists but is not unique. An optimal estimator can be selected that minimizes variance, but a bias-variance tradeoff exists where a larger domain of unbiased behavior requires increased variance.
Collapse
Affiliation(s)
- Scott S Hsieh
- Department of Radiology, Mayo Clinic, Rochester, Minnesota, USA
| | | |
Collapse
|
3
|
Zhou Z, Gong H, Hsieh S, McCollough CH, Yu L. Image quality evaluation in deep-learning-based CT noise reduction using virtual imaging trial methods: Contrast-dependent spatial resolution. Med Phys 2024; 51:5399-5413. [PMID: 38555876 PMCID: PMC11321944 DOI: 10.1002/mp.17029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 02/19/2024] [Accepted: 02/26/2024] [Indexed: 04/02/2024] Open
Abstract
BACKGROUND Deep-learning-based image reconstruction and noise reduction methods (DLIR) have been increasingly deployed in clinical CT. Accurate image quality assessment of these methods is challenging as the performance measured using physical phantoms may not represent the true performance of DLIR in patients since DLIR is trained mostly on patient images. PURPOSE In this work, we aim to develop a patient-data-based virtual imaging trial framework and, as a first application, use it to measure the spatial resolution properties of a DLIR method. METHODS The patient-data-based virtual imaging trial framework consists of five steps: (1) insertion of lesions into projection domain data using the acquisition geometry of the patient exam to simulate different lesion characteristics; (2) insertion of noise into projection domain data using a realistic photon statistical model of the CT system to simulate different dose levels; (3) creation of DLIR-processed images from projection or image data; (4) creation of ensembles of DLIR-processed patient images from a large number of noise and lesion realizations; and (5) evaluation of image quality using ensemble DLIR images. This framework was applied to measure the spatial resolution of a ResNet based deep convolutional neural network (DCNN) trained on patient images. Lesions in a cylindrical shape and different contrast levels (-500, -100, -50, -20, -10 HU) were inserted to the lower right lobe of the liver in a patient case. Multiple dose levels were simulated (50%, 25%, 12.5%). Each lesion and dose condition had 600 noise realizations. Multiple reconstruction and denoising methods were used on all the noise realizations, including the original filtered-backprojection (FBP), iterative reconstruction (IR), and the DCNN method with three different strength setting (DCNN-weak, DCNN-medium, and DCNN-strong). Mean lesion signal was calculated by performing ensemble averaging of all the noise realizations for each lesion and dose condition and then subtracting the lesion-present images from the lesion absent images. Modulation transfer functions (MTFs) both in-plane and along the z-axis were calculated based on the mean lesion signals. The standard deviations of MTFs at each condition were estimated with bootstrapping: randomly sampling (with replacement) all the DLIR/FBP/IR images from the ensemble data (600 samples) at each condition. The impact of varying lesion contrast, dose levels, and denoising strengths were evaluated. Statistical analysis with paired t-test was used to compare the z-axis and in-plane spatial resolution of five algorithms for five different contrasts and three dose levels. RESULTS The in-plane and z-axis spatial resolution degradation of DCNN becomes more severe as the contrast or radiation dose decreased, or DCNN denoising strength increased. In comparison with FBP, a 59.5% and 4.1% reduction of in-plane and z-axis MTF (in terms of spatial frequencies at 50% MTF), respectively, was observed at low contrast (-10 HU) for DCNN with the highest denoising strength at 25% routine dose level. When the dose level reduces from 50% to 12.5% of routine dose, the in-plane and z-axis MTFs reduces from 92.1% to 76.3%, and from 98.9% to 95.5%, respectively, at contrast of -100 HU, using FBP as the reference. For most conditions of contrasts and dose levels, significant differences were found among the five algorithms, with the following relationship in both in-plane and cross-plane spatial resolution: FBP > DCNN-Weak > IR > DCNN-Medium > DCNN-Strong. The spatial resolution difference among algorithms decreases at higher contrast or dose levels. CONCLUSIONS A patient-data-based virtual imaging trial framework was developed and applied to measuring the spatial resolution properties of a DCNN noise reduction method at different contrast and dose levels using real patient data. As with other non-linear image reconstruction and post-processing techniques, the evaluated DCNN method degraded the in-plane and z-axis spatial resolution at lower contrast levels, lower radiation dose, and higher denoising strength.
Collapse
Affiliation(s)
| | - Hao Gong
- Department of Radiology, Mayo Clinic, Rochester, MN, US
| | - Scott Hsieh
- Department of Radiology, Mayo Clinic, Rochester, MN, US
| | | | - Lifeng Yu
- Department of Radiology, Mayo Clinic, Rochester, MN, US
| |
Collapse
|
4
|
Yang CC, Lin KW. Improving the detection of hypo-vascular liver metastases in multiphase contrast-enhanced CT with slice thickness less than 5 mm using DenseNet. Radiography (Lond) 2024; 30:759-769. [PMID: 38458104 DOI: 10.1016/j.radi.2024.02.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 02/17/2024] [Accepted: 02/27/2024] [Indexed: 03/10/2024]
Abstract
INTRODUCTION Thinner slices are more susceptible in detecting small lesions but suffer from higher statistical fluctuation. This work aimed to reduce image noise in multiphase contrast-enhanced CT reconstructed with slice thickness thinner than the clinical setting (i.e., 5 mm) using convolutional neural network (CNN) for enabling better detection of hypo-vascular liver metastasis. METHODS A DenseNet model was used to generate noise map for multiphase CT reconstructed with slice thickness of 2.5 mm and 1.25 mm. Image denoising was conducted by subtracting the CNN-generated noise map from CT images with reduced photon flux due to thinner slice thickness. The performance of DenseNet was evaluated on CT scans of electron density phantoms and patients with hypovascular liver metastases less than 1.5 cm in terms of Hounsfield Unit (HU) variation, statistical fluctuation, and contrast-to-noise ratio (CNR). RESULTS The phantom study demonstrated that the CNN-based denoising method was able to reduce statistical fluctuation in CT images reconstructed with slice thickness of 2.5 mm and 1.25 mm without causing significant edge blurring or variation in HU values. With regards to patient study, it was found that the denoised 2.5-mm and 1.25-mm slices had higher CNR than the conventional 5-mm slices for hypo-vascular liver metastases in all 4 phases of multiphase CT. CONCLUSION Our results demonstrated that the detection of hypo-vascular liver metastases in multiphase contrast-enhanced CT with slice thickness less than 5 mm could be improved by using the CNN-based denoising method. IMPLICATIONS FOR PRACTICE Reconstruction slice thickness has a strong influence on the image quality of CT imaging. A CNN-based denoising method was used in this work to reduce the image noise in multiphase contrast-enhanced CT reconstructed with slice thickness less than 5 mm.
Collapse
Affiliation(s)
- C-C Yang
- Department of Medical Imaging and Radiological Sciences, Kaohsiung Medical University, Kaohsiung, Taiwan; Department of Medical Research, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan.
| | - K-W Lin
- Department of Radiology, E-Da Dachang Hospital, Kaohsiung, Taiwan
| |
Collapse
|
5
|
Nelson BJ, Gomez-Cardona DG, Thorne JE, Huber NR, Yu L, Leng S, McCollough CH, Missert AD. Multiple Kernel Synthesis of Head CT Using a Task-Based Loss Function. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024; 37:864-872. [PMID: 38343252 DOI: 10.1007/s10278-023-00959-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 10/30/2023] [Accepted: 11/02/2023] [Indexed: 04/20/2024]
Abstract
In CT imaging of the head, multiple image series are routinely reconstructed with different kernels and slice thicknesses. Reviewing the redundant information is an inefficient process for radiologists. We address this issue with a convolutional neural network (CNN)-based technique, synthesiZed Improved Resolution and Concurrent nOise reductioN (ZIRCON), that creates a single, thin, low-noise series that combines the favorable features from smooth and sharp head kernels. ZIRCON uses a CNN model with an autoencoder U-Net architecture that accepts two input channels (smooth- and sharp-kernel CT images) and combines their salient features to produce a single CT image. Image quality requirements are built into a task-based loss function with a smooth and sharp loss terms specific to anatomical regions. The model is trained using supervised learning with paired routine-dose clinical non-contrast head CT images as training targets and simulated low-dose (25%) images as training inputs. One hundred unique de-identified clinical exams were used for training, ten for validation, and ten for testing. Visual comparisons and contrast measurements of ZIRCON revealed that thinner slices and the smooth-kernel loss function improved gray-white matter contrast. Combined with lower noise, this increased visibility of small soft-tissue features that would be otherwise impaired by partial volume averaging or noise. Line profile analysis showed that ZIRCON images largely retained sharpness compared to the sharp-kernel input images. ZIRCON combined desirable image quality properties of both smooth and sharp input kernels into a single, thin, low-noise series suitable for both brain and skull imaging.
Collapse
Affiliation(s)
- Brandon J Nelson
- Department of Radiology, Mayo Clinic, 200 First Street SW, 55905, Rochester, MN, USA
| | - Daniel G Gomez-Cardona
- Department of Radiology, Mayo Clinic, 200 First Street SW, 55905, Rochester, MN, USA
- Department of Imaging, Gundersen Health System, La Crosse, WI, USA
| | - Jamison E Thorne
- Department of Radiology, Mayo Clinic, 200 First Street SW, 55905, Rochester, MN, USA
| | - Nathan R Huber
- Department of Radiology, Mayo Clinic, 200 First Street SW, 55905, Rochester, MN, USA
| | - Lifeng Yu
- Department of Radiology, Mayo Clinic, 200 First Street SW, 55905, Rochester, MN, USA
| | - Shuai Leng
- Department of Radiology, Mayo Clinic, 200 First Street SW, 55905, Rochester, MN, USA
| | - Cynthia H McCollough
- Department of Radiology, Mayo Clinic, 200 First Street SW, 55905, Rochester, MN, USA
| | - Andrew D Missert
- Department of Radiology, Mayo Clinic, 200 First Street SW, 55905, Rochester, MN, USA.
| |
Collapse
|
6
|
Zhou Z, Hsieh SS, Gong H, McCollough CH, Yu L. Evaluation of data uncertainty for deep-learning-based CT noise reduction using ensemble patient data and a virtual imaging trial framework. PROCEEDINGS OF SPIE--THE INTERNATIONAL SOCIETY FOR OPTICAL ENGINEERING 2024; 12925:1292508. [PMID: 38605999 PMCID: PMC11008675 DOI: 10.1117/12.3008581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]
Abstract
Deep learning-based image reconstruction and noise reduction (DLIR) methods have been increasingly deployed in clinical CT. Accurate assessment of their data uncertainty properties is essential to understand the stability of DLIR in response to noise. In this work, we aim to evaluate the data uncertainty of a DLIR method using real patient data and a virtual imaging trial framework and compare it with filtered-backprojection (FBP) and iterative reconstruction (IR). The ensemble of noise realizations was generated by using a realistic projection domain noise insertion technique. The impact of varying dose levels and denoising strengths were investigated for a ResNet-based deep convolutional neural network (DCNN) model trained using patient images. On the uncertainty maps, DCNN shows more detailed structures than IR although its bias map has less structural dependency, which implies that DCNN is more sensitive to small changes in the input. Both visual examples and histogram analysis demonstrated that hotspots of uncertainty in DCNN may be associated with a higher chance of distortion from the truth than IR, but it may also correspond to a better detection performance for some of the small structures.
Collapse
Affiliation(s)
- Zhongxing Zhou
- Department of Radiology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Scott S. Hsieh
- Department of Radiology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Hao Gong
- Department of Radiology, Mayo Clinic, Rochester, MN, 55905, USA
| | | | - Lifeng Yu
- Department of Radiology, Mayo Clinic, Rochester, MN, 55905, USA
| |
Collapse
|
7
|
Nelson BJ, Kc P, Badal A, Jiang L, Masters SC, Zeng R. Pediatric evaluations for deep learning CT denoising. Med Phys 2024; 51:978-990. [PMID: 38127330 DOI: 10.1002/mp.16901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 11/13/2023] [Accepted: 12/04/2023] [Indexed: 12/23/2023] Open
Abstract
BACKGROUND Deep learning (DL) CT denoising models have the potential to improve image quality for lower radiation dose exams. These models are generally trained with large quantities of adult patient image data. However, CT, and increasingly DL denoising methods, are used in both adult and pediatric populations. Pediatric body habitus and size can differ significantly from adults and vary dramatically from newborns to adolescents. Ensuring that pediatric subgroups of different body sizes are not disadvantaged by DL methods requires evaluations capable of assessing performance in each subgroup. PURPOSE To assess DL CT denoising in pediatric and adult-sized patients, we built a framework of computer simulated image quality (IQ) control phantoms and evaluation methodology. METHODS The computer simulated IQ phantoms in the framework featured pediatric-sized versions of standard CatPhan 600 and MITA-LCD phantoms with a range of diameters matching the mean effective diameters of pediatric patients ranging from newborns to 18 years old. These phantoms were used in simulating CT images that were then inputs for a DL denoiser to evaluate performance in different sized patients. Adult CT test images were simulated using standard-sized phantoms scanned with adult scan protocols. Pediatric CT test images were simulated with pediatric-sized phantoms and adjusted pediatric protocols. The framework's evaluation methodology consisted of denoising both adult and pediatric test images then assessing changes in image quality, including noise, image sharpness, CT number accuracy, and low contrast detectability. To demonstrate the use of the framework, a REDCNN denoising model trained on adult patient images was evaluated. To validate that the DL model performance measured with the proposed pediatric IQ phantoms was representative of performance in more realistic patient anatomy, anthropomorphic pediatric XCAT phantoms of the same age range were also used to compare noise reduction performance. RESULTS Using the proposed pediatric-sized IQ phantom framework, size differences between adult and pediatric-sized phantoms were observed to substantially influence the adult trained DL denoising model's performance. When applied to adult images, the DL model achieved a 60% reduction in noise standard deviation without substantial loss in sharpness in mid or high spatial frequencies. However, in smaller phantoms the denoising performance dropped due to different image noise textures resulting from the smaller field of view (FOV) between adult and pediatric protocols. In the validation study, noise reduction trends in the pediatric-sized IQ phantoms were found to be consistent with those found in anthropomorphic phantoms. CONCLUSION We developed a framework of using pediatric-sized IQ phantoms for pediatric subgroup evaluation of DL denoising models. Using the framework, we found the performance of an adult trained DL denoiser did not generalize well in the smaller diameter phantoms corresponding to younger pediatric patient sizes. Our work suggests noise texture differences from FOV changes between adult and pediatric protocols can contribute to poor generalizability in DL denoising and that the proposed framework is an effective means to identify these performance disparities for a given model.
Collapse
Affiliation(s)
- Brandon J Nelson
- Center for Devices and Radiological Health, Office of Science and Engineering Labs, Division of Imaging, Diagnostics, and Software Reliability, U.S. Food and Drug Administration, Silver Spring, Maryland, USA
| | - Prabhat Kc
- Center for Devices and Radiological Health, Office of Science and Engineering Labs, Division of Imaging, Diagnostics, and Software Reliability, U.S. Food and Drug Administration, Silver Spring, Maryland, USA
| | - Andreu Badal
- Center for Devices and Radiological Health, Office of Science and Engineering Labs, Division of Imaging, Diagnostics, and Software Reliability, U.S. Food and Drug Administration, Silver Spring, Maryland, USA
| | - Lu Jiang
- Center for Devices and Radiological Health, Office of Product Evaluation and Quality, Office of Radiological Health, U.S. Food and Drug Administration, Silver Spring, Maryland, USA
| | - Shane C Masters
- Center for Drug Evaluation and Research, Office of Specialty Medicine, Division of Imaging and Radiation Medicine, U.S. Food and Drug Administration, Silver Spring, Maryland, USA
| | - Rongping Zeng
- Center for Devices and Radiological Health, Office of Science and Engineering Labs, Division of Imaging, Diagnostics, and Software Reliability, U.S. Food and Drug Administration, Silver Spring, Maryland, USA
| |
Collapse
|
8
|
Sadia RT, Chen J, Zhang J. CT image denoising methods for image quality improvement and radiation dose reduction. J Appl Clin Med Phys 2024; 25:e14270. [PMID: 38240466 PMCID: PMC10860577 DOI: 10.1002/acm2.14270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 12/15/2023] [Accepted: 12/28/2023] [Indexed: 02/13/2024] Open
Abstract
With the ever-increasing use of computed tomography (CT), concerns about its radiation dose have become a significant public issue. To address the need for radiation dose reduction, CT denoising methods have been widely investigated and applied in low-dose CT images. Numerous noise reduction algorithms have emerged, such as iterative reconstruction and most recently, deep learning (DL)-based approaches. Given the rapid advancements in Artificial Intelligence techniques, we recognize the need for a comprehensive review that emphasizes the most recently developed methods. Hence, we have performed a thorough analysis of existing literature to provide such a review. Beyond directly comparing the performance, we focus on pivotal aspects, including model training, validation, testing, generalizability, vulnerability, and evaluation methods. This review is expected to raise awareness of the various facets involved in CT image denoising and the specific challenges in developing DL-based models.
Collapse
Affiliation(s)
- Rabeya Tus Sadia
- Department of Computer ScienceUniversity of KentuckyLexingtonKentuckyUSA
| | - Jin Chen
- Department of Medicine‐NephrologyUniversity of Alabama at BirminghamBirminghamAlabamaUSA
| | - Jie Zhang
- Department of RadiologyUniversity of KentuckyLexingtonKentuckyUSA
| |
Collapse
|
9
|
Chang S, Huber NR, Marsh JF, Koons EK, Gong H, Yu L, McCollough CH, Leng S. Pie-Net: Prior-information-enabled deep learning noise reduction for coronary CT angiography acquired with a photon counting detector CT. Med Phys 2023; 50:6283-6295. [PMID: 37042049 PMCID: PMC10564970 DOI: 10.1002/mp.16411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 03/10/2023] [Accepted: 03/29/2023] [Indexed: 04/13/2023] Open
Abstract
BACKGROUND Photon-counting-detector CT (PCD-CT) enables the production of virtual monoenergetic images (VMIs) at a high spatial resolution (HR) via simultaneous acquisition of multi-energy data. However, noise levels in these HR VMIs are markedly increased. PURPOSE To develop a deep learning technique that utilizes a lower noise VMI as prior information to reduce image noise in HR, PCD-CT coronary CT angiography (CTA). METHODS Coronary CTA exams of 10 patients were acquired using PCD-CT (NAEOTOM Alpha, Siemens Healthineers). A prior-information-enabled neural network (Pie-Net) was developed, treating one lower-noise VMI (e.g., 70 keV) as a prior input and one noisy VMI (e.g., 50 keV or 100 keV) as another. For data preprocessing, noisy VMIs were reconstructed by filtered back-projection (FBP) and iterative reconstruction (IR), which were then subtracted to generate "noise-only" images. Spatial decoupling was applied to the noise-only images to mitigate overfitting and improve randomization. Thicker slice averaging was used for the IR and prior images. The final training inputs for the convolutional neural network (CNN) inside the Pie-Net consisted of thicker-slice signal images with the reinsertion of spatially decoupled noise-only images and the thicker-slice prior images. The CNN training labels consisted of the corresponding thicker-slice label images without noise insertion. Pie-Net's performance was evaluated in terms of image noise, spatial detail preservation, and quantitative accuracy, and compared to a U-net-based method that did not include prior information. RESULTS Pie-Net provided strong noise reduction, by 95 ± 1% relative to FBP and by 60 ± 8% relative to IR. For HR VMIs at different keV (e.g., 50 keV or 100 keV), Pie-Net maintained spatial and spectral fidelity. The inclusion of prior information from the PCD-CT data in the spectral domain was able to improve a robust deep learning-based denoising performance compared to the U-net-based method, which caused some loss of spatial detail and introduced some artifacts. CONCLUSION The proposed Pie-Net achieved substantial noise reduction while preserving HR VMI's spatial and spectral properties.
Collapse
Affiliation(s)
- Shaojie Chang
- Department of Radiology, Mayo Clinic, Rochester, MN, US
| | | | | | | | - Hao Gong
- Department of Radiology, Mayo Clinic, Rochester, MN, US
| | - Lifeng Yu
- Department of Radiology, Mayo Clinic, Rochester, MN, US
| | | | - Shuai Leng
- Department of Radiology, Mayo Clinic, Rochester, MN, US
| |
Collapse
|
10
|
Missert AD, Hsieh SS, Ferrero A, McCollough CH. Supervised Learning for CT Denoising and Deconvolution Without High-Resolution Reference Images. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.08.31.23294861. [PMID: 37693583 PMCID: PMC10491378 DOI: 10.1101/2023.08.31.23294861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Purpose Convolutional neural networks (CNNs) have been proposed for super-resolution in CT, but training of CNNs requires high-resolution reference data. Higher spatial resolution can also be achieved using deconvolution, but conventional deconvolution approaches amplify noise. We develop a CNN that mitigates increasing noise and that does not require higher-resolution reference images. Methods Our model includes a noise reduction CNN and a deconvolution CNN that are separately trained. The noise reduction CNN is a U-Net, similar to other noise reduction CNNs found in the literature. The deconvolution CNN uses an autoencoder, where the decoder is fixed and provided as a hyperparameter that represents the system point spread function. The encoder is trained to provide a deconvolution that does not amplify noise. Ringing can occur from deconvolution but is controlled with a difference of gradients loss function term. Our technique was demonstrated on a variety of patient images and on ex vivo kidney stones. Results The noise reduction and deconvolution CNNs produced visually sharper images at low noise. In ex vivo mixed kidney stones, better visual delineation of the kidney stone components could be seen. Conclusions A noise reduction and deconvolution CNN improves spatial resolution and reduces noise without requiring higher-resolution reference images.
Collapse
|
11
|
Zhou Z, Inoue A, McCollough CH, Yu L. Self-trained deep convolutional neural network for noise reduction in CT. J Med Imaging (Bellingham) 2023; 10:044008. [PMID: 37636895 PMCID: PMC10449263 DOI: 10.1117/1.jmi.10.4.044008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2023] [Revised: 08/04/2023] [Accepted: 08/08/2023] [Indexed: 08/29/2023] Open
Abstract
Purpose Supervised deep convolutional neural network (CNN)-based methods have been actively used in clinical CT to reduce image noise. The networks of these methods are typically trained using paired high- and low-quality data from a massive number of patients and/or phantom images. This training process is tedious, and the network trained under a given condition may not be generalizable to patient images acquired and reconstructed under different conditions. We propose a self-trained deep CNN (ST_CNN) method for noise reduction in CT that does not rely on pre-existing training datasets. Approach The ST_CNN training was accomplished using extensive data augmentation in the projection domain, and the inference was applied to the data itself. Specifically, multiple independent noise insertions were applied to the original patient projection data to generate multiple realizations of low-quality projection data. Then, rotation augmentation was adopted for both the original and low-quality projection data by applying the rotation angle directly on the projection data so that images were rotated at arbitrary angles without introducing additional bias. A large number of paired low- and high-quality images from the same patient were reconstructed and paired for training the ST_CNN model. Results No significant difference was found between the ST_CNN and conventional CNN models in terms of the peak signal-to-noise ratio and structural similarity index measure. The ST_CNN model outperformed the conventional CNN model in terms of noise texture and homogeneity in liver parenchyma as well as better subjective visualization of liver lesions. The ST_CNN may sacrifice the sharpness of vessels slightly compared to the conventional CNN model but without affecting the visibility of peripheral vessels or diagnosis of vascular pathology. Conclusions The proposed ST_CNN method trained from the data itself may achieve similar image quality in comparison with conventional deep CNN denoising methods pre-trained on external datasets.
Collapse
Affiliation(s)
- Zhongxing Zhou
- Mayo Clinic, Department of Radiology, Rochester, Minnesota, United States
| | - Akitoshi Inoue
- Mayo Clinic, Department of Radiology, Rochester, Minnesota, United States
| | | | - Lifeng Yu
- Mayo Clinic, Department of Radiology, Rochester, Minnesota, United States
| |
Collapse
|
12
|
Huber NR, Kim J, Leng S, McCollough CH, Yu L. Deep Learning-Based Image Noise Quantification Framework for Computed Tomography. J Comput Assist Tomogr 2023; 47:603-607. [PMID: 37380148 PMCID: PMC10363183 DOI: 10.1097/rct.0000000000001469] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2023]
Abstract
OBJECTIVE Noise quantification is fundamental to computed tomography (CT) image quality assessment and protocol optimization. This study proposes a deep learning-based framework, Single-scan Image Local Variance EstimatoR (SILVER), for estimating the local noise level within each region of a CT image. The local noise level will be referred to as a pixel-wise noise map. METHODS The SILVER architecture resembled a U-Net convolutional neural network with mean-square-error loss. To generate training data, 100 replicate scans were acquired of 3 anthropomorphic phantoms (chest, head, and pelvis) using a sequential scan mode; 120,000 phantom images were allocated into training, validation, and testing data sets. Pixel-wise noise maps were calculated for the phantom data by taking the per-pixel SD from the 100 replicate scans. For training, the convolutional neural network inputs consisted of phantom CT image patches, and the training targets consisted of the corresponding calculated pixel-wise noise maps. Following training, SILVER noise maps were evaluated using phantom and patient images. For evaluation on patient images, SILVER noise maps were compared with manual noise measurements at the heart, aorta, liver, spleen, and fat. RESULTS When tested on phantom images, the SILVER noise map prediction closely matched the calculated noise map target (root mean square error <8 Hounsfield units). Within 10 patient examinations, SILVER noise map had an average percent error of 5% relative to manual region-of-interest measurements. CONCLUSION The SILVER framework enabled accurate pixel-wise noise level estimation directly from patient images. This method is widely accessible because it operates in the image domain and requires only phantom data for training.
Collapse
Affiliation(s)
- Nathan R. Huber
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | - Jiwoo Kim
- Columbia University, New York, NY, 10027, USA
| | - Shuai Leng
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | | | - Lifeng Yu
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| |
Collapse
|
13
|
Dunning CAS, Marsh J, Winfree T, Rajendran K, Leng S, Levin DL, Johnson TF, Fletcher JG, McCollough CH, Yu L. Accuracy of Nodule Volume and Airway Wall Thickness Measurement Using Low-Dose Chest CT on a Photon-Counting Detector CT Scanner. Invest Radiol 2023; 58:283-292. [PMID: 36525385 PMCID: PMC10023282 DOI: 10.1097/rli.0000000000000933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
OBJECTIVES A comparison of high-resolution photon-counting detector computed tomography (PCD-CT) versus energy-integrating detector (EID) CT via a phantom study using low-dose chest CT to evaluate nodule volume and airway wall thickness quantification. MATERIALS AND METHODS Twelve solid and ground-glass lung nodule phantoms with 3 diameters (5 mm, 8 mm, and 10 mm) and 2 shapes (spherical and star-shaped) and 12 airway tube phantoms (wall thicknesses, 0.27-1.54 mm) were placed in an anthropomorphic chest phantom. The phantom was scanned with EID-CT and PCD-CT at 5 dose levels (CTDI vol = 0.1-0.8 mGy at Sn-100 kV, 7.35 mGy at 120 kV). All images were iteratively reconstructed using matched kernels for EID-CT and medium-sharp kernel (MK) PCD-CT and an ultra-sharp kernel (USK) PCD-CT kernel, and image noise at each dose level was quantified. Nodule volumes were measured using semiautomated segmentation software, and the accuracy was expressed as the percentage error between segmented and reference volumes. Airway wall thicknesses were measured, and the root-mean-square error across all tubes was evaluated. RESULTS MK PCD-CT images had the lowest noise. At 0.1 mGy, the mean volume accuracy for the solid and ground-glass nodules was improved in USK PCD-CT (3.1% and 3.3% error) compared with MK PCD-CT (9.9% and 10.2% error) and EID-CT images (11.4% and 9.2% error), respectively. At 0.2 mGy and 0.8 mGy, the wall thickness root-mean-square error values were 0.42 mm and 0.41 mm for EID-CT, 0.54 mm and 0.49 mm for MK PCD-CT, and 0.23 mm and 0.16 mm for USK PCD-CT. CONCLUSIONS USK PCD-CT provided more accurate lung nodule volume and airway wall thickness quantification at lower radiation dose compared with MK PCD-CT and EID-CT.
Collapse
Affiliation(s)
- Chelsea A. S. Dunning
- Department of Radiology, Mayo Clinic, 200 First St SW Rochester, MN, United States 55905
| | - Jeffrey Marsh
- Department of Radiology, Mayo Clinic, 200 First St SW Rochester, MN, United States 55905
| | - Timothy Winfree
- Department of Radiology, Mayo Clinic, 200 First St SW Rochester, MN, United States 55905
| | - Kishore Rajendran
- Department of Radiology, Mayo Clinic, 200 First St SW Rochester, MN, United States 55905
| | - Shuai Leng
- Department of Radiology, Mayo Clinic, 200 First St SW Rochester, MN, United States 55905
| | - David L. Levin
- Department of Radiology, Mayo Clinic, 200 First St SW Rochester, MN, United States 55905
| | - Tucker F. Johnson
- Department of Radiology, Mayo Clinic, 200 First St SW Rochester, MN, United States 55905
| | - Joel G. Fletcher
- Department of Radiology, Mayo Clinic, 200 First St SW Rochester, MN, United States 55905
| | - Cynthia H. McCollough
- Department of Radiology, Mayo Clinic, 200 First St SW Rochester, MN, United States 55905
| | - Lifeng Yu
- Department of Radiology, Mayo Clinic, 200 First St SW Rochester, MN, United States 55905
| |
Collapse
|
14
|
Huber NR, Missert AD, Gong H, Leng S, Yu L, McCollough CH. Technical note: Phantom-based training framework for convolutional neural network CT noise reduction. Med Phys 2023; 50:821-830. [PMID: 36385704 PMCID: PMC9931634 DOI: 10.1002/mp.16093] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 10/28/2022] [Accepted: 10/31/2022] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Deep artificial neural networks such as convolutional neural networks (CNNs) have been shown to be effective models for reducing noise in CT images while preserving anatomic details. A practical bottleneck for developing CNN-based denoising models is the procurement of training data consisting of paired examples of high-noise and low-noise CT images. Obtaining these paired data are not practical in a clinical setting where the raw projection data is not available. This work outlines a technique to optimize CNN denoising models using methods that are available in a routine clinical setting. PURPOSE To demonstrate a phantom-based training framework for CNN noise reduction that can be efficiently implemented on any CT scanner. METHODS The phantom-based training framework uses supervised learning in which training data are synthesized using an image-based noise insertion technique. Ten patient image series were used for training and validation (9:1) and noise-only images obtained from anthropomorphic phantom scans. Phantom noise-only images were superimposed on patient images to imitate low-dose CT images for use in training. A modified U-Net architecture was used with mean-squared-error and feature reconstruction loss. The training framework was tested for clinically indicated whole-body-low-dose CT images, as well as routine abdomen-pelvis exams for which projection data was unavailable. Performance was assessed based on root-mean-square error, structural similarity, line profiles, and visual assessment. RESULTS When the CNN was tested on five reserved quarter-dose whole-body-low-dose CT images, noise was reduced by 75%, root-mean-square-error reduced by 34%, and structural similarity increased by 60%. Visual analysis and line profiles indicated that the method significantly reduced noise while maintaining spatial resolution of anatomic features. CONCLUSION The proposed phantom-based training framework demonstrated strong noise reduction while preserving spatial detail. Because this method is based within the image domain, it can be easily implemented without access to projection data.
Collapse
Affiliation(s)
| | | | - Hao Gong
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | - Shuai Leng
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | - Lifeng Yu
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | | |
Collapse
|
15
|
Zhou Z, Huber NR, Inoue A, McCollough CH, Yu L. Multislice input for 2D and 3D residual convolutional neural network noise reduction in CT. J Med Imaging (Bellingham) 2023; 10:014003. [PMID: 36743869 PMCID: PMC9888548 DOI: 10.1117/1.jmi.10.1.014003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 01/09/2023] [Indexed: 02/03/2023] Open
Abstract
Purpose Deep convolutional neural network (CNN)-based methods are increasingly used for reducing image noise in computed tomography (CT). Current attempts at CNN denoising are based on 2D or 3D CNN models with a single- or multiple-slice input. Our work aims to investigate if the multiple-slice input improves the denoising performance compared with the single-slice input and if a 3D network architecture is better than a 2D version at utilizing the multislice input. Approach Two categories of network architectures can be used for the multislice input. First, multislice images can be stacked channel-wise as the multichannel input to a 2D CNN model. Second, multislice images can be employed as the 3D volumetric input to a 3D CNN model, in which the 3D convolution layers are adopted. We make performance comparisons among 2D CNN models with one, three, and seven input slices and two versions of 3D CNN models with seven input slices and one or three output slices. Evaluation was performed on liver CT images using three quantitative metrics with full-dose images as reference. Visual assessment was made by an experienced radiologist. Results When the input channels of the 2D CNN model increases from one to three to seven, a trend of improved performance was observed. Comparing the three models with the seven-slice input, the 3D CNN model with a one-slice output outperforms the other models in terms of noise texture and homogeneity in liver parenchyma as well as subjective visualization of vessels. Conclusions We conclude the that multislice input is an effective strategy for improving performance for 2D deep CNN denoising models. The pure 3D CNN model tends to have a better performance than the other models in terms of continuity across axial slices, but the difference was not significant compared with the 2D CNN model with the same number of slices as the input.
Collapse
Affiliation(s)
- Zhongxing Zhou
- Mayo Clinic, Department of Radiology, Rochester, Minnesota, United States
| | - Nathan R. Huber
- Mayo Clinic, Department of Radiology, Rochester, Minnesota, United States
| | - Akitoshi Inoue
- Mayo Clinic, Department of Radiology, Rochester, Minnesota, United States
| | | | - Lifeng Yu
- Mayo Clinic, Department of Radiology, Rochester, Minnesota, United States
| |
Collapse
|
16
|
Huber NR, Ferrero A, Rajendran K, Baffour F, Glazebrook KN, Diehn FE, Inoue A, Fletcher JG, Yu L, Leng S, McCollough CH. Dedicated convolutional neural network for noise reduction in ultra-high-resolution photon-counting detector computed tomography. Phys Med Biol 2022; 67:10.1088/1361-6560/ac8866. [PMID: 35944556 PMCID: PMC9444982 DOI: 10.1088/1361-6560/ac8866] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 08/09/2022] [Indexed: 01/13/2023]
Abstract
Objective.To develop a convolutional neural network (CNN) noise reduction technique for ultra-high-resolution photon-counting detector computed tomography (UHR-PCD-CT) that can be efficiently implemented using only clinically available reconstructed images. The developed technique was demonstrated for skeletal survey, lung screening, and head angiography (CTA).Approach. There were 39 participants enrolled in this study, each received a UHR-PCD and an energy integrating detector (EID) CT scan. The developed CNN noise reduction technique uses image-based noise insertion and UHR-PCD-CT images to train a U-Net via supervised learning. For each application, 13 patient scans were reconstructed using filtered back projection (FBP) and iterative reconstruction (IR) and allocated into training, validation, and testing datasets (9:1:3). The subtraction of FBP and IR images resulted in approximately noise-only images. The 5-slice average of IR produced a thick reference image. The CNN training input consisted of thick reference images with reinsertion of spatially decoupled noise-only images. The training target consisted of the corresponding thick reference images without noise insertion. Performance was evaluated based on difference images, line profiles, noise measurements, nonlinear perturbation assessment, and radiologist visual assessment. UHR-PCD-CT images were compared with EID images (clinical standard).Main results.Up to 89% noise reduction was achieved using the proposed CNN. Nonlinear perturbation assessment indicated reasonable retention of 1 mm radius and 1000 HU contrast signals (>80% for skeletal survey and head CTA, >50% for lung screening). A contour plot indicated reduced retention for small-radius and low contrast perturbations. Radiologists preferred CNN over IR for UHR-PCD-CT noise reduction. Additionally, UHR-PCD-CT with CNN was preferred over standard resolution EID-CT images.Significance.CT images reconstructed with very sharp kernels and/or thin sections suffer from increased image noise. Deep learning noise reduction can be used to offset noise level and increase utility of UHR-PCD-CT images.
Collapse
Affiliation(s)
- Nathan R. Huber
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | - Andrea Ferrero
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | | | - Francis Baffour
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | | | - Felix E. Diehn
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | - Akitoshi Inoue
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | | | - Lifeng Yu
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | - Shuai Leng
- Department of Radiology, Mayo Clinic, Rochester, MN, 55901, USA
| | | |
Collapse
|
17
|
Clinical evaluation of a phantom-based deep convolutional neural network for whole-body-low-dose and ultra-low-dose CT skeletal surveys. Skeletal Radiol 2022; 51:145-151. [PMID: 34114078 DOI: 10.1007/s00256-021-03828-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Revised: 05/17/2021] [Accepted: 05/23/2021] [Indexed: 02/02/2023]
Abstract
OBJECTIVE This study evaluated the clinical utility of a phantom-based convolutional neural network noise reduction framework for whole-body-low-dose CT skeletal surveys. MATERIALS AND METHODS The CT exams of ten patients with multiple myeloma were retrospectively analyzed. Exams were acquired with routine whole-body-low-dose CT protocol and projection noise insertion was used to simulate 25% dose exams. Images were reconstructed with either iterative reconstruction or filtered back projection with convolutional neural network post-processing. Diagnostic quality and structure visualization were blindly rated (subjective scale ranging from 0 [poor] to 100 [excellent]) by three musculoskeletal radiologists for iterative reconstruction and convolutional neural network images at routine whole-body-low-dose and 25% dose CT. RESULTS For the diagnostic quality rating, the convolutional neural network outscored iterative reconstruction at routine whole-body-low-dose CT (convolutional neural network: 95 ± 5, iterative reconstruction: 85 ± 8) and at the 25% dose level (convolutional neural network: 79 ± 10, iterative reconstruction: 22 ± 13). Convolutional neural network applied to 25% dose was rated inferior to iterative reconstruction applied to routine dose. Similar trends were observed in rating experiments focusing on structure visualization. CONCLUSION Results indicate that the phantom-based convolutional neural network noise reduction framework can improve visualization of critical structures within CT skeletal surveys. At matched dose level, the convolutional neural network outscored iterative reconstruction for all conditions studied. The image quality improvement of convolutional neural network applied to 25% dose indicates a potential for dose reduction; however, the 75% dose reduction condition studied is not currently recommended for clinical implementation.
Collapse
|
18
|
Zeng R, Lin CY, Li Q, Lu J, Skopec M, Fessler JA, Myers KJ. Performance of a deep learning-based CT image denoising method: Generalizability over dose, reconstruction kernel and slice thickness. Med Phys 2021; 49:836-853. [PMID: 34954845 DOI: 10.1002/mp.15430] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Revised: 11/22/2021] [Accepted: 12/08/2021] [Indexed: 11/11/2022] Open
Abstract
PURPOSE Deep learning (DL) is rapidly finding applications in low-dose CT image denoising. While having the potential to improve image quality (IQ) over the filtered back projection method (FBP) and produce images quickly, performance generalizability of the data-driven DL methods is not fully understood yet. The main purpose of this work is to investigate the performance generalizability of a low-dose CT image denoising neural network in data acquired under different scan conditions, particularly relating to these three parameters: reconstruction kernel, slice thickness and dose (noise) level. A secondary goal is to identify any underlying data property associated with the CT scan settings that might help predict the generalizability of the denoising network. METHODS We select the residual encoder-decoder convolutional neural network (REDCNN) as an example of a low-dose CT image denoising technique in this work. To study how the network generalizes on the three imaging parameters, we grouped the CT volumes in the Low-Dose Grand Challenge (LDGC) data into three pairs of training datasets according to their imaging parameters, changing only one parameter in each pair. We trained REDCNN with them to obtain six denoising models. We test each denoising model on datasets of matching and mismatching parameters with respect to its training sets regarding dose, reconstruction kernel and slice thickness, respectively, to evaluate the denoising performance changes. Denoising performances are evaluated on patient scans, simulated phantom scans and physical phantom scans using IQ metrics including mean squared error (MSE), contrast-dependent modulation transfer function (MTF), pixel-level noise power spectrum (pNPS) and low-contrast lesion detectability (LCD). RESULTS REDCNN had larger MSE when the testing data was different from the training data in reconstruction kernel, but no significant MSE difference when varying slice thickness in the testing data. REDCNN trained with quarter-dose data had slightly worse MSE in denoising higher-dose images than that trained with mixed-dose data (17-80%). The MTF tests showed that REDCNN trained with the two reconstruction kernels and slice thicknesses yielded images of similar image resolution. However, REDCNN trained with mixed-dose data preserved the low-contrast resolution better compared to REDCNN trained with quarter-dose data. In the pNPS test, it was found that REDCNN trained with smooth-kernel data could not remove high-frequency noise in the test data of sharp kernel, possibly because the lack of high-frequency noise in the smooth-kernel data limited the ability of the trained model in removing high-frequency noise. Finally, in the LCD test, REDCNN improved the lesion detectability over the original FBP images regardless of whether the training and testing data had matching reconstruction kernels. CONCLUSIONS REDCNN is observed to be poorly generalizable between reconstruction kernels, more robust in denoising data of arbitrary dose levels when trained with mixed-dose data, and not highly sensitive to slice thickness. It is known that reconstruction kernel affects the in-plane pNPS shape of a CT image whereas slice thickness and dose level do not, so it is possible that the generalizability performance of this CT image denoising network highly correlates to the pNPS similarity between the testing and training data. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Rongping Zeng
- Center for Devices and Radiological Health, US Food and Drug Administration (FDA), Silver Spring, MD, 20993, USA
| | | | - Qin Li
- AstraZeneca, Waltham, MA, 02451, USA
| | - Jiang Lu
- Center for Devices and Radiological Health, US Food and Drug Administration (FDA), Silver Spring, MD, 20993, USA
| | | | - Jeffrey A Fessler
- Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Kyle J Myers
- Center for Devices and Radiological Health, US Food and Drug Administration (FDA), Silver Spring, MD, 20993, USA
| |
Collapse
|