Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Abbey CK, Barrett HH. Human- and model-observer performance in ramp-spectrum noise: effects of regularization and object variability. J Opt Soc Am A Opt Image Sci Vis 2001;18:473-88. [PMID: 11265678 PMCID: PMC2943344 DOI: 10.1364/josaa.18.000473] [Citation(s) in RCA: 179] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]

For:	Abbey CK, Barrett HH. Human- and model-observer performance in ramp-spectrum noise: effects of regularization and object variability. J Opt Soc Am A Opt Image Sci Vis 2001;18:473-88. [PMID: 11265678 PMCID: PMC2943344 DOI: 10.1364/josaa.18.000473] [Citation(s) in RCA: 179] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]

Number

Cited by Other Article(s)

Shao M, Byrd DW, Mitra J, Behnia F, Lee JH, Iravani A, Sadic M, Chen DL, Wollenweber SD, Abbey CK, Kinahan PE, Ahn S. A deep learning anthropomorphic model observer for a detection task in PET. Med Phys 2024. [PMID: 39008812 DOI: 10.1002/mp.17303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 05/15/2024] [Accepted: 06/24/2024] [Indexed: 07/17/2024] Open

Abstract

BACKGROUND

Lesion detection is one of the most important clinical tasks in positron emission tomography (PET) for oncology. An anthropomorphic model observer (MO) designed to replicate human observers (HOs) in a detection task is an important tool for assessing task-based image quality. The channelized Hotelling observer (CHO) has been the most popular anthropomorphic MO. Recently, deep learning MOs (DLMOs), mostly based on convolutional neural networks (CNNs), have been investigated for various imaging modalities. However, there have been few studies on DLMOs for PET.

PURPOSE

The goal of the study is to investigate whether DLMOs can predict HOs better than conventional MOs such as CHO in a two-alternative forced-choice (2AFC) detection task using PET images with real anatomical variability.

METHODS

Two types of DLMOs were implemented: (1) CNN DLMO, and (2) CNN-SwinT DLMO that combines CNN and Swin Transformer (SwinT) encoders. Lesion-absent PET images were reconstructed from clinical data, and lesion-present images were reconstructed with adding simulated lesion sinogram data. Lesion-present and lesion-absent PET image pairs were labeled by eight HOs consisting of four radiologists and four image scientists in a 2AFC detection task. In total, 2268 pairs of lesion-present and lesion-absent images were used for training, 324 pairs for validation, and 324 pairs for test. CNN DLMO, CNN-SwinT DLMO, CHO with internal noise, and non-prewhitening matched filter (NPWMF) were compared in the same train-test paradigm. For comparison, six quantitative metrics including prediction accuracy, mean squared errors (MSEs) and correlation coefficients, which measure how well a MO predicts HOs, were calculated in a 9-fold cross-validation experiment.

RESULTS

In terms of the accuracy and MSE metrics, CNN DLMO and CNN-SwinT DLMO showed better performance than CHO and NPWMF, and CNN-SwinT DLMO showed the best performance among the MOs evaluated.

CONCLUSIONS

DLMO can predict HOs more accurately than conventional MOs such as CHO in PET lesion detection. Combining SwinT and CNN encoders can improve the DLMO prediction performance compared to using CNN only.

Collapse

Zhang C, Li K, Zhang R, Chen GH. Experimental measurement of local noise power spectrum (NPS) in photon counting detector-CT (PCD-CT) using a single data acquisition. Med Phys 2024;51:4081-4094. [PMID: 38703355 PMCID: PMC11147724 DOI: 10.1002/mp.17110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 03/09/2024] [Accepted: 03/28/2024] [Indexed: 05/06/2024] Open

Rahman MA, Yu Z, Laforest R, Abbey CK, Siegel BA, Jha AK. DEMIST: A Deep-Learning-Based Detection-Task-Specific Denoising Approach for Myocardial Perfusion SPECT. IEEE TRANSACTIONS ON RADIATION AND PLASMA MEDICAL SCIENCES 2024;8:439-450. [PMID: 38766558 PMCID: PMC11101197 DOI: 10.1109/trpms.2024.3379215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]

Abstract

There is an important need for methods to process myocardial perfusion imaging (MPI) single-photon emission computed tomography (SPECT) images acquired at lower-radiation dose and/or acquisition time such that the processed images improve observer performance on the clinical task of detecting perfusion defects compared to low-dose images. To address this need, we build upon concepts from model-observer theory and our understanding of the human visual system to propose a detection task-specific deep-learning-based approach for denoising MPI SPECT images (DEMIST). The approach, while performing denoising, is designed to preserve features that influence observer performance on detection tasks. We objectively evaluated DEMIST on the task of detecting perfusion defects using a retrospective study with anonymized clinical data in patients who underwent MPI studies across two scanners (N = 338). The evaluation was performed at low-dose levels of 6.25%, 12.5%, and 25% and using an anthropomorphic channelized Hotelling observer. Performance was quantified using area under the receiver operating characteristics curve (AUC). Images denoised with DEMIST yielded significantly higher AUC compared to corresponding low-dose images and images denoised with a commonly used task-agnostic deep learning-based denoising method. Similar results were observed with stratified analysis based on patient sex and defect type. Additionally, DEMIST improved visual fidelity of the low-dose images as quantified using root mean squared error and structural similarity index metric. A mathematical analysis revealed that DEMIST preserved features that assist in detection tasks while improving the noise properties, resulting in improved observer performance. The results provide strong evidence for further clinical evaluation of DEMIST to denoise low-count images in MPI SPECT.

Collapse

Pouget E, Dedieu V. Applying Self-Supervised Learning to Image Quality Assessment in Chest CT Imaging. Bioengineering (Basel) 2024;11:335. [PMID: 38671757 PMCID: PMC11048026 DOI: 10.3390/bioengineering11040335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 03/25/2024] [Accepted: 03/27/2024] [Indexed: 04/28/2024] Open

Mehta R, Kawakita TA, Pineda AR. Modeling human observer detection for varying data acquisition in undersampled MRI for two-alternative forced choice (2-AFC) and forced localization tasks. PROCEEDINGS OF SPIE--THE INTERNATIONAL SOCIETY FOR OPTICAL ENGINEERING 2024;12929:1292907. [PMID: 38799476 PMCID: PMC11128320 DOI: 10.1117/12.3005839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]

Nelson BJ, Kc P, Badal A, Jiang L, Masters SC, Zeng R. Pediatric evaluations for deep learning CT denoising. Med Phys 2024;51:978-990. [PMID: 38127330 DOI: 10.1002/mp.16901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 11/13/2023] [Accepted: 12/04/2023] [Indexed: 12/23/2023] Open

Abstract

BACKGROUND

Deep learning (DL) CT denoising models have the potential to improve image quality for lower radiation dose exams. These models are generally trained with large quantities of adult patient image data. However, CT, and increasingly DL denoising methods, are used in both adult and pediatric populations. Pediatric body habitus and size can differ significantly from adults and vary dramatically from newborns to adolescents. Ensuring that pediatric subgroups of different body sizes are not disadvantaged by DL methods requires evaluations capable of assessing performance in each subgroup.

PURPOSE

To assess DL CT denoising in pediatric and adult-sized patients, we built a framework of computer simulated image quality (IQ) control phantoms and evaluation methodology.

METHODS

The computer simulated IQ phantoms in the framework featured pediatric-sized versions of standard CatPhan 600 and MITA-LCD phantoms with a range of diameters matching the mean effective diameters of pediatric patients ranging from newborns to 18 years old. These phantoms were used in simulating CT images that were then inputs for a DL denoiser to evaluate performance in different sized patients. Adult CT test images were simulated using standard-sized phantoms scanned with adult scan protocols. Pediatric CT test images were simulated with pediatric-sized phantoms and adjusted pediatric protocols. The framework's evaluation methodology consisted of denoising both adult and pediatric test images then assessing changes in image quality, including noise, image sharpness, CT number accuracy, and low contrast detectability. To demonstrate the use of the framework, a REDCNN denoising model trained on adult patient images was evaluated. To validate that the DL model performance measured with the proposed pediatric IQ phantoms was representative of performance in more realistic patient anatomy, anthropomorphic pediatric XCAT phantoms of the same age range were also used to compare noise reduction performance.

RESULTS

Using the proposed pediatric-sized IQ phantom framework, size differences between adult and pediatric-sized phantoms were observed to substantially influence the adult trained DL denoising model's performance. When applied to adult images, the DL model achieved a 60% reduction in noise standard deviation without substantial loss in sharpness in mid or high spatial frequencies. However, in smaller phantoms the denoising performance dropped due to different image noise textures resulting from the smaller field of view (FOV) between adult and pediatric protocols. In the validation study, noise reduction trends in the pediatric-sized IQ phantoms were found to be consistent with those found in anthropomorphic phantoms.

CONCLUSION

We developed a framework of using pediatric-sized IQ phantoms for pediatric subgroup evaluation of DL denoising models. Using the framework, we found the performance of an adult trained DL denoiser did not generalize well in the smaller diameter phantoms corresponding to younger pediatric patient sizes. Our work suggests noise texture differences from FOV changes between adult and pediatric protocols can contribute to poor generalizability in DL denoising and that the proposed framework is an effective means to identify these performance disparities for a given model.

Collapse

Göppel M, Anton M, Gala HDLH, Giussani A, Trinkl S, Renger B, Brix G. Dose-efficiency quantification of computed tomography systems using a model-observer. Med Phys 2023;50:7594-7605. [PMID: 37183490 DOI: 10.1002/mp.16441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 04/01/2023] [Accepted: 04/17/2023] [Indexed: 05/16/2023] Open

Abstract

BACKGROUND

Recent advances in computed tomography (CT) technology have considerably improved the quality of CT images and reduced radiation exposure in patients. At present, however, there is no generally accepted figure of merit (FOM) for comparing the dose efficiencies of CT systems.

PURPOSE

(i) To establish an FOM that characterizes the quality of CT images in relation to the radiation dose by means of a mathematical model observer and (ii) to evaluate the new FOM on different CT systems and image reconstruction algorithms.

METHODS

Images of a homogeneous phantom with four low-contrast inserts were acquired using three different CT systems at three dose levels and a representative protocol for CT imaging of low-contrast objects in the abdomen. The images were reconstructed using filtered-back projection and iterative algorithms. A channelized hotelling observer with difference-of-Gaussian channels was applied to compute the detectability (d ' $d^{\prime}$ ). This was done for each insert and each of the considered imaging conditions from square regions of interest (ROIs) that were (semi-)automatically centered on the inserts. The estimated detectabilities (d ' $d^{\prime}$ ) were averaged in the first step over the three dose levels (⟨ d ' ⟩ $\langle {d^{\prime}} \rangle $ ), and subsequently over the four contrast inserts (⟨ d ' ⟩ w ${\langle {d^{\prime}} \rangle _{\rm{w}}}$ ). All calculation steps included a dedicated assessment of the related uncertainties following accepted metrological guidelines.

RESULTS

The determined detectabilities (d ' $d^{\prime}$ ) varied considerably with the contrast and diameter of the four inserts, as well as with the radiation doses and reconstruction algorithms used for image generation (d ' $d^{\prime}\;$ = 1.3-5.5). Thus, the specification of a single detectability as an FOM is not well suited for comprehensively characterizing the dose efficiency of a CT system. A more comprehensive and robust characterization was provided by the averaged detectabilities⟨ d ' ⟩ $\langle {d^{\prime}} \rangle $ and, in particular,⟨ d ' ⟩ w ${\langle {d^{\prime}} \rangle _{\rm{w}}}$ . Our analysis reveals that the model observer analysis is very sensitive to the exact position of the ROIs.

CONCLUSIONS

The presented automatable software approach yielded with the weighted detectability⟨ d ' ⟩ w ${\langle {d^{\prime}} \rangle _{\rm{w}}}$ an objective FOM to benchmark different CT systems and reconstruction algorithms in a robust and reliable manner. An essential advantage of the proposed model-observer approach is that uncertainties in the FOM can be provided, which is an indispensable prerequisite for type testing.

Collapse

Kim G, Baek J. Power-law spectrum-based objective function to train a generative adversarial network with transfer learning for the synthetic breast CT image. Phys Med Biol 2023;68:205007. [PMID: 37722388 DOI: 10.1088/1361-6560/acfadf] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 09/18/2023] [Indexed: 09/20/2023]

Abstract

Objective.This paper proposes a new objective function to improve the quality of synthesized breast CT images generated by the GAN and compares the GAN performances on transfer learning datasets from different image domains.Approach.The proposed objective function, named beta loss function, is based on the fact that x-ray-based breast images follow the power-law spectrum. Accordingly, the exponent of the power-law spectrum (beta value) for breast CT images is approximately two. The beta loss function is defined in terms of L1 distance between the beta value of synthetic images and validation samples. To compare the GAN performances for transfer learning datasets from different image domains, ImageNet and anatomical noise images are used in the transfer learning dataset. We employ styleGAN2 as the backbone network and add the proposed beta loss function. The patient-derived breast CT dataset is used as the training and validation dataset; 7355 and 212 images are used for network training and validation, respectively. We use the beta value evaluation and Fréchet inception distance (FID) score for quantitative evaluation.Main results.For qualitative assessment, we attempt to replicate the images from the validation dataset using the trained GAN. Our results show that the proposed beta loss function achieves a more similar beta value to real images and a lower FID score. Moreover, we observe that the GAN pretrained with anatomical noise images achieves better equality than ImageNet for beta value evaluation and FID score. Finally, the beta loss function with anatomical noise as the transfer learning dataset achieves the lowest FID score.Significance.Overall, the GAN using the proposed beta loss function with anatomical noise images as the transfer learning dataset provides the lowest FID score among all tested cases. Hence, this work has implications for developing GAN-based breast image synthesis methods for medical imaging applications.

Collapse

Jang H, Baek J. Convolutional neural network-based model observer for signal known statistically task in breast tomosynthesis images. Med Phys 2023;50:6390-6408. [PMID: 36971505 DOI: 10.1002/mp.16395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 02/20/2023] [Accepted: 03/22/2023] [Indexed: 03/29/2023] Open

Abstract

BACKGROUND

Since human observer studies are resource-intensive, mathematical model observers are frequently used to assess task-based image quality. The most common implementation of these model observers assume that the signal information is exactly known. However, these tasks cannot thoroughly represent situations where the signal information is not exactly known in terms of size and shape.

PURPOSE

Considering the limitations of the tasks for which signal information is exactly known, we proposed a convolutional neural network (CNN)-based model observer for signal known statistically (SKS) and background known statistically (BKS) detection tasks in breast tomosynthesis images.

METHODS

A wide parameter search was conducted from six different acquisition angles (i.e., 10°, 20°, 30°, 40°, 50°, and 60°) within the same dose level (i.e., 2.3 mGy) under two separate acquisition schemes: (1) constant total number of projections, and (2) constant angular separation between projections. Two different types of signals: spherical (i.e., SKE tasks) and spiculated (i.e., SKS tasks) were used. The detection performance of the CNN-based model observer was compared with that of the Hotelling observer (HO) instead of the IO. Pixel-wise gradient-weighted class activation mapping (pGrad-CAM) map was extracted from each reconstructed tomosynthesis image to provide an intuitive understanding of the trained CNN-based model observer.

RESULTS

The CNN-based model observer achieved a higher detection performance compared to that of the HO for all tasks. Moreover, the improvement in its detection performance was greater for SKS tasks compared to that for SKE tasks. These results demonstrated that the addition of nonlinearity improved the detection performance owing to the variation of the background and signal. Interestingly, the pGrad-CAM results effectively localized the class-specific discriminative region, further supporting the quantitative evaluation results of the CNN-based model observer. In addition, we verified that the CNN-based model observer required fewer images to achieve the detection performance of the HO.

CONCLUSIONS

In this work, we proposed a CNN-based model observer for SKS and BKS detection tasks in breast tomosynthesis images. Throughout the study, we demonstrated that the detection performance of the proposed CNN-based model observer was superior to that of the HO.

Collapse

Klein DS, Lago MA, Abbey CK, Eckstein MP. A 2D Synthesized Image Improves the 3D Search for Foveated Visual Systems. IEEE TRANSACTIONS ON MEDICAL IMAGING 2023;42:2176-2188. [PMID: 37027767 PMCID: PMC10476603 DOI: 10.1109/tmi.2023.3246005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Abbey CK, Samuelson FW, Zeng R, Boone JM, Myers KJ, Eckstein MP. Discrimination tasks in simulated low-dose CT noise. Med Phys 2023;50:4151-4172. [PMID: 37057360 PMCID: PMC11181787 DOI: 10.1002/mp.16412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 03/21/2023] [Accepted: 03/22/2023] [Indexed: 04/15/2023] Open

Abstract

BACKGROUND

This study reports the results of a set of discrimination experiments using simulated images that represent the appearance of subtle lesions in low-dose computed tomography (CT) of the lungs. Noise in these images has a characteristic ramp-spectrum before apodization by noise control filters. We consider three specific diagnostic features that determine whether a lesion is considered malignant or benign, two system-resolution levels, and four apodization levels for a total of 24 experimental conditions.

PURPOSE

The goal of the investigation is to better understand how well human observers perform subtle discrimination tasks like these, and the mechanisms of that performance. We use a forced-choice psychophysical paradigm to estimate observer efficiency and classification images. These measures quantify how effectively subjects can read the images, and how they use images to perform discrimination tasks across the different imaging conditions.

MATERIALS AND METHODS

The simulated CT images used as stimuli in the psychophysical experiments are generated from high-resolution objects passed through a modulation transfer function (MTF) before down-sampling to the image-pixel grid. Acquisition noise is then added with a ramp noise-power spectrum (NPS), with subsequent smoothing through apodization filters. The features considered are lesion size, indistinct lesion boundary, and a nonuniform lesion interior. System resolution is implemented by an MTF with resolution (10% max.) of 0.47 or 0.58 cyc/mm. Apodization is implemented by a Shepp-Logan filter (Sinc profile) with various cutoffs. Six medically naïve subjects participated in the psychophysical studies, entailing training and testing components for each condition. Training consisted of staircase procedures to find the 80% correct threshold for each subject, and testing involved 2000 psychophysical trials at the threshold value for each subject. Human-observer performance is compared to the Ideal Observer to generate estimates of task efficiency. The significance of imaging factors is assessed using ANOVA. Classification images are used to estimate the linear template weights used by subjects to perform these tasks. Classification-image spectra are used to analyze subject weights in the spatial-frequency domain.

RESULTS

Overall, average observer efficiency is relatively low in these experiments (10%-40%) relative to detection and localization studies reported previously. We find significant effects for feature type and apodization level on observer efficiency. Somewhat surprisingly, system resolution is not a significant factor. Efficiency effects of the different features appear to be well explained by the profile of the linear templates in the classification images. Increasingly strong apodization is found to both increase the classification-image weights and to increase the mean-frequency of the classification-image spectra. A secondary analysis of "Unapodized" classification images shows that this is largely due to observers undoing (inverting) the effects of apodization filters.

CONCLUSIONS

These studies demonstrate that human observers can be relatively inefficient at feature-discrimination tasks in ramp-spectrum noise. Observers appear to be adapting to frequency suppression implemented in apodization filters, but there are residual effects that are not explained by spatial weighting patterns. The studies also suggest that the mechanisms for improving performance through the application of noise-control filters may require further investigation.

Collapse

Skwirczyński M, Tabor Z, Lasek J, Schneider Z, Gibała S, Kucybała I, Urbanik A, Obuchowicz R. Deep Learning Algorithm for Differentiating Patients with a Healthy Liver from Patients with Liver Lesions Based on MR Images. Cancers (Basel) 2023;15:3142. [PMID: 37370752 DOI: 10.3390/cancers15123142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 06/05/2023] [Accepted: 06/09/2023] [Indexed: 06/29/2023] Open

Onwanna J, Chantadisai M, Chaiwatanarat T, Rakvongthai Y. Dual-Tracer Parathyroid Imaging Using Joint SPECT Reconstruction. Nucl Med Mol Imaging 2023;57:126-136. [PMID: 37187950 PMCID: PMC10172461 DOI: 10.1007/s13139-022-00787-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 11/11/2022] [Accepted: 12/27/2022] [Indexed: 01/18/2023] Open

Anton M, Mäder U, Schopphoven S, Reginatto M. A nonparametric measure of noise in x-ray diagnostic images-mammography. Phys Med Biol 2023;68. [PMID: 36652714 DOI: 10.1088/1361-6560/acb485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 01/18/2023] [Indexed: 01/20/2023]

Pouget E, Dedieu V. Comparison of supervised-learning approaches for designing a channelized observer for image quality assessment in CT. Med Phys 2023. [PMID: 36647620 DOI: 10.1002/mp.16227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 12/29/2022] [Accepted: 01/03/2023] [Indexed: 01/18/2023] Open

Abstract

BACKGROUND

The current paradigm for evaluating computed tomography (CT) system performance relies on a task-based approach. As the Hotelling observer (HO) provides an upper bound of observer performances in specific signal detection tasks, the literature advocates HO use for optimization purposes. However, computing the HO requires calculating the inverse of the image covariance matrix, which is often intractable in medical applications. As an alternative, dimensionality reduction has been extensively investigated to extract the task-relevant features from the raw images. This can be achieved by using channels, which yields the channelized-HO (CHO). The channels are only considered efficient when the channelized observer (CO) can approximate its unconstrained counterpart. Previous work has demonstrated that supervised learning-based methods can usually benefit CO design, either for generating efficient channels using partial least squares (PLS) or for replacing the Hotelling detector with machine-learning (ML) methods.

PURPOSE

Here we investigated the efficiency of a supervised ML-algorithm used to design a CO for predicting the performance of unconstrained HO. The ML-algorithm was applied either (1) in the estimator for dimensionality reduction, or (2) in the detector function.

METHODS

A channelized support vector machine (CSVM) was employed and compared against the CHO in terms of ability to predict HO performances. Both the CSVM and the CHO were estimated with channels derived from the singular value decomposition (SVD) of the system operator, principal component analysis (PCA), and PLS. The huge variety of regularization strategies proposed by CT system vendors for statistical image reconstruction (SIR) make the generalization capability of an observer a key point to consider upfront of implementation in clinical practice. To evaluate the generalization properties of the observers, we adopted a 2-step testing process: (1) achieved with the same regularization strategy (as in the training phase) and (2) performed using different reconstruction properties. We generated simulated- signal-known-exactly/background-known-exactly (SKE/BKE) tasks in which different noise structures were generated using Markov random field (MRF) regularizations using either a Green or a quadratic, function.

RESULTS

The CSVM outperformed the CHO for all types of channels and regularization strategies. Furthermore, even though both COs generalized well to images reconstructed with the same regularization strategy as the images considered in the training phase, the CHO failed to generalize to images reconstructed differently whereas the CSVM managed to successfully generalize. Lastly, the proposed CSVM observer used with PCA channels outperformed the CHO with PLS channels while using a smaller training data set.

CONCLUSION

These results argue for introducing the supervised-learning paradigm in the detector function rather than in the operator of the channels when designing a CO to provide an accurate estimate of HO performance. The CSVM with PCA channels proposed here could be used as a surrogate for HO in image quality assessment.

Collapse

O’Neill AG, Valdez EL, Lingala SG, Pineda AR. Modeling human observer detection in undersampled magnetic resonance imaging reconstruction with total variation and wavelet sparsity regularization. J Med Imaging (Bellingham) 2023;10:015502. [PMID: 36852415 PMCID: PMC9961227 DOI: 10.1117/1.jmi.10.1.015502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 02/06/2023] [Indexed: 02/27/2023] Open

Abstract

Purpose

Task-based assessment of image quality in undersampled magnetic resonance imaging provides a way of evaluating the impact of regularization on task performance. In this work, we evaluated the effect of total variation (TV) and wavelet regularization on human detection of signals with a varying background and validated a model observer in predicting human performance.

Approach

Human observer studies used two-alternative forced choice (2-AFC) trials with a small signal known exactly task but with varying backgrounds for fluid-attenuated inversion recovery images reconstructed from undersampled multi-coil data. We used a 3.48 undersampling factor with TV and a wavelet sparsity constraints. The sparse difference-of-Gaussians (S-DOG) observer with internal noise was used to model human observer detection. The internal noise for the S-DOG was chosen to match the average percent correct (PC) in 2-AFC studies for four observers using no regularization. That S-DOG model was used to predict the PC of human observers for a range of regularization parameters.

Results

We observed a trend that the human observer detection performance remained fairly constant for a broad range of values in the regularization parameter before decreasing at large values. A similar result was found for the normalized ensemble root mean squared error. Without changing the internal noise, the model observer tracked the performance of the human observers as the regularization was increased but overestimated the PC for large amounts of regularization for TV and wavelet sparsity, as well as the combination of both parameters.

Conclusions

For the task we studied, the S-DOG observer was able to reasonably predict human performance with both TV and wavelet sparsity regularizers over a broad range of regularization parameters. We observed a trend that task performance remained fairly constant for a range of regularization parameters before decreasing for large amounts of regularization.

Collapse

Itoh T, Noguchi K. Evaluation of the quantitative performance of non-enhanced dual-energy CT X-map in detecting acute ischemic brain stroke: A model observer study using computer simulation. Phys Med 2022;104:85-92. [PMID: 36371946 DOI: 10.1016/j.ejmp.2022.10.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 09/02/2022] [Accepted: 10/30/2022] [Indexed: 11/11/2022] Open

Jha AK, Bradshaw TJ, Buvat I, Hatt M, Kc P, Liu C, Obuchowski NF, Saboury B, Slomka PJ, Sunderland JJ, Wahl RL, Yu Z, Zuehlsdorff S, Rahmim A, Boellaard R. Nuclear Medicine and Artificial Intelligence: Best Practices for Evaluation (the RELAINCE Guidelines). J Nucl Med 2022;63:1288-1299. [PMID: 35618476 PMCID: PMC9454473 DOI: 10.2967/jnumed.121.263239] [Citation(s) in RCA: 37] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 05/11/2022] [Indexed: 01/26/2023] Open

Funama Y, Shirasaka T, Goto T, Aoki Y, Tanaka K, Yoshida R. Iterative reconstruction with multifrequency signal recognition technology to improve low-contrast detectability: A phantom study. Acta Radiol Open 2022;11:20584601221109919. [PMID: 35747445 PMCID: PMC9209785 DOI: 10.1177/20584601221109919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 06/08/2022] [Indexed: 11/24/2022] Open

Lee C, Baek J. Effect of optical blurring of X-ray source on breast tomosynthesis image quality: Modulation transfer function, anatomical noise power spectrum, and signal detectability perspectives. PLoS One 2022;17:e0267850. [PMID: 35587494 PMCID: PMC9119460 DOI: 10.1371/journal.pone.0267850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 03/24/2022] [Indexed: 11/19/2022] Open

Abstract

We investigated the effect of the optical blurring of X-ray source on digital breast tomosynthesis (DBT) image quality using well-designed DBT simulator and table-top experimental systems. To measure the in-plane modulation transfer function (MTF), we used simulated sphere phantom and Teflon sphere phantom and generated their projection data using two acquisition modes (i.e., step-and-shoot mode and continuous mode). After reconstruction, we measured the in-plane MTF using reconstructed sphere phantom images. In addition, we measured the anatomical noise power spectrum (aNPS) and signal detectability. We constructed simulated breast phantoms with a 50% volume glandular fraction (VGF) of breast anatomy using the power law spectrum and inserted spherical objects with 1 mm, 2 mm, and 5 mm diameters as breast masses. Projection data were acquired using two acquisition modes, and in-plane breast images were reconstructed using the Feldkamp-Davis-Kress (FDK) algorithm. For the experimental study, we used BR3D breast phantom with 50% VGF and obtained projection data using a table-top experimental system. To compare the detection performance of the two acquisition modes, we calculated the signal detectability using the channelized Hotelling observer (CHO) with Laguerre-Gauss (LG) channels. Our results show that spatial resolution of in-plane image in continuous mode was degraded due to the optical blurring of X-ray source. This blurring effect was reflected in aNPS, resulting in large β values. From a signal detectability perspective, the signal detectability in step-and-shoot mode is higher than that in continuous mode for small spherical signals but not large spherical signals. Although the step-and-shoot mode has disadvantage in terms of scan time compared to the continuous mode, scanning in step-and-shoot mode is better for detecting small signals, indicating that there is a tradeoff between scan time and image quality.

Collapse

Barufaldi B, Vent TL, Bakic PR, Maidment ADA. Computer Simulations of Case Difficulty in Digital Breast Tomosynthesis Using Virtual Clinical Trials. Med Phys 2022;49:2220-2232. [PMID: 35212403 DOI: 10.1002/mp.15553] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 02/07/2022] [Accepted: 02/13/2022] [Indexed: 11/10/2022] Open

Abstract

PURPOSE

Virtual clinical trials (VCTs) require computer simulations of representative patients and images to evaluate and compare changes in performance of imaging technologies. The simulated images are usually interpreted by model observers whose performance depends upon the selection of imaging cases used in training evaluation models. This work proposes an efficient method to simulate and calibrate soft tissue lesions, which matches the detectability threshold of virtual and human readings.

METHODS

Anthropomorphic breast phantoms were used to evaluate the simulation of four mass models (I-IV) that vary in shape and composition of soft tissue. Ellipsoidal (I) and spiculated (II-IV) masses were simulated using composite voxels with partial volumes. Digital breast tomosynthesis projections and reconstructions of a clinical system were simulated. Channelized Hotelling observers (CHOs) were evaluated using reconstructed slices of masses that varied in shape, composition, and density of surrounded tissue. The detectability threshold of each mass model was evaluated using receiver operating characteristic (ROC) curves calculated with the CHO's scores.

RESULTS

The area under the curve (AUC) of each calibrated mass model were within the 95% confidence interval (mean AUC [95% CI]) reported in a previous reader study (0.93 [0.89, 0.97]). The mean AUC [95% CI] obtained were 0.94 [0.93, 0.96], 0.92 [0.90, 0.93], 0.92 [0.90, 0.94], 0.93 [0.92, 0.95] for models I to IV, respectively. The mean AUC results varied substantially as a function of shape, composition, and density of surrounded tissue.

CONCLUSIONS

For successful VCTs, lesions composed of soft tissue should be calibrated to simulate imaging cases that match the case difficulty predicted by human readers. Lesion composition, shape, and size are parameters that should be carefully selected to calibrate VCTs. This article is protected by copyright. All rights reserved.

Collapse

O'Neill AG, Lingala SG, Pineda AR. Predicting human detection performance in magnetic resonance imaging (MRI) with total variation and wavelet sparsity regularizers. PROCEEDINGS OF SPIE--THE INTERNATIONAL SOCIETY FOR OPTICAL ENGINEERING 2022;12035:1203511. [PMID: 36267385 PMCID: PMC9581458 DOI: 10.1117/12.2608986] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Onwanna J, Chantadisai M, Tepmongkol S, Fahey F, Ouyang J, Rakvongthai Y. Impact of reconstruction parameters on lesion detection and localization in joint ictal/inter-ictal SPECT reconstruction. Ann Nucl Med 2021;36:24-32. [PMID: 34559366 DOI: 10.1007/s12149-021-01680-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 09/14/2021] [Indexed: 10/20/2022]

Abstract

PURPOSE

Previously, a joint ictal/inter-ictal SPECT reconstruction was proposed to reconstruct a differential image representing the change of brain SPECT image from an inter-ictal to an ictal study. The so-called joint method yielded better performance for epileptic foci localization than the conventional subtraction method. In this study, we evaluated the performance of different reconstruction settings of the joint reconstruction of ictal/inter-ictal SPECT data, which creates a differential image showing the difference between ictal and inter-ictal images, in lesion detection and localization in epilepsy imaging.

METHODS

Differential images reconstructed from phantom data using the joint and the subtraction methods were compared based on lesion detection performance (channelized Hotelling observer signal-to-noise ratio (SNR_CHO) averaged across four lesion-to-background contrast levels) at the optimal iteration. The joint-initial method which was the joint method that was initialized by the subtraction method at optimal iteration was also used to reconstruct differential images. These three methods with respective optimal iteration and the subtraction method with four iterations were applied to epileptic patient datasets. A human observer lesion localization study was performed based on localization receiver operating characteristic (LROC) analysis.

RESULTS

From the phantom study, at their respective optimal iteration, the joint method yielded an improvement in lesion detection performance over the subtraction method of 26%, which increased to 145% when using the joint-initial method. From the patient study, the joint-initial method yielded the highest area under the LROC curve as compared with those of the joint and the subtraction methods with optimal iteration and with 4 iterations (0.44 vs 0.41, 0.39 and 0.36, respectively).

CONCLUSIONS

In lesion detection and localization, the joint method at optimal iteration outperformed the subtraction method at optimal iteration and at iteration typically used in clinical practice. Furthermore, initialization by the subtraction method improved the performance of the joint method.

Collapse

Jha AK, Myers KJ, Obuchowski NA, Liu Z, Rahman MA, Saboury B, Rahmim A, Siegel BA. Objective Task-Based Evaluation of Artificial Intelligence-Based Medical Imaging Methods:: Framework, Strategies, and Role of the Physician. PET Clin 2021;16:493-511. [PMID: 34537127 DOI: 10.1016/j.cpet.2021.06.013] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Li K, Zhou W, Li H, Anastasio MA. Assessing the Impact of Deep Neural Network-Based Image Denoising on Binary Signal Detection Tasks. IEEE TRANSACTIONS ON MEDICAL IMAGING 2021;40:2295-2305. [PMID: 33929958 PMCID: PMC8673589 DOI: 10.1109/tmi.2021.3076810] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]

Lago MA, Abbey CK, Eckstein MP. Medical image quality metrics for foveated model observers. J Med Imaging (Bellingham) 2021;8:041209. [PMID: 34423070 DOI: 10.1117/1.jmi.8.4.041209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 07/20/2021] [Indexed: 11/14/2022] Open

Pineda AR, Miedema H, Lingala SG, Nayak KS. Optimizing constrained reconstruction in magnetic resonance imaging for signal detection. Phys Med Biol 2021;66:10.1088/1361-6560/ac1021. [PMID: 34192682 PMCID: PMC9169904 DOI: 10.1088/1361-6560/ac1021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2020] [Accepted: 06/30/2021] [Indexed: 11/11/2022]

Abstract

Constrained reconstruction in magnetic resonance imaging (MRI) allows the use of prior information through constraints to improve reconstructed images. These constraints often take the form of regularization terms in the objective function used for reconstruction. Constrained reconstruction leads to images which appear to have fewer artifacts than reconstructions without constraints but because the methods are typically nonlinear, the reconstructed images have artifacts whose structure is hard to predict. In this work, we compared different methods of optimizing the regularization parameter using a total variation (TV) constraint in the spatial domain and sparsity in the wavelet domain for one-dimensional (2.56×) undersampling using variable density undersampling. We compared the mean squared error (MSE), structural similarity (SSIM), L-curve and the area under the receiver operating characteristic (AUC) using a linear discriminant for detecting a small and a large signal. We used a signal-known-exactly task with varying backgrounds in a simulation where the anatomical variation was the major source of clutter for the detection task. Our results show that the AUC dependence on regularization parameters varies with the imaging task (i.e. the signal being detected). The choice of regularization parameters for MSE, SSIM, L-curve and AUC were similar. We also found that a model-based reconstruction including TV and wavelet sparsity did slightly better in terms of AUC than just enforcing data consistency but using these constraints resulted in much better MSE and SSIM. These results suggest that the increased performance in MSE and SSIM over-estimate the improvement in detection performance for the tasks in this paper. The MSE and SSIM metrics show a big difference in performance where the difference in AUC is small. To our knowledge, this is the first time that signal detection with varying backgrounds has been used to optimize constrained reconstruction in MRI.

Collapse

Pouget E, Dedieu V. Impact of iterative reconstruction algorithms on the applicability of Fourier-based detectability index for x-ray CT imaging. Med Phys 2021;48:4229-4241. [PMID: 34075595 DOI: 10.1002/mp.15015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 05/17/2021] [Accepted: 05/23/2021] [Indexed: 11/08/2022] Open

Abstract

PURPOSE

The increasing application of iterative reconstruction algorithms in clinical computed tomography to improve image quality and reduce radiation dose, elicits strong interest, and needs model observers to optimize CT scanning protocols objectively and efficiently. The current paradigm for evaluating imaging system performance relies on Fourier methods, which presuppose a linear, wide-sense stationary system. Long-range correlations introduced by iterative reconstruction algorithms may narrow the applicability of Fourier techniques. Differences in the implementation of reconstruction algorithms between manufacturers add further complexity. The present work set out to quantify the errors entailed by the use of Fourier methods, which can lead to design decisions that do not correlate with detectability.

METHODS

To address this question, we evaluated the noise properties and the detectability index of the ideal linear observer using the spatial approach and the Fourier-based approach. For this purpose, a homogeneous phantom was imaged on two scanners: the Revolution CT (GE Healthcare) and the Somatom Definition AS+ (Siemens Healthcare) at different exposure levels. Images were reconstructed using different strength levels of IR algorithms available on the systems considered: Adaptative Statistical Iterative Reconstruction (ASIR-V) and Sinogram Affirmed Iterative Reconstruction (SAFIRE).

RESULTS

Our findings highlight that the spatial domain estimate of the detectability index is higher than the Fourier domain estimate. This trend is found to be dependent on the specific regularization used by IR algorithms as well as the signal to be detected. The eigenanalysis of the noise covariance matrix and of its circulant approximation yields explanation about the evoked trends. In particular, this analysis suggests that the predictive power of the Fourier-based ideal linear observer depends on the ability of each basis analyzed to be relevant to the signal to be detected.

CONCLUSION

The applicability of Fourier techniques is dependent on the specific regularization used by IR algorithms. These results argue for verifying the assumptions made when using Fourier methods since Fourier-task-based detectability index does not always correlate with signal detectability.

Collapse

Sidky EY, Phillips JP, Zhou W, Ongie G, Cruz-Bastida JP, Reiser IS, Anastasio MA, Pan X. A signal detection model for quantifying overregularization in nonlinear image reconstruction. Med Phys 2021;48:6312-6323. [PMID: 34169538 DOI: 10.1002/mp.14703] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Revised: 12/09/2020] [Accepted: 12/21/2020] [Indexed: 11/08/2022] Open

Abstract

Many useful image quality metrics for evaluating linear image reconstruction techniques do not apply to or are difficult to interpret for nonlinear image reconstruction. The vast majority of metrics employed for evaluating nonlinear image reconstruction are based on some form of global image fidelity, such as image root mean square error (RMSE). Use of such metrics can lead to overregularization in the sense that they can favor removal of subtle details in the image. To address this shortcoming, we develop an image quality metric based on signal detection that serves as a surrogate to the qualitative loss of fine image details. The metric is demonstrated in the context of a breast CT simulation, where different equal-dose configurations are considered. The configurations differ in the number of projections acquired. Image reconstruction is performed with a nonlinear algorithm based on total variation constrained least-squares (TV-LSQ). The resulting images are studied as a function of three parameters: number of views acquired, total variation constraint value, and number of iterations. The images are evaluated visually, with image RMSE, and with the proposed signal-detection-based metric. The latter uses a small signal, and computes detectability in the sinogram and in the reconstructed image. Loss of signal detectability through the image reconstruction process is taken as a quantitative measure of loss of fine details in the image. Loss of signal detectability is seen to correlate well with the blocky or patchy appearance due to overregularization with TV-LSQ, and this trend runs counter to the image RMSE metric, which tends to favor the over-regularized images. The proposed signal detection-based metric provides an image quality assessment that is complimentary to that of image RMSE. Using the two metrics in concert may yield a useful prescription for determining CT algorithm and configuration parameters when nonlinear image reconstruction is used.

Collapse

Lago MA, Abbey CK, Eckstein MP. Foveated Model Observers for Visual Search in 3D Medical Images. IEEE TRANSACTIONS ON MEDICAL IMAGING 2021;40:1021-1031. [PMID: 33315556 PMCID: PMC7994931 DOI: 10.1109/tmi.2020.3044530] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

O'Neill AG, Valdez EL, Lingala SG, Pineda AR. Modeling human observer detection in undersampled magnetic resonance imaging (MRI). PROCEEDINGS OF SPIE--THE INTERNATIONAL SOCIETY FOR OPTICAL ENGINEERING 2021;11599:115990H. [PMID: 36267661 PMCID: PMC9579838 DOI: 10.1117/12.2581076] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Lacy T, Ding A, Minkemeyer V, Frush D, Samei E. Patient-based Performance Assessment for Pediatric Abdominal CT: An Automated Monitoring System Based on Lesion Detectability and Radiation Dose. Acad Radiol 2021;28:217-224. [PMID: 32063494 DOI: 10.1016/j.acra.2020.01.018] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 01/18/2020] [Accepted: 01/18/2020] [Indexed: 12/12/2022]

Han M, Baek J. A convolutional neural network-based anthropomorphic model observer for signal-known-statistically and background-known-statistically detection tasks. ACTA ACUST UNITED AC 2020;65:225025. [DOI: 10.1088/1361-6560/abbf9d] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Obuchowicz R, Oszust M, Piorkowski A. Interobserver variability in quality assessment of magnetic resonance images. BMC Med Imaging 2020;20:109. [PMID: 32962651 PMCID: PMC7509933 DOI: 10.1186/s12880-020-00505-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Accepted: 09/01/2020] [Indexed: 11/10/2022] Open

Abstract

Background

The perceptual quality of magnetic resonance (MR) images influences diagnosis and may compromise the treatment. The purpose of this study was to evaluate how the image quality changes influence the interobserver variability of their assessment.

Methods

For the variability evaluation, a dataset containing distorted MRI images was prepared and then assessed by 31 experienced medical professionals (radiologists). Differences between observers were analyzed using the Fleiss’ kappa. However, since the kappa evaluates the agreement among radiologists taking into account aggregated decisions, a typically employed criterion of the image quality assessment (IQA) performance was used to provide a more thorough analysis. The IQA performance of radiologists was evaluated by comparing the Spearman correlation coefficients, ρ, between individual scores with the mean opinion scores (MOS) composed of the subjective opinions of the remaining professionals.

Results

The experiments show that there is a significant agreement among radiologists (κ=0.12; 95% confidence interval [CI]: 0.118, 0.121; P<0.001) on the quality of the assessed images. The resulted κ is strongly affected by the subjectivity of the assigned scores, separately presenting close scores. Therefore, the ρ was used to identify poor performance cases and to confirm the consistency of the majority of collected scores (ρ_mean = 0.5706). The results for interns (ρ_mean = 0.6868) supports the finding that the quality assessment of MR images can be successfully taught.

Conclusions

The agreement observed among radiologists from different imaging centers confirms the subjectivity of the perception of MR images. It was shown that the image content and severity of distortions affect the IQA. Furthermore, the study highlights the importance of the psychosomatic condition of the observers and their attitude.

Collapse

Tseng HW, Vedantham S, Cho SH, Karellas A. Joint Optimization of Collimator and Reconstruction Parameters in X-Ray Fluorescence Computed Tomography Using Analytical Point Spread Function and Model Observer. IEEE Trans Biomed Eng 2020;67:2443-2452. [PMID: 31899411 PMCID: PMC7326652 DOI: 10.1109/tbme.2019.2963040] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Abadi E, Segars WP, Tsui BMW, Kinahan PE, Bottenus N, Frangi AF, Maidment A, Lo J, Samei E. Virtual clinical trials in medical imaging: a review. J Med Imaging (Bellingham) 2020;7:042805. [PMID: 32313817 PMCID: PMC7148435 DOI: 10.1117/1.jmi.7.4.042805] [Citation(s) in RCA: 68] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Accepted: 03/23/2020] [Indexed: 12/13/2022] Open

Zeng R, Samuelson FW, Sharma D, Badal A, Christian GG, Glick SJ, Myers KJ, Badano A. Computational reader design and statistical performance evaluation of an in-silico imaging clinical trial comparing digital breast tomosynthesis with full-field digital mammography. J Med Imaging (Bellingham) 2020;7:042802. [PMID: 32118094 PMCID: PMC7043285 DOI: 10.1117/1.jmi.7.4.042802] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Accepted: 01/07/2020] [Indexed: 01/15/2023] Open

Abstract

A recent study reported on an in-silico imaging trial that evaluated the performance of digital breast tomosynthesis (DBT) as a replacement for full-field digital mammography (FFDM) for breast cancer screening. In this in-silico trial, the whole imaging chain was simulated, including the breast phantom generation, the x-ray transport process, and computational readers for image interpretation. We focus on the design and performance characteristics of the computational reader in the above-mentioned trial. Location-known lesion (spiculated mass and clustered microcalcifications) detection tasks were used to evaluate the imaging system performance. The computational readers were designed based on the mechanism of a channelized Hotelling observer (CHO), and the reader models were selected to trend human performance. Parameters were tuned to ensure stable lesion detectability. A convolutional CHO that can adapt a round channel function to irregular lesion shapes was compared with the original CHO and was found to be suitable for detecting clustered microcalcifications but was less optimal in detecting spiculated masses. A three-dimensional CHO that operated on the multiple slices was compared with a two-dimensional (2-D) CHO that operated on three versions of 2-D slabs converted from the multiple slices and was found to be optimal in detecting lesions in DBT. Multireader multicase reader output analysis was used to analyze the performance difference between FFDM and DBT for various breast and lesion types. The results showed that DBT was more beneficial in detecting masses than detecting clustered microcalcifications compared with FFDM, consistent with the finding in a clinical imaging trial. Statistical uncertainty smaller than 0.01 standard error for the estimated performance differences was achieved with a dataset containing approximately 3000 breast phantoms. The computational reader design methodology presented provides evidence that model observers can be useful in-silico tools for supporting the performance comparison of breast imaging systems.

Collapse

Vaishnav JY, Ghammraoui B, Leifer M, Zeng R, Jiang L, Myers KJ. CT metal artifact reduction algorithms: Toward a framework for objective performance assessment. Med Phys 2020;47:3344-3355. [PMID: 32406534 PMCID: PMC7496341 DOI: 10.1002/mp.14231] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Revised: 04/05/2020] [Accepted: 04/29/2020] [Indexed: 12/26/2022] Open

Abstract

Purpose

Although several metal artifact reduction (MAR) algorithms for computed tomography (CT) scanning are commercially available, no quantitative, rigorous, and reproducible method exists for assessing their performance. The lack of assessment methods poses a challenge to regulators, consumers, and industry. We explored a phantom‐based framework for assessing an important aspect of MAR performance: how applying MAR in the presence of metal affects model observer performance at a low‐contrast detectability (LCD) task This work is, to our knowledge, the first model observer–based framework for the evaluation of MAR algorithms in the published literature.

Methods

We designed a numerical head phantom with metal implants. In order to incorporate an element of randomness, the phantom included a rotatable inset with an inhomogeneous background. We generated simulated projection data for the phantom. We applied two variants of a simple MAR algorithm, sinogram inpainting, to the projection data, that we reconstructed using filtered backprojection. To assess how MAR affected observer performance, we examined the detectability of a signal at the center of a region of interest (ROI) by a channelized Hotelling observer (CHO). As a figure of merit, we used the area under the ROC curve (AUC).

Results

We used simulation to test our framework on two variants of the MAR technique of sinogram inpainting. We found that our method was able to resolve the difference in two different MAR algorithms’ effect on LCD task performance, as well as the difference in task performances when MAR was applied, vs not.

Conclusion

We laid out a phantom‐based framework for objective assessment of how MAR impacts low‐contrast detectability, that we tested on two MAR algorithms. Our results demonstrate the importance of testing MAR performance over a range of object and imaging parameters, since applying MAR does not always improve the quality of an image for a given diagnostic task. Our framework is an initial step toward developing a more comprehensive objective assessment method for MAR, which would require developing additional phantoms and methods specific to various clinical applications of MAR, and increasing study efficiency.

Collapse

Anton M, Veldkamp WJH, Hernandez-Giron I, Elster C. RDI[Formula: see text]a regression detectability index for quality assurance in: x-ray imaging. Phys Med Biol 2020;65:085017. [PMID: 32109907 DOI: 10.1088/1361-6560/ab7b2e] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Lee C, Han M, Baek J. Human observer performance on in-plane digital breast tomosynthesis images: Effects of reconstruction filters and data acquisition angles on signal detection. PLoS One 2020;15:e0229915. [PMID: 32163472 PMCID: PMC7067468 DOI: 10.1371/journal.pone.0229915] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Accepted: 02/17/2020] [Indexed: 11/29/2022] Open

Ortenzia O, Trojani V, Bertolini M, Nitrosi A, Iori M, Ghetti C. Radiation dose reduction and static image quality assessment using a channelized hotelling observer on an angiography system upgraded with clarity IQ. Biomed Phys Eng Express 2020;6:025008. [PMID: 33438634 DOI: 10.1088/2057-1976/ab73f6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Abbey CK, Samuelson FW, Zeng R, Boone JM, Eckstein MP, Myers KJ. Human observer templates for lesion discrimination tasks. PROCEEDINGS OF SPIE--THE INTERNATIONAL SOCIETY FOR OPTICAL ENGINEERING 2020;11316:113160U. [PMID: 33384465 PMCID: PMC7773010 DOI: 10.1117/12.2549119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Viswanath V, Daube Witherspoon ME, Karp JS, Surti S. Numerical observer study of lesion detectability for a long axial field-of-view whole-body PET imager using the PennPET Explorer. Phys Med Biol 2020;65:035002. [PMID: 31816616 PMCID: PMC7261597 DOI: 10.1088/1361-6560/ab6011] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Samei E, Bakalyar D, Boedeker KL, Brady S, Fan J, Leng S, Myers KJ, Popescu LM, Ramirez Giraldo JC, Ranallo F, Solomon J, Vaishnav J, Wang J. Performance evaluation of computed tomography systems: Summary of AAPM Task Group 233. Med Phys 2019;46:e735-e756. [DOI: 10.1002/mp.13763] [Citation(s) in RCA: 80] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Revised: 07/30/2019] [Accepted: 08/08/2019] [Indexed: 11/09/2022] Open

Kim B, Han M, Shim H, Baek J. A performance comparison of convolutional neural network-based image denoising methods: The effect of loss functions on low-dose CT images. Med Phys 2019;46:3906-3923. [PMID: 31306488 DOI: 10.1002/mp.13713] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Revised: 07/03/2019] [Accepted: 07/05/2019] [Indexed: 02/04/2023] Open

Abstract

PURPOSE

Convolutional neural network (CNN)-based image denoising techniques have shown promising results in low-dose CT denoising. However, CNN often introduces blurring in denoised images when trained with a widely used pixel-level loss function. Perceptual loss and adversarial loss have been proposed recently to further improve the image denoising performance. In this paper, we investigate the effect of different loss functions on image denoising performance using task-based image quality assessment methods for various signals and dose levels.

METHODS

We used a modified version of U-net that was effective at reducing the correlated noise in CT images. The loss functions used for comparison were two pixel-level losses (i.e., the mean-squared error and the mean absolute error), Visual Geometry Group network-based perceptual loss (VGG loss), adversarial loss used to train the Wasserstein generative adversarial network with gradient penalty (WGAN-GP), and their weighted summation. Each image denoising method was applied to reconstructed images and sinogram images independently and validated using the extended cardiac-torso (XCAT) simulation and Mayo Clinic datasets. In the XCAT simulation, we generated fan-beam CT datasets with four different dose levels (25%, 50%, 75%, and 100% of a normal-dose level) using 10 XCAT phantoms and inserted signals in a test set. The signals had two different shapes (spherical and spiculated), sizes (4 and 12 mm), and contrast levels (60 and 160 HU). To evaluate signal detectability, we used a detection task SNR (tSNR) calculated from a non-prewhitening model observer with an eye filter. We also measured the noise power spectrum (NPS) and modulation transfer function (MTF) to compare the noise and signal transfer properties.

RESULTS

Compared to CNNs without VGG loss, VGG-loss-based CNNs achieved a more similar tSNR to that of the normal-dose CT for all signals at different dose levels except for a small signal at the 25% dose level. For a low-contrast signal at 25% or 50% dose, adding other losses to the VGG loss showed more improved performance than only using VGG loss. The NPS shapes from VGG-loss-based CNN closely matched that of normal-dose CT images while CNN without VGG loss overly reduced the mid-high-frequency noise power at all dose levels. MTF also showed VGG-loss-based CNN with better-preserved high resolution for all dose and contrast levels. It is also observed that additional WGAN-GP loss helps improve the noise and signal transfer properties of VGG-loss-based CNN.

CONCLUSIONS

The evaluation results using tSNR, NPS, and MTF indicate that VGG-loss-based CNNs are more effective than those without VGG loss for natural denoising of low-dose images and WGAN-GP loss improves the denoising performance of VGG-loss-based CNNs, which corresponds with the qualitative evaluation.

Collapse

Balta C, Bouwman RW, Broeders MJM, Karssemeijer N, Veldkamp WJH, Sechopoulos I, van Engen RE. Optimization of the difference-of-Gaussian channel sets for the channelized Hotelling observer. J Med Imaging (Bellingham) 2019;6:035501. [PMID: 31572746 PMCID: PMC6763759 DOI: 10.1117/1.jmi.6.3.035501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 08/30/2019] [Indexed: 10/15/2023] Open

Han M, Baek J. A performance comparison of anthropomorphic model observers for breast cone beam CT images: A single‐slice and multislice study. Med Phys 2019;46:3431-3441. [DOI: 10.1002/mp.13598] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Revised: 04/22/2019] [Accepted: 05/13/2019] [Indexed: 12/28/2022] Open

Channelized Hotelling observer correlation with human observers for low-contrast detection in liver CT images. J Med Imaging (Bellingham) 2019;6:025501. [DOI: 10.1117/1.jmi.6.2.025501] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2018] [Accepted: 04/15/2019] [Indexed: 11/14/2022] Open

Bertolini M, Trojani V, Nitrosi A, Iori M, Sassatelli R, Ortenzia O, Ghetti C. Characterization of GE discovery IGS 740 angiography system by means of channelized Hotelling observer (CHO). ACTA ACUST UNITED AC 2019;64:095002. [DOI: 10.1088/1361-6560/ab144c] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Gong H, Yu L, Leng S, Dilger SK, Ren L, Zhou W, Fletcher JG, McCollough CH. A deep learning- and partial least square regression-based model observer for a low-contrast lesion detection task in CT. Med Phys 2019;46:2052-2063. [PMID: 30889282 DOI: 10.1002/mp.13500] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2018] [Revised: 02/15/2019] [Accepted: 03/12/2019] [Indexed: 12/27/2022] Open

Abstract

PURPOSE

This work aims to develop a new framework of image quality assessment using deep learning-based model observer (DL-MO) and to validate it in a low-contrast lesion detection task that involves CT images with patient anatomical background.

METHODS

The DL-MO was developed using the transfer learning strategy to incorporate a pretrained deep convolutional neural network (CNN), a partial least square regression discriminant analysis (PLS-DA) model and an internal noise component. The CNN was previously trained to achieve the state-of-the-art classification accuracy over a natural image database. The earlier layers of the CNN were used as a deep feature extractor, with the assumption that similarity exists between the CNN and the human visual system. The PLSR model was used to further engineer the deep feature for the lesion detection task in CT images. The internal noise component was incorporated to model the inefficiency and variability of human observer (HO) performance, and to generate the ultimate DL-MO test statistics. Seven abdominal CT exams were retrospectively collected from the same type of CT scanners. To compare DL-MO with HO, 12 experimental conditions with varying lesion size, lesion contrast, radiation dose, and reconstruction types were generated, each condition with 154 trials. CT images of a real liver metastatic lesion were numerically modified to generate lesion models with four lesion sizes (5, 7, 9, and 11 mm) and three contrast levels (15, 20, and 25 HU). The lesions were inserted into patient liver images using a projection-based method. A validated noise insertion tool was used to synthesize CT exams with 50% and 25% of routine radiation dose level. CT images were reconstructed using the weighted filtered back projection algorithm and an iterative reconstruction algorithm. Four medical physicists performed a two-alternative forced choice (2AFC) detection task (with multislice scrolling viewing mode) on patient images across the 12 experimental conditions. DL-MO was operated on the same datasets. Statistical analyses were performed to evaluate the correlation and agreement between DL-MO and HO.

RESULTS

A statistically significant positive correlation was observed between DL-MO and HO for the 2AFC low-contrast detection task that involves patient liver background. The corresponding Pearson product moment correlation coefficient was 0.986 [95% confidence interval (0.950, 0.996)]. Bland-Altman agreement analysis did not indicate statistically significant differences.

CONCLUSIONS

The proposed DL-MO is highly correlated with HO in a low-contrast detection task that involves realistic patient liver background. This study demonstrated the potential of the proposed DL-MO to assess image quality directly based on patient images in realistic, clinically relevant CT tasks.

Collapse