1
|
Dong Y, Xiong R, Zhao J, Zhang J, Fan X, Zhu S, Huang T. Learning a Deep Demosaicing Network for Spike Camera With Color Filter Array. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2024; 33:3634-3647. [PMID: 38809732 DOI: 10.1109/tip.2024.3403050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]
Abstract
For capturing dynamic scenes with ultra-fast motion, neuromorphic cameras with extremely high temporal resolution have demonstrated their great capability and potential. Different from the event cameras that only record relative changes in light intensity, spike camera fires a stream of spikes according to a full-time accumulation of photons so that it can recover the texture details for both static areas and dynamic areas. Recently, color spike camera has been invented to record color information of dynamic scenes using a color filter array (CFA). However, demosaicing for color spike cameras is an open and challenging problem. In this paper, we develop a demosaicing network, called CSpkNet, to reconstruct dynamic color visual signals from the spike stream captured by the color spike camera. Firstly, we develop a light inference module to convert binary spike streams to intensity estimates. In particular, a feature-based channel attention module is proposed to reduce the noises caused by quantization errors. Secondly, considering both the Bayer configuration and object motion, we propose a motion-guided filtering module to estimate the missing pixels of each color channel, without undesired motion blur. Finally, we design a refinement module to improve the intensity and details, utilizing the color correlation. Experimental results demonstrate that CSpkNet can reconstruct color images from the Bayer-pattern spike stream with promising visual quality.
Collapse
|
2
|
Nogami H, Kanetaka Y, Naganawa Y, Maeda Y, Fukushima N. Decomposed Multilateral Filtering for Accelerating Filtering with Multiple Guidance Images. SENSORS (BASEL, SWITZERLAND) 2024; 24:633. [PMID: 38276325 PMCID: PMC10820609 DOI: 10.3390/s24020633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 01/08/2024] [Accepted: 01/17/2024] [Indexed: 01/27/2024]
Abstract
This paper proposes an efficient algorithm for edge-preserving filtering with multiple guidance images, so-called multilateral filtering. Multimodal signal processing for sensor fusion is increasingly important in image sensing. Edge-preserving filtering is available for various sensor fusion applications, such as estimating scene properties and refining inverse-rendered images. The main application is joint edge-preserving filtering, which can preferably reflect the edge information of a guidance image from an additional sensor. The drawback of edge-preserving filtering lies in its long computational time; thus, many acceleration methods have been proposed. However, most accelerated filtering cannot handle multiple guidance information well, although the multiple guidance information provides us with various benefits. Therefore, we extend the efficient edge-preserving filters so that they can use additional multiple guidance images. Our algorithm, named decomposes multilateral filtering (DMF), can extend the efficient filtering methods to the multilateral filtering method, which decomposes the filter into a set of constant-time filtering. Experimental results show that our algorithm performs efficiently and is sufficient for various applications.
Collapse
Affiliation(s)
- Haruki Nogami
- Department of Computer Science, Faculty of Engineering, Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya 466-8555, Japan (Y.K.)
| | - Yamato Kanetaka
- Department of Computer Science, Faculty of Engineering, Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya 466-8555, Japan (Y.K.)
| | - Yuki Naganawa
- Department of Computer Science, Faculty of Engineering, Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya 466-8555, Japan (Y.K.)
| | - Yoshihiro Maeda
- Department of Electrical Engineering, Faculty of Engineering, Tokyo University of Science, Tokyo 125-8585, Japan;
| | - Norishige Fukushima
- Department of Computer Science, Faculty of Engineering, Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya 466-8555, Japan (Y.K.)
| |
Collapse
|
3
|
Stojkovic A, Aelterman J, Van Hamme D, Shopovska I, Philips W. Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems. SENSORS (BASEL, SWITZERLAND) 2023; 23:8507. [PMID: 37896600 PMCID: PMC10611084 DOI: 10.3390/s23208507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2023] [Revised: 09/27/2023] [Accepted: 10/09/2023] [Indexed: 10/29/2023]
Abstract
High dynamic range (HDR) imaging technology is increasingly being used in automated driving systems (ADS) for improving the safety of traffic participants in scenes with strong differences in illumination. Therefore, a combination of HDR video, that is video with details in all illumination regimes, and (HDR) object perception techniques that can deal with this variety in illumination is highly desirable. Although progress has been made in both HDR imaging solutions and object detection algorithms in the recent years, they have progressed independently of each other. This has led to a situation in which object detection algorithms are typically designed and constantly improved to operate on 8 bit per channel content. This makes these algorithms not ideally suited for use in HDR data processing, which natively encodes to a higher bit-depth (12 bits/16 bits per channel). In this paper, we present and evaluate two novel convolutional neural network (CNN) architectures that intelligently convert high bit depth HDR images into 8-bit images. We attempt to optimize reconstruction quality by focusing on ADS object detection quality. The first research novelty is to jointly perform tone-mapping with demosaicing by additionally successfully suppressing noise and demosaicing artifacts. The first CNN performs tone-mapping with noise suppression on a full-color HDR input, while the second performs joint demosaicing and tone-mapping with noise suppression on a raw HDR input. The focus is to increase the detectability of traffic-related objects in the reconstructed 8-bit content, while ensuring that the realism of the standard dynamic range (SDR) content in diverse conditions is preserved. The second research novelty is that for the first time, to the best of our knowledge, a thorough comparative analysis against the state-of-the-art tone-mapping and demosaicing methods is performed with respect to ADS object detection accuracy on traffic-related content that abounds with diverse challenging (i.e., boundary cases) scenes. The evaluation results show that the two proposed networks have better performance in object detection accuracy and image quality, than both SDR content and content obtained with the state-of-the-art tone-mapping and demosaicing algorithms.
Collapse
|
4
|
Huang F, Chen Y, Wang X, Wang S, Wu X. Spectral Clustering Super-Resolution Imaging Based on Multispectral Camera Array. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2023; 32:1257-1271. [PMID: 37022799 DOI: 10.1109/tip.2023.3242589] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Although multispectral and hyperspectral imaging acquisitions are applied in numerous fields, the existing spectral imaging systems suffer from either low temporal or spatial resolution. In this study, a new multispectral imaging system-camera array based multispectral super resolution imaging system (CAMSRIS) is proposed that can simultaneously achieve multispectral imaging with high temporal and spatial resolutions. The proposed registration algorithm is used to align pairs of different peripheral and central view images. A novel, super-resolution, spectral-clustering-based image reconstruction algorithm was developed for the proposed CAMSRIS to improve the spatial resolution of the acquired images and preserve the exact spectral information without introducing false information. The reconstructed results showed that the spatial and spectral quality and operational efficiency of the proposed system are better than those of a multispectral filter array (MSFA) based on different multispectral datasets. The PSNR of the multispectral super-resolution images obtained by the proposed method were respectively higher by 2.03 and 1.93 dB than those of GAP-TV and DeSCI, and the execution time was significantly shortened by approximately 54.55 s and 9820.19 s when the CAMSI dataset was used. The feasibility of the proposed system was verified in practical applications based on different scenes captured by the self-built system.
Collapse
|
5
|
Papanikolaou A, Garbat P, Kujawinska M. Metrological Evaluation of the Demosaicking Effect on Colour Digital Image Correlation with Application in Monitoring of Paintings. SENSORS (BASEL, SWITZERLAND) 2022; 22:7359. [PMID: 36236458 PMCID: PMC9573450 DOI: 10.3390/s22197359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 09/22/2022] [Accepted: 09/23/2022] [Indexed: 06/16/2023]
Abstract
A modified 3D colour digital image correlation method (3D cDIC) is proposed for efficient displacement measurements of colour objects with natural texture. The method is using a separate analysis of correlation coefficient (sigma) value in the RGB channels of CCD cameras by utilising local information from the channel with the minimum sigma. In this way, merged U, V and W displacement maps are generated based on the local correlation quality. As the proposed method applies to colour filter array cameras, the images in RGB channels have to undergo a demosaicking procedure which directly influences the accuracy of displacement measurements. In the paper, the best performing demosaicking methods are selected. The metrological analysis of their influence on the results of canvas paintings investigations obtained by unmodified and modified 3D cDIC processing is presented.
Collapse
Affiliation(s)
| | - Piotr Garbat
- Faculty of Electronics and Information Technology, Warsaw University of Technology, 00-665 Warsaw, Poland
| | | |
Collapse
|
6
|
Liu X, Yang L, Wang L. Modified Newton-residual interpolation for division of focal plane polarization image demosaicking. OPTICS EXPRESS 2022; 30:33048-33067. [PMID: 36242354 DOI: 10.1364/oe.460495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 07/08/2022] [Indexed: 06/16/2023]
Abstract
With the improvement of semiconductor processing technology, polarization sensors using division of focal plane have gradually become the mainstream method of polarization imaging. Similar to the color restoration method of the Bayer array sensor, the spatial information of polarized image is also recovered through the polarization demosaicking algorithm. In this paper, we propose a new modified Newton-residual interpolation polarization image demosaicking algorithm based on residual interpolation, which is suitable for a monochrome or color polarization filter array. First, we use the modified Newton interpolation method to generate edge-sensitive guiding images. Then, we carry out the improvement of the guide process during the residual interpolation by performing variance statistics on the local window image in the guiding process, so that the edges and flat image blocks have different guiding weights. Finally, we obtain edge-preserving results by applying these two improvements, which reduces the zipper effect and edge confusion. We compare the results of various algorithms on experimental data, demonstrating that our algorithm has impactful improvements in the evaluation metrics based on the ground-truth images.
Collapse
|
7
|
Yang J, Jin W, Qiu S, Xue F, Wang M. Residual Interpolation Integrated Pixel-by-Pixel Adaptive Iterative Process for Division of Focal Plane Polarimeters. SENSORS 2022; 22:s22041529. [PMID: 35214435 PMCID: PMC8874419 DOI: 10.3390/s22041529] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/22/2022] [Revised: 02/08/2022] [Accepted: 02/09/2022] [Indexed: 11/16/2022]
Abstract
Residual interpolations are effective methods to reduce the instantaneous field-of-view error of division of focal plane (DoFP) polarimeters. However, their guide-image selection strategies are improper, and do not consider the DoFP polarimeters’ spatial sampling modes. Thus, we propose a residual interpolation method with a new guide-image selection strategy based on the spatial layout of the pixeled polarizer array to improve the sampling rate of the guide image. The interpolation performance is also improved by the proposed pixel-by-pixel, adaptive iterative process and the weighted average fusion of the results of the minimized residual and minimized Laplacian energy guide filters. Visual and objective evaluations demonstrate the proposed method’s superiority to the existing state-of-the-art methods. The proposed method proves that considering the spatial layout of the pixeled polarizer array on the physical level is vital to improving the performance of interpolation methods for DoFP polarimeters.
Collapse
|
8
|
Hounsou N, Sanda Mahama AT, Gouton P. Extension of luminance component based demosaicking algorithm to 4- and 5-band multispectral images. ARRAY 2021. [DOI: 10.1016/j.array.2021.100088] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
|
9
|
Bian L, Wang Y, Zhang J. Generalized MSFA Engineering With Structural and Adaptive Nonlocal Demosaicing. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021; 30:7867-7877. [PMID: 34487494 DOI: 10.1109/tip.2021.3108913] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The emerging multispectral-filter-array (MSFA) cameras require generalized demosaicing for MSFA engineering. The existing interpolation, compressive sensing and deep learning based methods suffer from either limited reconstruction accuracy or poor generalization. In this work, we report a generalized demosaicing method with structural and adaptive nonlocal optimization, enabling boosted reconstruction accuracy for different MSFAs. The advantages lie in the following three aspects. First, the nonlocal low-rank optimization is applied and extended to the multiple spatial-spectral-temporal dimensions to exploit more crucial details. Second, the block matching accuracy is promoted by employing a novel structural similarity metric instead of the conventional Euclidean distance. Third, the running efficiency is boosted by an adaptive iteration strategy. We built a prototype system to capture raw mosaic images under different MSFAs, and used the technique as an off-the-shelf tool to demonstrate MSFA engineering. The experiments show that the binary tree (BT) based filter array produces higher accuracy than the random and regular ones for different number of channels.
Collapse
|
10
|
A Brief Review of Some Interesting Mars Rover Image Enhancement Projects. COMPUTERS 2021. [DOI: 10.3390/computers10090111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The Curiosity rover has landed on Mars since 2012. One of the instruments onboard the rover is a pair of multispectral cameras known as Mastcams, which act as eyes of the rover. In this paper, we summarize our recent studies on some interesting image processing projects for Mastcams. In particular, we will address perceptually lossless compression of Mastcam images, debayering and resolution enhancement of Mastcam images, high resolution stereo and disparity map generation using fused Mastcam images, and improved performance of anomaly detection and pixel clustering using combined left and right Mastcam images. The main goal of this review paper is to raise public awareness about these interesting Mastcam projects and also stimulate interests in the research community to further develop new algorithms for those applications.
Collapse
|
11
|
Wang S, Zhao M, Dou R, Yu S, Liu L, Wu N. A Compact High-Quality Image Demosaicking Neural Network for Edge-Computing Devices. SENSORS 2021; 21:s21093265. [PMID: 34066794 PMCID: PMC8125912 DOI: 10.3390/s21093265] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 05/03/2021] [Accepted: 05/05/2021] [Indexed: 11/16/2022]
Abstract
Image demosaicking has been an essential and challenging problem among the most crucial steps of image processing behind image sensors. Due to the rapid development of intelligent processors based on deep learning, several demosaicking methods based on a convolutional neural network (CNN) have been proposed. However, it is difficult for their networks to run in real-time on edge computing devices with a large number of model parameters. This paper presents a compact demosaicking neural network based on the UNet++ structure. The network inserts densely connected layer blocks and adopts Gaussian smoothing layers instead of down-sampling operations before the backbone network. The densely connected blocks can extract mosaic image features efficiently by utilizing the correlation between feature maps. Furthermore, the block adopts depthwise separable convolutions to reduce the model parameters; the Gaussian smoothing layer can expand the receptive fields without down-sampling image size and discarding image information. The size constraints on the input and output images can also be relaxed, and the quality of demosaicked images is improved. Experiment results show that the proposed network can improve the running speed by 42% compared with the fastest CNN-based method and achieve comparable reconstruction quality as it on four mainstream datasets. Besides, when we carry out the inference processing on the demosaicked images on typical deep CNN networks, Mobilenet v1 and SSD, the accuracy can also achieve 85.83% (top 5) and 75.44% (mAP), which performs comparably to the existing methods. The proposed network has the highest computing efficiency and lowest parameter number through all methods, demonstrating that it is well suitable for applications on modern edge computing devices.
Collapse
Affiliation(s)
- Shuyu Wang
- State Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China; (S.W.); (M.Z.); (R.D.); (S.Y.); (N.W.)
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mingxin Zhao
- State Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China; (S.W.); (M.Z.); (R.D.); (S.Y.); (N.W.)
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Runjiang Dou
- State Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China; (S.W.); (M.Z.); (R.D.); (S.Y.); (N.W.)
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Shuangming Yu
- State Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China; (S.W.); (M.Z.); (R.D.); (S.Y.); (N.W.)
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Liyuan Liu
- State Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China; (S.W.); (M.Z.); (R.D.); (S.Y.); (N.W.)
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
- Correspondence:
| | - Nanjian Wu
- State Key Laboratory of Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China; (S.W.); (M.Z.); (R.D.); (S.Y.); (N.W.)
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
- Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Beijing 100083, China
| |
Collapse
|
12
|
Genser N, Seiler J, Kaup A. Camera Array for Multi-Spectral Imaging. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2020; PP:9234-9249. [PMID: 32970597 DOI: 10.1109/tip.2020.3024738] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Recently, many new applications arose for multispectral and hyper-spectral imaging. Besides modern biometric systems for identity verification, also agricultural and medical applications came up, which measure the health condition of plants and humans. Despite the growing demand, the acquisition of multi-spectral data is up to the present complicated. Often, expensive, inflexible, or low resolution acquisition setups are only obtainable for specific professional applications. To overcome these limitations, a novel camera array for multi-spectral imaging is presented in this article for generating consistent multispectral videos. As differing spectral images are acquired at various viewpoints, a geometrically constrained multi-camera sensor layout is introduced, which enables the formulation of novel registration and reconstruction algorithms to globally set up robust models. On average, the novel acquisition approach achieves a gain of 2.5 dB PSNR compared to recently published multi-spectral filter array imaging systems. At the same time, the proposed acquisition system ensures not only a superior spatial, but also a high spectral, and temporal resolution, while filters are flexibly exchangeable by the user depending on the application. Moreover, depth information is generated, so that 3D imaging applications, e.g., for augmented or virtual reality, become possible. The proposed camera array for multi-spectral imaging can be set up using off-the-shelf hardware, which allows for a compact design and employment in, e.g., mobile devices or drones, while being cost-effective.
Collapse
|
13
|
Kim H, Lee S, Kang MG. Demosaicing of RGBW Color Filter Array Based on Rank Minimization with Colorization Constraint. SENSORS 2020; 20:s20164458. [PMID: 32785041 PMCID: PMC7472487 DOI: 10.3390/s20164458] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Revised: 08/07/2020] [Accepted: 08/07/2020] [Indexed: 11/16/2022]
Abstract
Recently, the white (w) channel has been incorporated in various forms into color filter
arrays (CFAs). The advantage of using theWchannel is thatWpixels have less noise than red (R),
green (G), or blue (B) (RGB) pixels; therefore, under low-light conditions, pixels with high fidelity
can be obtained. However, RGBW CFAs normally suffer from spatial resolution degradation due
to a smaller number of color pixels than in RGB CFAs. Therefore, even though the reconstructed
colors have higher sensitivity, which results in larger Color Peak Signal-to-Noise Ratio (CPSNR)
values, there are some color aliasing artifacts due to a low resolution. In this paper, we propose a
rank minimization-based color interpolation method with a colorization constraint for the RGBW
format with a large number ofWpixels. The rank minimization can achieve a broad interpolation
and preserve the structure in the image, and it thereby eliminates the color artifacts. However, the
colors fade from this global process. Therefore, we further incorporate a colorization constraint into
the rank minimization process for the better reproduction of the colors. The experimental results
show that the images can be reconstructed well, even from noisy pattern images obtained under
low-light conditions.
Collapse
Affiliation(s)
- Hansol Kim
- School of Electrical and Electronic Engineering, Yonsei University, Seoul 03722, Korea
| | - Sukho Lee
- Division of Computer Engineering, Dongseo University, Busan 47011, Korea
| | - Moon Gi Kang
- School of Electrical and Electronic Engineering, Yonsei University, Seoul 03722, Korea
| |
Collapse
|
14
|
Kwan C, Larkin J, Ayhan B. Demosaicing of CFA 3.0 with Applications to Low Lighting Images. SENSORS (BASEL, SWITZERLAND) 2020; 20:E3423. [PMID: 32560500 PMCID: PMC7349740 DOI: 10.3390/s20123423] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2020] [Revised: 06/01/2020] [Accepted: 06/15/2020] [Indexed: 11/16/2022]
Abstract
Low lighting images usually contain Poisson noise, which is pixel amplitude-dependent. More panchromatic or white pixels in a color filter array (CFA) are believed to help the demosaicing performance in dark environments. In this paper, we first introduce a CFA pattern known as CFA 3.0 that has 75% white pixels, 12.5% green pixels, and 6.25% of red and blue pixels. We then present algorithms to demosaic this CFA, and demonstrate its performance for normal and low lighting images. In addition, a comparative study was performed to evaluate the demosaicing performance of three CFAs, namely the Bayer pattern (CFA 1.0), the Kodak CFA 2.0, and the proposed CFA 3.0. Using a clean Kodak dataset with 12 images, we emulated low lighting conditions by introducing Poisson noise into the clean images. In our experiments, normal and low lighting images were used. For the low lighting conditions, images with signal-to-noise (SNR) of 10 dBs and 20 dBs were studied. We observed that the demosaicing performance in low lighting conditions was improved when there are more white pixels. Moreover, denoising can further enhance the demosaicing performance for all CFAs. The most important finding is that CFA 3.0 performs better than CFA 1.0, but is slightly inferior to CFA 2.0, in low lighting images.
Collapse
Affiliation(s)
- Chiman Kwan
- Applied Research LLC; Rockville, MD 20850, USA; (J.L.); (B.A.)
| | | | | |
Collapse
|
15
|
Shopovska I, Jovanov L, Philips W. Efficient Training Procedures for Multi-Spectra Demosaicing. SENSORS (BASEL, SWITZERLAND) 2020; 20:s20102850. [PMID: 32429529 PMCID: PMC7287920 DOI: 10.3390/s20102850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Revised: 05/12/2020] [Accepted: 05/15/2020] [Indexed: 06/11/2023]
Abstract
The simultaneous acquisition of multi-spectral images on a single sensor can be efficiently performed by single shot capture using a mutli-spectral filter array. This paper focused on the demosaicing of color and near-infrared bands and relied on a convolutional neural network (CNN). To train the deep learning model robustly and accurately, it is necessary to provide enough training data, with sufficient variability. We focused on the design of an efficient training procedure by discovering an optimal training dataset. We propose two data selection strategies, motivated by slightly different concepts. The general term that will be used for the proposed models trained using data selection is data selection-based multi-spectral demosaicing (DSMD). The first idea is clustering-based data selection (DSMD-C), with the goal to discover a representative subset with a high variance so as to train a robust model. The second is an adaptive-based data selection (DSMD-A), a self-guided approach that selects new data based on the current model accuracy. We performed a controlled experimental evaluation of the proposed training strategies and the results show that a careful selection of data does benefit the speed and accuracy of training. We are still able to achieve high reconstruction accuracy with a lightweight model.
Collapse
|
16
|
Ni Z, Ma KK, Zeng H, Zhong B. Color Image Demosaicing Using Progressive Collaborative Representation. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2020; 29:4952-4964. [PMID: 32149636 DOI: 10.1109/tip.2020.2975978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
In this paper, a progressive collaborative representation (PCR) framework is proposed that is able to incorporate any existing color image demosaicing method for further boosting its demosaicing performance. Our PCR consists of two phases: (i) offline training and (ii) online refinement. In phase (i), multiple training-and-refining stages will be performed. In each stage, a new dictionary will be established through the learning of a large number of feature-patch pairs, extracted from the demosaicked images of the current stage and their corresponding original full-color images. After training, a projection matrix will be generated and exploited to refine the current demosaicked image. The updated image with improved image quality will be used as the input for the next training-and-refining stage and performed the same processing likewise. At the end of phase (i), all the projection matrices generated as above-mentioned will be exploited in phase (ii) to conduct online demosaicked image refinement of the test image. Extensive simulations conducted on two commonly-used test datasets (i.e., the IMAX and Kodak) for evaluating the demosaicing algorithms have clearly demonstrated that our proposed PCR framework is able to constantly boost the performance of any image demosaicing method we experimented, in terms of the objective and subjective performance evaluations.
Collapse
|
17
|
Stojkovic A, Shopovska I, Luong H, Aelterman J, Jovanov L, Philips W. The Effect of the Color Filter Array Layout Choice on State-of-the-Art Demosaicing. SENSORS (BASEL, SWITZERLAND) 2019; 19:E3215. [PMID: 31330923 PMCID: PMC6679506 DOI: 10.3390/s19143215] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Revised: 07/18/2019] [Accepted: 07/18/2019] [Indexed: 11/17/2022]
Abstract
Interpolation from a Color Filter Array (CFA) is the most common method for obtaining full color image data. Its success relies on the smart combination of a CFA and a demosaicing algorithm. Demosaicing on the one hand has been extensively studied. Algorithmic development in the past 20 years ranges from simple linear interpolation to modern neural-network-based (NN) approaches that encode the prior knowledge of millions of training images to fill in missing data in an inconspicious way. CFA design, on the other hand, is less well studied, although still recognized to strongly impact demosaicing performance. This is because demosaicing algorithms are typically limited to one particular CFA pattern, impeding straightforward CFA comparison. This is starting to change with newer classes of demosaicing that may be considered generic or CFA-agnostic. In this study, by comparing performance of two state-of-the-art generic algorithms, we evaluate the potential of modern CFA-demosaicing. We test the hypothesis that, with the increasing power of NN-based demosaicing, the influence of optimal CFA design on system performance decreases. This hypothesis is supported with the experimental results. Such a finding would herald the possibility of relaxing CFA requirements, providing more freedom in the CFA design choice and producing high-quality cameras.
Collapse
Affiliation(s)
- Ana Stojkovic
- TELIN-IPI, Ghent University-imec, 9000 Ghent, Belgium.
| | | | - Hiep Luong
- TELIN-IPI, Ghent University-imec, 9000 Ghent, Belgium
| | - Jan Aelterman
- TELIN-IPI, Ghent University-imec, 9000 Ghent, Belgium.
| | | | | |
Collapse
|
18
|
Weights-Based Image Demosaicking Using Posteriori Gradients and the Correlation of R–B Channels in High Frequency. Symmetry (Basel) 2019. [DOI: 10.3390/sym11050600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
In this paper, we propose a weights-based image demosaicking algorithm which is based on the Bayer pattern color filter array (CFA). When reconstructing the missing G components, the proposed algorithm uses weights based on posteriori gradients to mitigate color artifacts and distortions. Furthermore, the proposed algorithm makes full use of the correlation of R–B channels in high frequency when interpolating R/B values at B/R positions. Experimental results show that the proposed algorithm is superior to previous similar algorithms in composite peak signal-to-noise ratio (CPSNR) and subjective visual effect. The biggest advantage of the proposed algorithm is the use of posteriori gradients and the correlation of R–B channels in high frequency.
Collapse
|
19
|
Comparison of Deep Learning and Conventional Demosaicing Algorithms for Mastcam Images. ELECTRONICS 2019. [DOI: 10.3390/electronics8030308] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Bayer pattern filters have been used in many commercial digital cameras. In National Aeronautics and Space Administration’s (NASA) mast camera (Mastcam) imaging system, onboard the Mars Science Laboratory (MSL) rover Curiosity, a Bayer pattern filter is being used to capture the RGB (red, green, and blue) color of scenes on Mars. The Mastcam has two cameras: left and right. The right camera has three times better resolution than that of the left. It is well known that demosaicing introduces color and zipper artifacts. Here, we present a comparative study of demosaicing results using conventional and deep learning algorithms. Sixteen left and 15 right Mastcam images were used in our experiments. Due to a lack of ground truth images for Mastcam data from Mars, we compared the various algorithms using a blind image quality assessment model. It was observed that no one algorithm can work the best for all images. In particular, a deep learning-based algorithm worked the best for the right Mastcam images and a conventional algorithm achieved the best results for the left Mastcam images. Moreover, subjective evaluation of five demosaiced Mastcam images was also used to compare the various algorithms.
Collapse
|
20
|
Bayer Image Demosaicking Using Eight-Directional Weights Based on the Gradient of Color Difference. Symmetry (Basel) 2018. [DOI: 10.3390/sym10060222] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
21
|
An Effective Directional Residual Interpolation Algorithm for Color Image Demosaicking. APPLIED SCIENCES-BASEL 2018. [DOI: 10.3390/app8050680] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|