Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhu L, Zhang Y, Wang S, Yuan H, Kwong S, Ip HHS. Convolutional Neural Network Based Synthesized View Quality Enhancement for 3D Video Coding. IEEE Trans Image Process 2018;27:5365-5377. [PMID: 30040639 DOI: 10.1109/tip.2018.2858022] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]

For:	Zhu L, Zhang Y, Wang S, Yuan H, Kwong S, Ip HHS. Convolutional Neural Network Based Synthesized View Quality Enhancement for 3D Video Coding. IEEE Trans Image Process 2018;27:5365-5377. [PMID: 30040639 DOI: 10.1109/tip.2018.2858022] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]

Number

Cited by Other Article(s)

Zhang Y, Ding K, Li N, Wang H, Huang X, Kuo CCJ. Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2023;32:5933-5947. [PMID: 37903048 DOI: 10.1109/tip.2023.3327003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/01/2023]

Abstract

Dynamic point cloud is a volumetric visual data representing realistic 3D scenes for virtual reality and augmented reality applications. However, its large data volume has been the bottleneck of data processing, transmission, and storage, which requires effective compression. In this paper, we propose a Perceptually Weighted Rate-Distortion Optimization (PWRDO) scheme for Video-based Point Cloud Compression (V-PCC), which aims to minimize the perceptual distortion of reconstructed point cloud at the given bit rate. Firstly, we propose a general framework of perceptually optimized V-PCC to exploit visual redundancies in point clouds. Secondly, a multi-scale Projection based Point Cloud quality Metric (PPCM) is proposed to measure the perceptual quality of 3D point cloud. The PPCM model comprises 3D-to-2D patch projection, multi-scale structural distortion measurement, and fusion model. Approximations and simplifications of the proposed PPCM are also presented for both V-PCC integration and low complexity. Thirdly, based on the simplified PPCM model, we propose a PWRDO scheme with Lagrange multiplier adaptation, which is incorporated into the V-PCC to enhance the coding efficiency. Experimental results show that the proposed PPCM models can be used as standalone quality metrics, and they are able to achieve higher consistency with the human subjective scores than the state-of-the-art objective visual quality metrics. Also, compared with the latest V-PCC reference model, the proposed PWRDO-based V-PCC scheme achieves an average bit rate reduction of 13.52%, 8.16%, 10.56% and 9.54%, respectively, in terms of four objective visual quality metrics for point clouds. It is significantly superior to the state-of-the-art coding algorithms. The computational complexity of the proposed PWRDO increases by 1.71% and 0.05% on average to the V-PCC encoder and decoder, respectively, which is negligible. The source codes of the PPCM and PWRDO schemes are available at https://github.com/VVCodec/PPCM-PWRDO.

Collapse

Wu C, He G, Lai X, Li Y. MPCNet: Compressed multi-view video restoration via motion-parallax complementation network. Neural Netw 2023;167:601-614. [PMID: 37713766 DOI: 10.1016/j.neunet.2023.08.037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 07/21/2023] [Accepted: 08/21/2023] [Indexed: 09/17/2023]

Liu W, Ma L, Qiu B, Cui M. Stereoscopic view synthesis with progressive structure reconstruction and scene constraints. PLoS One 2022;17:e0279249. [PMID: 36534690 PMCID: PMC9762595 DOI: 10.1371/journal.pone.0279249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2022] [Accepted: 12/04/2022] [Indexed: 12/23/2022] Open

Zhang H, Cao J, Zheng D, Yao X, Ling BWK. Deep Learning-Based Synthesized View Quality Enhancement with DIBR Distortion Mask Prediction Using Synthetic Images. SENSORS (BASEL, SWITZERLAND) 2022;22:8127. [PMID: 36365828 PMCID: PMC9656180 DOI: 10.3390/s22218127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Revised: 10/20/2022] [Accepted: 10/20/2022] [Indexed: 06/16/2023]

Abstract

Recently, deep learning-based image quality enhancement models have been proposed to improve the perceptual quality of distorted synthesized views impaired by compression and the Depth Image-Based Rendering (DIBR) process in a multi-view video system. However, due to the lack of Multi-view Video plus Depth (MVD) data, the training data for quality enhancement models is small, which limits the performance and progress of these models. Augmenting the training data to enhance the synthesized view quality enhancement (SVQE) models is a feasible solution. In this paper, a deep learning-based SVQE model using more synthetic synthesized view images (SVIs) is suggested. To simulate the irregular geometric displacement of DIBR distortion, a random irregular polygon-based SVI synthesis method is proposed based on existing massive RGB/RGBD data, and a synthetic synthesized view database is constructed, which includes synthetic SVIs and the DIBR distortion mask. Moreover, to further guide the SVQE models to focus more precisely on DIBR distortion, a DIBR distortion mask prediction network which could predict the position and variance of DIBR distortion is embedded into the SVQE models. The experimental results on public MVD sequences demonstrate that the PSNR performance of the existing SVQE models, e.g., DnCNN, NAFNet, and TSAN, pre-trained on NYU-based synthetic SVIs could be greatly promoted by 0.51-, 0.36-, and 0.26 dB on average, respectively, while the MPPSNRr performance could also be elevated by 0.86, 0.25, and 0.24 on average, respectively. In addition, by introducing the DIBR distortion mask prediction network, the SVI quality obtained by the DnCNN and NAFNet pre-trained on NYU-based synthetic SVIs could be further enhanced by 0.02- and 0.03 dB on average in terms of the PSNR and 0.004 and 0.121 on average in terms of the MPPSNRr.

Collapse

Lei J, Zhang Z, Pan Z, Liu D, Liu X, Chen Y, Ling N. Disparity-Aware Reference Frame Generation Network for Multiview Video Coding. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2022;31:4515-4526. [PMID: 35727785 DOI: 10.1109/tip.2022.3183436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Sheng X, Li L, Liu D, Xiong Z. Attribute Artifacts Removal for Geometry-Based Point Cloud Compression. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2022;31:3399-3413. [PMID: 35503831 DOI: 10.1109/tip.2022.3170722] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Zhang Y, Kwong S, Wang S. Machine learning based video coding optimizations: A survey. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2019.07.096] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Zhang M, Chen Y, Pan Y, Zeng Z. A Fast Image Deformity Correction Algorithm for Underwater Turbulent Image Distortion. SENSORS (BASEL, SWITZERLAND) 2019;19:E3818. [PMID: 31487831 PMCID: PMC6766914 DOI: 10.3390/s19183818] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 08/28/2019] [Accepted: 09/02/2019] [Indexed: 06/10/2023]

Liu H, Zhang Y, Zhang H, Fan C, Kwong S, Kuo CCJ, Fan X. Deep Learning based Picture-Wise Just Noticeable Distortion Prediction Model for Image Compression. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2019;29:641-656. [PMID: 31425033 DOI: 10.1109/tip.2019.2933743] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]