1
|
Jia Y, Yu W, Chen G, Zhao L. Nighttime road scene image enhancement based on cycle-consistent generative adversarial network. Sci Rep 2024; 14:14375. [PMID: 38909068 PMCID: PMC11193765 DOI: 10.1038/s41598-024-65270-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Accepted: 06/18/2024] [Indexed: 06/24/2024] Open
Abstract
During nighttime road scenes, images are often affected by contrast distortion, loss of detailed information, and a significant amount of noise. These factors can negatively impact the accuracy of segmentation and object detection in nighttime road scenes. A cycle-consistent generative adversarial network has been proposed to address this issue to improve the quality of nighttime road scene images. The network includes two generative networks with identical structures and two adversarial networks with identical structures. The generative network comprises an encoder network and a corresponding decoder network. A context feature extraction module is designed as the foundational element of the encoder-decoder network to capture more contextual semantic information with different receptive fields. A receptive field residual module is also designed to increase the receptive field in the encoder network.The illumination attention module is inserted between the encoder and decoder to transfer critical features extracted by the encoder to the decoder. The network also includes a multiscale discriminative network to discriminate better whether the image is a real high-quality or generated image. Additionally, an improved loss function is proposed to enhance the efficacy of image enhancement. Compared to state-of-the-art methods, the proposed approach achieves the highest performance in enhancing nighttime images, making them clearer and more natural.
Collapse
Affiliation(s)
- Yanfei Jia
- College of Electrical and Information Engineering, Beihua University, Jilin, 132013, China
| | - Wenshuo Yu
- College of Electrical Engineering, Northeast Electric Power University, Jilin, 132012, China
| | - Guangda Chen
- College of Electrical and Information Engineering, Beihua University, Jilin, 132013, China.
| | - Liquan Zhao
- College of Electrical Engineering, Northeast Electric Power University, Jilin, 132012, China
| |
Collapse
|
2
|
Zhang C, Gu Y, Yang GZ. Contrastive Adversarial Learning for Endomicroscopy Imaging Super-Resolution. IEEE J Biomed Health Inform 2023; 27:3994-4005. [PMID: 37171919 DOI: 10.1109/jbhi.2023.3275563] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Endomicroscopy is an emerging imaging modality for real-time optical biopsy. One limitation of existing endomicroscopy based on coherent fibre bundles is that the image resolution is intrinsically limited by the number of fibres that can be practically integrated within the small imaging probe. To improve the image resolution, Super-Resolution (SR) techniques combined with image priors can enhance the clinical utility of endomicroscopy whereas existing SR algorithms suffer from the lack of explicit guidance from ground truth high-resolution (HR) images. In this article, we propose an unsupervised SR pipeline to allow stable offline and kernel-generic learning. Our method takes advantage of both internal statistics and external cross-modality priors. To improve the joint learning process, we present a Sharpness-aware Contrastive Generative Adversarial Network (SCGAN) with two dedicated modules, a sharpness-aware generator and a contrastive-learning discriminator. In the generator, an auxiliary task of sharpness discrimination is formulated to facilitate internal learning by considering the rankings of training instances in various sharpness levels. In the discriminator, we design a contrastive-learning module to mitigate the ill-posed nature of SR tasks via constraints from both positive and negative images. Experiments on multiple datasets demonstrate that SCGAN reduces the performance gap between previous unsupervised approaches and the upper bounds defined in supervised settings by more than 50%, delivering a new state-of-the-art performance score for endomicroscopy super-resolution. Further application on a realistic Voronoi-based pCLE downsampling kernel proves that SCGAN attains PSNR of 35.851 dB, improving 5.23 dB compared with the traditional Delaunay interpolation.
Collapse
|
3
|
Hao S, Huang C, Heidari AA, Xu Z, Chen H, Althobaiti MM, Mansour RF, Chen X. Performance optimization of water cycle algorithm for multilevel lupus nephritis image segmentation. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
4
|
Zhou Y, Xu C, Zhao L, Zhu A, Hu F, Li Y. CSI-Former: Pay More Attention to Pose Estimation with WiFi. ENTROPY (BASEL, SWITZERLAND) 2022; 25:20. [PMID: 36673161 PMCID: PMC9858036 DOI: 10.3390/e25010020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 12/11/2022] [Accepted: 12/18/2022] [Indexed: 06/17/2023]
Abstract
Cross-modal human pose estimation has a wide range of applications. Traditional image-based pose estimation will not work well in poor light or darkness. Therefore, some sensors such as LiDAR or Radio Frequency (RF) signals are now using to estimate human pose. However, it limits the application that these methods require much high-priced professional equipment. To address these challenges, we propose a new WiFi-based pose estimation method. Based on the Channel State Information (CSI) of WiFi, a novel architecture CSI-former is proposed to innovatively realize the integration of the multi-head attention in the WiFi-based pose estimation network. To evaluate the performance of CSI-former, we establish a span-new dataset Wi-Pose. This dataset consists of 5 GHz WiFi CSI, the corresponding images, and skeleton point annotations. The experimental results on Wi-Pose demonstrate that CSI-former can significantly improve the performance in wireless pose estimation and achieve more remarkable performance over traditional image-based pose estimation. To better benefit future research on the WiFi-based pose estimation, Wi-Pose has been made publicly available.
Collapse
|
5
|
Xu Y, Lv Y, Zhu X, Liu S, Sun Y, Wang Y. Hyperspectral image super-resolution reconstruction based on image partition and detail enhancement. Soft comput 2022. [DOI: 10.1007/s00500-022-07723-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
6
|
Context receptive field and adaptive feature fusion for fabric defect detection. Soft comput 2022. [DOI: 10.1007/s00500-022-07675-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
|
7
|
Underwater Backscatter Recognition Using Deep Fuzzy Extreme Convolutional Neural Network Optimized via Hunger Games Search. Neural Process Lett 2022. [DOI: 10.1007/s11063-022-11068-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
8
|
Li F, Zhu A, Li J, Xu Y, Zhang Y, Yin H, Hua G. Frequency-driven channel attention-augmented full-scale temporal modeling network for skeleton-based action recognition. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109854] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
9
|
Remote sensing image super-resolution using multi-scale convolutional sparse coding network. PLoS One 2022; 17:e0276648. [PMID: 36288378 PMCID: PMC9605020 DOI: 10.1371/journal.pone.0276648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Accepted: 10/11/2022] [Indexed: 11/24/2022] Open
Abstract
With the development of convolutional neural networks, impressive success has been achieved in remote sensing image super-resolution. However, the performance of super-resolution reconstruction is unsatisfactory due to the lack of details in remote sensing images when compared to natural images. Therefore, this paper presents a novel multiscale convolutional sparse coding network (MCSCN) to carry out the remote sensing images SR reconstruction with rich details. The MCSCN, which consists of a multiscale convolutional sparse coding module (MCSCM) with dictionary convolution units, can improve the extraction of high frequency features. We can obtain more plentiful feature information by combining multiple sizes of sparse features. Finally, a layer based on sub-pixel convolution that combines global and local features takes as the reconstruction block. The experimental results show that the MCSCN gains an advantage over several existing state-of-the-art methods in terms of peak signal-to-noise ratio and structural similarity.
Collapse
|
10
|
Sun T, Wang X, Zhang K, Jiang D, Lin D, Jv X, Ding B, Zhu W. Medical Image Authentication Method Based on the Wavelet Packet and Energy Entropy. ENTROPY 2022; 24:e24060798. [PMID: 35741519 PMCID: PMC9223134 DOI: 10.3390/e24060798] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 05/31/2022] [Accepted: 06/06/2022] [Indexed: 11/18/2022]
Abstract
The transmission of digital medical information is affected by data compression, noise, scaling, labeling, and other factors. At the same time, medical data may be illegally copied and maliciously tampered with without authorization. Therefore, the copyright protection and integrity authentication of medical information are worthy of attention. In this paper, based on the wavelet packet and energy entropy, a new method of medical image authentication is designed. The proposed method uses the sliding window to measure the energy of the detail information. In the time–frequency data distribution, the local details of the data are mined. The complexity of energy is quantitatively described to highlight the valuable information. Based on the energy weight, the local energy entropy is constructed and normalized. The adjusted entropy value is used as the feature vector of the authentication information. A series of experiments show that the authentication method has good robustness against shearing attacks, median filtering, contrast enhancement, brightness enhancement, salt-and-pepper noise, Gaussian noise, multiplicative noise, image rotation, scaling attacks, sharpening, JPEG compression, and other attacks.
Collapse
Affiliation(s)
- Tiankai Sun
- Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian 116024, China;
- School of Information and Electrical Engineering, Xuzhou University of Technology, Xuzhou 221008, China; (K.Z.); (D.J.); (D.L.); (X.J.); (B.D.); (W.Z.)
| | - Xingyuan Wang
- Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian 116024, China;
- School of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
- Correspondence:
| | - Kejun Zhang
- School of Information and Electrical Engineering, Xuzhou University of Technology, Xuzhou 221008, China; (K.Z.); (D.J.); (D.L.); (X.J.); (B.D.); (W.Z.)
| | - Daihong Jiang
- School of Information and Electrical Engineering, Xuzhou University of Technology, Xuzhou 221008, China; (K.Z.); (D.J.); (D.L.); (X.J.); (B.D.); (W.Z.)
| | - Da Lin
- School of Information and Electrical Engineering, Xuzhou University of Technology, Xuzhou 221008, China; (K.Z.); (D.J.); (D.L.); (X.J.); (B.D.); (W.Z.)
| | - Xunguang Jv
- School of Information and Electrical Engineering, Xuzhou University of Technology, Xuzhou 221008, China; (K.Z.); (D.J.); (D.L.); (X.J.); (B.D.); (W.Z.)
| | - Bin Ding
- School of Information and Electrical Engineering, Xuzhou University of Technology, Xuzhou 221008, China; (K.Z.); (D.J.); (D.L.); (X.J.); (B.D.); (W.Z.)
| | - Weidong Zhu
- School of Information and Electrical Engineering, Xuzhou University of Technology, Xuzhou 221008, China; (K.Z.); (D.J.); (D.L.); (X.J.); (B.D.); (W.Z.)
| |
Collapse
|
11
|
Li W, Zhu A, Xu Y, Yin H, Hua G. A Fast Multi-Scale Generative Adversarial Network for Image Compressed Sensing. ENTROPY 2022; 24:e24060775. [PMID: 35741496 PMCID: PMC9222711 DOI: 10.3390/e24060775] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Revised: 05/20/2022] [Accepted: 05/23/2022] [Indexed: 02/01/2023]
Abstract
Recently, deep neural network-based image compressed sensing methods have achieved impressive success in reconstruction quality. However, these methods (1) have limitations in sampling pattern and (2) usually have the disadvantage of high computational complexity. To this end, a fast multi-scale generative adversarial network (FMSGAN) is implemented in this paper. Specifically, (1) an effective multi-scale sampling structure is proposed. It contains four different kernels with varying sizes so that decompose, and sample images effectively, which is capable of capturing different levels of spatial features at multiple scales. (2) An efficient lightweight multi-scale residual structure for deep image reconstruction is proposed to balance receptive field size and computational complexity. The key idea is to apply smaller convolution kernel sizes in the multi-scale residual structure to reduce the number of operations while maintaining the receptive field. Meanwhile, the channel attention structure is employed for enriching useful information. Moreover, perceptual loss is combined with MSE loss and adversarial loss as the optimization function to recover a finer image. Numerous experiments show that our FMSGAN achieves state-of-the-art image reconstruction quality with low computational complexity.
Collapse
Affiliation(s)
- Wenzong Li
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221008, China; (W.L.); (Y.X.); (H.Y.)
| | - Aichun Zhu
- School of Computer Science and Technology, Nanjing Tech University, Nanjing 211800, China;
| | - Yonggang Xu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221008, China; (W.L.); (Y.X.); (H.Y.)
| | - Hongsheng Yin
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221008, China; (W.L.); (Y.X.); (H.Y.)
| | - Gang Hua
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221008, China; (W.L.); (Y.X.); (H.Y.)
- Correspondence:
| |
Collapse
|
12
|
Enhanced Gamma-Ray Attenuation-Based Detection System Using an Artificial Neural Network. PHOTONICS 2022. [DOI: 10.3390/photonics9060382] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Scale deposition is the accumulation of various materials in the walls of transmission lines and unwanted parts in the oil and gas production system. It is a leading moot point in all transmission lines, tanks, and petroleum equipment. Scale deposition leads to drastic detrimental problems, reduced permeability, pressure and production losses, and direct financial losses due to the failure of some equipment. The accumulation of oil and gas leads to clogged pores and obstruction of fluid flow. Considering the passage of a two-phase flow, our study determines the thickness of the scale, and the flow regime is detected with the help of two Multilayer Perceptron (MLP) networks. First, the diagnostic system consisting of a dual-energy source, a steel pipe, and a NaI detector was implemented, using the Monte Carlo N Particle Code (MCNP). Subsequently, the received signals were processed, and properties were extracted using the wavelet transform technique. These features were considered as inputs of an Artificial Neural Network (ANN) model used to determine the type of flow regimes and predict the scale thickness. By accurately classifying the flow regimes and determining the scale inside the pipe, our proposed method provides a platform that could enhance many areas of the oil industry.
Collapse
|
13
|
Optimization of Spatial Resolution and Image Reconstruction Parameters for the Small-Animal Metis™ PET/CT System. ELECTRONICS 2022. [DOI: 10.3390/electronics11101542] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Purpose: The aim of this study was to investigate the optimization of the spatial resolution and image reconstruction parameters related to image quality in an iterative reconstruction algorithm for the small-animal Metis™ PET/CT system. Methods: We used a homemade Derenzo phantom to evaluate the image quality using visual assessment, the signal-to-noise ratio, the contrast, the coefficient of variation, and the contrast-to-noise ratio of the 0.8 mm hot rods of eight slices in the center of the phantom PET images. A healthy mouse study was performed to analyze the influence of the optimal reconstruction parameters and the Gaussian post-filter FWHM. Results: In the phantom study, the image quality was the best when the phantom was placed at the end, keeping the central axis parallel to the X-axis of the system, and selecting between 30 and 40 iterations, a 0.314 mm reconstructed voxel size, and a 1.57 mm Gaussian post-filter FWHM. The optimization of the spatial resolution could reach 0.6 mm. In the animal study, it was suitable to choose a voxel size of 0.472 mm, between 30 and 40 iterations, and a 2.36 mm Gaussian post-filter FWHM. Conclusions: Our results indicate that the optimal imaging conditions and reconstruction parameters are very necessary to obtain high-resolution images and quantitative accuracy, especially for the high-precision recognition of tiny lesions.
Collapse
|