Discover the SciOpen Platform and Achieve Your Research Goals with Ease.
Search articles, authors, keywords, DOl and etc.
We propose a normalizing flow based on the wavelet framework for super-resolution (SR) called WDFSR. It learns the conditional distribution mapping between low-resolution images in the RGB domain and high-resolution images in the wavelet domain to simultaneously generate high-resolution images of different styles. To address the issue of some flow-based models being sensitive to datasets, which results in training fluctuations that reduce the mapping ability of the model and weaken generalization, we designed a method that combines a T-distribution and QR decomposition layer. Our method alleviates this problem while maintaining the ability of the model to map different distributions and produce higher-quality images. Good contextual conditional features can promote model training and enhance the distribution mapping capabilities for conditional distribution mapping. Therefore, we propose a Refinement layer combined with an attention mechanism to refine and fuse the extracted condition features to improve image quality. Extensive experiments on several SR datasets demonstrate that WDFSR outperforms most general CNN- and flow-based models in terms of PSNR value and perception quality. We also demonstrated that our framework works well for other low-level vision tasks, such as low-light enhancement. The pretrained models and source code with guidance for reference are available at https://github.com/Lisbegin/WDFSR.
Li, W.; Zhou, K.; Qi, L.; Lu, L.; Lu, J. Best-buddy GANs for highly detailed image super-resolution. Proceedings of the AAAI Conference on Artificial Intelligence Vol. 36, No. 2, 1412–1420, 2022.
Shannon, C. E. Communication in the presence of noise. Proceedings of the IRE Vol. 37, No. 1, 10–21, 1949.
Liu, S.; Gang, R.; Li, C.; Song, R. Adaptive deep residual network for single image super-resolution. Computational Visual Media Vol. 5, No. 4, 391–401, 2019.
Ma, C.; Rao, Y.; Lu, J.; Zhou, J. Structure-preserving image super-resolution. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 44, No. 11, 7898–7911, 2022.
Wu, J.; Cong, R.; Fang, L.; Guo, C.; Zhang, B.; Ghamisi, P. Unpaired remote sensing image super-resolution with content-preserving weak supervision neural network. Science China Information Sciences Vol. 66, No. 1, Article No. 119105, 2022.
Park, S. H.; Moon, Y. S.; Cho, N. I. Flexible style image super-resolution using conditional objective. IEEE Access Vol. 10, 9774–9792, 2022.
Grover, A.; Chute, C.; Shu R.; Cao Z.; Ermon, S. AlignFlow: Cycle consistent learning from multiple domains via normalizing flows. Proceedings of the AAAI Conference on Artificial Intelligence Vol. 34, No. 4, 4028–4035, 2020.
Gal, R.; Hochberg, D. C.; Bermano, A.; Cohen-Or, D. SWAGAN: A style-based wavelet-driven generative model. ACM Transactions on Graphics Vol. 40, No. 4, Article No. 134, 2021.
Hsu, W. Y.; Jian, P. W. Detail-enhanced wavelet residual network for single image super-resolution. IEEE Transactions on Instrumentation and Measurement Vol. 71, Article No. 5016913, 2022.
Guo, X.; Li, Y.; Ling, H. LIME: Low-light image enhancement via illumination map estimation. IEEE Transactions on Image Processing Vol. 26, No. 2, 982–993, 2017.
Jiang, Y.; Gong, X.; Liu, D.; Cheng, Y.; Fang, C.; Shen, X.; Yang, J.; Zhou, P.; Wang, Z. EnlightenGAN: Deep light enhancement without paired supervision. IEEE Transactions on Image Processing Vol. 30, 2340–2349, 2021.
Zhang, Y.; Guo, X.; Ma, J.; Liu, W.; Zhang, J. Beyond brightening low-light images. International Journal of Computer Vision Vol. 129, No. 4, 1013–1037, 2021.
Breckenridge, M. B.; Tallia, A. F.; Like, R. C. Display of small-area variation in health-related data: A methodology using resistant statistics. Social Science & Medicine Vol. 26, No. 1, 141–151, 1988.
Huang, Y.; Li, J.; Hu, Y.; Gao, X.; Huang, H. Transitional learning: Exploring the transition states of degradation for blind super-resolution. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 45, No. 5, 6495–6510, 2023.
This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.
The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
To submit a manuscript, please go to https://jcvm.org.