跳到主要导航 跳到搜索 跳到主要内容

Uncertainty-guided hierarchical frequency domain Transformer for image restoration

  • Mingwen Shao
  • , Yuanjian Qiao
  • , Deyu Meng
  • , Wangmeng Zuo
  • China University of Petroleum (East China)
  • Harbin Institute of Technology

科研成果: 期刊稿件文章同行评审

46 引用 (Scopus)

摘要

Existing convolutional neural network (CNN)-based and vision Transformer (ViT)-based image restoration methods are usually explored in the spatial domain. However, we employ Fourier analysis to show that these spatial domain models cannot perceive the entire frequency spectrum of images, i.e., mainly focus on either high-frequency (CNN-based models) or low-frequency components (ViT-based models). This intrinsic limitation results in the partial missing of semantic information and the appearance of artifacts. To address this limitation, we propose a novel uncertainty-guided hierarchical frequency domain Transformer named HFDT to effectively learn both high and low-frequency information while perceiving local and global features. Specifically, to aggregate semantic information from various frequency levels, we propose a dual-domain feature interaction mechanism, in which the global frequency information and local spatial features are extracted by corresponding branches. The frequency domain branch adopts the Fast Fourier Transform (FFT) to convert the features from the spatial domain to the frequency domain, where the global low and high-frequency components are learned with Log-linear complexity. Complementarily, an efficient convolution group is employed in the spatial domain branch to capture local high-frequency details. Moreover, we introduce an uncertainty degradation-guided strategy to efficiently represent degraded prior information, rather than simply distinguishing degraded/non-degraded regions in binary form. Our approach achieves competitive results in several degraded scenarios, including rain streaks, raindrops, motion blur, and defocus blur.

源语言英语
文章编号110306
期刊Knowledge-Based Systems
263
DOI
出版状态已出版 - 5 3月 2023

学术指纹

探究 'Uncertainty-guided hierarchical frequency domain Transformer for image restoration' 的科研主题。它们共同构成独一无二的指纹。

引用此