Skip to main navigation Skip to search Skip to main content

Uncertainty-guided hierarchical frequency domain Transformer for image restoration

  • Mingwen Shao
  • , Yuanjian Qiao
  • , Deyu Meng
  • , Wangmeng Zuo
  • China University of Petroleum (East China)
  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

46 Scopus citations

Abstract

Existing convolutional neural network (CNN)-based and vision Transformer (ViT)-based image restoration methods are usually explored in the spatial domain. However, we employ Fourier analysis to show that these spatial domain models cannot perceive the entire frequency spectrum of images, i.e., mainly focus on either high-frequency (CNN-based models) or low-frequency components (ViT-based models). This intrinsic limitation results in the partial missing of semantic information and the appearance of artifacts. To address this limitation, we propose a novel uncertainty-guided hierarchical frequency domain Transformer named HFDT to effectively learn both high and low-frequency information while perceiving local and global features. Specifically, to aggregate semantic information from various frequency levels, we propose a dual-domain feature interaction mechanism, in which the global frequency information and local spatial features are extracted by corresponding branches. The frequency domain branch adopts the Fast Fourier Transform (FFT) to convert the features from the spatial domain to the frequency domain, where the global low and high-frequency components are learned with Log-linear complexity. Complementarily, an efficient convolution group is employed in the spatial domain branch to capture local high-frequency details. Moreover, we introduce an uncertainty degradation-guided strategy to efficiently represent degraded prior information, rather than simply distinguishing degraded/non-degraded regions in binary form. Our approach achieves competitive results in several degraded scenarios, including rain streaks, raindrops, motion blur, and defocus blur.

Original languageEnglish
Article number110306
JournalKnowledge-Based Systems
Volume263
DOIs
StatePublished - 5 Mar 2023

Keywords

  • Frequency-domain Transformer
  • Image restoration
  • Log-linear complexity
  • Uncertainty-guided

Fingerprint

Dive into the research topics of 'Uncertainty-guided hierarchical frequency domain Transformer for image restoration'. Together they form a unique fingerprint.

Cite this