CosAE: Learnable Fourier Series for Image Restoration

Abstract

In this paper, we introduce CosAE (Cosine Autoencoder), a novel, generic Autoencoder that seamlessly leverages the classic Fourier series with a feed-forward neural network. CosAE represents an input image as a series of 2D Cosine time series, each defined by a tuple of learnable frequency and Fourier coefficients. This method stands in contrast to a conventional Autoencoder that often sacrifices detail in their reduced-resolution bottleneck latent spaces. CosAE, however, encodes frequency coefficients, i.e., the amplitudes and phases, in its bottleneck. This encoding enables extreme spatial compression, e.g., 64x downsampled feature maps in the bottleneck, without losing detail upon decoding.

We showcase the advantage of CosAE via extensive experiments on flexible-resolution super-resolution and blind image restoration, two highly challenging tasks that demand the restoration network to effectively generalize to complex and even unknown image degradations. Our method surpasses state-of-the-art approaches, highlighting its capability to learn a generalizable representation for image restoration.

Video

Network Architecture

CosAE produce an extremely compressed representation of the input image in the bottleneck, while still being able to be expended via HCM, and decoded to a high-quality image. The HCM is a group of learnable Fourier basis functions, used to expand the compressed representation to the original image space.

Flexible-Resolution Super-Resolution

8x upsampled image. Drag to compare results

Blind Image Restoration

Blind image restoration with unknown degradations. Drag to compare.

BibTeX

@article{liu2024cosae,
  author    = {Sifei Liu, Shalini De Mello, Jan Kautz},
  title     = {CosAE: Learnable Fourier Series for Image Restoration},
  journal   = {NeurIPS},
  year      = {2024},
}