Spatially-Adaptive Pixelwise Networks for Fast Image Translation

Tamar Rott Shaham,Michael Gharbi,Richard Zhang,Eli Shechtman,Tomer Michaeli

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021（2021）

引用 69|浏览133

暂无评分

摘要

We introduce a new generator architecture, aimed at fast and efficient high-resolution image-to-image translation. We design the generator to be an extremely lightweight function of the full-resolution image. In fact, we use pixel-wise networks; that is, each pixel is processed independently of others, through a composition of simple affine transformations and nonlinearities. We take three important steps to equip such a seemingly simple function with adequate expressivity. First, the parameters of the pixel-wise networks are spatially varying, so they can represent a broader function class than simple 1 x 1 convolutions. Second, these parameters are predicted by a fast convolutional network that processes an aggressively low-resolution representation of the input. Third, we augment the input image by concatenating a sinusoidal encoding of spatial coordinates, which provides an effective inductive bias for generating realistic novel high-frequency image content. As a result, our model is up to 18x faster than state-of-the-art baselines. We achieve this speedup while generating comparable visual quality across different image resolutions and translation domains.

查看译文

关键词

image resolutions,translation domains,spatially-adaptive pixelwise networks,fast image translation,generator architecture,fast resolution image-to-image translation,efficient high-resolution image-to-image translation,extremely lightweight function,full-resolution image,pixel-wise networks,affine transformations,nonlinearities,broader function class,fast convolutional network,low-resolution representation,input image,spatial coordinates,high-frequency image content

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要