Learned Image Compression with Discretized Gaussian Mixture Likelihoods and Attention Modules

Cheng Zhengxue,Sun Heming,Takeuchi Masaru,Katto Jiro

CVPR（2020）

引用 672|浏览148

暂无评分

摘要

Image compression is a fundamental research field due to its significant influence on transmission and storage. Many well-known image compression standards have been developed and widely used for many decades. Recently, learned compression methods exhibit a fast development trend with promising results. However, there is still a performance gap between learned compression algorithms and reigning compression standards, especially in terms of widely used PSNR metric. In this paper, we explore the remaining redundancy of recent learned compression algorithms. We have found accurate entropy models for rate estimation largely affect the optimization of network parameters and thus affect the rate-distortion performance. Therefore, in this paper, we propose to use discretized Gaussian Mixture Likelihoods to parameterize the distributions of latent codes, which can achieve a more accurate and flexible entropy model. Besides, we take advantage of recent attention modules and incorporate them into the network architecture to enhance the performance. Experimental results demonstrate our proposed method achieves a state-of-the-art performance compared to existing learned compression methods on both Kodak and high-resolution datasets. To our knowledge our approach is the first work to achieve comparable performance with latest compression standard Versatile Video Coding (VVC) regarding PSNR. More importantly, our approach can generate more visually pleasant results when optimized by MS-SSIM.

查看译文

关键词

learned image compression,discretized Gaussian mixture likelihoods,rate-distortion performance,entropy model,versatile video coding,attention modules,MS-SSIM

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要