Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition(2017)

引用 421|浏览261
暂无评分
摘要
We propose a method for lossy image compression based on recurrent, convolutional neural networks that outperforms BPG (4:2:0 ), WebP, JPEG2000, and JPEG as measured by MS-SSIM. We introduce three improvements over previous research that lead to this state-of-the-art result. First, we show that training with a pixel-wise loss weighted by SSIM increases reconstruction quality according to several metrics. Second, we modify the recurrent architecture to improve spatial diffusion, which allows the network to more effectively capture and propagate image information through the network's hidden state. Finally, in addition to lossless entropy coding, we use a spatially adaptive bit allocation algorithm to more efficiently use the limited number of bits to encode visually complex image regions. We evaluate our method on the Kodak and Tecnick image sets and compare against standard codecs as well recently published methods based on deep neural networks.
更多
查看译文
关键词
lossless entropy coding,spatially adaptive bit allocation algorithm,visually complex image regions,deep neural networks,priming bit rates,spatially adaptive bit rates,recurrent networks,convolutional neural networks,JPEG,MS-SSIM,recurrent architecture,spatial diffusion,image information,lossy image compression
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要