Normalized Non-Negative Sparse Encoder for Fast Image Representation

Shizhou Zhang,Jinjun Wang,Weiwei Shi,Yihong Gong,Yong Xia,Yanning Zhanga

IEEE Transactions on Circuits and Systems for Video Technology（2019）

引用 7|浏览79

暂无评分

摘要

Image representation based on sparse coding generalizes the bag of words model. Although it reduces the reconstruction error for local features to achieve the state-of-the-art image classification performance, the large computational cost hinders the application of sparse coding-based image features. In this paper, we propose approximating a sparse code using the output of a simple neural network. The resulting parameter learning model for the neural network automatically incorporates non-negative and shift-invariant constraints, leading to an efficient normalized non-negative sparse coding (N ³ SC) sparse encoder. Without the use of the traditional iterative process to solve the sparse coding objective, the sparse encoder directly “converts” each local feature into a sparse code. We also introduce a method for training the encoder based on the auto-encoder method. In addition, we formally propose the corresponding sparse coding scheme called N ³ SC, which enforces both the non-negative constraint and the shift-invariant constraint in addition to the traditional sparse coding criteria. As demonstrated by several experiments, the obtained N ³ SC encoder requires only 3%–10% of the processing time for image feature extraction compared with the standard sparse coding scheme. At the same time, the features extracted using the exact solutions of the N ³ SC coding scheme and the N ³ SC encoder offer superior image classification accuracy compared to the accuracy of many existing sparse coding-based representations.

查看译文

关键词

Encoding,Image coding,Feature extraction,Computational modeling,Neural networks,Image reconstruction,Training

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要