Locality-constrained Linear Coding for image classification

CVPR(2010)

引用 4069|浏览908
暂无评分
摘要
The traditional SPM approach based on bag-of-features (BoF) requires nonlinear classifiers to achieve good image classification performance. This paper presents a simple but effective coding scheme called Locality-constrained Linear Coding (LLC) in place of the VQ coding in traditional SPM. LLC utilizes the locality constraints to project each descriptor into its local-coordinate system, and the projected coordinates are integrated by max pooling to generate the final representation. With linear classifier, the proposed approach performs remarkably better than the traditional nonlinear SPM, achieving state-of-the-art performance on several benchmarks. Compared with the sparse coding strategy [22], the objective function used by LLC has an analytical solution. In addition, the paper proposes a fast approximated LLC method by first performing a K-nearest-neighbor search and then solving a constrained least square fitting problem, bearing computational complexity of O(M + K2). Hence even with very large codebooks, our system can still process multiple frames per second. This efficiency significantly adds to the practical values of LLC for real applications.
更多
查看译文
关键词
k-nearest-neighbor search,sparse coding strategy,image coding,image matching,learning (artificial intelligence),nonlinear classifiers,locality-constrained linear coding,bag-of-features,vector quantisation,constrained least square fitting problem,vq coding,computational complexity,least squares approximations,image classification,spatial pyramid matching,histograms,coordinate system,feature extraction,sparse coding,learning artificial intelligence,encoding,frames per second,analytic solution,bismuth,k nearest neighbor,linear code,objective function
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要