DSP: Discriminative Spatial Part modeling for Fine-Grained Visual Categorization

Image and Vision Computing(2017)

引用 7|浏览58
暂无评分
摘要
Different from the basic-level classification, the Fine-Grained Visual Categorization (FGVC) aims to classify objects belonging to the same species. Therefore, it is more challenging than the basic-level classification. Recently, significant advances have been achieved in FGVC. However, most of the existing methods require bounding boxes or part annotations for training and testing, resulting in limited usability and flexibility. To conquer these limitations, we aim to automatically detect the bounding boxes and parts for FGVC. The bounding boxes are acquired by transferring bounding boxes from training images to testing images. Based on the generated bounding boxes, we employ a multiple-layer Orientational Spatial Part (OSP) model to learn local parts for the object. To achieve more discriminative part modeling, the Discriminative Spatial Part (DSP) model is proposed to select the discriminative parts from OSP. Finally, we employ Convolutional Neural Network (CNN) as the feature extractor and train a linear SVM as the classifier. Extensive experiments on public benchmark datasets manifest the impressive performance of our method, i.e., classification accuracy achieves 79.8% on CUB-200-2011 and 85.7% on Aircraft, which are higher than many existing methods using manual annotations.
更多
查看译文
关键词
Orientational Spatial Part model,Discriminative Spatial Part modeling,Fine-Grained Visual Categorization,CNN
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要