Weighted Feature Pooling Network In Template-Based Recognition

COMPUTER VISION - ACCV 2018, PT V(2018)

引用 1|浏览50
暂无评分
摘要
Many computer vision tasks are template-based learning tasks in which multiple instances of a specific concept (e.g. multiple images of a subject's face) are available at once to the learning algorithm. The template structure of the input data provides an opportunity for generating a robust and discriminative unified template-level representation that effectively exploits the inherent diversity of feature-level information across instances within a template. In contrast to other feature aggregation methods, we propose a new technique to dynamically predict weights that consider factors such as noise and redundancy in assessing the importance of image-level features and use those weights to appropriately aggregate the features into a single template-level representation. We present extensive experimental results on the MNIST, CIFAR10, UCF101, IJB-A, IJB-B, and Janus CS4 datasets to show that the new technique outperforms statistical feature pooling methods as well as other neural-network-based aggregation mechanisms on a broad set of tasks.
更多
查看译文
关键词
Template representation, Feature pooling, Attention network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要