Actionness Estimation Using Hybrid Fully Convolutional Networks

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2016)

引用 115|浏览155
暂无评分
摘要
Actionness was introduced to quantify the likelihood of containing a generic action instance at a specific location. Accurate and efficient estimation of actionness is important in video analysis and may benefit other relevant tasks such as action recognition and action detection. This paper presents a new deep architecture for actionness estimation, called hybrid fully convolutional network (H-FCN), which is composed of appearance FCN (A-FCN) and motion FCN (M-FCN). These two FCNs leverage the strong capacity of deep models to estimate actionness maps from the perspectives of static appearance and dynamic motion, respectively. In addition, the fully convolutional nature of H-FCN allows it to efficiently process videos with arbitrary sizes. Experiments are conducted on the challenging datasets of Stanford40, UCF Sports, and JHMDB to verify the effectiveness of H-FCN on actionness estimation, which demonstrate that our method achieves superior performance to previous ones. Moreover, we apply the estimated actionness maps on action proposal generation and action detection. Our actionness maps advance the current state-of-the-art performance of these tasks substantially.
更多
查看译文
关键词
actionness estimation,hybrid fully convolutional networks,HFCN,generic action instance,video analysis,appearance FCN,motion FCN,static appearance,dynamic motion,Stanford40 datasets,UCF Sports datasets,JHMDB datasets,action proposal generation,action detection,actionness maps
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要