Non-local Neural Networks

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition(2018)

引用 10475|浏览911
暂无评分
摘要
Both convolutional and recurrent operations are building blocks that process one local neighborhood at a time. In this paper, we present non-local operations as a generic family of building blocks for capturing long-range dependencies. Inspired by the classical non-local means method in computer vision, our non-local operation computes the response at a position as a weighted sum of the features at all positions. This building block can be plugged into many computer vision architectures. On the task of video classification, even without any bells and whistles, our non-local models can compete or outperform current competition winners on both Kinetics and Charades datasets. In static image recognition, our non-local models improve object detection/segmentation and pose estimation on the COCO suite of tasks. Code is available at https://github.com/facebookresearch/video-nonlocal-net .
更多
查看译文
关键词
nonlocal neural networks,convolutional operations,recurrent operations,long-range dependencies,nonlocal means method,computer vision architectures,nonlocal models,nonlocal operation,computer vision,video classification,Charades datasets,Kinetics datasets,static image recognition,object detection,object segmentation,pose estimation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要