Two-Stream 3-D convNet Fusion for Action Recognition in Videos With Arbitrary Size and Length.

Xuanhan Wang,Lianli Gao,Peng Wang,Xiaoshuai Sun,Xianglong Liu

IEEE Transactions on Multimedia（2018）

引用 248|浏览83

暂无评分

摘要

3-D convolutional neural networks (3-D-convNets) have been very recently proposed for action recognition in videos, and promising results are achieved. However, existing 3-D-convNets has two “artificial” requirements that may reduce the quality of video analysis: 1) It requires a fixed-sized (e.g., 112 $\times$ 112) input video; and 2) most of the 3-D-convNets require a fixed-length input (i.e., v...

查看译文

关键词

Videos,Three-dimensional displays,Feature extraction,Convolution,Two dimensional displays,Computational modeling,Histograms

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要