Long-term Temporal Convolutions for Action Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence(2018)

引用 1113|浏览382
暂无评分
摘要
Typical human actions last several seconds and exhibit characteristic spatio-temporal structure. Recent methods attempt to capture this structure and learn action representations with convolutional neural networks. Such representations, however, are typically learned at the level of a few video frames failing to model actions at their full temporal extent. In this work we learn video representatio...
更多
查看译文
关键词
Spatial resolution,Optical imaging,Neural networks,Training,Estimation,Network architecture,Optical filters
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要