Long-term Temporal Convolutions for Action Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence(2018)
摘要
Typical human actions last several seconds and exhibit characteristic spatio-temporal structure. Recent methods attempt to capture this structure and learn action representations with convolutional neural networks. Such representations, however, are typically learned at the level of a few video frames failing to model actions at their full temporal extent. In this work we learn video representatio...
更多查看译文
关键词
Spatial resolution,Optical imaging,Neural networks,Training,Estimation,Network architecture,Optical filters
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要