STDM-transformer: Space-time dual multi-scale transformer network for skeleton-based action recognition

Zhifu Zhao, Ziwei Chen,Jianan Li,Xuemei Xie,Kai Chen,Xiaotian Wang,Guangming Shi

NEUROCOMPUTING（2024）

引用 0|浏览17

暂无评分

摘要

Transformer-based methods have currently demonstrated impressive results in the field of skeleton-based action recognition. Nevertheless, how to effectively model multi-scale features with transformers remains a challenging problem, which is crucial to distinguish various actions. In this paper, we propose a Space- time Dual Multi-scale transformer (STDM-transformer) to explore the multi-scale collaborative representation employing both fine and coarse scale motion information. In contrast to existing approaches which typically propagate information between scales in a single fusion manner, our Space-time Dual Multi-scale method stratifies the space-time multi-scale into dual levels. One level is to construct fine-grained local motion interactions. In detail, the space-time multi-scale partition strategy and the novel intra-inter space-time transformer module are proposed to extract and aggregate the feature in part scale and body scale, respectively. The other is aimed at modeling coarse-grained global motion contexts, in which the layer-wise multi -scale progressive fusion strategy is designed. Extensive experimental results demonstrate that the proposed STDM-transformer achieves the SOTA performance on large-scale datasets.

查看译文

关键词

Skeleton-based action recognition,Transformer,Dual multi-scale network

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要