Point-level feature learning based on vision transformer for occluded person re-identification

IMAGE AND VISION COMPUTING(2024)

引用 0|浏览3
暂无评分
摘要
Person re -identification is challenging due to the presence of variations in pose and occlusion, which significantly impact the matching of visual features across different camera views and pose considerable difficulty for accurate person re -identification. This paper proposes a novel method for occluded person re -identification by introducing point -level feature learning based on vision transformers. Our approach utilizes a pose estimator to detect the keypoints of the human body and employs these points to locate intermediate features. These intermediate features of keypoints are input to a pose -based transformer branch to learn point -level features. Then, we design a part -based transformer branch to learn part -level features that capture visual features of different image parts, further enhancing the discriminative power of the learned features. Additionally, we employ a global branch to learn the global -level feature by treating the person's image as a single entity. Finally, we integrate point -level, part -level, and global -level features to represent a person's features. The experimental results on occluded and partial person re -identification datasets demonstrate the effectiveness of our proposed approach in improving reidentification. Our approach shows potential for improving person re -identification in scenarios with occlusion and pose variations.
更多
查看译文
关键词
Occluded person re -identification,Feature learning,Vision transformer,Pose estimation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要