On Input/Output Architectures for Convolutional Neural Network-Based Cross-View Gait Recognition

Noriko Takemura,Yasushi Makihara,Daigo Muramatsu,Tomio Echigo,Yasushi Yagi

IEEE Transactions on Circuits and Systems for Video Technology（2019）

引用 181|浏览97

暂无评分

摘要

In this paper, we discuss input/output architectures for convolutional neural network (CNN)- based cross-view gait recognition. For this purpose, we consider two aspects: verification versus identifi- cation, and the trade-off between spatial displacement caused by subject difference and view difference. More specifically, we use the Siamese network with a pair of inputs and contrastive loss for verification, and a triplet network with a triplet of inputs and triplet ranking loss for identification. The aforementioned CNN architec- tures are insensitive to spatial displacement because the difference between a matching pair is calculated at the last layer after passing through the convolution and max pooling layers; hence, they are expected to work relatively well under large view differences. By contrast, because it is better to use the spatial displace- ment to its best advantage because of the subject dif- ference under small view differences, we also use CNN architectures where the difference between a matching pair is calculated at the input level to make them more sensitive to spatial displacement. We conducted experiments for cross-view gait recognition and con- firmed that the proposed architectures outperformed the state-of-the-art benchmarks in accordance with their suitable situations of verification/identification tasks and view differences.

查看译文

关键词

Gait recognition,Probes,Network architecture,Robustness,Performance evaluation,Neural networks

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要