Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation

Matteo Fabbri,Fabio Lanzi,Simone Calderara,Stefano Alletto,Rita Cucchiara

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)（2020）

引用 95|浏览322

暂无评分

摘要

In this paper we present a novel approach for bottom-up multi-person 3D human pose estimation from monocular RGB images. We propose to use high resolution volumetric heatmaps to model joint locations, devising a simple and effective compression method to drastically reduce the size of this representation. At the core of the proposed method lies our Volumetric Heatmap Autoencoder, a fully-convolutional network tasked with the compression of ground-truth heatmaps into a dense intermediate representation. A second model, the Code Predictor, is then trained to predict these codes, which can be decompressed at test time to re-obtain the original representation. Our experimental evaluation shows that our method performs favorably when compared to state of the art on both multi-person and single-person 3D human pose estimation datasets and, thanks to our novel compression strategy, can process full-HD images at the constant runtime of 8 fps regardless of the number of subjects in the scene. Code and models are publicly available.

查看译文

关键词

full-HD image processing,single-person 3D human pose estimation datasets,ground-truth heatmap compression,size reduction,bottom-up multiperson 3D human pose estimation,novel compression strategy,single-person,Code Predictor,dense intermediate representation,fully-convolutional network,Volumetric Heatmap Autoencoder,effective compression method,simple compression method,high resolution volumetric heatmaps,monocular RGB images

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要