Efficient Convnet-Based Marker-Less Motion Capture In General Scenes With A Low Number Of Cameras

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2015)

引用 182|浏览202
暂无评分
摘要
We present a novel method for accurate marker-less capture of articulated skeleton motion of several subjects in general scenes, indoors and outdoors, even from input filmed with as few as two cameras. Our approach unites a discriminative image-based joint detection method with a model-based generative motion tracking algorithm through a combined pose optimization energy. The discriminative part-based pose detection method, implemented using Convolutional Networks (ConvNet), estimates unary potentials for each joint of a kinematic skeleton model. These unary potentials are used to probabilistically extract pose constraints for tracking by using weighted sampling from a pose posterior guided by the model. In the final energy, these constraints are combined with an appearance-based model-to-image similarity term. Poses can be computed very efficiently using iterative local optimization, as ConvNet detection is fast, and our formulation yields a combined pose estimation energy with analytic derivatives. In combination, this enables to track full articulated joint angles at state-of-the-art accuracy and temporal stability with a very low number of cameras.
更多
查看译文
关键词
ConvNet-based marker-less motion capture,cameras,articulated skeleton motion,discriminative image-based joint detection method,model-based generative motion tracking algorithm,combined pose optimization energy,discriminative part-based pose detection method,convolutional networks,kinematic skeleton model,unary potentials,weighted sampling,pose posterior,appearance-based model-to-image similarity term,iterative local optimization,ConvNet detection,pose estimation energy,analytic derivatives,temporal stability
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要