LOMo: Latent Ordinal Model for Facial Analysis in Videos

Karan Sikka,Gaurav Sharma,Marian Bartlett

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)（2016）

引用 101|浏览47

暂无评分

摘要

We study the problem of facial analysis in videos. We propose a novel weakly supervised learning method that models the video event (expression, pain etc.) as a sequence of automatically mined, discriminative sub-events (eg. onset and offset phase for smile, brow lower and cheek raise for pain). The proposed model is inspired by the recent works on Multiple Instance Learning and latent SVM/HCRF- it extends such frameworks to model the ordinal or temporal aspect in the videos, approximately. We obtain consistent improvements over relevant competitive baselines on four challenging and publicly available video based facial analysis datasets for prediction of expression, clinical pain and intent in dyadic conversations. In combination with complimentary features, we report state-of-the-art results on these datasets.

查看译文

关键词

LOMo,latent ordinal model,weakly supervised learning,video event modeling,automatically mined discriminative subevents,multiple instance learning,latent SVM/HCRF,video based facial analysis datasets,facial expression prediction,clinical pain,dyadic conversations

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要