Long Video Question Answering: A Matching-guided Attention Model

Pattern Recognition(2020)

引用 17|浏览60
暂无评分
摘要
•We study a rarely investigated but practically important problem, namely long video QA, which can be suitably applied to many long video tasks.•We propose a Matching-guided Attention Model (MAM) to deal with the long video QA problem, which jointly matches and regresses video snippets for questions and predicts the answers based on attended visual features.•We generate two new datasets (a simple one and a complex one) including long videos as well as pairwise questions and answers, which can be used for evaluating the study of the long video QA problem. Experimental results demonstrate the effectiveness of our proposed method by comparing with two short video QA methods and a baseline method.
更多
查看译文
关键词
Long video QA,Matching-guided attention
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要