Topic-aware video summarization using multimodal transformer.

Yubo Zhu,Wentian Zhao, Rui Hua,Xinxiao Wu

Pattern Recognit.(2023)

引用 1|浏览10
暂无评分
摘要
•The new task which aims to generate multiple video summaries to meet different user interests.•The new TopicSum dataset contains 136 videos to support the study of this new task.•Multimodal Transformer model to simultaneously predicts topics and generates topic-related summaries by fusing multimodal features.•Extensive experiments on TopicSum dataset that show the effectiveness of our method on both quantitative and qualitative evaluations.
更多
查看译文
关键词
topic-aware
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要