Video2Sentence and vice versa.
MM '13: ACM Multimedia Conference Barcelona Spain October, 2013(2013)
摘要
In this technical demonstration, we showcase a multimedia search engine that retrieves a video from a sentence, or a sentence from a video. The key novelty is our machine translation capability that exploits a cross-media representation for both the visual and textual modality using concept vocabularies. We will demonstrate the translations using arbitrary web videos and sentences related to everyday events. What is more, we will provide an automatically generated explanation, in terms of concept detectors, on why a particular video or sentence has been retrieved as the most likely translation.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络