Multimodal Deep Embedding via Hierarchical Grounded Compositional Semantics.

IEEE Transactions on Circuits and Systems for Video Technology(2018)

引用 18|浏览147
暂无评分
摘要
For a number of important problems, isolated semantic representations of individual syntactic words or visual objects do not suffice, but instead a compositional semantic representation is required; for example, a literal phrase or a set of spatially concurrent objects. In this paper, we aim to harness the existing image-sentence databases to exploit the compositional nature of image-sentence data...
更多
查看译文
关键词
Semantics,Visualization,Bicycles,Correlation,Machine learning,Buildings,Feature extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要