Multimodal Deep Embedding via Hierarchical Grounded Compositional Semantics.

Yueting Zhuang,Jun Song,Fei Wu,Xi Li,Zhongfei Zhang,Yong Rui

IEEE Transactions on Circuits and Systems for Video Technology（2018）

引用 18|浏览147

暂无评分

摘要

For a number of important problems, isolated semantic representations of individual syntactic words or visual objects do not suffice, but instead a compositional semantic representation is required; for example, a literal phrase or a set of spatially concurrent objects. In this paper, we aim to harness the existing image-sentence databases to exploit the compositional nature of image-sentence data...

查看译文

关键词

Semantics,Visualization,Bicycles,Correlation,Machine learning,Buildings,Feature extraction

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要