Low-Rank Regularized Multimodal Representation For Micro-Video Event Detection

Jing Zhang,Yuting Wu, Jinghui Liu,Peiguang Jing,Yuting Su

IEEE ACCESS（2020）

引用 1|浏览60

暂无评分

摘要

Currently, micro-videos are becoming one of the most representative products in the new media age. Although the length of micro-videos is limited to cater to the fast pace of life and are beneficial for rapid distribution, micro-videos are usually recorded in specific scenarios and tend to convey relatively complete events. To more accurately obtain the event types of micro-videos to facilitate potential applications, we propose a low-rank regularized multimodal representation method for micro-video event detection. To solve the less descriptive power of each modality, the latent common representation of micro-videos is obtained by exploiting complementarity among modalities. A considerable gain in accuracy on this basis can be achieved by further considering the low-rank constraint for the lowest-rank intrinsic representation and a flexible label-relaxation strategy for mappings between representations and their correspondences. A newly constructed micro-video dataset is used to verify the advantages of our proposed model. The experimental results demonstrated the superior performance of our proposed method compared with state-of-the-art methods.

查看译文

关键词

Micro-video, event detection, multimodal, low-rank representation

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要