Cross-media web video event mining based on multiple semantic-paths embedding

Xia Xiao, Mingyue Du, Shuyu Xu,Guoying Liu,Chengde Zhang

NEURAL COMPUTING & APPLICATIONS(2023)

引用 0|浏览0
暂无评分
摘要
Web video event mining based on cross-media fusion has become a research hotspot. However, each video is only described by a dozen noisy words, resulting in extremely unstable textual features. Moreover, different people might describe the same video with completely different words. Thus, the semantic association between textual and visual information would be much sparse, which brings great challenges to web video event mining based on cross-media associations. To address this issue, this paper proposes a novel framework to enrich the associations between near duplicate keyframes (NDK) and terms based on multiple semantic-paths embedding. After data preprocessing, we build a heterogeneous information network to establish associations among NDKs, terms and videos. Then, semantic-path walk strategy is designed to generate meaningful semantic-node sequences for embedding. Next, an embedding fusion method is proposed to predict the distribution characteristics of each term in NDKs. Finally, multiple correspondence analysis is used to mine web video events. Experiments on web videos from YouTube show that our proposed method performs better than several state-of-the-art baseline models, with an average F 1 score improvement of 19–50%.
更多
查看译文
关键词
event,cross-media,semantic-paths
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要