Research on type-aware fashion compatibility prediction based on a hybrid attention mechanism

Yun Li, GuoXiang Li,Jing Zhang,Peiguang Jing, Xingyu Lu

Multimedia Tools and Applications(2024)

引用 0|浏览0
暂无评分
摘要
In recent years, the vigorous development of e-commerce has also greatly promoted the development of the fashion multimedia field. Fashion compatibility modeling plays a very important role in a series of applications, such as fashion retrieval and personalized recommendation. Because fashion compatibility prediction usually contains multiple modal information, in this paper, we present a type-aware fashion compatibility prediction model based on a hybrid attention mechanism that is centered on using a priori category information for the fusion of multimodal information. Specifically, the model first designs a hybrid attention mechanism to fully explore the inner dependency of the visual modality while enriching with textual modalities to obtain a joint feature representation of multimodal information. Furthermore, the model matches fashion items as a distance measurement problem, and the joint feature representation of multimodal information realizes type-aware characteristics by the design of a distance measurement function with category-aware embedding. The experimental results obtained on the publicly available Polyvore and Polyvore-D datasets demonstrate the effectiveness of the model.
更多
查看译文
关键词
Deep correlation,Fashion compatibility,Hybrid attention mechanism,Multimodal fusion
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要