Image annotation of ancient chinese architecture based on visual attention mechanism and GCN

MULTIMEDIA TOOLS AND APPLICATIONS(2022)

引用 3|浏览9
暂无评分
摘要
Ancient Chinese architecture(ACA), especially like roof ridge decoration, vividly exhibits Chinese civilization and the typical, accurate image semantics can well reflect the historical style of ACA at that time. However, the current research on the 2D image annotation method of ACA lacks the annotation of historical and cultural information(such as dynasties, regions, e t c .). In addition, with the enrichment of ACA labels, the number of irrelevant labels will increase. To solve these problems, we propose an ACA image annotation method based on visual attention mechanism and graph convolutional network (GCN). Firstly, according to the uniqueness of the roof ridge decoration of ACA, we introduce the visual attention mechanism into the convolution neural network (CNN) to focus on the roof ridge decoration area and the corresponding image features are extracted. Secondly, to avoid the output of irrelevant labels, we construct a correlation matrix in the GCN to transfer the correlation between the labels of ACA and then obtain the label-related classifier. Finally, the classifier is applied to the extracted image features for multi-label loss training. Experiments on six types of ACA datasets demonstrate that the proposed method can effectively improve the annotation accuracy and enrich the semantic information of ACA.
更多
查看译文
关键词
Ancient Chinese architecture,Image annotation,Roof ridge decoration,Visual attention mechanism,Convolution neural network,Graph convolution network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要