Semantic Similarity-based Visual Reasoning without Language Information.

ChangSu Choi, HyeonSeok Lim, Hayoung Jang, Juhan Park,Eunkyung Kim,KyungTae Lim

ICAIIC(2023)

引用 0|浏览1
暂无评分
摘要
In this research, we propose new training data for the visual reasoning task based on semantic similarity and proposed a deep learning model that utilizes the data. The first contribution of this study is the construction of training data. Based on a total of 40 object attributes, we created a visual inference problem using only image data. As a result, a total of 6,000 datasets were built to create training and test data. We also propose a visual inference model as the second contribution of this work. The inference model shown in this study was evaluated for two tasks using ResNet50 and Vision Transformer, respectively. Based on the experimental evaluation results, we investigated the suitable pre-trained model for both single-choice binary reasoning and multiple-selection reasoning, respectively.
更多
查看译文
关键词
Visual Reasoning,Inference,Image similarity,Deep Learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要