谷歌浏览器插件
订阅小程序
在清言上使用

Transformer Region Proposal for Object Detection

ICCSIP(2020)

引用 0|浏览1
暂无评分
摘要
We present a method for detecting objects in images using Transformer and Region Proposal. Our method discards the traditional convolution neural network, obtain the positional information and category of the detection object by finding the correspondence between the images through the self-attention mechanism. We extract the region proposal by setting anchors, and combine the positional encoding obtained by the image auto-encoder. Then input the features into the transformer encoder and decoder network to obtain the corresponding information of the target object, and obtain the positional information and category of target object. The experimental results on the public dataset proved, our method’s detection speed can reach 26 frames per second, which can meet the needs of real-time detection. In terms of detection accuracy, our method’s average precision can reach the 31.4, and under the large object can reach 55.2, which can reach the detection effect of the current mainstream convolutional neural network detection algorithm.
更多
查看译文
关键词
Object detection,Region proposal,Transformer,Real-time
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要