Efficient Media Retrieval from Non-Cooperative Queries.

Kevin Shih,Wei Di,Vignesh Jagadeesh,Robinson Piramuthu

ICVS 2015 Proceedings of the 10th International Conference on Computer Vision Systems - Volume 9163（2015）

引用 1|浏览66

暂无评分

摘要

Text is ubiquitous in the artificial world and easily attainable when it comes to book title and author names. Using the images from the book cover set from the Stanford Mobile Visual Search dataset and additional book covers and metadata from openlibrary.org, we construct a large scale book cover retrieval dataset, complete with 100ï¾¿K distractor covers and title and author strings for each. Because our query images are poorly conditioned for clean text extraction, we propose a method for extracting a matching noisy and erroneous OCR readings and matching it against clean author and book title strings in a standard document look-up problem setup. Finally, we demonstrate how to use this text-matching as a feature in conjunction with popular retrieval features such as VLAD using a simple learning setupï¾¿to achieve significant improvements in retrieval accuracy over that of either VLAD or the text alone.

查看译文

关键词

Large scale, Media retrieval, Text

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要