Large-Scale Visual Font Recognition

CVPR(2014)

引用 55|浏览46
暂无评分
摘要
This paper addresses the large-scale visual font recogni- tion (VFR) problem, which aims at automatic identification of the typeface, weight, and slope of the text in an image or photo without any knowledge of content. Although vi- sual font recognition has many practical applications, it has largely been neglected by the vision community. To address the VFR problem, we construct a large-scale dataset con- taining 2, 420 font classes, which easily exceeds the scale of most image categorization datasets in computer vision. As font recognition is inherently dynamic and open-ended, i.e., new classes and data for existing categories are constantly added to the database over time, we propose a scalable so- lution based on the nearest class mean classifier (NCM). The core algorithm is built on local feature embedding, lo- cal feature metric learning and max-margin template se- lection, which is naturally amenable to NCM and thus to such open-ended classification problems. The new algo- rithm can generalize to new classes and new data at lit- tle added cost. Extensive experiments demonstrate that our approach is very effective on our synthetic test images, and achieves promising results on real world test images.
更多
查看译文
关键词
local feature embedding,local feature metric learning,font recognition,automatic identification,character recognition,fine-grained recognition, large-scale recognition, font recognition, character recognition,text detection,text weight identification,large scale visual font recognition,image classification,large-scale recognition,nearest class mean classifier,computer vision,core algorithm,fine-grained recognition,max-margin template selection,text typeface identification,text slope identification,image categorization,measurement,image recognition,visualization,vectors
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要