RECOGNITION OF TEXT IN 3-D SCENES

Gregory K Myers,Robert C Bolles,Quangtuan Luong,James A Herson

msra（2001）

引用 38|浏览37

暂无评分

摘要

Video is an increasingly important source of information to the intelligence analyst. Recognizing text that appears in real-world scenery is potentially useful for characterizing the contents of video imagery. Previous research in text recognition for both printed documents and other sources of imagery has generally assumed that the text lies in a plane that is oriented roughly perpendicular to the optical axis of the camera. However, text such as street signs, name plates, and billboards appearing in captured video imagery often lies in a plane that is oriented at an oblique angle. SRI International (SRI) is developing an approach that takes advantage of 3-D scene geometry to detect the orientation of the plane on which text is printed. The text recognition process will then be able to transform the video image of the text to a normalized coordinate system before performing OCR, yielding more robust recognition performance. Our approach applies full-perspective projections and image-to-image homographies that capture the appearance of a plane viewed through perspective optics. We describe our approach and present some preliminary results. PROBLEM STATEMENT

查看译文

关键词

coordinate system,perspective projection

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要