Separating Optical and Language Models Through Encoder-Decoder Strategy for Transferable Handwriting Recognition

Adeline Granet,Emmanuel Morin,Harold Mouchère,Solen Quiniou,Christian Viard-Gaudin

2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)（2018）

引用 0|浏览21

暂无评分

摘要

Lack of data can be an issue when beginning a new study on historical handwritten documents. To deal with this, we propose a deep-learning based recognizer which separates the optical and the language models in order to train them separately using different resources. In this work, we present the optical encoder part of a multilingual transductive transfer learning applied to historical handwriting recognition. The optical encoder transforms the input word image into a non-latent space that depends only on the letter-n-grams: it enables it to be independent of the language. This transformation avoids embedding a language model and operating the transfer learning across languages using the same alphabet. The language decoder creates from a vector of letter-n-grams a word as a sequence of characters. Experiments show that separating optical and language model can be a solution for multilingual transfer learning.

查看译文

关键词

Handwriting recognition, knowledge transfer, Optical model, Language model

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要