Differentiating Between Oriental And European Scripts By Statistical Features

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE(1998)

引用 14|浏览4
暂无评分
摘要
Two types of techniques are usually adopted in language differentiation: token matching and statistical analysis. In this paper we present a method which uses a combined analysis of several discriminating statistical features, for the differentiation between European and oriental language scripts. When applied to more than 23 languages, it has proved to be effective in differentiating between documents printed in these different scripts.
更多
查看译文
关键词
script classification, language differentiation, oriental languages, Asian scripts, Chinese characters, Roman scripts
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要