Formal Concept Analysis for Evaluating Intrinsic Dimension of a Natural Language

Sergei O. Kuznetsov,Vasilii A. Gromov, Nikita S. Borodin, Andrei M. Divavin

PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2023(2023)

引用 0|浏览9
暂无评分
摘要
Some results of a computational experiment for determining the intrinsic dimension of linguistic varieties for the Bengali and Russian languages are presented. At the same time, both sets of words and sets of bigrams in these languages were considered separately. The method used to solve this problem was based on formal concept analysis algorithms. It was found that the intrinsic dimensions of these languages are significantly less than the dimensions used in popular neural network models in natural language processing.
更多
查看译文
关键词
Intrinsic dimension,Formal concept analysis,Language manifold
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要