谷歌浏览器插件
订阅小程序
在清言上使用

A Language and Its Dimensions: Intrinsic Dimensions of Language Fractal Structures.

Vasilii A. Gromov, Nikita S. Borodin, Asel S. Yerbolova

COMPLEXITY(2024)

引用 0|浏览4
暂无评分
摘要
The present paper introduces a novel object of study - a language fractal structure. We hypothesize that a set of embeddings of all n-grams of a natural language constitutes a representative sample of this fractal set. (We use the term Hailonakea to refer to the sum total of all language fractal structures, over all n). The paper estimates intrinsic (genuine) dimensions of language fractal structures for the Russian and English languages. To this end, we employ methods based on (1) topological data analysis and (2) a minimum spanning tree of a data graph for a cloud of points considered (Steele theorem). For both languages, for all n, the intrinsic dimensions appear to be non-integer values (typical for fractal sets), close to 9 for both of the Russian and English language.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要