谷歌浏览器插件
订阅小程序
在清言上使用

New Estonian Words and Senses: Detection and Description

Margit Langemets,Jelena Kallas, Kaisa Norak, Indrek Hein

Dictionaries: journal of the Dictionary Society of North America(2020)

引用 1|浏览2
暂无评分
摘要
The Web era has intensified the need for the automatic monitoring of language, including the extraction of new words and senses. In this paper, we first give a brief overview of the unified dictionary system Ekilex, the starting point for all new lexicographic tasks at the Institute of the Estonian Language since 2019. We describe the existing databases meant for manual collecting and registering new words and meanings. Next we describe an experimental study on semi-automatic new word detection on the basis of the small media corpus and existing dictionaries carried out in 2018 at the Institute of the Estonian Language. The goal of the experiment was to develop a workflow for new word detection, to test the reliability of the tools for Estonian language processing, and to compile the new word candidate list. The experiment was focused on single word detection. The results revealed that in order to make new word discovery more effective we need more advanced tools for automatic language processing, and we perceive an urgent need to set up an infrastructure for (semi-) automatic new word detection.
更多
查看译文
关键词
estonian words,senses,detection,description
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要