Corpus Manager A Tool for Multilingual Corpus Analysis

msra

引用 23|浏览5
暂无评分
摘要
The vast amount of electronic texts in the web facilitates the creation of megacorpora at a very short time. However, as the corpus size increases, the complexity of its management is becoming a serious problem limiting its functionality. Furthermore, a great deal of corpus linguistics research is based on the quantitative comparison of small corpus samples, which are drawn from a bigger general language corpus based on specific criteria such as authorship, topic, genre, register etc (Biber 1993). For these reasons a number of tools have already been developed and aim to organize and handle texts in corpora (e.g. Christ 1994, Holmes-Higgin et al. 1994). However, most of the developed systems have a significant learning curve and limited flexibility regarding the metadata which can be used as subcorpus selection criteria.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要