RRGparbank: A Parallel Role and Reference Grammar Treebank.

International Conference on Language Resources and Evaluation (LREC)(2022)

引用 0|浏览3
暂无评分
摘要
This paper describes the first release of RRGparbank, a multilingual parallel treebank for Role and Reference Grammar (RRG) that contains annotations of George Orwell's novel 1984 and its translations. The release comprises the entire novel for English and a constructionally diverse, parallel "seed" sample for German, French, Russian, and Farsi. The paper gives an overview of the annotation decisions taken and describes the adopted treebanking methodology. As a possible application, a multilingual parser is trained on the treebank data. RRGparbank is one of the first resources for which RRG has been applied to large amounts of real-world data. It enables comparative and typological corpus studies in RRG and creates new possibilities of data-driven NLP applications based on RRG.
更多
查看译文
关键词
Syntax, Treebank, Parallel Corpus, Role and Reference Grammar, English, German, French, Russian, Farsi
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要