Comparing the Use of Edited and Unedited Text in Parser Self-Training.

IWPT '11: Proceedings of the 12th International Conference on Parsing Technologies(2011)

引用 4|浏览28
暂无评分
摘要
We compare the use of edited text in the form of newswire and unedited text in the form of discussion forum posts as sources for training material in a self-training experiment involving the Brown reranking parser and a test set of sentences from an online sports discussion forum. We find that grammars induced from the two automatically parsed corpora achieve similar Parseval f-scores, with the grammars induced from the discussion forum material being slightly superior. An error analysis reveals that the two types of grammars do behave differently.
更多
查看译文
关键词
discussion forum material,discussion forum post,online sports discussion forum,edited text,training material,unedited text,Brown reranking parser,error analysis,self-training experiment,similar Parseval f-scores,parser self-training
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要