Statistical modality tagging from rule-based annotations and crowdsourcing

ExProM '12: Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics(2015)

引用 33|浏览89
暂无评分
摘要
We explore training an automatic modality tagger. Modality is the attitude that a speaker might have toward an event or state. One of the main hurdles for training a linguistic tagger is gathering training data. This is particularly problematic for training a tagger for modality because modality triggers are sparse for the overwhelming majority of sentences. We investigate an approach to automatically training a modality tagger where we first gathered sentences based on a high-recall simple rule-based modality tagger and then provided these sentences to Mechanical Turk annotators for further annotation. We used the resulting set of training data to train a precise modality tagger using a multi-class SVM that delivers good performance.
更多
查看译文
关键词
training data,automatic modality tagger,high-recall simple rule-based modality,modality tagger,precise modality tagger,linguistic tagger,Mechanical Turk annotators,good performance,main hurdle,multi-class SVM,Statistical modality,rule-based annotation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要