Statistical modality tagging from rule-based annotations and crowdsourcing

Vinodkumar Prabhakaran,Michael Bloodgood,Mona Diab,Bonnie Dorr,Lori Levin,Christine D. Piatko,Owen Rambow,Benjamin Van Durme

ExProM '12: Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics（2015）

引用 33|浏览89

暂无评分

摘要

We explore training an automatic modality tagger. Modality is the attitude that a speaker might have toward an event or state. One of the main hurdles for training a linguistic tagger is gathering training data. This is particularly problematic for training a tagger for modality because modality triggers are sparse for the overwhelming majority of sentences. We investigate an approach to automatically training a modality tagger where we first gathered sentences based on a high-recall simple rule-based modality tagger and then provided these sentences to Mechanical Turk annotators for further annotation. We used the resulting set of training data to train a precise modality tagger using a multi-class SVM that delivers good performance.

查看译文

关键词

training data,automatic modality tagger,high-recall simple rule-based modality,modality tagger,precise modality tagger,linguistic tagger,Mechanical Turk annotators,good performance,main hurdle,multi-class SVM,Statistical modality,rule-based annotation

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要