Extracting Opinion Expressions with semi-Markov Conditional Random Fields

Empirical Methods in Natural Language Processing(2012)

引用 39|浏览82
暂无评分
摘要
Extracting opinion expressions from text is usually formulated as a token-level sequence labeling task tackled using Conditional Random Fields (CRFs). CRFs, however, do not readily model potentially useful segment-level information like syntactic constituent structure. Thus, we propose a semi-CRF-based approach to the task that can perform sequence labeling at the segment level. We extend the original semi-CRF model (Sarawagi and Cohen, 2004) to allow the modeling of arbitrarily long expressions while accounting for their likely syntactic structure when modeling segment boundaries. We evaluate performance on two opinion extraction tasks, and, in contrast to previous sequence labeling approaches to the task, explore the usefulness of segmentlevel syntactic parse features. Experimental results demonstrate that our approach outperforms state-of-the-art methods for both opinion expression tasks.
更多
查看译文
关键词
Extracting opinion expression,likely syntactic structure,opinion expression task,opinion extraction task,previous sequence,segmentlevel syntactic parse feature,syntactic constituent structure,token-level sequence,original semi-CRF model,segment boundary,semi-Markov conditional random field
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要