Identification of Occupation Mentions in Clinical Narratives.

Lecture Notes in Computer Science(2016)

引用 8|浏览220
暂无评分
摘要
A patient's occupation is an important variable used for disease surveillance and modeling, but such information is often only available in free-text clinical narratives. We have developed a large occupation dictionary that is used as part of both knowledge- (dictionary and rules) and data-driven (machine-learning) methods for the identification of occupation mentions. We have evaluated the approaches on both public and non-public clinical datasets. A machine-learning method using linear chain conditional random fields trained on minimalistic set of features achieved up to 88 % F-1-measure (token-level), with the occupation feature derived from the knowledge- driven method showing a notable positive impact across the datasets (up to additional 32 % F-1-measure).
更多
查看译文
关键词
Information extraction,Natural language processing,Named entity recognition,Lexical resource,Occupation,Profession
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要