JHU1: an unsupervised approach to person name disambiguation using web snippets

SemEval@ACL(2007)

引用 31|浏览39
暂无评分
摘要
This paper presents an approach to person name disambiguation using K-means clustering on rich-feature-enhanced document vectors, augmented with additional web-extracted snippets surrounding the polysemous names to facilitate term bridging. This yields a significant F-measure improvement on the shared task training data set. The paper also illustrates the significant divergence between the properties of the training and test data in this shared task, substantially skewing results. Our system optimized on F0.2 rather than F0.5 would have achieved top performance in the shared task.
更多
查看译文
关键词
rich-feature-enhanced document vector,polysemous name,significant f-measure improvement,shared task training data,web snippet,skewing result,shared task,test data,additional web-extracted snippet,unsupervised approach,person name disambiguation,significant divergence,k means
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要