Generating word clusters by graph clustering based on hearst patterns

Gaurav Saxena,Manraj Singh Grover, Shampa Chakervarty

2016 1st India International Conference on Information Processing (IICIP)(2016)

引用 0|浏览1
暂无评分
摘要
The process of clustering similar words is crucial for a broad range of applications such as text classification and word sense disambiguation. Several approaches for deriving word similarity have been proposed. Some, like latent semantic analysis, are derived from the distributional hypothesis. Others extract relationships between terms by drawing upon predefined linguistic patterns. In this work, we propose an innovative approach which combines the essence of both these approaches. In the first phase, our algorithm generates a graphical model of terms and their interrelations with the help of special lexico-syntactic patterns called Hearst Patterns. We then apply a graph clustering technique to find semantically related words.
更多
查看译文
关键词
Hearst Patterns,Neural Probabilistic Language model,Chinese Whispers Algorithm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要