Term graph model for text classification

ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS(2005)

引用 68|浏览0
暂无评分
摘要
Most existing text classification methods (and text mining methods at large) are based on representing the documents using the traditional vector space model. We argue that important information, such as the relationship among words, is lost. We propose a term graph model to represent not only the content of a document but also the relationship among the keywords. We demonstrate that the new model enables us to define new similarity functions, such as considering rank correlation based on PageRank-style algorithms, for the classification purpose. Our preliminary results show promising results of our new model.
更多
查看译文
关键词
existing text classification method,traditional vector space model,preliminary result,new model,classification purpose,text mining method,important information,term graph model,pagerank-style algorithm,new similarity function,vector space model,rank correlation,text mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要