Entity Tracking in Real-Time Using Sub-topic Detection on Twitter.

ECIR 2014: Proceedings of the 36th European Conference on IR Research on Advances in Information Retrieval - Volume 8416(2014)

引用 2|浏览23
暂无评分
摘要
The velocity, volume and variety with which Twitter generates text is increasing exponentially. It is critical to determine latent sub-topics from such tweet data at any given point of time for providing better topic-wise search results relevant to users' informational needs. The two main challenges in mining sub-topics from tweets in real-time are 1 understanding the semantic and the conceptual representation of the tweets, and 2 the ability to determine when a new sub-topic or cluster appears in the tweet stream. We address these challenges by proposing two unsupervised clustering approaches. In the first approach, we generate a semantic space representation for each tweet by keyword expansion and keyphrase identification. In the second approach, we transform each tweet into a conceptual space that represents the latent concepts of the tweet. We empirically show that the proposed methods outperform the state-of-the-art methods.
更多
查看译文
关键词
Sub-Topic Detection, Clustering, Entity Tracking, Text Mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要