Word Embedding Based Event Detection On Social Media

HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2017(2017)

引用 10|浏览58
暂无评分
摘要
Event detection from social media messages is conventionally based on clustering the message contents. The most basic approach is representing messages in terms of term vectors that are constructed through traditional natural language processing (NLP) methods and then assigning weights to terms generally based on frequency. In this study, we use neural feature extraction approach and explore the performance of event detection under the use of word embeddings. Using a corpus of a set of tweets, message terms are embedded to continuous space. Message contents that are represented as vectors of word embedding are grouped by using hierarchical clustering. The technique is applied on a set of Twitter messages posted in Turkish. Experimental results show that automatically extracted features detect the contextual similarities between tweets better than traditional feature extraction with term frequency-inverse document frequency (TF-IDF) based term vectors.
更多
查看译文
关键词
Event detection, Neural feature extraction, Word embedding, Neural probabilistic language models
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要