谷歌浏览器插件
订阅小程序
在清言上使用

Finding Key Terms Representing Events from Thai Twitter

Advances in Intelligent Systems and ComputingAdvances in Natural Language Processing, Intelligent Informatics and Smart Technology(2018)

引用 0|浏览1
暂无评分
摘要
In the fast and big data era, we all desire to understand trend or big picture of a story instantly. This work wants to find an automatic approach to extract the good-enough key terms of each event appear in Thai Twitter society. The core idea is to help reducing time for human to do the key term extraction, yet the quality of such selected key terms are acceptable by human and is better than our previous implementation. Our studied approaches focus to work on Thai language and covered preprocessing, feature selections and weighting schemes on three Thai real tweet events with different characteristics. Our experiment comprise four main approaches and a number of hypothesis. Our findings confirm the usefulness of hashtag terms with five or more character length, the benefit of bigram with stop words and the importance of event characteristics. In fact, we conclude to use different approaches for different types of event. The performance and rational evaluations are done by statistical analysis, evaluators voting, and F-Score measurement and are confirmed to be better than previous work twice as much.
更多
查看译文
关键词
Feature selection, Thai language, Twitter, Hashtag, Bigram, Unigram, Stop words
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要