A Token Level Multi-target Stance Detection Dataset

2020 IEEE Fifth International Conference on Data Science in Cyberspace (DSC)(2020)

引用 1|浏览47
暂无评分
摘要
More and more people would like to express their opinions on the social media platform, such as Twitter, Sina Weibo, or Facebook. High-performance stance detection algorithm has become a meaningful and important application. The accuracy improvement of the stance detection model significantly relies on the quality of the training dataset. Previously, the traditional social media textual datasets with manual annotation are tagged on the sentence level mostly. It leads to lacking fine-grained analysis and generalization ability of the stance detection algorithm. Therefore, we propose a token level stance detection dataset with 2025 labeled tweets. The multiple targets of stances in the tweet are tagged at a token level. It uses the token level target annotations instead of using a list of hashtags to represent the stance target. The experiment shows that our dataset can improve the classification accuracy of the stance detection algorithm.
更多
查看译文
关键词
stance detection,natural language processing,machine learning,Social Network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要