Sentiment Analysis For Brazilian Portuguese Over A Skewed Class Corpora

COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE (PROPOR 2016)(2016)

引用 3|浏览20
暂无评分
摘要
The goal of this paper is to compare existing sentiment analysis models, namely Doc2Vec and Recursive Neural Tensor Network, when applied to a skewed class corpus. Such setting is not uncommon, but the literature lacks results on it. We used two techniques to create more balance between classes: under-sampling and over-sampling the target corpora. Doc2Vec achieved the best result overall on the skewed classes, but performed poorly over small and sampled configurations. RNTN achieved the best result on the over-sampled corpus. The Naive Bayes baseline was not surpassed in the under-sampled corpus with Pos/Neg classes, which was the smallest corpus configuration.
更多
查看译文
关键词
Skewed Class, Sentiment Analysis (SA), Brazilian Portuguese Corpus, Recursive Neural Tensor Network (RNTN), Corporate Configurations
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要