Deep Learning Ensembles for Hate Speech Detection

Safa Alsafari,Samira Sadaoui,Malek Mouhoub

2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI)（2020）

引用 9|浏览10

暂无评分

摘要

Our study explores offensive and hate speech detection for the Arabic language, as previous studies are minimal. Based on two-class, three-class, and six-class Arabic-Twitter datasets, we develop single and ensemble CNN and BiLSTM classifiers that we train with non-contextual (Fasttext-SkipGram) and contextual (Multilingual Bert and AraBert) word-embedding models. For each hate/offensive classification task, we conduct a battery of experiments to evaluate the performance of single and ensemble classifiers on testing datasets. The average-based ensemble approach was found to be the best performing, as it returned F-scores of 91%, 84%, and 80% for two-class, three-class and six-class prediction tasks, respectively. We also perform an error analysis of the best ensemble model for each task.

查看译文

关键词

Hate and Offensive Speech,Word Embedding,CNN,BiLSTM,Ensemble Models,Error Analysis

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要