Advancing Arabic Hate Speech Detection via Neural Transfer Learning with BERT

Ezzaldeen Mahyoub Naji, Ajit A Maslekar,Zeyad A. T. Ahmed, Alhasan Alharbi, Belal Al-sellami, Mohammed Tawfik

2023 3rd International Conference on Smart Generation Computing, Communication and Networking (SMART GENCON)(2023)

引用 0|浏览0
暂无评分
摘要
Online hate speech poses grave societal dangers, necessitating automatic detection systems. However, limited efforts have focused on Arabic's complex dialects. This study investigates neural transfer learning for dialect Arabic hate speech detection. A public Levantine Twitter dataset with 5,846 expert annotated tweets is compiled spanning normal, offensive, and hate speech. Traditional machine learning models including SVM, logistic regression, and gradient boosting are benchmarked, achieving 80-83% accuracy. However, these models fail to capture nuanced contextual differences between offensive and hateful language. To address this, transfer learning is proposed using the pretrained Arabic BERT model ArabERT. ArabERT leverages BERT's bidirectional representations to model linguistic context. ArabERT is fine-tuned on the Levantine dataset for hate speech classification. Results show ArabERT significantly outperforms machine learning models, attaining 90% accuracy and 94% Fl-score specifically for hate speech detection. Detailed analysis demonstrates ArabERT's contextual modeling enables nuanced discernment between offensive and hateful tweets. The outcomes provide strong evidence that transfer learning approaches like ArabERT are crucial for handling informal multi-dialect Arabic. This work makes three key contributions - introducing an effective neural framework for Arabic hate speech detection, rigorous benchmarking, and providing insights into deep learning strategies. The findings showcase transfer learning's efficacy for low-resource Arabic NLP and pave promising directions for future progress.
更多
查看译文
关键词
Hate Speech,Arabic language,NLP,ArabERT,Deep Learning Transformer,Machine Learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要