HateBERT: Retraining BERT for Abusive Language Detection in English
Abstract:
In this paper, we introduce HateBERT, a re-trained BERT model for abusive language detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit comments in English from communities banned for being offensive, abusive, or hateful that we have collected and made available to the public. We present the results of a...More
Code:
Data:
Tags
Comments