BiCapsHate: Attention to the Linguistic Context of Hate via Bidirectional Capsules and Hatebase.

Ashraf Kamal, Tarique Anwar,Vineet Kumar Sejwal, Mohd Fazil

IEEE Trans. Comput. Soc. Syst.(2024)

引用 1|浏览0
暂无评分
摘要
Online social media (OSM) communications sometimes turn into hate-filled and offensive comments or arguments. It not just disrupts the social fabric online, but also leads to hate, violence, and crime, in the real physical world in worst scenarios. The existing content moderation practices of OSM platforms often fail to control the online hate. In this article, we develop a deep learning model called BiCapsHate to detect hate speech (HS) in OSM posts. The model consists of five layers of deep neural networks. It starts with an input layer to process the input text and follows on to an embedding layer to embed the text into a numeric representation. A BiCaps layer then learns the sequential and linguistic contextual representations, a dense layer prepares the model for final classification, and lastly the output layer produces the resulting class as either hate or non-HS (NHS). The BiCaps layer, being the most important component, effectively learns the contextual information with respect to different orientations in both forward and backward directions of the input text via capsule networks. It is further aided by our rich set of hand-crafted shallow and deep auxiliary features including the Hatebase lexicon, making the model well-informed . We conduct extensive experiments on five benchmark datasets to demonstrate the efficacy of the proposed BiCapsHate model. In the overall results, we outperform the existing state-of-the-art methods including fBERT, HateBERT, and ToxicBERT. BiCapsHate achieves up to 94% and 92% f-score on balanced and imbalanced datasets, respectively. Our complete source code is publicly available at GitHub repository https://github.com/Ashraf-Kamal/BiCapsHate.
更多
查看译文
关键词
Capsule networks,hate speech (HS) detection,Hatebase lexicon,long short-term memory (LSTM) networks,online social media (OSM)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要