谷歌浏览器插件
订阅小程序
在清言上使用

Gender-based Cyberbullying Detection for Under-resourced Bangla Language

Hasnain Karim Rabib, Mostafa Galib, Takia Mosharref Nobo, Tanjila Alam Sathi,Mohammed Saidul Islam,Abu Raihan Mostofa Kamal,Md Azam Hossain

2022 12th International Conference on Electrical and Computer Engineering (ICECE)(2022)

引用 0|浏览13
暂无评分
摘要
The present study explores the detection of gender discrimination based cyberbullying in under-resourced Bangla language. While being spoken by 230 million people globally and being rich in diversity, the Bengali language lacks computational models and annotated resources for cyberbullying detection. To address this, we created GenDisc, a corpus of Bangla Facebook comments featuring gender-based cyberbullying. This study also presents a framework for cyberbullying detection. In our proposed approach, we used four different models to train a gender-based discriminatory text classifier, followed by an ensembling technique on those four models. Then we compared the individual prediction accuracies with the ensembled prediction accuracy. While training the dataset, we followed the stratified k-fold cross validation technique. We demonstrated that integrating gender-based discrimination variables improve a classifier’s capacity to discriminate against cyberbullying. Our evaluations yielded an accuracy of 68% in gender-based speech detection during cross-validation tests.
更多
查看译文
关键词
Cyberbullying,Gender discrimination,Transformer,BERT,Ensemble
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要