SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore
arxiv(2024)
摘要
To address the limitations of current hate speech detection models, we
introduce , a novel framework designed for the linguistic
and cultural context of Singapore and Southeast Asia. It extends the functional
testing approach of HateCheck and MHC, employing large language models for
translation and paraphrasing into Singapore's main languages, and refining
these with native annotators. reveals critical flaws in
state-of-the-art models, highlighting their inadequacy in sensitive content
moderation. This work aims to foster the development of more effective hate
speech detection tools for diverse linguistic environments, particularly for
Singapore and Southeast Asia contexts.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要