Debiasing Multilingual Word Embeddings: A Case Study of Three Indian Languages

Hypertext and Hypermedia(2021)

引用 2|浏览10
暂无评分
摘要
ABSTRACTIn this paper, we advance the current state-of-the-art method for debiasing monolingual word embeddings so as to generalize well in a multilingual setting. We consider different methods to quantify bias and different debiasing approaches for monolingual as well as multilingual settings. We demonstrate the significance of our bias-mitigation approach on downstream NLP applications. Our proposed methods establish the state-of-the-art performance for debiasing multilingual embeddings for three Indian languages - Hindi, Bengali, and Telugu in addition to English. We believe that our work will open up new opportunities in building unbiased downstream NLP applications that are inherently dependent on the quality of the word embeddings used.
更多
查看译文
关键词
Debiasing multilingual embeddings, Gender debias, Debiasing Indian languages
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要