Your fairness may vary: Group fairness of pretrained language models in toxic text classification

Ioana Baldini,Dennis Wei,Karthikeyan Natesan Ramamurthy,Mikhail Yurochkin,Moninder Singh

arxiv（2021）

引用 4|浏览2

暂无评分

摘要

We study the performance-fairness trade-off in more than a dozen fine-tuned LMs for toxic text classification. We empirically show that no blanket statement can be made with respect to the bias of large versus regular versus compressed models. Moreover, we find that focusing on fairness-agnostic performance metrics can lead to models with varied fairness characteristics.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要