Measuring and Improving Consistency in Pretrained Language Models

Yanai Elazar,Nora Kassner,Shauli Ravfogel,Abhilasha Ravichander,Eduard Hovy,Hinrich Schütze,Yoav Goldberg

TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS（2021）

引用 84|浏览100

暂无评分

摘要

Consistency of a model-that is, the invariance of its behavior under meaning-preserving alternations in its input-is a highly desirable property in natural language processing. In this paper we study the question: Are Pretrained Language Models (PLMs) consistent with respect to factual knowledge? To this end, we create PARAREL, a high-quality resource of cloze-style query English paraphrases. It contains a total of 328 paraphrases for 38 relations. Using PARAREL, we show that the consistency of all PLMs we experiment with is poor-though with high variance between relations. Our analysis of the representational spaces of PLMs suggests that they have a poor structure and are currently not suitable for representing knowledge robustly. Finally, we propose a method for improving model consistency and experimentally demonstrate its effectiveness.(1)

查看译文

关键词

language models,consistency

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要