Human-Model Divergence In The Handling Of Vagueness

E Stengel-Eskin, J Guallar-Blasco,B Van Durme

UNIMPLICIT 2021: THE FIRST WORKSHOP ON UNDERSTANDING IMPLICIT AND UNDERSPECIFIED LANGUAGE(2021)

引用 0|浏览37
暂无评分
摘要
While aggregate performance metrics can generate valuable insights at a large scale, their dominance means more complex and nuanced language phenomena, such as vagueness, may be overlooked. Focusing on vague terms (e.g. sunny, cloudy, young, etc.) we inspect the behavior of visually grounded and text-only models, finding systematic divergences from human judgments even when a model's overall performance is high. To help explain this disparity, we identify two assumptions made by the datasets and models examined and, guided by the philosophy of vagueness, isolate cases where they do not hold.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要