Shared representations of human actions across vision and language

Diana C. Dima, Sugitha Janarthanan,Jody C. Culham,Yalda Mohsenzadeh

biorxiv（2024）

引用 0|浏览2

暂无评分

摘要

Humans can recognize and communicate about many actions performed by others. How are actions organized in the mind, and is this organization shared across vision and language? We collected similarity judgments of human actions depicted through naturalistic videos and sentences, and tested four models of action categorization, defining actions at different levels of abstraction ranging from specific (action verb) to broad (action target: whether an action is directed towards an object, another person, or the self). The similarity judgments reflected a shared organization of action representations across videos and sentences, determined mainly by the target of actions, even after accounting for other semantic features. Language model embeddings predicted the behavioral similarity of action videos and sentences, and captured information about the target of actions alongside unique semantic information. Together, our results show how action concepts are organized in the human mind and in large language model representations. ### Competing Interest Statement The authors have declared no competing interest. The data and results have been archived as an Open Science Framework repository ().

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要