Black Box Adversarial Prompting for Foundation Models

Natalie Maus,Patrick Chao,Eric T. Wong,Jacob R. Gardner

arXiv (Cornell University)（2023）

引用 0|浏览2

暂无评分

摘要

Prompting interfaces allow users to quickly adjust the output of generative models in both vision and language. However, small changes and design choices in the prompt can lead to significant differences in the output. In this work, we develop a black-box framework for generating adversarial prompts for unstructured image and text generation. These prompts, which can be standalone or prepended to benign prompts, induce specific behaviors into the generative process, such as generating images of a particular object or generating high perplexity text.

查看译文

关键词

black box adversarial prompting,foundation,models

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要