Chrome Extension
WeChat Mini Program
Use on ChatGLM

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Maksym Andriushchenko, Alexandra Souly, Mateusz Dziemian, Derek Duenas, Maxwell Lin, Justin Wang,Dan Hendrycks,Andy Zou,Zico Kolter,Matt Fredrikson, Eric Winsor, Jerome Wynne, Yarin Gal, Xander Davies

CoRR(2024)

Cited 0|Views7
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined