Chrome Extension
WeChat Mini Program
Use on ChatGLM

Enhancing Neural Network Transparency Through Representation Analysis

Andy Zou, Long Phan, Sarah Li Chen, James Campbell, Phillip Huang Guo,Richard Ren,Alexander Pan,Xuwang Yin,Mantas Mazeika, Annah Dombrowski,Shashwat Goel,Nathaniel Li, Michael J. Byun,Zifan Wang, Alex Troy Mallen,Steven Basart,Sanmi Koyejo,Dawn Song,Matt Fredrikson,J Zico Kolter,Dan Hendrycks

ICLR 2024(2024)

Cited 0|Views0
Key words
transparency,interpretability,monitoring,alignment,ML safety
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined