The Thieves on Sesame Street are Polyglots Extracting Multilingual Models from Monolingual APIs
EMNLP 2020, pp. 6203-6207, 2020.
Pre-training in natural language processing makes it easier for an adversary with only query access to a victim model to reconstruct a local copy of the victim by training with gibberish input data paired with the victim’s labels for that data. We discover that this extraction process extends to local copies initialized from a pre-trained...More
Full Text (Upload PDF)
PPT (Upload PPT)