Inoculation by Fine-Tuning: A Method for Analyzing Challenge Datasets
North American Chapter of the Association for Computational Linguistics, pp. 2171-2179, 2019.
Several datasets have recently been constructed to expose brittleness in models trained on existing benchmarks. While model performance on these challenge datasets is significantly lower compared to the original benchmark, it is unclear what particular weaknesses they reveal. For example, a challenge dataset may be difficult because it ta...More
PPT (Upload PPT)