Annotation Artifacts in Natural Language Inference Data
north american chapter of the association for computational linguistics, 2018.
Large-scale datasets for natural language inference are created by presenting crowd workers with a sentence (premise), and asking them to generate three new sentences (hypotheses) that it entails, contradicts, or is logically neutral with respect to. We show that, in a significant portion of such data, this protocol leaves clues that make...More
PPT (Upload PPT)