Show Your Work: Improved Reporting of Experimental Results

Suchin Gururangan
Suchin Gururangan
Dallas Card
Dallas Card

EMNLP/IJCNLP (1), pp. 2185-2194, 2019.

Cited by: 78|Views76
EI

Abstract:

Research in natural language processing proceeds, in part, by demonstrating that new models achieve superior performance (e.g., accuracy) on held-out test data, compared to previous results. In this paper, we demonstrate that test-set performance scores alone are insufficient for drawing accurate conclusions about which model performs b...More

Code:

Data:

Your rating :
0

 

Tags
Comments