MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset
Abstract:
We present MLQE-PE, a new dataset for Machine Translation (MT) Quality Estimation (QE) and Automatic Post-Editing (APE). The dataset contains seven language pairs, with human labels for 9,000 translations per language pair in the following formats: sentence-level direct assessments and post-editing effort, and word-level good/bad labels...More
Code:
Data:
Full Text
Tags
Comments