Human annotation of ASR error regions: Is "gravity" a sharable concept for human annotators?

Daniel Luzzati,Cyril Grouin,Ioana Vasilescu,Martine Adda-Decker,Eric Bilinski,Nathalie Camelin,Juliette Kahn,Carole Lailler,Lori Lamel,Sophie Rosset

LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION（2014）

引用 23|浏览36

暂无评分

摘要

This paper is concerned with human assessments of the severity of errors in ASR outputs. We did not design any guidelines so that each annotator involved in the study could consider the "seriousness" of an ASR error using their own scientific background. Eight human annotators were involved in an annotation task on three distinct corpora, one of the corpora being annotated twice, hiding this annotation in duplicate to the annotators. None of the computed results (inter-annotator agreement, edit distance, majority annotation) allow any strong correlation between the considered criteria and the level of seriousness to be shown, which underlines the difficulty for a human to determine whether a ASR error is serious or not.

查看译文

关键词

Annotation,ASR Seriousness Errors,Speech Recognition

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要