Computer-assisted transcription of speech based on confusion network reordering

Acoustics, Speech and Signal Processing(2011)

引用 18|浏览8
暂无评分
摘要
Large vocabulary automatic speech recognition (ASR) technologies perform well in known and controlled contexts. In less con trolled conditions, however, human review is often necessary to check and correct the results of such systems in order to ensure that the output of ASR will be understandable. We propose a method for computer-assisted transcription of speech, based on automatic reordering confusion networks. Our method will be evaluated in terms of KSR (Keystroke Saving Rate) and WSR (Word Stroke Ratio). It allows to significantly reduce the number of actions needed to correct ASR outputs. WSR computed before and after every network reordering shows a gain of about 17.7% (3.4 points).
更多
查看译文
关键词
speech recognition,ASR,KSR,WSR,computer-assisted transcription,confusion network reordering,human review,keystroke saving rate,large vocabulary automatic speech recognition technology,word stroke ratio,Automatic correction,Cache models,Confusion network,Speech recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要