Vocabulary-independent search in spontaneous speech

Frank Seide,Peng Yu,Chengyuan Ma,Eric Chang

ICASSP '04). IEEE International Conference（2004）

引用 106|浏览37

暂无评分

摘要

For efficient organization of speech recordings - meetings, interviews, voice mails, lectures - the ability to search for spoken keywords is an essential capability. Today, most spoken-document retrieval systems use large-vocabulary recognition. For the above scenarios, such systems suffer from both the unpredictable vocabulary/domain and generally high word-error rates (WER). We present a vocabulary-independent system to index and to search rapidly spontaneous speech. A speech recognizer generates lattices of phonetic word fragments, against which keywords are matched phonetically. We first show the need to use recognition alternatives (lattices) in a high-WER context, on a word-based baseline. Then we introduce our new method of phonetic word-fragment lattice generation, which uses longer-span language knowledge than a phoneme recognizer. Last we introduce heuristics to compact the lattices to feasible sizes that can be searched efficiently. On the LDC voice mail corpus, we show that vocabulary/domain-independent phonetic search is as accurate as a vocabulary/domain-dependent word-lattice based baseline system for in-vocabulary keywords (FOMs of 74-75%), but nearly maintains this accuracy also for out-of-vocabulary keywords.

查看译文

关键词

error statistics,query processing,speech processing,speech recognition,large-vocabulary recognition,phonetic word fragment lattice generation,speech recognizer,speech recordings,spoken keywords,spoken-document retrieval systems,spontaneous speech,vocabulary-independent search,word-error rates

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要