End-to-End Transformer-Based Open-Vocabulary Keyword Spotting with Location-Guided Local Attention.

Bo Wei, Meirong Yang, Tao Zhang, Xiao Tang, Xing Huang,Kyuhong Kim,Jaeyun Lee,Kiho Cho,Sung-Un Park

Interspeech(2021)

引用 8|浏览6
暂无评分
摘要
Open-vocabulary keyword spotting (KWS) aims to detect arbitrary keywords from continuous speech, which allows users to define their personal keywords. In this paper, we propose a novel location guided end-to-end (E2E) keyword spotting system. Firstly, we predict endpoints of keyword in the entire speech based on attention mechanism. Secondly, we calculate the existence probability of keyword by fusing the located keyword speech segment and text with local attention. The results on Librispeech dataset and Google speech commands dataset show our proposed method significantly outperforms the baseline method and the latest small-footprint E2E KWS method.
更多
查看译文
关键词
Open-vocabulary,keyword spotting,end-to-end,keyword location,local attention
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要