Chrome Extension
WeChat Mini Program
Use on ChatGLM

Multilingual Bottleneck Features for Query by Example Spoken Term Detection

arXiv (Cornell University)(2019)

Cited 9|Views5
No score
Abstract
State of the art solutions to query by example spoken term detection (QbE-STD) rely on bottleneck feature representation of the query and audio document. Here, we present a study on QbE-STD performance using several monolingual as well as multilingual bottleneck features extracted from feed forward networks. In contrast to previous works, we use multitask learning to train the multilingual networks which perform significantly better than the concatenated monolingual features. Additionally, we propose to employ residual networks (ResNet) to estimate the bottleneck features and show significant improvements over the corresponding feed forward network based features. The neural networks are trained on GlobalPhone corpus and QbE-STD experiments are performed on a very challenging QUESST 2014 database
More
Translated text
Key words
Multilingual feature,Bottleneck feature,Residual network,Multitask learning,Query by example spoken term detection
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined