Using Monolingual Speech Recognition for Spoken Term Detection in Code-switched Hindi-English Speech.

ICDM Workshops（2019）

引用 5|浏览30

暂无评分

摘要

Code-switching is the alternation of two or more languages in a single utterance or a conversation and is prevalent in multilingual communities all over the world. Spoken Term Detection (STD) is the task of detecting a given word or phrase in audio. STD has applications in audio indexing and mining. In this work, we explore Spoken Term Detection for code-switched conversational Hindi-English speech. Code-switching provides various challenges to this problem, including, 1. lack of training data to build robust code-switched Automatic Speech Recog- nition (ASR) systems, 2. non-standardized transcription due to borrowing and cross-transcription, 3. presence of translated or code-switched variants of the terms. In this work, we assume that a code-switched ASR System for Hindi-English does not exist, and make use of only a monolingual Hindi ASR to retrieve audio containing Hindi and English keywords. We use various techniques to normalize the output of a monolingual ASR system. We evaluate our techniques using Term Weighted Value (TWV) and find that phonetic matching of the query and ASR hypotheses at the utterance level is the most promising approach.

查看译文

关键词

Spoken Term Detection,Code-Switching,Code-Mixing,Keyword Spotting,Low-Resource

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要