Noisy Word Recognition Using A Feature Based On Ternarized Spectral Slope

2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3(2007)

引用 1|浏览2
暂无评分
摘要
In previous paper, we proposed a feature FTTSS (Fourier Transform of Ternarized Spectral Slope) based on power spectrum derivatives with regard to frequency to develop a robust word recognition system under noisy environments, and we confirmed noise robustness of the feature compared with MFCC by applying it to word recognition with HMM. Generally, word recognition with HMM is improved by adding features that may express temporal variations, such as Delta MFCC or Delta FTTSS, because HMM can deal with only piecewise stationary signals. Actually, we have examined effectiveness of using Delta FTTSS in word recognition. It is supposed that features showing raw temporal variations of spectral power are effective in speech recognition and ternary conversion of features may decrease deteriorations of recognition performance by noise corruption. Therefore in this research, we propose a new feature FTTTS (Fourier Transform of Ternarized Temporal Slope) instead of Delta FTTSS. The FTTTS is defined by Fourier Transform along frequency of smoothed Ternarized Temporal variations of Spectral power at specific frequency. As a result, we have confirmed experimentally that the proposed feature FTTTS have noise robustness for SNR 0-20 dB compared with FTTSS+Delta FTTSS or the conventional feature MFCC+Delta MFCC by applying them to word recognition with HMIM.
更多
查看译文
关键词
word recognition,fourier transform,speech recognition,power spectrum,fourier transforms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要