A supervised signal-to-noise ratio estimation of speech signals

ICASSP(2014)

引用 25|浏览55
暂无评分
摘要
This paper introduces a supervised statistical framework for estimating the signal-to-noise (SNR) ratio of speech signals. Information on how noise corrupts a signal can help us compensate for its effects, especially in real life applications where the usual assumption of white Gaussian noise does not hold and speech boundaries in the signal are not known. We use features from which we can detect speech regions in a signal, without using Voice Activity Detection, and estimate the energies of those regions. Then we use these features to train ordinary least squares regression models for various noise types. We compare this supervised method with state-of-the-art SNR estimation algorithms and show its superior performance with respect to the tested noise types.
更多
查看译文
关键词
speech processing,supervised learning,snr ratio,voice activity detection,speech regions,learning (artificial intelligence),regression analysis,speech boundaries,supervised signal-to-noise ratio estimation,speech signal processing,least squares regression models,least squares approximations,supervised statistical framework,speech signals,signal-to-noise ratio estimation,gaussian noise,supervised method,estimation,nist,speech,robustness,signal to noise ratio,learning artificial intelligence
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要