Logistic discriminative speech detectors using posterior SNR

ICASSP '04). IEEE International Conference(2004)

引用 8|浏览20
暂无评分
摘要
We introduce an elegant and novel design for a speech detector which estimates the probability of the presence of speech in each time-frequency bin, as well as in each frame. The proposed system uses discriminative estimators based on logistic regression, and incorporates spectral and temporal correlations in the same framework. The detector is flexible enough to be configured in a single level or a "stacked" bilevel architecture depending on the needs of the application. An important part of the proposed design is the use of a new set of features: the normalized logarithm of the estimated posterior signal-to-noise ratio. These can be easily and automatically generated by tracking the noise spectrum online. We present results on the AURORA database to demonstrate that the overall design is simple, flexible and effective.
更多
查看译文
关键词
acoustic signal detection,correlation methods,parameter estimation,probability,regression analysis,speech processing,discriminative estimators,logistic discriminative speech detectors,logistic regression,normalized logarithm,posterior SNR,posterior signal-to-noise ratio estimation,spectral correlation,stacked bilevel architecture,temporal correlation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要