Estimating the Quality of Synthesized and Natural Speech Transmitted Through Telephone Networks Using Single-ended Prediction Models

Acta Acustica United With Acustica(2008)

引用 23|浏览6
暂无评分
摘要
This paper reports on experiments to estimate the speech output quality of telephone services in an instrumental way, using single-ended quality prediction models. It addresses both naturally-produced as well as synthesized speech generated with a Text-To-Speech ( TTS) system. Three auditory tests have been carried out where typical speech samples have been transmitted over various telephone channels, and then judged by listeners with respect to their overall quality. The mean auditory ratings obtained in these tests have been compared to estimates provided by three different single-ended models, one of which is currently recommended by the International Telecommunication Union for predicting the quality of naturally-produced speech. Correlations between auditory and estimated quality scores vary considerably between experiments. It is concluded that the single-ended models mainly predict the effects of the transmission channel, but not of the ( naturally-produced or synthesized) source speech material.
更多
查看译文
关键词
prediction model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要