Latent-Based Neural Net for Non-Intrusive Speech Quality Assessment

Fredrik Cumlin, Christian Schüldt,Saikat Chatterjee

2023 31st European Signal Processing Conference (EUSIPCO)(2023)

引用 0|浏览0
暂无评分
摘要
For non-intrusive speech quality assessment, we treat the mean-opinion-score (MOS) of a speech signal as a latent, and propose a latent MOS network (LaMOSNet) to estimate the MOS. At the time of training, the proposed LaMOSNet has two parts in series, with the first part providing the latent estimate, i.e. the MOS of an input speech signal, and the second part providing an estimated score by a given judge. Only the first part is used for testing. We address two inherent aspects -limited-data and noisy-data aspects - in training using stochastic gradient noise and a student-teacher type of training, motivated by semi-supervised learning. It is shown that LaMOSNet provides good performance on the Voice Conversion Challenge 2018 dataset, and state-of-the-art correlation performance on the Voice Conversion Challenge 2016 dataset.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要