WaveNet-Based Zero-Delay Lossless Speech Coding.

Takenori Yoshimura,Kei Hashimoto,Keiichiro Oura,Yoshihiko Nankaku,Keiichi Tokuda

SLT（2018）

引用 6|浏览64

暂无评分

摘要

This paper presents a WaveNet-based zero-delay lossless speech coding technique for high-quality communications. The WaveNet generative model, which is a state-of-the-art model for neural-network-based speech waveform synthesis, is used in both the encoder and decoder. In the encoder, discrete speech signals are losslessly compressed using sample-by-sample entropy coding. The decoder fully reconstructs the original speech signals from the compressed signals without algorithmic delay. Experimental results show that the proposed coding technique can transmit speech audio waveforms with 50% their original bit rate and the WaveNet-based speech coder remains effective for unknown speakers.

查看译文

关键词

Speech coding,Decoding,Training,Delays,Probability distribution,Adaptation models

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要