Towards Flexible Speech Coding For Speech Synthesis: An Lf Plus Modulated Noise Vocoder
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5(2008)
摘要
This paper presents an ARX-LF-based model of speech that is amenable to low-bit-rate quantization and speech modifications directly at the parametric domain. The new model successfully addresses the non-deterministic part of voiced speech by modulating noise with the glottal flow, while unvoiced speech and transients are synthesized by modulating noise with a signal-derived time envelope. The presented work is essentially a high-quality vocoder that can be used for low complexity coding/synthesis/modification of speech suitable for embedded text-to-speech applications.
更多查看译文
关键词
speech coding, LF, LPC vocoder, embedded speech synthesis, text-to-speech, modulated noise, pitch/time scaling, speech transformation/modification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络