Talking Face Generation with Multilingual TTS

Hyoung-Kyu Song,Sang Hoon Woo,Junhyeok Lee,Seungmin Yang,Hyunjae Cho,Youseong Lee,Dongho Choi,Kang-wook Kim

IEEE Conference on Computer Vision and Pattern Recognition（2022）

引用 12|浏览28

暂无评分

摘要

Recent studies in talking face generation have focused on building a model that can generalize from any source speech to any target identity. A number of works have already claimed this functionality and have added that their models will also generalize to any language. However, we show, using languages from different language families, that these models do not translate well when the training language and the testing language are sufficiently different. We reduce the scope of the problem to building a language-robust talking face generation system on seen identities, i.e., the target identity is the same as the training identity. In this work, we introduce a talking face generation system that generalizes to different languages. We evaluate the efficacy of our system using a multilingual text-to-speech system. We present the joint text-to-speech system and the talking face generation system as a neural dubber system. Our demo is available at https://bit.ly/ml-face-generation-cvpr22-demo. Also, our screencast is uploaded at https://youtu.be/F6h0s0M4vBI.

查看译文

关键词

multilingual TTS,source speech,target identity,different language families,training language,testing language,language-robust talking face generation system,training identity,multilingual text-to-speech system,joint text-to-speech system

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要