Direct Speech-to-Speech Translation with a Sequence-to-Sequence Model
INTERSPEECH, pp. 1123-1127, 2019.
We present an attention-based sequence-to-sequence neural network which can directly translate speech from one language into speech in another language, without relying on an intermediate text representation. The network is trained end-to-end, learning to map speech spectrograms into target spectrograms in another language, correspondin...More
PPT (Upload PPT)