Learning to Count Words in Fluent Speech enables Online Speech Recognition
SLT, pp. 38-45, 2021.
Sequence to Sequence models, in particular the Transformer, achieve state of the art results in Automatic Speech Recognition. Practical usage is however limited to cases where full utterance latency is acceptable. In this work we introduce Taris, a Transformer-based online speech recognition system aided by an auxiliary task of incremen...More
Full Text (Upload PDF)
PPT (Upload PPT)