When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing(2021)
Key words
Language Modeling,Statistical Language Modeling,Automatic Speech Recognition,Sequence-to-Sequence Learning,End-to-End Speech Recognition
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined