Data-Efficiency with a Single GPU: an Exploration of Transfer Methods for Small Language Models
Key words
Topic Modeling,Language Modeling,Machine Translation,Neural Machine Translation,Multilingual Neural Machine Translation
