On-the-Fly Attention Modulation for Neural Generation.

Yue Dong,Chandra Bhagavatula,Ximing Lu,Jena D. Hwang,Antoine Bosselut,Jackie Chi Kit Cheung,Yejin Choi

ACL/IJCNLP（2021）

引用 7|浏览104

暂无评分

摘要

Despite considerable advancements with deep neural language models (LMs), neural text generation still suffers from degeneration: generated text is repetitive, generic, self-inconsistent, and lacking commonsense. The empirical analyses on sentence-level attention patterns reveal that neural text degeneration may be associated with insufficient learning of inductive biases by the attention mechanism. Our findings motivate on-the-fly attention modularization, a simple but effective method for injecting inductive biases into attention computation during inference. The resulting text produced by the language model with attention modularization can yield enhanced diversity and commonsense reasoning while maintaining fluency and coherence.

查看译文

关键词

neural generation,attention,modulation,on-the-fly

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要