An Empirical Investigation of Beam-Aware Training in Supertagging
EMNLP, pp. 4534-4542, 2020.
Structured prediction is often approached by training a locally normalized model with maximum likelihood and decoding approximately with beam search. This approach leads to mismatches as, during training, the model is not exposed to its mistakes and does not use beam search. Beam-aware training aims to address these problems, but unfort...More
PPT (Upload PPT)