Generating High-Quality Surface Realizations Using Data Augmentation and Factored Sequence Models.
MULTILINGUAL SURFACE REALISATION: SHARED TASK AND BEYOND(2018)
摘要
This work presents state of the art results in reconstruction of surface realizations from obfuscated text. We identify the lack of sufficient training data as the major obstacle to training high-performing models, and solve this issue by generating large amounts of synthetic training data. We also propose preprocessing techniques which make the structure contained in the input features more accessible to sequence models. Our models were ranked first on all evaluation metrics in the English portion of the 2018 Surface Realization shared task.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络