AliMe DA: A Data Augmentation Framework for Question Answering in Cold-start Scenarios

Research and Development in Information Retrieval(2021)

引用 2|浏览47
暂无评分
摘要
ABSTRACTCold-start is the most difficult and time-consuming phase when building a question answering based chatbot for a new business scenario because of the collection of sufficient training data. In this paper, we propose AliMe DA, a practical data augmentation (DA) framework that consists of data production, denoising and consumption, to alleviate this problem. We show how our DA approach can be used to substantially enhance annotation productivity and also improve downstream model performance. More importantly, we provide best practices for data augmentation, including how to choose and employ appropriate methods at each stage of our framework, and share our observation on the applicable scene of data augmentation in the era of pre-trained language models.
更多
查看译文
关键词
Data Augmentation, Question Answering, Cold-start
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要