What Do Questions Exactly Ask? Mfae: Duplicate Question Identification With Multi-Fusion Asking Emphasis

PROCEEDINGS OF THE 2020 SIAM INTERNATIONAL CONFERENCE ON DATA MINING (SDM)(2020)

引用 8|浏览65
暂无评分
摘要
Duplicate Question Identification (DQI) improves the processing efficiency and accuracy of large-scale community question answering and automatic QA system. The purpose of DQI task is to identify whether the paired questions are semantically equivalent. However, how to distinguish the synonyms or homonyms in paired questions is still challenging. Most previous works focus on the word-level or phrase-level semantic differences. We firstly propose to explore the asking emphasis of a question as a key factor in DQI. Asking emphasis bridges semantic equivalence between two questions. In this paper, we propose an attention model with multi-fusion asking emphasis (MFAE) for DQI. At first, BERT is used to obtain the dynamic pre-trained word embeddings. Then we get inter- and intra-asking emphasis by summing inter-attention and self-attention, respectively; the idea is that, the more a word interacts with others, the more important the word is. Finally, we use eight-way combinations to generate multi-fusion asking emphasis and multi-fusion word representation. Experimental results demonstrate that our model achieves state-of-the-art performance on both Quora Question Pairs and CQADupStack data. In addition, our model can also improve the results for natural language inference task on SNLI and MultiNLI datasets. The code is available at https://github.com/rzhangpku/MFAE.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要