Cross-Domain Text Classification Algorithm Based On Instance-Transfer Learning

Ruijun Liu,Jun Wang, Zhuo Yu,Yuqian Shi,Lun Zhang,Changjiang Ji,Xin Jin

INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2020（2020）

引用 0|浏览13

暂无评分

摘要

Cross-domain text classification has broad application prospects in the field of data mining. Since transfer learning can help target domain data to achieve the sharing and transfer of semantic information with the help of existing knowledge domains, transfer learning is generally used to achieve cross-domain text processing. Based on this, we propose a cross-domain text classification algorithm -MTrA. The algorithm is based on TrAdaBoost, taking into account the distribution differences between the source domain and the target domain. It uses the Maximum Mean Discrepancy (MMD) as the initial weight parameter of the two domains. MTrA adds a weight backfill factor that considers the accuracy of the source domain classification and balances the weight update method of the source domain data. Through the verification in the dataset 20 Newsgroups, compared with the traditional TrAdaBoost algorithm, it improves the classification accuracy by 9.4% on average. it proves the effectiveness and advantages of the algorithm.

查看译文

关键词

transfer learning, cross-domain, text classification

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要