Leveraging Interactive Knowledge And Unlabeled Data In Gender Classification With Co-Training

DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2015(2015)

引用 7|浏览32
暂无评分
摘要
Conventional approaches to gender classification much rely on a large scale of labeled data, which is normally hard and expensive to obtain. In this paper, we propose a co-training approach to address this problem in gender classification. Specifically, we employ both non-interactive and interactive texts, i.e., the message and comment texts, as two different views in our co-training approach to well incorporate unlabeled data. Experimental results on a large data set from micro-blog demonstrate the appropriateness of leveraging interactive knowledge in gender classification and the effectiveness of the proposed co-training approach in gender classification.
更多
查看译文
关键词
Interactive knowledge, Gender classification, Co-training
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要