Character Feature Extraction for Novels Based on Text Analysis

Tingting Wu,Jianming Hu,Xin Zhu

2022 International Conference on Culture-Oriented Science and Technology (CoST)(2022)

引用 0|浏览0
暂无评分
摘要
Research on automatic character analysis of novels can help to achieve automatic Q & A with fictional characters. In this paper, a corpus containing 1435 novel texts was constructed with Chinese martial arts novel characters as the research object, and a total of 57026 characters were extracted. The character vectors were generated by Skip-gram model training, and the effect of applying the character vectors was explored. Similarity calculation and K-means clustering were performed on the persona vectors, and the experimental results showed that people from the same author usually have similarity. The gender classification prediction was performed using logistic regression and support vector machine for the persona vectors respectively, and the experimental results showed that both classification algorithms could predict the gender of the new sample characters well.
更多
查看译文
关键词
character vector,Skip-gram,character similarity calculation,character clustering,binary classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要