Deep learning based pipeline with multichannel inputs for patent classification

World Patent Information(2021)

引用 1|浏览0
暂无评分
摘要
This work introduces a deep learning pipeline for automatic patent classification with multichannel inputs based on LSTM and word vector embeddings. Sophisticated text mining methods are used to extract the most important segments from patent texts, and a domain-specific pre-trained word embeddings model for the patent domain is developed; it was trained on a very large dataset of more than five million patents. The deep learning pipeline is using multiple parallel LSTM networks that read the source patent document using different input dimensions namely embeddings of different segments of patent texts, and sparse linear input of different metadata. Classifying patents into corresponding technical fields is selected as a use case. In this use case, a series of patent classification experiments are conducted on different patent datasets, and the experimental results indicate that using the segments of patent texts as well as the metadata as multichannel inputs for a deep neural network model, achieves better performance than one input channel.
更多
查看译文
关键词
Patent analysis,Neural network,Deep learning,Patent classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要