MetaPAD: Meta Pattern Discovery from Massive Text Corpora
KDD '17: The 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Halifax NS Canada August, 2017, pp. 877-886, 2017.
We developed an e cient framework, MetaPAD, to discover the meta pa erns from massive corpora with three techniques, including a context-aware segmentation method to carefully determine the boundaries of the pa erns with a learnt pa ern quality assessment function, which avoids c...
Mining textual patterns in news, tweets, papers, and many other kinds of text corpora has been an active theme in text mining and NLP research. Previous studies adopt a dependency parsing-based pattern discovery approach. However, the parsing results lose rich context around entities in the patterns, and the process is costly for a corpus...More
PPT (Upload PPT)