A Relevant Content Filtering Based Framework For Data Stream Summarization.
Lecture Notes in Computer Science(2016)
摘要
Social media platforms are a rich source of information these days, however, of all the available information, only a small fraction is of users' interest. To help users catch up with the latest topics of their interests from the large amount of information available in social media, we present a relevant content filtering based framework for data stream summarization. More specifically, given the topic or event of interest, this framework can dynamically discover and filter out relevant information from irrelevant information in the stream of text provided by social media platforms. It then captures the most representative and up-to-date information to generate a sequential summary or event story line along with the evolution of the topic or event. This framework does not depend on any labeled data, it instead uses the weak supervision provided by the user, which matches the real scenarios of users searching for information about an ongoing event. The experiments on two real events traced by Twitter verified the effectiveness of the proposed framework. The robustness of using the most easy-to-obtain weak supervision, i.e., trending topic or hashtag indicates that the framework can be easily integrated into social media platforms such as Twitter to generate sequential summaries for the events of interest.
更多查看译文
关键词
Social media,Data stream,Content filtering,Summarization,Microblog,Twitter
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络