Temporal Dynamics Of On-Line Information Streams
DATA STREAM MANAGEMENT: PROCESSING HIGH-SPEED DATA STREAMS(2016)
摘要
A number of recent computing applications involve information arriving con- tinuously over time in the form of a data stream, and this has led to new ways of thinking about traditional problems in a variety of areas. In some cases, the rate and overall volume of data in the stream may be so great that it cannot all be stored for processing, and this leads to new requirements for eciency and scalability. In other cases, the quantities of information may still be manageable, but the data stream perspective takes what has generally been a static view of a problem and adds a strong temporal dimension to it. Our focus here is on some of the challenges that this latter issue raises in the settings of text mining, on-line information, and information retrieval. Many information sources have a stream-like structure, in which the way con- tent arrives over time carries an essential part of its meaning. News coverage is a basic example; understanding the pattern of a developing news story re- quires considering not just the content of the relevant articles but also how they evolve over time. Some of the other basic corpora of interest in infor- mation retrieval | for example, scientic papers and patents | show similar temporal evolution over time-scales that can last years and decades. And the proliferation of on-line information sources and on-line forms of communica- tion has led to numerous other examples: e-mail, chat, discussion boards, and weblogs (or \blogs") all represent personal information streams with intricate topic modulations over time. Indeed, all these information sources co-exist on-line | news, e-mail, dis- cussion, commentary, and the collective output of professional and research communities; they form much of the raw material through which Internet users navigate and search. They have also served to make the \time axis" of infor- mation increasingly visible. One could argue that these developments have led to a shift in our working metaphor for Internet and Web information, from a relatively static one, a \universal encyclopedia," to a much more dynamic
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络