Detecting Recurring and Novel Classes in Concept-Drifting Data Streams

Mohammad M. Masud,Tahseen M. Al-Khateeb,Latifur Khan,Charu Aggarwal,Jing Gao,Jiawei Han,Bhavani Thuraisingham

Data Mining（2011）

引用 84|浏览1

暂无评分

摘要

Concept-evolution is one of the major challenges in data stream classification, which occurs when a new class evolves in the stream. This problem remains unaddressed by most state-of-the-art techniques. A recurring class is a special case of concept-evolution. This special case takes place when a class appears in the stream, then disappears for a long time, and again appears. Existing data stream classification techniques that address the concept-evolution problem, wrongly detect the recurring classes as novel class. This creates two main problems. First, much resource is wasted in detecting a recurring class as novel class, because novel class detection is much more computationally- and memory-intensive, as compared to simply recognizing an existing class. Second, when a novel class is identified, human experts are involved in collecting and labeling the instances of that class for future modeling. If a recurrent class is reported as novel class, it will be only a waste of human effort to find out whether it is really a novel class. In this paper, we address the recurring issue, and propose a more realistic novel class detection technique, which remembers a class and identifies it as "not novel" when it reappears after a long disappearance. Our approach has shown significant reduction in classification error over state-of-the-art stream classification techniques on several benchmark data streams.

查看译文

关键词

recurring class,concept-drifting data streams,novel class,novel class detection,recurrent class,recurring detection,pattern classification,stream classification,special case,realistic novel class detection,data stream classification,detecting recurring,concept-evolution,benchmark data stream,data stream classification techniques,data handling,new class evolves,existing class,data stream classification technique,novel classes,concept drift

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要