Rare category exploration via wavelet analysis

Expert Systems with Applications: An International Journal(2016)

引用 5|浏览73
暂无评分
摘要
We propose a novel approach RCEWA for RCE which achieves a linear time complexity.We provide theoretical proofs for the effectiveness of using wavelet analysis for RCE.Experiments show that RCEWA outperforms the existing algorithms w.r.t. F-score. Rare category exploration (in short as RCE) aims to discover all the remaining data examples of a rare category from a known data example of the rare category. A few approaches have been proposed to address this problem. Most of them, however, are on quadratic or even cubic time complexities w.r.t. data set size n. More importantly, the F-scores (harmonic mean of precision and recall) of the existing approaches are not satisfactory. Compared with the existing solutions to RCE, this paper proposes a novel approach with a linear time complexity and achieves a higher F-score of mining results. The key steps of our approach are to reduce search space by performing wavelet analysis on the data density function, and then refine the coarse mining result in the reduced search space via fine-grained metrics. A solid theoretical analysis is conducted to prove the feasibility of our solution, and extensive experiments on real data sets further verify its effectiveness and efficiency.
更多
查看译文
关键词
Rare category exploration,Wavelet transform,Linear time complexity,Bandwidth selection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要