Oblivious dimension reduction for k -means: beyond subspaces and the Johnson-Lindenstrauss lemma

Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing(2019)

引用 70|浏览103
暂无评分
摘要
We show that for n points in d-dimensional Euclidean space, a data oblivious random projection of the columns onto m∈ O((logk+loglogn)ε−6log1/ε) dimensions is sufficient to approximate the cost of all k-means clusterings up to a multiplicative (1±ε) factor. The previous-best upper bounds on m are O(logn· ε−2) given by a direct application of the Johnson-Lindenstrauss Lemma, and O(kε−2) given by [Cohen et al.-STOC’15].
更多
查看译文
关键词
dimension reduction, k-means, random projections
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要