谷歌浏览器插件
订阅小程序
在清言上使用

Centroidal Clustering of Noisy Observations by Using rth Power Distortion Measures

IEEE transactions on neural networks and learning systems(2024)

引用 2|浏览3
暂无评分
摘要
We consider the problem of clustering a dataset through multiple noisy observations of its members. The goal is to obtain a clustering that is as faithful to the clustering of the original dataset as possible. We propose a centroidal approach whose distortion measure is the sum of rth powers of the distances between the cluster center and the noisy observations. For r = 2, our scheme boils down to the well-known approach of clustering the average of noisy samples. First, we provide a mathematical analysis of our clustering scheme. In particular, we find formulas for the average distortion and the spatial distribution of the cluster centers in the asymptotic regime where the number of centers is large. We then provide an algorithm to numerically optimize the cluster centers in the finite regime. We extend our method to automatically assign weights to noisy observations. Finally, we show that for various practical noise models, with a suitable choice of r, our algorithms can outperform several other existing techniques over various datasets.
更多
查看译文
关键词
Centroidal clustering,high-resolution theory,noisy clustering,quantization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要