Efficient Privacy Preserving Distributed K-Means for Non-IID Data

André Brandão,Ricardo Mendes,João P. Vilela

ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021（2021）

引用 3|浏览3

暂无评分

摘要

Privacy is becoming a crucial requirement in many machine learning systems. In this paper we introduce an efficient and secure distributed K-Means algorithm, that is robust to non-IID data. The base idea of our proposal consists in each client computing the K-Means algorithm locally, with a variable number of clusters. The server will use the resultant centroids to apply the K-Means algorithm again, discovering the global centroids. To maintain the client's privacy, homomorphic encryption and secure aggregation is used in the process of learning the global centroids. This algorithm is efficient and reduces transmission costs, since only the local centroids are used to find the global centroids. In our experimental evaluation, we demonstrate that our strategy achieves a similar performance to the centralized version even in cases where the data follows an extreme non-IID form.

查看译文

关键词

Privacy, Distributed clustering, Federated learning, Homomorphic encryption, Secure aggregation

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要