Chrome Extension

WeChat Mini Program

Use on ChatGLM

Log in

Academic Profile User Profile

My Following Paper Collections Browse History

Accelerated Training Via Device Similarity in Federated Learning

Yuanli Wang,Joel Wolfrath,Nikhil Sreekumar,Dhruv Kumar,Abhishek Chandra

PROCEEDINGS OF THE 4TH INTERNATIONAL WORKSHOP ON EDGE SYSTEMS, ANALYTICS AND NETWORKING (EDGESYS'21)（2021）

Cited 11|Views36

Abstract

Federated Learning is a privacy-preserving, machine learning technique that generates a globally shared model with in-situ model training on distributed devices. These systems are often comprised of millions of user devices and only a subset of available devices can be used for training in each epoch. Designing a device selection strategy is challenging, given that devices are highly heterogeneous in both their system resources and training data. This heterogeneity makes device selection very crucial for timely model convergence and sufficient model accuracy. Existing approaches have addressed system heterogeneity for device selection but have largely ignored the data heterogeneity. In this work, we analyze the impact of data heterogeneity on device selection, model convergence, model accuracy, and fault tolerance in a federated learning setting. Based on our analysis, we propose that clustering devices with similar data distributions followed by selecting the devices with the best processing capacity from each cluster can significantly improve the model convergence without compromising model accuracy. This clustering also guides us in designing policies for fault tolerance in the system. We propose three methods for identifying groups of devices with similar data distributions. We also identify and discuss rich trade-offs between privacy, bandwidth consumption, and computation overhead for each of these proposed methods. Our preliminary experiments show that the proposed methods can provide a 46% - 58% reduction in training time compared to existing approaches in reaching the same accuracy.

More

Translated text

Bibtex

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Related Papers

Reference papers

Cited Papers

The Divergence and Bhattacharyya Distance Measures in Signal Selection

1967

被引用1593 | 浏览

The Diver,gence and Bhattacharyya Distance Measures in Signal Selection

BHAT-

1967

被引用1162 | 浏览

Gradient-based Learning Applied to Document Recognition

Y Lecun,L Bottou,Y Bengio,P Haffner

1998

被引用70455 | 浏览

Federated Multi-Task Learning.

Virginia Smith,Chao-Kai Chiang,Maziar Sanjabi,Ameet Talwalkar

2017

被引用2356 | 浏览

Gaia: Geo-Distributed Machine Learning Approaching Lan Speeds

Kevin Hsieh,Aaron Harlap,Nandita Vijaykumar,Dimitris Konomis,Gregory R. Ganger,Phillip B. Gibbons,Onur Mutlu

2017

被引用527 | 浏览

Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training.

Yujun Lin,Song Han,Huizi Mao,Yu Wang,Bill Dally

2018

被引用1731 | 浏览

When Edge Meets Learning: Adaptive Control for Resource-Constrained Distributed Machine Learning.

Shiqiang Wang,Tiffany Tuor,Theodoros Salonidis,Kin K. Leung,Christian Makaya,Ting He,Kevin Chan

2018

被引用568 | 浏览

LEAF: A Benchmark for Federated Settings.

Sebastian Caldas, Sai Meher Karthik Duddu,Peter Wu,Tian Li,Jakub Konečný,H. Brendan McMahan,Virginia Smith,Ameet Talwalkar

2018

被引用1481 | 浏览

Applied Federated Learning: Improving Google Keyboard Query Suggestions

Timothy Yang,Galen Andrew,Hubert Eichner,Haicheng Sun, Wei Li, Nicholas Kong,Daniel Ramage,Françoise Beaufays

2018

被引用730 | 浏览

Federated Learning with Non-IID Data

Yue Zhao,Meng Li,Liangzhen Lai,Naveen Suda,Damon Civin,Vikas Chandra

2018

被引用2970 | 浏览

Exploring Federated Learning on Battery-Powered Devices

Zichen Xu, Li,Wenting Zou

2019

被引用24 | 浏览

The Non-IID Data Quagmire of Decentralized Machine Learning

Kevin Hsieh,Amar Phanishayee,Onur Mutlu,Phillip B. Gibbons

2019

被引用699 | 浏览

Advances and Open Problems in Federated Learning

Peter Kairouz,H. Brendan McMahan,Brendan Avent,Aurélien Bellet,Mehdi Bennis,Arjun Nitin Bhagoji,Keith Bonawitz,Zachary Charles,Graham Cormode,Rachel Cummings,Rafael G. L. D'Oliveira,Salim El Rouayheb,

2019

被引用6405 | 浏览

FedVision: an Online Visual Object Detection Platform Powered by Federated Learning.

Yang Liu,Anbu Huang,Yun Luo,He Huang,Youzhi Liu,Yuanyuan Chen,Lican Feng,Tianjian Chen,Han Yu,Qiang Yang

2020

被引用373 | 浏览

TiFL: A Tier-based Federated Learning System.

Zheng Chai,Ahsan Ali,Syed Zawad, Stacey Treux,Ali Anwar,Nathalie Barcaldo,Yi Zhou,Heiko Ludwig,Feng Yan,Yue Cheng

2020

被引用314 | 浏览

Experience-Driven Computational Resource Allocation of Federated Learning by Deep Reinforcement Learning

Yufeng Zhan,Peng Li,Song Guo

2020

被引用135 | 浏览

Oort: Efficient Federated Learning Via Guided Participant Selection

Fan Lai,Xiangfeng Zhu,Harsha Madhyastha,Mosharaf Chowdhury

2021

被引用502 | 浏览

Poster: Exploiting Data Heterogeneity for Performance and Reliability in Federated Learning

Yuanli Wang,Dhruv Kumar,Abhishek Chandra

2020

被引用2 | 浏览

GradientBased Learning Applied to Document Recognition

S. Haykin,Bart Kosko

2009

被引用27367 | 浏览

Data Disclaimer

The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn

Chat Paper

【要点】：该论文提出了一种基于设备相似性的联邦学习训练加速方法，通过分析数据异质性对设备选择、模型收敛性、准确性和容错能力的影响，提出了一种新的设备选择策略，以提高模型训练的效率而不牺牲准确性。

【方法】：提出三种识别具有相似数据分布的设备组的方法，并对每种方法的隐私保护、带宽消耗和计算开销进行了详细的权衡分析。

【实验】：初步实验结果表明，与现有方法相比，所提出的方法可以减少46%至58%的训练时间，同时达到相同的准确性。实验使用的数据集未在摘要中明确提及。

去 AI 文献库对话