PerfEstimator: A Generic and Extensible Performance Estimator for Data Parallel DNN Training

2021 IEEE/ACM International Workshop on Cloud Intelligence (CloudIntelligence)(2021)

引用 1|浏览21
暂无评分
摘要
Understanding the performance of data parallel DNN training at large-scale is crucial for supporting efficient DNN cloud deployment as well as facilitating the design and optimization of scalable DNN systems. Existing works adopt analytical modeling, which may fall short in capturing the system behaviors resulting from the fast evolving DNN systems and constantly proposed optimizations. In this pa...
更多
查看译文
关键词
system,profiling,modeling,machine-learning,cloud-computation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要