PerfEstimator: A Generic and Extensible Performance Estimator for Data Parallel DNN Training
2021 IEEE/ACM International Workshop on Cloud Intelligence (CloudIntelligence)(2021)
摘要
Understanding the performance of data parallel DNN training at large-scale is crucial for supporting efficient DNN cloud deployment as well as facilitating the design and optimization of scalable DNN systems. Existing works adopt analytical modeling, which may fall short in capturing the system behaviors resulting from the fast evolving DNN systems and constantly proposed optimizations. In this pa...
更多查看译文
关键词
system,profiling,modeling,machine-learning,cloud-computation
AI 理解论文
溯源树
样例
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要