Performance Modelling of Deep Learning on Intel Many Integrated Core Architectures

2019 International Conference on High Performance Computing & Simulation (HPCS)(2019)

引用 0|浏览1
暂无评分
摘要
Many complex problems, such as natural language processing or visual object detection, are solved using deep learning. However, efficient training of complex deep convolutional neural networks for large data sets is computationally demanding and requires parallel computing resources. In this paper, we present two parameterized performance models for estimation of execution time of training convolutional neural networks on the Intel many integrated core architecture. While for the first performance model we minimally use measurement techniques for parameter value estimation, in the second model we estimate more parameters based on measurements. We evaluate the prediction accuracy of performance models in the context of training three different convolutional neural network architectures on the Intel Xeon Phi. The achieved average performance prediction accuracy is about 15% for the first model and 11% for second model.
更多
查看译文
关键词
Deep Learning,Convolutional Neural Network (CNN),Performance Modelling,Intel Many Integrated Core (MIC) Architecture,Intel Xeon Phi
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要