Performance Modelling of Deep Learning on Intel Many Integrated Core Architectures

Andre Viebke,Sabri Pllana,Suejb Memeti,Joanna Kolodziej

2019 International Conference on High Performance Computing & Simulation (HPCS)（2019）

引用 0|浏览1

暂无评分

摘要

Many complex problems, such as natural language processing or visual object detection, are solved using deep learning. However, efficient training of complex deep convolutional neural networks for large data sets is computationally demanding and requires parallel computing resources. In this paper, we present two parameterized performance models for estimation of execution time of training convolutional neural networks on the Intel many integrated core architecture. While for the first performance model we minimally use measurement techniques for parameter value estimation, in the second model we estimate more parameters based on measurements. We evaluate the prediction accuracy of performance models in the context of training three different convolutional neural network architectures on the Intel Xeon Phi. The achieved average performance prediction accuracy is about 15% for the first model and 11% for second model.

查看译文

关键词

Deep Learning,Convolutional Neural Network (CNN),Performance Modelling,Intel Many Integrated Core (MIC) Architecture,Intel Xeon Phi

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要