An Experimental Perspective for Computation-Efficient Neural Networks Training.

Lujia Yin,Xiaotao Chen,Zheng Qin,Zhaoning Zhang,Jinghua Feng,Dongsheng Li

ADVANCED COMPUTER ARCHITECTURE（2018）

引用 2|浏览24

暂无评分

摘要

Nowadays, as the tremendous requirements of computation-efficient neural networks to deploy deep learning models on inexpensive and broadly-used devices, many lightweight networks have been presented, such as MobileNet series, ShuffleNet, etc. The computation-efficient models are specifically designed for very limited computational budget, e.g., 10-150 MFLOPs, and can run efficiently on ARM-based devices. These models have smaller CMR than the large networks, such as VGG, ResNet, Inception, etc. However, it is quite efficient for inference on ARM, how about inference or training on GPU? Unfortunately, compact models usually cannot make full utilization of GPU, though it is fast for its small size. In this paper, we will present a series of extensive experiments on the training of compact models, including training on single host, with GPU and CPU, and distributed environment. Then we give some analysis and suggestions on the training.

查看译文

关键词

Neural networks training,Experiment,Distributed

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要