Constraints on Hyper-parameters in Deep Learning Convolutional Neural Networks

Ubaid M. Al-Saggaf,Abdelaziz Botalb,Muhammad Faisal,Muhammad Moinuddin,Abdulrahman U. Alsaggaf,Sulhi Ali Alfakeh

International journal of advanced computer science and applications/International journal of advanced computer science & applications（2022）

Cited 0|Views17

No score

Abstract

Convolutional Neural Network (CNN), a type of Deep Learning, has a very large number of hyper-meters in contrast to the Artificial Neural Network (ANN) which makes the task of CNN training more demanding. The reason why the task of tuning parameters optimization is difficult in the CNN is the existence of a huge optimization space comprising a large number of hyper-parameters such as the number of layers, number of neurons, number of kernels, stride, padding, rows or columns truncation, parameters of the backpropagation algorithm, etc. Moreover, most of the existing techniques in the literature for the selection of these parameters are based on random practice which is developed for some specific datasets. In this work, we empirically investigated and proved that CNN performance is linked not only to choosing the right hyper-parameters but also to its implementation. More specifically, it is found that the performance is also depending on how it deals when the CNN operations require setting of hyper-parameters that do not symmetrically fit the input volume. We demonstrated two different implementations, crop or pad the input volume to make it fit. Our analysis shows that padding performs better than cropping in terms of prediction accuracy (85.58% in contrast to 82.62%) while takes lesser training time (8 minutes lesser).

Translated text

Key words

Neural networks,convolution,pooling,hyper-parameters,CNN,deep learning,zero-padding,stride,back-propagation

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Chat Paper

Summary is being generated by the instructions you defined