Smooth Non-increasing Square Spatial Extents of Filters in Convolutional Layers of CNNs for Image Classification Problems

The present paper considers an open problem of setting hyperparameters for convolutional neural networks aimed at image classification. Since selecting filter spatial extents for convolutional layers is a topical problem, it is approximately solved by accumulating statistics of the neural network pe...

Full description

Bibliographic Details
Main Author: Romanuke Vadim V.
Format: Article
Language:English
Published: Sciendo 2018-05-01
Series:Applied Computer Systems
Subjects:
Online Access:https://doi.org/10.2478/acss-2018-0007
id doaj-51ef1a2de305422ca68c128b77d1db84
record_format Article
spelling doaj-51ef1a2de305422ca68c128b77d1db842021-09-06T19:41:00ZengSciendoApplied Computer Systems2255-86912018-05-01231526210.2478/acss-2018-0007acss-2018-0007Smooth Non-increasing Square Spatial Extents of Filters in Convolutional Layers of CNNs for Image Classification ProblemsRomanuke Vadim V.0Polish Naval Academy, Gdynia, PolandThe present paper considers an open problem of setting hyperparameters for convolutional neural networks aimed at image classification. Since selecting filter spatial extents for convolutional layers is a topical problem, it is approximately solved by accumulating statistics of the neural network performance. The network architecture is taken on the basis of the MNIST database experience. The eight-layered architecture having four convolutional layers is nearly best suitable for classifying small and medium size images. Image databases are formed of grayscale images whose size range is 28 × 28 to 64 × 64 by step 2. Except for the filter spatial extents, the rest of those eight layer hyperparameters are unalterable, and they are chosen scrupulously based on rules of thumb. A sequence of possible filter spatial extents is generated for each size. Then sets of four filter spatial extents producing the best performance are extracted. The rule of this extraction that allows selecting the best filter spatial extents is formalized with two conditions. Mainly, difference between maximal and minimal extents must be as minimal as possible. No unit filter spatial extent is recommended. The secondary condition is that the filter spatial extents should constitute a non-increasing set. Validation on MNIST and CIFAR- 10 databases justifies such a solution, which can be extended for building convolutional neural network classifiers of colour and larger images.https://doi.org/10.2478/acss-2018-0007convolutional layerconvolutional neural networksfiltershyperparametersnetwork architecturesquare spatial extents of filters
collection DOAJ
language English
format Article
sources DOAJ
author Romanuke Vadim V.
spellingShingle Romanuke Vadim V.
Smooth Non-increasing Square Spatial Extents of Filters in Convolutional Layers of CNNs for Image Classification Problems
Applied Computer Systems
convolutional layer
convolutional neural networks
filters
hyperparameters
network architecture
square spatial extents of filters
author_facet Romanuke Vadim V.
author_sort Romanuke Vadim V.
title Smooth Non-increasing Square Spatial Extents of Filters in Convolutional Layers of CNNs for Image Classification Problems
title_short Smooth Non-increasing Square Spatial Extents of Filters in Convolutional Layers of CNNs for Image Classification Problems
title_full Smooth Non-increasing Square Spatial Extents of Filters in Convolutional Layers of CNNs for Image Classification Problems
title_fullStr Smooth Non-increasing Square Spatial Extents of Filters in Convolutional Layers of CNNs for Image Classification Problems
title_full_unstemmed Smooth Non-increasing Square Spatial Extents of Filters in Convolutional Layers of CNNs for Image Classification Problems
title_sort smooth non-increasing square spatial extents of filters in convolutional layers of cnns for image classification problems
publisher Sciendo
series Applied Computer Systems
issn 2255-8691
publishDate 2018-05-01
description The present paper considers an open problem of setting hyperparameters for convolutional neural networks aimed at image classification. Since selecting filter spatial extents for convolutional layers is a topical problem, it is approximately solved by accumulating statistics of the neural network performance. The network architecture is taken on the basis of the MNIST database experience. The eight-layered architecture having four convolutional layers is nearly best suitable for classifying small and medium size images. Image databases are formed of grayscale images whose size range is 28 × 28 to 64 × 64 by step 2. Except for the filter spatial extents, the rest of those eight layer hyperparameters are unalterable, and they are chosen scrupulously based on rules of thumb. A sequence of possible filter spatial extents is generated for each size. Then sets of four filter spatial extents producing the best performance are extracted. The rule of this extraction that allows selecting the best filter spatial extents is formalized with two conditions. Mainly, difference between maximal and minimal extents must be as minimal as possible. No unit filter spatial extent is recommended. The secondary condition is that the filter spatial extents should constitute a non-increasing set. Validation on MNIST and CIFAR- 10 databases justifies such a solution, which can be extended for building convolutional neural network classifiers of colour and larger images.
topic convolutional layer
convolutional neural networks
filters
hyperparameters
network architecture
square spatial extents of filters
url https://doi.org/10.2478/acss-2018-0007
work_keys_str_mv AT romanukevadimv smoothnonincreasingsquarespatialextentsoffiltersinconvolutionallayersofcnnsforimageclassificationproblems
_version_ 1717767207841169408