Impacts of Sample Design for Validation Data on the Accuracy of Feedforward Neural Network Classification

Validation data are often used to evaluate the performance of a trained neural network and used in the selection of a network deemed optimal for the task at-hand. Optimality is commonly assessed with a measure, such as overall classification accuracy. The latter is often calculated directly from a c...

Full description

Bibliographic Details
Main Author: Giles M. Foody
Format: Article
Language:English
Published: MDPI AG 2017-08-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/7/9/888
id doaj-0646651ea5cf4e769610409a9742c47f
record_format Article
spelling doaj-0646651ea5cf4e769610409a9742c47f2020-11-24T23:08:34ZengMDPI AGApplied Sciences2076-34172017-08-017988810.3390/app7090888app7090888Impacts of Sample Design for Validation Data on the Accuracy of Feedforward Neural Network ClassificationGiles M. Foody0School of Geography, University of Nottingham, University Park, Nottingham NG7 2RD, UKValidation data are often used to evaluate the performance of a trained neural network and used in the selection of a network deemed optimal for the task at-hand. Optimality is commonly assessed with a measure, such as overall classification accuracy. The latter is often calculated directly from a confusion matrix showing the counts of cases in the validation set with particular labelling properties. The sample design used to form the validation set can, however, influence the estimated magnitude of the accuracy. Commonly, the validation set is formed with a stratified sample to give balanced classes, but also via random sampling, which reflects class abundance. It is suggested that if the ultimate aim is to accurately classify a dataset in which the classes do vary in abundance, a validation set formed via random, rather than stratified, sampling is preferred. This is illustrated with the classification of simulated and remotely-sensed datasets. With both datasets, statistically significant differences in the accuracy with which the data could be classified arose from the use of validation sets formed via random and stratified sampling (z = 2.7 and 1.9 for the simulated and real datasets respectively, for both p < 0.05%). The accuracy of the classifications that used a stratified sample in validation were smaller, a result of cases of an abundant class being commissioned into a rarer class. Simple means to address the issue are suggested.https://www.mdpi.com/2076-3417/7/9/888cross-validationmulti-layer perceptronremote sensingclassification errorsample designmachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Giles M. Foody
spellingShingle Giles M. Foody
Impacts of Sample Design for Validation Data on the Accuracy of Feedforward Neural Network Classification
Applied Sciences
cross-validation
multi-layer perceptron
remote sensing
classification error
sample design
machine learning
author_facet Giles M. Foody
author_sort Giles M. Foody
title Impacts of Sample Design for Validation Data on the Accuracy of Feedforward Neural Network Classification
title_short Impacts of Sample Design for Validation Data on the Accuracy of Feedforward Neural Network Classification
title_full Impacts of Sample Design for Validation Data on the Accuracy of Feedforward Neural Network Classification
title_fullStr Impacts of Sample Design for Validation Data on the Accuracy of Feedforward Neural Network Classification
title_full_unstemmed Impacts of Sample Design for Validation Data on the Accuracy of Feedforward Neural Network Classification
title_sort impacts of sample design for validation data on the accuracy of feedforward neural network classification
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2017-08-01
description Validation data are often used to evaluate the performance of a trained neural network and used in the selection of a network deemed optimal for the task at-hand. Optimality is commonly assessed with a measure, such as overall classification accuracy. The latter is often calculated directly from a confusion matrix showing the counts of cases in the validation set with particular labelling properties. The sample design used to form the validation set can, however, influence the estimated magnitude of the accuracy. Commonly, the validation set is formed with a stratified sample to give balanced classes, but also via random sampling, which reflects class abundance. It is suggested that if the ultimate aim is to accurately classify a dataset in which the classes do vary in abundance, a validation set formed via random, rather than stratified, sampling is preferred. This is illustrated with the classification of simulated and remotely-sensed datasets. With both datasets, statistically significant differences in the accuracy with which the data could be classified arose from the use of validation sets formed via random and stratified sampling (z = 2.7 and 1.9 for the simulated and real datasets respectively, for both p < 0.05%). The accuracy of the classifications that used a stratified sample in validation were smaller, a result of cases of an abundant class being commissioned into a rarer class. Simple means to address the issue are suggested.
topic cross-validation
multi-layer perceptron
remote sensing
classification error
sample design
machine learning
url https://www.mdpi.com/2076-3417/7/9/888
work_keys_str_mv AT gilesmfoody impactsofsampledesignforvalidationdataontheaccuracyoffeedforwardneuralnetworkclassification
_version_ 1725613551956000768