Understanding Data Augmentation for Classification: When to Warp? | IEEE Conference Publication | IEEE Xplore

Understanding Data Augmentation for Classification: When to Warp?


Abstract:

In this paper we investigate the benefit of augmenting data with synthetically created samples when training a machine learning classifier. Two approaches for creating ad...Show More

Abstract:

In this paper we investigate the benefit of augmenting data with synthetically created samples when training a machine learning classifier. Two approaches for creating additional training samples are data warping, which generates additional samples through transformations applied in the data-space, and synthetic over-sampling, which creates additional samples in feature-space. We experimentally evaluate the benefits of data augmentation for a convolutional backpropagation-trained neural network, a convolutional support vector machine and a convolutional extreme learning machine classifier, using the standard MNIST handwritten digit dataset. We found that while it is possible to perform generic augmentation in feature-space, if plausible transforms for the data are known then augmentation in data-space provides a greater benefit for improving performance and reducing overfitting.
Date of Conference: 30 November 2016 - 02 December 2016
Date Added to IEEE Xplore: 26 December 2016
ISBN Information:
Conference Location: Gold Coast, QLD, Australia

Contact IEEE to Subscribe

References

References is not available for this document.