Loading [MathJax]/extensions/MathMenu.js
Exploring Critical Aspects of CNN-based Keyword Spotting. A PHOCNet Study | IEEE Conference Publication | IEEE Xplore

Exploring Critical Aspects of CNN-based Keyword Spotting. A PHOCNet Study


Abstract:

Deep convolutional neural networks are today the new baseline for a wide range of machine vision tasks. The problem of keyword spotting is no exception to this rule. Many...Show More

Abstract:

Deep convolutional neural networks are today the new baseline for a wide range of machine vision tasks. The problem of keyword spotting is no exception to this rule. Many successful network architectures and learning strategies have been adapted from other vision tasks to create successful keyword spotting systems. In this paper, we argue that various details concerning this adaptation could be re-examined, to the end of building stronger spotting models. In particular, we examine the usefulness of a pyramidal spatial pooling layer versus a simpler approach, and show that a zoning strategy combined with fixed-size inputs can be just as effective while less computationally expensive. We also examine the usefulness of augmentation, class balancing and ensemble learning strategies and propose an improved network. Our hypotheses are tested with numerical experiments on the IAM document collection, where the proposed network outperforms all other existing models.
Date of Conference: 24-27 April 2018
Date Added to IEEE Xplore: 25 June 2018
ISBN Information:
Conference Location: Vienna, Austria

Contact IEEE to Subscribe

References

References is not available for this document.