Loading [MathJax]/extensions/MathMenu.js
Bird Species Classification with Audio-Visual Data using CNN and Multiple Kernel Learning | IEEE Conference Publication | IEEE Xplore

Bird Species Classification with Audio-Visual Data using CNN and Multiple Kernel Learning


Abstract:

Recently, deep convolutional neural networks (CNN) have become a new standard in many machine learning applications not only in image but also in audio processing. Howeve...Show More

Abstract:

Recently, deep convolutional neural networks (CNN) have become a new standard in many machine learning applications not only in image but also in audio processing. However, most of the studies only explore a single type of training data. In this paper, we present a study on classifying bird species by combining deep neural features of both visual and audio data using kernel-based fusion method. Specifically, we extract deep neural features based on the activation values of an inner layer of CNN. We combine these features by multiple kernel learning (MKL) to perform the final classification. In the experiment, we train and evaluate our method on a CUB-200-2011 standard data set combined with our originally collected audio data set with respect to 200 bird species (classes). The experimental results indicate that our CNN+MKL method which utilizes the combination of both categories of data outperforms single-modality methods, some simple kernel combination methods, and the conventional early fusion method.
Date of Conference: 02-04 October 2019
Date Added to IEEE Xplore: 05 December 2019
ISBN Information:

ISSN Information:

Conference Location: Kyoto, Japan

Contact IEEE to Subscribe

References

References is not available for this document.