Abstract:
Automatic Instrument recognition in sound recordings is a traditional method and has been gaining a lot of attention since the last decades due to the advent of music str...Show MoreMetadata
Abstract:
Automatic Instrument recognition in sound recordings is a traditional method and has been gaining a lot of attention since the last decades due to the advent of music streaming services like Spotify, Apple Music, Deezer etc. Distinction between similar instruments like Cello and Violin, Flute and Clarinet still remains a challenging task for machines and even humans. This research paper is an effort to identify 4 similar String instruments (Acoustic Guitar, Cello, Violin and Electric Guitar) in the audio recordings. We have used 1D MLPs and 2D CNN architectures for classifying the sounds and compared the performance on different audio features. Our experiments also show that using image based transfer learning models like Inception and VGG gives better results among the aforementioned architectures.
Date of Conference: 14-16 October 2020
Date Added to IEEE Xplore: 09 December 2020
ISBN Information:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Transfer Learning ,
- Convolutional Neural Network ,
- Audio Recordings ,
- Multilayer Perceptron ,
- Convolutional Neural Network Architecture ,
- Similar Instruments ,
- Transfer Learning Model ,
- Clarinet ,
- String Instruments ,
- Neural Network ,
- Activation Function ,
- Training Data ,
- F1 Score ,
- Fast Fourier Transform ,
- Power Spectrum ,
- ImageNet ,
- Softmax Function ,
- Increase In Accuracy ,
- Max-pooling Layer ,
- Musical Instruments ,
- Mel-frequency Cepstral Coefficients ,
- Multilayer Perceptron Network ,
- Discrete Cosine Transform ,
- Temporal Model ,
- Deep Learning Architectures ,
- Human Hearing ,
- Human Ear ,
- Frequency Bins ,
- Audio Data
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Transfer Learning ,
- Convolutional Neural Network ,
- Audio Recordings ,
- Multilayer Perceptron ,
- Convolutional Neural Network Architecture ,
- Similar Instruments ,
- Transfer Learning Model ,
- Clarinet ,
- String Instruments ,
- Neural Network ,
- Activation Function ,
- Training Data ,
- F1 Score ,
- Fast Fourier Transform ,
- Power Spectrum ,
- ImageNet ,
- Softmax Function ,
- Increase In Accuracy ,
- Max-pooling Layer ,
- Musical Instruments ,
- Mel-frequency Cepstral Coefficients ,
- Multilayer Perceptron Network ,
- Discrete Cosine Transform ,
- Temporal Model ,
- Deep Learning Architectures ,
- Human Hearing ,
- Human Ear ,
- Frequency Bins ,
- Audio Data
- Author Keywords