Abstract:
Playing techniques contain distinctive information about musical expressivity and interpretation. Yet, current research in music signal analysis suffers from a scarcity o...Show MoreMetadata
Abstract:
Playing techniques contain distinctive information about musical expressivity and interpretation. Yet, current research in music signal analysis suffers from a scarcity of computational models for playing techniques, especially in the context of live performance. To address this problem, our paper develops a general framework for playing technique recognition. We propose the adaptive scattering transform, which refers to any scattering transform that includes a stage of data-driven dimensionality reduction over at least one of its wavelet variables, for representing playing techniques. Two adaptive scattering features are presented: frequency-adaptive scattering and direction-adaptive scattering. We analyse seven playing techniques: vibrato, tremolo, trill, flutter-tongue, acciaccatura, portamento, and glissando. To evaluate the proposed methodology, we create a new dataset containing full-length Chinese bamboo flute performances (CBFdataset) with expert playing technique annotations. Once trained on the proposed scattering representations, a support vector classifier achieves state-of-the-art results. We provide explanatory visualisations of scattering coefficients for each technique and verify the system over three additional datasets with various instrumental and vocal techniques: VPset, SOL, and VocalSet.
Published in: IEEE/ACM Transactions on Audio, Speech, and Language Processing ( Volume: 30)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Support Vector Machine ,
- Additional Datasets ,
- Instrumental Techniques ,
- Scatter Characteristics ,
- Training Set ,
- Convolutional Neural Network ,
- Frequency Band ,
- Binary Classification ,
- Hidden Markov Model ,
- Multi-label ,
- Extent Of Variation ,
- Recognition System ,
- Scale In Sample ,
- Recognition Results ,
- Modulation Rate ,
- Frame Size ,
- Average Scale ,
- Mother Wavelet ,
- Real-world Performance ,
- Pitch Change ,
- Dominant Band ,
- Mel-frequency Cepstral Coefficients ,
- Local Invariance ,
- Harmonic Structure ,
- Complex Modulus ,
- Temporal Modulation ,
- Detection Results ,
- Morlet Wavelet ,
- Low-pass ,
- Time Axis
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Support Vector Machine ,
- Additional Datasets ,
- Instrumental Techniques ,
- Scatter Characteristics ,
- Training Set ,
- Convolutional Neural Network ,
- Frequency Band ,
- Binary Classification ,
- Hidden Markov Model ,
- Multi-label ,
- Extent Of Variation ,
- Recognition System ,
- Scale In Sample ,
- Recognition Results ,
- Modulation Rate ,
- Frame Size ,
- Average Scale ,
- Mother Wavelet ,
- Real-world Performance ,
- Pitch Change ,
- Dominant Band ,
- Mel-frequency Cepstral Coefficients ,
- Local Invariance ,
- Harmonic Structure ,
- Complex Modulus ,
- Temporal Modulation ,
- Detection Results ,
- Morlet Wavelet ,
- Low-pass ,
- Time Axis
- Author Keywords