Conventional and periodic N-grams in the transcription of drum sequences
Paulus, J.K.; Klapuri, A.P.
Multimedia and Expo, 2003. ICME apos;03. Proceedings. 2003 International Conference on
Volume 2, Issue , 6-9 July 2003 Page(s): II - 737-40 vol.2
Digital Object Identifier 10.1109/ICME.2003.1221722
Summary: In this paper, we describe a system for transcribing polyphonic drum sequences from an acoustic signal to a symbolic representation. Low-level signal analysis is done with an acoustic model consisting of a Gaussian mixture model and a support vector machine. For higher-level modelling, periodic N-grams are proposed to construct a "language model" for music, based on the repetitive nature of musical structure. Also, a technique for estimating relatively long N-grams is introduced. The performance of N-grams in the transcription was evaluated using a database of realistic drum sequences from different genres and yielded a performance increase of 7.6 % compared to a the use of only prior (unigram) probabilities with the acoustic model.
View citation and abstract |