Automatic recognition of musical patterns plays a crucial part in musicological and ethnomusicological research and can become an indispensable tool for the search and comparison of music extracts within a large multimedia database. This paper presents an efficient method for recognizing isolated musical patterns in a monophonic environment, using a novel extension of dynamic time warping, which we call context dependent dynamic time warping. Each pattern, to be recognized, is converted into a sequence of frequency jumps by means of a fundamental frequency tracking algorithm, followed by a quantizer. The resulting sequence of frequency jumps is presented to the input of the recognizer. The main characteristic of context dependent dynamic time warping is that it exploits the correlation exhibited among adjacent frequency jumps of the feature sequence. The methodology has been tested in the context of Greek traditional music, which exhibits certain characteristics that make the classification task harder, when compared with western musical tradition. A recognition rate higher than 95% was achieved.
Published in:
Speech and Audio Processing, IEEE Transactions on
(Volume:11
,
Issue:
3
)
Date of Publication: May 2003