A novel algorithm for automatic transcription of piano polyphonic music is proposed. It is based on a processing scheme that incorporates the following subtasks: segmentation of notes in time domain, estimation of frequency components based on the structure of time segments, extraction of pitches of underlying notes, and tracking of notes to obtain the final music score. A combination of multiresolution techniques, such as multiresolution Fourier transform and maximum likelihood frequency estimator, enables the user to successfully cope with the problems of constant time-frequency resolution and frequency masking. The algorithm demonstrates a better performance then results obtained by means of existing commercial software.
Published in:
Image and Signal Processing and Analysis, 2005. ISPA 2005. Proceedings of the 4th International Symposium on
Date of Conference: 15-17 Sept. 2005