By Topic

Sparse Modeling for Lossless Audio Compression

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Florin Ghido ; Department of Signal Processing, Tampere University of Technology, Finland ; Ioan Tabus

We investigate the problem of sparse modeling for predictive coding and introduce an efficient algorithm for computing sparse stereo linear predictors for lossless audio compression. Sparse linear predictive coding offers both improved compression and reduction of decoding complexity compared with non-sparse linear predictive coding. The modeling part amounts to finding the optimal structure of a sparse linear predictor using a fully implementable minimum description length (MDL) approach. The MDL criterion, simplified conveniently under realistic assumptions, is approximately minimized by a greedy algorithm which solves sequentially least squares partial problems, where the LDLT factorization ensures numerically stable solutions and facilitates a quasi-optimal quantization of the parameter vector. The overall compression system built around this modeling tool is shown to achieve the main goals: improved compression and, even more importantly, faster decoding speeds than the state of the art lossless audio compression methods. The optimal MDL sparse predictors are shown to provide parametric spectra that constitute new alternative spectral descriptors, capturing important regularities missed by the optimal MDL non-sparse predictors.

Published in:

IEEE Transactions on Audio, Speech, and Language Processing  (Volume:21 ,  Issue: 1 )