By Topic

An Information-Geometric Approach to Real-Time Audio Segmentation

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

The purchase and pricing options are temporarily unavailable. Please try again later.
2 Author(s)
Arnaud Dessein ; MuTant Project-Team (INRIA) hosted by the Music Representations Team, UMR STMS 9912 (IRCAM, CNRS, UPMC), Paris, France ; Arshia Cont

We present a generic approach to real-time audio segmentation in the framework of information geometry for exponential families. The proposed system detects changes by monitoring the information rate of the signals as they arrive in time. We also address shortcomings of traditional cumulative sum approaches to change detection, which assume known parameters before change. This is done by considering exact generalized likelihood ratio test statistics, with a complete estimation of the unknown parameters in the respective hypotheses. We derive an efficient sequential scheme to compute these statistics through convex duality. We finally provide results for speech segmentation in speakers, and polyphonic music segmentation in note slices.

Published in:

IEEE Signal Processing Letters  (Volume:20 ,  Issue: 4 )