Skip to Main Content
Particle filter-based acoustic source tracking algorithms track (online and in real-time) the position of a sound source-a person speaking in a room-based on the current data from a distributed microphone array as well as the previously recorded data. This paper develops a multi-target tracking (MTT) methodology to allow for an unknown and time-varying number of speakers in a fully probabilistic manner and in doing so does not resort to independent modules for new target proposal or target number estimation as in previous works. The approach uses the concept of an existence grid to propose possible regions of activity before tracking is carried out with a variable dimension particle filter-which also explicitly supports the concept of a null particle, containing no target states, when no speakers are active. Examples demonstrate typical tracking performance in a number of different scenarios with simultaneously active speech sources.
Audio, Speech, and Language Processing, IEEE Transactions on (Volume:20 , Issue: 4 )
Date of Publication: May 2012