By Topic

Bayesian formulation of image patch matching using cross-correlation

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Hakan Ardo ; Centre for Mathematical Sciences, Lund University, Sweden ; Kalle Astrom

A classical solution for matching two image patches is to use the cross-correlation coefficient. This works well if there is a lot of structure within the patches, but not so well if the patches are close to uniform. This means that some patches are matched with more confidence than others. By estimating this uncertainty more weight can be put on the confident matches than those that are more uncertain. In this paper we present a system that can learn the distribution of the correlation coefficient from a video sequence of an empty scene. No manual annotation of the video is needed. Two distributions functions are learned for two different cases: i) the correlation between an estimated background image and the current frame showing that background and ii) the correlation between an estimated background image and an unrelated patch. Using these two distributions the patch matching problem is formulated as a binary classification problem and the probability of two patches matching is derived. The model depends on the signal to noise ratio. The noise level is reasonably invariant over time, while the signal level, represented by the amount of structure in the patch or it's spatial variance, has to be measured for every frame. A common application where this is useful is feature point matching between different images. Another application is background/foreground segmentation. In this paper it is shown how the theory can be used to implement a very fast background/foreground segmentation by transforming the calculations to the DCT-domain and processing a motion JPEG stream without uncompressing it. This allows the algorithm to be embedded on a 150 MHz ARM based network camera. It is also suggested to use recursive quantile estimation to estimate the background model. This gives very accurate background models even if there is a lot of foreground present during the initialisation of the model.

Published in:

Distributed Smart Cameras, 2009. ICDSC 2009. Third ACM/IEEE International Conference on

Date of Conference:

Aug. 30 2009-Sept. 2 2009