Skip to Main Content
The bag of visual words representation has attracted a lot of attention in the computer vision community. In particular, Probabilistic Latent Semantic Analysis (PLSA) has been applied to object recognition as an unsupervised technique built on top of the bag of visual words representation. PLSA, however, does not explicitly consider the spatial information of the visual words. In this paper, we propose an iterative technique, where a modified form of PLSA provides location and scale estimates of the foreground object through the estimated latent semantic. In return, the updated location and scale estimates will improve the estimate of the latent semantic. We call this iterative algorithm Semantic-Shift. We show results with significant improvements over PLSA.