Visual grouping and object recognition
Malik, J.
Image Analysis and Processing, 2001. Proceedings. 11th International Conference on
Volume , Issue , 26-28 Sep 2001 Page(s):612 - 621
Digital Object Identifier 10.1109/ICIAP.2001.957078
Summary:We develop a two-stage framework for parsing and understanding
images, a process of image segmentation grouping pixels to form regions
of coherent color and texture, and a process of recognition - comparing
assemblies of such regions, hypothesized to correspond to a single
object, with views of stored prototypes. We treat segmenting images into
regions as an optimization problem: partition the image into regions
such that there is high similarity within a region and low similarity
across regions. This is formalized as the minimization of the normalized
cut between regions. Using ideas from spectral graph theory, the
minimization can be set as an eigenvalue problem. Visual attributes such
as color, texture, contour and motion are encoded in this framework by
suitable specification of graph edge weights. The recognition problem
requires us to compare assemblies of image regions with previously
stored proto-typical views of known objects. We have devised a novel
algorithm for shape matching based on a relationship descriptor called
the shape context. This enables us to compute similarity measures
between shapes which, together with similarity measures for texture and
color, can be used for object recognition. The shape matching algorithm
has yielded excellent results on a variety of different 2D and 3D
recognition problems
View citation and abstract |