Notice
There is currently an issue with the citation download feature. Learn more

2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings.

2003

Go

Filter Results

Displaying Results 1 - 25 of 97
  • Object-specific figure-ground segregation

    Publication Year: 2003, Page(s):II - 39-45 vol.2
    Cited by:  Papers (28)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (924 KB) | HTML iconHTML

    We consider the problem of segmenting an image into foreground and background, with foreground containing solely objects of interest known a priori. We propose an integration model that incorporates both edge detection and object part detection results. It consists of two parallel processes: low-level pixel grouping and high-level patch grouping. We seek a solution that optimizes a joint grouping ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Object segmentation using graph cuts based active contours

    Publication Year: 2003, Page(s):II - 46-53 vol.2
    Cited by:  Papers (7)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (14119 KB) | HTML iconHTML

    In this paper we present a graph cuts based active contours (GCBAC) approach to object segmentation problems. Our method is a combination of active contours and the optimization tool of graph cuts and differs fundamentally from traditional active contours in that it uses graph cuts to iteratively deform the contour. Consequently, it has the following advantages. (1) It has the ability to jump over... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Proceedings 2003 IEEE Computer Society Conference On Computer Vision And Pattern Recognition

    Publication Year: 2003, Page(s):iii - xii
    Request permission for commercial reuse | PDF file iconPDF (327 KB)
    Freely Available from IEEE
  • Multi-modal image registration by minimizing Kullback-Leibler distance between expected and observed joint class histograms

    Publication Year: 2003, Page(s):II - 570
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (732 KB) | HTML iconHTML

    We present a new multimodal image registration method based on the a priori knowledge of the class label mappings between two segmented input images. A joint class histogram between the image pairs is estimated by assigning each bin value equal to the total number of occurrences of the corresponding class label pairs. The discrepancy between the observed and expected joint class histograms should ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 2003, Page(s):737 - 742
    Request permission for commercial reuse | PDF file iconPDF (170 KB)
    Freely Available from IEEE
  • Shape-time photography

    Publication Year: 2003, Page(s):II-151 - II-157 vol.2
    Cited by:  Papers (5)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (651 KB) | HTML iconHTML

    We introduce a new method to describe shape relationships over time in a photograph. We acquire both range and image information in a sequence of frames using a stationary stereo camera. From the pictures taken, we compute a composite image consisting of the pixels from the surfaces closest to the camera over all the time frames. Through occlusion cues, this composite reveals 3-D relationships bet... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Example-based style synthesis

    Publication Year: 2003, Page(s):II - 143-50 vol.2
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1034 KB) | HTML iconHTML

    We introduce an example-based synthesis technique that extrapolates novel styles for a given input image. The technique is based on separating the style and content of image fragments. Given an image with a new style and content, it is first adaptively partitioned into fragments. Stitching together novel fragments produces a coherent image in a new style for a given content. The aggregate of synth... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Editable dynamic textures

    Publication Year: 2003, Page(s):II - 137-42 vol.2
    Cited by:  Papers (27)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1643 KB) | HTML iconHTML

    We present a simple and efficient algorithm for modifying the temporal behavior of "dynamic textures," i.e. sequences of images that exhibit some form of temporal regularity, such as flowing water, steam, smoke, flames, foliage of trees in wind. The main goal is to design algorithms for synthesizing and editing realistic sequences of images of dynamic scenes that exhibit some form of temporal stat... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Advanced Gaussian MRF rotation-invariant texture features for classification of remote sensing imagery

    Publication Year: 2003, Page(s):II - 685-90 vol.2
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (428 KB) | HTML iconHTML

    The features based on Markov random field (MRF) models are usually sensitive to the rotation of image textures. The paper develops an anisotropic circular Gaussian MRF (ACGMRF) model for modeling rotated image textures and retrieving rotation-invariant texture features. To overcome the singularity problem of the least squares estimate (LSE) method, an approximate least squares estimate (ALSE) meth... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Properties and applications of shape recipes

    Publication Year: 2003, Page(s):II - 383-90 vol.2
    Cited by:  Papers (18)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (3924 KB)

    In low-level vision, the representation of scene properties such as shape, albedo, etc., are very high dimensional as they have to describe complicated structures. The approach proposed here is to let the image itself bear as much of the representational burden as possible. In many situations, scene and image are closely related and it is possible to find a functional relationship between them. Th... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Robust data clustering

    Publication Year: 2003, Page(s):II-128 - II-133 vol.2
    Cited by:  Papers (24)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (332 KB) | HTML iconHTML

    We address the problem of robust clustering by combining data partitions (forming a clustering ensemble) produced by multiple clusterings. We formulate robust clustering under an information-theoretical framework; mutual information is the underlying concept used in the definition of quantitative measures of agreement or consistency between data partitions. Robustness is assessed by variance of th... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The effects of segmentation and feature choice in a translation model of object recognition

    Publication Year: 2003, Page(s):II - 675-82 vol.2
    Cited by:  Papers (20)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (399 KB) | HTML iconHTML

    We work with a model of object recognition where words must be placed on image regions. This approach means that large scale experiments are relatively easy, so we can evaluate the effects of various early and midlevel vision algorithms on recognition performance. We evaluate various image segmentation algorithms by determining word prediction accuracy for images segmented in various ways and repr... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Nearest neighbor search for relevance feedback

    Publication Year: 2003, Page(s):II - 643-8 vol.2
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (356 KB) | HTML iconHTML

    We introduce the problem of repetitive nearest neighbor search in relevance feedback and propose an efficient search scheme for high dimensional feature spaces. Relevance feedback learning is a popular scheme used in content based image and video retrieval to support high-level concept queries. The paper addresses those scenarios in which a similarity or distance matrix is updated during each iter... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Visual hull alignment and refinement across time: a 3D reconstruction algorithm combining shape-from-silhouette with stereo

    Publication Year: 2003, Page(s):II - 375-82 vol.2
    Cited by:  Papers (38)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (862 KB) | HTML iconHTML

    Visual hull (VH) construction from silhouette images is a popular method of shape estimation. The method, also known as shape-from-silhouette (SFS), is used in many applications such as non-invasive 3D model acquisition, obstacle avoidance, and more recently human motion tracking and analysis. One of the limitations of SFS, however, is that the approximated shape can be very coarse when there are ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • General c-means clustering model and its application

    Publication Year: 2003, Page(s):II-122 - II-127 vol.2
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (391 KB) | HTML iconHTML

    Many partitional clustering algorithms originated from the definition of mean.We propose a new clustering model - general c-means clustering algorithm (GCM). Generally, when the data set is clustered into c (c > 1) subsets, each subset is often expected to have a different prototype (or cluster center) than others. Therefore, we propose the definition of undesirable solution of clustering algorith... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Enhancing image and video retrieval: learning via equivalence constraints

    Publication Year: 2003, Page(s):II - 668-74 vol.2
    Cited by:  Papers (60)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (444 KB) | HTML iconHTML

    The paper is about learning using partial information in the form of equivalence constraints. Equivalence constraints provide relational information about the labels of data points, rather than the labels themselves. Our work is motivated by the observation that in many real life applications partial information about the data can be obtained with very little cost. For example, in video indexing w... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Activity recognition using the dynamics of the configuration of interacting objects

    Publication Year: 2003, Page(s):II - 633-40 vol.2
    Cited by:  Papers (55)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (518 KB) | HTML iconHTML

    Monitoring activities using video data is an important surveillance problem. A special scenario is to learn the pattern of normal activities and detect abnormal events from a very low resolution video where the moving objects are small enough to be modeled as point objects in a 2D plane. Instead of tracking each point separately, we propose to model an activity by the polygonal 'shape' of the conf... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Spacetime stereo: shape recovery for dynamic scenes

    Publication Year: 2003, Page(s):II - 367-74 vol.2
    Cited by:  Papers (99)  |  Patents (19)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1923 KB) | HTML iconHTML

    This paper extends the traditional binocular stereo problem into the spacetime domain, in which a pair of video streams is matched simultaneously instead of matching pairs of images frame by frame. Almost any existing stereo algorithm may be extended in this manner simply by replacing the image matching term with a spacetime term. By utilizing both spatial and temporal appearance variation, this m... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Practical non-parametric density estimation on a transformation group for vision

    Publication Year: 2003, Page(s):II - 114-21 vol.2
    Cited by:  Papers (6)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (394 KB) | HTML iconHTML

    It is now common practice in machine vision to define the variability in an object's appearance in a factored manner, as a combination of shape and texture transformations. In this context, we present a simple and practical method for estimating non-parametric probability densities over a group of linear shape deformations. Samples drawn from such a distribution do not lie in a Euclidean space, an... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • What is the space of camera response functions?

    Publication Year: 2003, Page(s):II - 602-9 vol.2
    Cited by:  Papers (51)  |  Patents (12)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (609 KB) | HTML iconHTML

    Many vision applications require precise measurement of scene radiance. The function relating scene radiance to image brightness is called the camera response. We analyze the properties that all camera responses share. This allows us to find the constraints that any response function must satisfy. These constraints determine the theoretical space of all possible camera responses. We have collected... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Estimating 3D hand pose from a cluttered image

    Publication Year: 2003, Page(s):II - 432-9 vol.2
    Cited by:  Papers (105)  |  Patents (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (523 KB) | HTML iconHTML

    A method is proposed that can generate a ranked list of plausible three-dimensional hand configurations that best match an input image. Hand pose estimation is formulated as an image database indexing problem, where the closest matches for an input hand image are retrieved from a large database of synthetic hand images. In contrast to previous approaches, the system can function in the presence of... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A new semi-supervised EM algorithm for image retrieval

    Publication Year: 2003, Page(s):II-662 - II-667 vol.2
    Cited by:  Papers (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (426 KB) | HTML iconHTML

    One of the main tasks in content-based image retrieval (CBIR) is to reduce the gap between low-level visual features and high-level human concepts. This paper presents a new semi-supervised EM algorithm (NSSEM), where the image distribution in feature space is modeled as a mixture of Gaussian densities. Due to the statistical mechanism of accumulating and processing meta knowledge, the NSS-EM algo... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Expectation grammars: leveraging high-level expectations for activity recognition

    Publication Year: 2003, Page(s):II-626 - II-632 vol.2
    Cited by:  Papers (23)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (333 KB) | HTML iconHTML

    Video-based recognition and prediction of a temporally extended activity can benefit from a detailed description of high-level expectations about the activity. Stochastic grammars allow for an efficient representation of such expectations and are well-suited for the specification of temporally well-ordered activities. In this paper, we extend stochastic grammars by adding event parameters, state c... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Document image enhancement using directional wavelet

    Publication Year: 2003, Page(s):II-534 - II-539 vol.2
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (926 KB) | HTML iconHTML

    This paper proposes an algorithm to clean up a large collection of historical handwritten documents kept up in the National Archives of Singapore. Due to the seepage of ink over long period of storage, the front page of each document has been severely marred by the reverse side writing. Earlier attempts have been made to match both sides of the page to identify the offending strokes originating fr... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Shadow elimination and occluder light suppression for multi-projector displays

    Publication Year: 2003, Page(s):II - 513-20 vol.2
    Cited by:  Papers (9)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (391 KB) | HTML iconHTML

    Two related problems of front projection displays, which occur when users obscure a projector, are: (i) undesirable shadows cast on the display by the users, and (ii) projected light falling on and distracting the users. This paper provides a computational framework for solving these two problems based on multiple overlapping projectors and cameras. The overlapping projectors are automatically ali... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.