By Topic

2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings.

2003

Go

Filter Results

Displaying Results 1 - 25 of 97
  • Object segmentation using graph cuts based active contours

    Publication Year: 2003, Page(s):II - 46-53 vol.2
    Cited by:  Papers (7)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (14119 KB) | HTML iconHTML

    In this paper we present a graph cuts based active contours (GCBAC) approach to object segmentation problems. Our method is a combination of active contours and the optimization tool of graph cuts and differs fundamentally from traditional active contours in that it uses graph cuts to iteratively deform the contour. Consequently, it has the following advantages. (1) It has the ability to jump over... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Object-specific figure-ground segregation

    Publication Year: 2003, Page(s):II - 39-45 vol.2
    Cited by:  Papers (28)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (924 KB) | HTML iconHTML

    We consider the problem of segmenting an image into foreground and background, with foreground containing solely objects of interest known a priori. We propose an integration model that incorporates both edge detection and object part detection results. It consists of two parallel processes: low-level pixel grouping and high-level patch grouping. We seek a solution that optimizes a joint grouping ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Proceedings 2003 IEEE Computer Society Conference On Computer Vision And Pattern Recognition

    Publication Year: 2003, Page(s):iii - xii
    Request permission for commercial reuse | PDF file iconPDF (327 KB)
    Freely Available from IEEE
  • Multi-modal image registration by minimizing Kullback-Leibler distance between expected and observed joint class histograms

    Publication Year: 2003, Page(s):II - 570
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (732 KB) | HTML iconHTML

    We present a new multimodal image registration method based on the a priori knowledge of the class label mappings between two segmented input images. A joint class histogram between the image pairs is estimated by assigning each bin value equal to the total number of occurrences of the corresponding class label pairs. The discrepancy between the observed and expected joint class histograms should ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 2003, Page(s):737 - 742
    Request permission for commercial reuse | PDF file iconPDF (170 KB)
    Freely Available from IEEE
  • Word image matching using dynamic time warping

    Publication Year: 2003, Page(s):II-521 - II-527 vol.2
    Cited by:  Papers (149)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (448 KB) | HTML iconHTML

    Libraries and other institutions are interested in providing access to scanned versions of their large collections of handwritten historical manuscripts on electronic media. Convenient access to a collection requires an index, which is manually created at great labor and expense. Since current handwriting recognizers do not perform well on historical documents, a technique called word spotting has... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Fusing online and offline information for stable 3D tracking in real-time

    Publication Year: 2003, Page(s):II - 241-8 vol.2
    Cited by:  Papers (24)  |  Patents (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (951 KB) | HTML iconHTML

    We propose an efficient online real-time solution for single-camera 3D tracking of rigid objects that can handle large camera displacements, drastic aspect changes, and partial occlusions. While the offline camera registration problem can be considered as essentially solved, robust online tracking remains an open issue because many real-time algorithms described in the literature still lack robust... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Video content annotation using visual analysis and a large semantic knowledgebase

    Publication Year: 2003, Page(s):II-327 - II-334 vol.2
    Cited by:  Papers (14)  |  Patents (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (787 KB) | HTML iconHTML

    We present a novel approach to automatically annotating broadcast video. To manage the enormous variety of objects, events and scenes in video problem domains such as news video, we couple generic image analysis with a semantic database, WordNet, containing huge amounts of real-world information. Object and event recognition are performed by searching WordNet for concepts jointly supported by imag... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Hue fields and color curvatures: a perceptual organization approach to color image denoising

    Publication Year: 2003, Page(s):II-713 - II-720 vol.2
    Cited by:  Papers (6)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1437 KB) | HTML iconHTML

    The denoising of color images is an increasingly studied problem whose state-of-the-art solutions employ a variety of diffusion schemes. Specifying the correct diffusion is difficult, however, in part because of the subtleties of color interactions. We address this difficulty by proposing a perceptual organization approach to color denoising based on the principle of good continuation. We exploit ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Face relighting with radiance environment maps

    Publication Year: 2003, Page(s):II - 158-65 vol.2
    Cited by:  Papers (17)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (679 KB) | HTML iconHTML

    A radiance environment map pre-integrates a constant surface reflectance with the lighting environment. It has been used to generate photo-realistic rendering at interactive speed. However, one of its limitations is that each radiance environment map can only render the object, which has the same surface reflectance as what it integrates. We present a ratio-image based technique to use a radiance ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Shadow elimination and occluder light suppression for multi-projector displays

    Publication Year: 2003, Page(s):II - 513-20 vol.2
    Cited by:  Papers (9)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (391 KB) | HTML iconHTML

    Two related problems of front projection displays, which occur when users obscure a projector, are: (i) undesirable shadows cast on the display by the users, and (ii) projected light falling on and distracting the users. This paper provides a computational framework for solving these two problems based on multiple overlapping projectors and cameras. The overlapping projectors are automatically ali... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Practical non-parametric density estimation on a transformation group for vision

    Publication Year: 2003, Page(s):II - 114-21 vol.2
    Cited by:  Papers (6)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (394 KB) | HTML iconHTML

    It is now common practice in machine vision to define the variability in an object's appearance in a factored manner, as a combination of shape and texture transformations. In this context, we present a simple and practical method for estimating non-parametric probability densities over a group of linear shape deformations. Samples drawn from such a distribution do not lie in a Euclidean space, an... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Automated feature-based range registration of urban scenes of large scale

    Publication Year: 2003, Page(s):II-555 - Ii-561 vol.2
    Cited by:  Papers (40)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (842 KB) | HTML iconHTML

    We are building a system that can automatically acquire 3D range scans and 2D images to build geometrically and photometrically correct 3D models of urban environments. A major bottleneck in the process is the automated registration of a large number of geometrically complex 3D range scans in a common frame of reference. In this paper we provide a method for the accurate and efficient registration... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Mean-shift blob tracking through scale space

    Publication Year: 2003, Page(s):II - 234-40 vol.2
    Cited by:  Papers (312)  |  Patents (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (473 KB) | HTML iconHTML

    The mean-shift algorithm is an efficient technique for tracking 2D blobs through an image. Although the scale of the mean-shift kernel is a crucial parameter, there is presently no clean mechanism for choosing or updating scale while tracking blobs that are changing in size. We adapt Lindeberg's (1998) theory of feature scale selection based on local maxima of differential scale-space filters to t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Object class recognition by unsupervised scale-invariant learning

    Publication Year: 2003, Page(s):II-264 - II-271 vol.2
    Cited by:  Papers (760)  |  Patents (71)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2336 KB) | HTML iconHTML

    We present a method to learn and recognize object class models from unlabeled and unsegmented cluttered scenes in a scale invariant manner. Objects are modeled as flexible constellations of parts. A probabilistic representation is used for all aspects of the object: shape, appearance, occlusion and relative scale. An entropy-based feature detector is used to select regions and their scale within t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A sparse texture representation using affine-invariant regions

    Publication Year: 2003, Page(s):II-319 - II-324 vol.2
    Cited by:  Papers (41)  |  Patents (15)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1970 KB) | HTML iconHTML

    This paper introduces a texture representation suitable for recognizing images of textured surfaces under a wide range of transformations, including viewpoint changes and nonrigid deformations. At the feature extraction stage, a sparse set of affine-invariant local patches is extracted from the image. This spatial selection process permits the computation of characteristic scale and neighborhood s... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Simultaneous structure and texture image inpainting

    Publication Year: 2003, Page(s):II - 707-12 vol.2
    Cited by:  Papers (40)  |  Patents (15)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (672 KB) | HTML iconHTML

    An algorithm for the simultaneous filling-in of texture and structure in regions of missing image information is presented. The basic idea is to first decompose the image into the sum of two functions with different basic characteristics, and then reconstruct each one of these functions separately with structure and texture filling-in algorithms. The first function used in the decomposition is of ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Shape-time photography

    Publication Year: 2003, Page(s):II-151 - II-157 vol.2
    Cited by:  Papers (5)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (651 KB) | HTML iconHTML

    We introduce a new method to describe shape relationships over time in a photograph. We acquire both range and image information in a sequence of frames using a stationary stereo camera. From the pictures taken, we compute a composite image consisting of the pixels from the surfaces closest to the camera over all the time frames. Through occlusion cues, this composite reveals 3-D relationships bet... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Robust crease detection in fingerprint images

    Publication Year: 2003, Page(s):II - 505-10 vol.2
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1991 KB) | HTML iconHTML

    We study a pattern in the fingerprint called a crease, a kind of stripe which irregularly crosses the normal fingerprint patterns (ridges and valleys). Creases will cause spurious minutiae when using a conventional feature detection algorithm, and therefore decreases the recognition rate of fingerprint identification. By representing the crease using a parameterized rectangle, we design an optimal... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Constrained subspace modeling

    Publication Year: 2003, Page(s):II - 106-13 vol.2
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (462 KB) | HTML iconHTML

    When performing subspace modeling of data using principal component analysis (PCA) it may be desirable to constrain certain directions to be more meaningful in the context of the problem being investigated. This need arises due to the data often being approximately isotropic along the lesser principal components, making the choice of directions for these components more-or-less arbitrary. Furtherm... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Practical super-resolution from dynamic video sequences

    Publication Year: 2003, Page(s):II-549 - II-554 vol.2
    Cited by:  Papers (7)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2578 KB) | HTML iconHTML

    This paper introduces a practical approach for superresolution, the process of reconstructing a high-resolution image from the low-resolution input ones. The emphasis of our work is to super-resolve frames from dynamic video sequences, which may contain significant object occlusion or scene changes. As the quality of super-resolved images highly relies on the correctness of image alignment between... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Polydioptric camera design and 3D motion estimation

    Publication Year: 2003, Page(s):II - 294-301 vol.2
    Cited by:  Papers (13)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (619 KB) | HTML iconHTML

    Most cameras used in computer vision applications are still based on the pinhole principle inspired by our own eyes. It has been found though that this is not necessarily the optimal image formation principle for processing visual information using a machine. We describe how to find the optimal camera for 3D motion estimation by analyzing the structure of the space formed by the light rays passing... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Learning object intrinsic structure for robust visual tracking

    Publication Year: 2003, Page(s):II-227 - II-233 vol.2
    Cited by:  Papers (27)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (409 KB) | HTML iconHTML

    In this paper, a novel method to learn the intrinsic object structure for robust visual tracking is proposed. The basic assumption is that the parameterized object state lies on a low dimensional manifold and can be learned from training data. Based on this assumption, firstly we derived the dimensionality reduction and density estimation algorithm for unsupervised learning of object intrinsic rep... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Range image segmentation by surface extraction using an improved robust estimator

    Publication Year: 2003, Page(s):II - 33-8 vol.2
    Cited by:  Papers (12)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (422 KB) | HTML iconHTML

    The paper presents a novel range image segmentation algorithm based on planar surface extraction. The algorithm was applied to common range image databases and was favorably compared against seven other segmentation algorithms using a popular evaluation framework. The experimental results show that, as compared to the other methods, our algorithm presents a good performance in preserving small reg... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Using multiple cues for hand tracking and model refinement

    Publication Year: 2003, Page(s):II - 443-50 vol.2
    Cited by:  Papers (24)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (991 KB) | HTML iconHTML

    We present a model based approach to the integration of multiple cues for tracking high degree of freedom articulated motions and model refinement. We then apply it to the problem of hand tracking using a single camera sequence. Hand tracking is particularly challenging because of occlusions, shading variations, and the high dimensionality of the motion. The novelty of our approach is in the combi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.