2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings.

2003

Go

Filter Results

Displaying Results 1 - 25 of 97
  • Object segmentation using graph cuts based active contours

    Publication Year: 2003, Page(s):II - 46-53 vol.2
    Cited by:  Papers (7)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (14119 KB) | HTML iconHTML

    In this paper we present a graph cuts based active contours (GCBAC) approach to object segmentation problems. Our method is a combination of active contours and the optimization tool of graph cuts and differs fundamentally from traditional active contours in that it uses graph cuts to iteratively deform the contour. Consequently, it has the following advantages. (1) It has the ability to jump over... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Object-specific figure-ground segregation

    Publication Year: 2003, Page(s):II - 39-45 vol.2
    Cited by:  Papers (31)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (924 KB) | HTML iconHTML

    We consider the problem of segmenting an image into foreground and background, with foreground containing solely objects of interest known a priori. We propose an integration model that incorporates both edge detection and object part detection results. It consists of two parallel processes: low-level pixel grouping and high-level patch grouping. We seek a solution that optimizes a joint grouping ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Proceedings 2003 IEEE Computer Society Conference On Computer Vision And Pattern Recognition

    Publication Year: 2003, Page(s):iii - xii
    Request permission for commercial reuse | |PDF file iconPDF (327 KB)
    Freely Available from IEEE
  • Multi-modal image registration by minimizing Kullback-Leibler distance between expected and observed joint class histograms

    Publication Year: 2003, Page(s):II - 570
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (732 KB) | HTML iconHTML

    We present a new multimodal image registration method based on the a priori knowledge of the class label mappings between two segmented input images. A joint class histogram between the image pairs is estimated by assigning each bin value equal to the total number of occurrences of the corresponding class label pairs. The discrepancy between the observed and expected joint class histograms should ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 2003, Page(s):737 - 742
    Request permission for commercial reuse | |PDF file iconPDF (170 KB)
    Freely Available from IEEE
  • View invariants for human action recognition

    Publication Year: 2003, Page(s):II - 613-19 vol.2
    Cited by:  Papers (37)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (367 KB) | HTML iconHTML

    This paper presents two approaches for the representation and recognition of human action in video, aiming for view-point invariance. The paper first presents new results using a 2D approach presented earlier. Inherent limitations of the 2D approach are discussed and a new 3D approach that builds on recent work on 3D model-based invariants, is presented. Each action is represented as a unique curv... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • On the fundamental performance for fingerprint matching

    Publication Year: 2003, Page(s):II-499 - II-504 vol.2
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (388 KB) | HTML iconHTML

    Fingerprints have long been used for person authentication. However, there is not enough scientific research to explain the probability that two fingerprints, which are impressions of different fingers, may be taken as the same one. In this paper, we propose a formal framework to estimate the fundamental algorithm independent error rate of fingerprint matching. Unlike a previous work, which assume... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • What is the space of camera response functions?

    Publication Year: 2003, Page(s):II - 602-9 vol.2
    Cited by:  Papers (52)  |  Patents (12)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (609 KB) | HTML iconHTML

    Many vision applications require precise measurement of scene radiance. The function relating scene radiance to image brightness is called the camera response. We analyze the properties that all camera responses share. This allows us to find the constraints that any response function must satisfy. These constraints determine the theoretical space of all possible camera responses. We have collected... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Image hallucination with primal sketch priors

    Publication Year: 2003, Page(s):II - 729-36 vol.2
    Cited by:  Papers (71)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (883 KB) | HTML iconHTML

    We propose a Bayesian approach to image hallucination. Given a generic low resolution image, we hallucinate a high resolution image using a set of training images. Our work is inspired by recent progress on natural image statistics that the priors of image primitives can be well represented by examples. Specifically, primal sketch priors (e.g., edges, ridges and corners) are constructed and used t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Bayesian framework for fusing multiple word knowledge models in videotext recognition

    Publication Year: 2003, Page(s):II-528 - II-533 vol.2
    Cited by:  Papers (4)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (266 KB) | HTML iconHTML

    Videotext recognition is challenging due to low resolution, diverse fonts/styles, and cluttered background. Past methods enhanced recognition by using multiple frame averaging, image interpolation and lexicon correction, but recognition using multi-modality language models has not been explored. In this paper, we present a formal Bayesian framework for videotext recognition by combining multiple k... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A novel model for orientation field of fingerprints

    Publication Year: 2003, Page(s):II - 493-8 vol.2
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (568 KB) | HTML iconHTML

    As a global feature of the fingerprint, the orientation field is very important to an automatic fingerprint identification system (AFIS). Establishing an accurate and precise model for the orientation field will not only improve the performance of orientation estimation, but will also make it feasible to apply orientation information into the matching process. Such a novel model for the orientatio... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A sparse texture representation using affine-invariant regions

    Publication Year: 2003, Page(s):II-319 - II-324 vol.2
    Cited by:  Papers (44)  |  Patents (15)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (1970 KB) | HTML iconHTML

    This paper introduces a texture representation suitable for recognizing images of textured surfaces under a wide range of transformations, including viewpoint changes and nonrigid deformations. At the feature extraction stage, a sparse set of affine-invariant local patches is extracted from the image. This spatial selection process permits the computation of characteristic scale and neighborhood s... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A perspective on distortions

    Publication Year: 2003, Page(s):II - 594-601 vol.2
    Cited by:  Papers (13)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (1374 KB) | HTML iconHTML

    A framework for analyzing distortions in non-single viewpoint imaging systems is presented. Such systems possess loci of viewpoints called caustics. In general, perspective (or undistorted) views cannot be computed from images acquired with such systems without knowing scene structure. Views computed without scene structure will exhibit distortions, which we call caustic distortions. We first intr... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Shape-time photography

    Publication Year: 2003, Page(s):II-151 - II-157 vol.2
    Cited by:  Papers (5)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (651 KB) | HTML iconHTML

    We introduce a new method to describe shape relationships over time in a photograph. We acquire both range and image information in a sequence of frames using a stationary stereo camera. From the pictures taken, we compute a composite image consisting of the pixels from the surfaces closest to the camera over all the time frames. Through occlusion cues, this composite reveals 3-D relationships bet... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Object removal by exemplar-based inpainting

    Publication Year: 2003, Page(s):II-721 - II-728 vol.2
    Cited by:  Papers (225)  |  Patents (39)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (888 KB) | HTML iconHTML

    A new algorithm is proposed for removing large objects from digital images. The challenge is to fill in the hole that is left behind in a visually plausible way. In the past, this problem has been addressed by two classes of algorithms: (i) "texture synthesis" algorithms for generating large image regions from sample textures, and (ii) "inpainting" techniques for filling in small image gaps. The f... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Activity recognition using the dynamics of the configuration of interacting objects

    Publication Year: 2003, Page(s):II - 633-40 vol.2
    Cited by:  Papers (57)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (518 KB) | HTML iconHTML

    Monitoring activities using video data is an important surveillance problem. A special scenario is to learn the pattern of normal activities and detect abnormal events from a very low resolution video where the moving objects are small enough to be modeled as point objects in a 2D plane. Instead of tracking each point separately, we propose to model an activity by the polygonal 'shape' of the conf... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Word image matching using dynamic time warping

    Publication Year: 2003, Page(s):II-521 - II-527 vol.2
    Cited by:  Papers (169)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (448 KB) | HTML iconHTML

    Libraries and other institutions are interested in providing access to scanned versions of their large collections of handwritten historical manuscripts on electronic media. Convenient access to a collection requires an index, which is manually created at great labor and expense. Since current handwriting recognizers do not perform well on historical documents, a technique called word spotting has... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Implicit meshes for modeling and reconstruction

    Publication Year: 2003, Page(s):II-483 - II-490 vol.2
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (793 KB) | HTML iconHTML

    Explicit surfaces, such as triangulations or wireframe models, have been extensively used to represent the deformable 3D models that are used to fit 3D point and 2D silhouette data. The resulting approaches, however, suffer from the fact that fitting typically involves finding the facets that are closest to the 3D data points or most likely to be silhouette facets. This requires searching, which i... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Visual landmarks detection and recognition for mobile robot navigation

    Publication Year: 2003, Page(s):II-313 - II-318 vol.2
    Cited by:  Papers (13)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (747 KB) | HTML iconHTML

    This article describes visual functions dedicated to the extraction and recognition of planar quadrangles detected from a single camera. Extraction is based on a relaxation scheme with constraints between image segments, while the characterization we propose allows recognition to be achieved from different view-points and viewing conditions. We defined and evaluated several metrics on this represe... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Spacetime stereo: a unifying framework for depth from triangulation

    Publication Year: 2003, Page(s):II - 359-66 vol.2
    Cited by:  Papers (36)  |  Patents (10)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (474 KB) | HTML iconHTML

    Depth from triangulation has traditionally been treated in a number of separate threads in the computer vision literature, with methods like stereo, laser scanning, and coded structured light considered separately. In this paper, we propose a common framework, spacetime stereo, which unifies many of these previous methods. Viewing specific techniques as special cases of this general framework lead... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Automated feature-based range registration of urban scenes of large scale

    Publication Year: 2003, Page(s):II-555 - Ii-561 vol.2
    Cited by:  Papers (46)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (842 KB) | HTML iconHTML

    We are building a system that can automatically acquire 3D range scans and 2D images to build geometrically and photometrically correct 3D models of urban environments. A major bottleneck in the process is the automated registration of a large number of geometrically complex 3D range scans in a common frame of reference. In this paper we provide a method for the accurate and efficient registration... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Using many cameras as one

    Publication Year: 2003, Page(s):II - 587-93 vol.2
    Cited by:  Papers (75)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (413 KB) | HTML iconHTML

    We illustrate how to consider a network of cameras as a single generalized camera in a framework proposed by Nayar (2001). We derive the discrete structure from motion equations for generalized cameras, and illustrate the corollaries to epipolar geometry. This formal mechanism allows one to use a network of cameras as if they were a single imaging device, even when they do not share a common cente... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Example-based style synthesis

    Publication Year: 2003, Page(s):II - 143-50 vol.2
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (1034 KB) | HTML iconHTML

    We introduce an example-based synthesis technique that extrapolates novel styles for a given input image. The technique is based on separating the style and content of image fragments. Given an image with a new style and content, it is first adaptively partitioned into fragments. Stitching together novel fragments produces a coherent image in a new style for a given content. The aggregate of synth... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Bayesian human segmentation in crowded situations

    Publication Year: 2003, Page(s):II - 459-66 vol.2
    Cited by:  Papers (105)  |  Patents (8)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (709 KB) | HTML iconHTML

    The problem of segmenting individual humans in crowded situations from stationary video camera sequences is exacerbated by object inter-occlusion. We pose this problem as a "model-based segmentation" problem in which human shape models are used to interpret the foreground in a Bayesian framework. The solution is obtained by using an efficient Markov chain Monte Carlo (MCMC) method that uses domain... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Hue fields and color curvatures: a perceptual organization approach to color image denoising

    Publication Year: 2003, Page(s):II-713 - II-720 vol.2
    Cited by:  Papers (6)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (1437 KB) | HTML iconHTML

    The denoising of color images is an increasingly studied problem whose state-of-the-art solutions employ a variety of diffusion schemes. Specifying the correct diffusion is difficult, however, in part because of the subtleties of color interactions. We address this difficulty by proposing a perceptual organization approach to color denoising based on the principle of good continuation. We exploit ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.