By Topic

Circuits and Systems for Video Technology, IEEE Transactions on

Issue 11 • Date Nov. 2012

Filter Results

Displaying Results 1 - 15 of 15
  • Table of contents

    Publication Year: 2012 , Page(s): C1
    Save to Project icon | Request Permissions | PDF file iconPDF (233 KB)  
    Freely Available from IEEE
  • IEEE Transactions on Circuits and Systems for Video Technology publication information

    Publication Year: 2012 , Page(s): C2
    Save to Project icon | Request Permissions | PDF file iconPDF (140 KB)  
    Freely Available from IEEE
  • Incremental Learning in Human Action Recognition Based on Snippets

    Publication Year: 2012 , Page(s): 1529 - 1541
    Cited by:  Papers (10)
    Save to Project icon | Request Permissions | Click to expandQuick Abstract | PDF file iconPDF (9942 KB) |  | HTML iconHTML  

    In this paper, we present a systematic framework for recognizing human actions without relying on impractical assumptions, such as processing of an entire video or requiring a large look-ahead of frames to label an incoming video. As a secondary goal, we examine incremental learning as an overlooked obstruction to the implementation of reliable real-time recognition. Assuming weak appearance constancy, the shape of an actor is approximated by adaptively changing intensity histograms to extract pyramid histograms of oriented gradient features. As action progresses, the shape update is carried out by adjustment of a few blocks within a tracking window to closely track evolving contours. The nonlinear dynamics of an action are learned using a recursive analytic approach, which transforms training into a simple linear representation. Such a learning strategy has two advantages: 1) minimized error rates, and significant savings in computational time; and 2) elimination of the widely accepted limitations of batch-mode training for action recognition. The effectiveness of our proposed framework is corroborated by experimental validation against the state of the art. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Camera-Driven Power Management of Computer Display

    Publication Year: 2012 , Page(s): 1542 - 1553
    Cited by:  Papers (1)
    Save to Project icon | Request Permissions | Click to expandQuick Abstract | PDF file iconPDF (13840 KB) |  | HTML iconHTML  

    Effective display power management is essential to reduce the cost of operating desktop computers as well as prolonging the lifetime of battery-operated notebooks and laptops between consecutive battery recharges. Although displays are dedicated for our eyes, existing display power management technologies “sense” only our fingers or temperature or motion through keyboard, mouse, or special sensors. Therefore, display may either switch off when its user looks at the screen or loose energy by keeping the screen on when nobody looks at it. This paper discusses a new display power management technology and outlines its implementation in personal computer systems. Unlike presently used power management, our technology “watches” the user through a single video camera. The technology tracks the user's eyes, keeping the display active only if the user looks at the screen. Otherwise, it dims the display down or even switches it off to save energy. We implemented the technology in field programmable gate array-based hardware and present results of its experimental evaluation. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Image Reconstruction From Random Samples With Multiscale Hybrid Parametric and Nonparametric Modeling

    Publication Year: 2012 , Page(s): 1554 - 1563
    Cited by:  Papers (3)
    Save to Project icon | Request Permissions | Click to expandQuick Abstract | PDF file iconPDF (16807 KB) |  | HTML iconHTML  

    Statistical image modeling is of central importance to many image-processing tasks that are ill-posed in nature. Existing image models can be categorized as parametric models and nonparametric models according to the statistical techniques used. In this paper, we develop a hybrid image reconstruction (HIR) algorithm from sparse random samples using parametric and nonparametric modeling of images. More specifically, the modeling strength of the parametric and nonparametric techniques are combined within a multiscale framework. The linear autoregressive parametric model and kernel regressive nonparametric models are used to explore the interscale and intrascale dependencies of the image, respectively. The proposed HIR algorithm is capable of recovering the image from very sparse samples (e.g., 5%), and experimental results suggest that the proposed algorithm achieves noticeable improvement over some of the existing approaches in terms of both peak signal-to-noise ratio and subjective qualities of the reconstruction results. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Region-Based Rate Control for H.264/AVC for Low Bit-Rate Applications

    Publication Year: 2012 , Page(s): 1564 - 1576
    Cited by:  Papers (4)
    Save to Project icon | Request Permissions | Click to expandQuick Abstract | PDF file iconPDF (12425 KB) |  | HTML iconHTML  

    Rate control plays an important role in video coding. However, in the conventional rate control algorithms, the number and position of macroblocks (MBs) inside one basic unit for rate control is inflexible and predetermined. The different characteristics of the MBs are not fully considered. Also, there is no overall optimization of the coding of basic units. This paper proposes a new region-based rate control scheme for H.264/advanced video coding to improve the coding efficiency. The inter-frame information is explored to objectively divide one frame into multiple regions based on their rate-distortion (R-D) behaviors. The MBs with similar characteristics are classified into the same region, and the entire region, instead of a single MB or a group of contiguous MBs, is treated as a basic unit for rate control. A linear rate-quantization stepsize model and a linear distortion-quantization stepsize model are proposed to accurately describe the R-D characteristics for the region-based basic units. Moreover, based on the above linear models, an overall optimization model is proposed to obtain suitable quantization parameters for the region-based basic units. Experimental results demonstrate that the proposed region-based rate control approach can achieve both better subjective and objective quality by performing the rate control adaptively with the content, compared to the conventional rate control approaches. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Analysis and VLSI Implementation of EWA Rendering for Real-Time HD Video Applications

    Publication Year: 2012 , Page(s): 1577 - 1589
    Cited by:  Papers (5)
    Save to Project icon | Request Permissions | Click to expandQuick Abstract | PDF file iconPDF (14211 KB) |  | HTML iconHTML  

    Nonlinear image warping or image resampling is a necessary step in many current and upcoming video applications, such as video retargeting, stereoscopic 3-D mapping, and multiview synthesis. The challenges for real-time resampling include not only image quality but also available energy and computational power of the employed device. In this paper, we employ an elliptical-weighted average (EWA) rendering approach to 2-D image resampling. We extend the classical EWA framework for increased visual quality and provide a very large scale integration architecture for efficient view rendering. The resulting architecture is able to render high-quality video sequences in real time targeted for low-power applications in end-user display devices. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Structure and Motion Recovery Based on Spatial-and-Temporal-Weighted Factorization

    Publication Year: 2012 , Page(s): 1590 - 1603
    Cited by:  Papers (3)
    Save to Project icon | Request Permissions | Click to expandQuick Abstract | PDF file iconPDF (27180 KB) |  | HTML iconHTML  

    This paper focuses on the problem of structure and motion recovery from uncalibrated image sequences. It has been empirically proven that image measurement uncertainties can be modeled spatially and temporally by virtue of reprojection residuals. Consequently, a spatial-and-temporal-weighted factorization (STWF) algorithm is proposed to handle significant noise contained in the tracking data. This paper presents three novelties and contributions. First, the image reprojection residual of a feature point is demonstrated to be generally proportional to the error magnitude associated with the image point. Second, the error distributions are estimated from a different perspective, that of the reprojection residuals. The image errors are modeled both spatially and temporally to cope with different kinds of uncertainties. Previous studies have considered only the spatial information. Third, based on the estimated error distributions, an STWF algorithm is proposed to improve the overall accuracy and robustness of traditional approaches. Unlike existing approaches, the proposed technique does not require prior information of image measurement and is easy to implement. Extensive experiments on synthetic data and real images validate the proposed method. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A 385 MHz 13.54 K Gates Delay Balanced Two-Level CAVLC Decoder for Ultra HD H.264/AVC Video

    Publication Year: 2012 , Page(s): 1604 - 1610
    Save to Project icon | Request Permissions | Click to expandQuick Abstract | PDF file iconPDF (4333 KB) |  | HTML iconHTML  

    To satisfy the heavy performance requirement in real-time high-resolution H.264/AVC, very large-scale integrated implementation of the entropy decoder is necessary since it dominates the overall decoder throughput. In this paper, we propose a high-throughput delay balanced two-level context-based adaptive variable length coding (CAVLC) decoder with 21% shorter critical path delay in comparison to the traditional two-level decoder design. Furthermore, redundant decoding processes are removed by a skipping mechanism. The proposed CAVLC decoder only needs 127.13 cycles per macroblock on average to support level 5.1 decoding with 13.54k gate counts under 90-nm CMOS technology. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Outdoor Dynamic 3-D Scene Reconstruction

    Publication Year: 2012 , Page(s): 1611 - 1622
    Cited by:  Papers (1)
    Save to Project icon | Request Permissions | Click to expandQuick Abstract | PDF file iconPDF (10786 KB) |  | HTML iconHTML  

    Existing systems for 3-D reconstruction from multiple view video use controlled indoor environments with uniform illumination and backgrounds to allow accurate segmentation of dynamic foreground objects. In this paper, we present a portable system for 3-D reconstruction of dynamic outdoor scenes that require relatively large capture volumes with complex backgrounds and nonuniform illumination. This is motivated by the demand for 3-D reconstruction of natural outdoor scenes to support film and broadcast production. Limitations of existing multiple view 3-D reconstruction techniques for use in outdoor scenes are identified. Outdoor 3-D scene reconstruction is performed in three stages: 1) 3-D background scene modeling using spherical stereo image capture; 2) multiple view segmentation of dynamic foreground objects by simultaneous video matting across multiple views; and 3) robust 3-D foreground reconstruction and multiple view segmentation refinement in the presence of segmentation and calibration errors. Evaluation is performed on several outdoor productions with complex dynamic scenes including people and animals. Results demonstrate that the proposed approach overcomes limitations of previous indoor multiple view reconstruction approaches enabling high-quality free-viewpoint rendering and 3-D reference models for production. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Removing Object Reflections in Videos by Global Optimization

    Publication Year: 2012 , Page(s): 1623 - 1633
    Save to Project icon | Request Permissions | Click to expandQuick Abstract | PDF file iconPDF (6304 KB) |  | HTML iconHTML  

    This paper presents a novel algorithm for the removal of reflections generated by objects on reflecting floors. The algorithm uses both chromatic properties of the reflections and geometrical constraints on their positions; however, it does not make use of a model of the reflected objects, and so it can be applied to scenes containing several kinds of objects (e.g., people, baggage, animals, vehicles, etc.). The proposed method has been validated by an extensive set of experiments on a large video database. In these experiments, the method has been compared to two other recent reflection removal algorithms. The experimental results show that the proposed method is fast and effective, both in absolute terms and in comparison with the other algorithms. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Implementation of Wireless Vision Sensor Node for Characterization of Particles in Fluids

    Publication Year: 2012 , Page(s): 1634 - 1643
    Cited by:  Papers (5)
    Save to Project icon | Request Permissions | Click to expandQuick Abstract | PDF file iconPDF (3572 KB) |  | HTML iconHTML  

    Wireless vision sensor networks (WVSNs) have a number of wireless vision sensor nodes (VSNs), often spread over a large geographical area. Each node has an image capturing unit, a battery or alternative energy source, a memory unit, a light source, a wireless link, and a processing unit. The challenges associated with WVSNs include low energy consumption, low bandwidth, limited memory, and processing capabilities. In order to meet these challenges, our paper is focused on the exploration of energy-efficient reconfigurable architectures for VSN. In this paper, the design and research challenges associated with the implementation of VSN on different computational platforms, such as microcontroller, field-programmable gate arrays, and server, are explored. In relation to this, the effect on the energy consumption and the design complexity at the node, when the functionality is moved from one platform to another, are analyzed. Based on the implementation of the VSN on embedded platforms, the lifetime of the VSN is predicted using the measured energy values of the platforms for different implementation strategies. The implementation results show that an architecture, where the compressed images after pixel-based operation are transmitted, realize a WVSN system with low energy consumption. Moreover, the complex postprocessing tasks are moved to a server which has less constraints. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Open Access [advertisement]

    Publication Year: 2012 , Page(s): 1644
    Save to Project icon | Request Permissions | PDF file iconPDF (1156 KB)  
    Freely Available from IEEE
  • IEEE Circuits and Systems Society Information

    Publication Year: 2012 , Page(s): C3
    Save to Project icon | Request Permissions | PDF file iconPDF (32 KB)  
    Freely Available from IEEE
  • IEEE Transactions on Circuits and Systems for Video Technology information for authors

    Publication Year: 2012 , Page(s): C4
    Save to Project icon | Request Permissions | PDF file iconPDF (107 KB)  
    Freely Available from IEEE

Aims & Scope

The emphasis is focused on, but not limited to:
1. Video A/D and D/ A
2. Video Compression Techniques and Signal Processing
3. Multi-Dimensional Filters and Transforms
4. High Speed Real-Tune Circuits
5. Multi-Processors Systems—Hardware and Software
6. VLSI Architecture and Implementation for Video Technology 

 

Full Aims & Scope

Meet Our Editors

Editor-in-Chief
Dan Schonfeld
Multimedia Communications Laboratory
ECE Dept. (M/C 154)
University of Illinois at Chicago (UIC)
Chicago, IL 60607-7053
tcsvt-eic@tcad.polito.it

Managing Editor
Jaqueline Zelkowitz
tcsvt@tcad.polito.it