By Topic

Multimedia, IEEE Transactions on

Issue 2 • Date April 2011

Filter Results

Displaying Results 1 - 25 of 25
  • Table of contents

    Publication Year: 2011 , Page(s): C1 - C4
    Request Permissions | PDF file iconPDF (46 KB)  
    Freely Available from IEEE
  • IEEE Transactions on Multimedia publication information

    Publication Year: 2011 , Page(s): C2
    Request Permissions | PDF file iconPDF (37 KB)  
    Freely Available from IEEE
  • Rate and Distortion Modeling of CGS Coded Scalable Video Content

    Publication Year: 2011 , Page(s): 165 - 180
    Cited by:  Papers (4)  |  Patents (1)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (2255 KB) |  | HTML iconHTML  

    In this paper, we derive single layer and scalable video rate and distortion models for video bitstreams encoded using the coarse grain quality scalability (CGS) feature of the scalable extension of H.264/AVC. In these models, we assume the source is Laplacian distributed and compensate for errors in the distribution assumption by linearly scaling the Laplacian parameter . Moreover, we present sim... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Geometric Invariant Audio Watermarking Based on an LCM Feature

    Publication Year: 2011 , Page(s): 181 - 190
    Cited by:  Papers (11)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (934 KB) |  | HTML iconHTML  

    The development of a geometric invariant audio watermarking scheme without degrading acoustical quality is challenging work. This paper proposes a multi-bit spread-spectrum audio watermarking scheme based on a geometric invariant log coordinate mapping (LCM) feature. The LCM feature is very robust to audio geometric distortions. The watermark is embedded in the LCM feature, but it is actually embe... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Game-theoretic strategies and equilibriums in multimedia fingerprinting social networks

    Publication Year: 2011 , Page(s): 191 - 205
    Cited by:  Papers (5)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (820 KB) |  | HTML iconHTML  

    Multimedia social network is a network infrastructure in which the social network users share multimedia contents with all different purposes. Analyzing user behavior in multimedia social networks helps design more secured and efficient multimedia and networking systems. Multimedia fingerprinting protects multimedia from illegal alterations and multiuser collusion is a cost-effective attack. The c... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Unsupervised Alignment of News Video and Text Using Visual Patterns and Textual Concepts

    Publication Year: 2011 , Page(s): 206 - 215
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1388 KB) |  | HTML iconHTML  

    A brief preview of a news video can be generated by semantically aligning the textual sentences of the anchor report, summarized by the anchor, with the visual field shots. Since accurately detecting the object in a visual shot is difficult and a textual term may generally correspond to several synonyms, the alignment of an anchor sentence with a video shot remains challenging. In this study, the ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Audiovisual Discrimination Between Speech and Laughter: Why and When Visual Information Might Help

    Publication Year: 2011 , Page(s): 216 - 234
    Cited by:  Papers (8)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1713 KB) |  | HTML iconHTML  

    Past research on automatic laughter classification/detection has focused mainly on audio-based approaches. Here we present an audiovisual approach to distinguishing laughter from speech, and we show that integrating the information from audio and video channels may lead to improved performance over single-modal approaches. Both audio and visual channels consist of two streams (cues), facial expres... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Multi-Core Platforms for Beamforming and Wave Field Synthesis

    Publication Year: 2011 , Page(s): 235 - 245
    Cited by:  Papers (5)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1140 KB) |  | HTML iconHTML  

    Immersive-Audio technologies are widely used to build experimental and commercial audio systems. However, most of them are based on standard PCs, which introduce performance limitations and excessive power consumption. To address these drawbacks, we explore the implementation prospectives of two Immersive-Audio technologies: the beamforming (BF) and the wave field synthesis (WFS). We target two po... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Nongeometric Distortion Smoothing Approach for Depth Map Preprocessing

    Publication Year: 2011 , Page(s): 246 - 254
    Cited by:  Papers (12)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1468 KB) |  | HTML iconHTML  

    Depth image-based rendering (DIBR) is a process that uses one 2-D color image and its associated depth map to render virtual view 3-D images. One of the main problems in DIBR is how to reduce holes that occur on the generated virtual view images. In general, preprocessing the whole depth image by smoothing filter before image warping can reduce the hole occurrence. However, smoothing the whole dep... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • MIMiC: Multimodal Interactive Motion Controller

    Publication Year: 2011 , Page(s): 255 - 265
    Multimedia
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1277 KB) |  | HTML iconHTML  

    We introduce a new algorithm for real-time interactive motion control and demonstrate its application to motion captured data, prerecorded videos, and HCI. Firstly, a data set of frames are projected into a lower dimensional space. An appearance model is learnt using a multivariate probability distribution. A novel approach to determining transition points is presented based on k-medoids, whereby ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Robust Camera Calibration and Player Tracking in Broadcast Basketball Video

    Publication Year: 2011 , Page(s): 266 - 279
    Cited by:  Papers (8)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1715 KB) |  | HTML iconHTML  

    With the growth of fandom population, a considerable amount of broadcast sports videos have been recorded, and a lot of research has focused on automatically detecting semantic events in the recorded video to develop an efficient video browsing tool for a general viewer. However, a professional sportsman or coach wonders about high level semantics in a different perspective, such as the offensive ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Optimizing Visual Search Reranking via Pairwise Learning

    Publication Year: 2011 , Page(s): 280 - 291
    Cited by:  Papers (5)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1087 KB) |  | HTML iconHTML  

    Visual search reranking is defined as reordering visual documents (images or video clips) based on the initial search results or some auxiliary knowledge to improve the search precision. Conventional approaches to visual search reranking empirically take the “classification performance” as the optimization objective, in which each visual document is determined relevant or not, follow... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Virtual Contour Guided Video Object Inpainting Using Posture Mapping and Retrieval

    Publication Year: 2011 , Page(s): 292 - 302
    Cited by:  Papers (4)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1425 KB) |  | HTML iconHTML  

    This paper presents a novel framework for object completion in a video. To complete an occluded object, our method first samples a 3-D volume of the video into directional spatio-temporal slices, and performs patch-based image inpainting to complete the partially damaged object trajectories in the 2-D slices. The completed slices are then combined to obtain a sequence of virtual contours of the da... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Survey of Audio-Based Music Classification and Annotation

    Publication Year: 2011 , Page(s): 303 - 319
    Cited by:  Papers (28)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (597 KB) |  | HTML iconHTML  

    Music information retrieval (MIR) is an emerging research area that receives growing attention from both the research community and music industry. It addresses the problem of querying and retrieving certain types of music from large music data set. Classification is a fundamental problem in MIR. Many tasks in MIR can be naturally cast in a classification setting, such as genre classification, moo... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Fast Visual Retrieval Using Accelerated Sequence Matching

    Publication Year: 2011 , Page(s): 320 - 329
    Cited by:  Papers (4)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1359 KB) |  | HTML iconHTML  

    We present an approach to represent, match, and index various types of visual data, with the primary goal of enabling effective and computationally efficient searches. In this approach, an image/video is represented by an ordered list of feature descriptors. Similarities between such representations are then measured by the approximate string matching technique. This approach unifies visual appear... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Learning Visual Contexts for Image Annotation From Flickr Groups

    Publication Year: 2011 , Page(s): 330 - 341
    Cited by:  Papers (12)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (979 KB) |  | HTML iconHTML  

    We present an extension of automatic image annotation that takes the context of a picture into account. Our core assumption is that users do not only provide individual images to be tagged, but group their pictures into batches (e.g., all snapshots taken over the same holiday trip), whereas the images within a batch are likely to have a common style. These batches are matched with categories learn... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Interactive Image Segmentation With Multiple Linear Reconstructions in Windows

    Publication Year: 2011 , Page(s): 342 - 352
    Cited by:  Papers (2)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1765 KB) |  | HTML iconHTML  

    This paper proposes an algorithm for interactive image segmentation. The task is formulated as a problem of graph-based transductive classification. Specifically, given an image window, the color of each pixel in it will be reconstructed linearly with those of the remaining pixels in this window. The optimal reconstruction weights will be kept unchanged to linearly reconstruct their class labels. ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Layered Multicast With Inter-Layer Network Coding for Multimedia Streaming

    Publication Year: 2011 , Page(s): 353 - 365
    Cited by:  Papers (9)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1079 KB) |  | HTML iconHTML  

    Multirate multicast is a powerful methodology of multimedia communication in heterogenous networks. A variant of multirate multicast motivated by scalable multimedia streaming is layered multicast, where the transmitted signal is presented in successive data layers. With recent advances of network coding theory, many layered multicast schemes using network coding have been proposed to improve the ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Optimal Bandwidth Assignment for Multiple-Description-Coded Video

    Publication Year: 2011 , Page(s): 366 - 375
    Cited by:  Papers (7)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (915 KB) |  | HTML iconHTML  

    In video streaming over multicast network, user bandwidth requirement is often heterogeneous possibly with orders of magnitude difference (say, from hundreds of kb/s for mobile devices to tens of Mb/s for high-definition TV). Multiple description coding (MDC) can be used to address this bandwidth heterogeneity issue. In MDC, the video source is encoded into multiple independent descriptions. A rec... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Superchunk-Based Efficient Search in P2P-VoD System

    Publication Year: 2011 , Page(s): 376 - 387
    Cited by:  Papers (5)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (770 KB) |  | HTML iconHTML  

    In this paper, we seek to provide reliable and fast content discovery in peer-to-peer (P2P) video-on-demand (VoD) system to enable user interactivity under peer dynamics. We first identify two characteristics of content discovery in P2P-VoD: real-time constraints and limited local cache. Tapping on these properties, we propose a hybrid content discovery mechanism: SUpeRchunk- based eFficient searc... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Spatial Correlation-Based Image Compression Framework for Wireless Multimedia Sensor Networks

    Publication Year: 2011 , Page(s): 388 - 401
    Cited by:  Papers (10)  |  Patents (1)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (895 KB) |  | HTML iconHTML  

    Data redundancy caused by correlation has motivated the application of collaborative multimedia in-network processing for data filtering and compression in wireless multimedia sensor networks (WMSNs). This paper proposes an information theoretic image compression framework with an objective to maximize the overall compression of the visual information gathered in a WMSN. The novelty of this framew... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Scalable Video Multicast in Hybrid 3G/Ad-Hoc Networks

    Publication Year: 2011 , Page(s): 402 - 413
    Cited by:  Papers (23)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (1382 KB) |  | HTML iconHTML  

    Mobile video broadcasting service, or mobile TV, is expected to become a popular application for 3G wireless network operators. Most existing solutions for video Broadcast Multicast Services (BCMCS) in 3G networks employ a single transmission rate to cover all viewers. The system-wide video quality of the cell is therefore throttled by a few viewers close to the boundary, and is far from reaching ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • IEEE Transactions on Multimedia EDICS

    Publication Year: 2011 , Page(s): 414
    Request Permissions | PDF file iconPDF (16 KB)  
    Freely Available from IEEE
  • IEEE Transactions on Multimedia Information for authors

    Publication Year: 2011 , Page(s): 415 - 416
    Request Permissions | PDF file iconPDF (46 KB)  
    Freely Available from IEEE
  • IEEE Transactions on Multimedia society information

    Publication Year: 2011 , Page(s): C3
    Request Permissions | PDF file iconPDF (28 KB)  
    Freely Available from IEEE

Aims & Scope

The scope of the Periodical is the various aspects of research in multimedia technology and applications of multimedia.

Full Aims & Scope

Meet Our Editors

Editor-in-Chief
Chang Wen Chen
State University of New York at Buffalo