By Topic

2012 IEEE International Symposium on Multimedia

10-12 Dec. 2012

Filter Results

Displaying Results 1 - 25 of 107
  • [Front cover]

    Publication Year: 2012, Page(s): C4
    Request permission for commercial reuse | PDF file iconPDF (2696 KB)
    Freely Available from IEEE
  • [Title page i]

    Publication Year: 2012, Page(s): i
    Request permission for commercial reuse | PDF file iconPDF (18 KB)
    Freely Available from IEEE
  • [Title page iii]

    Publication Year: 2012, Page(s): iii
    Request permission for commercial reuse | PDF file iconPDF (142 KB)
    Freely Available from IEEE
  • [Copyright notice]

    Publication Year: 2012, Page(s): iv
    Request permission for commercial reuse | PDF file iconPDF (145 KB)
    Freely Available from IEEE
  • Table of contents

    Publication Year: 2012, Page(s):v - xiii
    Request permission for commercial reuse | PDF file iconPDF (151 KB)
    Freely Available from IEEE
  • General Co-chairs' Foreword

    Publication Year: 2012, Page(s): xiv
    Request permission for commercial reuse | PDF file iconPDF (110 KB) | HTML iconHTML
    Freely Available from IEEE
  • Message from the Program Co-chairs

    Publication Year: 2012, Page(s): xv
    Request permission for commercial reuse | PDF file iconPDF (109 KB) | HTML iconHTML
    Freely Available from IEEE
  • Message from the Workshop Chairs

    Publication Year: 2012, Page(s): xvi
    Request permission for commercial reuse | PDF file iconPDF (114 KB) | HTML iconHTML
    Freely Available from IEEE
  • Organization

    Publication Year: 2012, Page(s):xvii - xviii
    Request permission for commercial reuse | PDF file iconPDF (118 KB)
    Freely Available from IEEE
  • Program Committee

    Publication Year: 2012, Page(s):xix - xxiii
    Request permission for commercial reuse | PDF file iconPDF (90 KB)
    Freely Available from IEEE
  • Reviewers

    Publication Year: 2012, Page(s): xxiv
    Request permission for commercial reuse | PDF file iconPDF (95 KB)
    Freely Available from IEEE
  • Multimodal Information Fusion of Audio Emotion Recognition Based on Kernel Entropy Component Analysis

    Publication Year: 2012, Page(s):1 - 8
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (868 KB) | HTML iconHTML

    This paper focuses on the application of novel information theoretic tools in the area of information fusion. Feature transformation and fusion is critical for the performance of information fusion, however the majority of the existing works depend on the second order statistics, which is only optimal for Gaussian-like distribution. In this paper, the integration of information fusion techniques a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Recognition and Summarization of Chord Progressions and Their Application to Music Information Retrieval

    Publication Year: 2012, Page(s):9 - 16
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (397 KB) | HTML iconHTML

    Accurate and compact representation of music signals is a key component of large-scale content-based music applications such as music content management and near duplicate audio detection. This problem is not well solved yet despite many research efforts in this field. In this paper, we suggest mid-level summarization of music signals based on chord progressions. More specially, in our proposed al... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Study on Difficulty Level Recognition of Piano Sheet Music

    Publication Year: 2012, Page(s):17 - 23
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (503 KB) | HTML iconHTML

    Looking for a piano sheet music with proper difficulty for a piano learner is always an important work to his/her teacher. In the paper, we study on a new and challenging issue of recognizing the difficulty level of piano sheet music. To analyze the semantic content of music, we focus on symbolic music, i.e., sheet music or score. Specifically, difficulty level recognition is formulated as a regre... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Spectral Noise Gate Technique Applied to Birdsong Preprocessing on Embedded Unit

    Publication Year: 2012, Page(s):24 - 27
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (576 KB) | HTML iconHTML

    This paper proposes an approach for audio preprocessing and noise removal from recordings obtained in natural environments. The method is inspired in the acoustic signature of the audio, and aims to preprocess the recordings of bird songs obtained directly in the field. Using the Spectral Noise Gate technique, the undesired noise is removed on a real application in real time during the recording u... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • High Capacity Logarithmic Audio Watermarking Based on the Human Auditory System

    Publication Year: 2012, Page(s):28 - 31
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (295 KB) | HTML iconHTML

    This paper proposes a high capacity audio watermarking algorithm in the logarithm domain based on the absolute threshold of hearing (ATH) of the human auditory system (HAS) which makes this scheme a novel technique. The key idea is to divide the selected frequency band into short frames and quantize the samples based on the HAS. Apart from remarkable capacity, transparency and robustness, this sch... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Music Part Segmentation in Music TV Programs Based on Chroma Vector Analysis

    Publication Year: 2012, Page(s):32 - 35
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (877 KB) | HTML iconHTML

    This paper presents a music part detection method incorporating chroma vector analysis for use with music TV programs. Results show that envelopes of chroma components of music signals tend to have horizontal (i.e. temporal) correlation in time-frequency representation because music signals have a periodic chord sequences. Based on this fact, we analyze time series of chroma components and attempt... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Discriminative Multiple Canonical Correlation Analysis for Multi-feature Information Fusion

    Publication Year: 2012, Page(s):36 - 43
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (395 KB) | HTML iconHTML

    This paper presents a novel approach for multi-feature information fusion. The proposed method is based on the Discriminative Multiple Canonical Correlation Analysis (DMCCA), which can extract more discriminative characteristics for recognition from multi-feature information representation. It represents the different patterns among multiple subsets of features identified by minimizing the Frobeni... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • ARtifact: Tablet-Based Augmented Reality for Interactive Analysis of Cultural Artifacts

    Publication Year: 2012, Page(s):44 - 49
    Cited by:  Papers (3)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2212 KB) | HTML iconHTML

    To ensure the preservation of cultural heritage, artifacts such as paintings must be analyzed to diagnose physical frailties that could result in permanent damage. Advancements in digital imaging techniques and computer-aided analysis have greatly aided in such diagnoses but can limit the ability to work directly with the artifact in the field. This paper presents the implementation and applicatio... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Interframe Coding of Canonical Patches for Mobile Augmented Reality

    Publication Year: 2012, Page(s):50 - 57
    Cited by:  Papers (7)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1160 KB) | HTML iconHTML

    Local features are widely used for content-based image retrieval and augmented reality applications. Typically, feature descriptors are calculated from the gradients of a canonical patch around a repeatable key point in the image. In previous work, we showed that one can alternatively transmit the compressed canonical patch and perform descriptor computation at the receiving end with comparable pe... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • 3D Scene Generation by Learning from Examples

    Publication Year: 2012, Page(s):58 - 64
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (762 KB) | HTML iconHTML

    Due to overwhelming use of 3D models in video games and virtual environments, there is a growing interest in 3D scene generation, scene understanding and 3D model retrieval. In this paper, we introduce a data-driven 3D scene generation approach from a Maximum Entropy (MaxEnt) model selection perspective. Using this model selection criterion, new scenes can be sampled by matching a set of contextua... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Photo-Taking Point Recommendation with Nested Clustering

    Publication Year: 2012, Page(s):65 - 68
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (491 KB) | HTML iconHTML

    In this paper, we propose a novel recommendation method for photo-taking points from a large amount of social community photo collections. There are many research activities on photo-related recommendations from a lot of photos stored and managed by photo sharing web services, such as Flickr, Picas a and Panoramio, Although some methods, such as landmark recommendation, tag recommendation and phot... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Simultaneous Image Annotation and Geo-Tag Prediction via Correlation Guided Multi-task Learning

    Publication Year: 2012, Page(s):69 - 72
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (496 KB) | HTML iconHTML

    In recent years, several methods have been proposed to exploit image context (such as location) that provides valuable cues complementary to the image content, i.e., image annotations and geotags have been shown to assist the prediction of each other. To exploit the useful interrelatedness between these two heterogeneous prediction tasks, we propose a new correlation guided structured sparse multi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Learning Multiple Sequence-Based Kernels for Video Concept Detection

    Publication Year: 2012, Page(s):73 - 77
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (279 KB) | HTML iconHTML

    Kernel based methods are widely applied to concept and event detection in video. Recently, kernels working on sequences of feature vectors of a video segment have been proposed for this problem, rather than treating feature vectors of individual frames independently. It has been shown that these sequence-based kernels (based e.g., on the dynamic time warping or edit distance paradigms) outperform ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Commonsense Knowledge for the Collection of Ground Truth Data on Semantic Descriptors

    Publication Year: 2012, Page(s):78 - 83
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (617 KB) | HTML iconHTML

    The coverage of the semantic gap in video indexing and retrieval has gone through a continuous increase of the vocabulary of high - level features or semantic descriptors, sometimes organized in light - scale, corpus - specific, computational ontologies. This paper presents a computer - supported manual annotation method that relies on a very large scale, shared, commonsense ontologies for the sel... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.