By Topic

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on

Date 14-19 May 2006

Go

Filter Results

Displaying Results 1 - 25 of 317
  • [Title page]

    Publication Year: 2006 , Page(s): C5
    Save to Project icon | Request Permissions | PDF file iconPDF (67 KB)  
    Freely Available from IEEE
  • 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

    Publication Year: 2006 , Page(s): i
    Save to Project icon | Request Permissions | PDF file iconPDF (59 KB)  
    Freely Available from IEEE
  • Copyright page

    Publication Year: 2006 , Page(s): ii
    Save to Project icon | Request Permissions | PDF file iconPDF (54 KB)  
    Freely Available from IEEE
  • IEEE Signal Processing Society 2006 Board of Governors

    Publication Year: 2006 , Page(s): iii
    Save to Project icon | Request Permissions | PDF file iconPDF (54 KB)  
    Freely Available from IEEE
  • ICASSP 2006 Conference Committee

    Publication Year: 2006 , Page(s): iv
    Save to Project icon | Request Permissions | PDF file iconPDF (56 KB)  
    Freely Available from IEEE
  • Technical Program Committee

    Publication Year: 2006 , Page(s): v - x
    Save to Project icon | Request Permissions | PDF file iconPDF (72 KB)  
    Freely Available from IEEE
  • General Chair's Welcome

    Publication Year: 2006 , Page(s): xi
    Save to Project icon | Request Permissions | PDF file iconPDF (57 KB) |  | HTML iconHTML  
    Freely Available from IEEE
  • Future ICASSP Conferences

    Publication Year: 2006 , Page(s): xii
    Save to Project icon | Request Permissions | PDF file iconPDF (39 KB)  
    Freely Available from IEEE
  • Table of contents

    Publication Year: 2006 , Page(s): xiii - cxxv
    Save to Project icon | Request Permissions | PDF file iconPDF (382 KB)  
    Freely Available from IEEE
  • Generative Process Tracking for Audio Analysis

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (2)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (194 KB) |  | HTML iconHTML  

    The problem of generative process tracking involves detecting and adapting to changes in the underlying generative process that creates a time series of observations. It has been widely used for visual background modelling to adaptively track the generative process that generates the pixel intensities. In this paper, we extend this idea to audio background modelling and show its applications in su... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The Importance of Sequences in Musical Similarity

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (17)  |  Patents (2)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (604 KB) |  | HTML iconHTML  

    This paper demonstrates the importance of temporal sequences for passage-level music information retrieval. A number of audio analysis problems are solved successfully by using models that throw away the temporal sequence data. This paper suggests that we do not have this luxury when we consider a more difficult problem: that is finding musically similar passages within a narrow range of musical s... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Enhancing Similarity Matrices for Music Audio Analysis

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (8)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (345 KB) |  | HTML iconHTML  

    Similarity matrices have become an important tool in music audio analysis. However, the quadratic time and space complexity as well as the intricacy of extracting the desired structural information from these matrices are often prohibitive with regard to real-world applications. In this paper, we describe an approach for enhancing the structural properties of similarity matrices based on two conce... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Extraction of High-Level Musical Structure From Audio Data and Its Application to Thumbnail Generation

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (20)  |  Patents (1)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (176 KB) |  | HTML iconHTML  

    A method for segmenting musical audio with a hierarchical timbre model is introduced. New evidence is presented to show that music segmentation can be recast as clustering of timbre features, and a new clustering algorithm is described. A prototype thumbnail-generating application is described and evaluated. Experimental results are given, including comparison of machine and human segmentations View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Audio Elements Based Auditory Scene Segmentation

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (2)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (145 KB) |  | HTML iconHTML  

    Auditory scene segmentation is an important step in the process of high-level semantic inference from audio data streams, and in particular, a prerequisite for auditory scene categorization. In this paper, we analyze the limits of previous works on auditory scene segmentation, and then propose a novel method that, conceptually, is inspired by the ideas used in text and video scene segmentation, an... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Comparing Audio and Video Segmentations for Music Videos Indexing

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (3)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (271 KB) |  | HTML iconHTML  

    Music videos are good examples of multimedia documents in which the structures of the audio and video streams are highly correlated. This paper presents a system that matches these structures and extracts audio-visual correlation measures. The audio and video streams are independently segmented at two-levels: shots (sections for audio) and events. Audio segmentation is performed at the event level... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Speech Acquisition and Enhancement in a Reverberant, Cocktail-Party-Like Environment

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (6)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (404 KB) |  | HTML iconHTML  

    Developing a successful multi-microphone speech acquisition system in a reverberant, cocktail-party-like environment is a very challenging problem since both interfering sources and reverberation need to be well controlled. In this paper, we propose an algorithm based on blind SIMO identification. We first blindly identify the channels from the interfering sources to all the microphones. Then we e... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Proportionate Frequency Domain Adaptive Algorithms for Blind Channel Identification

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (10)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (447 KB) |  | HTML iconHTML  

    We present fast-converging adaptive blind channel identification algorithms for acoustic room impulse responses. These new algorithms exploit the fast-convergence of the improved proportionate normalized least-mean-square (IPNLMS) algorithm and address the problem of delay inherent in frequency domain algorithms by employing the multi-delay filter (MDF) structure. Simulation results for both speec... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Doa Estimation for Multiple Sparse Sources with Normalized Observation Vector Clustering

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (16)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (282 KB) |  | HTML iconHTML  

    This paper presents a new method for estimating the direction of arrival (DOA) of source signals whose number N can exceed the number of sensors M. Subspace based methods, e.g., the MUSIC algorithm, have been widely studied, however, they are only applicable when M > N. Another conventional independent component analysis based method allows M ges N, however, it cannot be applied when M < N. ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Post-Processing for Convolutive Blind Source Separation

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (7)  |  Patents (6)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (240 KB) |  | HTML iconHTML  

    Convolutive blind source separation (BSS) aims at separating point sources from mixtures picked up by several sensors. In real-world environments moving speakers, background noise and long reverberation are encountered which often degrade the performance of BSS algorithms. In such cases, the application of a post-filter can improve the output signal quality by suppression of residual cross-talk an... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Superdirective Beamforming Robust Against Microphone Mismatch

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (2)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (166 KB) |  | HTML iconHTML  

    Fixed superdirective beamformers using small-size microphone arrays are known to be highly sensitive to errors in the assumed microphone array characteristics. This paper discusses the design of robust superdirective beamformers by taking into account the statistics of the microphone characteristics. Different design procedures are considered: applying a white noise gain constraint, trading off th... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Frequency Independent Flexible Spherical Beamforming Via Rbf Fitting

    Publication Year: 2006 , Page(s): V
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (242 KB) |  | HTML iconHTML  

    We describe a new method for sound analysis using a spherical microphone array without the use of quadrature over the sphere. Quadrature based solutions are very sensitive to the placement of microphones on the sphere, needing measurements to be made at exactly the quadrature positions. We propose to use fitting with band-limited radial basis functions (RBFs) rather than quadrature. Our approach r... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Acoustic Scene Analysis Based on Power Decomposition

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (1)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (89 KB) |  | HTML iconHTML  

    A method is proposed for the analysis of acoustic scenes. The contribution of each competing source is suppressed on the basis of harmonic structure or cross-sensor correlation, in such a way that other sources may be estimated. Successive suppression of sources allows the scene to be characterized. In the limit of purely periodic sources and no noise, the method provides accurate estimates of fun... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Music Pitch Representation by Periodicity Measures Based on Combined Temporal and Spectral Representations

    Publication Year: 2006 , Page(s): V
    Cited by:  Papers (8)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (222 KB) |  | HTML iconHTML  

    Periodicity estimation of an audio signal, for applications such as pitch, multiple pitch or tempo estimation is often problematic due to the presence of multiple harmonics in the audio signal producing octave errors. While pitch models or rhythm models can be used, they remain often dedicated to a specific problem. In this paper, we propose a straightforward approach for periodicity estimation ba... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Multi-Scale Frame-Based Analysis of Audio Signals for Musical Transcription Using a Dictionary of Chromatic Waveforms

    Publication Year: 2006 , Page(s): V
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (397 KB) |  | HTML iconHTML  

    A new method for analyzing audio signals in the context of musical transcription is described. It consists of a frame-based expansion of the signal over a multi-scale time-frequency dictionary with a set of logarithmic discrete frequencies. This method, based on the matching pursuit algorithm, provides the same frequency resolution as a constant-Q filter-bank, but with a better time resolution, es... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Computationally Efficient Amplitude Modulated Sinusoidal Audio Coding Using Frequency-Domain Linear Prediction

    Publication Year: 2006 , Page(s): V
    Cited by:  Patents (2)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (178 KB) |  | HTML iconHTML  

    A method for amplitude modulated sinusoidal audio coding is presented that has low complexity and low delay. This is based on a sub-band processing system, where, in each subband, the signal is modeled as an amplitude modulated sum of sinusoids. The envelopes are estimated using frequency-domain linear prediction and the prediction coefficients are quantized. As a proof of concept, we evaluate dif... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.