By Topic

New Trends in Audio and Video / Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2008

Date 25-27 Sept. 2008

Filter Results

Displaying Results 1 - 25 of 44
  • [Front cover]

    Publication Year: 2008, Page(s): c1
    IEEE is not the copyright holder of this material | PDF file iconPDF (967 KB)
    Freely Available from IEEE
  • General information

    Publication Year: 2008, Page(s): 2
    Request permission for commercial reuse | PDF file iconPDF (32 KB) | HTML iconHTML
    Freely Available from IEEE
  • Table of contents

    Publication Year: 2008, Page(s):3 - 4
    IEEE is not the copyright holder of this material | PDF file iconPDF (191 KB)
    Freely Available from IEEE
  • Parametric HMMs for movement recognition and synthesis

    Publication Year: 2008, Page(s):9 - 14
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1518 KB) | HTML iconHTML

    A common problem in human movement recognition is the recognition of movements of a particular type (semantic). E.g., grasping movements have a particular semantic (grasping) but the actual movements usually have very different appearances due to, e.g., different grasping directions. In this paper, we develop an exemplar-based parametric hidden Markov model (PHMM) that allows to represent movement... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Objects classification based on their physical sizes for detection of events in camera images

    Publication Year: 2008, Page(s):15 - 20
    Cited by:  Papers (1)
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (930 KB) | HTML iconHTML

    In the paper, a method of estimation of the physical sizes of the objects tracked in the video surveillance system, and a simple module for object classification based on the estimated physical sizes, are presented. The results of object classification are then used for automatic detection of various types of events in the camera image. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Video compression based on dictionary of Base Functions

    Publication Year: 2008, Page(s):21 - 26
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (550 KB) | HTML iconHTML

    In this paper video coder with Base Function Dictionary - BFD was presented. In classic approach prediction error signal is encoded with using block based transformation such as popular Discrete Cosine Transform - DCT. Coder proposed in this article uses BFD dictionary to encode prediction error signal. BFD was adapted to prediction error signal characteristic. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Multi-view face detection and recognition under varying illumination conditions by designing an illumination effect cancelling filter

    Publication Year: 2008, Page(s):27 - 32
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (2720 KB) | HTML iconHTML

    This paper presents a novel approach for detection and recognition of multi-view faces whose location is unknown and the illumination conditions are varying. The detection of faces is accomplished after canceling the effect of the various illumination conditions by using a proposed filter. Because of the independency of the approach to skin color of face, the persons with every kind of skin colors... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Automatic detection of pancreatic islets in magnetic resonanse rat liver images

    Publication Year: 2008, Page(s):33 - 38
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (921 KB) | HTML iconHTML

    This work presents the automatic method of pancreatic islets detection in magnetic resonance rat liver images. These islets are able to produce insuline and may be transplanted into human liver. This can be a new treatment method of type-1 diabetes mellitus. Analysis of MR islets images is necessary to estimate the islets tolerance and rejection rate. Proposed technique was compared and discussed ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An FPGA-based sigma-delta audio DAC

    Publication Year: 2008, Page(s):39 - 42
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (240 KB) | HTML iconHTML

    This paper presents a sigma-delta audio digital-to-analog converter (DAC) implemented on a single field programmable gate array (FPGA) for non-commercial application. The simulations' results of the digital interpolation filter and digital sigma-delta modulator are shown. The results of an FPGA-based hardware implementation will be obtained and presented soon. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The influence of pinna flare angle on Head-Related Transfer Function

    Publication Year: 2008, Page(s):43 - 46
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (922 KB) | HTML iconHTML

    Head Related Transfer-Function (HRTF) describes the influence of torso, head and pinna on sound spectrum. The changes of sound spectrum at listener ear are one of the major cues for sound source localization. It is possible to recognize HRTF using numerical method, e.g. Boundary Elements Method (BEM). The numerical model used for calculation of HRTF is constructed by transfer geometrical shape of ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Comparative evaluation of the acoustic tube amplifier sound quality by means of listening tests

    Publication Year: 2008, Page(s):47 - 50
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (694 KB) | HTML iconHTML

    The study presents comparative analysis of basic electrical parameters and sound quality of six acoustic amplifiers: five acoustic vacuum tube amplifier and a transistor amplifier. For the measurement of basic electrical parameters, specialized computer stands and a computer psychoacoustic model have been used. Additionally, listening test has been carried out in accordance with the methods presen... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Reed function in clarinet physical model

    Publication Year: 2008, Page(s):51 - 54
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (159 KB) | HTML iconHTML

    In the paper a simplified physical model of clarinet for physical modeling synthesis is presented. Proposed system consist of two elements: sound generator and resonator. A generating element which corresponds to reed and its function in the sound production in instrument is implemented with the use of a reed function. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Selected aspects of the use of digital signal processing in the sound therapy

    Publication Year: 2008, Page(s):55 - 60
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (857 KB) | HTML iconHTML

    This paper presents a review of selected methods with use of sound in diagnosis and therapy. Methods proposed by the authors are based mainly on advanced methods of spectral analysis and use of disharmonic multitones, particularly those produced by special sound bowls. Usefulness of various DSP methods has been proven for creating and assessing the acoustic materials for the therapy. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • On the use of instantaneous complex frequency for pitch and formant tracking

    Publication Year: 2008, Page(s):61 - 66
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (153 KB) | HTML iconHTML

    In the paper a method for pitch and formant tracking using instantaneous complex frequency (ICF) is presented. It includes estimating the frequencies, as well as the bandwidths of the speech formants. The formants are separated from the speech signal using complex passband Hilbert filters. The filters are simultaneously used to compute the Hilbertian signal in order to calculate the ICF of each fo... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • LVCSR Speech Database - JURISDIC

    Publication Year: 2008, Page(s):67 - 72
    Cited by:  Papers (1)
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (1274 KB) | HTML iconHTML

    In the paper an overview of the Polish Speech Database for taking dictation of legal texts, created for the purpose of LVCSR system for Polish in the frame of Polish Platform for Homeland Security (PPBW) is presented. Basic information about the design of the database is provided as well as the applied method of the text corpora construction and the database structure. Fundamental details on the r... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Evaluation of existing Harmonic-to-Noise Ratio methods for voice assessment

    Publication Year: 2008, Page(s):73 - 78
    Cited by:  Papers (1)
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (909 KB) | HTML iconHTML

    In this paper, an evaluation of several methods allowing the estimation of the Harmonic-to-Noise Ratio (HNR) of sustained vowels was conducted. The HNR estimation methods are mainly based on time, spectral, and cepstral signal representations. An algorithm was implemented for each method and was tested with synthesized voice sounds in order to evaluate their accuracy. Tests were also conducted wit... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Speaker verification based on prosodic features

    Publication Year: 2008, Page(s):79 - 82
    Cited by:  Papers (1)
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (1369 KB) | HTML iconHTML

    In this paper accuracy of the speaker verification system based on prosodic features was evaluated. The efficiency of bigram models of the speaker rhythm was assessed. The results showed that in speech the rhythm carries speaker specific information. The optimal number quantization levels for of segments' duration was determined. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Silence/noise detection for speech and music signals

    Publication Year: 2008, Page(s):83 - 88
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (2360 KB) | HTML iconHTML

    This paper introduces a novel off-line algorithm for silence/noise detection in noisy signals. The main concept of the proposed algorithm is to provide noise patterns for further signals processing i.e. noise reduction for speech enhancement. The algorithm is based on frequency domain characteristics of signals. The examples of different types of noisy signals are presented. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Word endpoint detection/extraction method for speaker identification purposes

    Publication Year: 2008, Page(s):89 - 92
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (663 KB) | HTML iconHTML

    In the paper, a method for the word endpoints detection in the continuous speech is presented. The key element of the method is an adaptive finding of two thresholds to discriminate the time instants/sections in which the useful signal occurs. An algorithm for the threshold waveform computing based on the averaging of the speech signal envelope is proposed, and the envelope is found as a correlati... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Subband wavelet signal denoising for voice activity detection

    Publication Year: 2008, Page(s):93 - 96
    Cited by:  Papers (1)
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (1442 KB) | HTML iconHTML

    In this paper we propose a method for voice activity detection (VAD) in a speech signal recorded in the presence of noise. The so-called endpoint detection (EPD), i.e., detection of voice activity (speech) boundaries is very difficult if the signal is acquired in noisy environments. The proposed VAD method uses an additional stage of wavelet subband denoising. We compared this approach with other ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Research of the anisotropy of the sound bowls

    Publication Year: 2008, Page(s):97 - 102
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (2387 KB) | HTML iconHTML

    In this paper, the multimicrophone measurement position to the test of the sound bowls was presented. The results of the sound decay measurements and the sound energy spectra as a function of the position on the sound bowl circuit and the hit place into the bowl were shown. The anisotropy of the acoustic field generated by sound bowls observed by therapists and the influence of the hit place on th... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Visualization of sound bowl acoustic field

    Publication Year: 2008, Page(s):103 - 108
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (1873 KB) | HTML iconHTML

    In this paper multi-microphone measurement post for testing special sound bowls is presented. The method of digital image generations showing spatial distribution of bowls' sound energy and examples of the energy distribution images are given. The influence of bowl's acoustic field on both time and space as well as and circular character of the acoustic field is confirmed. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Logic self repair based on regular building blocks

    Publication Year: 2008, Page(s):109 - 114
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (1347 KB) | HTML iconHTML

    The scalability of CMOS technology is apparently approaching physical limits. In particular, technology forecasts expect higher rates of permanent and transient faults, which make fault tolerant design and, eventually, built-in self repair (BISR) capabilities a necessity. While BISR works reasonably in regular structures such as memory blocks, BISR for random logic is by far an unsolved problem. T... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Performance of Wavelet-OFDM and Circular Wavelet-OFDM in power line communications

    Publication Year: 2008, Page(s):115 - 120
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (288 KB) | HTML iconHTML

    In the paper a circular version of wavelet orthogonal frequency division multiplexing (Circular Wavelet-OFDM), which can be called also a circular discrete wavelet multitone (Circular-DWMT), is introduced and its application in power line communications (PLC) is proposed and tested. Especially, the method robustness against harmful narrowband and impulsive disturbances is evaluated and compared wi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A concept of current-mode long-term analog memory for neural-network learning on silicon

    Publication Year: 2008, Page(s):121 - 126
    Cited by:  Papers (1)
    IEEE is not the copyright holder of this material | Click to expandAbstract | PDF file iconPDF (1464 KB) | HTML iconHTML

    A new concept and CMOS implementation of an analog current-mode memory with increased retention time is presented. Because the memory is of a capacitive type, there are difficulties with long-term storing the written information, when its basic form is used. To overcome this problem, we propose applying a positive feedback which ensures obtaining the same base potential of the memory sample & ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.