Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing. ISIMP 2001 (IEEE Cat. No.01EX489)

4-4 May 2001

Filter Results

Displaying Results 1 - 25 of 144
  • Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing. ISIMP 2001 (IEEE Cat. No.01EX489)

    Publication Year: 2001
    Request permission for commercial reuse | PDF file iconPDF (800 KB)
    Freely Available from IEEE
  • Author index

    Publication Year: 2001, Page(s):xx - xxii
    Request permission for commercial reuse | PDF file iconPDF (155 KB)
    Freely Available from IEEE
  • Enhancement of fax documents using a binary angular representation

    Publication Year: 2001, Page(s):125 - 128
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (336 KB) | HTML iconHTML

    In this paper, we explore a new approach to enhancing fax documents using a binary directional filter bank (DFB). The process of sending fax documents often results in distortions that are visible in the form of spurious point noise and ragged edges. We propose a new approach that remains in the binary domain for the entire process. Conventional directional filter banks provide representations tha... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Tunable and variable passive digital filters for multimedia signal processing

    Publication Year: 2001, Page(s):229 - 232
    Cited by:  Papers (2)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (332 KB) | HTML iconHTML

    The paper introduces a class of tunable and variable digital filters, which are derived from passive digital filters. By changing the values of their filter coefficients, the frequency response characteristics of these passive digital filters can be tuned or varied. These tunable and variable passive digital filters include first-order and second-order lowpass and highpass digital filters, and sec... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Towards hybrid 2D phase unwrapping using fuzzy clustering and neuro-fuzzy learning for SAR images: a case study on IFSAR phase image

    Publication Year: 2001, Page(s):271 - 274
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (320 KB) | HTML iconHTML

    In acknowledging that every phase unwrapping (PU) technique has its advantages and disadvantages, a hybrid PU scheme is proposed. There are three phases in this scheme. In phase I, phase images are segmented based on fuzzy clustering, in phase II, PU is performed on every segment and its results are evaluated, and in phase III, neurofuzzy training is used to map optimal PU technique to every clust... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Post-processing for removing coding artifacts using edge-preserving regularization

    Publication Year: 2001, Page(s):121 - 124
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (564 KB) | HTML iconHTML

    In low bit rate image coding, the perceived quality of the decoded image is severely degraded by various artifacts, such as blocking, blotchiness and ringing effects. We propose an effective post-processing algorithm with low computational complexity for removing coding artifacts. We first give an image recovery model for post-processing. Based on this model, a smooth constraint is imposed, togeth... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • PCR-based fair intelligent bandwidth allocation for rate adaptive video traffic

    Publication Year: 2001, Page(s):141 - 145
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (352 KB) | HTML iconHTML

    In this paper, we propose a network bandwidth sharing algorithm, Peak Cell Rate (PCR)-based Fair Intelligent Bandwidth Allocation (PFIBA) for transporting rate-adaptive video traffic using feedback, and report on its performance under a general PCR-based share policies. Through extensive simulations, we obtained following results. The PFIBA algorithm is capable of allocating bandwidth fairly for t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Effect of channel quality estimation error on the performance of interactive mobile video system

    Publication Year: 2001, Page(s):312 - 315
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (316 KB) | HTML iconHTML

    Adaptive modulation is an effective scheme to observably improve the performance of interactive video data transmission over mobile wireless channels. However, its effectiveness is greatly affected by the accuracy of the channel quality estimation. With the measurements of extra bit error rate cost (EBC) and potential theoretical channel capacity loss (PTCCL), we analyze the effect of channel qual... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Fast search algorithm for edge-oriented block matching algorithm

    Publication Year: 2001, Page(s):225 - 228
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (808 KB) | HTML iconHTML

    The edge-oriented block matching algorithm (EOMBA) provides a better motion-compensated prediction along moving edges in comparison with the traditional intensity-based block motion estimation methods. However, the accuracy of the motion vectors of blocks containing the boundary between the moving objects and the still area is critical in the EOBMA. Consequently, the full search algorithm (FSA) wi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Analog LSI design of data clustering/labeling method in speech recognition system

    Publication Year: 2001, Page(s):563 - 566
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (312 KB) | HTML iconHTML

    This report first introduces a new speech recognition system. The system is designed with software and hardware mixed systems. In the hardware part, this report shows an analog LSI architecture. This architecture realizes a data clustering/labeling which is used for speech characteristic labeling. The clustering/labeling system which supports a quite simple Japanese vowel labeling mechanism is des... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • CORBA based distributed computing model for multimodal speech recognition

    Publication Year: 2001, Page(s):417 - 420
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (316 KB) | HTML iconHTML

    We present our work in progress on the computing infrastructure of HMNI (Human-Machine Natural Interaction) research project. In a multimodal speech recognition system, various signals and a huge amount of data are collected and analyzed simultaneously. To make this kind of task realizable, we developed a practical distributed computing model complying with CORBA (Common Object Request Broker Arch... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Vehicle license image segmentation using wavelet transform

    Publication Year: 2001, Page(s):267 - 270
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (288 KB) | HTML iconHTML

    A novel multiscale method of vehicle license image segmentation based on wavelet transform is proposed. This analysis utilizes the local wavelet transform modulus maxima as the image edge at multiple scales, and combines the multiscale edge information. Then a template matching method is applied to segment the vehicle license image based on edge density analysis and character edge spatial feature ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Integer multiwavelet transform for lossless image coding

    Publication Year: 2001, Page(s):117 - 120
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (320 KB) | HTML iconHTML

    The integer Haar wavelet transform or S-transform is used as the basic building block for many integer wavelet transforms. As an alternative, a new integer multiwavelet transform and its associated integer prefilter are designed based on a box-and-slope multi-scaling system. Both the transform and prefilter can be implemented with a simple integer Haar transform requiring only addition and bit shi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • JPEG 2000

    Publication Year: 2001, Page(s):20 - 23
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (280 KB) | HTML iconHTML

    JPEG 2000 part I which has become an international standard (IS) in December 2000 is reviewed. Parts II thru VII relating to extensions, motion JPEG 2000, conformance, reference software, file format and technical report are in various stages of development with IS targeted for the year 2001. JBPG2000 focuses on efficient coding of grey level and binary still images (compound documents, text, anno... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Motion activity based wireless video quality perceptual metric

    Publication Year: 2001, Page(s):527 - 530
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (280 KB) | HTML iconHTML

    Due to its limited bandwidth, lossy compressed video is very important in wireless video transmissions. We propose a quality evaluation framework based on motion activity information, and two concepts, video definition and video smoothness are introduced into digital video metrics. The motion activity description method is essential to both aspects. Two simple but effective methods describing moti... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An allocation algorithm for transporting compressed video

    Publication Year: 2001, Page(s):137 - 140
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (312 KB) | HTML iconHTML

    In this paper, we proposed a novel weight-based bandwidth allocation algorithm (WBA) for transporting compressed video traffic using feedback. Extensive simulation using a modified NIST simulator is conducted to evaluate its performance under a general weight-based share policy. Our results demonstrate that the WBA algorithm is capable of allocating bandwidth fairly for the minimum cell rate (MCR)... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Half-quadratic regularization, preconditioning and applications

    Publication Year: 2001, Page(s):32 - 35
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (312 KB) | HTML iconHTML

    The article addresses a wide class of image deconvolution or reconstruction situations where a sought image is recovered from degraded observed image. The sought solution is defined to be the minimizer of an objective function combining a data-fidelity term and an edge-preserving, convex regularization term. Our objective is to speed up the calculation of the solution in a wide range of situations... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Transmitting additional data of MPEG-2 compressed video to support interactive operations

    Publication Year: 2001, Page(s):308 - 311
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (284 KB) | HTML iconHTML

    We address the problem of supporting interactive playout, both forward and backward, of MPEG-2 encoded video stream. Typically, the forward and backward operations are used mainly in the form of fast forward (FF) and fast backward or fast rewind (FR). The proposed approach is based on transmitting additional data of the same movie from the server to the digital storage device (DSD) in a client sta... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Application of MLP and RBF networks to cloud detection

    Publication Year: 2001, Page(s):60 - 63
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (400 KB) | HTML iconHTML

    The paper compares the performances of multilayer perceptrons (MLPs) and radial basis function (RBF) networks in detecting clouds in NOAA/AVHRR images. The main results show that the RBF networks are able to handle complex atmospheric and oceanographic phenomena while conventional rule-based systems and MLPs cannot. In particular, the experimental evaluations show that the RBF networks can converg... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Video compression based on selectively refining regions in difference frames

    Publication Year: 2001, Page(s):221 - 224
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (352 KB) | HTML iconHTML

    We introduce a computationally simple video compression algorithm based on Y. Chiu and T. Berger's (1999) CU30 algorithm, suitable for video streaming and telephony. The authors concentrate on being able to identify and select regions in the difference frame domain in order that only the regions deemed important for quality and rate are chosen and encoded. A binary shape coder is designed to descr... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A speech recognition and speech corpus system based on Matlab

    Publication Year: 2001, Page(s):559 - 562
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (320 KB) | HTML iconHTML

    In this paper, an educational mandarin syllable speech recognition and speech corpus system based on Matlab and SQL is introduced. The system includes a set of mixture CDHMM training and recognition algorithms both in Matlab M-files and optimized MEX-files, which can deal with multi-observations and scaling problems; a database toolbox to save and retrieve binary speech samples to and from a SQL d... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A new speech enhancement method

    Publication Year: 2001, Page(s):92 - 94
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (196 KB) | HTML iconHTML

    The paper proposes a speech enhancement method in the wavelet domain, that is, using a soft thresholding to remove noise components from the wavelet coefficients of noisy speech. To prevent the quality degradation of the unvoiced sounds during the denoising process, we first separate unvoiced region from noisy speech and then apply the thresholding method in a different way from other regions. Sim... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An adaptive cumulation algorithm for video shot detection

    Publication Year: 2001, Page(s):296 - 299
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (340 KB) | HTML iconHTML

    A cumulation algorithm for video shot detection is first introduced. The algorithm considers the properties of gradual transitions. In a gradual transition, there is only a small difference between consecutive frames. The algorithm can remember the differences between consecutive frames and cumulates them. When the accumulation difference exceeds a threshold, an occurrence of shot transition is de... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Computer recognition of human emotions

    Publication Year: 2001, Page(s):490 - 493
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (316 KB) | HTML iconHTML

    We address an automatic computer system which can recognize a driver's behavior by analysis of his/her facial expressions for the purpose of driving safety. This system is intended to be used for monitoring whether the driver is sleepy, or if the driver is faced with very dangerous traffic situations. The output of this recognition system is fed into intelligent control systems to aid control of t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Development of a speaker recognition system using wavelets and artificial neural networks

    Publication Year: 2001, Page(s):413 - 416
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (336 KB) | HTML iconHTML

    We address the problem of speaker recognition from speech signals. The study focuses on the development of a speaker recognition system comprising two modules: a wavelet-based feature extractor, and a neural network-based classifier. We have conducted a number of experiments to investigate the applicability of a discrete Wavelet transform (DWT) in extracting discriminative features from the speech... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.