By Topic

TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE

4-4 Dec. 1997

Go

Filter Results

Displaying Results 1 - 25 of 110
  • TENCON '97 Brisbane - Australia. Proceedings of IEEE TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications (Cat. No.97CH36162)

    Publication Year: 1997
    Request permission for commercial reuse | PDF file iconPDF (1450 KB)
    Freely Available from IEEE
  • Modelling human limits for visual data assimilation

    Publication Year: 1997
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (153 KB)

    Summary form only given. Some aspects of the performance of the human visual system (HVS) have been quantified in psychology and neurology experiments and theories. Much of this knowledge does not extend beyond the early vision stages, and so does not provide an adequate basis for accurate models of how humans perceive particular instances of visual data, such as individual images or sequences of ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Speech and audio processing for multimedia communications

    Publication Year: 1997
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (49 KB)

    Summary form only given. Multimedia communication involves processing, storage, transmission forwarding, and presentation of audiovisual information, and establishing natural interfaces between systems and their users. The computing, communication and integration infrastructures needed to support multimedia applications are also of great interest. Three of the key technologies in realizing multime... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Speaker characteristics in speech and speaker recognition

    Publication Year: 1997
    Cited by:  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (38 KB)

    Summary form only given. This paper analyses the acoustic variability of speakers and its impact on the robustness of contemporary automatic speech recognition and speaker recognition systems. The physiological and behavioural constraints of individual speech articulation are reviewed and examples are given of how these constraints affect commonly used spectral and cepstral features. Speech data f... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Motion estimation in video coding

    Publication Year: 1997
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (46 KB)

    Summary form only given. Motion estimation (ME) is one of the most computational intensive operations in video compression. It can easily account for over 80% of the computation in an MPEG-2 video encoder. Most ME algorithms can be formulated as an optimization problem and the challenge is to avoid being trapped at local minima without the expense of a full search in the optimization space. In thi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • List of authors

    Publication Year: 1997, Page(s):851 - 854
    Request permission for commercial reuse | PDF file iconPDF (362 KB)
    Freely Available from IEEE
  • A generic implementation framework for FPGA based stereo matching

    Publication Year: 1997, Page(s):461 - 464 vol.2
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (368 KB)

    This paper investigates the real time implementation requirements for several stereo matching algorithms. The area-based matching techniques sum of absolute differences (SAD), sum of squared differences (SSD), and normalised cross correlation (NCC) are considered as well as the non-parametric census and rank methods. A generic stereo matching framework is presented and algorithm specific implement... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Application of noise reduction techniques for alaryngeal speech enhancement

    Publication Year: 1997, Page(s):491 - 494 vol.2
    Cited by:  Papers (10)  |  Patents (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (396 KB)

    A hybrid noise reduction method using both spectral subtraction and root cepstral subtraction procedures is applied to speech produced using an artificial larynx (electro-larynx). The noise reduction suppresses the direct path noise of the electro-larynx device and provides an improvement in mean opinion score View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • 0.5 μm GaAs MESFET 4.2 mW ultra-low power decision circuit for optical communications with 0.1 μm CMOS and HBT implementation

    Publication Year: 1997, Page(s):457 - 460 vol.2
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (332 KB)

    The design specifications considering low-power and high-speed are presented. Several decision circuits for an optical communication receiver system, are designed and simulated using AIM-SPICE circuit simulation tools, based on extracted process parameter using an in-house characterization and modeling tool named CMLEE. Several circuits using 2×4 emitter-area HBT, 0.5 μm MESFET and compat... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • On improving the intelligibility of synchronized over-lap-and-add (SOLA) at low TSM factor

    Publication Year: 1997, Page(s):487 - 490 vol.2
    Cited by:  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (624 KB)

    We propose an algorithm to modify the synchronized overlap-and-add (SOLA) technique. SOLA is a popular technique for time scale modification of speech and audio signal. It changes the time scale of the signal while maintaining the pitch information. However, when the time scale of the signal is compressed more, the synthesized speech of SOLA has high articulations rate that it becomes almost impos... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Guidelines for multimedia design and development

    Publication Year: 1997, Page(s):839 - 842 vol.2
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (404 KB)

    The purpose of this article has been to provide some guidelines and recommend tools to be used at different stages of the multimedia design and development process. It argues that the flexibility of present design tools means that they should not be considered as contributing solely to the building of the final multimedia product at the end of the development process, instead they can contribute v... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A new double talk detector using the lattice predictors for an acoustic echo canceller

    Publication Year: 1997, Page(s):483 - 486 vol.2
    Cited by:  Papers (2)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (348 KB)

    We propose a new intelligent double talk detector that can not only detect double talk fast but also distinguish the echo path change from the double talk without a decision delay. The detection mechanism is performed by observing the change rate of the reflection coefficients of the two lattice predictors that are placed on the near-end and far-end terminals. The excellence of the proposed method... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Nonlinear discrete Fourier transformer for the time series analysis and application to speech coding

    Publication Year: 1997, Page(s):613 - 616 vol.2
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (396 KB)

    The primary motivation of the paper is to investigate waveform coding of a speech signal. The paper presents a new signal analyzing tool-the `nonlinear discrete Fourier transform (NDFT) which has an improved signal analysis performance. By virtue of the NDFT, waveform coding of the speech signal with a long segment (for example a segment with 512 or 1024 samples) is studied. The new coding method ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Analysis of real-time parallel programs using source-level timing schema

    Publication Year: 1997, Page(s):433 - 436 vol.2
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (292 KB)

    The paper introduces deterministic timing schema or formulae for predicting the best and worst case execution times of real time parallel programs. Timing schema (J. Kim and A.C. Shaw; H.R. Callison and A.C. Shaw) are formulae based on source program elements to calculate the execution time of programs. The total execution time is computed from the schema provided for a variety of parallel program... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Application of ground penetrating radar for coal thickness measurement

    Publication Year: 1997, Page(s):835 - 838 vol.2
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (472 KB)

    Ground penetrating radar (GPR) represents a non-invasive sensing technique for subsurface imaging. This paper overviews an impulse GPR unit being developed and evaluated by the CSIRO for use in coal thickness measurement. The GPR unit has been designed to operate in an underground mining environment and so special considerations have been made to ensure that the radar equipment does not present a ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Pseudo-inverse filter design for improving the axial resolution of ultrasound images

    Publication Year: 1997, Page(s):703 - 706 vol.2
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (864 KB)

    The resolution of medical ultrasound systems can be improved using filtering techniques. We present a filter for improving the axial resolution that approximates an inverse filter but which is less sensitive to noise than a standard inverse filter making it suitable for use with ultrasound. This filter produces a superior image resolution to that gained by matched filtering, and also shows advanta... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Wavelet for speech denoising

    Publication Year: 1997, Page(s):479 - 482 vol.2
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (308 KB)

    This paper presents the use of the wavelet transform for noise reduction in noisy speech signals. The use of different wavelets and different orders have been evaluated for their suitability as a transform for speech noise removal. The wavelets evaluated are the biorthogonal wavelets, Daubechies wavelets, coiflets as well as symlets. Also two different means of filtering the noise in the transform... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Packet voice transmission using Java programming language

    Publication Year: 1997, Page(s):629 - 632 vol.2
    Cited by:  Patents (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (324 KB)

    We investigate the suitability of Java in real time communication over the network. We describe the development of a voice transmission application that allows full-duplex communications over the Ethernet. The Ethernet (using CSMA/CD MAC protocol) is essentially a data network with no special scheme for real time traffic, therefore, it is up to the application layer to provide some strategies to c... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The effects of noise on the waveform interpolation speech coder

    Publication Year: 1997, Page(s):609 - 612 vol.2
    Cited by:  Papers (1)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (464 KB)

    A study of the effects of additive noise on the WI coder is presented. In WI, pitch-cycle waveforms (prototypes) are extracted from the residual signal. Since these prototypes evolve slowly when the speech is voiced and rapidly when unvoiced, simple filtering can be applied in the evolution domain to separate these components. The focus of this work is to investigate how this decomposition maps no... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Using fundamental electrical theory for varying time quantum uniprocessor scheduling

    Publication Year: 1997, Page(s):429 - 432 vol.2
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (284 KB)

    Given the total number of instructions to be completed on a uniprocessor system and the cycle time per instruction we introduce a method of calculating time quantum allocation to individual fine grain tasks. The main theory behind our method is based on fundamental equations describing electrical phenomenon. We show how electric circuit analysis can be used to describe this fundamental problem, an... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Instantaneous phase of the autocorrelation and robustness to signal-dependent noise

    Publication Year: 1997, Page(s):831 - 834 vol.2
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (288 KB)

    The phase of an analytic signal constructed from the autocorrelation function of a signal contains significant information about the shape of the signal. Using Bedrosian's (1963) theorem for the Hilbert transform it is proved that this phase is robust to multiplicative noise if the signal is baseband and the spectra of the signal and the noise do not overlap. Higher-order spectral features are int... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Real-time retina tracking by a personal computer

    Publication Year: 1997, Page(s):699 - 702 vol.2
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (380 KB)

    Visual field observation and measurement provides important information regarding the diagnosis, progression, treatment, and management of many eye diseases. Before the advent of scanning laser ophthalmoscopes (SLO), these evaluations were made through a fundus camera. Fundus cameras make use of a conventional visual technique and observations are made either through an eyepiece or through recorde... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Adaptive tracking algorithm based on direction field using ML estimation in angiogram

    Publication Year: 1997, Page(s):671 - 675 vol.2
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (528 KB)

    We present a new tracking algorithm for the main artery contours in a digital angiogram. The proposed work extracts features and profiles the narrow blood vessel, mainly the blood vessel in the digital subtraction angiography image. A consecutive value is performed on the boundary detection by calculating maximum-likelihood (ML) estimation on adjacent pixels. The proposed algorithm adaptively dete... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Speech enhancement for forensic applications using dynamic time warping and wavelet packet analysis

    Publication Year: 1997, Page(s):475 - 478 vol.2
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (344 KB)

    This paper presents two novel speech enhancement techniques which are applicable to forensic audio recordings. The first technique is one for removing a background signal from a single channel recording in order to obtain the target signal. The method assumes that it is possible to obtain a copy of the background signal at a later instance. The copy of the background signal is aligned onto the sin... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Mixed wideband speech and music coding using a speech/music discriminator

    Publication Year: 1997, Page(s):605 - 608 vol.2
    Cited by:  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (424 KB)

    In multimedia applications such as videoconferencing, users are demanding higher quality speech/audio transmission than the POTS can offer. 7kHz wideband speech/audio offers a good compromise between bandwidth and sound quality. It improves the intelligibility and naturalness of speech and adds a feeling of transparent communication. Currently the only existing international standard for coding su... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.