Loading [a11y]/accessibility-menu.js
Nonlinear speech processing with oscillatory neural networks for speaker segregation | IEEE Conference Publication | IEEE Xplore

Nonlinear speech processing with oscillatory neural networks for speaker segregation


Abstract:

Nonlinear masking of space-time representations of speech is a universal technique for speech processing. In the present work we use an AM representation of cochlear filt...Show More

Abstract:

Nonlinear masking of space-time representations of speech is a universal technique for speech processing. In the present work we use an AM representation of cochlear filterbank signals in combination with a mask that is derived from a network of oscillatory neurons. The proposed approach does not need any training or learning and the mask takes into account the dependence between points from the auditory derived representation. A potential application is illustrated in the context of speaker segregation.
Date of Conference: 03-06 September 2002
Date Added to IEEE Xplore: 30 March 2015
Print ISSN: 2219-5491
Conference Location: Toulouse, France

2 Introduction

Speech enhancement, speaker segregation, speech recognition and coding can be viewed as the result of 3 successive processes.

Decomposition of the speech signal into an adequate space-time representation (auditory image representations, spectrograms, wavelet decompositions, etc.);

Selection of the relevant information from the chosen representation or masking of the irrelevant information;

Synthesis of the speech (enhancement, speaker segregation, coding) or extraction of the parameters from the selected relevant areas of the representation (speech recognition).

Contact IEEE to Subscribe

References

References is not available for this document.