By Topic

A study on different neural network architectures applied to text-to-phoneme mapping

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

The purchase and pricing options are temporarily unavailable. Please try again later.
3 Author(s)
Bilcu, E.B. ; Digital Comput. Syst. Lab., Tampere Univ. of Technol., Finland ; Suontausta, J. ; Saarinen, J.

In this paper, the problem of text-to-phoneme mapping of isolated words for the English language is studied. Multilayer perceptron, recurrent and bidirectional recurrent neural network architectures are compared in the text-to-phoneme mapping task. Multilayer perceptron neural networks utilize contextual information due to the orthography of a word. In our study, recurrent and bidirectional recurrent neural networks, on the other hand, do not take the letter context into account. Instead, these networks utilize the contextual information due to the previously transcribed phonemes as introduced by the feedback loop in the networks.

Published in:

Image and Signal Processing and Analysis, 2003. ISPA 2003. Proceedings of the 3rd International Symposium on  (Volume:2 )

Date of Conference:

18-20 Sept. 2003