Word class driven synthesis of prosodic annotations
Arnfield, S.
Dept. of Linguistic Sci., Reading Univ.;
This paper appears in: Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Publication Date: 3-6 Oct 1996
Volume: 3,
On page(s): 1978-1980 vol.3
Meeting Date: 10/03/1996 - 10/06/1996
Location: Philadelphia, PA, USA
ISBN: 0-7803-3555-4
References Cited: 10
INSPEC Accession Number: 5679615
Digital Object Identifier: 10.1109/ICSLP.1996.608024
Current Version Published: 2002-08-06
Abstract
Prosody is an important aspect of speech that current text to
speech synthesis systems fail to mimic in a convincing or natural way.
The paper describes research on a partial system for prosodic synthesis
using easily derived low level syntactic information. A computer program
has been developed that can annotate unseen text with prosodic stress
and tone marks using the sequence of part of speech tags previously
assigned to each word by a tagging system. Training and testing material
was taken from the Lancaster/IBM Spoken English Corpus (SEC).
Co-occurrence measures were calculated relating stress and tone mark
annotations to the word class annotation information. A model was
developed around the statistical information which calculates a score
for all possible mappings between a given part of speech sequence and
all the potential stress/tone annotations. The highest scoring pattern
is selected as that which is the most likely “baseline”
annotation, according to the model. Performance figures attain up to 91%
agreement with the original corpus annotations
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.