By Topic

A statistical approach to the segmentation and broad classification of continuous speech into phrase-sized information units

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
D. Huber ; Dept. of Inf. Theory, Chalmers Univ. of Technol., Gothenburg

An algorithm is presented which uses the F0 tracings of a connected-speech utterance as input and performs speaker-independent segmentation into prosodically defined information units. Two global declination lines are computed by the linear regression method, which approximate the trends in time of the peaks (topline) and valleys (baseline) of F0 across the utterance. Computation is reiterated every time the Pearson product moment correlation coefficient for these declination lines drops below the present level of acceptability. Segmentation is thus performed without prior knowledge of higher level linguistic information, with the termination of one unit being determined by the general resetting of the intonation contour wherever in the utterance it may occur. The structure of the algorithm is described and its performance evaluated on three medium-sized Swedish texts read by four native speakers of standard Swedish

Published in:

Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on

Date of Conference:

23-26 May 1989