By Topic

Investigation of an efficient representation of speech spectra for segmentation and classification of speech sounds

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
W. Beninghof ; Northeastern University, Boston, Mass ; M. Ross

A functional representation of speech sounds in orthogonal polynomial space is described and preliminary results are presented. Speech spectra are approximated by a linear combination of orthogonal polynomials which are found to be more efficient than a linear combination of trigonometric functions. The original spectra (100 samples in frequency) and the polynomial approximations are represented by points in their respective Hilbert spaces, the distance between successive points being a measure of the dissimilarity of successive spectra. Segment boundaries are indicated where the distance between successive spectra exceeds a threshold. The effectiveness in segmentation of connected utterances using these spectral forms is compared. Also, representing speech in orthogonal polynomial space appears to be applicable to clustering and separating transformations which yield simple decision boundaries for phoneme classification. Although only one polynomial class is investigated, the procedure is valid for other functional representations of speech data.

Published in:

IEEE Transactions on Audio and Electroacoustics  (Volume:18 ,  Issue: 1 )