Skip to Main Content
A connected digit recognizer is proposed in which a set of isolated word templates is used as reference patterns and an unconstrained dynamic time warping algorithm is used to literally "spot" the digits in the string. Segmentation boundaries between digits are obtained as the termination point of the dynamic path from the previous time warp. A region around the boundary is searched for the optimum starting point for the succeeding digit. At each stage the recognizer keeps track of a set of candidate digit strings for each test string. The string with the smallest accumulated distance is used as the preliminary string estimate. For variable length digit strings of from 2 to 5 digits (where the recognizer was not told the length of the string), word error rates of about 2-3% and string error rates on the order of 8% were obtained for both speaker dependent and speaker independent systems.
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '80. (Volume:5 )
Date of Conference: Apr 1980