Scheduled System Maintenance on May 29th, 2015:
IEEE Xplore will be upgraded between 11:00 AM and 10:00 PM EDT. During this time there may be intermittent impact on performance. For technical support, please contact us at We apologize for any inconvenience.
By Topic

Speech-Aware Multimedia

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Hemphill, C.T. ; Texas Instrum. Inc., Dallas, TX, USA ; Thrift, P.R. ; Linn, J.C.

Computer users have long desired a personal software agent that could execute verbal commands. Today's World Wide Web (WWW or Web), with its point and click hypertext interface, makes a tremendous amount of information readily available online. A speech interface would make the Web even more powerful, allowing us to access information by surfing the Web by voice. TI have developed Speech Aware Multimedia (SAM) with this in mind, to make information on the Web more accessible and useful. They combined an innovative speech recognition engine with the Web to let anyone browse arbitrary Web pages using only speech as the input medium. Speech brings added flexibility and power to the classical Web interface and makes information access more natural. Today's speech recognition capability is well matched to Web browsing. The Web page provides a natural, well defined context for a speech recognition application. The recognition engine does not need to recognize any and all possible phrases, but only those phrases pertaining to the specific page in view at the moment. This context imposes limits that significantly aid recognition performance. Furthermore, the visual information on a page prompts the user on what to request and how to request it by voice

Published in:

MultiMedia, IEEE  (Volume:3 ,  Issue: 1 )