By Topic

A study on sports video classification based on audio analysis and speech recognition

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Li Lu ; Think IT Speech Lab., Chinese Acad. of Sci., Beijing, China ; Qingwei Zhao ; Yonghong Yan ; Kun Liu

This paper proposes a method to deal with the problem of sports classification through audio analysis. First, a two-pass audio segmentation module is developed as the front-end to extract announcer's speech from the audio streams. Then speech recognition technology is employed on the speech segments to extract keywords which are used as features to distinguish different sports. Finally, based on the keyword spotting (KWS) results and specific keywords selected for each kind of sports, a score ranking strategy is designed for conducting classification automatically. For robust KWS in our system, adaptation techniques for acoustic model and language model are employed and both of them show significant improvements on the KWS performance. Fifteen games of seven kinds of sports are used to evaluate the system performance. By integrating all the techniques, an average figure of metric (FOM) of 70.74 is achieved on the KWS task, a 100% accuracy rate is achieved on sports classification task using all detected keywords of each game.

Published in:

Audio Language and Image Processing (ICALIP), 2010 International Conference on

Date of Conference:

23-25 Nov. 2010