By Topic

Improving accuracy in behaviour identification for content-based retrieval by using audio and video information

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
H. Miyamori ; Commun. Res. Lab., Kyoto, Japan

Presents a method for accurately identifying human behaviours for content-based retrieval by using audio and video information. In conventional content-based retrieval, target events are identified by analyzing information about the position of objects such as loci, relative positions, their transitions, etc. from video. However, methods using indices obtained only from video essentially fail to detect some important time points and positions due to tracking errors as a result of occlusion, leading to recognition failures and oversights of target events. Our approach combines the use of audio information with conventional video methods, to develop an integrated reasoning module that can recognize some events that cannot be identified by conventional ones. Based on the proposed method, we implemented a content-based retrieval system that can identify several actions in a real tennis video. The basic actions of a player such as a forehand swing, an overhead swing, etc. are identified by using information about the court and net lines, the players' positions, the ball positions, and the moments when the players hit the ball, which are called "impact points". Simulation results show that the rate of detecting impact points affects the rate of recognition of player's basic actions. They also show that by using audio information, we can avoid some recognition problems.

Published in:

Pattern Recognition, 2002. Proceedings. 16th International Conference on  (Volume:2 )

Date of Conference: