Analysis of gesture and action in technical talks for videoindexing
Ju, S.X.
Black, M.J.
Minneman, S.
Kimber, D.
Dept. of Comput. Sci., Toronto Univ., Ont.;
This paper appears in: Computer Vision and Pattern Recognition, 1997. Proceedings., 1997 IEEE Computer Society Conference on
Publication Date: 17-19 Jun 1997
On page(s): 595-601
Meeting Date: 06/17/1997 - 06/19/1997
Location: San Juan, Puerto Rico
ISBN: 0-8186-7822-4
References Cited: 15
INSPEC Accession Number: 5644218
Digital Object Identifier: 10.1109/CVPR.1997.609386
Current Version Published: 2002-08-06
Abstract
We present an automatic system for analyzing and annotating video
sequences of technical talks. Our method uses a robust motion estimation
technique to detect key frames and segment the video sequence into
subsequences containing a single overhead slide. The subsequences are
stabilized to remove motion that occurs when the speaker adjusts their
slides. Any changes remaining between frames in the stabilized sequences
may be due to speaker gestures such as pointing or writing and we use
active contours to automatically track these potential gestures. Given
the constrained domain we define a simple “vocabulary” of
actions which can easily be recognized based on the active contour shape
and motion. The recognized actions provide a rich annotation of the
sequence that can be used to access a condensed version of the talk from
a web page
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.