By Topic

Automatic indexing of key sentences for lecture archives

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

The purchase and pricing options are temporarily unavailable. Please try again later.
4 Author(s)
Kawahara, T. ; Sch. of Informatics, Kyoto Univ., Japan ; Shitaoka, K. ; Kitade, T. ; Nanjo, H.

Automatic extraction of key sentences from lecture audio archives is addressed. The method makes use of the characteristic expressions used in initial utterances of sections, which are defined as discourse markers and derived in an unsupervised manner based on word statistics. The statistics of the discourse markers is then used to define the importance of the sentences. It is also combined with the conventional tf-idf measure for content words. Experimental results confirm the effectiveness of the method using the discourse markers and its combination with the keyword-based method. We also present a statistical method for inserting periods into raw speech transcriptions for improving the readability.

Published in:

Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on

Date of Conference:

30 Nov.-3 Dec. 2003