Abstract:
We address the problem of automatic speech summarization on open-domain TED talks. The large vocabulary and diversity of topics from speaker-to-speaker presents significa...Show MoreMetadata
Abstract:
We address the problem of automatic speech summarization on open-domain TED talks. The large vocabulary and diversity of topics from speaker-to-speaker presents significant difficulties. The challenges increase not only how to handle disfluencies and fillers, but also how to extract topic-related meaningful messages within the free talks. Here, we propose to incorporate semantic and acoustic features within the speech summarization technique. In addition, we also propose a new evaluation method for speech summarization by checking semantic similarity between system and human summarization. Experiments results reveal that the proposed methods are effective in spontaneous speech summarization.
Published in: Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific
Date of Conference: 09-12 December 2014
Date Added to IEEE Xplore: 16 February 2015
Electronic ISBN:978-6-1636-1823-8