A Probabilistic Model of Meetings That Combines Words and Discourse Features
Dowman, M.
Savova, V.
Griffiths, T.L.
Kording, K.P.
Tenenbaum, J.B.
Purver, M.
Dept. of Gen. Syst. Studies, Univ. of Tokyo, Tokyo;
This paper appears in: Audio, Speech, and Language Processing, IEEE Transactions on
Publication Date: Sept. 2008
Volume: 16,
Issue: 7
On page(s): 1238-1248
ISSN: 1558-7916
INSPEC Accession Number: 10158378
Digital Object Identifier: 10.1109/TASL.2008.925867
Current Version Published: 2008-08-15
Abstract
In order to determine the points at which meeting discourse changes from one topic to another, probabilistic models were used to approximate the process through which meeting transcripts were produced. Gibbs sampling was used to estimate the values of random variables in the models, including the locations of topic boundaries. This paper shows how discourse features were integrated into the Bayesian model and reports empirical evaluations of the benefit obtained through the inclusion of each feature and of the suitability of alternative models of the placement of topic boundaries. It demonstrates how multiple cues to segmentation can be combined in a principled way, and empirical tests show a clear improvement over previous work.
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.