Conferences >ICASSP 2019 - 2019 IEEE Inter...

Self-attention Based Model for Punctuation Prediction Using Word and Speech Embeddings

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper proposes to use self-attention based model to predict punctuation marks for word sequences. The model is trained using word and speech embedding features which...Show More

Metadata

Abstract:

This paper proposes to use self-attention based model to predict punctuation marks for word sequences. The model is trained using word and speech embedding features which are obtained from the pre-trained Word2Vec and Speech2Vec, respectively. Thus, the model can use any kind of textual data and speech data. Experiments are conducted on English IWSLT2011 datasets. The results show that the self-attention based model trained using word and speech embedding features outperforms the previous state-of-the-art single model by up to 7.8% absolute overall F₁-score. The results also show that it obtains performance improvement by up to 4.7% absolute overall F₁-score against the previous best ensemble model.

Published in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 12-17 May 2019

Date Added to IEEE Xplore: 17 April 2019

ISBN Information: