Abstract:
The authors analyze three critical components in training word embeddings: model, corpus, and training parameters. They systematize existing neural-network-based word emb...Show MoreMetadata
Abstract:
The authors analyze three critical components in training word embeddings: model, corpus, and training parameters. They systematize existing neural-network-based word embedding methods and experimentally compare them using the same corpus. They then evaluate each word embedding in three ways: analyzing its semantic properties, using it as a feature for supervised tasks, and using it to initialize neural networks. They also provide several simple guidelines for training good word embeddings.
Published in: IEEE Intelligent Systems ( Volume: 31, Issue: 6, Nov.-Dec. 2016)
DOI: 10.1109/MIS.2016.45