Skip to Main Content
Sequence labeling tasks, such as named entity recognition and part of speech tagging, are the fundamental compositions of the information extraction system, and thus received attentions these years. This paper proposes k-similar conditional random fields for semi-supervised sequence labeling, and makes use of unlabeled data to calculate the similarity between words with distributional clustering. The named entity recognition experiments show that this method can improve the performance through unlabeled data.
Date of Conference: 23-25 July 2008