By Topic

A Hybrid Model Based on CRFs for Chinese Named Entity Recognition

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Lishuang Li ; Dept. of Comput. Sci. & Eng., Dalian Univ. of Technol., Dalian ; Zhuoye Ding ; Degen Huang ; Huiwei Zhou

This paper presents a hybrid model and the corresponding algorithm combining conditional random fields (CRFs) with statistical methods to improve the performance of CRFs for the task of Chinese named entity recognition (NER). CRFs has a good performance in the task of sequence labeling. In the experiment of recognizing Chinese named entity with CRFs, it can be found that the wrong tags labeled by CRFs are mostly the ones which have lower marginal probabilities. A statistical model is introduced to compliment it. In the hybrid model, marginal probability of every label in CRFs is used to separate CRFs method and statistical method. If the probability is greater than the given threshold, the test sample is recognized by CRFs; otherwise, the statistical model is used. By integrating the advantages of two methods, the hybrid model achieves 93.61% F-measure for Chinese person names and 91.75% F-measure for Chinese location names on MSRA dataset.

Published in:

Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on

Date of Conference:

23-25 July 2008