By Topic

Distant BI-Gram model, collocation, and their applications in post-processing for Chinese character recognition

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Rui-Feng Xu ; Department of Computing, Hong Kong Polytech. Univ., China ; Qin Lu ; Yeung, D.S. ; Xi-Zhao Wang

In this paper, we present a distant BI-Gram model, which extended the regular BI-Gram model by considering the distance information and weight parameters, in order to describe the long-distance restrictions among the Chinese sentence. The extraction of the statistical information and weight parameters of this language model is discussed. Based on this work, the word combination strength and spread are employed to extract the recurrent word combinations, i.e. collocations. The distant BI-Gram model and collocation are applied to a statistic-based post-processing system for improving the recognition performance of Chinese characters. The experimental results show that by employing these two language models, the post-processing system achieves a higher improvement performance.

Published in:

Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on  (Volume:4 )

Date of Conference:

4-5 Nov. 2002