By Topic

Study of segment dictionary based on two-dimensional array

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Chengcheng Li ; Sch. of Comput. & Inf. Eng., Inner Mongolia Normal Univ., Hohhot, China ; Hong Wu

Chinese word automatic segmentation is the foundation of Chinese Information Processing, and it has widely application in many fields. In this paper, a new dictionary mechanism is presented: According to the Chinese characteristic of the high frequency of one word and two words we put forward such an idea that we can build up index table by the first two words as the keywords, and this index table is a two-dimensional array. This algorithm directly locates data by establishing a corresponding relationship between the first two Chinese characters' internal code. In this way, we can directly find out the two-word words by using the two-dimensional array. This approach can significantly reduce the times of queries, so as to further accelerate the speed of segmentation.

Published in:

Broadband Network and Multimedia Technology (IC-BNMT), 2010 3rd IEEE International Conference on

Date of Conference:

26-28 Oct. 2010