By Topic

Word sense disambiguation using multi-engine collaborative boostrapping

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

5 Author(s)
Jianyong Duan ; Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., China ; Weilin Wu ; Yi Hu ; Yuquan Chen
more authors

In this paper we proposed a new word sense disambiguation method, called multi-engine collaborative bootstrapping (MCB) that combines different types of corpora and also uses two languages to bootstrapping. MCB contains the bilingual bootstrapping as its kernel algorithm that leads to incremental knowledge acquisition. EM model is performed to train parameters in base learner. Feature translation model is improved by semantic correlation estimation. In addition we use multi-engine to produce qualified starting seeds from parallel corpora and monolingual corpora. Those seeds that are generated through unsupervised machine learning approaches can also ensure bootstrapping effectiveness in contrast with manual selected seeds in spite of their different selection mechanisms. Experimental results prove the effectiveness of MCB. Some factors including feature space and starting seed number are concerned in our experiments because EM algorithm is sensible to starting values. Limitation of resources is also concerned.

Published in:

Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on

Date of Conference:

30 Oct.-1 Nov. 2005