By Topic

Constructing term thesaurus using text association rule mining

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Alisa Kongthon ; Human Language Technology (HLT) Laboratory, National Electronics and Computer Technology Center (NECTEC), Thailand Science Park, Klong Luang, Pathumthani 12120, Thailand ; Choochart Haruechaiyasak ; Santipong Thaiprayoon

This paper presents a new algorithm called ldquoconcept-groupingrdquo that adapts an association rule mining technique to construct term thesaurus for data preprocessing purpose. Similar terms, which are written differently, can be grouped together into the same concept based on their associations before they are used for subsequent analysis. This data preprocessing is important since it has an impact on the quality of other data mining techniques such as data clustering. The algorithm is applied to bibliographic databases such as INSPEC and EI Compendex toward the objective of enhancing traditional bibliometrics and content analysis. From the experiments with a set of publication abstracts, applying the proposed algorithm to combine similar terms into a pertinent concept before clustering process yields better cluster quality.

Published in:

Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology, 2008. ECTI-CON 2008. 5th International Conference on  (Volume:1 )

Date of Conference:

14-17 May 2008