By Topic

A Term Cluster Query Expansion Model Based on Classification Information in Natural Language Information Retrieval

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

5 Author(s)
Jeon Wook Kang ; Hankuk Acad. of Foreign Studies, Yongin, South Korea ; Hyun-Kyu Kang ; Myeong-Cheol Ko ; Heung Seok Jeon
more authors

A natural language information retrieval system ranks related documents according to criteria based on user query keywords and document similarities. However, many efforts have been made to make more useful query keywords because users do not use many keywords in their natural language search query when retrieving information on the Web. Because a keyword does not provide much information, however, relevance feedback is generally used to complement the weakness of general retrieval methods. This paper proposes a term cluster query expansion model based on classification information of retrieved documents. This model generates classification information from the upper ranked n documents retrieved by retrieval system. On the basis of the extracted classification information, the term cluster (m) that represents each group is generated, and then the model allows user to select term cluster that corresponds to user information needs. The query keywords are expanded by using a relevance feedback algorithm based on the selected classification information. As a result of the experiments with test collection, the retrieval effectiveness was improved by 13.2% compared to the initial query when the Rocchio method was used.

Published in:

Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on  (Volume:2 )

Date of Conference:

23-24 Oct. 2010