By Topic

A Comparative Study on Feature Window Selection in Text Filtering

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Hu Quan ; Coll. of Phys. Sci. & Technol., Huazhong Normal Univ., Wuhan, China ; Xie Fang ; Liu Xiaoguang

Text representation is a preliminary step to text filtering, while VSM is the most commonly used method in this field. However, the document feature set, which produced by VSM, usually has a very high dimensionality. As a result, the distribution of feature value tends to be highly skewed. In this paper some new mechanisms are presented to abate such problems. Using these mechanisms, document features are extracted from some smaller feature windows rather than a full text, such as sentences, graphs and blocks, and the correlative texts are finally evaluated by local similarity. They are gotten by the analysis of documentpsilas linguistics structures in documents. As a result, it can give a remarkable effect on the precision of text filtering.

Published in:

Information Technology and Applications, 2009. IFITA '09. International Forum on  (Volume:3 )

Date of Conference:

15-17 May 2009