We are currently experiencing intermittent issues impacting performance. We apologize for the inconvenience.
By Topic

Constructing features for document classification by using temporal patterns of term usages

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Abe, H. ; Dept. of Med. Inf., Shimane Univ., Izumo, Japan ; Tsumoto, S.

In document classification method by using appeared words as features, it is important to determine keywords for the features to characterize each document. However, conventional methods select the keywords based on their frequency or/and particular importance index such as tf-idf, and cut-off the other appeared words by using a threshold value. This omits remaining information such as rare combinations of the appeared words and time dependent differences of their usages. In this paper, we present the availability of the features based on temporal patterns of the overall words and phrases for temporally published documents in one domain. Thus, the documents are characterized by the temporal patterns of one or more importance indices for considering temporal differences of the overall term usages. In the experiment, we compare document classification results of two sets of bibliographical documents on the time dependency by using the two types of the feature set. For an exploratory class labels, we show the availability for obtaining classification rules that mention the relationship between the class and the important temporal patterns for the prediction.

Published in:

Granular Computing (GrC), 2011 IEEE International Conference on

Date of Conference:

8-10 Nov. 2011