By Topic

Labeled and unlabeled data in text categorization

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
C. Silva ; Escola Superior de Tecnologia e Gestao, Instituto Politecnico de Leiria, Portugal ; B. Ribeiro

There is a growing interest in exploring the use of unlabeled data as a way to improve classification performance in text categorization. The ready availability of this kind of data in most applications makes it an appealing source of information. This work reports a study carried out on the Reuters-21578 corpus to evaluate the performance of support vector machines when unlabeled examples are introduced in the learning process. The improvement achieved, especially in false negative values and therefore in recall values, demonstrates that the use of unlabeled examples can be very important for small data sets.

Published in:

Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on  (Volume:4 )

Date of Conference:

25-29 July 2004