C4.5 decision forests

  • Download Citations
  • Email
  • Print
  • Rights And Permissions

Access The Full Text

Sign In:Full text access may be available with your subscription

Forgot Username/Password?Athens/Shibboleth Sign In


Tin Kam Ho;  
Lucent Technol., AT&T Bell Labs., Murray Hill, NJ 

This paper appears in: Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on
Issue Date: 16-20 Aug 1998
On page(s): 545 - 549 vol.1
Meeting Date: 16 Aug 1998 - 20 Aug 1998
Location: Brisbane, Qld. , Australia
Print ISBN: 0-8186-8512-3
Cited by : 2
INSPEC Accession Number: 6096499
Digital Object Identifier: 10.1109/ICPR.1998.711201 
Date of Current Version: 06 August 2002

Abstract

Much of previous attention on decision trees focuses on the splitting criteria and optimization of tree sizes. The dilemma between overfitting and achieving maximum accuracy is seldom resolved. We propose a method to construct a decision tree based classifier that maintains highest accuracy on training data and improves on generalization accuracy as it grows in complexity. Trees are generated using the well-known C4.5 algorithm, and the classifier consists of multiple trees constructed in pseudo-randomly selected subspaces of the given feature space. We compare the method to single-tree classifiers and other forest construction methods by experiments on four public data sets, where the method's superiority is demonstrated. A measure is given to describe the similarity between trees in a forest, and is related to the combined classification accuracy

Available to subscribers and IEEE members.

Available to subscribers and IEEE members.

Available to subscribers and IEEE members.



Indexed by Inspec

© Copyright 2012 IEEE – All Rights Reserved