1. Introduction
Decision trees have been widely used in classifications, and many algorithms have been proposed for the construction of decision trees. Classical decision tree algorithms include Concept Learning System (CLS), ID3 [1], [2], C4.5 [3] and Classification and Regression Trees (CART) [4] etc. The purpose of building decision trees is to predict the unknown samples correctly. The tree should reflect the distribution of the data and be as small as possible. It is generally recognized that smaller trees usually have stronger generalization abilities. Most decision trees are generalized using a top-down style method, choosing attributes as the nodes of decision tree under some selection criteria [5]. The growth of decision trees is a process of dividing the training set recursively, the procedure terminate when all samples of each subset belong to the same class.