A study of sample size with neural network
Ying-Jin Cui; Davis, S.; Chao-Kun Cheng; Xue Bai
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Volume 6, Issue , 26-29 Aug. 2004 Page(s): 3444 - 3448 vol.6
Digital Object Identifier
Summary: This study investigated sample complexity for a linearly separable dataset by training and testing a breast cancer database. This study considered two networks: a single layer network and a multilayer network. We observed that the training sample size could be 1 for both networks with good generalization results under different conditions. The multilayer network performed well with any training sample but the single layer network required selection of a training sample having informative class output value. When the multilayer network was trained with a small training sample and the threshold for the testing network output was set at an appropriate value the test error became as low as 2%. We concluded that for a linearly separable dataset it is possible achieve good performance by training a network with small sample size, such as 1 or 2.
View citation and abstract |