Skip to Main Content
This paper compares the performance of three clustering algorithms on the task of outlier's detection. The goal is to illustrate that better clustering indicates better detection of outliers. k-means (KM), Bisecting k-means (BKM) and the partitioning around medoids (PAM) algorithms are each combined with the clustering-based outliers detection (Find CBLOF) method. Undertaken experimental results over four gene expression datasets where outliers are presented show that the clustering solutions of the PAM algorithm enable the Find CBLOF algorithm to discover more outliers than those of both the k-means and the bisecting k-means algorithms. The main reason for this is that PAM provides better clustering quality than that of the other two clustering algorithms on the tested datasets measured by external and internal quality measures.
Date of Conference: June 29 2008-July 5 2008