Abstract:
Clustering is one of the most useful methods of intelligent engineering domain, in which a set of similar objects are categorized into clusters. Almost all of the well-kn...Show MoreMetadata
Abstract:
Clustering is one of the most useful methods of intelligent engineering domain, in which a set of similar objects are categorized into clusters. Almost all of the well-known clustering algorithms require input parameters which are hard to determine but have a significant influence on the clustering result. Furthermore, the majority is not robust enough towards noisy data. This paper presents an efficient and effective clustering technique, named DBSCAN-GM that combines Gaussian-Means and DBSCAN algorithms. The idea of DBSCAN-GM is to cover the limitations of DBSCAN, by exploring the benefits of Gaussian-Means: it runs Gaussian-Means to generate small clusters with determined cluster centers, in purpose to estimate the values of DBSAN's parameters. The results of our method show that it is efficient even for large data sets especially data with large dimension and capable to handle noises, contrary to partitioning algorithms such as K-Means or Gaussian-Means. Additionally, DBSCAN-GM does not necessitate any priori information, in contrast to the density clustering DBSCAN obliging two input parameters which are hard to guess, namely Eps (the radius that bounds the neighborhood region of an object) and MinPts (the minimum number of objects that must exist in the objects neighborhood region). Simulative experiments are carried out on a variety of datasets, which highlight the DBSCAN-GM's effectiveness and cluster validity to check the good quality of clustering results.
Date of Conference: 13-15 June 2012
Date Added to IEEE Xplore: 30 July 2012
ISBN Information:
Print ISSN: 1543-9259
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Clustering Method ,
- Input Parameters ,
- Clustering Algorithm ,
- Clustering Results ,
- Noisy Data ,
- Cluster Centers ,
- Dense Clusters ,
- Clustering Techniques ,
- Partitioning Algorithm ,
- Centroid ,
- Valid Measure ,
- Distancing Measures ,
- Subset Of Data ,
- Clusters Of Points ,
- Global Parameters ,
- Arbitrary Shape ,
- Clustering Quality ,
- Density-based Clustering ,
- Minimum Description Length ,
- Core Point ,
- Cluster Shape ,
- Ep Values ,
- Respective Clusters ,
- Choice Of Position
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Clustering Method ,
- Input Parameters ,
- Clustering Algorithm ,
- Clustering Results ,
- Noisy Data ,
- Cluster Centers ,
- Dense Clusters ,
- Clustering Techniques ,
- Partitioning Algorithm ,
- Centroid ,
- Valid Measure ,
- Distancing Measures ,
- Subset Of Data ,
- Clusters Of Points ,
- Global Parameters ,
- Arbitrary Shape ,
- Clustering Quality ,
- Density-based Clustering ,
- Minimum Description Length ,
- Core Point ,
- Cluster Shape ,
- Ep Values ,
- Respective Clusters ,
- Choice Of Position