Skip to Main Content
This paper proposes a new clustering algorithm referred to as the possibilitic latent variables (PLV) clustering algorithm. This algorithm provides a powerful tool for the analysis of complex data, such as clinical diagnosis and biological expressions data, due to its robustness to various data distributions and its accuracy in establishing appropriate groups from data. The algorithm combines a distribution model and the fuzzy degrees concept. Compared to the expectation-maximization (EM) algorithm, which is a well-known distribution estimating algorithm, the PLV algorithm has the considerable advantage that it can be applied to various data types, i.e. it is not restricted solely to Gaussian data distributions. Additionally, the proposed algorithm has a better performance than the well-known fuzzy clustering algorithm, i.e. the FCM algorithm, where it can address compact regions, other than simply dividing objects into several equal populations. The performance of the proposed algorithm is verified by conducting clustering tasks on the contents of several medical diagnosis and biological expressions datasets.
Date of Conference: 15-20 April 2007