Skip to Main Content
Social tagging is a major characteristic of Web 2.0. A social tagging system can be modeled with a tripartite network of users, resources, and tags. In this paper, we investigate how to enhance Web clustering by leveraging the tripartite network of social tagging systems. We propose a clustering method called “Tripartite Clustering” which clusters the three types of nodes (resources, users, and tags) simultaneously by only utilizing the links in the social tagging network. We also investigate two other approaches to exploit social tagging for clustering with K-means and Link K-means. All the clustering methods are experimented on a real-world social tagging data set sampled from del.icio.us. The clustering results are evaluated against a human-maintained Web directory. The experimental results show that the social tagging network is a very useful information source for document clustering. All social-annotation-based clustering methods can significantly improve the performance of content-based clustering. Compared to social-annotation-based K-means and Link K-means, Tripartite Clustering achieves equivalent or better performance and produces more useful information.
Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on (Volume:41 , Issue: 5 )
Date of Publication: Sept. 2011