Loading [MathJax]/extensions/MathMenu.js
SCHAIN-IRAM: An Efficient and Effective Semi-Supervised Clustering Algorithm for Attributed Heterogeneous Information Networks | IEEE Journals & Magazine | IEEE Xplore

SCHAIN-IRAM: An Efficient and Effective Semi-Supervised Clustering Algorithm for Attributed Heterogeneous Information Networks


Abstract:

A heterogeneous information network (HIN) is one whose nodes model objects of different types and whose links model objects’ relationships. To enrich its information, obj...Show More

Abstract:

A heterogeneous information network (HIN) is one whose nodes model objects of different types and whose links model objects’ relationships. To enrich its information, objects in an HIN are typically associated with additional attributes. We call such an HIN an Attributed HIN or AHIN. We study the problem of clustering objects in an AHIN, taking into account objects’ similarities with respect to both object attribute values and their structural connectedness in the network. We show how supervision signal, expressed in the form of a must-link set and a cannot-link set, can be leveraged to improve clustering results. We put forward the SCHAIN algorithm to solve the clustering problem, and two highly efficient variants, SCHAIN-PI and SCHAIN-IRAM, which employ the power iteration based method and the implicitly restarted Arnoldi method respectively to compute eigenvectors of a matrix. We conduct extensive experiments comparing SCHAIN-based algorithms with other state-of-the-art clustering algorithms. Our results show that SCHAIN-IRAM outperforms other competitors in terms of clustering effectiveness and is highly efficient.
Published in: IEEE Transactions on Knowledge and Data Engineering ( Volume: 34, Issue: 4, 01 April 2022)
Page(s): 1980 - 1992
Date of Publication: 27 May 2020

ISSN Information:

Funding Agency:

Author image of Xiang Li
Department of Computer Science, The University of Hong Kong, Hong Kong
Xiang Li received the PhD degree from the University of Hong Kong, in 2018. From 2018 to 2019, he worked as a research scientist with the Data Science Lab at JD.com. He is currently a research associate with the University of Hong Kong. His research interests include data mining and machine learning applications.
Xiang Li received the PhD degree from the University of Hong Kong, in 2018. From 2018 to 2019, he worked as a research scientist with the Data Science Lab at JD.com. He is currently a research associate with the University of Hong Kong. His research interests include data mining and machine learning applications.View more
Author image of Yao Wu
Twitter, San Fransisco, CA, USA
Yao Wu received the PhD degree in computer science from Simon Fraser University, Canada, in 2016. He is currently a senior machine learning engineer at Twitter.
Yao Wu received the PhD degree in computer science from Simon Fraser University, Canada, in 2016. He is currently a senior machine learning engineer at Twitter.View more
Author image of Martin Ester
School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
Martin Ester received the PhD degree in computer science from ETH Zurich, Switzerland, in 1989. Since November 2001, he has been an associate professor, now full professor with the School of Computing Science of Simon Fraser University. His current research interests include social network analysis, recommender systems, biological network analysis, and data mining.
Martin Ester received the PhD degree in computer science from ETH Zurich, Switzerland, in 1989. Since November 2001, he has been an associate professor, now full professor with the School of Computing Science of Simon Fraser University. His current research interests include social network analysis, recommender systems, biological network analysis, and data mining.View more
Author image of Ben Kao
Department of Computer Science, The University of Hong Kong, Hong Kong
Ben Kao received the BSc degree in computer science from the University of Hong Kong, in 1989, and the PhD degree in computer science from Princeton University, in 1995. He is currently a professor with the Department of Computer Science, University of Hong Kong. His research interests include database management systems, data mining, real-time systems, and information retrieval systems.
Ben Kao received the BSc degree in computer science from the University of Hong Kong, in 1989, and the PhD degree in computer science from Princeton University, in 1995. He is currently a professor with the Department of Computer Science, University of Hong Kong. His research interests include database management systems, data mining, real-time systems, and information retrieval systems.View more
Author image of Xin Wang
Department of Computer Science and Technology, Tsinghua University, Beijing, China
Xin Wang received the BE and PhD degrees in computer science and technology from Zhejiang University, China, and the PhD degree in computing science from Simon Fraser University, Canada. He is currently an assistant professor with the Department of Computer Science and Technology, Tsinghua University. His research interests include cross-modal multimedia intelligence and recommendation.
Xin Wang received the BE and PhD degrees in computer science and technology from Zhejiang University, China, and the PhD degree in computing science from Simon Fraser University, Canada. He is currently an assistant professor with the Department of Computer Science and Technology, Tsinghua University. His research interests include cross-modal multimedia intelligence and recommendation.View more
Author image of Yudian Zheng
Twitter, San Fransisco, CA, USA
Yudian Zheng received the PhD degree in computer science from the University of Hong Kong, in 2017. He is currently an engineer in Twitter, working on building large scale online distributed machine learning systems. His main research interests include data analysis, machine learning, and crowdsourced data management.
Yudian Zheng received the PhD degree in computer science from the University of Hong Kong, in 2017. He is currently an engineer in Twitter, working on building large scale online distributed machine learning systems. His main research interests include data analysis, machine learning, and crowdsourced data management.View more

Author image of Xiang Li
Department of Computer Science, The University of Hong Kong, Hong Kong
Xiang Li received the PhD degree from the University of Hong Kong, in 2018. From 2018 to 2019, he worked as a research scientist with the Data Science Lab at JD.com. He is currently a research associate with the University of Hong Kong. His research interests include data mining and machine learning applications.
Xiang Li received the PhD degree from the University of Hong Kong, in 2018. From 2018 to 2019, he worked as a research scientist with the Data Science Lab at JD.com. He is currently a research associate with the University of Hong Kong. His research interests include data mining and machine learning applications.View more
Author image of Yao Wu
Twitter, San Fransisco, CA, USA
Yao Wu received the PhD degree in computer science from Simon Fraser University, Canada, in 2016. He is currently a senior machine learning engineer at Twitter.
Yao Wu received the PhD degree in computer science from Simon Fraser University, Canada, in 2016. He is currently a senior machine learning engineer at Twitter.View more
Author image of Martin Ester
School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
Martin Ester received the PhD degree in computer science from ETH Zurich, Switzerland, in 1989. Since November 2001, he has been an associate professor, now full professor with the School of Computing Science of Simon Fraser University. His current research interests include social network analysis, recommender systems, biological network analysis, and data mining.
Martin Ester received the PhD degree in computer science from ETH Zurich, Switzerland, in 1989. Since November 2001, he has been an associate professor, now full professor with the School of Computing Science of Simon Fraser University. His current research interests include social network analysis, recommender systems, biological network analysis, and data mining.View more
Author image of Ben Kao
Department of Computer Science, The University of Hong Kong, Hong Kong
Ben Kao received the BSc degree in computer science from the University of Hong Kong, in 1989, and the PhD degree in computer science from Princeton University, in 1995. He is currently a professor with the Department of Computer Science, University of Hong Kong. His research interests include database management systems, data mining, real-time systems, and information retrieval systems.
Ben Kao received the BSc degree in computer science from the University of Hong Kong, in 1989, and the PhD degree in computer science from Princeton University, in 1995. He is currently a professor with the Department of Computer Science, University of Hong Kong. His research interests include database management systems, data mining, real-time systems, and information retrieval systems.View more
Author image of Xin Wang
Department of Computer Science and Technology, Tsinghua University, Beijing, China
Xin Wang received the BE and PhD degrees in computer science and technology from Zhejiang University, China, and the PhD degree in computing science from Simon Fraser University, Canada. He is currently an assistant professor with the Department of Computer Science and Technology, Tsinghua University. His research interests include cross-modal multimedia intelligence and recommendation.
Xin Wang received the BE and PhD degrees in computer science and technology from Zhejiang University, China, and the PhD degree in computing science from Simon Fraser University, Canada. He is currently an assistant professor with the Department of Computer Science and Technology, Tsinghua University. His research interests include cross-modal multimedia intelligence and recommendation.View more
Author image of Yudian Zheng
Twitter, San Fransisco, CA, USA
Yudian Zheng received the PhD degree in computer science from the University of Hong Kong, in 2017. He is currently an engineer in Twitter, working on building large scale online distributed machine learning systems. His main research interests include data analysis, machine learning, and crowdsourced data management.
Yudian Zheng received the PhD degree in computer science from the University of Hong Kong, in 2017. He is currently an engineer in Twitter, working on building large scale online distributed machine learning systems. His main research interests include data analysis, machine learning, and crowdsourced data management.View more
Contact IEEE to Subscribe

References

References is not available for this document.