Abstract:
Emerging topic detection from microblogs has developed into an attractive task because events usually break on social channels. However, due to the features of high noise...Show MoreMetadata
Abstract:
Emerging topic detection from microblogs has developed into an attractive task because events usually break on social channels. However, due to the features of high noise, short length, fast arriving rate and irregular writing style of microblogs, it has been proven to be a challenge to detect emerging topics from microblog streams early and accurately in a scalable way. Several approaches have been proposed to tackle this problem and have achieved sound performance in some aspects. However, from the point of novelty and scalability, there is still considerable space for improvement. Inspired by the consideration, we propose an emerging topic detection framework based on emerging pattern mining. Via encoding the term novelty into an efficient high utility itemset mining (HUIM) algorithm, a group of emerging patterns which are concise and interpretive representations of topics can be first detected, decreasing the computational cost of the clustering part.
Published in: 2018 IEEE 22nd International Conference on Computer Supported Cooperative Work in Design ((CSCWD))
Date of Conference: 09-11 May 2018
Date Added to IEEE Xplore: 16 September 2018
ISBN Information:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Pattern Mining ,
- Topic Detection ,
- Emerging Pattern Mining ,
- Scalable ,
- High Use ,
- Considerable Space ,
- Space For Improvement ,
- Scalable Way ,
- Sequential Pattern Mining ,
- Time Interval ,
- Social Media ,
- Topical ,
- Problem Definition ,
- Semantic Similarity ,
- Topic Modeling ,
- Word Embedding ,
- Term Frequency ,
- Short Text ,
- Hashtags ,
- Use Of Items ,
- Word Usage ,
- Topic Clusters ,
- Partitioning Method ,
- Clustering-based Methods ,
- Sina Weibo
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Pattern Mining ,
- Topic Detection ,
- Emerging Pattern Mining ,
- Scalable ,
- High Use ,
- Considerable Space ,
- Space For Improvement ,
- Scalable Way ,
- Sequential Pattern Mining ,
- Time Interval ,
- Social Media ,
- Topical ,
- Problem Definition ,
- Semantic Similarity ,
- Topic Modeling ,
- Word Embedding ,
- Term Frequency ,
- Short Text ,
- Hashtags ,
- Use Of Items ,
- Word Usage ,
- Topic Clusters ,
- Partitioning Method ,
- Clustering-based Methods ,
- Sina Weibo
- Author Keywords