New Event Detection is the task of automatically detecting novel events from a temporally-ordered stream of news story documents. Traditionally, on-line new event detection system determines whether the incoming document contains a new event based on the history of all processed documents. With the improvement of processed documents, the efficiency of on-line new event detection will decrease. In this paper, we apply a time window strategy to new event detection. A group of new incoming documents in the time window are firstly clustered to obtain candidate topics. Next, these candidate topics are compared with the previously identified topics to determine whether a new topic is detected. The first story of the new detected topic, in temporal order, is regarded as a new event. By analyzing the news story documents within the time window in group, the new event detection efficiency is improved. Furthermore, the evaluations show that the time window processing strategy is helpful to improve the accuracy of new event detection.
Published in:
Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
(Volume:4
)
Date of Conference: 10-13 July 2011