Cart (Loading....) | Create Account
Close category search window
 

Event identification within news topics

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Xiangying Dai ; Shenzhen Grad. Sch., Harbin Inst. of Technol., Shenzhen, China ; Yunlian Sun

With the vast amount of information arriving each day, it is necessary to develop automatic techniques for analyzing and handling these huge volumes of information. This problem is addressed by Topic Detection and Tracking (TDT), which organizes news stories by topics, and each topic is viewed as a flat collection of news stories. However, a topic in news is not only a flat collection of news stories but also a set of events. Additionally, there exists a three-layer hierarchy (topic → event → story), which can make people hold the new things that happen in the news easily. Therefore, to recognize the events in topics is significant. Unfortunately, the similarity between two stories, which belong to different events in a topic, is usually high. This is induced by common words occurring in both the two stories. And these common words usually cause events in the same topic to be mutually confusing. To address this problem, we present a novel approach for event identification in this paper. First, we need to remove topic-specific stopwords from each story, then some named-entities are selected as part of features due to their high distinguishable characteristic for identifying events. There is another issue deserving of in-depth consideration. We know weights on different features were empirically determined in the previous work. In our work, we propose a new method to calculate these weights. The experiments are implemented on a Linguistic Data Consortium dataset. The experimental results show that our scheme for event identification has significant improvement over the previous methods.

Published in:

Intelligent Computing and Integrated Systems (ICISS), 2010 International Conference on

Date of Conference:

22-24 Oct. 2010

Need Help?


IEEE Advancing Technology for Humanity About IEEE Xplore | Contact | Help | Terms of Use | Nondiscrimination Policy | Site Map | Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest professional association for the advancement of technology.
© Copyright 2014 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.