Abstract:
Publicly available Web search engines suffer from several limitations, which significantly reduce usability in particular cases. The most important limitations are out-of...Show MoreMetadata
Abstract:
Publicly available Web search engines suffer from several limitations, which significantly reduce usability in particular cases. The most important limitations are out-of-date information, very simple query language and limited number of results. In many cases, users of the Internet are interested in finding new information which appear in the particular Web portal. In this paper, a system for monitoring of Web sites is presented. The system can continuously analyze the content of specified Web pages using advanced text processing algorithms. It actively notifies the user when required information is found in newly-added content. It can be deployed on a single PC as well as on a cluster of computers, providing good scalability. The paper presents an abstract architecture of the system, details of the implementation and real-life experiments results.
Date of Conference: 08-11 September 2013
Date Added to IEEE Xplore: 07 November 2013
Electronic ISBN:978-83-60810-52-1
Conference Location: Krakow, Poland