I. Introduction
Data streams naturally arise in countless domains, such as medical analysis [10], online text [7], social activity mining [17], and sensor network monitoring [12]. For example, consider an online web-click stream, where a huge collection of logging entries are generated every second, with information of millions of users and URLs. The web-site owners would like to detect intrusions or target designed advertisements by investigating the user-click patterns. In such a situation, the most fundamental requirement is the efficient monitoring of data streams. Since the data streams arrive online at high bit rates and are potentially unbounded in size, the algorithm should handle ‘big data streams’ of billions (or even trillions [28]) of entries with fast response times, that is, it cannot afford any post-processing. And in addition, since the sampling rates of streams are frequently different and their time periods vary in practical situations, the mechanism should be robust against noise and provide scaling of the time axis.