Skip to Main Content
Sequential pattern mining is an active field in the domain of knowledge discovery and has been widely studied for over a decade by data mining researchers. More and more, with the constant progress in hardware and software technologies, real-world applications like network monitoring systems or sensor grids generate huge amount of streaming data. These works need an efficient and scalable parallel algorithm. On the basis of the widespread problem in current sequential pattern data mining algorithm and researching the data mining algorithm of serial sequential pattern, this paper proposes sequential patterns based and projection database based algorithm for scalable parallel sequential patterns data mining algorithm. Through theoretical analysis and experimental verification, the parallel data mining algorithm can well reduce the computational and spatial complexity and improve the efficiency of data mining in massive data circumstances.