Web sequential pattern mining is an important way to learn the access behavior of Web users. In this paper, we present an efficient method of Web sequential pattern mining in the e-learning environment. Different from traditional mining methods, we categorize the user sessions into human user sessions, crawler sessions and resource-download user sessions. Then we filter out the non-human user sessions, leaving the human user sessions for sequential pattern mining. With the purpose of mining users' meaningful sequential patterns, we identify users' transactions from the user sessions, and do the sequential pattern mining based on transaction level. We present a method of transaction identification based on users' access path tree. It can find out all the transactions effectively. We also make some improvements on PrefixSpan algorithm, which can reduce the memory space it takes and avoid generating duplicate projections. The experimental results of our mining method are very satisfactory. Using our mining method can efficiently learn the users' learning behavior in the e-learning environment.
Published in:
Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
Date of Conference: 11-13 Dec. 2009