Skip to Main Content
With the development of information technologies, web data mining has been put forward and in wide research. It is defined as the discovery, extraction and analysis of useful and potential information from the World Wide Web. But much of inhomogeneous and anomalistic and dynamic updated semi-structured data in web pages makes web data mining difficult. To solve this problem, on the basis of analyzing the characteristics of XML, the paper presents a web data mining model on XML, and introduces the method to implement the model with XML and Java technologies in detail with the combination of an instance. Finally, some valuable discussions are put forward on this model for its shortages.