Skip to Main Content
Web Wrapper extracts the data from the given Web sources according to the corresponding extraction rules of them. Its' design is a key technology for Web information extraction and integration. This paper describes the design and implementation of a kind of the Web wrapper which based on pre-defined schema. Then it validates the data extraction from the new books information Web pages of some publishing companies and analyses the extraction results with this kind of Web Wrapper. We find it can accurately extract the data from the Web source. So we can conclude that this kind of Web Wrapper which proposed in this paper is feasible, efficient and maintainable. It will be applied for Web data integration based on wrapper/mediator that we rely on to develop a Web application for book information integration and query system.