By analyzing the characteristics of Semi-structured data along with the actual Book Return Data (BokeDataInfo.xml) in the Auto department chain sales as an example, DOM objects extract, transform and load into the data tables of current level of detail of the Data Warehouse. The paper based XML Semi-structured data has designed and implemented a Data Warehouse ETL tool which based on the Semi-structured data. Meanwhile, it also cleans up the defect which the loading of Data Warehouse data can not directly loaded and extracted the XML documents by commercial ETL tool, and it also fathoms the practical exploration for the problem of extracting and loading the semi-structured XML data into the current level of detail of the Data Warehouse.
Published in:
Information Technology and Applications (IFITA), 2010 International Forum on
(Volume:3
)
Date of Conference: 16-18 July 2010