Skip to Main Content
Data transactions between business partners often include unstructured data such as invoices or purchase orders. In order to process such automatically, complex business entities need to be identified. Examples for complex entities are products, business partners and purchase orders which are stored in a supplier relationship management system. Both, structured records in the enterprise system and text data, describe these complex entities. A major challenge is to correctly associate entities recognized in unstructured data with entities stored in structured data, e.g. enterprise databases. We address that problem and propose a robust process methodology which includes three phases: candidate extraction from unstructured text, generation of initial mappings with structured data and disambiguation of the mappings exploiting relationships among the entities in the enterprise data and the documentspsila structure. We describe each step in detail, propose a common architecture and introduce to our data model and algorithms.