Processing math: 100%
Finding the PG schema of any (semi)structured dataset: a tale of graphs and abstraction | IEEE Conference Publication | IEEE Xplore

Finding the PG schema of any (semi)structured dataset: a tale of graphs and abstraction


Abstract:

Property Graphs (PGs) are an attractive data model both for business users, and for developers of data management tools. They combine the internal structure helpful in re...Show More

Abstract:

Property Graphs (PGs) are an attractive data model both for business users, and for developers of data management tools. They combine the internal structure helpful in relational databases, where each record has a clearly identified set of attributes, with the flexible structure and support for heterogeneity, common in graph databases. Several useful and/or interesting datasets are available in non-PG data models. These include legacy databases, created before the advent of the PG standards, as well as well-known benchmarks based on real and synthetic data, Open Data published in other formats such as XML, JSON or RDF, etc. Converting such datasets to Property Graphs would enable their exploitation under the PG model. In this work-in-progress paper, we describe an approach to derive, from any (semi)-structured dataset, a PG schema consisting of node types, edge types, and a graph type. Our approach builds on (i) ConnectionLens, a tool for converting (semi)structured datasets into simple data graphs, and (ii) Abstra, which, in a ConnectionLens graph, identifies a set of entities and relationships. This work is the first step towards a universal data migration tool from (semi)-structured data, to PGs.
Date of Conference: 13-16 May 2024
Date Added to IEEE Xplore: 17 June 2024
ISBN Information:

ISSN Information:

Conference Location: Utrecht, Netherlands

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.