In Situ Neural Relational Schema Matcher | IEEE Conference Publication | IEEE Xplore

In Situ Neural Relational Schema Matcher


Abstract:

The scarcity of training data restricts a neural network from capturing schema diversity and intricacies, hindering schema-matching models' generalization capabilities. I...Show More

Abstract:

The scarcity of training data restricts a neural network from capturing schema diversity and intricacies, hindering schema-matching models' generalization capabilities. In this paper, we propose ISResMat, a framework specifically designed to match the schemas of relational tables by fine-tuning a pre-trained language model. We first offer a training data construction method, Pairwise Sampling, which could generate the training dataset with table data. Next, we design two loss functions (i.e., Meta-Matching Loss and Agent-Delegating Loss) to learn representations of table columns. With those representations, we could calculate matching scores between different table columns for deducing the matching candidates, which provides a novel approach to schema matching. Finally, we present two optimizations (i.e., Matching Rectification Loss and Distribution-Aware Fingerprint) to handle the problems of matching cardinality constraints and numerical columns, respectively. ISResMat is a flexible framework supporting instance-based, schema-based, and hybrid matching without significant modification. Experiments on 500+ fabricated and human-curated relation pairs spanning diverse domains and matching scenarios showcase that our approach outperforms existing state-of-the-art methods.
Date of Conference: 13-16 May 2024
Date Added to IEEE Xplore: 23 July 2024
ISBN Information:

ISSN Information:

Conference Location: Utrecht, Netherlands
The State Key Laboratory of Blockchain and Data Security, Zhejiang University, China
The State Key Laboratory of Blockchain and Data Security, Zhejiang University, China
The State Key Laboratory of Blockchain and Data Security, Zhejiang University, China
The State Key Laboratory of Blockchain and Data Security, Zhejiang University, China
The State Key Laboratory of Blockchain and Data Security, Zhejiang University, China

The State Key Laboratory of Blockchain and Data Security, Zhejiang University, China
The State Key Laboratory of Blockchain and Data Security, Zhejiang University, China
The State Key Laboratory of Blockchain and Data Security, Zhejiang University, China
The State Key Laboratory of Blockchain and Data Security, Zhejiang University, China
The State Key Laboratory of Blockchain and Data Security, Zhejiang University, China

Contact IEEE to Subscribe

References

References is not available for this document.