Loading [a11y]/accessibility-menu.js
Automated extraction of concept matcher thesaurus from semi-structured catalogue-like sources of data on the web | IEEE Conference Publication | IEEE Xplore

Automated extraction of concept matcher thesaurus from semi-structured catalogue-like sources of data on the web


Abstract:

Ontology design and the process of populating a data-set with knowledge following the chosen or developed ontology to fit the principles of Semantic Web and Linked Open D...Show More

Abstract:

Ontology design and the process of populating a data-set with knowledge following the chosen or developed ontology to fit the principles of Semantic Web and Linked Open Data is a time-consuming and iterative process, requiring either expert knowledge or a set of tools for data scraping from web. A valid and consistent ontology and knowledge withing the data-set require unification of concepts which means overcoming ambiguity and synonymy of terms which become individuals of ontology. In this paper we spot on techniques used for organising a Russian food product data-set under a light-weight FOOD Ontology and concept matching in particular. Main approaches to data-set concept unification, synonymic term matching and ways to collect dictionaries for matcher are mentioned. The tool for catalogue-like semi-structured resources parsing and thesaurus extraction is developed and introduced for the task of on-the-fly concept matching.
Date of Conference: 18-22 April 2016
Date Added to IEEE Xplore: 08 September 2016
ISBN Information:
Electronic ISSN: 2305-7254
Conference Location: St. Petersburg, Russia

Contact IEEE to Subscribe

References

References is not available for this document.