By Topic

An Ensemble-Based Named Entity Recognition Solution for Detecting Consumer Products

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

The purchase and pricing options are temporarily unavailable. Please try again later.
1 Author(s)
Romaszko, L. ; Fac. of Math., Inf. & Mech., Univ. of Warsaw, Warsaw, Poland

This paper presents a technical description of a solution for International Conference on Data Mining 2012 Contest - Consumer Products number 1. The Contest provided a dataset including thousands of text items, a product catalog with over fifteen million products, and hundreds of manually annotated product mentions to support data-driven approaches. The task was to identify product mentions within a large user-generated web-based textual corpus and disambiguate the mentions against the large product catalog. The solution consists of an ensemble-based algorithm for processing a textual content. It uses Conditional Random Fields and a special approach which recognizes product mentions. This solution finished in the third place in the contest.

Published in:

Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on

Date of Conference:

10-10 Dec. 2012