Bridging Quantities in Tables and Text | IEEE Conference Publication | IEEE Xplore

Bridging Quantities in Tables and Text


Abstract:

There is a wealth of schema-free tables on the Web, holding valuable information about quantities on sales and costs, environmental footprint of cars, health data and mor...Show More

Abstract:

There is a wealth of schema-free tables on the Web, holding valuable information about quantities on sales and costs, environmental footprint of cars, health data and more. Table content can only be properly interpreted in conjunction with the textual context that surrounds the tables. This paper introduces the quantity alignment problem: bidirectional linking between textual mentions of quantities and the corresponding table cells, in order to support advanced content summarization and faster navigation between explanations in text and details in tables. We present the BriQ system for computing such alignments. BriQ is designed to cope with the specific challenges of approximate quantities, aggregated quantities, and calculated quantities in text that are common but cannot be directly matched in table cells. We judiciously combine feature-based classification with joint inference by random walks over candidate alignment graphs. Experiments with a large collection of tables from the Common Crawl project demonstrate the viability of our methods.
Date of Conference: 08-11 April 2019
Date Added to IEEE Xplore: 06 June 2019
ISBN Information:

ISSN Information:

Conference Location: Macao, China

Contact IEEE to Subscribe

References

References is not available for this document.